Difference between text-generation and text2text? Canonical way to provide multiple demonstrations?

yannbane · October 18, 2022, 9:22am

I have a sequence of demonstrations:

first demo => first output
second demo => second output
...
nth demo => nth output

Given a n+1th demo, I want the model to generate a candidate for the n+1th output.

Should I format this into a giant list separated by '\n', and append '=>' at the end to get this result? This is what I would do in the OpenAI Playground.

What’s the appropriate pipeline to use for this task? Is it text-generation, text2text, or something else? All data (both demos and outputs) is plaintext (ASCII). I’m currently aiming for gpt2-medium, which I will later probably have to fine-tune.

GPT3 (text-davinci-002 on OpenAI Playground) can do this task without fine-tuning, but I have my doubts about GPT2.

DeleMike · January 25, 2024, 12:51pm

@yannbane shouldn’t it be a text-generation task? You are trying to generate a new text give your input. See here for my reasoning

Topic		Replies	Views
Fine tune the text generation with gpt2 Beginners	2	441	February 22, 2023
Text generation conditioned on numbers 🤗Transformers	0	403	May 26, 2022
Trying to choose a model/methodology (text generation) Beginners	0	411	April 14, 2021
GPT2 finetuning for text generation is getting overfitted Beginners	0	1109	August 27, 2021
Is it possible to generate GPT2 output without an input prompt text Beginners	5	4405	March 14, 2021

Difference between text-generation and text2text? Canonical way to provide multiple demonstrations?

Related topics