Fine tune GPT2/LLaMA in seq2seq manner

cardcounter · October 23, 2023, 8:21am

With GPT2/LLaMA, by default, we need to input the [prompt label] the whole sentence model([prompt label]) in fine-tuning and caculate the CrossEntropy on the label part, and the model output the model().logits.

Are there any ways to input the prompt only and do the fine-tuning in the seq2seq manner ? (model(prompt)), this way we minimize the loss of log p(y|x).

as3eem · January 14, 2024, 11:29am

Commenting to follow this thread.

nielsr · January 14, 2024, 7:17pm

Hi,

I assume you mean you want to only train the model on the completions, rather than the instructions (prompts)? This is supported by the DataCollatorForCompletionOnlyLM collator in the TRL library. You can use it in combination with the SFTTrainer in order only train the model on the completions.

Topic		Replies	Views
LLaMa3.1 8B Instruct Prompt Tuning for Text Classification doesn't improve test accuracy Models	3	778	October 1, 2024
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	15235	December 21, 2023
Instruction tuning llm Beginners	8	12388	May 8, 2024
LLama2 trained on completions only repeating prompt during inference Beginners	0	245	April 1, 2024
Llama 2 fine tuning general questions (tokenizer, compute_metrics, labels)) Beginners	0	1511	October 28, 2023

Fine tune GPT2/LLaMA in seq2seq manner

Related topics