Slower train with collator for completion only

miguelwon · September 27, 2023, 4:10pm

Hi,

I’m trying to fine tune Llama 2 and I don’t want to fine tune in full (instructions + completion), only the completion. Because of that I’m using the DataCollatorForCompletionOnlyLM collator.

The completion of by training dataset is a very small sentence and so I was expecting a faster training, because the training would be resumed to predict the next token of completion. Is this what is happing when we use DataCollatorForCompletionOnlyLM collator? The problem is that I’m not measuring any improvement in speed. Actually, I think it got slower.

Any comment on this would very appreciated.
Thanks

silcowitz · April 7, 2024, 6:42pm

Did you find an answer to this?

Topic		Replies	Views
Using Data Collator For Completion Only LM requires more memory Beginners	0	645	July 8, 2024
Best practice for usage of Data Collator For CompletionOnlyLM in multi-turn chat 🤗Transformers	2	675	May 25, 2025
Instruction tuning llm Beginners	8	12311	May 8, 2024
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	15209	December 21, 2023
TRL SFT super prone to nan when using data collator Intermediate	2	1308	April 27, 2024

Slower train with collator for completion only

Related topics