Different Trainers, when to use which?

alvations · July 28, 2023, 7:19am

There’s a few *Trainer objects available from transformers, trl and setfit.

Other than the standard answer of “it depends on the task and which library you want to use”, what is the best practice or general guidelines when choosing which *Trainer object to use to train/tune our models?

Together with the *Trainer object, sometimes we see suggestions to use *TrainingArguments or the vanilla TrainingArguments.

For reference, we have:

fine-tuning llama,
- https://github.com/philschmid/huggingface-llama-2-samples/blob/master/training/scripts/run_clm.py suggests vanilla Trainer
few-shot learning,
- SetFit: Efficient Few-Shot Learning Without Prompts suggests from setfit import SetFitTrainer
machine translation
- https://github.com/huggingface/transformers/blob/main/examples/pytorch/translation/run_translation.py suggests Seq2SeqTrainer
reinforcement learning
- Supervised Fine-tuning Trainer suggests from trl import SFTTrainer

ai-nikolai · October 12, 2024, 2:39pm

@alvations is there any update on this?

@philschmid Are there any resources you could recommend? Or people we could tag?

Topic		Replies	Views
Using Huggingface Trainer for custom models Beginners	5	4338	May 29, 2023
[LMM Fine Tuning] Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer Intermediate	1	1672	November 29, 2023
Fine-tuning using TF and py Beginners	0	28	November 9, 2024
How to use the model from the chapter "Fine-tuning a model with the Trainer API" Course	0	322	April 17, 2024
Difference between calling model() and using Trainer()? Beginners	6	1425	November 19, 2020

Different Trainers, when to use which?

Related topics