[LMM Fine Tuning] Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer

MarvinMartin24 · June 13, 2023, 5:06am

When should one opt for the Supervised Fine Tuning Trainer (SFTTrainer) instead of the regular Transformers Trainer when it comes to instruction fine-tuning for Language Models (LLMs)? From what I gather, the regular Transformers Trainer typically refers to unsupervised fine-tuning, often utilized for tasks such as Input-Output schema formatting after conducting supervised fine-tuning. There seem to be various examples of fine-tuning tasks with similar characteristics, but with some employing the SFTTrainer and others using the regular Trainer. Which factors should be considered in choosing between the two approaches?

Thank you!

bin20000402 · November 29, 2023, 2:59pm

SFTTrainer更多是用于大模型的分类任务，因为分类任务往往有label

Topic		Replies	Views
Instruction tuning llm Beginners	8	12304	May 8, 2024
Doubts regarding Tuner and SFTuner Beginners	0	368	July 10, 2023
Fine tune with SFTTrainer Intermediate	17	13982	September 12, 2024
How Labelled Data is Processed \| Transformers Trainer 🤗Transformers	10	4143	April 16, 2024
Different Trainers, when to use which? 🤗Transformers	1	1626	October 12, 2024

[LMM Fine Tuning] Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer

Related topics