Doubts regarding Tuner and SFTuner

jjovalle99 · July 10, 2023, 3:56pm

I’m following some tutorials that FourthBrain and DeepLearningAI conducted a few weeks ago. However, I’m a little bit confused with the fact that they make the distinction between Supervised Instruct-tuning and Fine-tuning with an “unsupervised” approach.

Supervised Instruct-tuning: Google Colab
Unsupervised fine-tuning:
Google Colab

The only difference that I can catch is the usage of SFTrainer and on the first notebook and Trainer on the second one. However, I noticed they are basically doing the same thing (ofc the data is different).

With this in mind, my questions are:

Is there a real difference between SFTrainer and Trainer in this scenario? I know SFTrainer is built on top of Trainer
How are these two approaches different? From what I understand this is basically doing a self-supervised fine-tuning (CausalLM)

Thank you for your time!

Topic		Replies	Views
[LMM Fine Tuning] Supervised Fine Tuning Trainer (SFTTrainer) vs transformers Trainer Intermediate	1	1662	November 29, 2023
Fine tune with SFTTrainer Intermediate	17	13547	September 12, 2024
Difference between calling model() and using Trainer()? Beginners	6	1419	November 19, 2020
When to use SFTTrainer 🤗Transformers	5	12030	December 6, 2023
Supervised Fine-tuning Trainer - where is the 'supervised' part? Beginners	0	448	July 3, 2023

Doubts regarding Tuner and SFTuner

Related topics