Finetuning with SFTtrainer

naviiiid · June 12, 2024, 2:14pm

Hell everyone!
I’ve been trying to finetune a GPT based model using SFT trainer from TRl library. it is my understanding that you could directly pass a non-tokenized dataset and the SFT trainer class handles the tokenization internally.
However after defining my dataset(with only one row named “text”), the error “you should provide a list of encodings but you have provided none” is raised.
what could be the problem here??
Intermediate #TRL #SFTTrainer

nielsr · June 12, 2024, 2:16pm

See my answer here: Fine tune with SFTTrainer - #8 by nielsr

Topic		Replies	Views
Fine tune with SFTTrainer Intermediate	17	14023	September 12, 2024
SFT Trainer and chat templates Beginners	3	356	March 26, 2025
Error using SFTTrainer: Make sure that your dataset has enough samples to at least yield one packed sequence Beginners	9	2996	November 1, 2024
Anyone have idea how we can finetune a model using Trainer API? 🤗Transformers	0	446	April 22, 2022
Questions when doing Transformer-XL Finetune with Trainer Beginners	3	1057	October 6, 2021

Finetuning with SFTtrainer

Related topics