Verification of script to train a LLM on supervised data

bminesh-shah · June 25, 2025, 2:46pm

Trainer does sound like a great tool but I noticed that it lags in speed as compared to the manual loop as it has a lot of stuff built on top of the loop. Also writing a manual loop provides a sense of control over the training. As far as errors are concerned, the script runs well . Also HuggingFace doesn’t provide a tutorial for supervised fine-tuning using Trainer from transformers (only a trl based SFTTrainer tutorial is provided). The script I provided tries supervised finetuning using transformers. Also HuggingFace tutorials are not up-to-date with the stable versions of the modules it uses which leads to a lot of errors and debugging sessions on my part. Due to this reason, I try manually doing stuff like the tokenization, data loading, training loop, etc. Can you help me out with the tokenization part and help me figure out my mistake?

Topic		Replies	Views
Fine-tuning a custom module but do not use LoRA 🤗Transformers	1	63	November 14, 2025
How Labelled Data is Processed \| Transformers Trainer 🤗Transformers	10	4870	April 16, 2024
Fine tunning llama2 with multiple GPUs and Hugging face trainer 🤗Transformers	1	3512	November 3, 2023
A fine tuned Llama2-chat model can't answer questions from the dataset 🤗Transformers	0	321	December 20, 2023
(IMPOSSIBLE) LORA Finetuning with BASIC custom dataset Intermediate	2	505	November 30, 2023

Verification of script to train a LLM on supervised data

Related topics