Verification of script to train a LLM on supervised data

John6666 · June 25, 2025, 2:42am

I don’t think there are any major issues with quantization. The LoRA settings also appear to be correct. If there are any problems, it usually results in an error…

Regarding tokenization, I later noticed that the handling of special tokens and masks may not be complete.

And since the training loop section is manually created, making debugging difficult… For example, I can assist with fixes for cases where it doesn’t work due to syntax errors, but it’s challenging to identify logical oversights.

Given that manually performing all of this correctly is quite tedious (there are numerous model-specific conventions and details that aren’t typically obvious), I think it would be simpler to implement using an existing trainer.

Topic		Replies	Views
Fine-tuning a custom module but do not use LoRA 🤗Transformers	1	63	November 14, 2025
How Labelled Data is Processed \| Transformers Trainer 🤗Transformers	10	4871	April 16, 2024
Fine tunning llama2 with multiple GPUs and Hugging face trainer 🤗Transformers	1	3512	November 3, 2023
A fine tuned Llama2-chat model can't answer questions from the dataset 🤗Transformers	0	321	December 20, 2023
(IMPOSSIBLE) LORA Finetuning with BASIC custom dataset Intermediate	2	505	November 30, 2023

Verification of script to train a LLM on supervised data

Related topics