Verification of script to train a LLM on supervised data

John6666 · June 24, 2025, 5:18am

df['train'] = df['test']

This makes the dataset smaller, so it seems prone to overfitting, but it’s probably just set up this way for testing purposes.

I don’t see any obvious issues with the data preprocessing code. I’m not sure if padding tokens should be included in the labels, and if that differs between Seq2Seq and CausalLM… I’m not sure about that part.

The LR is slightly high (though I don’t think it’s a problem), and the lack of LoRA Dropout or Weight Decay settings might be related to overfitting.

Topic		Replies	Views
Hugging Face Trainer class with accelerate 🤗Accelerate	2	387	May 21, 2024
Unable to train model (Loss is 0.000000) DeepSpeed	2	1089	October 17, 2023
Finetune xlm roberta base(overfitting ,any solution ) Beginners	3	447	December 26, 2023
Train modell for Question Answering Intermediate	3	308	May 6, 2024
How to train a LlamaTokenizer? 🤗Tokenizers	22	4006	August 20, 2024

Verification of script to train a LLM on supervised data

Related topics