Loss to zero in the training

GenV · February 17, 2022, 10:21am

Hello,
I’m training with the official code code to finetune a T5-base on my dataset (seq2seq).

After 3 epochs, the train loss go to zero, meanwhile the eval loss it’s only near to zero. My model does not fit well on the test-set, so what can I do to avoid this zero-loss in the training? Can I change the loss function? Thanks.

Topic		Replies	Views
Traing loss decreases but dev accuracy gives zero Beginners	0	364	January 10, 2023
T5 variants return Training Loss 0 and Validation loss nan while fine tuning 🤗Transformers	8	5433	November 10, 2024
Loss becoming nearly zero in first 5K steps when training LM from scratch 🤗Transformers	10	2427	March 18, 2023
Wav2vec2 finetune - loss goes to zero at some point Beginners	0	380	April 16, 2022
Training Loss = 0.0, Validation Loss = nan Intermediate	6	13871	September 5, 2023

Loss to zero in the training

Related topics