Unable to update the weights / learn anything

nielsr · December 22, 2023, 9:20am

Hi,

I really recommend this post: A Recipe for Training Neural Networks to debug your training run. The best tip for me is: take 1 training example, and see whether the model is able to overfit it (i.e. achieve 100% accuracy). If not, then there’s a bug in your model.

See also this guide which we wrote for debugging your training pipeline with the Trainer class: Debugging the training pipeline - Hugging Face NLP Course.

Topic		Replies	Views
Why my model doesn't learn anything? 🤗Transformers	0	834	July 29, 2021
Trainer.__init__() got an unexpected keyword argument 'model' 🤗Transformers	1	6216	May 29, 2023
Fine-Tuning results suggest some underlying implementation error? 🤗Transformers	1	681	October 5, 2021
Token classification - learning_rate can not be changed 🤗Transformers	0	189	August 31, 2023
Trainer() and required_grad=false 🤗Transformers	1	283	January 18, 2024

Unable to update the weights / learn anything

Related topics