I'm getting "nan" value for loss, while following a tutorial from the documentatin

sunny · October 14, 2020, 12:02pm

Hi,

I’m following the “Fine-tuning with Custom Dataset” tutorial for Question Answering on the SQuaD dataset tutorial: available here. I’ve copy pasted all the code shown in the tutorial step by step. However, when my model starts training, I don’t get the expected metric values for loss as I normally would, instead I get “nan”. Here is the output when I run the
model.fit(train_dataset.shuffle(1000).batch(16), epochs=3, batch_size=16)

Epoch 1/3
5427/5427 [==============================] - 4604s 848ms/step - loss: nan - output_1_loss: nan - output_2_loss: nan
Epoch 2/3
365/5427 [=>…] - ETA: 1:11:28 - loss: nan - output_1_loss: nan - output_2_loss: nan

I don’t know what is wrong, and I don’t think this output is what is supposed to be.

Would appreciate any help with this regard.
Thank you.

Topic		Replies	Views
Getting nan while fine tuning Blip 2 and weired output Intermediate	0	147	May 14, 2024
`nan` training loss but eval loss does improve over time Research	5	4006	October 10, 2022
KeyError: 'loss' while training QnA Beginners	2	2555	March 17, 2022
[trainer] 'train_loss' different from 'loss' 🤗Transformers	4	4716	March 31, 2023
The result of fine tuning will be loss =0,eval_loss=Nan. How can I start learning the right way? Beginners	0	1043	September 10, 2023

I'm getting "nan" value for loss, while following a tutorial from the documentatin

Related topics