How to Add Validation Loss to run_squad.py?

imsteven · November 26, 2020, 6:27am

I’m using the ‘bert-base-uncased’ as the model on SQuADv1.1 and have args.evaluate_during_training set to True. I tried adding "start_positions": batch[3], "end_positions": batch[4] into the evaluate method so that BertForQuestionAnswering returns total loss.

However, when I try to do that, I get cunn_ClassNLLCriterion_updateOutput_kernel(Dtype *, Dtype *, Dtype *, long *, Dtype *, int, int, int, int, long) [with Dtype = float, Acctype = float]: block: [0,0,0], thread: [0,0,0] Assertion t >= 0 && t < n_classes failed.

What might be the problem? The only difference is that I’m trying to get the loss output from the model by passing in the start_positions and end_positions similar to that during training for evaluation using the dev. dataset.

imsteven · November 28, 2020, 1:35am

I noticed that in the dev dataset that it contains multiple possible answers. Is there a way to account for that in terms of validation loss or is there a better way to see if the model is overfitting?

Topic		Replies	Views
'No Log' for validation loss during training with Trainer Beginners	2	8156	May 15, 2024
How do i get Training and Validation Loss during fine tuning 🤗Transformers	2	14676	August 27, 2021
KeyError: 'loss' while training QnA Beginners	2	2555	March 17, 2022
Validation loss always 0.0 for BERT Sequence Tagger Beginners	1	915	January 14, 2022
Validation loss lower than training loss when further pretraining BERT? 🤗Transformers	1	811	January 29, 2022

How to Add Validation Loss to run_squad.py?

Related topics