Prediction on GLUE, COLA dataset

peune · July 8, 2022, 11:38am

Hi,

I tried to finetune ‘bert-base-cased’ on COLA task on GLUE dataset.
The training was OK, but I got the following problem when I performed prediction on ‘test’ split using the following code:

predictions = trainer.predict(test_split)

I got the following errors:

…/aten/src/ATen/native/cuda/Loss.cu:257: nll_loss_forward_reduce_cuda_kernel_2d: block: [0,0,0], thread: [0,0,0] Assertion t >= 0 && t < n_classes failed.
…/aten/src/ATen/native/cuda/Loss.cu:257: nll_loss_forward_reduce_cuda_kernel_2d: block: [0,0,0], thread: [1,0,0] Assertion t >= 0 && t < n_classes failed.
…/aten/src/ATen/native/cuda/Loss.cu:257: nll_loss_forward_reduce_cuda_kernel_2d: block: [0,0,0], thread: [2,0,0] Assertion t >= 0 && t < n_classes failed.
…
RuntimeError: CUDA error: device-side assert triggered

If I changed the prediction to ‘validation’ split, everything will be OK.
The questions are

Is this a known bug?
Are there a good method to check where this problem come from?
I have seen that for this dataset (GLUE, COLA), the ‘label’ provided in ‘test’ split differs from the label in ‘train’ and ‘validation’ splits. Do I need to modify these labels before evaluating the model?

Thanks.

Topic		Replies	Views
`run_glue.py` with my own dataset of one-sentence input 🤗Transformers	6	7398	July 18, 2021
Finetuning Transformers for Text Classification Issue 🤗Transformers	2	706	May 11, 2023
Error of run_glue.py: RuntimeError: CUDA error: device-side assert triggered 🤗Transformers	0	728	July 21, 2023
CUDA error: device-side assert triggered 🤗Transformers	3	4267	June 4, 2021
[HELP] RuntimeError: CUDA error: device-side assert triggered Beginners	20	53692	October 23, 2024

Prediction on GLUE, COLA dataset

Related topics