[HELP] RuntimeError: CUDA error: device-side assert triggered

ehalit · August 24, 2021, 11:41am

Since you are getting an index out of range error, there indeed seems to be a mismatch between the ground-truth labels and the prediction layer of the model. Since you are working on a language classification task, this is probably caused by the tokenizer, which means you are facing both of the problems I mentioned.

Unfortunately, I have no experience with custom tokenizers but if the model architecture needs the special tokens, for example BERT will always need [MASK] for MLM and [SEP] for NSP, I believe you will have to include them in the vocabulary of your newly trained tokenizer.

Topic		Replies	Views
RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with `TORCH_USE_CU 🤗Transformers	2	990	November 1, 2024
Runtime Error: Trainer API Dataloader Using CPU but Expecting CUDA 🤗Transformers	2	1795	December 22, 2023
CUDA error: device-side assert triggered after a certain steps Beginners	7	15374	July 24, 2024
RuntimeError: CUDA error: device-side assert triggered 🤗Transformers	1	2498	April 28, 2021
Always getting RuntimeError: CUDA out of memory with Trainer 🤗Transformers	10	6893	April 4, 2024

[HELP] RuntimeError: CUDA error: device-side assert triggered

Related topics