Maximum recursion depth exceeded when using DataCollator

clawdelu · November 14, 2022, 9:19am

After some more digging I’ve discovered that the issue is related to this one.

The padding strategies are [‘longest’, ‘max_length’, ‘do_not_pad’]. The issue seems to be improper padding. If you set the strategy to do_not_pad it will work (for one sentence) tokenizer(example).

But even when I set the max_length of the tokenizer to a number, the issue still persists.

Topic		Replies	Views
Issues with Data Collator and Tokenizing with NER Datasets 🤗Tokenizers	1	2510	May 9, 2022
RecursionError: Maximum recursion depth exceeded in comparison Beginners	2	1313	September 29, 2021
Key error: 0 in DataCollatorForSeq2Seq for BERT Beginners	10	3991	March 13, 2024
DataCollator not padding as expected Intermediate	0	662	August 17, 2022
Maximum Recursion Depth Error 🤗Transformers	0	119	July 24, 2024

Maximum recursion depth exceeded when using DataCollator

Related topics