I only have 25GB RAM and everytime I try to run the below code my google colab crashes. Any idea how to prevent his from happening. Batch wise would work? If so, how does that look like? max_q_len = 128 max_a_len = 64 def batch_encode(text, max_seq_len): return tokenizer.batch_encode_plus( …

Tokenizer.batch_encode_plus uses all my RAM

neuralpat March 22, 2021, 7:43am 2

Are you positive it’s actually the encoding that does it and not some other part of your code? Maybe you can show us the traceback?

Topic		Replies	Views
Tokenizer taking lot of memory 🤗Transformers	3	3517	April 16, 2023
Training tokenizer takes too much RAM 🤗Tokenizers	1	1338	February 21, 2022
Tokenizer.train() running out of memory 🤗Tokenizers	0	763	February 9, 2023
Huggingface distilbert-base-uncased-finetuned-sst-2-english runs out of ram with only a few kb? Beginners	0	375	May 12, 2022
Colab session crashing after using all available RAM Beginners	0	2424	January 16, 2021