Using Batch Encodings

ddegenaro · July 12, 2022, 7:26pm

Hi there! I am trying to work with a corpus I built which contains batch encodings of size 256 (I passed a list of 256 sentences to a fast BERT tokenizer at a time, and pickled the outputs).

Unfortunately, when I try to pass this entire BatchEncoding object into my model to get prediction logits, the GPU runs out of memory.

I am now trying to “unpack” the BatchEncoding objects into single inputs that can be fed to the model one at a time, but I cannot figure out how to do this. Does anyone know how I can accomplish this?

Alternatively, does anyone know how much memory one of these BatchEncodings might be taking up? I am not sure it is reasonable for 256 BERT encodings to eat up 12 GB of GPU RAM.

Thanks!

Topic		Replies	Views
Tokenizer taking lot of memory 🤗Transformers	3	3550	April 16, 2023
How to convert a list of Batchencoding to a BatchEncoding Beginners	0	574	March 19, 2021
Tokenizer.batch_encode_plus uses all my RAM Beginners	5	2799	November 23, 2021
Make bert inference faster 🤗Transformers	6	11069	September 16, 2021
Cache T5 encoder results within batch when training 🤗Transformers	0	489	March 6, 2021

Using Batch Encodings

Related topics