TF bert-base-uncased reserves large memory space

jasonyux · June 24, 2022, 1:54am

I wanted to do some text classification task using tensorflow so I did

model = TFBertForSequenceClassification.from_pretrained("bert-base-uncased")

but then I immediately checked memory usage (before the above operation there were only 2 MB usage):

|   0  NVIDIA GeForce ...  On   | 00000000:01:00.0 Off |                  N/A |
| 30%   53C    P2   128W / 350W |  22999MiB / 24265MiB |      0%      Default |

Is this normal that it is taking like 20GB of space? I looked up online and it seems that those simple bert models should take only around a few GB? This makes it very prohibitive to train with a large batch size (I wanted to use BATCH_SIZE=64). Any idea why this happens/how to ask tf not to reserve so much memory?

jasonyux · June 24, 2022, 2:10am

It seems to be a tensorflow “feature” that it will automatically reserves the entirety of the gpu. I found the solution is to do:

gpus = tf.config.list_physical_devices('GPU')
for gpu in gpus: # so tf does not map all gpu space
	tf.config.experimental.set_memory_growth(gpu, True)

Topic		Replies	Views
Out of memory when fine-tuning bert on tpu 🤗Transformers	0	605	December 2, 2021
GPU memory GPTJ inference 🤗Transformers	0	237	June 13, 2023
Why is the tensor produced by inference so big? Beginners	2	431	April 17, 2023
Accuracy changes dramatically 🤗Transformers	0	562	November 23, 2020
Tokenizer taking lot of memory 🤗Transformers	3	3482	April 16, 2023

TF bert-base-uncased reserves large memory space

Related topics