Hyperparameter tuning practical guide?

moma1820 · September 25, 2021, 10:08am

Hi i have been having problems doing parameter tuning with google colab, where its alawys gpu that runs out of memory.

Is there any practical advice you could give me for tuning bert models? In terms of envoirment settings i need for example number of gpu so i don’t run out of mem

It is to be noted that when doing tuning with CPU it works but takes ages.

I am using trainer api with Optuna

Akshayextreme · October 6, 2021, 8:03pm

If your GPU can only take 16 as batch_size then make sure that multiplication of batch_size and gradient_accumulation does not go beyond 16. You need to specify range for both these parameters such that any combination of elements from both ranges does not take the effective batch_size beyond 16.

Topic	Replies	Views
Hyper parameter tuning on Colab? Intermediate	295	September 10, 2021
Hyperparameter Tuning QNLI Colab Example using RoBERTa "RuntimeError('CUDA out of memory..." 🤗Transformers	306	May 20, 2021
Huge disparity between CPU and GPU memory usage? 🤗Transformers	405	February 22, 2022
Out of memory when fine-tuning bert on tpu 🤗Transformers	606	December 2, 2021
Out of Memory Bert fine-tuning Beginners	488	January 3, 2023

Hyperparameter tuning practical guide?

Related topics