Trainer error for "albert-base-v2" due to batch size mismatch

saebom · July 7, 2022, 10:14pm

Hi all,

Previously I have been fine-tuning “bert-base-uncased” on my custom dataset (loaded from a csv file with datasets.load_dataset), and everything works fine when I use BERT with the Hugging Face Trainer. I have recently tried replacing “bert-base-uncased” with “albert-base-v2” (in both the model and tokenizer), but I have been stuck on this error message when trying to run ALBERT:

Expected input batch_size (4096) to match target batch_size (8)

Since per_device_train_batch_size=8, I am certain that the input dimension comes from 8 * 512 = 4096 where 512 is the length of an embedded vector, I think. It seems like the issue is that somewhere along the way in the model the batch matrix of embedded vectors gets smushed down to one vector.

I have tried everything to fix this bug, but I cannot work it out. My set up for the fine-tuning is exactly the same as the PyTorch set-up in [Fine-tune a pretrained model]. Any advice would be greatly appreciated

zzj0402 · April 11, 2023, 4:22am

You are not alone. Keep me posted. I am poking around the dataset batching mechanism but no luck.

Topic		Replies	Views
ValueError: Expected input batch_size (16) to match target batch_size (64) Beginners	7	5010	November 7, 2023
Albert giving OOM compared to Bert Models	0	327	December 10, 2020
Expected input batch_size (2048) to match target batch_size (4) Beginners	3	1603	May 23, 2022
AlbertForMaskedLM error- "view size is not compatible..." 🤗Transformers	1	1648	June 22, 2023
Error with runing bert question-answering fine-tuning 🤗Transformers	1	306	November 29, 2022

Trainer error for "albert-base-v2" due to batch size mismatch

Related topics