ValueError: Expected input batch_size (16) to match target batch_size (64)

laurb · October 15, 2020, 8:06pm

I’ve modeled my training script on the information in the finetuning with custom datasets documentation (https://huggingface.co/transformers/custom_datasets.html).
I have both a custom dataset and a custom model (I used the run_language_modeling.py script to pretrain the roberta-base model with our raw texts).

when I run trainer.train() I get the error: ValueError: Expected input batch_size (16) to match target batch_size (64), when the model is computing the loss on a training_step

I don’t know where target batch_size is being set. The input batch_size matches the value I have for per_device_train_batch_size.

Does anyone have an idea?

laurb · October 16, 2020, 10:12am

Tried this using roberta-base as the model as well, and get the same error.

Rainiefantasy · September 14, 2021, 1:46am

Hey Laurb,

Sorry I know this is an old post, but did you manage to resolve this? I’ve got the same issue when using DistilBert using custom dataset as in their tutorial.
ValueError: Expected input batch_size (16) to match target batch_size (2848).

laurb · September 16, 2021, 11:08am

Sorry. I did resolve it, but have no memory of how. I’m up to using transformers 4.9.2 now and do not have the issue and do not need to make changes to their run_classification script. (I’m using the pytorch version).

nicir · September 27, 2022, 7:31am

Hello Rainiefantasy,

I know this is an old issue but have you also managed to resolve this problem? Maybe you remember. I have the exact same problem…

Thank you!

massisenergy · January 6, 2023, 8:44am

Hello,

I am having similar issue while running trainer.train():

ValueError: Expected input batch_size (664) to match target batch_size (8).

Checkpoint: bert-base-uncased
Dataset: jmamou/augmented-glue-sst2

Can anyone please help?
Thanks,

zbeloki · February 27, 2023, 11:31am

In my case it was because I was trying to train as a multi-label classification model by encoding the labels with sklearn’s MultiLabelBinarizer but forgot to set the config parameter for multilabel setting to the model:

model.config.problem_type = “multi_label_classification”

Raisa06 · November 7, 2023, 5:50am

Hi,

I’m also facing same issue while running trainer.train()

Using bert-base-uncased

Basically I’ve done the sliding windows for splitting the dataset as we have the maximum sequence length for the tokenizer.

Ideally sliding windows are generated and we have mapped to respective labels.

While the trainer.train() I’m getting the below error,

ValueError: Expected input batch_size (2040) to match target batch_size (6392)

Can anyone please help with the error?

Thanks

Topic		Replies	Views
Multilabel sequence classification with Roberta value error expected input batch size to match target batch size 🤗Transformers	1	4234	March 2, 2021
ValueError: Expected input batch_size (8) to match target batch_size (280) Beginners	1	1942	November 18, 2024
Error while training a custom hugging face RoBERTa Models	0	88	June 26, 2024
"ValueError: Target size must be same as input size" when training twitter-roberta-base-emotion-multilabel-latest Beginners	1	411	May 2, 2023
ValueError: Target size (torch.Size([8])) must be the same as input size (torch.Size([8, 8])) Beginners	15	12581	January 9, 2025

ValueError: Expected input batch_size (16) to match target batch_size (64)

Related topics