AutoTrain Advanced

gbwsolutions · November 28, 2023, 11:59am

I am using AutoTrain Advanced and selected Task as Text Classification and Base Model as achimoraites/roberta-base_ag_news. My uploaded file(.csv) have 11 class labels in target column. But my based model has only 4 class labels. When I start training in huggingface space, I am getting below error.

RuntimeError: Error(s) in loading state_dict for RobertaForSequenceClassification:
size mismatch for classifier.out_proj.weight: copying a param with shape torch.Size([4, 768]) from checkpoint, the shape in current model is torch.Size([11, 768]).
size mismatch for classifier.out_proj.bias: copying a param with shape torch.Size([4]) from checkpoint, the shape in current model is torch.Size([11]).
You may consider adding ignore_mismatched_sizes=True in the model from_pretrained method.

AutoTrain advanced I am using to train & create my own model without using any code. But based on this error, I have to add ignore_mismatched_sizes=True somewhere. Where I can add this and how can I resolve this issue.

Also, suggest me on what basis we can select the base model and I couldn’t found any documentation or information regarding AutoTrain Advanced even though not much information in huggingface documentation. Could you please provide the details where I can get AutoTrain Advanced documentation to train & create my own models.

Topic		Replies	Views
Mismatched target and input size for BCE using "multi_label_classification" Intermediate	2	7009	September 1, 2022
Multilabel sequence classification with Roberta value error expected input batch size to match target batch size 🤗Transformers	1	4230	March 2, 2021
How do I fine-tune roberta-large for text classification Beginners	7	3854	December 17, 2021
Error while training a custom hugging face RoBERTa Models	0	88	June 26, 2024
ValueError: Target size (torch.Size([8])) must be the same as input size (torch.Size([8, 3])) Beginners	0	630	December 20, 2022

AutoTrain Advanced

Related topics