ValueError with Trainer

deadbod-81 · August 29, 2023, 2:47pm

Getting the following error when running trainer.train()

ValueError: You have to specify either input_ids or inputs_embeds

Here is my trainer

trainer = Trainer(
model=model,
args=training_args,
train_dataset=train_test_valid_dataset[“train”],
eval_dataset=train_test_valid_dataset[“test”],
compute_metrics=compute_metrics
)

Using the AutoTkenizer

from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained(“distilbert-base-uncased”)

def tokenize_function(examples):
return tokenizer(examples[“title”], padding=“max_length”, truncation=True)

tokenized_datasets = train_test_valid_dataset.map(tokenize_function, batched=True)

Can someone help out here ?

Topic		Replies	Views
I got 'ValueError: You have to specify either input_ids or inputs_embeds' when I am training GPT2 using huggingface Trainer Beginners	2	4866	February 19, 2023
Train encoder/decoder error Beginners	1	424	July 16, 2022
I get a "You have to specify either input_ids or inputs_embeds" error, but I do specify the input ids Beginners	6	21601	October 31, 2021
Error of 'input_ids' when using Transformers Trainer class with Encoder/Decoder model 🤗Transformers	0	2041	July 11, 2023
Fine-Tuning AutoModelWithLMHead Model 🤗Transformers	1	714	January 10, 2022

ValueError with Trainer

Related topics