I get a "You have to specify either input_ids or inputs_embeds" error, but I do specify the input ids

ugoren · June 2, 2021, 9:21pm

I trained a BERT based encoder decoder model: ed_model

I tokenized the input with:

txt = "I love huggingface"
inputs = input_tokenizer(txt, return_tensors="pt").to(device)
print(inputs)

The output clearly shows that a input_ids is the return dict


{'input_ids': tensor([[ 101, 5660, 7975, 2127, 2053, 2936, 5061,  102]], device='cuda:0'), 'token_type_ids': tensor([[0, 0, 0, 0, 0, 0, 0, 0]], device='cuda:0'), 'attention_mask': tensor([[1, 1, 1, 1, 1, 1, 1, 1]], device='cuda:0')}

But when I try to predict, I get this error:
ed_model.forward(**inputs)

ValueError: You have to specify either input_ids or inputs_embeds

Any ideas ?

ehalit · June 3, 2021, 6:19am

Does this help: ValueError: You have to specify either input_ids or inputs_embeds! · Issue #3626 · huggingface/transformers · GitHub

ugoren · June 4, 2021, 10:55am

Yes, thank you !
Solved the issue

Do you happen to have any thoughts on this as well ?

TopRightExit · September 28, 2021, 11:24pm

Hi @ugoren , how did you solve this issue? I encountered the same issue trying to train the EncoderDecodeModel using the seq2seqtrainer.

ugoren · September 29, 2021, 5:20am

Add a “decoder_” prefix

TopRightExit · September 30, 2021, 4:25am

Yea, I did just that, but still got the error (transformers==4.9.2):

batch['attention_mask'] = inputs.attention_mask
batch['input_ids'] = inputs.input_ids
batch['token_type_ids'] = inputs.token_type_ids
batch["decoder_input_ids"] = outputs.input_ids.copy()
batch["labels"] = outputs.input_ids.copy()

Where outputs are from decoding the translations. I guess the error I got was something else.

afsahulsyed · October 31, 2021, 12:23pm

I am facing the same issue.
@ugoren can you please elaborate your solution?

Topic		Replies	Views
Error of 'input_ids' when using Transformers Trainer class with Encoder/Decoder model 🤗Transformers	0	1986	July 11, 2023
Train encoder/decoder error Beginners	1	412	July 16, 2022
ValueError: You have to specify either decoder_input_ids or decoder_inputs_embeds 🤗Transformers	3	1787	November 14, 2023
You cannot specify both input_ids and inputs_embeds at the same time Beginners	0	1391	February 22, 2021
I got 'ValueError: You have to specify either input_ids or inputs_embeds' when I am training GPT2 using huggingface Trainer Beginners	2	4833	February 19, 2023

I get a "You have to specify either input_ids or inputs_embeds" error, but I do specify the input ids

Related topics