Fine-Tuning AutoModelWithLMHead Model

Stimmot · January 3, 2022, 1:58pm

Hi everyone,

I want to fine-tune the AutoModelWithLMHead model from this repository, which is a German GPT-2 model.

I have prepocessed a bunch of text passages for the fine-tuning, but when beginning training, I receive the following error (copied with a little context):

File "GPT\lib\site-packages\torch\nn\modules\module.py", line 1102, in _call_impl
    return forward_call(*input, **kwargs)
  File "GPT\lib\site-packages\transformers\models\gpt2\modeling_gpt2.py", line 774, in forward
    raise ValueError("You have to specify either input_ids or inputs_embeds")
ValueError: You have to specify either input_ids or inputs_embeds

It’s asking for either input ids or embeddings, which I thought I provided by instantiating the trainer. Here’s my code for the preparation of the model:

# Load data
with open("Fine-Tuning Dataset/train.txt", "r", encoding="utf-8") as train_file:
    train_data = train_file.read().split("--")

with open("Fine-Tuning Dataset/test.txt", "r", encoding="utf-8") as test_file:
    test_data = test_file.read().split("--")

# Load pre-trained tokenizer and prepare input
tokenizer = AutoTokenizer.from_pretrained('dbmdz/german-gpt2')

tokenizer.pad_token = tokenizer.eos_token
train_input = tokenizer(train_data, padding="longest")
test_input = tokenizer(test_data, padding="longest")

# Define model

model = AutoModelWithLMHead.from_pretrained("dbmdz/german-gpt2")
training_args = TrainingArguments("test_trainer")

# Evaluation

metric = load_metric("accuracy")

def compute_metrics(eval_pred):
    logits, labels = eval_pred
    predictions = numpy.argmax(logits, axis=-1)
    return metric.compute(predictions=predictions, references=labels)

# Train
trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=train_input,
    eval_dataset=test_input,
    compute_metrics=compute_metrics,
)
trainer.train()
trainer.evaluate()

Does anyone know the cause for this? Any help is gladly appreciated! Thank you.

sgugger · January 10, 2022, 3:48pm

It looks like your train_input is not a dataset containing the "input_ids", as expected by the model. Look at train_input[0] for instance to see which keys it contains.

Topic		Replies	Views
Error when fine-tuning AutoModelWithLMHead Model 🤗Transformers	0	399	January 4, 2022
I got 'ValueError: You have to specify either input_ids or inputs_embeds' when I am training GPT2 using huggingface Trainer Beginners	2	4828	February 19, 2023
ValueError with Trainer Beginners	0	428	August 29, 2023
ValueError: You have to specify either decoder_input_ids or decoder_inputs_embeds 🤗Transformers	3	1769	November 14, 2023
ValueError: The model did not return a loss from the inputs, only the following keys: last_hidden_state, past_key_values. For reference, the inputs it received are input_ids, attention_mask Beginners	3	947	February 16, 2024

Fine-Tuning AutoModelWithLMHead Model

Related topics