"The model did not return a loss from the inputs"

Beginner-MLer · March 23, 2025, 12:03am

AI & tokenizer is GPT-2. You can ask me for more specific details about my code or Transformers setup.

Some parts were taken from the documentation and modified to my liking.

I am making a comment generator based on the provided comment list from my files. However, whenever I try and train the AI (GPT-2) on my data, it returns this error:

Please note that I’m new to this type of stuff, so the issue is most likely more clearer to you all experienced people. Here’s my code, sorry if it’s messy:

from transformers import AutoModelForCausalLM, AutoTokenizer, TrainingArguments, Trainer
from datasets import load_dataset
import evaluate, numpy, json
model = AutoModelForCausalLM.from_pretrained("gpt2")
tokenizer = AutoTokenizer.from_pretrained("gpt2", is_split_into_words=True)

if tokenizer.pad_token is None:
    tokenizer.add_special_tokens({'pad_token': '[PAD]'})
    model.resize_token_embeddings(len(tokenizer))

listOfJSONS = []

with open("aCommentsList.txt", "r") as cmt:
        split = cmt.read().split("\n")

        current = {"text": []}
        stop = 2000

        for i, data in enumerate(split):
                if i == stop:
                        stop += 2000
                        open(f"input/json{stop / 2000}.json", "w").write(json.dumps(current))
                        current = {"text": []}
                        listOfJSONS.append(stop / 2000)

                if len(data.strip()) == 0: continue
                current["text"].append(data)

def tokenize(examples):
    if isinstance(examples["text"], list):
        examples["text"] = [str(text) for text in examples["text"]]
    else:
        examples["text"] = str(examples["text"])
    return tokenizer(examples["text"], padding="max_length", truncation=True, return_tensors="pt")

def metrics(_eval):
        logits, labels = _eval
        predictions = numpy.argmax(logits, axis=-1)
        return metric.compute(predictions=predictions, references=labels)

metric = evaluate.load("accuracy")
arguments = TrainingArguments(output_dir="AIOutput", eval_strategy="epoch")

def doSomeStuff():
        dataset = load_dataset("json", data_dir="input", split="train").train_test_split(train_size=1, test_size=1)
        name = ["name"] * len(dataset["train"])
        labels = ["label"] * len(dataset["train"])

        dataset["train"].add_column("name", name)
        dataset["train"].add_column("label", labels)

        tokenized = dataset.map(tokenize, batched=True)

        trainDataset = tokenized["train"].shuffle(seed=42).select(range(1))
        evalDataset = tokenized["test"].shuffle(seed=42).select(range(1))

        trainer = Trainer(
                model=model,
                args=arguments,
                train_dataset=trainDataset,
                eval_dataset=evalDataset,
                compute_metrics=metrics
        )

        trainer.train()

doSomeStuff()

As you can see, I attempted to combat this issue by attempting to create a name and label table, but it only put more gasoline on the fire. How do I prevent this issue?

Thanks for your support, this issue has been bugging me for hours.

John6666 · March 23, 2025, 5:00am

I don’t think the code is too confusing.
Anyway, it seems you have to give it “text” and “labels”. The error messages are hard to understand…

github.com/huggingface/transformers

ValueError: The model did not return a loss from the inputs, only the following keys: logits. For reference, the inputs it received are input_ids,attention_mask,pixel_values,aspect_ratio_ids,aspect_ratio_mask,cross_attention_mask

opened 07:16PM - 24 Oct 24 UTC

closed 08:04AM - 02 Dec 24 UTC

youky860423

bug

### System Info ValueError: The model did not return a loss from the inputs, on…ly the following keys: logits. For reference, the inputs it received are input_ids,attention_mask,pixel_values,aspect_ratio_ids,aspect_ratio_mask,cross_attention_mask ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [X] My own task or dataset (give details below) ### Reproduction 1. Call trainer.train(); 2. system return error for no logits ### Expected behavior successfully running forward and compute loss

mahmutc · March 23, 2025, 8:01am

Have you tried adding a DataCollator?

Topic		Replies	Views
Key Error 'loss' while fine tuning GPT-2 with the Trainer utility 🤗Transformers	9	7477	May 10, 2022
I tired and can't solve this error , ValueError: The model did not return a loss from the inputs, only the following keys: logits. For reference, the inputs it received are input_ids,attention_mask Models	1	1166	March 29, 2023
ValueError: The model did not return a loss from the inputs, only the following keys: last_hidden_state, past_key_values. For reference, the inputs it received are input_ids, attention_mask Beginners	3	960	February 16, 2024
【Solved】How can I get loss by using trainer when training gpt2? Beginners	3	948	July 21, 2022
Model did not return a loss --- but why? 🤗Transformers	0	752	April 27, 2023

"The model did not return a loss from the inputs"

Some parts were taken from the documentation and modified to my liking.

Related topics