TypeError: argument 'ids': 'list' object cannot be interpreted as an integer when lora training

John6666 · February 10, 2025, 9:09am

I tried debugging it a little. I found the cause, or rather the trigger, but I’m not sure what the right thing to do is…
The cause is that the Evaluate function is trying to decode something that the tokenizer can’t decode.

    # 将 token IDs 解码为文本
    print("predictions:", predictions)
    print("labels:", labels)
    #predictions_str = tokenizer.batch_decode(predictions.tolist(), skip_special_tokens=True) # cannot be decoded... like <class 'numpy.ndarray'> "predictions: [[[1.87890625, 8.796875, 11.3359375, 8.6328125, 7.796875, 10.8125, 11.3046875, 13.625, 9.8046875, 11.984375, ..."
    labels_str = tokenizer.batch_decode(labels.tolist(), skip_special_tokens=True)

Topic		Replies	Views
Expected `tensors` and `new_tensors` to have the same type but found <class 'tuple'> and <class 'torch.Tensor'> 🤗Transformers	2	14	January 12, 2025
Hallucination with trainer.evaluate() on LLMs 🤗Transformers	1	679	February 19, 2024
EvalPrediction returning one less prediction than label id for each batch Beginners	7	6147	June 19, 2024
Trainer .evaluate() method returns one less prediction, but training runs fine (GPT-2 fine-tuning) Beginners	2	1809	November 14, 2022
Huggingface Trainer eval while training 🤗Transformers	1	732	December 31, 2022

TypeError: argument 'ids': 'list' object cannot be interpreted as an integer when lora training

Related topics