Logging_steps=1 => ValueError

mdroth · June 2, 2023, 1:58am

I have assembled this colab notebook to finetune LLaMA 7B adapter weights. Within TrainingArguments, in line 62, I would like to use logging_steps=1 so I can see the logs at every step. However, I get

ValueError: expected sequence of length 24 at dim 1 (got 58)

when I do use logging_steps=1 and as a consequence, the model won’t be trained.
Why is that? And how can I avoid that error but still log the loss?

Side note:
After trainer.train(), I save the model and the tokenizer. Apparently, this uses the GPU and may cause OutOfMemoryError: CUDA out of memory. You can just comment out these lines (or pay Colab for more GPU memory).

Topic		Replies	Views
ValueError: expected sequence of length 128 at dim 1 (got 68) Intermediate	0	496	October 25, 2023
How to solve ValueError: expected sequence of length 15 at dim 1 (got 18) error in python 🤗Transformers	3	7889	June 10, 2023
Colab error (memory crashes) Beginners	3	3059	April 22, 2021
CUDA out of memory for Longformer Beginners	6	1269	October 22, 2021
ValueError: Expected input batch_size (1024) to match target batch_size (4) Beginners	1	75	January 7, 2025

Logging_steps=1 => ValueError

Related topics