ValueError: expected sequence of length 1024 at dim 1 (got 507)

spencer-s · July 14, 2022, 5:41pm

I’ve been trying to use run_clm.py from the language modeling example section to fine tune GPT-neo 125m on a small data set I put together. (around 10 megs txt file) I’ve been trying to use sage maker to speed up the process using the example script given under the GPT-neo-125 train section. The problem I ran into was that I get the following message during training: “ValueError: expected sequence of length 1024 at dim 1 (got 507).” From what I can tell, the run_clm.py function group_texts is supposed to drop text that doesn’t conform to length requirements.

I started writing this post feeling at wit’s end, but as it turns out I managed to find a solution to the issue. It turns out that on lines 417-418 of run_clm.py there was a check that was allowing smaller inputs through. I’ve fixed it on my own branch seeing how I came across this board through my google searches, hopefully, someone else out there will find this useful

ChrisToukmaji · May 26, 2024, 7:17pm

Solved my issue, thanks! I was running an old version of run_clm.py
It has been fixed as of May 2023 for all future readers.

Topic		Replies	Views
ValueError: expected sequence of length 25 at dim 1 (got 43) Beginners	1	262	February 26, 2024
Fine-tung gpt-2 run_clm.py stops early Beginners	4	1560	November 3, 2020
Run_clm.py is very slow on gpu (used to take seconds) Beginners	0	895	May 20, 2021
Run_clm.py stops after some % with error Beginners	3	1410	August 8, 2022
ValueError: too many dimensions 'str' 🤗Transformers	0	3021	April 13, 2022

ValueError: expected sequence of length 1024 at dim 1 (got 507)

Related topics