Addition for Migration Documentation

ktangri · August 10, 2020, 7:09pm

Hello,
I’ve been working on research related to the Commonsense Explanations Dataset cos-e which uses the transformers version with commit id e14c6b52e37876ee642ffde49367c51b0d374f41. I decided to update the version of the library that the salesforce code uses to the latest version of transformers and ran into an issue where the perplexity of generated text from GPT was massive.

Long story short, I realized that the older version of the library didn’t implicitly shift the language modeling labels (for the causal language modeling that GPT uses). As a result, after migrating to the new transformers library, my labels were all shifted over one. Would be helpful if this was added to the migration documentation here.

Topic		Replies	Views
Where does the Transformers do the target text shifting in causal LM? Beginners	4	4819	February 21, 2025
How to label dataset for Causal Language Modeling Beginners	0	522	January 27, 2023
Error in DataCollator section of Hugging Face Tutorial LM fine tuning Beginners	2	258	January 12, 2024
Russian documentation review 🤗Transformers	0	63	December 3, 2024
How is the data shifted by one token during CausalLM fine tuning Models	4	3183	April 14, 2025

Addition for Migration Documentation

Related topics