We fine-tuned the GPT2Model (distilgpt2) some time ago. Due to tool vulnerability issues, we have to upgrade transformers 4.48.0 or above. However, the exact same GPT2 model produces different outputs for the exact same input after the upgrading. It seems to me that the masked portion of the model o…

Perhaps this? SDPA is now default attention. [`GPT2`] Add SDPA support (#31172) committed 07:40AM - 19 Jun 24 UTC (UTC) [image] vasqu +191 -11 …

GPT2Model model output inconsistency between different transformers versions

John6666 March 21, 2025, 6:31pm 2

Possibly related this phenomenon.

Also, the part that has changed a lot recently is the KV cache-related area, which seems to have changed quite a bit.

Topic		Replies	Views
Inconsistent GPT2Model results between transformers versions Intermediate	7	42	July 19, 2025
Hidden States of OpenAI GPT2 inconsistent 🤗Transformers	2	294	October 25, 2021
Api and parameters change from transofrmers 2.5.1 to 3.5.1 for GPT2 🤗Transformers	0	245	January 4, 2021
GPT2 Implementation from scratch 🤗Transformers	0	400	August 11, 2020
Difference in output logits when using a subsection of the input sentence 🤗Transformers	0	376	January 8, 2023