Hidden States of OpenAI GPT2 inconsistent

fgab · October 25, 2021, 12:57pm

Hi, I am trying to use the OpenAI GPT2 and I just realized that the hidden states change every time I run the model and I cannot figure out why. When I use BertModel this does not happen.
Does anyone have an explanation for that?
Thank you so much in advance!

nielsr · October 25, 2021, 1:43pm

Do you run the model in evaluation mode?

i.e. model.eval()

=> this will turn off any dropout modules

BramVanroy · October 25, 2021, 2:27pm

IIRC GPT-2 does sampling which is not token-level deterministic.

Topic		Replies	Views
Returned Tensors and Hidden State Beginners	4	2653	September 5, 2020
Can hidden states be passed instead of input_ids or inputs_embeds in Transformers OpenAI GPT2 🤗Transformers	0	483	July 6, 2021
GPT2Model model output inconsistency between different transformers versions Intermediate	6	22	March 22, 2025
Looking for help with GPT-2 code Models	0	226	February 7, 2024
GPT2: hidden states get by output_hidden_states is different from those by register_forward_hook 🤗Transformers	0	87	November 19, 2024

Hidden States of OpenAI GPT2 inconsistent

Related topics