Probaility of sequence generated in beam search of GPT2

divyanshu16 · September 29, 2020, 4:49pm

I am using a huggingface model of type transformers.modeling_gpt2.GPT2LMHeadModel and using beam search to predict the text.

Is there any way to get the probability calculated in beam search for returned sequence.
Can I put a condition to return a text sequence only when it crosses some threshold probability.

Below code gives the 5 texts’ tokens but I need the probability of those 5 sequences.

test_prefix = “Is this someth”
test_input_ids = tokenizer.encode(test_prefix, return_tensors=‘pt’)
test_input_ids = test_input_ids.to(device)

model = GPT2LMHeadModel.from_pretrained(“some/local/path”)

test_beam_outputs = model.generate(
test_input_ids,
max_length=len(test_prefix.split(’ ')) + 6,
num_beams=5,
early_stopping=True,
length_penalty=0.5,
num_return_sequences=5,
no_repeat_ngram_size=2
)

Topic		Replies	Views
GPT-2 Logits to tokens for beam search (Generate method) 🤗Transformers	0	1316	September 2, 2021
Generation Probabilities: How to compute probabilities of output scores for GPT2 🤗Transformers	24	28779	April 5, 2023
Prevent repeat tokens in GPT2LMHeadModel text generation with max_new_tokens=1 Beginners	0	1116	November 19, 2021
Generation scores Beginners	0	608	April 24, 2023
Does the GPT-2 model have a confidence score for text generation result? Beginners	1	1162	May 24, 2023

Probaility of sequence generated in beam search of GPT2

Related topics