GPT-2 Perplexity Score Normalized on Sentence Lenght?

Lindasteel · March 30, 2021, 10:10am

I am using the following code to calculate the perplexity of sentences and I need to know whether the score is normalized on sentence length. If not, what do I need to change to normalize it?
Thanks!

import torch
import sys
import numpy as np
 
from transformers import GPT2Tokenizer, GPT2LMHeadModel
# Load pre-trained model (weights)
with torch.no_grad():
        model = GPT2LMHeadModel.from_pretrained('gpt2')
        model.eval()
# Load pre-trained model tokenizer (vocabulary)
tokenizer = GPT2Tokenizer.from_pretrained('gpt2')
 
def score(sentence):
    tokenize_input = tokenizer.encode(sentence)
    tensor_input = torch.tensor([tokenize_input])
    loss=model(tensor_input, labels=tensor_input)[0]
    return np.exp(loss.detach().numpy())
 
if __name__=='__main__':
    for line in sys.stdin:
        if line.strip() !='':
            print(line.strip()+'\t'+ str(score(line.strip())))
        else:
            break

Saket · July 1, 2021, 10:33pm

Hey, did you find an answer to your question?
What is the right way (if there is a need) to normalize the perplexity number based on sentence length? Should I divide by the number of tokens ? I have a reason to believe that they must already be doing it on the inside in the loss computation. Not sure though

Felipehonorato · October 15, 2021, 5:55pm

Hey! Where did you find this way of calculating the perplexity? Is it accurate?

Topic		Replies	Views
Huge discrepancy in perplexity of LLM for Trainer v/s scratch implementation? Beginners	1	95	October 24, 2024
ASR hypotheses rescoring with perplexity score 🤗Transformers	0	1174	March 4, 2021
Confused by calculation of perplexity in docs Beginners	0	644	September 28, 2021
Need help with gpt2 model Beginners	0	549	July 9, 2023
Non-meaningful response from finetuned GPT-2 model 🤗Transformers	0	424	June 26, 2023

GPT-2 Perplexity Score Normalized on Sentence Lenght?

Related topics