How to calculate perplexity from the `generate` function?

NeoFelix · December 30, 2022, 2:08pm

Hi,
I am trying to calculate the perplexity from the generate function. I use beam search as the decoding strategy, but I would like to get the perplexity for all outputs of the third sentence (or maybe other, not the first one).

To calculate the perplexity, I need first calculate the loss, but I didn’t find a way to extract the logits from the generate function with beam search. I found that the scores are the “Beam transition scores for each vocabulary token at each generation step. Beam transition scores consisting of log probabilities of tokens conditioned on log softmax of previously generated tokens in this beam.” According to this post: [Announcement] GenerationOutputs: Scores, Attentions and Hidden States now available as outputs to generate, scores now correspond to all processed lm head logits + the current beam_scores for each output token. So I am confused how can I extract the logits to calculate the loss or calculate the perplexity directly from generate function.

NeoFelix · January 2, 2023, 3:04pm

any thoughts @patrickvonplaten ?

NeoFelix · January 2, 2023, 3:40pm

I have made a function for calculating ppl for one generated sentence:


def calculate_ppl(scores, sequence, rank):
    """
    calculate_ppl calculates the perplexity for one sequence

    Args:
        scores (Tuple[Tensor]): generation scores
        sequence (Tensors): sequence of tokens
        rank (int): rank for the sequence according to sequence score

    Returns:
        float: ppl for one sequence
    """
    log_probs = [torch.max(score[rank]).item() for score in scores]
    ppl = math.exp(-1 * (sum(log_probs) / (sequence.shape[1]-1)))
    return ppl

But I am not sure this is correct, because the ppl is extreme low for my case.

Topic		Replies	Views
Generation Probabilities: How to compute probabilities of output scores for GPT2 🤗Transformers	24	28835	April 5, 2023
[Announcement] Generation: Get probabilities for generated output 🤗Transformers	63	40808	January 20, 2025
T5ForConditionalGeneration, How to get prediction probabilities or logits at the inference time? (to calculate perplexity) 🤗Transformers	0	691	April 5, 2022
Getting CrossEntropy loss from beam search scores 🤗Transformers	0	402	June 21, 2022
Perplexity for BART summaries Beginners	1	1484	February 11, 2022

How to calculate perplexity from the `generate` function?

Related topics