Compute Perplexity using compute_metrics in SFTTrainer

shenoyajith · February 27, 2024, 11:34am

How can I compute perplexity as a metric when using the SFTTrainer and log at end of each epoch, by using that in compute_metrics argument. I intend to pick the best checkpoint with least perplexity.

Here is the dimension of logits and labels that go into the compute_metrics function (50, 256, 50272) (total_records,seq_len_vocab_size).
and labels (50, 256).

How can I compute perplexity using a compute metrics function for a CasualLM task?

dataset = load_dataset(“imdb”)
trainer = SFTTrainer(
“facebook/opt-350m”,
train_dataset=dataset[“train”].select(range(50)),
eval_dataset = dataset[“test”].select(range(50)),
dataset_text_field=“text”,
max_seq_length=256,
compute_metrics = #logic goes here,
args = training_args,
)
trainer.train()

Pragyan-02 · January 22, 2025, 7:16am

Hi, I wrote this function to log perplexity at each logging step.

def compute_metrics(pred: EvalPrediction):
predictions = torch.tensor(pred[0])

targets = torch.tensor(pred[1])

perplexity = Perplexity(ignore_index=-100)

perplexity_score = perplexity(predictions, targets)

return {"Perplexity": perplexity_score}

In trainer called like this:

trainer = SFTTrainer(
    **trainer_args,
    train_dataset=train_data,
    eval_dataset=valid_data,
    peft_config=peft_config ,
    dataset_text_field="text",
    max_seq_length=config.block_size,
    compute_metrics = utils.compute_metrics)

Topic		Replies	Views
Log Perplexity using Trainer 🤗Transformers	2	1955	October 9, 2021
Useful compute_metrics functions for perplexity 🤗Transformers	0	633	September 29, 2022
Using perplexity as metric during training Beginners	0	1664	June 7, 2023
Calculating perplexity from hidden_states Intermediate	2	1376	March 21, 2023
Whats happening in the SFT trainer? Beginners	15	2607	July 16, 2025

Compute Perplexity using compute_metrics in SFTTrainer

Related topics