Evaluating on MMLU while finetuning using Trainer

imdatta0 · October 3, 2023, 9:47am

Hi all, I’m trying to finetune llama on custom text dataset and would want to check how it’d perform if I evaluated it on MMLU dataset. Here’s what I’m using rn ( the eval_data and train_data are subsets of custom dataset).

trainer = Trainer(
    model = model,
    train_dataset = train_data,
    eval_dataset = eval_data,
    args = **training_args
    data_collator=DataCollatorForLanguageModeling(tokenizer,pad_to_multiple_of=8, mlm=False)
)

If I replace eval_data with MMLU dataset (aka by doing
eval_data = load_datset('mmlu','abstract_algebra')
), how do I tell the trainer to use Binary loss ( 1 or 0) instead of Cross Entropy loss (this is what it’d use normally ig)?

Topic		Replies	Views
Different results from checkpoint evaluation when loading fine-tuned LLM model Intermediate	5	3239	September 22, 2023
Can I compute `eval_loss` and `bleu` score simultaneously for decoder only transformers 🤗Transformers	0	437	June 23, 2023
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K Beginners	5	1126	December 5, 2024
Ideal loss and training values? Beginners	1	191	May 20, 2025
Mixtral 8x7B or any LLM evaluation Models	0	182	March 15, 2024

Evaluating on MMLU while finetuning using Trainer

Related topics