Why does fine-tuning require creating two Trainers?

RylanSchaeffer · October 4, 2021, 6:42pm

I’m following the tutorial here:

huggingface/notebooks/blob/master/transformers_doc/training.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# Transformers installation\n",
    "! pip install transformers datasets\n",
    "# To install from source instead of the last release, comment the command above and uncomment the following one.\n",
    "# ! pip install git+https://github.com/huggingface/transformers.git\n"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Fine-tuning a pretrained model"
   ]

This file has been truncated. show original

which creates two separate Trainers, one for training, the other for evaluation:


trainer = Trainer(
    model=model, args=training_args, train_dataset=small_train_dataset, eval_dataset=small_eval_dataset
)

trainer.train()

trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=small_train_dataset,
    eval_dataset=small_eval_dataset,
    compute_metrics=compute_metrics,
)
trainer.evaluate()

Three questions:

Why are two Trainers required? What can we just create the Trainer once, call .train() then call .evaluate()?
Why is the validation loss not computed during training?
Why is a custom compute_metrics function required? I’m referring to:


metric = load_metric("accuracy")

def compute_metrics(eval_pred):
    logits, labels = eval_pred
    predictions = np.argmax(logits, axis=-1)
    return metric.compute(predictions=predictions, references=labels)

Topic		Replies	Views
Error when fine-tuning with the Trainer API 🤗Datasets	4	2674	December 10, 2021
How to add multiple metrics to Huggingface Transformers Trainer? 🤗Transformers	1	2068	July 26, 2022
Ensemble with `trainer` Beginners	2	790	February 21, 2021
Multiple training will give exactly the same result except for the first time 🤗Transformers	1	3550	July 19, 2021
Implementation of Two Distinct Datasets with HuggingFace Trainer Module Intermediate	5	32	June 18, 2025

Why does fine-tuning require creating two Trainers?

Related topics