Track multiple losses & different outputs size with Trainer and callbacks

sgugger · October 27, 2020, 11:38am

The Trainer class is not built to optimize two models at the same time, so no, there is no easier way than subclassing and overrifing the training_step. In general, subclassing the Trainer and overriding the method(s) to fit your needs is the expected way and we designed the Trainer API to make it as easy as possible.

For predict/evaluate, yes Trainer will need tensors of the same size (with the exception of the batch dimension) otherwise it won’t be able to concatenate all predictions. This is something we’ll look into more when we rewrite the token-classification examples (in the next few weeks).

Topic		Replies	Views
Track more than one loss using Trainer and Wandb Intermediate	1	702	July 11, 2024
Multiple Loss Tracking on Train and Evaluate Steps 🤗Transformers	3	126	February 26, 2025
How to create and use my own ModelOutput class with Trainer 🤗Transformers	0	386	January 27, 2021
A custom trainer for multi-task learning? 🤗Transformers	1	847	September 18, 2024
Log losses/metrics with CustomTrainer(Trainer) class in the same frequency as Trainer, with wandb Beginners	8	56	August 6, 2025

Track multiple losses & different outputs size with Trainer and callbacks

Related topics