Some functions when customizing trainer

ezio98 · July 16, 2021, 3:10am

Hi, it is glad to find the behavior of “Trainer” can be customized by overriding its methods. However, I am facing a problem with the originally existed functions. For example:

def training_step(self, model: nn.Module, inputs: Dict[str, Union[torch.Tensor, Any]]) -> torch.Tensor:
    ...
    if is_sagemaker_mp_enabled():
            scaler = self.scaler if self.use_amp else None
            loss_mb = smp_forward_backward(model, inputs, self.args.gradient_accumulation_steps, scaler=scaler)
            return loss_mb.reduce_mean().detach().to(self.args.device)
    ...
    loss = self.compute_loss(model, inputs)
    print(loss) # This is the only place I want to change
    return loss.detach()

This is the part of the code of method “training_step”, which I want to rewrite. Suppose I just want to print the loss in each training step without changing other codes. But apparently, I cannot import the function “is_sagemaker_mp_enabled()”, and thus I have to delete them. I don’t think this is a good solution and is there any elegant way?

Thanks for the help!

sgugger · July 16, 2021, 10:17am

There is no reason you shouldn’t be able to import is_sagemaker_mp_enabled from its location (transformers.file_utils).

ezio98 · July 19, 2021, 5:42am

Thanks! Now I can add extra logic without changing the original codes.

Topic		Replies	Views
Specify Loss for Trainer / TrainingArguments 🤗Transformers	5	21399	October 5, 2021
Supervised Fine-tuning Trainer - Loss function calculation Beginners	0	3339	September 6, 2023
Transformers replacing loss function 🤗Transformers	0	3371	March 26, 2022
Trainer code for token-wise prediction model Intermediate	0	436	June 6, 2022
Implementing a Trainer with custom loss produces key error 🤗Accelerate	2	3117	April 30, 2023

Some functions when customizing trainer

Related topics