Callbacks for logging results to GPT2

sachin · February 16, 2022, 4:41am

I have a GPT2 model which I’m training (and works) that takes in an example input sequence and outputs a target sequence. I am able to log the model results of the validation set after the model has finished training but was hoping I could log after the end of an epoch, and was hoping to use callbacks.

What I’m hoping to log goes something like this:

class MyCallback(transformers.TrainerCallback):
    def on_epoch_end(args: TrainingArguments, state: TrainerState, control: TrainerControl, **kwargs):
        valid_dl = trainer.get_eval_dataloader()
        
        for x, _ in valid_dl:
            x_tokenized = tokenizer(x)
            output = tokenizer.decode(model.generate(x_tokenized))
            break
       logging_fn(x, output)

I do not know how to access model, tokenizer and trainer in the above scenario. Any thoughts on how I could achieve above effect?

sgugger · February 16, 2022, 3:33pm

That is not supported as it would let the user modify the state of the Trainer which would then mess up with the rest of the training loop.

Topic		Replies	Views
Trainer API to log both Training and Validation Metrics 🤗Transformers	2	1724	July 1, 2021
Trainer: log token count 🤗Transformers	0	251	October 19, 2023
How to monitor both train and validation metrics at the same step? 🤗Transformers	21	15441	July 29, 2021
Export logs while training 🤗Transformers	1	1280	December 6, 2022
How to get per-eval-step score when using trainer? 🤗Transformers	4	1930	October 18, 2022

Callbacks for logging results to GPT2

Related topics