How to print a few examples at the beginning of training when using Trainer?

mralexis · June 29, 2021, 6:12am

I would like to see a few examples to manually verify the input and output. Previously in an older version of transformers there is a functionality of doing that. I am wondering whether there is something similar for trianer?

sgugger · June 29, 2021, 12:05pm

This is not something related to the Trainer, you just have to print some elements of your dataset. The functionality is implemented in all examples, see for instance this one

mralexis · June 29, 2021, 4:35pm

Thanks!
A follow-up question is how I could print out a few examples with output say per-batch? A concrete example would be given a seq2seqtrainer, I want to check the raw output during training as a measurement of progress. Would that be something doable?

sgugger · June 29, 2021, 5:06pm

You can get the first batch of the training dataloader by doing:

for batch in trainer.get_train_dataloader():
    break

and then access the sentences in batch. You will need to decode the inputs using your tokenizer though.

mralexis · June 29, 2021, 6:29pm

I see. But this only has the input, right? How could I access a few random examples of the output?

sgugger · June 29, 2021, 6:30pm

output = model(**batch)

will give you the output.

Topic		Replies	Views
Unpacking transformer's trainer.eval() to see every example's output, loss Intermediate	4	322	April 9, 2024
How to use the model from the chapter "Fine-tuning a model with the Trainer API" Course	0	320	April 17, 2024
Trainer does not print to console the loss (train and eval) Beginners	0	1735	June 24, 2023
BatchSampler - with trainer Beginners	0	199	July 20, 2023
How to get a Dataloader from a Trainer? Beginners	2	1880	March 5, 2024

How to print a few examples at the beginning of training when using Trainer?

Related topics