Evaluate model at saved checkpoint

MillyXXX · June 22, 2021, 2:53pm

Hi, I’m using Seq2seq Trainer for QG and trying to see sample output of model at a checkpoint like so:

model = ProphetNetForConditionalGeneration.from_pretrained(‘squad_training_latest/results/checkpoint-212000’)
…
contexts = [tdata[‘context’][i]]
answers = [tdata[‘answer’][i]]
questions = [tdata[‘question’][i], tdata[‘question2’][i]]

    encoder_inputs, decoder_inputs = preprocess_batch(contexts, questions, answers)
    decoder_inputs = decoder_inputs.contiguous()

    question_ids = model.generate(encoder_inputs, early_stopping=False, return_dict_in_generate=False, eos_token_id=102, min_length = 64)
    rv = tokenizer.batch_decode(question_ids, skip_special_tokens=False)
    print(rv)

The result looks very bad, overfitted like [’[SEP] what was the the the the the name of the? [X_SEP] the the? name of the?’] for every example

however the eval loss was low so wondering if I’m evaluating right, or should I use Trainer.predict? load the model a different way?

(side question: I noticed Trainer.evaluate() returns teacher forced predictions to me and it differs from model.generate() and was wondering why and whether the param predict_with_generate has anything to do with it)

Topic		Replies	Views
[Urgent] trainer.predict() and model.generate creates totally different predictions 🤗Transformers	4	6902	February 1, 2021
Trainer.evaluate() with text generation Beginners	5	3537	December 31, 2021
Difference in trainer.predict() and model.generate() for LM 🤗Transformers	0	1786	July 5, 2023
Seq2seq evaluation speed is slow 🤗Transformers	7	3821	June 20, 2023
Trainer predict or evaluate returns zero for metrics 🤗Transformers	0	56	July 11, 2024

Evaluate model at saved checkpoint

Related topics