When I use Seq2SeqTrainer to train and evaluate, how do I change the decoding strategy. For example, during evaluation after each epoch, I want to use top_p decoding, where do I pass arguments such as do_sample=, top_p=
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Generation_max_length, generation_num_beams meaning in seq2seq | 0 | 426 | December 25, 2023 | |
Stochastic Sampling with Trainer.evaluate() Logits | 3 | 327 | May 6, 2024 | |
Predict beam size on Seq2SeqTrainer | 1 | 277 | July 15, 2021 | |
Using Seq2SeqTrainer to eval during training? | 1 | 1049 | November 27, 2021 | |
How can i implement custom model to use Seq2SeqTrainer class | 0 | 444 | November 8, 2023 |