Greedy decoding produces empty output

prajjwal1 · February 14, 2021, 4:36am

I’ve finetuned ctrl, and it seems that greedy decoding with repetition_penalty=1.2 and temperature=0 and top_k sampling ('max_length': 64, 'do_sample':True, 'max_length':64, 'top_k':50, 'top_p':0.95)
produces empty sequences often when varied examples are provided as context. I wanted to know what is the reason for this usually ? I tried playing with parameters in model.generate , but they didn’t seem to help much directly. Also, some checkpoints in between are able to generate text for that sample. What is the reason usually ?

HYCTW · June 29, 2022, 2:22am

Hi I have same question
Did you figure out why this problem occurs !?

v-xchen-v · November 22, 2023, 4:09am

Maybe need some format for prompt, such as decorate the context with question-answer format of f’Q:{context}\nA:’ to let finetuned model to generate the text you want.

Topic		Replies	Views
Using penalized sampling from CTRL 🤗Transformers	1	341	February 4, 2021
Greedy sampling with the new branch 🤗Transformers	0	133	July 8, 2024
Warm-starting encoder-decoder models using EncoderDecoderModel always giving an empty string after fine-tuning 🤗Transformers	0	113	March 25, 2024
What parameter settings (if any) do the "Sample" and "Greedy" options correspond to when using the BLOOM api? Models	2	898	December 18, 2022
Need advice for implementing Greedy Search for ORTModelForSeq2SeqLM 🤗Optimum	2	596	January 17, 2024

Greedy decoding produces empty output

Related topics