P-tuned Phi-2 produces open ended, repeating, responses

jpodivin · April 28, 2024, 7:36pm

I’ve worked with the guide from peft docs only slightly adjusting the preprocessing function to resolve one bug and substituting phi-2 for the original model. The trained model produces repeating output, for example:

[‘Tweet text : @NYTsupport i have complained a dozen times & yet my papers are still thrown FAR from my door. Why is this so hard to resolve? Label : complaintaintcomplaintaint’]

I’m wondering what could be the cause and if there is anything that can be done about it?

jpodivin · May 24, 2024, 12:42pm

I found out where the problem was. Turns out the eos token set in config doesn’t actually influence what’s going on during inference. It has to be set as an argument of the generate method. [0] All credit for this solution should go to Eugenio-Schiavoni who first wrote about it on github.com

tldr:
Instead of:

model.config.eos_token_id = tokenizer.eos_token_id

use:

model.generate(input_tokens, eos_token_id=tokenizer.eos_token_id)

[0]When using model.generate, it does not stop at eos_token, but instead continues until the maximum length. · Issue #23175 · huggingface/transformers · GitHub

system · May 25, 2024, 12:42am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
GPT2 returns sequence of <\|endoftext\|> after finetuning 🤗Transformers	2	248	January 31, 2024
Repetitive words in model output Models	1	51	December 18, 2024
GPT2 finetuned with eos token will never yield eos token during generation Beginners	6	3368	April 12, 2024
Infinity output from gpt2 model? Beginners	2	153	June 22, 2024
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation Beginners	5	46181	September 24, 2024

P-tuned Phi-2 produces open ended, repeating, responses

Related topics