How to apply decoding method and penalty

Pivitin · December 22, 2023, 9:32pm

I’m trying to apply some parameters in my code below, anyone know how to apply them? Is it possible in google/flan-t5-xl model?

parameters I want to apply:

{
    "decoding_method": "greedy",
    "max_new_tokens": 5,
    "repetition_penalty": 1
}

Code from google/flan-t5-xl model:

from transformers import T5Tokenizer, T5ForConditionalGeneration
tokenizer = T5Tokenizer.from_pretrained("google/flan-t5-xl", decoding_method="greedy",)
model = T5ForConditionalGeneration.from_pretrained("google/flan-t5-xl")
input_ids = tokenizer(prompt_input, return_tensors="pt", ).input_ids
outputs = model.generate(input_ids, max_new_tokens=5)
print(f"Sentimental: {tokenizer.decode(outputs[0], skip_special_tokens=True)}")

nielsr · December 23, 2023, 11:23am

Hi,

Sure. For decoding, the generate method can be used, and it uses greedy decoding by default, so that’s ok. You can pass the additional arguments as keyword arguments:

from transformers import T5Tokenizer, T5ForConditionalGeneration
tokenizer = T5Tokenizer.from_pretrained("google/flan-t5-xl", decoding_method="greedy",)
model = T5ForConditionalGeneration.from_pretrained("google/flan-t5-xl")
input_ids = tokenizer(prompt_input, return_tensors="pt", ).input_ids

generation_kwargs = {"max_new_tokens": 5, "repetition_penalty": 1}
outputs = model.generate(input_ids, **generation_kwargs)
print(f"Sentimental: {tokenizer.decode(outputs[0], skip_special_tokens=True)}")

Topic		Replies	Views
Unable to use Constrained beam search with google/flan-t5-base 🤗Transformers	1	377	October 20, 2023
Confused about max_length and max_new_tokens 🤗Transformers	7	36058	September 5, 2024
How to run T5 with Accelerator/XLA 🤗Accelerate	0	592	May 18, 2023
Minimum number of tokens in generate Models	0	1062	March 10, 2023
T5 Model Generate and Model Outputs Vastly Different Beginners	1	813	September 11, 2022

How to apply decoding method and penalty

Related topics