Pass tokenizer or model arguments

nbroad · October 17, 2022, 1:28pm

When I use an ASR model, I get the following message in the logs:

/opt/conda/lib/python3.9/site-packages/transformers/generation_utils.py:1296: UserWarning: Neither max_length nor max_new_tokens has been set, max_length will default to 20 (self.config.max_length). Controlling max_length via the config is deprecated and max_length will be removed from the config in v5 of Transformers – we recommend using max_new_tokens to control the maximum length of the generation.

Typically with generative models, arguments like max_length can be passed to the pipeline object to control the tokenizer or model, but my attempts to do so in a request for an ASR endpoint did not work. I know I could create a custom handler, but I’m curious if there is a way to do it with the default endpoint.

Topic		Replies	Views
"What’s the Difference Between max_length and max_new_tokens?" 🤗Transformers	0	615	September 5, 2024
Confused about max_length and max_new_tokens 🤗Transformers	7	36277	September 5, 2024
OpenAi Whisper not giving full transcript using Interface Endpoint Models	0	481	November 17, 2022
Limit max # of tokens for inference in pipeline? Beginners	0	1080	April 7, 2023
How to set 'max_length' properly when using pipeline? 🤗Transformers	4	1617	November 18, 2024

Pass tokenizer or model arguments

Related topics