Limit max # of tokens for inference in pipeline?

PaulHoule · April 7, 2023, 3:00am

I’m following the first example for fine tuning a model, particularly I am tokenizing like so

# source is a dataset with text and label

tokenizer = AutoTokenizer.from_pretrained('bert-base-cased')
def tokenize_function(examples):
    return tokenizer(examples["text"], padding="max_length", truncation=True)
train = source.map(tokenize_function)
...
trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=train,
)
...

now I want to do some inference and I make a pipeline

p=pipeline("text-classification",model=model,tokenizer=tokenizer,device=0)

then I pass a list of strings through it

p(my_strings)

and get the error

RuntimeError: The size of tensor a (560) must match the size of tensor b (512) at non-singleton dimension 1

because one of the strings in the array is too long and makes too many tokens. When I was doing inference I was able to apply the max_length as a parameter to the tokenizer when I tokenized with the call method, but I can’t see how to configure it so that it is done inside the pipeline. I’ve tried some other approaches to doing inference without using the pipeline but the documentation keeps sending me back. What should I do?

Topic		Replies	Views
Why do Pipelines allow more than 512 tokens? Beginners	1	631	April 4, 2023
How to stop at 512 tokens when sending text to pipeline? 🤗Transformers	2	1446	February 7, 2024
Truncating sequence -- within a pipeline Beginners	7	5814	May 3, 2024
Why is the tensor produced by inference so big? Beginners	2	431	April 17, 2023
Tokenizer truncation Beginners	1	1791	June 14, 2022

Limit max # of tokens for inference in pipeline?

Related topics