Variable length batch decoding

valhalla · October 4, 2020, 4:36pm

All tokenizers offer this functionality, just pass the list of seqs to it

tokens = tokenizer([s1, s2])["input_ids"]

by default it’ll pad all the seqs to the maximum length in the batch if they are of different length. You can find more detailed info in this guide

Topic		Replies	Views
Decoder generate with prompts of variable lengths? 🤗Transformers	0	664	May 25, 2022
Issue with Decoding in HuggingFace 🤗Tokenizers	2	3878	March 24, 2022
Batch generation with GPT2 🤗Transformers	12	17189	January 16, 2024
Parallelize Mistral/ llama2 output 🤗Transformers	1	154	May 25, 2024
Llama2 pad token for batched inference Models	7	15664	March 31, 2024