Variable length batch decoding

s4sarath · October 4, 2020, 4:11pm

Hi All,

Just want to know, is there any way to batch decode variable length sentences.

For example [S1, S2] , where S1 has 5 words abd S2 has 10 words . Can we decode it using GPT2 , BERT etc?

valhalla · October 4, 2020, 4:16pm

What do you mean by decoding, as in decoding the generated tokens by GPT2 or making predictions on a batch of sequences ?

s4sarath · October 4, 2020, 4:31pm

By Decoding I mean, generate sequence of tokens. @valhalla .

valhalla · October 4, 2020, 4:36pm

All tokenizers offer this functionality, just pass the list of seqs to it

tokens = tokenizer([s1, s2])["input_ids"]

by default it’ll pad all the seqs to the maximum length in the batch if they are of different length. You can find more detailed info in this guide

s4sarath · October 4, 2020, 4:41pm

@valhalla thanks for this. I have seen that. But when I try the pad token in gpt2 it dint work as expected.

valhalla · October 4, 2020, 4:47pm

There is no pad token for GPT, you can manually set the eos token as pad token using

tokenizer.pad_token = tokenizer.eos_token

s4sarath · October 5, 2020, 3:03am

Hi @valhalla . Thanks for suggestion . I have done as per you say. This is the same thing I have tried before. But as you see in screenshot, the results of variable batch sentences does not produce correct results.

valhalla · October 5, 2020, 9:23am

As this is auto regressive model which predicts next token based on previous tokens, it might not generate correct tokens when there are eos in the text.

I thought you were asking about batching at training time. Sorry about the misleading answer.

Right now generate does not support batched generation for gpt2.

Pinging @lysandre

s4sarath · October 5, 2020, 11:53am

No problem @valhalla . Appreciate your response.
I have implemented this feature locally.

Off topic, would like to know what is your thoughts on this.

valhalla · October 5, 2020, 12:28pm

Not a TF user

Great, can you share your fix if possible, lots of other people are interested in batched prediction for GPT.

s4sarath · October 5, 2020, 1:18pm

I have implemented in TF2.0 . I had to make quite a bunch of changes to make it work. Will share it.

fzyzcjy · March 31, 2024, 1:27pm

Hi, is there any updates? Thanks!

Topic		Replies	Views
Decoder generate with prompts of variable lengths? 🤗Transformers	0	664	May 25, 2022
Issue with Decoding in HuggingFace 🤗Tokenizers	2	3880	March 24, 2022
Batch generation with GPT2 🤗Transformers	12	17192	January 16, 2024
Parallelize Mistral/ llama2 output 🤗Transformers	1	154	May 25, 2024
Llama2 pad token for batched inference Models	7	15670	March 31, 2024

Variable length batch decoding

Related topics