BertForNextSentencePrediction with larger batch size

DLiebman · May 17, 2021, 8:33pm

Is there a way to use BertForNextSentencePrediction in inference mode with a batch size larger than 1? I have some code.

encoding = self.tokenizer(prompt1, prompt2, return_tensors='pt', padding=True, truncation=True, add_special_tokens=True)
print(encoding)
outputs = self.model(**encoding, next_sentence_label=torch.LongTensor([1]), target_batch_size=10)
logits = outputs.logits
#print(logits)

Here prompt1 and prompt2 are lists of sentences. The list is 10 sentences long. I get an error like this:

ValueError: Expected input batch_size (10) to match target batch_size (1).

lewtun · May 18, 2021, 1:22pm

hey @DLiebman i think the problem is that you’re passing a batch of 10 examples, but only a single label in the next_sentence_label argument. changing your code to the following works for me:

outputs = model(**encoding, next_sentence_label=torch.ones((10,1), dtype=torch.long), target_batch_size=10)

DLiebman · May 18, 2021, 3:46pm

yep. something like that will work. thanks.

Topic		Replies	Views
Bert NextSentence memory leak Beginners	4	1553	May 29, 2021
Reduce inference time with batches Beginners	0	415	September 14, 2021
How to apply this change to this Bert Implementation? Beginners	0	180	August 5, 2023
Pre-train BERT with HF Trainer 🤗Transformers	0	739	April 22, 2022
Expected input batch_size (2048) to match target batch_size (4) Beginners	3	1603	May 23, 2022

BertForNextSentencePrediction with larger batch size

Related topics