Let's think about BERT pair classification

thigner1 · June 4, 2022, 1:28am

BERT positional encoding max is 512 tokens.
If I have data which is 512 tokens size and make pair of it for making pair similarity check.

BERT paper says input would be [cls] first sequence [sep] second sequence.

that means first sequence + second sequence combined should be not over 512.

then one sequence would be 256 sequences only.

My question is if I have data which is 512 tokens, and wanna make pair similiarity classifier

then what should I do?

Topic		Replies	Views
Fine-tuning BERT with sequences longer than 512 tokens Models	7	27642	April 4, 2022
RoBERTa for Sentence-pair classification Models	2	1973	April 23, 2024
How to load a BERT model with 1024 dimensions Beginners	0	2877	June 9, 2021
Sentence pair classification with BertForSequenceClassification cause IndexError: index out of range in self 🤗Transformers	0	1548	November 10, 2022
Encoding sentence pair with BERT cause ValueError: not enough values to unpack (expected 2, got 1) Beginners	1	6761	November 13, 2022