Claritifcation about the `max_position_embeddings` argument

carted-ml · March 30, 2022, 10:26am

Referring to this Colab Notebook:

While initializing the RobertaConfig:

config = RobertaConfig(
    vocab_size=52_000,
    max_position_embeddings=514,
    num_attention_heads=12,
    num_hidden_layers=6,
    type_vocab_size=1,
)

Why do we set max_position_embeddings to 514 when the maximum sequence length is set to 512 in the notebook?

Rushikesh · January 27, 2023, 8:45am

Hey this may be late, but the other two are sep_tokens at the start and end of the tensor list.

Topic		Replies	Views
Positional encoding error in RoBERTa 🤗Transformers	1	331	October 2, 2023
Different size of Roberta-base tokenizer and model embedding Beginners	1	1106	March 1, 2022
[Question] Why does vocab size determine training parameters Beginners	2	841	August 5, 2021
PyTorch version Beginners	7	1681	July 12, 2022
Error using `max_length` in transformers 🤗Transformers	3	2703	February 26, 2021

Claritifcation about the `max_position_embeddings` argument

Related topics