Can we resize embedding with embedding weighted initialized differently?

xuxuxu · August 18, 2020, 6:32am

When we add new tokens, this method automatically adds embedding using torch nn.Embedding.
https://huggingface.co/transformers/_modules/transformers/modeling_utils.html#PreTrainedModel.resize_token_embeddings

The documentation says the resized embeddings are nn. Embedding, which said they by default initialize weights from N(0, 1) (https://pytorch.org/docs/stable/generated/torch.nn.Embedding.html). But I have checked that the resized embedding weights are almost N(0, 0.01) or N(0, 0.02)? Can I check the true distribution of resized embedding weights ?

Topic		Replies	Views
State of the art technique for initializing Embedding Matrix? Research	3	5042	July 19, 2020
Saving Manually Resized Embeddings for a Pretrained Bert Model (I believe I am asking this correctly) Beginners	0	105	November 7, 2024
Training with class weights 🤗Transformers	5	2836	November 18, 2023
Resize embeddings on Peft model Intermediate	4	572	May 12, 2025
Initializing the weights of the final layer of e.g. BertForTokenClassification with a manual seed 🤗Transformers	2	7928	October 6, 2020

Can we resize embedding with embedding weighted initialized differently?

If I want the embedding weights initialized differently, how can I achieve that efficiently?

Related topics