Reduce the number of features of BERT embeddings

Sergio · January 28, 2021, 9:35pm

Hi everyone,
I am using a XXL BERT for my project.
I would like to test the network using an embedding dimension lesser than 768, for example, 300.
I think I could try to perform a PCA on the embeddings.
Is there an implemented solution which does this?

Many thanks in advance

emanuelevivoli · May 7, 2021, 9:54am

Hi, actually you could use a Dense layer (from sentence-tranformers here ) and go from 768 to 300 with a bit of finetuning.

If you still want to use PCA, huggingface (for what I know) doesn’t have it’s own implementation so I advice you to pick the best python library you know and use that implemlementation.

For example, if you want to use scikit-learn library they have PCA as well as other cool stuff. ( this is the first example that I happen to see on scikit-learn PCA)

R00 · August 31, 2021, 11:41am

Is using a Dense layer with a lower feature-space, a legitimate way of dimensionality-reduction ?
Also are PCA and t-SNE good options for dimensionality-reduction of embeddings computed from transformer-based models? I see that they are performed a lot with Word2Vec/ TF.IDF embeddings

Topic		Replies	Views
Use sentence transformers with different embeddings size 🤗Transformers	0	293	June 6, 2023
Low Dim Embeddings from Similarity Transformer Models Beginners	1	643	April 5, 2024
Getting pretrained embeddings 🤗Transformers	0	598	June 20, 2023
Reduce output dimensions of BERT Models	3	2682	March 5, 2022
`truncate_dim` on `BertModel` 🤗Transformers	0	89	August 20, 2024

Reduce the number of features of BERT embeddings

Related topics