Not a language model - remove word embedding weights from model for lighter cuda util

EnterTheVoid22 · February 17, 2021, 9:42am

i’m working on a model using the bert architecture but it’s not a language model.

i’m doing so by using the arguement ‘inputs_embeds’ and also outputting the pooled output layer.
so what’s important is that i don’t have any use of the word embedding layers.
i was wondering if there’s any clean way of removing those word embedding layers and other related layers that are not used by me, perhaps it can make my model lighter and reduce the times i’m getting cuda out of memory errors.

Palaash · October 6, 2022, 8:37pm

Were you able to figure out how to solve the issue? I’m facing the same.

Topic		Replies	Views
Remove a named module from a pre-trained model 🤗Transformers	0	246	April 12, 2024
Unable to train Bert by splitting across GPUs 🤗Transformers	0	456	June 24, 2022
How to get word embedding from a TF bert model? 🤗Transformers	0	337	October 1, 2021
Output of BertEmbeddings Models	1	377	May 1, 2021
Using Batch Encodings 🤗Transformers	0	694	July 12, 2022

Not a language model - remove word embedding weights from model for lighter cuda util

Related topics