Instantiating multiple heads at the same time

shensmobile · October 18, 2022, 6:29pm

I have been using the same backbone/language model (BERT) that I have used for several different classification heads. I’m working on a project that needs several models running at once. I am unfortunately running into CUDA out of memory errors.

Is it possible to “share” the common backbone of my model to take up less space on my GPU?

Topic		Replies	Views
Fine-tuning BERT with multiple classification heads 🤗Transformers	10	5490	January 19, 2024
Share a Multi-Task Model on the huggingface Hub Models	0	710	September 20, 2022
Model Parallelism, how to parallelize transformer? Beginners	3	12701	June 18, 2021
Unable to train Bert by splitting across GPUs 🤗Transformers	0	455	June 24, 2022
Multiple gpu not properly parallelized during model.generate() 🤗Transformers	4	1619	October 9, 2022

Instantiating multiple heads at the same time

Related topics