Ways to reduce memory consumption in Q&A tasks without damage (or at least, not that much) the accuracy?

victorescosta · October 13, 2021, 8:35pm

i’m facing this problem: I’m trying to spend less memory in my Q&A task using bert. I debugged my steps and saw that the start_logits and end_logits

start_logits, end_logits = model(**inputs)

costs more than 11gb of ram. Is there any ways to solve this? I mean, use less memory to perform this task without harm my model accuracy? If so, can someone share some of them? And some alternative ways in case is not possible to do this?

Topic		Replies	Views
How to reduce memory usage for inference while training models from scratch? Models	0	1390	January 30, 2021
Using Batch Encodings 🤗Transformers	0	694	July 12, 2022
Bert NextSentence memory leak Beginners	4	1553	May 29, 2021
The model I'm using for QA info extraction is too heavy Beginners	0	249	April 19, 2022
Seeking Advice on Optimizing Hardware Resources for Model Training Beginners	3	155	August 4, 2024

Ways to reduce memory consumption in Q&A tasks without damage (or at least, not that much) the accuracy?

Related topics