Resource required to fine tune a large model?

dnrkdnrk · November 12, 2022, 2:35pm

Hi!

I am a student just started learning about NLP.
I recently found below model in huggingface slight_smile:, was was blown away by the performance of the model.

So I have been playing with it and wanted to fine tune to change its tone and feed in some more data to have fluency in certain subject. The problems is that I have never dealt with such a large model, and it crushes due to shortage of GPU power. I have no idea how much resources are needed to deal with such large model so I am not sure whether I should invest on a new GPU or get a cloud VM subscription.

If any of you have tested it, I would like to ask you to share some information about how you did it and how much resources were needed. Thanks for your time!

Topic		Replies	Views
How to Efficiently Fine-Tune Models on Custom Datasets with Limited Resources? Beginners	0	120	July 10, 2024
Seeking Advice on Optimizing Hardware Resources for Model Training Beginners	3	153	August 4, 2024
Guidance on getting started with fine tuned uncensored model Beginners	2	1157	March 8, 2025
Which model for inference on 11 GB GPU? Beginners	1	394	October 30, 2021
How much VRAM and how many GPUs to fine-tune a 70B parameter model like LLaMA 3.1 locally? Models	1	276	April 17, 2025

Resource required to fine tune a large model?

Related topics