LoRa fine tuning a chatbot on 6GB VRAM GPU

Tekarukite · January 21, 2025, 9:01am

Hello, I am doing a chatbot working with data from IMDB.

I was thinking of using meta-llama/Llama-3.2-1B-Instruct and LoRa fine tuning it on BrightData/IMDb-Media Dataset. I have a 6GB VRAM GPU and not much time.

My questions are: Is this the right model to use for the task, or should I use a smaller one? Is it good paired with the dataset? Will 6GB VRAM be enough for LoRa fine tuning it? Also, should I split the dataset and LoRa tune it on a smaller scale? The dataset has 250K rows.

John6666 · January 21, 2025, 9:30am

I’m not very familiar with it, but training requires a lot of VRAM, so even if you use LoRA with fp16 precision at 1B, it might be just about possible with 6GB of VRAM.
Well, I think it won’t crash if you have enough RAM…
If you can use QLoRA, you can save a lot of VRAM, but I’ve never used it.

Topic		Replies	Views
LoRA / QLoRA fine tuning a 8b Model(llama 3.1) Beginners	1	298	February 24, 2025
How to calculate the memory required using Lora fine tuning Models	0	952	November 21, 2023
VRAM Usage Differences in SageMaker Training Jobs vs. Direct Instance for Fine-Tuning LLama3 8B with QLoRA Amazon SageMaker	0	61	October 18, 2024
How estimate VRAM needed for prompt according to prompt's size (inference and fine tuning) Beginners	1	1251	September 22, 2023
Optimizing LLM Inference with One Base LLM and Multiple LoRA Adapters for Memory Efficiency 🤗Transformers	1	4652	January 20, 2024

LoRa fine tuning a chatbot on 6GB VRAM GPU

Related topics