Finetuning a small LLM on 32GB, 4vCPU

pr12431 · July 12, 2024, 10:50pm

Is it possible to use microsoft/Phi-3-mini-4k-instruct with vCPU (I have a i3.xlarge databricks cluster at work with 4vCPU and 32GB memory)? I think it says this implementation uses flash attention, so I was trying to download microsoft/Phi-3-mini-4k-instruct-onnx instead. I have a few errors there, but just want to double check I am using the correct implementation and can just set the device_map to “auto” or “cpu” when I don’t have a GPU instance.

the background was I was trying to finetune an open source model, but without a GPU it seems like a pain…

Topic		Replies	Views
Phi3 Mini 4k Instruct Flash Attention not found 🤗Transformers	4	4973	May 11, 2024
Load Phi 3 small on Nvidia Tesla V100 - Flash Attention 🤗Transformers	3	903	August 6, 2024
Trouble loading checkpoint shards for microsoft/Phi-3-mini-4k-instruct Intermediate	1	938	May 5, 2024
Phi-3 model fine-tuning Models	1	140	August 22, 2024
Should 24GB of VRAM be able to fine tune a 1B model? Beginners	9	554	February 23, 2025

Finetuning a small LLM on 32GB, 4vCPU

Related topics