How to merge fine-tuned LLaMA-3.1-8B (via LLaMA-Factory) into a single GGUF for LM Studio?

dasdawedWR · May 25, 2025, 9:48am

Hi everyone!

I successfully fine-tuned the meta-llama/Llama-3.1-8B-Instruct model using the dataset G-reen/TheatreLM-v2.1-Characters.
The training was done using LLaMA-Factory, since that was the only method that worked for me.

The training itself went fine. But now I’m stuck with a problem.

I don’t understand how to merge the base model and the fine-tuned files into a single .gguf file so I can use it in LM Studio.

Here’s how my files are organized:

Fine-tuned files (LoRA output):
D:\IA\LLaMA-Factory\saves\Llama-3.1-8B\lora\train_2025-05-24-18-39-59
Base model:
D:\IA\LLaMA-Factory\models\Llama-3.1-8B

I’ve tried different ways but nothing worked so far.
If anyone can explain how to properly combine these into a .gguf file — I would really appreciate the help!

Thanks in advance!

John6666 · May 25, 2025, 10:41am

Maybe similar case?

Topic		Replies	Views
Llama3 Fine-Tuning Consultation Beginners	1	96	February 12, 2025
Lama 3.23b performs great when I download and use using ollama but when I manually download the model or if I use the gguf model by unsloth, it gives me irrelevant response. Please help me out Beginners	9	1354	October 31, 2024
Finetuning Meta-Llama-3.1-8B using PEFT Models	4	3455	February 1, 2025
LLAMA-2 Finetune Models	0	528	July 27, 2023
How to train an already finetuned LLM(LLama2)? Intermediate	0	300	March 13, 2024

How to merge fine-tuned LLaMA-3.1-8B (via LLaMA-Factory) into a single GGUF for LM Studio?

Related topics