Performance problems with finetuned model (Llama 2 7B based)

taoofstefan · June 8, 2024, 8:49am

Hello,

I am trying to finetune Llama 2 7B to enhance its function calling abilities. I am not sure what I am doing wrong, but the model is not really converging while training and the performance, hence, isn’t good at all.

I am using this Jupyter Notebook and tried it with and without quantization.

These are the training and eval/test datasets. I made them, and they are designed to learn the ‘get_current_weather’ function.

As my datasets are small, I picked higher values for r and lora_alpha

r=256, lora_alpha=512

Does anyone have any ideas on how to improve the model? Am I doing something wrong?

I am very new to the topic of finetuning LLMs. So, any feedback or help would be very much appreciated.

nielsr · June 10, 2024, 9:24am

Hi,

i’d recommend taking a look at the following resources to fine-tune open-source LLMs:

Unsloth: GitHub - unslothai/unsloth: Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory. I see they recommend lower values for lora: Google Colab.
the Alignment handbook (optimized for chat fine-tuning): alignment-handbook/scripts at main · huggingface/alignment-handbook · GitHub.

I’d recommend starting with a super tiny dataset and see if the model is able to overfit it.

taoofstefan · June 10, 2024, 2:46pm

Thanks, I will have a look

I tried lower values for both before going up. The higher values seemed to work a little better but still not good.

I used a different approach, adding the SFTTrainer to the fine tuning process, which makes things work well.

nielsr · June 10, 2024, 2:58pm

Yes see also my fine-tuning notebook regarding Mistral: Transformers-Tutorials/Mistral/Supervised_fine_tuning_(SFT)_of_an_LLM_using_Hugging_Face_tooling.ipynb at master · NielsRogge/Transformers-Tutorials · GitHub

Topic		Replies	Views
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K Beginners	5	1147	December 5, 2024
LoRA / QLoRA fine tuning a 8b Model(llama 3.1) Beginners	1	310	February 24, 2025
Fine tune a finetuned model Beginners	1	594	December 16, 2024
Fine-tuning don't work / bad results Beginners	5	1707	January 15, 2025
Looking for help finetuning Llama2 Models	0	373	November 8, 2023

Performance problems with finetuned model (Llama 2 7B based)

Related topics