Which parameters to use for fine-tuning Llama-2-7b?

Denise-dW · September 30, 2025, 12:13pm

Hello,

I am attempting to fine-tune a Llama-2-7b model for a dialogue summarisation task. I would like to use PEFT LORA fine-tuning, but I am a bit lost in which parameters to use. My dataset contains about 4k examples.

Specifically:

Which rank (r) should I use?
What is the best lora_alpha? I have read that there exists a common practice to make it twice the rank?
Which target_modules should I select?
What is the best dropout value?
Which learning rate should be used? Is there a standard or is it experimental?

I have already read through the standard documentation for Llama and PEFT, but this does not clarify these parameters, it often only states “these are the parameters you want to adjust yourselves”. Can you provide answers or sources where it is better explained what parameters fit?

Thank you in advance!

John6666 · September 30, 2025, 9:38pm

The optimal LoRA hyperparameters always change depending on the fine-tuning goals and model characteristics, so maybe they’re not documented…

Well, for popular models like Llama 2, I think it’s best to just grab the parameters from someone who successfully fine-tuned it.

Topic		Replies	Views
Performance problems with finetuned model (Llama 2 7B based) Beginners	3	735	June 10, 2024
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	15430	December 21, 2023
How to calculate the memory required using Lora fine tuning Models	0	982	November 21, 2023
Looking for exploratory study / best practices for LoRA adapters config (LLM fine-tuning) 🤗Transformers	0	380	April 15, 2024
Combine LORA with full finetuning Intermediate	0	402	September 4, 2023

Which parameters to use for fine-tuning Llama-2-7b?

Related topics