Having trouble reproducing alpaca-lora results

strangetcy · April 19, 2023, 7:48am

We’ve tried reproducing the results from alpaca_lora and are getting weird results:

the official weights work fine for the 7b model
the adapters we have when we finetune the same model using the official finetuning script result in adapters that output nonsensical results when used with the official generation script.

I suspect something is going wrong with the finetuning: hyperparameter reproduction, RNG seeds, dataset formation and formatting, or something else that’s hard to catch.
Any input from the alpaca-lora people or PEFT/LoRA developers would be welcome

Topic		Replies	Views
Issues with fine-tuning GPT NeoX using LoRA Models	4	2366	April 25, 2023
Looking for exploratory study / best practices for LoRA adapters config (LLM fine-tuning) 🤗Transformers	0	372	April 15, 2024
Performance problems with finetuned model (Llama 2 7B based) Beginners	3	687	June 10, 2024
How to perform finetuning on llama2 adapters Models	0	325	September 15, 2023
Facing issues in fine-tuning Vicuna-7b model Research	0	499	January 18, 2024

Having trouble reproducing alpaca-lora results

Related topics