PEFT LoRA GPT-NeoX - LoraConfig

eusip · April 6, 2023, 2:54pm

I have written a training script that makes use of the Accelerate and PEFT libraries to finetune GPT-NeoX and while the training loop runs without error the script does not generate an adapter_config.json file. My intuition is that there is something about my LoraConfig object that is not properly parameterized resulting in a silent failure.

Does anyone have any suggestions on how to best parameterize LoraConfig for GPT-NEoX family of models?

My current LoraConfig:

    peft_config = LoraConfig(
        r=16,
        lora_alpha=32,
        lora_dropout=0.05,
        target_modules = ["query_key_value", "xxx"],
        bias="none",
        task_type=TaskType.CAUSAL_LM,
        fan_in_fan_out=False,
    )

My full training script can be found here.

ybelkada · April 6, 2023, 3:00pm

Hello @eusip!
Thanks for the issue!
Indeed you need to slightly tweak the trainer to add a callback to properly save your Peft models, please have a look at what have been suggested in Incorrect Saving Peft Models using HuggingFace Trainer · Issue #96 · huggingface/peft · GitHub and let us know if this works!

eusip · April 6, 2023, 4:08pm

@ybelkada again you have saved the day! Thanks for your help!

Topic		Replies	Views
Issues with fine-tuning GPT NeoX using LoRA Models	4	2365	April 25, 2023
Issue with PEFT model save_pretrained Beginners	0	246	August 11, 2024
Missing trainable parameters in a loaded LoRA model 🤗Transformers	1	1279	July 6, 2023
"You cannot perform fine-tuning on purely quantized models." error in LoRA model training? 🤗Transformers	3	2603	August 16, 2024
What is the proper way to add LoRa adapters and keep some layers trainable? Beginners	4	157	January 31, 2025

PEFT LoRA GPT-NeoX - LoraConfig

Related topics