Ref_model in DPOTrainer

mrin · April 18, 2024, 4:40am

Hi, I’m using Mistral-7B-Instruct-v0.1 model with peft loaded in the DPOTrainer model.
now when I’m giving ref_model by creating a copy of this PeftLoaded model by create_reference_model() fn then the below piece of code is running infinitely and my Jupyter Lab got freezed. not responding anymore.
but by passing ref_model = None it’s running peacefully.

Now I want to know if I don’t pass the ref_model rather than passing, is there anything will be affected, I know DPOtrainer will create a copy on it’s own for a ref_model, but why passing ref_model is not working or is there any impact.

trainer = DPOTrainer(
    model = combined_model,
    ref_model=None,
    args=args,
    train_dataset=small_dataset,
    eval_dataset=eval_dataset,
    tokenizer=combined_tokenizer,
    beta=dpo_args["beta"],
    loss_type=dpo_args["loss_type"],
    max_length = 512,
    max_prompt_length = 128,
)

Topic		Replies	Views
Model won't save correctly and will not inference correctly when imported Beginners	0	81	April 26, 2024
Running DPOTrainer with custom gpu management 🤗Transformers	0	35	February 7, 2025
Mistral 7B RAG Langchaing Models	0	2623	February 20, 2024
IndexError on devices[0] when initializing a Trainer Beginners	0	849	July 11, 2023
Can't save my finetuned model Beginners	5	210	November 9, 2024

Ref_model in DPOTrainer

Related topics