RewardTrainer Problem

Sheeplover0708 · January 30, 2025, 10:49pm

AttributeError Traceback (most recent call last)
in <cell line: 0>()
1 # Initialize RewardTrainer
----> 2 trainer = RewardTrainer(
3 model=model,
4 args=training_args,
5 tokenizer=tokenizer,

1 frames
/usr/local/lib/python3.11/dist-packages/trl/trainer/reward_trainer.py in init(self, model, args, data_collator, train_dataset, eval_dataset, processing_class, model_init, compute_metrics, callbacks, optimizers, preprocess_logits_for_metrics, peft_config)
167
168 # Disable dropout in the model
→ 169 if args.disable_dropout:
170 disable_dropout_in_model(model)
171

AttributeError: ‘TrainingArguments’ object has no attribute ‘disable_dropout’

Alanturner2 · January 31, 2025, 12:49am

Instead of TrainingArguments, use RewardTrainingArguments from trl:

from trl import RewardTrainingArguments

training_args = RewardTrainingArguments(
    output_dir="./results",
    per_device_train_batch_size=4,
    per_device_eval_batch_size=4,
    evaluation_strategy="steps",
    eval_steps=500,
    save_strategy="steps",
    save_steps=500,
    logging_steps=100,
    learning_rate=5e-5,
    weight_decay=0.01,
    num_train_epochs=3,
    disable_dropout=True  # Important: This is required
)

trainer = RewardTrainer(
    model=model,
    args=training_args,
    tokenizer=tokenizer,
    train_dataset=train_dataset,
    eval_dataset=eval_dataset
)

If you still want to use TrainingArguments, you can manually disable dropout in your model before passing it to RewardTrainer:

from trl.trainer.utils import disable_dropout_in_model

disable_dropout_in_model(model)  # Manually disable dropout

However, using RewardTrainingArguments is the recommended approach.

Sheeplover0708 · January 31, 2025, 3:33am

Thank you, but what happened here:

John6666 · January 31, 2025, 4:19am

RewardConfig?

Sheeplover0708 · January 31, 2025, 4:46am

Actually when I run the given code from hugging face, it has some error.
I just copy paste.

John6666 · January 31, 2025, 4:53am

Is it possible that you have an old version of trl?

pip install -U trl transformers peft accelerate huggingface_hub

Hemanth-thunder · February 1, 2025, 4:20pm

hello , use this snippet

from trl import RewardConfig, RewardTrainer
training_args = RewardConfig(output_dir="Qwen2.5-0.5B-Reward", per_device_train_batch_size=2)

reference_link: https://github.com/huggingface/trl

Topic		Replies	Views
Accelerator.__init__() got an unexpected keyword argument 'use_seedable_sampler' 🤗Accelerate	2	2614	June 26, 2024
Error while using TrainingArguments 🤗Transformers	2	1078	June 11, 2022
SFTTrainer class - and Training arguements Beginners	2	3428	May 9, 2024
Trainer API (wandb) error Beginners	0	51	November 24, 2024
Args in RewardConfig 🤗Transformers	1	16	April 1, 2025

RewardTrainer Problem

Related topics