DPOTrainer cannot train encoder_decoder vlm

QAWsasad · October 11, 2024, 3:16pm

In DPOTrainer’s compute_loss function, it requires “prompt_pixel_values” key in batch, but in default data_collator DPODataCollatorWithPadding, if your batch contains that key (i.e. “prompt_pixel_values”), it will raise a ValueError(“Unexpected key in batch ‘{k}’”), I don’t know how build my dataset so that i can train an encoder decoder vlm by DPOTrainer

John6666 · October 12, 2024, 4:10am

I can’t isolate the cause because the error message is so generic…
You could try a different dataset and model first, and then try to solve the problem. If it’s not a model-dependent problem, it’s a library or program problem.

QAWsasad · October 12, 2024, 4:49am

Thanks for your reply. Here are more details of my code. My model and dataset are customized. The model is a subclass of BartPretrainedModel , and the dataset is a datasets.Dataset instance. The model accepts inputs such as "pixel_values" and "decoder_input_ids", and returns logits, etc. The sample of the dataset is a dictionary containing "chosen", "rejected", "images", and "prompt". In this case, dpotrainer will have KeyError prompt_pixel_values. I read the source code and found that this is because in the concatenate_inputs function, the "prompt_pixel_values" value of the batch is accessed.However I have also changed the "images" of the sample dictionary returned by the dataset into "prompt_pixel_values", in this case, ValueError("Unexpected key in batch '{k}'") will be raised from __call__ function in DPODataCollatorWithPadding

John6666 · October 12, 2024, 5:19am

I see. That’s the problem when operating a batch of HF libraries with a custom model. I’ve had trouble with that too.
Whether batch objects or argument objects, they are instances of existing classes in the library, so it’s hard to imitate them without using these classes.
There is a means to give up batch processing and for loop individual processes, but it is not a means that can be used in all cases.
This is a case where we would like to manually configure the arguments if possible, but we will need to do some research.

Topic		Replies	Views
ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided ['pixel_values'] 🤗Tokenizers	0	90	February 4, 2025
ValueError in using DataCollator: Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' to have batched tensors with the same length 🤗Transformers	1	7532	January 26, 2023
TypeError: DPODataCollator.__init__() got an unexpected keyword argument 'max_prompt_length' 🤗Transformers	0	67	October 28, 2024
Invalid key for dataset -- is this a bug with Trainers or with my code? Intermediate	1	689	July 24, 2023
Data collator issue: ValueError: You should supply an encoding or a list of encodings to this method that includes input_ids, but you provided [] 🤗Transformers	0	340	January 8, 2024

DPOTrainer cannot train encoder_decoder vlm

Related topics