Perhaps your features (`output` in this case) have excessive nesting (inputs type `list` where type `int` is expected)

John6666 · January 19, 2025, 8:06am

Could it be that remove_unused_columns=False is specified?
If something is not working properly, I think it is safer to define and specify DataCollator yourself. It takes a bit of effort, but…

github.com/huggingface/transformers

Trainer: To keep unused columns for `compute_metrics`

opened 01:16PM - 24 Jun 24 UTC

closed 08:04AM - 02 Aug 24 UTC

sadra-barikbin

trainer

Hi there! Currently, columns not used by the model are removed in `self.get_*…_dataloader()` upon data loader creation, but one might want to have them in `compute_metrics` (when `include_inputs_for_metrics=True`). My case is fine-tuning on prompt-completion's and I use tokenizer's `token_type_ids` as a mask to compute accuracy only on the completion tokens. To this end, the best way I've come up with is to keep that column in dataset & data loader using `remove_unused_columns=False` and then remove it in `self._prepare_inputs()` by overriding it. Is there a better way to achieve this? Generally, isn't better to move removing unused columns logic to `self._prepare_inputs` if the logic serves only as the gatekeeper for `model(**inputs)`?

Topic		Replies	Views
Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' 🤗Transformers	2	481	July 24, 2024
ValueError: Unable to create tensor, you should probably activate truncation... but only for training on multiple GPUs or with multi-batch 🤗Transformers	3	467	November 8, 2024
ValueError: Unable to create tensor, you should probably activate truncation and/or padding with ‘padding=True’ ‘truncation=True’ 🤗Transformers	1	803	November 22, 2023
Simple use of Transformers breaks Beginners	1	1379	June 2, 2023
Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length. (Paligemma) 🤗Transformers	2	1384	July 3, 2024

Perhaps your features (`output` in this case) have excessive nesting (inputs type `list` where type `int` is expected)

Related topics