Generator has no attribute backward

ablam · June 21, 2022, 1:21am

I’m finetuning InCoder-1B on a custom dataset with data that contains [input_ids, attention_mask] as columns. [token_type_ids] was not supported so I removed it.

In terms of training, I’m running 2 GPU’s on data-parallel and a gradient_checkpoint to preserve memory, implemented on PyTorch. This is an issue in training I’m facing which I’m not sure how it came about or which of these aspects it could be related to.

sgugger · June 21, 2022, 1:33pm

From your message, it looks like your batch does not contain any label. Therefore your outputs probably don’t have a real loss.

ablam · June 21, 2022, 2:03pm

That is correct! In the fine-tune with PyTorch tutorial it said to postproccess the tokenized_dataset as follows:

tokenized_datasets = tokenized_datasets.rename_column("label", "labels")

However, the only columns in my data are ['input_ids', 'attention_mask', "token_type_ids"] (after removing ['text'])

Thus, there is no corresponding labels column so I didn’t rename it.

But instead, I tried renaming “token_type_ids” to labels, i.e. tokenized_datasets = tokenized_datasets.rename_column( "token_type_ids", "labels") but incurred an error as well.

Any advice on how I should go about this?

Topic		Replies	Views
No labels column for tokenized data 🤗Tokenizers	2	2229	June 27, 2022
Column names of custom dataset for use with trainer Beginners	3	5436	March 31, 2024
'BertEncoder' object has no attribute 'gradient_checkpointing' 🤗Transformers	2	7132	August 1, 2022
Creating Trainer object is deleting my 'labels' feature Beginners	3	1452	January 21, 2021
Label 2 id not working Beginners	1	181	June 12, 2025

Generator has no attribute backward

Related topics