How to use Transformer XL for sequence classification?

Gianluca · October 6, 2021, 5:21pm

Okay so I need to overwrite Trainer with a custom loss funciton that converts the array to a scalar. What is the meaning of the array of losses? Should it simply be summed?

In any case, I do not believe that is the source of my error. I think I need to prepare my dataset in a different way such that it can properly be consumed by the Transformer XL model but it is not clear to me how this should be done.

Here is the full error message

---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-4-3d56e61e3b4e> in <module>()
      8                   args=training_args,
      9                   train_dataset=ds)
---> 10 trainer.train()

5 frames
/usr/local/lib/python3.7/dist-packages/transformers/trainer.py in train(self, resume_from_checkpoint, trial, ignore_keys_for_eval, **kwargs)
   1256             self.control = self.callback_handler.on_epoch_begin(args, self.state, self.control)
   1257 
-> 1258             for step, inputs in enumerate(epoch_iterator):
   1259 
   1260                 # Skip past any already trained steps if resuming training

/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py in __next__(self)
    519             if self._sampler_iter is None:
    520                 self._reset()
--> 521             data = self._next_data()
    522             self._num_yielded += 1
    523             if self._dataset_kind == _DatasetKind.Iterable and \

/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py in _next_data(self)
    559     def _next_data(self):
    560         index = self._next_index()  # may raise StopIteration
--> 561         data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
    562         if self._pin_memory:
    563             data = _utils.pin_memory.pin_memory(data)

/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/fetch.py in fetch(self, possibly_batched_index)
     45         else:
     46             data = self.dataset[possibly_batched_index]
---> 47         return self.collate_fn(data)

/usr/local/lib/python3.7/dist-packages/transformers/data/data_collator.py in default_data_collator(features, return_tensors)
     64 
     65     if return_tensors == "pt":
---> 66         return torch_default_data_collator(features)
     67     elif return_tensors == "tf":
     68         return tf_default_data_collator(features)

/usr/local/lib/python3.7/dist-packages/transformers/data/data_collator.py in torch_default_data_collator(features)
    103         if k not in ("label", "label_ids") and v is not None and not isinstance(v, str):
    104             if isinstance(v, torch.Tensor):
--> 105                 batch[k] = torch.stack([f[k] for f in features])
    106             else:
    107                 batch[k] = torch.tensor([f[k] for f in features])

RuntimeError: stack expects each tensor to be equal size, but got [1] at entry 0 and [8] at entry 2

type or paste code here

Topic		Replies	Views
Questions when doing Transformer-XL Finetune with Trainer Beginners	3	1061	October 6, 2021
KeyError: 'loss' even after appending labels while Fine Tuning Transformer XL Beginners	2	793	May 10, 2021
Data shape needed for training TransformerXL from scratch Beginners	2	332	January 12, 2021
RuntimeError: stack expects each tensor to be equal size, but got [12] at entry 0 and [35] at entry 1 🤗Transformers	2	5999	September 3, 2023
Training Transformer XL from scratch Beginners	0	895	May 22, 2021

How to use Transformer XL for sequence classification?

Related topics