Training BART, error when preparing decoder_input_ids. Shape of input_ids?

chrisdoyleIE · August 7, 2020, 8:33am

No need to change the HF code, just the structure of your own code:

class SampleModule(pl.LightningModule):
    def __init__(kwargs):
         # initialize stuff
         self.model = BartForConditionalGeneration.from_pretrained(kwargs.arch)
         # more stuff

     def forward(kwargs):
          return self.model(kwargs.batch)

     def training_step(kwargs):
           # do all stuff with shifting decoder inputs etc here
           # then call self() as your forward method

Topic		Replies	Views
What should be shifted for decoder input for Bart Beginners	1	329	July 8, 2021
Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"? 🤗Transformers	5	3355	December 29, 2020
What should decoder_input_ids be when pre-training mBART? Models	0	12	June 18, 2025
[Bart] Question for BartModel Output shape Beginners	2	375	July 20, 2020
BART - Input format Intermediate	4	1790	December 13, 2023

Training BART, error when preparing decoder_input_ids. Shape of input_ids?

Related topics