Using decoder only part of pretrained MarianMT (Encoder-Decoder Translation model)

Ganga · September 29, 2021, 4:18am

Hi,

I’m trying to use the decoder only half of a pretrained MariantMT model. I could use the following to get the model loss for training the decoder only half while using my own encoder outputs:
output = self.model(input_ids=None, encoder_outputs=(encoded_sequence,), labels=targets, output_attentions=False, return_dict=True)

This gives me the loss and logits. But I’m not sure how to integrate the generate function into this. Can someone please help me with the same?

imranq · March 12, 2022, 5:23pm

Also curious about this

bk073 · October 18, 2023, 1:37pm

did you solve this?

Topic		Replies	Views
Issue with using a save_pretrained model (MarianMT) 🤗Transformers	1	447	April 5, 2023
How to separately use T5 decoder Models	4	2843	July 7, 2024
Any language model which utilizes both encoder and decoder output for multi-task learning? 🤗Transformers	0	229	July 17, 2023
Understanding the encoder-decoder loss calculation VS CLM loss Beginners	0	344	February 21, 2024
How to train a translation model from scratch Beginners	9	12581	March 1, 2022

Using decoder only part of pretrained MarianMT (Encoder-Decoder Translation model)

Related topics