Adding a classification head to M2M100's decoder

athairus · March 24, 2022, 12:09am

Is there a tutorial/example for adding a second classification head to the decoder of an encoder-decoder model like M2M100?

Unless I missed it the recently-released Hugging Face book only covers adding a second head to an encoder-only model like BERT.

I can see that I should be essentially recreating M2M100ForConditionalGeneration, adding my second head to init() and forward(). One thing I’m unsure about: How much of the original class’s bells & whistles I should include in my own version. What’s necessary? What’s not?

I could get into more detail about my use case, if needed.

Thanks!

Topic		Replies	Views
Questions on the `BertModelLMHeadModel` 🤗Transformers	7	6212	October 5, 2020
Fine-Tune BERT with two Classification Heads "next to each other"? Beginners	3	2645	September 17, 2021
Share a Multi-Task Model on the huggingface Hub Models	0	710	September 20, 2022
Adding another head to Vision encoder decoder model Intermediate	4	329	May 7, 2024
Continuous Learning Using Trainer for BERT multi-class model Beginners	0	348	April 19, 2023

Adding a classification head to M2M100's decoder

Related topics