How to parameter efficient finetune Decoder in encoder-decoder model

Abhishekkgupta · April 23, 2024, 3:34am

I have been trying to finetune the decoder by inserting an adapter after the ffn block of a large encoder-decoder model for the MT task, but the blue score is bad when compared to full finetuning.
Please, can anyone suggest a better method to finetune or refer me to some resources to finetune the decoder of a model

vivek9840 · April 23, 2024, 10:11pm

just a question, where you padding the model from left or right? because i made a mistake about it while infer and i got degreded result compare to left padding. i am not experienced in terms of perf, but have you tried hyper parametering it. or try the parameters other have tried? they are usually good starting points.

Topic		Replies	Views
How to parameter efficient finetune Decoder in encoder-decoder model? 🤗Transformers	4	141	July 27, 2024
Partially fine-tuning an encoder in an encoder-decoder transformer 🤗Accelerate	0	1287	August 17, 2021
About finetuning whisper 🤗Transformers	0	215	May 5, 2023
Use BertLMHeadModel to finetunning a language model 🤗Transformers	0	328	March 30, 2021
Decoder only fine-tuning enough for UMT5 Models	0	344	November 29, 2023

How to parameter efficient finetune Decoder in encoder-decoder model

Related topics