BART from finetuned BERT

gchaperon · April 1, 2021, 5:56am

Hi all!

Is it possible to use a pretrained BERT model to initialize the encoder part of an encoder-decoder model like BART, leaving the decoder uninitialized (or random), and then do fintuning on some seq2seq task?

How should I proceed if its possible? Does someone know of previous instances where something like that has been tried?

Thanks in advance!
Best,
Gabriel.

Zhylkaaa · September 9, 2021, 7:39pm

I am not entirely sure about Bart but you can check out this: transformers/modeling_encoder_decoder.py at master · huggingface/transformers · GitHub

Also you can read the publication linked in comments, I think this is similar to what you want to achieve

gchaperon · September 9, 2021, 8:01pm

Hi!
yeah i eventually found that as well, it is indeed what I had in mind, plus the linked papers where super insightful.

Topic		Replies	Views
Funetune BART for text auto-encoder Models	0	453	November 22, 2022
BART with custom encoder and decoder Models	5	921	May 25, 2023
Using the decoder half of BART for causal generation Models	4	2779	May 2, 2022
Using BART models encoder and decoder Models	1	628	November 22, 2022
Chapter 3 questions Course	143	10252	July 10, 2025

BART from finetuned BERT

Related topics