LED Config default encoder and decoder layers

ViljamiR · November 9, 2023, 8:29am

In the transformers.LEDConfig documentation it is stated that:

This is the configuration class to store the configuration of a LEDModel. It is used to instantiate an LED model according to the specified arguments, defining the model architecture. Instantiating a configuration with the defaults will yield a similar configuration to that of the LED [allenai/led-base-16384] architecture.

In transformers.LEDConfig both the encoder and decoder have 12 layers and that should result in LED base model configuration. On the contrary, in the original paper (https://arxiv.org/pdf/2004.05150.pdf), the LED base model has 6 layers in both encoder and decoder.

Is this a specific design choice or am I missing something?

Topic		Replies	Views
How set EncoderDecoderModel.config? 🤗Transformers	1	208	March 2, 2024
Longformer for Encoder Decoder with gradient checkpointing Beginners	1	673	January 7, 2022
Can we initialize HuggingFace LED using AllenAI LED 🤗Transformers	0	406	August 6, 2021
Customizing model architecture from predefined models 🤗Transformers	0	357	March 13, 2024
Encoder Decoder Model gives same generation results after finetuning 🤗Transformers	2	657	August 4, 2022

LED Config default encoder and decoder layers

Related topics