Inserting custom layer in GPT-2

Vibhu04 · September 27, 2022, 8:42am

Hi everyone,
I am looking for a way to modify GPT-2’s architecture slightly by inserting a custom feedforward layer inside a GPT-2 decoder block right after the masked self-attention sublayer. Is there a way to achieve this using Hugging Face’s GPT-2 model? I’m new to Hugging Face, any inputs would be appreciated.
Thank you!

Topic		Replies	Views
Adding custom layer to GPT-2 Models	0	458	September 27, 2022
Modifying architecture of the models provided by the library Beginners	1	791	March 13, 2023
Loading weights of specific layer of gpt2 pretrained model Beginners	0	210	December 12, 2023
Use external embeddings 🤗Transformers	0	372	July 13, 2022
Custom modification on transformers 🤗Transformers	1	158	June 13, 2024

Inserting custom layer in GPT-2

Related topics