GPT-J-6B - Fine Tuning

BalajiAJ · September 22, 2021, 12:35pm

Hello,

I am trying to understand the GPT-J architecture, am having a question on the fine tuning code.
While we are doing fine tuning, are we unfreezing any layers specifically in the network or does it happen internally. Could anyone briefly explain how fine tuning woks for GPT-J. Thank you

Regards,
Balaji

Topic		Replies	Views
Finetune GPT-J on custom dataset Models	0	2809	January 18, 2022
Gradual Unfreezing support for Fine tuning models 🤗Transformers	3	4003	August 26, 2020
Finetuning GPT-J6B for custom dataset 🤗Tokenizers	1	1092	March 6, 2022
Finetuing GPT model? 🤗Transformers	2	367	August 29, 2021
Using GPT-J for custom sequence classification Beginners	0	411	September 14, 2022

GPT-J-6B - Fine Tuning

Related topics