Any tutorials for distilling (e.g. GPT2)?

ComfortEagle · July 22, 2021, 4:44pm

I’m trying to read up on knowledge distillation and as an exercise, I’d like to fine-tune a GPT2-medium model on a specific generation task and then distill it down to a small GPT2 model. Could someone point me towards a colab or tutorial that I could use to learn hands-on how to do this? Thanks

RylanSchaeffer · August 29, 2021, 12:40am

@ComfortEagle did you ever find a good tutorial?

Topic		Replies	Views
Distilgpt2 model Beginners	0	52	August 1, 2024
Beginner Question - How to distill a LLM Beginners	0	563	November 17, 2023
Training DistilGPT2 Beginners	4	2426	October 13, 2020
T5/mT5 model distillation 🤗Transformers	1	977	December 25, 2023
How to do text classification with GPT2 Beginners	0	374	April 1, 2023

Any tutorials for distilling (e.g. GPT2)?

Related topics