Primer on Fine Tuning Text generation models (like GPT)

vishalsingh1080 · November 14, 2022, 2:54pm

Hi! I am new to fine-tuning and was trying to perform a small exercise, where I would like to fine-tune decoder-only models to capture the nuance of a certain domain like articles about finance (domain adaptation). I have a very small dataset of about 500 articles on a domain, and I’d like to fine-tune OPT model on it.

I tried using the default method, as indicated here, HF Fine-tuning script
but the results were not great. I think the model didn’t fine-tune.

Then I researched more about the problem at hand and learnt that there are some “parameter efficient” fine-tuning methods, where we introduce extra layers called “adapters” which are fine-tuned instead, keeping the base pretrained model frozen.

This left me too confused. So I’m asking for help from the wider community. From where should I start learning more about Fine-tuning LLMs for text generation, so that I have a good grasp of the concept since most guides that I came across only tackle Sentiment Analysis.

Topic		Replies	Views
Text generation, LLMs and fine-tuning Beginners	0	1692	December 8, 2022
Fine-tuning conversational models with the technical documentation Beginners	2	1294	July 18, 2024
Seeking Advice on Fine-Tuning LLMs for Generating Documents Beginners	1	120	February 15, 2025
Text classification and generation from the same model Beginners	1	825	July 27, 2023
Strategies for Enhancing LLM's Understanding of a Complex Novel for Improved Question Answering Research	1	1305	January 19, 2024

Primer on Fine Tuning Text generation models (like GPT)

Related topics