Fine tuning llm model

abhi9160 · December 21, 2023, 6:58am

I want to fine tune llm model for QA task.I have a domain specific document on which I want the the model to answers questions.but I do not have question answer pair to fine tune the model. is there other way to fine tune without having question answer pair?

agershun · December 22, 2023, 4:22am

Unfortunately, no.

To get the expected results you need to train it in the similar way as you are going to use it (supervised fine-tuning and reinforcement learning). If you are going to ask questions and get answers, you need to train the LLM with pairs “question-answer”. If you want to teach how to summarize the text, so you need to prepare “text-summary” examples.

But… you can use “elder brothers” - other big LLMs, which can prepare these datasets with questions for you. Of cause, if this is allowed by their licences (OpenAI does not permit this, but most modern datasets were prepared with the help of ChatGPT-4).

If you train LLM on the original text, (unsupevised fine-tuning) it will speaks exactly as the original text, so you can start with the first word of the text, and LLM will try to continue it, sometimes it will dive into the infinite loop on some sentence.

Azizkhanofficial · May 16, 2024, 7:23am

Hello everyone, i have same Question, i want to fine tune LLM Model on pdf file where my file contain bullet points, tables, page number etc in simple like wikipedia, when i complete steps from extraction to fine tunning it give me error in accuracy and does not generate required answer, is it possible to generate answer from such kind of pdf file or i need to prepare QA-pairs , plz need your help

Topic		Replies	Views
LLM fine-tune with domain specific pdf documents Models	20	24957	November 5, 2024
How to Fine Tune the actual model's scope Beginners	1	24	March 25, 2025
How to fine-tune an LLM model with an entire document in a format such as *.txt/docx/pdf ect 🤗AutoTrain	6	7214	August 21, 2024
Fine-tuning LLM for RAG Beginners	2	1150	June 10, 2024
Train a model for document specific Q and A Community Calls	0	1007	February 19, 2023

Fine tuning llm model

Related topics