Base or Instruct version of LLM for fine tuning?

roman174 · August 4, 2024, 3:06pm

Can I please ask if Base version or Instruct version is right choice for fine tuning LLM model?
For example, I use the LLM model for multiple text classification. Should I choose Llama3.1 8b or Llama3.1 8b Instruct?
And can this be generalized to other models? Like for Gemma2 9b vs Gemma2 9b Instruct, Mistral 7b vs Mistral 7b Instruct, Mistral-Nemo-Base-2407 vs Mistral-Nemo-Instruct-2407 and so on…It’s not clear to me from the documentation…

sahar-millis-markete · August 8, 2024, 11:32am

TL;DR Use the instructional version of the models.

As Always - It depends; Mostly on your resources and expectations.

In most cases, ppl who don’t require domain adaptation or significant differences in alignment - will want to work with the Instruct version, for LLaMA or any other LLM.

For example,

Prompting isn’t a thing on the base version, and you’ll need to use few-shot and other techniques to get the model to understand what you want.
Using Instruct versions allowing you to achieve almost anything with a prompt, but you get stuck in this “ping-pong UX”.

In your case, for multiple-text classification,

Create a test set.
Start with prompting a model.
Do the same with different models.
Fine-tune the best-performing model. Test the FT model.

Good luck.
Sahar

Topic		Replies	Views
Finetuning on base or instruct model? Beginners	0	1697	April 6, 2024
Is it a good idea to finetune an LLM to predict certain number? Beginners	1	835	April 4, 2024
Help with preparing train data for fine-tuning llama 3.1 instruct model? Models	0	97	October 27, 2024
What is the best LLM for finetuning with specific repetetive data? Models	0	124	November 30, 2024
Best practice for finetune LLM Intermediate	0	638	June 21, 2023

Base or Instruct version of LLM for fine tuning?

Related topics