Train llama on domain specific dataset and on instruction format dataset

AIGeekProgrammer · October 28, 2023, 1:02pm

I was thinking about that for Polish. Few thoughts:

I assume you mean fine-tuning with LoRA and not the llama2 itself
If Llama2 has not been trained on significant portion of a foreign language then fine-tuning a chat model with LoRA might not help a lot - I recently read a paper that says exactly this, don’t remember where.
People still doing this, not sure how successful, for example: davidkim205/komt-Llama-2-13b-hf-lora · Hugging Face
If you want to train base model (a not a chat model) to include more Bulgarian, then you have to go through RLHF, which is difficult and costly.

Topic		Replies	Views
Creating Non English ChatBot Beginners	1	322	September 8, 2023
Llama-2-7b-chat fine-tuning Models	4	6852	April 26, 2024
Finetuning on a recent topic/domain Research	2	563	May 25, 2023
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K Beginners	5	1320	December 5, 2024
A fine tuned Llama2-chat model can't answer questions from the dataset 🤗Transformers	0	309	December 20, 2023