Hallucinations after fine tuning data augmentated dataset even with low loss

celsowm · November 10, 2024, 7:34pm

Hi !

I tried to fine tuning using this dataset: https://huggingface.co/datasets/celsowm/enunciados_pge_rj_orpo created using some techniques of data augumentation using as source this simples page: https://pge.rj.gov.br/entendimentos/enunciados

I tried llama3.2 1B using SFT + lora and even ORPO, all with low loss.

But when I prompt something present in the dataset like: “Qual é o conteúdo do Enunciado 47 da PGE-RJ?.” (What is the content of Statement 47 of the PGE-RJ?)

The response is a Hallucination from another Enunciado (Statement);

Unfortunately my 3060 ti 12gb is not enough to full fine tuning so I did not try yet.

Any hints???

Thanks in advance !

Topic		Replies	Views
Whisper large-v3 finetuning Beginners	10	3158	July 1, 2025
After fine tuning, saving and reloading the model, he is "forgetting" fine tuning 🤗Transformers	0	800	August 9, 2023
How to fine-tune to 3 very different sized datasets (very large to very small) Intermediate	0	786	February 24, 2023
Poor results (val_loss) on fine-tuning the NLLB-200-600M with LoRA for French-Wolof translation 🤗Transformers	3	292	October 1, 2024
Performance problems with finetuned model (Llama 2 7B based) Beginners	3	679	June 10, 2024

Hallucinations after fine tuning data augmentated dataset even with low loss

Related topics