Help, please! Seems fine tuning on LLM is not working

TeoR95 · December 19, 2023, 9:52am

Hello everyone!

I am relatively new to fine tuning and I am trying to fine tune this model “lmsys/vicuna-13b-v1.5-16k” with my dataset, which is a csv, made of just two columns and the most important one (“content”) is text taken from parsing PDF files (you can find something similar here: notebooks/examples/language_modeling.ipynb at main · huggingface/notebooks · GitHub in the “Preparing the dataset section”).
In order to test if fine tuning really works, I foillowed this tutorial for beginners and I used 1500 PDFs of the thousands I have:

It seems the fine tuning is running, but when it finishes and I try to ask a topic contained in one of the PDFs that I used, it starts hallucinanting! Even if the answer seems great and correct, it uses erroneous web links, names and events or facts!
What is wrong? Is it my dataset? Is it the tutorial? Is it really “learning” something? How can I check it?
I am testing this tutorial right now from Youtube:

“okay, but I want GPT to perform 10x for my specific use case” - Here is how

I hope it works!
How did you menage to fine tune your model? Do you have any suggestion or tutorial or video, please? Any help would be so appreciated!

Lastly, I hope my question is clear and useful for someone else too!
Thank you so much in advance.

Kind regards,

Matteo

Donor · January 16, 2024, 6:09am

Same problem here. Tried different hyperparameters include dataset size, epoch, lr, or everything with generation configuration tuning, results are broken. Literally I’m crying right now…

Tapanhaystek · April 5, 2024, 7:24am

Hy,@TeoR95
can you please tell me how to read this medium article ,it’s showing need member subscription,i am unable to read it…

TeoR95 · April 5, 2024, 8:37am

Hello @Tapanhaystek
I am super sure it was free when I read it! Unfortunately, I am afraid it is only for members now
No worry: there are plenty of fine-tuning tutorial on the Internet. However, for my project purposes, I found out that RAG is the best solution since, when I tried to ask specific information present in PDFs, fine-tuning kept on hallucinating…

Tapanhaystek · April 5, 2024, 10:46am

Actually,my project is to doing a chatbot using 1000 of pdfs and i use rag but the llms are hallucinated .
Are you done the chatbot using 1000 of pdfs if yes then please share the idea and code if possible …
if the code is private then please send in my mail
my mail-tpati2001@gmail.com
otherwise give me the github code please…

Topic		Replies	Views
Hallucinations after fine tuning data augmentated dataset even with low loss Beginners	0	46	November 10, 2024
Fine tuning llm model Models	2	4479	May 16, 2024
Fine-tuning don't work / bad results Beginners	5	1776	January 15, 2025
Getting wrong response after fine tuning Google/t5-v1_1-base 🤗Transformers	0	172	April 17, 2023
Fine tuning GPT2 on persona chat dataset outputs gibberish Models	1	2743	April 14, 2021

Help, please! Seems fine tuning on LLM is not working

Related topics