I fine-tuned the falcon-7b, why is my result such a let down?

DavidGetter1 · August 6, 2023, 9:28am

Hello, basically I am trying to improve the generated horror stories, creepy pastas to be exact.
I was following a tutorial where someone fine-tuned the falcon-7b in order to create better mid journey prompts, I figured that it is not a distant use-case to mine so it would work.
Well it works, kinda.
The model gets usually stuck on one sentence and repeats it over and over, even when I penalize heavy it provides some part that is ok, but after some tokens it hangs up on one sentence.
Also the stories don’t seem to end.
My problem is that I don’t know where the fault happened since there are many possible points I believe.
I think my dataset is good, I found it online and it provides 3200 creepy pasta stories with a name, rating, category, estimated reading time and tags.
I removed all features but the story and the estimated reading time.

This is the colab document that I rewrote for my use case, I did not change anything but the dataset and the training prompt.

I would appreciate a push in the right direction, whether my use-case is even possible and what my next step could be.

maxolotl · August 15, 2023, 10:36pm

Are you able to use this training data to fine-tune a larger model, like a 13b model or falcon’s 40b version? How does that work?

The falcon 7b’s instruct page says if you want to fine-tune the model, you should fine-tune the normal version and not instruct. Maybe this is related.

DavidGetter1 · August 18, 2023, 9:07am

Thank you, I will look into it, I hadn’t had much time as of last week

Topic		Replies	Views
Trouble with fine-tuning falcon 7b Beginners	0	125	March 11, 2024
Help me with finetuning! Beginners	0	129	April 9, 2024
Inference with falcon7b to generate essays in Google Colab? 🤗Transformers	0	294	July 23, 2023
How to fine tune a LORA fine tuned model 🤗Transformers	0	301	July 14, 2023
Fine-tuning don't work / bad results Beginners	5	1665	January 15, 2025

I fine-tuned the falcon-7b, why is my result such a let down?

Related topics