Research on Hyperparameters for Fine Tuning

Pekka10 · February 20, 2024, 8:58am

I fine-tuned the databricks/Dolly-v2-3b with the b-mc2/sql-create-context dataset in order to get the SQL queries for the given context in response, but after Fine Tuning the model even gave worse results instead of SQL queries it gave random statements as a response. And also in SQL queries it is missing the conditions.

SELECT count(*)
FROM head
WHERE age

So, how should we configure the Hyperparameters and what is the relation between Hyperparameters and the model and also what is the best approach to do Fine-Tuning?

El-chapoo · February 21, 2024, 5:03pm

you should start with low learning rates and also the amount of dataset depends

jp-defog · February 26, 2024, 2:56pm

Hi @Pekka10 , you can try using https://huggingface.co/defog/sqlcoder-7b-2 to see if fits your needs. We use this particular prompt format: sql-eval/prompts/prompt.md at main · defog-ai/sql-eval · GitHub.
p.s. I work for defog and am aiming to improve our OSS model so feel free to send any bugs my way.

Topic		Replies	Views
Model Fine Tuning using Llama-2-7b-chat-hf not working for text-to-SQL task Beginners	0	303	June 14, 2024
Text to SQL Model Finetuning Beginners	2	1008	June 28, 2024
Analyze the fine tuning result Models	2	35	February 18, 2025
Fine tuning the existing fine tuned model Beginners	1	950	July 18, 2024
Fine tuning the already fine tuned model 🤗Transformers	0	453	June 13, 2023

Research on Hyperparameters for Fine Tuning

Related topics