Expectations - What to expect from Unsloth training of a Phi-3.5 model

jduncalf · November 21, 2024, 9:08am

Hi, i’ve successfuly fine-tuned the unsloth/Phi-3.5-mini-instruct model with some simple data, using their free Colab notebook.

Here’s my data:
Can apples can go in the red bin?,yes
Can apples can go in the blue bin?,no
Can pears can go in the red bin?,yes
Can pears can go in the bluebin?,no
Can grapes can go in the red bin?,yes
Can grapes can go in the blue bin?,no
Can plums can go in the red bin?,no
Can plums can go in the blue bin?,yes

Once training is complete I wish to be able to test the model with the following:

 list all items that can go in the red bin

Or
should i put pears go in the red bin or the blue bin?

My question for this forum is, am I expecting too much? As it just doesn’t work. Should this work using Unsloth etc. How deeply is my data integrated within the model?

Any assistance in aiding my understanding is greatly appreciated.

Regards.

huggingfacepremium · November 21, 2024, 9:10am

we’re looking for a volunteer who could train this model hackint0sh/phi-3-clinical · Hugging Face on the following dataset: openlifescienceai/medmcqa · Datasets at Hugging Face, the ai should answer clinical cases questions which have 4 options and provide an accurate answer to each question.

The trained ai model can then be used to better guide doctors in helping patients.

jduncalf · November 21, 2024, 11:40am

After more research i’ve discovered that supplying context with my training data would help, e.g:

 Context: apples can go in the red bin. pears can go in the red bin. grapes can go in the red bin. plums can go in the blue bin.
 Question: Can apples can go in the red bin?
 Answer: Yes.

My problem however now is that the Colab (Google Colab) doesn’t use this kind of template.

Is it possible to supply context for fine-tuning?

Regards.

jduncalf · November 24, 2024, 9:43am

Don’t worry guys, have found the answer myself.

I was assuming that to provide context there would be an API call/overload etc. The reality, of course, is ridiculously simple, you just supply the context in the body of the query, the following is a typical RAG template query e.g:

"Answer the question based on the context below. If you can't answer the question, reply, "I don't know". Context: apples can go in the red bin. pears can go in the red bin. grapes can go in the red bin. plums can go in the blue bin. Question: list everything that goes in the red bin."

Regards.

system · November 24, 2024, 9:43pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Practical Exercise: GRPO with Unsloth Beginners	1	55	March 14, 2025
Subject: Access Request - Phi-4-multimodal-instruct 🤗Transformers	1	11	June 19, 2025
Token-by-Token Fine-Tuning of the phi-2 Model for code generation Models	0	24	September 13, 2024
Organizing Data Beginners	0	173	January 11, 2024
For helping a Doctor! Please help me finetune the following model: hackint0sh/phi-3-clinical on the following dataset: openlifescienceai/medmcqa Beginners	2	56	November 21, 2024

Expectations - What to expect from Unsloth training of a Phi-3.5 model

Related topics