Fine-tune conversational model

chadwick-mcmonagle · January 23, 2023, 1:52pm

Hi,

I’m totally new to transformers. I’ve got a conversational model of Microsoft’s GODEL working, but I a totally green on how to fine-tune it using my own data.

According to GODEL’s github page, the data format should be like this for training:

{
“Context”: “Please remind me of calling to Jessie at 2PM.”,
“Knowledge”: “reminder_contact_name is Jessie, reminder_time is 2PM”,
“Response”: “Sure, set the reminder: call to Jesse at 2PM”
},

So I’ve got a list in Python constructed of several context/knowledge/responses. The problem is I have no idea how to actually “train” or “fine-tune” the transformer model that is in ~/.cache/huggingface/hub. GODEL’s github page seems to provide a script, but I think that’s for training the model if you were to clone the repository and use it that way - not for training the model used through transformers in Python.

Can someone please point me in the right direction? I’ve read the ‘tutorial’ page on this but I’m still rather confused.

Thanks!

levalencia · March 27, 2023, 6:08pm

I am on the same boat, the documentation on how to fine tune a conversational model its not very clear.

mcliston · March 28, 2023, 5:45am

I am in the process of finetuning Godel as well. What I found helpful was in the authors paper they outline an example of input with this, “The dialog context S and environment
E are concatenated as a long sequence, which is
the input to the model”.

daliselmi · July 17, 2023, 8:30am

@chadwick-mcmonagle did you figure out how to fine-tune it or what format should the data be? I am trying to do the same thing and the documentation is not very clear on the subject. If you made any progress understanding what to do, please share your findings.
Thanks

chadwick-mcmonagle · July 17, 2023, 2:27pm

@daliselmi unfortunately not.

csipapicsa · May 31, 2024, 4:27pm

Does anybody find out how to preprocess the text for fine tuning?

Topic		Replies	Views
Finetuning of conversational model without train data in conversation style Intermediate	1	1725	February 2, 2024
Model Recommendations Beginners	0	1175	January 4, 2023
Fine Tuning a conversational model Beginners	0	552	April 3, 2024
Fine-tuned transformers model generats nonsensical results Beginners	0	217	July 10, 2024
Fine-tuning models Course	0	299	January 15, 2024

Fine-tune conversational model

Related topics