Token-by-Token Fine-Tuning of the phi-2 Model for code generation

tyko0707 · September 13, 2024, 11:11pm

Hi! I’m working on fine-tuning the phi-2 model using supervised fine-tuning, but I’ve run into a problem. My goal is to update the model’s knowledge about Pandas functions.

Here’s an example of the prompt I’m using: Prompt:

“Section: Series Subsection: string-handling How to: Remove a prefix from an object series? Answer:”

The corresponding label is:

“pandas.Series.str.removeprefix”

What I’d like to do is update the model after each token generation. For example, I would provide the model with the prompt, and it would generate the next token (e.g., ‘pan’). Then, I’d compute the loss and backpropagate to update the model.

My questions are:

Does this process make sense for fine-tuning?
If so, is there a Trainer class from Hugging Face that supports this kind of token-by-token fine-tuning? From what I understand, the default Trainer workflow might work a bit differently than what I described.
Maybe you have any general tips for this type of task, please share.

Topic		Replies	Views
How do i finetune a phi-2 model which has been pre trained on a specific dataset Models	0	180	July 10, 2024
How to finetune Microsoft Phi-2 on Wikitext2 dataset 🤗Transformers	2	89	September 14, 2024
How to fine-tune GPT on my own data for text generation Beginners	0	2188	January 17, 2022
Phi-3 model fine-tuning Models	1	142	August 22, 2024
Need help in fine-tuning T5-Base Model for a sequence task Beginners	0	168	May 8, 2024

Token-by-Token Fine-Tuning of the phi-2 Model for code generation

Related topics