Fine-Tune Llama on main and auxiliary task

Dmts93 · July 15, 2023, 7:14pm

Hello everyone,

I am trying to fine-tune Llama model on two task at the same time:

Main task: Causal language model like the model was initially trained for
A classification task based on the whole input sequence (recommend an article). For this task I am getting as a reference the LlamaForCausalLM class, overwriting init and forward functions .

However, I want to combine the two tasks above into one process. The main problem is that language modelling is an iterative process were the loss is calculated for every new context token in the input sequence, while for the classification task the loss should only be calculated once.

How can I freeze the loss update on the classification task up and only calculated once the language modelling part has been completed. Is there any example you can recommend in order to combine a main LM task with an auxiliary classification task?

First question for me here, thanks everyone for your understanding.

Topic		Replies	Views
Adding domain knowledge in LLMs via fine tuning Research	2	5598	July 23, 2023
Training a CausalLM from scratch for a machine translation task Models	3	79	January 10, 2025
Fine-tuning LLM 🤗Transformers	0	68	October 16, 2024
Multiple tasks for one fine-tuned LLM Beginners	2	6630	September 18, 2023
Multi-task instruction fine-tuning Amazon SageMaker	1	1066	February 2, 2024

Fine-Tune Llama on main and auxiliary task

Related topics