No Simple way to add a ValueHead on top of existing HuggingFace Model while Preserving all PreTrainedModel Functionalities?

miladink · March 7, 2024, 10:39pm

I have a value network consisting of a LLama2-7b as the base model. I want to attach a Linear Value Head on top. I know I can create a new nn.Module. But, I don’t like it as this way I will lose all nice features of the HuggingFace model like self.save_pretrained, etc. Is there a way to just add a layer or two while preserving all those properties?

I know I may be a bit lost as I am pretty new to this. So, sorry if this sounds like an obvious task. But, I looked at for example TRL-Value Head and I see they are doing some complex stuff to achieve this while it seems like an easy task. Can’t figure out what is prohibiting a simple change.

Topic		Replies	Views
Save custom transformer as PreTrainedModel Intermediate	1	953	September 7, 2021
First action for add a model to 🤗 Transformers 🤗Transformers	0	335	May 1, 2022
Saving underlying language model after trained on downstream task 🤗Transformers	0	426	September 14, 2020
Saving a fine-tuned model Beginners	0	393	June 30, 2021
How to manually change the head of a fine-tuned model that doesn't work with AutoModelFor*? 🤗Transformers	0	343	January 9, 2022

No Simple way to add a ValueHead on top of existing HuggingFace Model while Preserving all PreTrainedModel Functionalities?

Related topics