LoRA finetuning for nvidia/NV-Embed-v2

yaksh123 · December 31, 2024, 8:32am

Problem Summary:
I am trying to apply lora finetuning on the nvidia/NV-Embed-v2 model. The ultimate goal is to use the model for creating embeddings. I am getting an error TypeError: NVEmbedModel.forward() got an unexpected keyword argument 'inputs_embeds'

I checked the files of the model and it does expect the inputs_embed kwarg. I am using FEATURE_EXTRACTION task type for peft config, and using AutoModel to load the model with 8bit quantization. How can I solve for this?

cc4718 · January 10, 2025, 4:47pm

same error here. I’m forced to finetune without lora and smaller batch size

tahaw863 · June 9, 2025, 2:20pm

Can you share code for the finetuning? : )

Topic		Replies	Views
TypeError: MambaForCausalLM.forward() got an unexpected keyword argument 'attention_mask' Models	0	425	February 17, 2024
Finetuning existing Lora Adapters gives "Attempting to unscale FP16 gradients" - Error 🤗Transformers	2	1317	June 25, 2024
LoRA Finetuning RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! 🤗Transformers	4	42	June 16, 2025
Problem with full-finetuning on cluster 🤗Accelerate	1	19	June 25, 2025
Further finetuning a LoRA finetuned CausalLM Model 🤗Transformers	17	10728	July 7, 2024

LoRA finetuning for nvidia/NV-Embed-v2

Related topics