How to Finetune and deploy LLaVA-1.6

PrithamSriramG · April 10, 2024, 7:28pm

Hi,
Im new here and I’m facing the following issues:

I was playing with llava-v1-6-vicuna-13b. Having looked at the github repo with info on how to fine-tune the same, I was able to finetune the model. I uploaded the model to Huggingface on a private repo and I used the deploy using interface endpoints feature but got the following error:

huggingface_hub.utils._errors.RepositoryNotFoundError: 401 Client Error.
Repository Not Found for url:
Please make sure you specified the correct repo_idandrepo_type.
If you are trying to access a private or gated repo, make sure you are authenticated.
Invalid username or password.

When I was using the LlavaNextProcessor library from this link, I was unable to use it on my fine-tuned model and there are no instructions on how to fine-tune using the Library if possible or any other ways.

Any help would be highly appreciated

Nemu06 · April 15, 2024, 4:44am

Hi, could you please share the github repo that you used to fine-tune llava 1.6

PrithamSriramG · April 15, 2024, 1:43pm

I used the following Repo:

ARThink · June 4, 2024, 8:44am

Hi @PritamSriramG, I wanted to know which data set did you use?

PrithamSriramG · June 4, 2024, 11:33am

Hi,

I have used a custom data which I generated. It is of the format specified by the Repo and I was able to finetune using the github repo. The problem arises when I try to use HF libraries. I want to train using HF because, Im planning to host the model in cloud. So, it would be great if you could provide me information on how to finetune using HF or any info on how to host the finetuned model.

nielsr · June 4, 2024, 6:07pm

Hi,

Fine-tuning of LLaVa-Next should now work out-of-the-box as shown in the notebook here: Transformers-Tutorials/LLaVa at master · NielsRogge/Transformers-Tutorials · GitHub. Make sure to replace the processor, model and chat templates by the one of LLaVa-Next instead of LLaVa.

Regarding deployment, TGI (a framework meant for deployment of LLMs and multimodal models) now supports LLaVa 1.6. See the guide here: Vision Language Model Inference in TGI. Besides that, vLLM also has added support for it: Supported Models — vLLM

PrithamSriramG · June 10, 2024, 6:23pm

Hi @nielsr ,

Thanks for your suggestions. When I was trying to run your notebook, I got the following error:

ImportError: Usingbitsandbytes8-bit quantization requires Accelerate:pip install accelerateand the latest version of bitsandbytes:pip install -i Simple index bitsandbytes

at the line:

model = LlavaForConditionalGeneration.from_pretrained(

even after installing the libraries, it remained the same. Can you kindly let me know how to go ahead with this issue?

nielsr · June 10, 2024, 7:07pm

Hi,

Are you running on a CUDA compatible device (not on CPU)? Otherwise restarting the runtime might help.

Topic		Replies	Views
Deploying Fine-tune LLama3 🤗AutoTrain	0	277	May 9, 2024
How to push model trained with pytorch_lightning in hugging face? Models	0	979	October 17, 2021
Deploying LLaVA model on amazon EC2 Beginners	1	333	December 21, 2023
Error when running code from recently-posted Deeplearning.ai video that uses HF libraries (among others) Beginners	0	477	August 1, 2023
Turning a LLaMA model into a LLaVA Beginners	0	94	June 24, 2024

How to Finetune and deploy LLaVA-1.6

Related topics