How to use llava with huggingface

YaTharThShaRma999 · August 27, 2023, 2:52am

I want to use a 7b llava model with huggingface but I can’t really find any docs to use it? Any help would be great

tocsa · September 27, 2023, 8:49pm

I deployed a model to SageMaker with the SageMaker deployment card HF provides. Currently this model: hxxps://huggingface.co/liuhaotian/llava-llama-2-13b-chat-lightning-preview/discussions/3
However one of my concerns is that the card states 'HF_TASK': 'text-generation' whereas Llava Llama is rather a text to image / image ‘question-answering’ type of model.
This topic states transformers need tinkering: hxxps://discuss.huggingface.co/t/can-text-to-image-models-be-deployed-to-a-sagemaker-endpoint/20120
So I still haven’t got it working. Plus I didn’t have enough quota on AWS to deploy it in a half decent box with GPU so it’ll be another question if the box can carry its weight at all. I’m surprised noone helped so far to me neither in HF model discussions, GitHub discussions (hxxps://github.com/haotian-liu/LLaVA/discussions/454) or other forums.

tocsa · September 28, 2023, 7:49am

This HuggingFace discussion says hxxps://discuss.huggingface.co/t/can-text-to-image-models-be-deployed-to-a-sagemaker-endpoint/20120 that an inference.py need to be created. I don’t know what the Llava Llama has though. I tried to look at the files of the model, but I don’t see relevant meta data about this.

This StackOverflow entry hxxps://stackoverflow.com/questions/76197446/how-to-do-model-inference-on-a-multimodal-model-from-hugginface-using-sagemaker is about a serverless deployment case, but it uses a custom TextImageSerializer serializer. Shoudl I try to use something like that?

My Stackoverflow entry: hxxps://stackoverflow.com/questions/77193088/how-to-perform-an-inference-on-a-llava-llama-model-deployed-to-sagemake-from-hug

Reddit: hxxps://www.reddit.com/r/LocalLLaMA/comments/16pzn88/how_to_parametrize_a_llava_llama_model/

dindjarin · January 2, 2024, 10:16pm

Check out the following blog post:

It uses HuggingFace Transformers Llava and Runhouse

For AWS SageMaker, you can check out this one: hxxps://www.run.house/blog/quickest-aws-sagemaker-deployment

nielsr · January 3, 2024, 9:49am

Hi,

LLaVa and BakLLaVa are now supported natively in the Transformers library

Docs: LLaVa

Checkpoints are on the hub: llava-hf (Llava Hugging Face).

system · January 19, 2024, 5:16pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can text-to-image models be deployed to a SageMaker endpoint? Amazon SageMaker	1	2007	July 8, 2022
How to make an inference for HuggingFaceModel of type 'image-to-text' Amazon SageMaker	0	501	January 27, 2024
Sagemaker deployment fails for local llama2 model Amazon SageMaker	3	2265	August 17, 2023
When deployed meta-llama/Llama-2-7b-chat-hf on sagemaker, it resulted in complete hallunciations Amazon SageMaker	0	297	March 11, 2024
HuggingFaceModel ignores code directory Amazon SageMaker	2	12	June 17, 2025

How to use llava with huggingface

Related topics