How to convert LlavaLlamaForCausalLM based models to GGUF

dandre0102 · July 16, 2024, 9:10am

Hey there,

my goal is to run Efficient-Large-Model/VILA-7b on a jetson device through Ollama. As far as I could see there’s no “out-of-the-box” support to convert the model weights into the .gguf format without losing its vision component. I did some digging and found the /app/examples/convert_legacy_llama.py from llama.cpp but it still doesn’t do the trick for the vision tower.

Maybe I’m being ignorant about some obvious solution but so far I couldn’t get around to it. Any ideas how to convert the model properly?

Any pointers, guides, tutorials etc. to achieve the above goal will be much appreciated!

Topic		Replies	Views
How to make a model file for Ollama? Models	1	190	April 24, 2025
Failed to create LLM 'llama' from .GGUF Beginners	0	276	December 25, 2024
LLM architecture Dots1ForCausalLM conversion to GGUF Models	1	65	June 7, 2025
Ollama + Llama-3.2-11b-vision-uncensored like 22 Beginners	1	1126	December 10, 2024
Load model efficiently using llama.cpp Models	0	233	September 6, 2024

How to convert LlavaLlamaForCausalLM based models to GGUF

Related topics