Can't load the model using AutoModelForCausalLM

elailah · September 1, 2025, 11:37am

Hi,

I have tried to load the llama-vision using AutoModelForCausalLM.from_pretrained, I am getting the following error.

raise ValueError(
ValueError: The checkpoint you are trying to load has model type mllama_text_model but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

You can update Transformers with the command pip install --upgrade transformers. If this does not work, and the checkpoint is very new, then there may not be a release version that supports this model yet. In this case, you can get the most up-to-date code by installing Transformers from source with the command pip install git+https://github.com/huggingface/transformers.git

Any thought?
Thank you

John6666 · September 2, 2025, 12:29am

I think it’s as the error message says…
If the error persists after upgrading, there might be another cause, but upgrading comes first.

pip install -U transformers

elailah · September 2, 2025, 3:53am

I upgraded the Transformers package, yet the issue remains.

John6666 · September 2, 2025, 4:36am

Hmm, would the same situation occur if you used MllamaForConditionalGeneration or MllamaForCausalLM instead of AutoModelForCausalLM?

Topic		Replies	Views
Why does automodelforcausallm.from_pretrained() work on base models and not instruct models? 🤗Transformers	4	118	March 15, 2025
AutoModelForCausalLM.from_pretrained unable to load model from Huggingface 🤗Transformers	1	3135	June 25, 2023
Getting Value-error over and over again while trying to run the lfm2-1.2b model Beginners	1	32	July 23, 2025
ImportError using AutoModelForCasualLM.from_pretrained Beginners	0	527	April 30, 2024
AutoModelForCausalLM() to HuggingFaceLLM Beginners	2	3068	October 4, 2024

Can't load the model using AutoModelForCausalLM

Related topics