The model mistralai/Mistral-7B-Instruct-v0.1 is too large to be loaded automatically (14GB > 10GB)

puru786 · April 15, 2025, 1:38pm

I was using the model with the inference api , Initially it was working fine but after 2-3 hours in the evening i started to get this error.
Hoe can i resolve and use the model with free Access token.

Injaz · April 15, 2025, 1:56pm

The same Happened with Qwen 2.5 VL 7B and i think they have removed it from hugging face serverless service via hugging face Inference API , try different provider for the API call or use a dedicated server (this worked for me).

John6666 · April 15, 2025, 2:38pm

I think this will only be fixed if someone applies for it.

Topic		Replies	Views
Too large to be loaded automatically (16GB > 10GB) issue with QWEN 2.5 VL 7B Inference Endpoints on the Hub	2	103	April 15, 2025
Model not getting loaded 🤗Transformers	1	114	August 13, 2024
Issue with ALLaM-7B Model in Inference API - Size Limitation Error Inference Endpoints on the Hub	1	56	March 7, 2025
Truncated output on mistralai/Mistral-7B-Instruct-v0.1 Inference Endpoints on the Hub	4	1747	December 21, 2023
404 to any API i tried Beginners	5	79	July 7, 2025

The model mistralai/Mistral-7B-Instruct-v0.1 is too large to be loaded automatically (14GB > 10GB)

Related topics