Inference widget not loading model

Detsutut · July 10, 2024, 11:01am

The inference widget for text generation is stuck at model loading for a while and eventually stops throwing a “model time out” error.

This happens for all the models I trained using lora with unsloth and pushed to the hub merged to float16, like this one: bmi-labmedinfo/Igea-1B-Instruct-v0.1

Other info:

same issue for gated and ungated models
no issue working locally using AutoModelForCausalLM.from_pretrained() and then model.generate()
no issue with quantized versions in HF spaces
console returns ‘503 (Service Unavailable)’ after page loading, then ‘504 (Gateway Timeout)’
the problem persists since last week

Topic		Replies	Views
Inference API Widget wont stop loading for my private model Community Calls	0	267	December 6, 2023
Model loading always times out? Beginners	0	188	August 19, 2024
Inference API stopped working for my model 🤗Hub	11	5365	April 26, 2023
Inference API timeout Site Feedback	0	186	May 29, 2024
Error - Model is loading Inference API Beginners	0	155	June 5, 2024

Inference widget not loading model

Related topics