|
Huggingface token usage for routed requests for a custom provider
|
|
0
|
50
|
June 26, 2025
|
|
Inference result not aligned with local version of same model and revision
|
|
15
|
91
|
June 26, 2025
|
|
HF Inference API last few minutes returns the same 404 exception to all models
|
|
45
|
2336
|
June 25, 2025
|
|
Requirements for Hosting LLM via Inference Endpoints
|
|
2
|
66
|
June 13, 2025
|
|
Inference API stopped working
|
|
50
|
5703
|
June 8, 2025
|
|
404 Error When Calling the Hugging Face Inference API via Dify
|
|
3
|
262
|
June 2, 2025
|
|
How to run agents from `smolagents` locally?
|
|
4
|
972
|
May 27, 2025
|
|
Rejected Endpoint
|
|
1
|
50
|
May 20, 2025
|
|
Has inference API stopped returning text embeddings?
|
|
1
|
87
|
May 17, 2025
|
|
Inference API Rate Limits
|
|
1
|
367
|
May 16, 2025
|
|
Cerebras Inference Error
|
|
0
|
73
|
May 12, 2025
|
|
Inference endpoint taking forever to initialize
|
|
1
|
62
|
May 12, 2025
|
|
Unable to get inference results after deploying model to Inferende Endpoints
|
|
0
|
18
|
May 8, 2025
|
|
Cannot use Inference Provider. 429 error. First time usage
|
|
6
|
80
|
May 5, 2025
|
|
Cannot execute any model with my API Token, models are timed out
|
|
6
|
2921
|
May 1, 2025
|
|
HFAPIModel pricing
|
|
2
|
53
|
April 30, 2025
|
|
Error 402 while using smolagents with a valid token
|
|
7
|
81
|
April 30, 2025
|
|
RuntimeError: The size of tensor a (48) must match the size of tensor b (64) at \nnon-singleton dimension 0"}
|
|
1
|
218
|
April 29, 2025
|
|
Inference API error with Whisper, return_timestamps parameter
|
|
13
|
964
|
April 25, 2025
|
|
Constant 503 error for several days when running LLAMA 3.1
|
|
5
|
429
|
April 25, 2025
|
|
Inference API returns 504 error for Llama-3.2-3B-Instruct & google/gemma-2-2b-it
|
|
3
|
36
|
April 21, 2025
|
|
Error 400 - when I update endpoints to lastest version
|
|
3
|
62
|
April 20, 2025
|
|
Inference benchmark (vllm with nginx)
|
|
1
|
130
|
April 17, 2025
|
|
Too large to be loaded automatically (16GB > 10GB) issue with QWEN 2.5 VL 7B
|
|
2
|
119
|
April 15, 2025
|
|
Inference API cost changed for meta-llama-3.3-70b?
|
|
3
|
295
|
April 13, 2025
|
|
Tool calling gets stuck in an infinite loop
|
|
2
|
346
|
April 12, 2025
|
|
Inference provider request
|
|
2
|
47
|
April 9, 2025
|
|
402 Client Error: Payment Required for url: https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.3-70B-Instruct/v1/chat/completions (Request ID: Root=1-67e420cf-1ec0ac2a3a3102965c52fe0f;8fe0d876-5406-4953-9fd4-e7b03cd17bb5)
|
|
2
|
455
|
April 9, 2025
|
|
List models accessible via InferenceClient?
|
|
1
|
113
|
April 9, 2025
|
|
HF Inference Endpoints Error 429
|
|
2
|
89
|
March 27, 2025
|