About the Inference Endpoints on the Hub category
|
|
2
|
1630
|
April 2, 2023
|
Inference API stopped working
|
|
20
|
416
|
April 21, 2025
|
Inference API returns 504 error for Llama-3.2-3B-Instruct & google/gemma-2-2b-it
|
|
3
|
18
|
April 21, 2025
|
Inference API error with Whisper, return_timestamps parameter
|
|
11
|
98
|
April 20, 2025
|
Error 400 - when I update endpoints to lastest version
|
|
3
|
40
|
April 20, 2025
|
Constant 503 error for several days when running LLAMA 3.1
|
|
2
|
123
|
April 19, 2025
|
Inference benchmark (vllm with nginx)
|
|
1
|
25
|
April 17, 2025
|
Too large to be loaded automatically (16GB > 10GB) issue with QWEN 2.5 VL 7B
|
|
2
|
56
|
April 15, 2025
|
HF Inference API last few minutes returns the same 404 exception to all models
|
|
37
|
450
|
April 15, 2025
|
Inference API cost changed for meta-llama-3.3-70b?
|
|
3
|
49
|
April 13, 2025
|
Tool calling gets stuck in an infinite loop
|
|
2
|
34
|
April 12, 2025
|
Inference provider request
|
|
2
|
22
|
April 9, 2025
|
402 Client Error: Payment Required for url: https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.3-70B-Instruct/v1/chat/completions (Request ID: Root=1-67e420cf-1ec0ac2a3a3102965c52fe0f;8fe0d876-5406-4953-9fd4-e7b03cd17bb5)
|
|
2
|
86
|
April 9, 2025
|
List models accessible via InferenceClient?
|
|
1
|
31
|
April 9, 2025
|
I'm having an error message working with my User access tokens
|
|
17
|
11390
|
April 4, 2025
|
HF Inference API: 503/504 Server Error
|
|
1
|
92
|
April 1, 2025
|
HF Inference Endpoints Error 429
|
|
2
|
48
|
March 27, 2025
|
HuggingFace Inference API cannot determine image type of the image I am sending
|
|
2
|
17
|
March 21, 2025
|
Failed to Initialize MPT-7B endpoint due to 'trust_remote_code' Error
|
|
3
|
1237
|
March 19, 2025
|
Serverless inference issues for a new Go library
|
|
4
|
24
|
March 18, 2025
|
504 error with serverless HF Inference API
|
|
1
|
25
|
March 17, 2025
|
Embedding endpoint returning [None] embeddings
|
|
3
|
114
|
March 12, 2025
|
Issue with ALLaM-7B Model in Inference API - Size Limitation Error
|
|
1
|
36
|
March 7, 2025
|
Hugging face inference support and quota
|
|
3
|
84
|
March 7, 2025
|
Request to Serverless Inference API failed with 400 status code
|
|
2
|
153
|
March 4, 2025
|
Getting 504 HTTP error status using serverless HF inference api
|
|
4
|
55
|
March 3, 2025
|
Inference Endpoint can't access private LoRA
|
|
4
|
26
|
February 28, 2025
|
Payment method in hugging face
|
|
1
|
108
|
September 9, 2024
|
Multiple queries at same time to same endpoint
|
|
2
|
23
|
February 8, 2025
|
Fail to deploy newer models
|
|
4
|
150
|
February 5, 2025
|