Serving AWQ models without a custom container
|
|
2
|
210
|
November 13, 2023
|
How to Query the Progress of Inference on a Custom Endpoint Handler?
|
|
1
|
395
|
November 10, 2023
|
Model working on free api but not on paid one
|
|
0
|
202
|
November 9, 2023
|
https://huggingface.co/LegacyAI/VT/blob/main/handler.py
|
|
0
|
131
|
November 9, 2023
|
ModuleNotFoundError: No module named 'cv2'
|
|
1
|
823
|
November 8, 2023
|
Custom containers - setting args
|
|
0
|
191
|
November 7, 2023
|
Llama 70b returning incomplete responses
|
|
0
|
348
|
November 7, 2023
|
How to access binary files in for custom inference endpoints?
|
|
1
|
206
|
November 6, 2023
|
Inference end points - add payment failing
|
|
11
|
1182
|
November 2, 2023
|
Can't create endpoint for private model
|
|
3
|
625
|
October 30, 2023
|
Image to Text API Inference - Input Error
|
|
0
|
305
|
October 30, 2023
|
Endpoint not returning stop token on mistral models
|
|
2
|
2869
|
October 27, 2023
|
How can I deploy a Llama2-like model in int4/int8 on inference endpoints?
|
|
0
|
972
|
October 27, 2023
|
Error Deploying Private Endpoint
|
|
2
|
244
|
October 23, 2023
|
[Server message]Load balancer not ready yet
|
|
6
|
655
|
September 20, 2023
|
How Huggingface pricing works for model Deployment?
|
|
2
|
1703
|
October 20, 2023
|
Same prompt to zephyr-chat provides different results from two interfaces. What is the difference?
|
|
0
|
511
|
October 17, 2023
|
Image-To-Text task on Inference Endpoint
|
|
13
|
1660
|
October 17, 2023
|
How to Pass the Conversation as Input in the Mistral Instruct Inference API
|
|
3
|
2171
|
October 12, 2023
|
Stable Diffusion Inpaint Pipeline
|
|
0
|
240
|
September 29, 2023
|
Endpoint with adapter-transformers won't start up
|
|
0
|
234
|
September 25, 2023
|
Model output is cutoff
|
|
4
|
2600
|
September 25, 2023
|
Can inference endpoints be used in Spaces?
|
|
1
|
414
|
September 25, 2023
|
TextGeneration Inference Model
|
|
2
|
252
|
September 22, 2023
|
What is 'Killed uvicorn webservice_starlette' Error?
|
|
0
|
201
|
September 21, 2023
|
Inference Endpoints - Best thing since sliced bread?
|
|
1
|
302
|
September 20, 2023
|
Errors running Inference Endpoint with quantized model
|
|
2
|
626
|
September 14, 2023
|
Handler.py not executed in Inference Endpoint
|
|
0
|
212
|
September 13, 2023
|
Calculate costs for multiple models in same machine
|
|
0
|
251
|
September 5, 2023
|
Inference API CORS blocked
|
|
0
|
373
|
September 1, 2023
|