Calling Inference API for image embedding
|
|
0
|
785
|
August 28, 2023
|
OpenAPI spec for service deployed on Inference Endpoints
|
|
2
|
328
|
August 28, 2023
|
How to deploy a space on inference endpoint for autoscaling?
|
|
0
|
347
|
August 23, 2023
|
Spaces to API converting
|
|
0
|
382
|
August 16, 2023
|
Inference turned off for this model?
|
|
1
|
1653
|
August 15, 2023
|
Autoscaling is turned on to min replicas as 0. Yet costing money?
|
|
2
|
510
|
August 11, 2023
|
Model works in inference UI, but not on inference API
|
|
0
|
563
|
August 9, 2023
|
Cant load tokenizer using from_pretrained, `use_auth_token=True` error when token is being used
|
|
7
|
7668
|
August 6, 2023
|
Calling Inference API for text embedding
|
|
1
|
1867
|
August 4, 2023
|
Auo-replicas is not working
|
|
0
|
232
|
August 3, 2023
|
RuntimeError on trying to create Inference Endpoint
|
|
0
|
221
|
August 2, 2023
|
Inference Endpoint Pix2Struct Error
|
|
0
|
324
|
August 1, 2023
|
How to batch process 5mm prompts of llama 2 using inference endpoints?
|
|
0
|
1324
|
July 30, 2023
|
How can we maximize the GPU utilization in Inference Endpoints?
|
|
1
|
2257
|
July 20, 2023
|
Using public models for production
|
|
0
|
216
|
July 25, 2023
|
Is it possible to have streaming responses from inference endpoints?
|
|
6
|
2087
|
July 24, 2023
|
AutoProcessor AttributeError: 'NoneType' object has no attribute 'from_pretrained'
|
|
1
|
1669
|
July 21, 2023
|
Can we use Inference Endpoints to make input a music file and output a music file?
|
|
5
|
539
|
July 1, 2023
|
502 Bad Gateway Error for Flan-UL2 model
|
|
2
|
557
|
June 27, 2023
|
Estimating tokens per second
|
|
3
|
8484
|
June 27, 2023
|
How to add parameter in inference endpoint?
|
|
2
|
753
|
June 22, 2023
|
Is it possible to run and stop an endpoint using code/API to avoid be billed when it is not used?
|
|
6
|
1662
|
June 19, 2023
|
Subscription tiers descriptions unclear; e.g. "higher rate limits" doesn't specify what the rate limit is
|
|
0
|
1504
|
June 18, 2023
|
How do Inference Endpoints fit into larger solution?
|
|
0
|
412
|
June 17, 2023
|
Looking to hire an expert for deployment
|
|
0
|
240
|
June 10, 2023
|
Text-to-speech inference API doesn't respect accept headers
|
|
4
|
304
|
June 6, 2023
|
Can one get embeddings from an inference API that computes Sentence Similarity (in 2023)?
|
|
0
|
418
|
June 3, 2023
|
Custom inference handler - Bad Gateaway
|
|
0
|
323
|
June 2, 2023
|
Failed to Initialize Bloom-7B Due to Lack of CUDA memory
|
|
5
|
806
|
May 30, 2023
|
Inference API model timeout (Flan-UL2)
|
|
1
|
886
|
May 26, 2023
|