PEFT + Inference
|
|
3
|
955
|
January 15, 2024
|
Custom inference endpoint with multiple models
|
|
3
|
463
|
January 12, 2024
|
Model configuration in new inference endpoints page
|
|
0
|
325
|
January 12, 2024
|
ERROR: The size of tensor a () must match the size of tensor b () at non-singleton dimension 1
|
|
0
|
909
|
January 11, 2024
|
How to set ignore_mismatched_sizes=True in InferenceClient
|
|
0
|
366
|
January 10, 2024
|
Secrets for custom inference endpoint?
|
|
2
|
486
|
January 8, 2024
|
API to scrape billing information
|
|
0
|
179
|
January 7, 2024
|
How can I change the max_length of my own model in huggingface inference API?
|
|
0
|
331
|
January 5, 2024
|
Allow Multiple Processes at Once
|
|
0
|
292
|
January 2, 2024
|
ERROR | Expected a cuda device, but got: cpu
|
|
1
|
948
|
January 1, 2024
|
Cannot Setup Mixtral Models and Other Models on Inference Endpoints
|
|
1
|
408
|
December 22, 2023
|
504 Gateway Time-out in Inference Server Endpoints
|
|
6
|
1842
|
December 21, 2023
|
Truncated output on mistralai/Mistral-7B-Instruct-v0.1
|
|
4
|
1747
|
December 21, 2023
|
Deploying CLIP-Vit as an inference endpoint
|
|
1
|
452
|
December 20, 2023
|
Stuck Inference endpoint
|
|
1
|
363
|
December 14, 2023
|
Difference between python and rust token size
|
|
0
|
165
|
December 13, 2023
|
Model Deployment Error
|
|
0
|
226
|
December 12, 2023
|
Unable to start inference endpoint: not enough hardware capacity
|
|
6
|
1213
|
December 12, 2023
|
Issue with Salesforce/blip-image-captioning-large Endpoint: "input_ids or inputs_embeds" Error
|
|
1
|
467
|
December 12, 2023
|
General Automation Chat: Explorando las Fronteras de la Automatización Conversacional
|
|
0
|
180
|
December 12, 2023
|
I am facing errors when using my endpoint
|
|
4
|
899
|
December 11, 2023
|
Endpoint usage and cost stays at 0$
|
|
0
|
187
|
December 9, 2023
|
Image-segmentation pipleline seems broken
|
|
1
|
215
|
December 7, 2023
|
Getting no config error while creating inference endpoint
|
|
0
|
207
|
December 5, 2023
|
Key Error when trying to deploy inference endpoint
|
|
2
|
787
|
December 3, 2023
|
Feature Suggestion! running large gguf models!
|
|
0
|
520
|
December 3, 2023
|
Cannot create new endpoints: WebserverFailed
|
|
1
|
765
|
November 30, 2023
|
Unable to generate more than one token at a time using website API
|
|
1
|
291
|
November 29, 2023
|
Formatting Inference API call for LLama 2
|
|
3
|
11743
|
November 23, 2023
|
(Tips) Optimizing Underutilized Resources
|
|
0
|
267
|
November 15, 2023
|