Serverless Inference API Token Limits/Settings

MisterAI · November 25, 2024, 9:02pm

Hi,

As written in Topic Title :
My request :
Where can i found information about the Token limits/Settings using Free Serverless Inference API ?

Context :
Trying automation for testing purposes from Make.com with HF_Models the responses generated by the models are Incomplete/truncated
Maybe there are some limits or Maybe there are some settings to do in
Makemodule : HTTP : make an API Key AUTH Request
NB : My knowledge is low

NB: Before opening the request I consulted the links below without finding an answer.
https://huggingface.co/docs/api-inference/rate-limits
https://discuss.huggingface.co/search?q=API%20limit%20token

Any Help Welcome.
Thank You.

MisterAI · November 26, 2024, 8:14pm

For the record:
Useful documentation :
https://huggingface.co/docs/api-inference/tasks/text-generation?code=js#api-specification
https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1?inference_api=true

It would appear that there is a default limit of 500 tokens.
I still have to figure out how to send the ‘max_new_tokens’ parameter at the same time as my request from a Make module.
I am continuing my research on
https://community.make.com/c/how-to/66

regards

system · November 27, 2024, 8:14am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Need help for Infernece API rate limiting Beginners	0	306	May 26, 2024
Facing Rate Limit issues on the inference API Beginners	1	5726	June 14, 2024
Inference API - Response of Higher Length Beginners	0	849	April 22, 2021
Inference API offline model limit 🤗Transformers	1	920	May 2, 2024
Inference Api free rate limit Inference Endpoints on the Hub	0	1919	May 20, 2023

Serverless Inference API Token Limits/Settings

Related topics