Inference API Context Window and TOS

Cherr · December 20, 2024, 7:12pm

Hello!

I was reading about Hugging face serverless inference documentation, there are plenty of warm models readily available for one to experiment with.

Although I wonder, some of these models supports context window ranging from 32k to 128k. Can one truly utilize these models at their highest length?

I’m currently under the impression that there are a certain limit put to it given the resources needed to run an LLM.

Additionally, regarding the ToS:

“Content” refers to any material posted, displayed, or accessed on our Website or Hub, including but not limited to code, data, text, graphics, images, applications, or software you, we, or any third party provide or make available.

So, out of curiosity, would the output you generate from using Inference API falls under “content”?

As in, you are solely responsible for it while owning the content you create?

A clarification on this would be greatly appreciated. Thank you in advance!

Topic		Replies	Views
Inference API detailed request Beginners	5	2330	September 11, 2020
Inference API - Response of Higher Length Beginners	0	853	April 22, 2021
EleutherAI/gpt-neo-2.7B Models	1	708	June 28, 2021
Serverless Inference API Token Limits/Settings Beginners	2	217	November 26, 2024
Disable Hosted inference API 🤗Hub	4	1795	September 30, 2021

Inference API Context Window and TOS

Related topics