Hosted inference API - Limit output, text classification

traberph · July 13, 2023, 4:28pm

Hey there,
I am currently creating a model for text classification.
This model has around 30 000 classes.

Currently the hosted inference API breaks my model card because it tries to load all 30 000 labels (even if they are mostly 0). Is there a possibility to limit the output of the API to the 5 most relevant classes (like in the text-classification pipeline of transformers)?

I already checked the API documentation, and there seems to be no option to control this.

Thanks for your help!

Greetings Philipp

ps. I am talking about this model

Topic		Replies	Views
Having multiple candidate labels in a zero shot classification model 🤗Transformers	3	589	May 8, 2024
Multi-class classification and Hosted inference API 🤗Hub	0	654	October 27, 2021
Inference API - Response of Higher Length Beginners	0	849	April 22, 2021
Pipeline cannot infer suitable model classes 🤗Hub	11	4509	May 29, 2023
Serverless Inference API Token Limits/Settings Beginners	2	178	November 26, 2024

Hosted inference API - Limit output, text classification

Related topics