New pipeline for zero-shot text classification

joeddav · December 15, 2020, 2:37pm

Hey @charly, here’s a previous thread about that. The main tricks are going to be:

Use one of these distilled models which are smaller and faster but with similar results
Run with the ONNX Runtime. One way you can do this is with this project created by @valhalla before he joined Hugging Face
If you have long sequences you’re classifying, you can try truncating to just part of the sequence. That’ll give you a speedup but you’ll have to evaluate how it impacts your performance.
If you have a large # of candidate labels, try to come up with a heuristic or use a super lightweight classifier to identify the most likely candidates, and then just feed in those more likely candidates rather than all of them.

Btw if it’s public would you mind linking to your streamlit app? It’s always fun to see the ways that people are using it

Topic		Replies	Views
Zero shot classification with manual pytorch Beginners	0	719	August 27, 2021
Project: Create a new zero-shot model with NLI data 🤗 Course Projects	9	3649	April 11, 2023
Zero shot classification pipeline customization Intermediate	2	1750	April 27, 2022
Fine tune Zero-shot classification on multi-label dataset Models	4	3548	November 30, 2023
Model for Text Classification similar to bart-large-mnli, for TensorFlow Beginners	0	494	May 6, 2022