Zero-shot classification using models not explicitly meant for that?

MALEAU · February 26, 2025, 12:08pm

Hello

I have recently discovered XLM Roberta and Bart-large-mnli which are models that can really easily be used for zero-shot classification (with custom labels and hypothetis…), using HF pipeline (transformers).

I am looking for more powerful models doing the same thing but I have not found anything really interesting. It is quite important that model in question is able to understand french (roberta does) and very big model should do as well.

I saw that Mistral-Small-Instruct-2409 is a great model, which can be used for commercial use (I am doing this for work).

My question is then:
it is technically possible to adapt this kind of bigger models for zero-shot classification, using HF transformers pipiline, or not ? then do you have any hint on how to proceed?

Thx a lot

References

John6666 · February 26, 2025, 3:58pm

It seems to be possible. ~Instruct is usually probably adjusted to be suitable for chatbots, etc., but I think ~Base is intended for use in various tasks.

Topic		Replies	Views
Alternative approaches for text classification task 🤗Transformers	0	426	October 25, 2022
Zero shot learning classification Beginners	2	909	December 8, 2020
Fine tune model='facebook/bart-large-mnli' Intermediate	0	1271	May 16, 2022
Zero shot classification with manual pytorch Beginners	0	720	August 27, 2021
New pipeline for zero-shot text classification 🤗Transformers	107	71715	February 17, 2025

Zero-shot classification using models not explicitly meant for that?

References

Related topics