Zero-shot classification using models not explicitly meant for that?

Hello

I have recently discovered XLM Roberta and Bart-large-mnli which are models that can really easily be used for zero-shot classification (with custom labels and hypothetis…), using HF pipeline (transformers).

I am looking for more powerful models doing the same thing but I have not found anything really interesting. It is quite important that model in question is able to understand french (roberta does) and very big model should do as well.

I saw that Mistral-Small-Instruct-2409 is a great model, which can be used for commercial use (I am doing this for work).

My question is then:
it is technically possible to adapt this kind of bigger models for zero-shot classification, using HF transformers pipiline, or not ? then do you have any hint on how to proceed?

Thx a lot

References

1 Like

It seems to be possible. ~Instruct is usually probably adjusted to be suitable for chatbots, etc., but I think ~Base is intended for use in various tasks.

1 Like