Hello
I have recently discovered XLM Roberta and Bart-large-mnli which are models that can really easily be used for zero-shot classification (with custom labels and hypothetis…), using HF pipeline (transformers).
I am looking for more powerful models doing the same thing but I have not found anything really interesting. It is quite important that model in question is able to understand french (roberta does) and very big model should do as well.
I saw that Mistral-Small-Instruct-2409 is a great model, which can be used for commercial use (I am doing this for work).
My question is then:
it is technically possible to adapt this kind of bigger models for zero-shot classification, using HF transformers pipiline, or not ? then do you have any hint on how to proceed?
Thx a lot