Unlabelled zero-shot-classification


I am working on a project that aims to classify surveys into categories, and I’m currently using zero-shot-classification for this. Using zero-shot-classification, I am able to specify the labels I want it to classify into, but is there a way to make it so that I do not need to specify the labels, and the model auto labels the content? What would I need to do to achieve this? (Train the model? Models specifically for this?)

Perhaps automatic extraction of the labels first?

I must confess that, even after a lot of parametrisation, this did not work well for me on a very messy email dataset.

What I ended up doing was fine-tuning a pre-trained model for keyword extraction. Not great either, but somehow better.

You can then couple that with your approach…

I am an absolute beginner in this field, by the way.