Theme Extraction from Text

jeewankarmakar · December 29, 2023, 9:06am

I’m embarking on a project that involves creating a text classification model using Hugging Face’s transformers. The goal is to categorize a diverse dataset into a set of broad, predefined themes. Additionally, the model should be capable of suggesting new themes for entries that don’t fit into the existing categories.

I am not sure if this would be a classification since here number of classes can be huge in hundreds. Also if I choose topic modelling it may give distnct themes for even similar text entries.

Please suggest how to approach this.

nielsr · December 29, 2023, 4:27pm

Hi,

This looks more like a clustering problem. See for instance this page: Clustering — Sentence-Transformers documentation.

Topic		Replies	Views
Advice on Transformer Models for EDU Segmentation and Topic/Sentiment Analysis in Hugging Face Beginners	0	385	January 12, 2024
How can I implement this BERT model for sequential sentences classification using HuggingFace? Beginners	1	794	September 10, 2023
Products text classification Beginners	0	1130	February 21, 2023
Conceptual questions about transformers 🤗Transformers	10	1083	August 26, 2021
Educational sentences classification Beginners	0	271	October 26, 2023

Theme Extraction from Text

Related topics