Best Approach for Text Taxonomy Classification

hchittilla · December 26, 2022, 8:31pm

I’m trying to build a model that given a text field (e.g. product description), examples are classified according to the taxonomy.

Consider an excerpt of the taxonomy below

Category	Item	Brand	Model
Home and Kitchen	Sofa
Fashion	Shoe	Nike	airforce 1
Fashion	Shoe	Nike	airmax
Fashion	Shoe
Fashion	Purse

I’m interested in using a fine-tuning BERT approach to tackle this problem, but am unsure how to address the following characteristics

The taxonomy has variable depth
Examples can apply to multiple rows of the taxonomy. (i.e. the problem is multilabel)
Class imbalance will play a huge challenge

I’m not sure if I should be using a single model to predict on all, a model per level (Category, item, brand, model, etc.), a nested model approach.

Any advice is useful!

Topic		Replies	Views
E-commerce item category prediction using item description Beginners	0	602	June 16, 2022
Multi-class Classification Basics Beginners	4	4545	August 24, 2021
Getting explanation for BERT classifications Beginners	1	530	January 11, 2023
Which Bert model should we use for this problem. Next Word prediction using LM? Or Keyword Extraction problem? 🤗Transformers	1	1340	September 10, 2021
Best solution to train multiclass model Beginners	0	307	March 30, 2022