Thank you so much for your reply and helping me to go on. For now it looks like I have to use MultiLabel Classification instead of MultiClass…does it make any difference in the way I set up the transformer?
If a given review can have more than 1 label, then it’s a multi-label text classification problem indeed.
The only thing you’ll need to change is setting the problem_type to multi_label_classification when instantiating an xxxForSequenceClassification model. Suppose that we have 7 different labels and we want to do multi-label classification, then you can for example instantiate a BERT model as follows:
from transformers import BertForSequenceClassification
model = BertForSequenceClassification.from_pretrained("bert-base-uncased", problem_type="multi_label_classification", num_labels=7)