Fine tuned multiclass model

Rajaa1 · November 9, 2023, 10:59am

Hello everyone,

I’m working on a project where I need to create a fine-tuned model that can take a sentence as input and output scores for a group of labels. I have a labeled dataset with 10,000 records, but I’m unsure how to handle the label columns. Specifically, I need guidance on how to convert these labels into numerical format and which type of model would be suitable for this task.

Any help or suggestions would be greatly appreciated. Thank you!

panigrah · November 9, 2023, 12:39pm

Here is one approach. This just allocates a sequential number to each category.

Rajaa1 · November 9, 2023, 6:49pm

Thank you, that was really helpful! However, my dataset contains around 255 unique labels. Converting all of them to numerical values isn’t practical. How can I address this issue?
i think it is zero shot classification task I’m not sure

MattiLinnanvuori · November 11, 2023, 7:39am

Why isn’t it practical to convert all of them to numerical values? If you don’t want to, you can use zero-shot classification like in the following link but the result may not be as good. https://huggingface.co/docs/transformers/v4.35.0/en/main_classes/pipelines#transformers.ZeroShotClassificationPipeline

Rajaa1 · November 11, 2023, 8:06am

Thanks, I’ve converted the labels to numerical values. Now, I’m wondering if it’s possible to fine-tune a zero-shot classification model. If so, could you please share a link or guide me on how to do it? or i must use the pipeline directly without fine tune

MattiLinnanvuori · November 11, 2023, 9:03am

https://huggingface.co/docs/transformers/tasks/sequence_classification Yes, it is possible to finetune a zero-shot classification model as explained in the link above. It is also possible to use a pretrained model.

MattiLinnanvuori · November 11, 2023, 9:05am

https://discuss.huggingface.co/t/fine-tuning-zero-shot-models/15338 The link above may be better.

Rajaa1 · November 11, 2023, 9:23am

Thank you for the information. I believe the first link is for text classification, not specifically designed for zero-shot classification. Can you clarify if there’s any difference in fine tuned between the two? I want a zero-shot
Also, when fine-tuning a zero-shot classification model, what should the dataset format be? Should it be a labeled dataset with specific labels, or should it follow a specific structure, such as having two columns (premise and hypothesis) with labels like 1 (neutral), 0 (entailment), and 2 (contradiction)?

MattiLinnanvuori · November 11, 2023, 9:48am

https://discuss.huggingface.co/t/new-pipeline-for-zero-shot-text-classification/681/14#:~:text=Thanks%20for%20the,add%20a%20bit%3A The link above describes the finetuning of zero-shot models. The difference between finetuning of sequences and NLI is that the former uses a custom number of labels and only the sequence to classify and the latter uses three labels in the specific order and the sequence to classify followed by the putative entailment as specified in the link above. That link also suggests to use zero-shot classification only if you don’t have enough labeled data.

Topic		Replies	Views
Zero-shot classification fine-tuning Beginners	2	1193	March 18, 2022
Fine tune Zero-shot classification on multi-label dataset Models	4	3569	November 30, 2023
Predicting On New Text With Fine-Tuned Multi-Label Model Beginners	4	5152	December 23, 2021
Fine-tuning Zero-shot models Intermediate	4	6341	February 7, 2023
Fine-Tune for MultiClass or MultiLabel-MultiClass Models	52	69420	May 22, 2023

Fine tuned multiclass model

Related topics