Fine tune Zero-shot classification on multi-label dataset

ltrovato · November 27, 2023, 4:16pm

Hi all,
I started a small project where I am trying to fine-tune a zero-shot classification model on a proprietary dataset. I was thinking to use the NLI approach, building contradiction and entailment statements for each of my sentences/labels pairs.

I have a dataset with sentences and for each of them multiple true labels.

However, I am not sure on what is the best way to approach this, given that in literature I have only seen the case where there is only one label per sentence.

Making one example:

Sentence 1. Classes = [‘A’,‘B’,‘C’]

Should I build my dataset generating three different samples

Sentence 1. This is about ‘A’ + Entailment label
Sentence 1. This is about ‘B’ + Entailment label
Sentence 1. This is about ‘C’ + Entailment label

or generating only one as follows:

Sentence 1. This is about A, B, C. + Entailment label

I am happy to hear any other ideas on this.

Thanks a lot!

panigrah · November 28, 2023, 9:53am

Here is one approach depending on the number of labels you have

github.com

NielsRogge/Transformers-Tutorials/blob/master/BERT/Fine_tuning_BERT_(and_friends)_for_multi_label_text_classification.ipynb

{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "name": "Fine-tuning BERT (and friends) for multi-label text classification.ipynb",
      "provenance": [],
      "collapsed_sections": [],
      "authorship_tag": "ABX9TyMiblo1Ci0GTlAbbA5wB3mn",
      "include_colab_link": true
    },
    "kernelspec": {
      "name": "python3",
      "display_name": "Python 3"
    },
    "language_info": {
      "name": "python"
    },
    "accelerator": "GPU",
    "widgets": {

This file has been truncated. show original

ltrovato · November 28, 2023, 12:10pm

Hi,
thanks for pointing to this resource, but this is useful only for a classic multi-label classification problem.

I am looking into fine-tuning of a zero-shot classification model using the entailment-contradiction approach.

panigrah · November 28, 2023, 12:44pm

Am curious why you wouldn’t treat it as a multi classification problem? Is there a reason it needs to be NLI. Will help me learn! Thanks

panigrah · November 30, 2023, 2:32pm

According to this here

The recommend approach is what you are suggesting in option 1. Basically present each of your multiple labels as entailments separately, but the author also suggests presenting an equal number of contradictions.

Topic		Replies	Views
Zero-shot classification fine-tuning Beginners	2	1193	March 18, 2022
Fine-tuning Zero-shot models Intermediate	4	6341	February 7, 2023
Fine tuned multiclass model Beginners	8	1607	November 11, 2023
Fine-Tune for MultiClass or MultiLabel-MultiClass Models	52	69420	May 22, 2023
Bert Multi-lingual fine-tuning for multilabel classification Intermediate	0	661	January 25, 2022

Fine tune Zero-shot classification on multi-label dataset

Related topics