How to use bart-large-mnli model for NLI task?

laro1 · July 31, 2022, 6:32am

I want to use “facebook/bart-large-mnli” model for NLI task.
I have dataset with premises and hypothesis columns and labels [0,1,2].

How can I use this model for that NLI task ?

I wrote the following code:

import torch

device    = torch.device("cuda" if torch.cuda.is_available() else "cpu")
nli_model = AutoModelForSequenceClassification.from_pretrained('facebook/bart-large-mnli')
tokenizer = AutoTokenizer.from_pretrained('facebook/bart-large-mnli')


nli_model.to(device)

i          = 0 # first examle check
premise    = tokenized_datasets['TRAIN'][i]['premise']
hypothesis = tokenized_datasets['TRAIN'][i]['hypothesis']


x      = tokenizer.encode(premise, hypothesis, return_tensors='pt', truncation_strategy='only_first')
logits = nli_model(x.to(device))[0]

entail_contradiction_logits = logits[:,[0,2]]
probs                       = entail_contradiction_logits.softmax(dim=1)
probs

and I got only 2 values: tensor([[8.8793e-05, 9.9991e-01]], device='cuda:0', grad_fn=<SoftmaxBackward0>) (instead of 3 values - contradiction, neutral, entailment)

How can I use this model for NLI (predict the right value from 3 labels) ?

Topic		Replies	Views
Dataloader and bart-large-mnli Beginners	1	775	August 20, 2022
Zero shot classification with manual pytorch Beginners	0	719	August 27, 2021
Error while Fine tuning Zero shot classification model fb-bart-large-mnli Intermediate	0	521	June 6, 2023
Model trains with Seq2SeqTrainer but gets stuck using Trainer 🤗Transformers	4	1950	August 23, 2021
Model for Text Classification similar to bart-large-mnli, for TensorFlow Beginners	0	494	May 6, 2022

How to use bart-large-mnli model for NLI task?

Related topics