Question regarding TF DistilBert For Sequence Classification

alghar · December 16, 2021, 4:59am

I have successfully fine tuned “TF DistilBert For Sequence” Classification to distinguish comments that are toxic vs. not in my datasets. Is there a way to use the same model to gauge which sentence in a pair of toxic sentences is more (or less) toxic? Is there a way to access the probability produced by the classifier to compare toxicity of two toxic sentences?

nielsr · December 16, 2021, 10:59am

Hi,

You can access the probability as follows:

from transformers import DistilBertTokenizer, TFDistilBertForSequenceClassification
import tensorflow as tf

tokenizer = DistilBertTokenizer.from_pretrained('distilbert-base-uncased')
model = TFDistilBertForSequenceClassification.from_pretrained('distilbert-base-uncased')

inputs = tokenizer("Hello, my dog is cute", return_tensors="tf")
outputs = model(inputs)

probabilities = tf.math.softmax(outputs.logits, axis=-1)
print(probabilities)

The probabilities are a tensor of shape (batch_size, num_labels), containing the probabilities per class for every example in the batch.

Topic		Replies	Views
DistilBertModel to sequence classification 🤗Transformers	0	234	January 23, 2023
How to get probabilities per label in finetuning classification task? Beginners	5	5444	February 18, 2022
Token classification probability and scoring 🤗Transformers	0	749	November 23, 2020
Predictions for sequenceclassification task Beginners	2	1256	October 9, 2020
Need help to give inputs to my fine tuned model Beginners	1	328	December 7, 2021

Question regarding TF DistilBert For Sequence Classification

Related topics