Hello. Sentiment analysis models such as bertweet give two outputs. A classification and a number. Is that number a measure of confidence in the classification? Or is it a probability?
Also, do I need to do any sort of tokenizing or preprocessing before feeding these models text? Or does transformers.pipeline do some of this preprocessing for me?
Thanks.