The documentation in the link above makes me believe that scores
is just a “processed” version of logits
. This begs the question: how exactly are these logits
processed?
I took a sample of these score
s myself, and they look no different in meaning to logits
. They don’t look like probabilities since some of them are clearly negative or above 1.0.
tensor([[-7.5898, -5.9922, 18.5625, ..., -8.4844, -4.7539, -4.6758]],
device='cuda:0'))
Can someone please explain to me what these scores really are?