I am trying you r code but i get an error at line " logits.append(scores.gather(dim=2, index=class_ids.T.expand(len(chunk), -1, -1))) can you tell me why you use gather? and my text is not divided into chunks so how can I GET THE PROBABILITIES DIRECTLY.Can you help me with this please