Thank you, I just found the solution:
output = model.generate(**inputs, max_new_tokens=40, return_dict_in_generate=True, output_scores=True)
sequence = output.sequences
scores = output.scores # logits of the generated tokens
Thank you, I just found the solution:
output = model.generate(**inputs, max_new_tokens=40, return_dict_in_generate=True, output_scores=True)
sequence = output.sequences
scores = output.scores # logits of the generated tokens