Generation Probabilities: How to compute probabilities of output scores for GPT2

Looks like the pull request is here: https://github.com/huggingface/transformers/pull/14654 and is implemented in transformers v4.16.0