Generation Probabilities: How to compute probabilities of output scores for GPT2

@patrickvonplaten

Could you elaborate on how you chose gen_probs.prod(-1) as your method of obtaining an unique probability per sequence? Why not use gen_probs.mean(-1) for the average probability score per sequence?