I want to use the input_id level scores provided by model.generate(return_scores=True) with the decode function of the Tokenizer.
The logit scores are input_id level and the decoder decodes to token level.
My desired output for a prompt of “hello, how are you” would be "doing (p=0.8) today (p=0.8 * 0.7) sir (p=0.8 * 0.7 * 0.7) for example.
How can I achieve this?