@patrickvonplaten @sgugger For a simple greedy decoding in gpt2 small tensorflow is taking 7 seconds, while pytorch is taking less than a second.
1 Like
@patrickvonplaten @sgugger For a simple greedy decoding in gpt2 small tensorflow is taking 7 seconds, while pytorch is taking less than a second.