Why Tensorflow Models are way slower than Pytorch models, for autoregressive modeling?

s4sarath · October 1, 2020, 12:21pm

Hi,

I was experimenting with many models include GPT2, T5 etc. But it seems like Tensorflow models are too slow for same type of generation comparing to Tensorflow, whether it is greedy, beam etc .

Any specific reasons for this?

Thanks

s4sarath · October 2, 2020, 3:13am

@patrickvonplaten @sgugger For a simple greedy decoding in gpt2 small tensorflow is taking 7 seconds, while pytorch is taking less than a second.

s4sarath · October 5, 2020, 3:23pm

@lysandre - Can anyone shed some light on this? will be great.

BramVanroy · October 5, 2020, 9:08pm

Are you measuring epoch time or total runtime? I have no direct experience with tensorflow but I remember that the set up of the graph might take quite a bit file on TF. you should probably only time the steps themselves and not the set up.

s4sarath · October 6, 2020, 1:21am

I mean for inference. Here is the snippet. Something is wrong.
@thomwolf - Any thoughts? Thanks…

clem · October 8, 2020, 7:08am

Maybe @jplu has some insights!

stefan-it · October 8, 2020, 5:46pm

I think it is because PyTorch is more awesome

Just had a look at the example code, maybe the .to('cuda') call makes something much more faster

s4sarath · October 9, 2020, 12:31am

.to(‘cuda’) is there , when I initialised.
I expected a technical answer though, why tf is slower for generations.

@ huggingface team , which means tensorflow implementation are not suitable for production right as latency is higher. Can we conclude it that way?

jplu · October 12, 2020, 12:43pm

Hello!

The reason is because for now the TF models are not optimized for NLG, including the generate function, and we don’t recommend to use them for that task in production This is something we are working on, but cannot give you a specific date.

s4sarath · October 13, 2020, 1:47am

I agree, you have to do a lot more things to optimize it. Especially the caching side.
It would be great if there was a warning while using tf generate. Thanks for the most valuable reply here @jplu.

saurabh3949 · July 25, 2022, 8:26pm

@jplu : Calling .generate on TF models is still a lot slower (upto 80x) compared to PyTorch. Are the TF models now optimized for NLG? Am I missing something here?

Topic		Replies	Views
Tensorflow Models are way slower than Pytorch models, for autoregressive generation? 🤗Transformers	3	389	July 26, 2022
Very slow training (>5mins per batch) - code review request Research	2	642	October 11, 2023
BERT model is slow in Pytorch 🤗Transformers	5	626	November 30, 2023
Advice to speed and performance 🤗Transformers	4	7220	December 7, 2020
Baffling performance issue on most NVidia GPUs with simple transformers + pytorch code Intermediate	5	4508	April 9, 2024

Why Tensorflow Models are way slower than Pytorch models, for autoregressive modeling?

Related topics