Inference time gets slower as dataset size increase

hernandez · February 23, 2023, 4:54pm

Hello.

I have a custom dataset and trained distilBERT model on that dataset. I would like to do inference. So on my inference .py file. I load the model from the default weights of distilibert (from the hub). Then, if in my directory there are my optimized weights, I load on top of the model my weights (It should basically overwrite the standard weights I get no errors). Then I instantiate the Trainer and I call the predict function for work with batches in the same way of the Training part.

Now I have a very larga dataset, and it take a lot of time for doing inference. But something interesting is the following:

Subset of 5000 samples are inferred in 36 seconds with batch of size 64
Subset of 50000 samples are inferred in 12.36 minutes with batch size 64

so a 21x factor having a 10x size dataset. Do someone here know the reason?

I’m doing it on simple laptop with a 3060RTX gpu

UPDATE
This behavior seems to be correlated with the type of dataset. As I tested with another dataset of the same size and it is 8 times faster. So I suspect it is something related with the dataset, maybe the Tokenizer? The complexity of the sentences in the other dataset? I’m using fast Tokenizer pre-trained

Topic		Replies	Views
What's the best way to speed up inference on a large dataset? Beginners	3	3908	March 13, 2022
Why is using my DistilBERT model for inference so slow? Intermediate	0	921	June 18, 2021
Different Inference Speed for same size models Models	0	389	August 29, 2021
How to reduce memory usage for inference while training models from scratch? Models	0	1390	January 30, 2021
Training speed becoming much slower when using a larger dataset Beginners	0	319	March 31, 2022

Inference time gets slower as dataset size increase

Related topics