How to correctly measure inference time?

sandmaker · July 25, 2022, 6:37pm

I am trying to compare the inference time of different Huggingface models with different batch sizes. But am not sure if I should use per_device_train_batch_size, per_device_eval_batch_size, both or some other method. What is the correct way to do it with the trainer API?

Topic		Replies	Views
Batch size during training vs batch size during evaluation Beginners	1	1881	August 27, 2023
Evaluation loss depends on batch size Beginners	1	143	October 14, 2024
How Can I Understand the Exact Cost of My Inference API Requests? Intermediate	2	125	April 16, 2025
Inference API timeout Site Feedback	0	185	May 29, 2024
Advice to speed and performance 🤗Transformers	4	7218	December 7, 2020

How to correctly measure inference time?

Related topics