How to make single-input inference faster? Create my own pipeline?

mbforbes · August 26, 2021, 6:08pm

+1 to what @Narsil said, I was just going to suggest the golden rule of optimizing: measure first! Measure how long each piece of code takes, over a few runs with a few different configurations (input lengths, how many you’re predicting, that kind of thing). Then you’ll know where you can best focus your efforts.

I totally get that it’s annoying to measure. I also often drag my feet before doing this. But I’m always glad I did. Otherwise, you might spend a bunch of effort speeding up one part a tiny bit, when the bottleneck is actually somewhere else!

Topic		Replies	Views
Make bert inference faster 🤗Transformers	6	10841	September 16, 2021
Model inference on tokenized dataset 🤗Datasets	2	6303	March 22, 2023
Inference speed between pipelines and Heads 🤗Transformers	0	312	April 3, 2023
Speeding up electra inference, multilabel classification 🤗Transformers	0	377	June 9, 2022
Auto Model for Sequence Classification take more than 20 minutes to classify a single sequence 🤗Transformers	3	252	March 7, 2024

How to make single-input inference faster? Create my own pipeline?

Related topics