What's the best way to speed up inference on a large dataset?

ollibolli · March 12, 2022, 10:25am

Hi,

I have been trying to do inference of a model I’ve finetuned for a large dataset.
I’ve done it this way: Summary of the tasks
Iterating over all the questions and contexts but it’s too slow.

This way from the course seems to be quite ok but I run into memory issues, assuming because the whole dataset is in a dict?

batch = {k: eval_set_for_model[k].to(device) for k in eval_set_for_model.column_names}
trained_model = AutoModelForQuestionAnswering.from_pretrained(trained_checkpoint).to(
    device
)

with torch.no_grad():
    outputs = trained_model(**batch)

Is there some way I can pass the dataset like I would in lightning directly and iterate over the batches dynamically?
I.e. instead of getting the batch manually as above, do something like

for batch in iter(dataset):
 pred = model(**batch)

?

Thanks a lot in advance

davanstrien · March 13, 2022, 3:16pm

You may find the discussion on pipeline batching useful. I think batching is usually only worth it for running on GPU. If you are doing inference on CPU looking into ONNX might make sense (probably it’s only worth the effort if you are going to be doing inference multiple times – if it’s a one-time thing you might just prefer to wait a bit longer!)

ollibolli · March 13, 2022, 3:53pm

thanks for the answer,
yes, I’ve tried pipeline batching, but I seem to not be able to feed the dataset into the pipeline (just for qa, it works for classification etc, for q&a I get asked to makea dict out of it).
It’s running on GPU.

ollibolli · March 13, 2022, 3:53pm

I’m doing it for the kaggle student nlp project, so it’s just that I have the 9 hours inference limit.

Do you know how to do pipeline batching for q&a?

Topic		Replies	Views
Fastest way to do inference on a large dataset in huggingface? 🤗Datasets	5	3290	May 3, 2024
Inference time gets slower as dataset size increase 🤗Transformers	0	432	February 23, 2023
Inference using Pipeline and TensorFlow Beginners	0	497	December 2, 2021
How to ensure fast inference on both CPU and GPU with BertForSequenceClassification? Beginners	5	5831	November 3, 2021
Reduce inference time with batches Beginners	0	415	September 14, 2021

What's the best way to speed up inference on a large dataset?

Related topics