Pipeline very slow

nprime496 · June 22, 2022, 10:40pm

I just trained a BertForSequenceClassification classifier but come on problems when trying to predict.

When I use the predict method of trainer on encodings I precomputed, I’m able to obtain predictions for ~350 samples from test set in less than 20 seconds.

trainer.predict(test_encodings)

However, when I load the model from storage and use a pipeline, the code runs for more than 10 mins, even adding batching doesn’t seem to help.

classifier = pipeline("text-classification", model=model,tokenizer=tokenizer,max_length=512, truncation=True)
classifier(X_test.Text.to_list(),batch_size=10)

What can explain this difference ?

tomwagstaff-opml · May 5, 2023, 1:52pm

Can’t offer any solutions, but I’m experiencing a similar problem training a model in the HuggingFace hub and trying to load it with the from_pretrained method to make predictions locally. I’m averaging 6s / observation - not something I can ever use in production.

This SO post reports basically the same issue with from_pretrained…

Topic		Replies	Views
Slightly different output from trainer.predict and pipeline(..., function_to_apply="none") Beginners	1	501	June 21, 2023
Slow speed when using a fine-tuned bert for prediction Beginners	0	2163	March 26, 2022
How can I use pipeline for sequence to sequence classification Beginners	0	306	May 10, 2022
Using Trainer at inference time 🤗Transformers	9	15857	May 4, 2023
Tokenizer extremely slow when deployed to a container 🤗Tokenizers	0	1287	April 14, 2023

Pipeline very slow

Related topics