New pipeline for zero-shot text classification

kevinyauris · September 18, 2020, 8:44am

Hi @valhalla, thanks for developing the onnx_transformers. I have tried it with zero-shot-classification pipeline and do a benchmark between using onnx and just using pytorch, following the benchmark_pipelines notebook. I tried several SageMaker instances with various numbers of cores and CPU types. It seems that using an instance that has more CPU core will give more speed-up when but using an instance with more cores is more expensive and at a certain level, the price is almost the same as using a GPU.
I wonder if there are other ways to speed things up while keep the cost minimal. I found that quantization may help but it seems that onnx_transformers doesn’t support onnx quantize yet. Do you have plan to support it? Can you kindly give me some reference to use onnx quantize with zero-shot-classification pipeline (with or without using onnx_transformers)?
Thanks in advance!

Topic		Replies	Views
Alternative approaches for text classification task 🤗Transformers	0	430	October 25, 2022
Zero shot classification with manual pytorch Beginners	0	731	August 27, 2021
How to scale Zero Shot Pipeline in large datasets? 🤗Transformers	0	226	August 27, 2021
Model for Text Classification similar to bart-large-mnli, for TensorFlow Beginners	0	499	May 6, 2022
Speeding up zero shot classification [Solved] Beginners	5	6097	September 9, 2020

New pipeline for zero-shot text classification

Related topics