Recommended Hardware for NER Pipeline Model

tbonza · September 22, 2020, 8:11pm

Trying to get started and run the pretrained NER pipeline model (https://huggingface.co/transformers/task_summary.html#named-entity-recognition) on about 10 million instances of text. Would you recommend using a CPU or GPU?

Previously, I used spaCy and broke my data up into batch sizes of 1k then ran those batches on a C5 instance on AWS. The AWS EC2 instance was compute optimized and had 96 cores. I’m able to run any instance on AWS. Thanks!

valhalla · September 24, 2020, 4:19pm

if you have 10M examples and have access to GPU then definitely use GPU. If you want fast inference on CPU for ner pipeline then you could try onnx_transformers, which provides same API as pipeline but leverages onnx for accelerated inference.

Topic		Replies	Views
Seeking Advice on Optimizing Hardware Resources for Model Training Beginners	3	153	August 4, 2024
[Help] GPU with query answering 🤗Transformers	0	328	November 25, 2020
Best practice for using Transformers on GPU on EC2? Beginners	5	5178	April 10, 2021
What's the best way to speed up inference on a large dataset? Beginners	3	3905	March 13, 2022
Run models on a desktop computer? Beginners	7	80640	June 16, 2024

Recommended Hardware for NER Pipeline Model

Related topics