Speed up translation model

Altabus · September 28, 2023, 5:54am

Hi there!

I’m using a translation model, let’s say wmt, but it translated 20 sentences per second on CUDA and that’s too slow for me, I need to translate 1 million sentences per day, or at least as close to that as possible, I can somehow speed up my code without server improvements?

Also, I’d like to use my own model, so I’ll ask if these methods will work for it, but if it’s easier to speed up existing models, that’s ok

I’m thinking about starting a translation using threading or multiprocessing, but I don’t understand them well, so these tips will be very useful

Also, is translator_model(text) thread safe?

Topic		Replies	Views
Boosting the speed of a translation model Helsinki-NLP/opus-mt-en-ar 🤗Transformers	0	736	October 2, 2023
Increase the speed of the Mbart model Beginners	1	646	September 28, 2023
Slow inference while performing translation Intermediate	0	604	June 10, 2022
Speeding up the inference for marian MT 🤗Transformers	4	2757	April 8, 2024
Very low GPU usage when translating text, datasets not helping 🤗Transformers	3	5825	July 12, 2022

Speed up translation model

Related topics