Researching ways to speed up WhisperAI startup

orielsy · January 20, 2024, 9:26am

Seasoned Front End Dev with background in C++. New to Python, Docker, WhisperAI. Using an RTX-3080 and Ryzen 5800X with 32GB or RAM for development.

Got a Docker image/container of WhisperX going that I can tap for transcriptions.[WhisperX Docker Images]

It works really well but there’s something I’m curious about. Whisper takes about 10 seconds to start transcribing each time I send over an audio file (using the medium model).

Any way to mitigate those 10 seconds? Via Docker? Via built in functionality of Whisper/WhisperX?

Wish there was a way to keep those startup scripts spun up so that the next Whisper request didn’t have to start from zero.

Screenshot of Whisper’s output when initiating a transcription. It’s this startup process that I would like to know if it can be mitigated/reduced.

Topic		Replies	Views
Help about Whisper chunk_length Beginners	1	189	February 15, 2025
Duration of audio sequence ingested by Whisper Inference Endpoints on the Hub	2	1681	January 17, 2023
Fine tuning whisper on custom dataset Beginners	3	934	January 11, 2024
Disable timestamps for Whisper Beginners	1	2665	May 26, 2024
Whisper fine tuning on custom audio data Beginners	4	2744	February 15, 2025

Researching ways to speed up WhisperAI startup

Related topics