Whisper on long audio files -- support for chunking?

The transformer library supports chunking (concatenation of multiple segments) for transcribing long audio files with Wav2Vec2, as described here: Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

The OpenAI repository contains code for chunking with Whisper: whisper/transcribe.py at main · openai/whisper · GitHub

Is chunking with Whisper supported in the transformers library?


Is there any update for this issue?

1 Like