Whisper on long audio files -- support for chunking?

The transformer library supports chunking (concatenation of multiple segments) for transcribing long audio files with Wav2Vec2, as described here: Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers

The OpenAI repository contains code for chunking with Whisper: whisper/transcribe.py at main · openai/whisper · GitHub

Is chunking with Whisper supported in the transformers library?

1 Like

Is there any update for this issue?

2 Likes

any update on this? @sanchit-gandhi please help

Indeed there is! See openai/whisper-large-v2 · Hugging Face