The transformer library supports chunking (concatenation of multiple segments) for transcribing long audio files with Wav2Vec2, as described here: Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers
The OpenAI repository contains code for chunking with Whisper: whisper/transcribe.py at main · openai/whisper · GitHub
Is chunking with Whisper supported in the transformers library?