Hi all,
Excuse me, I am a newbie. My problem is about the finetuning of ASR models (especially facebook/s2t-small-librispeech-asr · Hugging Face) on my custom dataset consisting of my recording. Where can I find a piece of example for that?
Thanks in advance,
Davide