I have fine-tuned Wav2Vec2-960h on my custom Italian speech dataset. At the inference, I have some transcription errors. Can I fix them using a LLM? Any idea or practical example?
Thanks in advance,
Davide
I have fine-tuned Wav2Vec2-960h on my custom Italian speech dataset. At the inference, I have some transcription errors. Can I fix them using a LLM? Any idea or practical example?
Thanks in advance,
Davide