Correct Wav2Vec2 ASR output

I have fine-tuned Wav2Vec2-960h on my custom Italian speech dataset. At the inference, I have some transcription errors. Can I fix them using a LLM? Any idea or practical example?

Thanks in advance,

Davide