Trouble in boosting ASR performance by adding LM

yilmazay · May 5, 2023, 12:03pm

Hi,
I generated my own xlsr ASR model by fine tuning facebook/wav2vec2-xls-r-300m with about 150 hours of Turkish transcripted audio data. Currently I got about 18% WER.
I wanted to boost my model’s performance by adding an LM according to the steps described in
Boosting Wav2Vec2 with n-grams in 🤗 Transformers.
Although I did exactly as described in that blog, however, I cannot get any better performance.
Actually, when I use a model with LM I got worse performance.
It seems to me that LM does not work, or it can’t do any positive contribution to the output.
Could I be missing something while adding LM to the ASR model?
I will appreciate any suggestions or guidance on this issue.
Thanks in advance.

Topic		Replies	Views
Boosting Wav2Vec2-xls-r with an N gram decoder using the transcripts used to train wav2vec2 Models	1	984	July 26, 2022
Wav2vec2-base task performance Models	4	890	May 8, 2023
Effect of different sample rates while finetuning an XLSR ASR model Models	0	252	April 27, 2023
Live Transcription/ASR Beginners	0	1640	September 18, 2022
Different versions of 'wav2vec2' model and their differences Beginners	1	1502	August 7, 2021

Trouble in boosting ASR performance by adding LM

Related topics