I’m trying to build an Automatic Speech Recognition model for Indian English ( accents, dialect, etc.). I have around 15 hours of labeled data.
The trained model outputs blank for every file in the test set and I don’t know where it is going wrong.
Any help would be much appreciated. Is anyone else attempting this?