my organisation in the trouble my whole business is based on STT I need more accurate stt seamless m4t is not able to convert any audio fully I have little bit of noises audios. I have testing platform for student who is preparing for various exam in that I need to take answer of given question by student in audio form and for evaluating their answer I need to fully accurate text of those audio so I can analyse their grammar, mistakes so here I have used whisper in frontend but problem is whisper is doing auto correction and sometime stocking on one word and repeating again and again. I have used web speech api as well but it get stuck in between. I have huge amount of transcription thing in a month approx 80000 hours/ month. Can hugging face make some help . I want any solution in frontend can be helpful I just want whatever user speak wrong or right same in text without autocorrection or filler sententence and without sticking. Any input will be very helpful.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Text to Speech Alignment with Transformers | 2 | 5217 | April 20, 2022 | |
AI to improve voice | 10 | 209 | April 10, 2025 | |
Can Wav2Vec2 distinguish music during speech-to-text? | 1 | 338 | August 27, 2023 | |
How to match locserver performance with Hugging face V3 | 0 | 30 | October 22, 2024 | |
Approach for Creating a Real-Time Speech-to-Speech Model with Emotions, Laughter, and Crying—aka "The Perfect Voice Changer" | 1 | 143 | February 24, 2025 |