Japanese keyword audio dataset

ShashankR · July 4, 2023, 5:10am

Hello,

I was working on an Audio training model for keyword spotting.
Looking for a Japanese Keyword audio dataset.
Please guide me by providing reference links to the dataset.

Thank you

AkimfromParis · July 5, 2023, 9:45am

Hello ShashankR,

I doubt that you will find good-quality audio training in Japanese on Hugging Face. Japanese communication giants such as Softbank, NTT, and Docomo are keeping their datasets private.

Maybe you should ask around on Japanese websites such as:

Good luck!

ShashankR · July 5, 2023, 11:22am

Hello AkimfromParis,

Thank You for the quick reply. I will note that point.
But in Hugging Face there are Japanese audio datasets, but the datasets available are of conversions between two people or a monolog.
As per your guidance, I will check Japanese websites.

Thank you

jgetner · April 1, 2025, 8:48pm

Another option would be to use a language model capable of generating japanese to build a synthetic dataset in text which you can then convert to audio using text to speech models(caveat they would need to produce correct linguistic audio). This would allow for a high quality dataset that is targeted for your end goal. You can add diversity of audio by using different voice generators and add noise so your model can learn to generalize instead of copy.

Topic		Replies	Views
How to create a dataset for "audio-like" files for ASR Beginners	0	402	April 10, 2023
Wake word detection Research	6	181	April 5, 2025
Run on single local file rather than dataset Beginners	1	316	January 30, 2024
A service to translate datasets into other languages 🤗Datasets	1	860	June 6, 2023
Is there a data set on huggingface that has classified audio emotion? 🤗Datasets	1	28	March 13, 2025

Japanese keyword audio dataset

Related topics