I want to fix all metadata and loading for my audio datasets (like
comodoro/vystadial2016_asr), but cannot find specific documentation for audio datasets, in particular how to load into an
librispeech_asr/blob/main/librispeech_asr.py, but audio data there is only referenced in feature metadata and
_generate_examples. I tried to emulate it at least, but I got stuck at these lines with
[Errno 2] No such file or directory: 'data_voip_cs_2016'; I do not know what exactly the download manager returns and how to access it, not even looking at the source.
- Is there any more documentation?
- Can I debug the script locally instead of adding a commit for every fix attempt?
- I would also like to add a loading script for some json (HF hosted) audio datasets like datasets/comodoro/pscr, is that even possible?