Hi, I am looking for a help with my test dataset. I plan to upload large dataset on Hugging Face but first I wanted to experiment and learn how to stream audio data. I have a problem with Dataset Preview. I cannot get the audio probes to listen in preview. My dataset will consist of audio in mp3 format packed in tar.gz file and a metadata in .tsv file format with audio name and some description. Can you please help me and maybe tell what I did wrong in my dataloader test.py · j-krzywdziak/test at main that I cannot preview the audio correctly. Any tip will be helpful, because this is my first time with Hugging Face and I am still a little bit unsure about how it works. Unfortunately documentation did not help. Thanks in advance!
cc @severo do you know what could be the issue ?
The dataset script is correct and the dataset is streamable, but it seems to return one mp3 and one wav for the preview for some reason
@lhoestq @j-krzywdziak hey! this is weird but I tried to change config name from “Test Dataset” to “test-dataset” (to exclude space) and it worked (I cloned the repo, you can check here: polinaeterna/test-user · Datasets at Hugging Face)
1 Like
I created an issue: Support all the characters in dataset, config and split · Issue #853 · huggingface/datasets-server · GitHub
1 Like