Hi All, in this paper:
DISCO-10M: A Large-Scale Music Dataset
The authors state the dataset will be hosted on Hugging Face datasets starting June 2023.
I’ve search but cannot find it. Does anyone know where to find it?
Many thanks!
Michael
Hi All, in this paper:
DISCO-10M: A Large-Scale Music Dataset
The authors state the dataset will be hosted on Hugging Face datasets starting June 2023.
I’ve search but cannot find it. Does anyone know where to find it?
Many thanks!
Michael
Did you end up finding it?
Hi, unfortunately this dataset has been taken down
Ref: This reddit comment thread
It seems like the closest dataset for use-cases enabled by DISCO-10M is LAION-DISCO-12M (currently available on Huggingface!).
The features of LAION-DISCO-12M are
['song_id', 'title', 'artist_names', 'artist_ids', 'album_name', 'album_id', 'isExplicit', 'views', 'duration']
Any links about pre-computed CLAP embeddings for this dataset are much appreciated!