DISCO-10M DOI: 10.57967/hf/0754 not found

Hi All, in this paper:
DISCO-10M: A Large-Scale Music Dataset

The authors state the dataset will be hosted on Hugging Face datasets starting June 2023.
I’ve search but cannot find it. Does anyone know where to find it?

Many thanks!
Michael

Did you end up finding it?

Hi, unfortunately this dataset has been taken down :frowning:
Ref: This reddit comment thread

It seems like the closest dataset for use-cases enabled by DISCO-10M is LAION-DISCO-12M (currently available on Huggingface!).

The features of LAION-DISCO-12M are

['song_id', 'title', 'artist_names', 'artist_ids', 'album_name', 'album_id', 'isExplicit', 'views', 'duration']

Any links about pre-computed CLAP embeddings for this dataset are much appreciated!

1 Like