Hi! I’m working/creating a vision dataset of medical images (cytology) and will release a new version in ~3 months. But i need to publish the “official” dataset at mendeley data because it’s an academic work. So, i want to know if i also can share this dataset at hugging faces datasets? (By part of mendelay data, i know that doesn’t have a problem)
Hi ! yes definitely
It’s actually pretty common for researchers to host their data elsewhere than Hugging Face, and still, it’s possible to have a Hugging Face dataset that downloads from this other host (mendeley for example)
For example web_of_science is hosted on mendeley (see the download defined in the dataset script here: web_of_science.py · web_of_science at main) and you can load it in python with load_dataset
1 Like
Ok, thanks for the answer, and also for the example!