uploaded in the .tar
format following the webdataset conventions.
Upon checking the dataset on our local server, it clearly contains 80k samples. However, the Hugging Face platform is currently displaying totally 59.1k samples.
uploaded in the .tar
format following the webdataset conventions.
Upon checking the dataset on our local server, it clearly contains 80k samples. However, the Hugging Face platform is currently displaying totally 59.1k samples.