uploaded in the .tar format following the webdataset conventions.
Upon checking the dataset on our local server, it clearly contains 80k samples. However, the Hugging Face platform is currently displaying totally 59.1k samples.
uploaded in the .tar format following the webdataset conventions.
Upon checking the dataset on our local server, it clearly contains 80k samples. However, the Hugging Face platform is currently displaying totally 59.1k samples.