Trying to download the ROOTS corpus (bigscience-data (BigScience Data)).
Anybody knows if this the entire ROOTS corpus? I am asking because somewhere in the paper it is said that “we release a large subset of the ROOTS”. So trying to find if this is the large subset or all of it.
Also maybe I am missing something, but i cannot find an easy way to download all the data. Do I have to go through the sections one by one and download each file individually? Or there is a one-click way to do so. Thanks.