I am new to datasets library.
I have a quick question - do datasets library provide some out of box alternatives for common pandas functions such as value_counts, groupby, mean etc. - essentially anything that requires operation over columns.
A quick search via Google/ChatGpt doesn’t reveal a straightforward solution. I also couldn’t find any solution in huggingface documentation - map, select, filter - all of them apply row-wise.
If there is no native way to do them in datasets is it because they are yet to be incorporated or can’t be done due to fundamental limitations of how datasets is built i.e. trading versatile functions to gain speed/performance?