Hi! Which version of datasets
are you using? We’ve made some improvements in the latest release (2.8.0
) to optimize decoding, so use this version for the best performance.
Also, unlike select
(creates an indices mapping), filter
writes a new dataset to disk/memory, which can take some time for larger datasets (some benefits are faster indexing, etc.)