Sharding depends on the dataset, one shard corresponds to one file.
Sharding is quite important to enable the use of num_workers in in data loaders, so feel free to use datasets that are already sharded or shard a dataset your self, e.g. using:
from datasets import load_dataset
ds = load_dataset(...)
ds.push_to_hub(repo_id, num_shards=...)