I am currently creating a dataset where the semantics of a split make no sense. It’s an Information Retrieval corpus that should not be split. I currently load the entire corpus into the “train” split because I’ve been copying what other dataset loading scripts do, and it works, but would rather drop splits altogether for the corpus config.
Can I use other strings to name the splits? Can I leave out splits altogether for a given self.config.name?