Pekka10
1
Can anyone help me how to split the dataset into training and test like the way it was defined earlier-:
# load dataset
train_dataset, test_dataset = load_dataset(dataset_name, split=[‘train’, ‘test’])
This is not working and i am using b-mc2/sql-create-context dataset and need to split it into train and test.
Hi! b-mc2/sql-create-context
has a single train
split (built from a single JSON file), so you need to use Dataset.train_test_split
to achieve that:
train_dataset = load_dataset("b-mc2/sql-create-context ", split="train")
train_test_dataset = train_dataset.train_test_split(test_size=0.2)
train_dataset, test_dataset = train_test_dataset["train"], train_test_dataset["test"]