Hello and welcome @laro1!
You can use the train_test_split()
function and specify the test_size
parameter to determine the size of the split. For example:
ds.train_test_split(test_size=0.3)
DatasetDict({
train: Dataset({
features: ['premise', 'hypothesis', 'label'],
num_rows: 525
})
test: Dataset({
features: ['premise', 'hypothesis', 'label'],
num_rows: 225
})
})
Check out the docs here and let me know if that helps!