Script to prepare load my own data into a DataSet

Can someone point me to a tutorial, documentation/API, where I could see how to load my own data into a DataSet format so that I can use it for a text-classification task?

I’ve looked in tutorials and examples, but it’s always with an already prepared/baked standard dataset from the datasets package.

Thanks in advance!

Hi there! :slight_smile:

The Local and remote files section of the documentation should help, depending on your data file format. For example, if you have just one csv file you can do:

from datasets import load_dataset
dataset = load_dataset('csv', data_files='my_file.csv')