I’m following the guidence on training text-classfication using my own dataset,
refer to notebooks/sagemaker-notebook.ipynb at master · huggingface/notebooks · GitHub
I have two questions:
- should the dataset contain label column only support int? in other words, I need to preprocess my data, convert categories to 1,2,3…?
- do I need to specify the class number? if so, where?