Hey there,
so I’ve been stuck the entire day and can not find anything to help me. Maybe I am blind, but I am also completely new to this (ML and Python that is)
Basically I have made a dataset that looks like this:
2ndBestKiller/DrugTest · Datasets at Hugging Face (this is exactly mine)
Now I wanted to do a NER Tutorial from the HF page, this one here:
For the most part, my dataset looks exactly like theirs, except for the ClassLabels in my ner_labels column. There should be references to the corresponding IOB entities, but alas there is nothing.
Here is what I am talking about:
ner_feature = raw_datasets[“train”].features[“ner_tags”]
when you print “ner_feature” is should look like this:
Sequence(feature=ClassLabel(num_classes=9, names=[‘O’, ‘B-PER’, ‘I-PER’, ‘B-ORG’, ‘I-ORG’, ‘B-LOC’, ‘I-LOC’, ‘B-MISC’, ‘I-MISC’], names_file=None, id=None), length=-1, id=None)
But mine looks like this:
Sequence(feature=Value(dtype=‘int64’, id=None), length=-1, id=None)
And I can not figure out how to set this up. Non of the tutorials or documentation seem to talk about this, but again, maybe I am just blind.
Can anyone help me (and please make it easy to understand since my knowledge in this area is very limited)