Hello,
I am having trouble with the ClassLabel features for Token Classification. I am working via Pandas data frame for my dataset. And I am loading the data frame with the dataset. I cannot see the 9 custom IOB labels inside ClassLabel.
df = pd.DataFrame(df)
dataset = Dataset.from_pandas(df)
dataset = dataset.train_test_split(test_size=0.1)
Output:
DatasetDict({
train: Dataset({
features: ['tokens', 'labels', 'id'],
num_rows: 10000
})
test: Dataset({
features: ['tokens', 'labels', 'id'],
num_rows: 1000
})
})
Output:
{'tokens': Value(dtype='string', id=None),
**'labels': Value(dtype='string', id=None),**
'id': Value(dtype='int64', id=None)}
I already tried the “cast” method → dataset.cast_column(“labels”
here … ClassLabel Error · Issue #5737 · huggingface/datasets · GitHub
And the “new_features” in the package reference.
here… Main classes
Thank you guys!