Hi,
Yes that’s the right approach. You can take a look at Spacy’s offsets_to_biluo_tags
method. It’s great to convert character index-level annotations to token annotations (in BILOU-format, which is a bit more exotic than IOB).
Hi,
Yes that’s the right approach. You can take a look at Spacy’s offsets_to_biluo_tags
method. It’s great to convert character index-level annotations to token annotations (in BILOU-format, which is a bit more exotic than IOB).