Hugging Face inference API

romjansen · June 20, 2022, 5:27pm

Hi all!

Today I’ve published a model fine-tuned for token classification of legislation references to the Hugging Face Hub and set up a model card including a widget (romjansen/robbert-base-v2-NER-NL-legislation-refs · Hugging Face). Using Hugging Face’s inference API widget this model can be quickly tested on the provided examples.

However, the hosted inference API widget incorrectly presents the last token of a legislation reference as a seperate entity due to the workings of its ‘simple’ aggregation_strategy. While this model was fine-tuned on training data labelled in accordence with the BILOU scheme, the hosted inference API groups entities by merging B- and I- tags when the tag is similar (thereby omitting the L- tags). Does anybody know if I can adjust the aggregation_strategy to use the right tagging scheme?

Kind regards,

Rens

Topic		Replies	Views
TokenClassificationPipeline produce entities with "##" characters 🤗Transformers	6	25	May 19, 2025
HuggingFace Inference Endpoints: Pipeline Args Inference Endpoints on the Hub	5	581	January 22, 2024
NER tag , aggregation stratergy 🤗Tokenizers	2	7318	February 1, 2022
Output of 'bert-base-NER-uncased' is different when using website and different when used via python 🤗Transformers	1	504	November 10, 2021
Why is aggregation_strategy="simple" not combining subwords properly in Hugging Face token classification (DeBERTa fine-tuned model) Intermediate	0	19	April 22, 2025

Hugging Face inference API

Related topics