ModernBertForQuestionAnswering does not exist?

saattrupdan · January 10, 2025, 5:33pm

Is there a reason why the new ModernBERT does not support the question answering task?

saattrupdan · January 10, 2025, 5:33pm

@tomaarsen Maybe you know?

Alanturner2 · January 11, 2025, 8:52am

Hello!

The reason why ModernBERT might not support the question answering task could be due to its design and intended use cases. Many newer BERT variants, including ModernBERT, are often optimized for specific tasks or have been fine-tuned for better performance on tasks like classification, summarization, or token tagging.

Question answering (QA) typically requires a model to be fine-tuned on datasets specifically labeled for QA tasks (like SQuAD, etc.), which might not have been the primary focus of ModernBERT’s training. If you’re interested in using ModernBERT for QA, you could potentially fine-tune it on a QA dataset yourself or explore other BERT variants like RoBERTa or DistilBERT, which have strong QA capabilities due to their specific training and optimization for such tasks.

I hope that helps clarify things!

saattrupdan · January 11, 2025, 10:00am

Hi!

I’m talking about finetuning it on a QA dataset

But the class ModernBertForQuestionAnswering, required for finetuning, simply does not exist.

khu · January 29, 2025, 9:48pm

Looks like there’s a SQUAD2 finetune already out there, but you need to turn on trust_remote_code because they implement ModernBertForQuestionAnswering themselves: modelling.py · Praise2112/ModernBERT-large-squad2-v0.1 at main.

It would be great if transformers supported this natively!

saattrupdan · February 17, 2025, 1:46pm

That’s great, but note that “they” are not the official ModernBERT creators - the ModernBERT model repo does not have any modelling.py script.

But good to have an example of a way one good manually set it up - would be great to have that modelling script on the official repos!

Topic		Replies	Views
Finetuning German BERT for QA on biomedical domain Research	2	1018	January 30, 2022
Evaluate question answering with squad dataset Beginners	2	1308	October 10, 2021
Transformers BERT QA Task: run_qa.py vs run_qa_no_trainer.py 🤗Transformers	0	572	November 11, 2021
Finetuned I-Bert for question answering task 🤗Transformers	1	481	November 22, 2021
ModernBERT Pretraining using HuggingFace API Models	3	258	March 17, 2025

ModernBertForQuestionAnswering does not exist?

Related topics