Build a question answering system in your own language

OK I think a German version of SQuAD v1 already exists in the xquad dataset and there’s also a custom QA dataset called germanquad.

For French there’s a custom QA dataset called fquad. So for these languages, it would be really cool to focus on fine-tuning a German / French model one of these datasets or using a multilingual model like XLM-RoBERTa that can answer questions in both at once.

What do you think about doing something like that?