Build a question answering system in your own language

lewtun · November 17, 2021, 11:43am

Hey @Endre cool to hear that you’re interested in this project!

Given that it might be time consuming to translate all of SQuAD into Hungarian or Romanian, it might make sense to first start by training a model on an existing dataset in one of those languages.

For example, there is the mqa dataset which is a different type of question answering called “community question answering”. It has subsets in both your languages and this way you can get a model trained / Space up and running faster than creating the dataset from scratch.

Community QA is more of a retrieval based approach and you can find an example of what it involves here with the haystack library (based on transformers).

Of course you’re welcome to create your own SQuAD dataset, but thought I should provide an alternative just in case

Topic		Replies	Views
RAG Class for Question Answering 🤗Transformers	0	446	October 22, 2020
Question answering using Large Language model Models	2	405	February 25, 2024
Creating t5 for language Beginners	0	243	April 9, 2022
Evaluate question answering with squad dataset Beginners	2	1327	October 10, 2021
Fine tune Albert, RoBERTa or ELECTRA on SQuAD2.0 and need a model Models	0	400	April 29, 2021

Build a question answering system in your own language

Related topics