How to fine-tune mT5 model for QA task?

NLPHeb · July 28, 2023, 5:11pm

Hello everyone!

I want to fine-tune the mT5 model for the QA task (mT5-small).
I have downloaded the data in my language, and I now have:
train_questions, train_contexts, train_answers.

I do not know how to use the tokenizer (and which one I should use), and how to train the model on my dataset.
I tried Google and GPT-4 with no luck.

My first attempt was to build a new class like:

class mT5(nn.Module):
  def __init__(self):
    super(mT5, self).__init__()
    self.mT5 = MT5Model.from_pretrained("google/mt5-small")
    self.hidden_size = self.mT5.config.hidden_size
  def forward():

But I was stuck here too.

I will very much appreciate a good explanation of this!

Thanks a lot!

Topic		Replies	Views
Fine-tune mt5 on Question Answering with run_qa Beginners	3	2192	August 25, 2021
Need help in fine-tuning T5-Base Model for a sequence task Beginners	0	168	May 8, 2024
Fine-tune T5-small but lower performance Models	0	1407	April 21, 2022
Finetuning mT5 for specific language pair Models	0	142	October 17, 2024
Freezing mt5 model for fine-tuning Models	1	479	July 15, 2023

How to fine-tune mT5 model for QA task?

Related topics