Prepare data to fine-tune T5 model on unsupervised objective

Adi-0-0-Gupta · June 10, 2021, 12:35pm

Hi, I couldn’t find a way to fine-tune the T5 model on a dataset in a specific domain (let’s say medical domain) using the unsupervised objective. Does the current version of Huggingface support this? Basically, all I need is to prepare the dataset to train the T5 model on the unsupervised objective, which could itself be very tricky. Any pointer on this is highly appreciated. P.S: I am looking for something in PyTorch and not Tensorflow. @valhalla @clem

naumov-al · September 27, 2021, 1:27pm

Hi! Have you found a solution? I also can’t figure out how to fine-tune the pretrained model (mT5) on unlabeled domain specific data using Transformer library. Thank you.

pierreguillou · November 3, 2021, 10:53pm

Hello @Adi-0-0-Gupta and @naumov-al.

There is an overview of the training task of Language Modeling for T5 in the T5 page on the Hugging Face site at Unsupervised denoising training.

And you will get scripts for training as said in this text from Hugging Face:

If you’re interested in pre-training T5 on a new corpus, check out the run_t5_mlm_flax.py script in the Examples directory.

To train the T5 tokenizer vocab on your specific domain, this script should help: t5_tokenizer_model.py

Source: Example scripts in the T5 page

pre-training: the run_t5_mlm_flax.py script allows you to further pre-train T5 or pre-train T5 from scratch on your own data. The t5_tokenizer_model.py script allows you to further train a T5 tokenizer or train a T5 Tokenizer from scratch on your own data. Note that Flax (a neural network library on top of JAX) is particularly useful to train on TPU hardware.

Topic		Replies	Views
Training T5 on mlm task from scratch 🤗Transformers	4	3264	July 29, 2022
No Improvement in Results after Implementing Unsupervised Denoising Training Technique for T5 Model using Hugging Face Models	0	120	April 25, 2024
Finetuning T5 for Summarisation - Poor results Intermediate	1	530	April 28, 2024
Example of how to pretrain T5? 🤗Transformers	15	16007	March 16, 2023
Question on HuggingFace's T5 documenation 🤗Transformers	0	320	May 18, 2023

Prepare data to fine-tune T5 model on unsupervised objective

Related topics