Example of how to pretrain T5?

nielsr · September 21, 2021, 9:45am

T5 pre-training is now supported in JAX/FLAX. You can check out the example script here: transformers/examples/flax/language-modeling at master · huggingface/transformers · GitHub. It actually includes 2 scripts:

t5_tokenizer_model.py, to train a T5 tokenizer (i.e. SentencePiece) from scratch.
run_t5_mlm_flax.py, to pre-train T5. It’s suited to run on TPUs (for which you can obtain access for free by applying to Google’s TFRC program).

@patrickvonplaten also demonstrates how to run the script in this video (starts around 13:35).

This script was developed for the JAX/FLAX community event. It would be really cool if someone contributes the PyTorch version of it. It would mean translating the script from FLAX to PyTorch, which is probably straightforward.

Topic		Replies	Views
How is T5 pretrained? 🤗Transformers	3	509	July 12, 2021
Prepare data to fine-tune T5 model on unsupervised objective 🤗Transformers	2	3927	November 3, 2021
PreTrain T5 from scratch in Bengali Flax/JAX Projects	5	2205	July 26, 2022
Pre-training googlebyt5small 🤗Transformers	0	228	October 26, 2022
No Improvement in Results after Implementing Unsupervised Denoising Training Technique for T5 Model using Hugging Face Models	0	119	April 25, 2024

Example of how to pretrain T5?

Related topics