Example of how to pretrain T5?

StephennFernandes · March 28, 2022, 2:11pm

@lewtun @valhalla @nielsr @patrickvonplaten I am planing to pretrain multilingual T5 small and/or medium from scratch, i can across this post and the hugginface implementation for T5, my question is can i use the same pretraining script from T5 , by replace the T5Config with mT5Config ? WOULD THIS WORK ?

Also how should the dataset be arranged for multilingual languages pretraining ? should all the langages be arranged in a sequential order where a sequence of one lang followed by another eg: [French, German, Italian] or should all the languages be randomly shuffled ?

for the record i am planning to pretrain mT5 on indian languages on the oscar corpus and some additionally sourced text corpus.

Topic		Replies	Views
Pre-training googlebyt5small 🤗Transformers	0	236	October 26, 2022
Training T5 on mlm task from scratch 🤗Transformers	4	3298	July 29, 2022
Prepare data to fine-tune T5 model on unsupervised objective 🤗Transformers	2	3952	November 3, 2021
Prepare data for pretraining T5 model 🤗Datasets	1	1093	May 4, 2023
How is T5 pretrained? 🤗Transformers	3	523	July 12, 2021

Example of how to pretrain T5?

Related topics