T5 Finetuning Tips

Since this is super old thread I opened a follow up here Finetuning mT5 for specific language pair