TPU slow finetuning T5-base

GenV · February 11, 2022, 3:27pm

Hi,
I am trying to train a T5-base on Colab with the TPU. I am using the official code to perform a fine-tuning on the T5-base (with my dataset), but the training with TPU is extremely slow! I’m using the offical code.

I am attaching the colab code with the various libraries I have installed: notebook.

Also, if I try to increase the batch size as >= 64, I get a memory error, as there seems to be only about 8 Gb available.

Can someone help me? Thank you!

Topic		Replies	Views
T5 evaluation via Trainer `predict_with_generate` extremely slow on TPU? Beginners	1	773	November 2, 2023
TPU trainer with multi-core Intermediate	5	2195	April 21, 2022
Trainer with TPUs Beginners	3	2761	April 13, 2022
How to fine-tune T5-base model? Beginners	10	4584	July 28, 2021
Tutorials for using Colab TPUs with Huggingface Transformers? 🤗Transformers	16	20549	June 3, 2024

TPU slow finetuning T5-base

Related topics