Tutorials for using Colab TPUs with Huggingface Transformers?

sgugger · December 7, 2020, 2:31pm

At a first glance, you have loss.item() in your training loop, which you should absolutely avoid on TPUs (it’s a big slowdown). You should use loss.detach() to accumulate your losses on the TPU then only do the .item() at the very end of your epoch.

Topic		Replies	Views
How to use TPU for BERT training Colab Beginners	1	959	July 30, 2022
Trainer with Google Colab TPU? Beginners	0	653	April 25, 2022
How to use TPU for model training using example script run_mlm.py Beginners	4	1374	June 18, 2022
When can we expect TPU Trainer? 🤗Transformers	4	4062	March 3, 2022
Set TPU device in Trainer Beginners	5	2615	October 15, 2024

Tutorials for using Colab TPUs with Huggingface Transformers?

Related topics