Tutorials for using Colab TPUs with Huggingface Transformers?

facehugger2020 · December 7, 2020, 8:58pm

Thanks. I did as you suggested, but the training loop is still making very slow progress.

OLD:

    epoch_loss = 0.0
    for i, batch in enumerate(dl):

        loss  = loss_fn(yhat, y)
        loss.backward()

        epoch_loss += loss.item()

    return epoch_loss/len(dl)

NEW:

    epoch_loss = 0.0
    for i, batch in enumerate(dl):

        loss  = loss_fn(yhat, y)
        loss.backward()

        epoch_loss += loss.detach() # <-- NEW

    return epoch_loss.item()/len(dl) # <-- NEW

One batch is still taking a long time to complete. I suspect it’s running on the CPU rather than the TPU. However, I think I followed all XLA setup correctly. If this issue is out of Transformers’ domain, I’ll go bug the XLA folks.

Topic		Replies	Views
How to use TPU for BERT training Colab Beginners	1	959	July 30, 2022
Trainer with Google Colab TPU? Beginners	0	653	April 25, 2022
How to use TPU for model training using example script run_mlm.py Beginners	4	1374	June 18, 2022
When can we expect TPU Trainer? 🤗Transformers	4	4062	March 3, 2022
Set TPU device in Trainer Beginners	5	2615	October 15, 2024

Tutorials for using Colab TPUs with Huggingface Transformers?

Related topics