TPU Out of memory in Pix2Struct ForConditionalGeneration model

Sajan · August 13, 2023, 2:03pm

I am trying to train Pix2StructForConditionalGeneration model on google colab TPU but I get memory error while training for large patch size and large output token length. Is it possible to parallelize the model across the cores of 1 TPU device? (I am just testing on colab but I have access to google TPU research cloud)

Topic		Replies	Views
TPU Memory problem when saving model checkpoint Beginners	0	552	April 7, 2022
Instructpix2pix training guide please 🧨 Diffusers	2	343	May 30, 2023
Accelerate / TPU with bigger models: process 0 terminated with signal SIGKILL 🤗Accelerate	2	3744	May 13, 2022
🤗Transformer with Trainer API on TPU VMs and TPU Pods Beginners	0	408	December 18, 2023
Pipeline device issue, torch_xla generation() bug, flax models malloc errors 🤗Transformers	0	173	April 21, 2024

TPU Out of memory in Pix2Struct ForConditionalGeneration model

Related topics