Using TRL on TPU

pere · February 7, 2025, 12:58pm

Does anyone know if it is possible to use TRL on TPU. I am particulary interested in reinforcement learning and GRPO, as there currently does not seem to be any jax-alternatives out there.

steve-rogers56 · February 11, 2025, 2:13pm

TRL (Transformers Reinforcement Learning) primarily supports PyTorch, which can be challenging to run efficiently on TPUs. While JAX-based alternatives for GRPO are limited, you might try running TRL on TPU via PyTorch/XLA. However, native JAX support would require custom implementation.

Topic		Replies	Views
How to use TPU for BERT training Colab Beginners	1	954	July 30, 2022
How to run GPT Neo on TPU using transformer? 🤗Transformers	0	231	February 10, 2022
GRPO or PPO or some RL Research	1	53	May 19, 2025
TypeError: Cannot convert a MPS Tensor to float64 dtype as the MPS framework doesn't support float64. Please use float32 instead 🤗Transformers	2	8694	July 6, 2023
Trainer with Google Colab TPU? Beginners	0	650	April 25, 2022

Using TRL on TPU

Related topics