Develop robust examples for model parallel training on TPU's

Supreeth · January 15, 2023, 4:21am

Hello,
we plan on creating examples for model parallel training that support many more model architectures than and is more modular than the current example (transformers/examples/research_projects/jax-projects/model_parallel at main · huggingface/transformers · GitHub) . We would like the outcome of this project to be a series of articles/blogposts that detail model-parallel and data-parallel training of HuggingFace transformers on TPU’s/GPU’s. It would be great if this project could be united with other projects that aim to pretrain a large model

Topic		Replies	Views
Best way to share a Jax Transformer Model trained with Haiku 🤗Hub	0	495	January 24, 2023
Example of how to pretrain T5? 🤗Transformers	15	16017	March 16, 2023
Training scripts Flax/JAX Projects	1	3193	January 5, 2022
HuggingArtists \| Train a model to generate lyrics Flax/JAX Projects	5	3571	August 19, 2021
About the Flax/JAX Projects category Flax/JAX Projects	3	2124	July 1, 2022

Develop robust examples for model parallel training on TPU's

Related topics