Trying to follow the instructions for training an Roberta-base mlm-model, as described here: huggingface/transformers master/examples/flax/language-modeling 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX. Everyt…

Flax - core dump when starting training

Norod78 August 4, 2021, 4:41pm 5

I had this problem too. What helped me was

Also, if it’s relevant, I’ve wrote down this list of things I did which allowed me to train directly on the TPU VM:

Topic		Replies	Views
Ideas for beginner-friendlier TPU-VM clm training Beginners	3	1311	July 28, 2021
BERT pre-training run_mlm_flax.py questions Beginners	0	253	November 3, 2021
PreTrain RoBERTa from scratch in Portuguese Flax/JAX Projects	16	2408	October 4, 2021
I'm making ROBERTA dumber, and I don't know why Beginners	1	341	March 8, 2021
Accelerate / TPU with bigger models: process 0 terminated with signal SIGKILL 🤗Accelerate	2	3726	May 13, 2022