LM Example run_mlm.py. The memory usage of each GPU is inconsistent

JimYang666 · May 18, 2023, 11:47am

When I use run_mlm.py.
The memory usage of GPU number 0 is twice that of other GPUs.
Is this normal?
Thanks for any advice!

JimYang666 · May 18, 2023, 11:54am

May be I used fp16?

aalexandrov · September 6, 2023, 4:23pm

I would also like to know what to do about this.

Topic		Replies	Views
How to specify different batch sizes for different GPUs when training with rum_mlm.py? Beginners	1	1103	July 26, 2021
How to ensure run_t5_mlm_flax.py uses GPU? Beginners	0	320	April 6, 2023
Trainer use multigpu 🤗Transformers	0	501	July 29, 2021
Single GPU is faster than multiple GPUs 🤗Accelerate	3	1920	January 31, 2024
LM example run_clm.py isn't distributing data across multiple GPUs as expected 🤗Transformers	10	2697	May 17, 2023