Unique text generation per-GPU

enzokro · June 9, 2022, 2:35am

Hello,

I had a quick question about properly generating text on multiple GPUs.
Right now I am generating texts by splitting GPT-j-6B across a few nodes.

Usually I set a fixed seed for my jobs. When I do that, the generated output text from each job is identical so it only makes sense to save one of them. I believe that is done in the last step of this deepspeed example: deepspeed for GPT-Neo-2.7B

But when I don’t set an explicit seed each job gets different RNG settings. Thanks to this I instead get as many unique text outputs as there are GPUs.

Is not setting the shared seed a valid way to “cheat” and increase text generation throughput? Or should seeds be set for another reason (i.e. something in synced_gpus=True or deepspeed needs it).

Apologies if this has been asked before I could not find it via keywords.

Thank you!

Topic		Replies	Views
Generate text on multiple GPU 🤗Transformers	2	1301	May 10, 2021
Best practices for improving text generation speed? Beginners	0	2756	April 30, 2022
I have a question about multi-GPU inference DeepSpeed	0	1518	March 9, 2023
[deepspeed] bigscience/T0* multi-gpu text generation Intermediate	0	475	September 8, 2022
Same seed across different gpus in multiple workers Intermediate	0	274	March 8, 2024

Unique text generation per-GPU

Related topics