How does Accelerate ensure uniqueness of data samples across GPUs?

mdrpanwar · June 21, 2023, 1:55pm

Hi,

I have the following questions about the inner workings of Accelerate. If there is an existing document answering these, please link to it.

When using DDP, how does Accelerate sample the data from the dataloader such that one data instance is used exactly once across all the GPUs (i.e. batches used per GPU don’t share examples)? Is there any change in this behaviour when loading from an IterableDataset?
When passing the same seed to accelerate.utils.set_seed() function, is reproducibility guaranteed for DDP training runs? That is, is k-th batch on each GPU same across different training runs?

I am using an IterableDataset for training in a DDP setting and want to ensure the reproducibility of the training runs. If it does not come out of the box with Accelerate, please guide me on how to achieve the same.

Thanks.

muellerzr · June 21, 2023, 1:59pm

Check out our sampler, which splits up the data as we grab it: https://github.com/huggingface/accelerate/blob/main/src/accelerate/data_loader.py (see BatchSamplerShard and IterableDatasetShard)

Yes.

mdrpanwar · June 21, 2023, 2:20pm

Thanks for your reply and for sharing the reference. To confirm my understanding let me provide an example.

Assume two setups:
(1) DDP on 2 GPUs, and a per_gpu_batch_size of 16
(2) Training on a single GPU with a batch size of 32

It seems that what the accelerate samplers do for setup (1) is equivalent to setup (2). The sampler just splits the data across GPUs. So, the examples on which the model gets trained are the same in both setups. Is that correct?

However, I tried these two setups and the DDP case gives a higher loss than the single GPU setup. What could be the reason for that?

Topic		Replies	Views
How to handle streaming datasets with DDP? 🤗Datasets	1	564	January 28, 2024
Same seed across different gpus in multiple workers Intermediate	0	274	March 8, 2024
Keeping IterableDataset node-wise split fixed during DDP 🤗Datasets	8	1939	April 29, 2024
Can accelerator handle the distributed sampler? 🤗Accelerate	2	2945	December 21, 2021
How to handle IterableDataset with HuggingFace trainer and num_workers in DDP setup 🤗Datasets	5	2963	September 26, 2024

How does Accelerate ensure uniqueness of data samples across GPUs?

Related topics