Distributed inference: how to store results in a global variable

silviuvo · October 15, 2024, 4:08pm

Hi team! I am trying to speed up inference using a diffusion model.

Consider this (slightly modified) example from https://huggingface.co/docs/diffusers/en/training/distributed_inference:

import torch
import datasets
from accelerate import PartialState
from diffusers import DiffusionPipeline

pipeline = DiffusionPipeline.from_pretrained(
    "runwayml/stable-diffusion-v1-5", torch_dtype=torch.float16, use_safetensors=True
)
distributed_state = PartialState()
pipeline.to(distributed_state.device)
ds = datasets.load_from_disk("/path/to/my/dataset")

with distributed_state.split_between_processes(ds) as proc_ds:
    proc_ds = proc_ds.map(lambda e: {
        "image": pipeline(e["prompt"]).images[0]
    })

How can I now combine the proc_ds datasets from across processes into one global dataset again that I continue to work on from the main process?

Can I just declare a list results outside of the with block, and then, after the map within the with block, do results.append(proc_ds)? Is that thread-safe? How does this actually work, how will that global list end up in the main process?

Sorry if this is obvious. Thank you for your help!

silviuvo · October 16, 2024, 11:40am

Update: from withing each process, I am adding results to a list proc_ds_lst, then do proc_ds_lst = accelerate.utils.gather_object(proc_ds_lst). Seems to work.

silviuvo · October 16, 2024, 11:40am

This post helped: How to properly gather results of PartialState for inference on 4xGPUs · Issue #2440 · huggingface/accelerate · GitHub

Topic		Replies	Views
Proper way to gather output from accelerate multi-gpu inference Beginners	1	719	November 7, 2023
Distributed inference for datasets created on the fly Intermediate	3	655	October 10, 2023
Distributed Inference with 🤗 Accelerate - Compare Baseline vs Fine Tuned Model 🤗Accelerate	3	551	January 30, 2024
Slow processing with map when using deepspeed or fairscale 🤗Datasets	10	3692	June 25, 2021
HF Trainer downstream evaluation on multiple GPUS 🤗Transformers	1	1096	December 21, 2022

Distributed inference: how to store results in a global variable

Related topics