Can a diffuser pipeline run on multiple GPUs?

Hey, we have this sample using Instruct-pix2pix diffuser . We observe that inference is faster on a multi-GPU instance than on a single-GPU instance ; is the pipe.to("cuda:" + gpu_id) running the pipeline on multiple GPUs? what explains the speedup on a multi-GPU machine vs single-GPU machine?