I don’t see many diffusion models that take multiple images as input…
Perhaps IP adapters or ControlNet…