Which interface and model for my consumer product customization❓

Guys,
I’m building a tool letting users customize a base img of a product (mug, cap,…) with their choice of colored tones and basic design (eg: ‘pink flower’ here / brand logo there / tagline there,…). Prompt requests sent from client side to server via API.
Doesn’t seem too heavy an img2img operation.
Any kind soul can recommend the ideal interface: A1111, Vlad, Comfy, InvokeAI,…?
And which model? Do I need to train one?

Thanks all.