Hi everyone!
We’re building a Web3 customization tool and are currently working on an AI pipeline where the model should:
- Understand that the UI layout in the center (a wallet login screen) is not to be redrawn
- Generate only one object or character that interacts with the interface (e.g., leans toward a button, sits beside it, etc.)
- Return a transparent PNG, without adding background or modifying the UI
- Ideally support prompt + guide image + layout-awareness
What we already tried:
We created a full JSON representation of the wallet layout:
Including positions, button labels, sizes, safe zones, and colors.
We also generate a
guide image
of the UI as a reference (Phantom Wallet login mockup)
We built a
promptBuilder.ts
that merges:
- Hard-coded constraints (
Do not cover the interface
, etc.) - The layout as descriptive text (
Unlock button at x:470, y:490
) - User prompt (e.g., “Pepe touches the unlock button”)
Then we tested:
lucataco/sdxl-controlnet
(now returns 404)
stability-ai/stable-diffusion-xl-base-1.0
via HuggingFace APIIPAdapter
in local pipelinesComfyUI
to build manual graph workflows
Issues we face:
- Most models tend to redraw the UI layout, even when told not to
- Background often reappears (even with transparent prompts)
- Character generation isn’t aware of UI boundaries (like “don’t cover the Unlock button”)
- IPAdapter respects style, but lacks fine-grained interaction control
Our ideal model:
We’re looking for a model (or combo) that can:
- Accept both image + prompt + optional JSON or mask
- Draw only the new character (no background, no UI duplication)
- Ideally supports ControlNet mask or fine spatial constraints
- Returns PNG with transparency
What we’d love from the community:
- Any suggestions for models or pipelines that could help?
- Has anyone tried layout-aware generation like this?
- Would custom ControlNet training or DreamBooth variant help here?
We’re happy to share more screenshots or JSON layouts if needed.
Thanks in advance — this forum has been super helpful for us so far