There are many models that accept your condition.
1.OpenAI GPT 4
this is perhaps the most advanced option for multimodal capabilities.
2.Google DeepMind’s Gemini
3. Midjourney and stable diffusion
4. CLIP and Artbreeder
There are many models that accept your condition.
1.OpenAI GPT 4
this is perhaps the most advanced option for multimodal capabilities.
2.Google DeepMind’s Gemini
3. Midjourney and stable diffusion
4. CLIP and Artbreeder