Model that can generate both text and image as output

ValdeJunior · December 31, 2024, 12:15am

There are many models that accept your condition.

1.OpenAI GPT 4
this is perhaps the most advanced option for multimodal capabilities.
2.Google DeepMind’s Gemini
3. Midjourney and stable diffusion
4. CLIP and Artbreeder

Topic		Replies	Views
New Stable Diffusion 🔒 Gradio	0	843	September 2, 2023
Diffuser API Inference Community Limited to 1 Image Return Inference Endpoints on the Hub	0	486	April 8, 2023
Multimodal LLM with Image and Text sequentially in its prompt 🤗Transformers	2	12465	January 1, 2024
Image to Text model that can take an additional text as input for context 🤗Hub	1	493	September 5, 2023
I'm looking for an 'image to text' model Beginners	0	826	April 2, 2023

Model that can generate both text and image as output

Related topics