Postprocess using CLIP

cnut1648 · October 24, 2022, 4:43am

Hello, I wonder if there are some ways to postprocess synthesized images with CLIP. For example given a prompt “A photo of a chair”, sometimes the generated images are not consistent with the prompt (eg a twisted chair", so I believe it’s helpful to postprocess using some existing models like CLIP. Basically we can generate 10 images and only keep top 5 images with the highest scores with the text prompt.
I wonder if such functionality is already implmented. Thanks.

pcuenq · October 24, 2022, 6:21am

Hi @cnut1648, that’s certainly possible! It’s not implemented in the diffusers library, but you can do it yourself in exactly the way you are describing. You can start by taking a look at the CLIP usage documentation and go from there. Please, let us know if you have any questions

Topic		Replies	Views
Provide CLIP embeddings directly to diffuser Beginners	0	319	August 5, 2023
CLIP Image to Text search Beginners	0	897	December 19, 2022
Diffusers load custom embedding 🧨 Diffusers	0	48	November 7, 2024
How is additional text information used for image classification using CLIP? Beginners	0	450	November 5, 2023
Use OpenAI's CLIP for style transfer 🤗 Course Projects	3	3259	November 16, 2021

Postprocess using CLIP

Related topics