Clip score for text to image model

goldenalcheese · December 19, 2024, 2:13pm

Hi, I am reading this page about Clip score used to evaluate a text-to-image model: Evaluating Diffusion Models. I notice that the clip score calculated is larger than 1, while in the literature, it is usually reported around 0.3. Do I just divide the clip score by 100 to match the scale in the literature?

Topic		Replies	Views
Stable Diffusion CLIP similarity 🧨 Diffusers	6	4587	December 6, 2022
CLIP Image to Text search Beginners	0	898	December 19, 2022
Get well adjusted confidence scores from similarity of CLIP encodings Intermediate	1	563	July 25, 2024
CLIP scores, with vector input rather than image input 🤗Transformers	0	263	April 15, 2024
Binary CLIP model 🤗Transformers	0	407	March 9, 2023

Clip score for text to image model

Related topics