Binary CLIP model

natankatz · March 9, 2023, 8:20am

Good Morning

I want to use clip as a binary model (namely give an input of a single sentence and an image)
The existing models don’t work since they output too high values in the last linear layer. Which means that they may provide high probability under the sigmoid.
Which model can solve this?

Topic		Replies	Views
Converting weights to .safetensors with HF format -> CLIP-L is ruined. Why? Beginners	18	1273	September 21, 2024
Image Captioning fine tuning 🤗Transformers	0	438	February 25, 2023
Can I pass multiple images in CLIP model? Models	1	1716	September 25, 2023
Getting token probabilities of a caption given an image from BLIP2 🤗Transformers	4	476	October 13, 2024
Convert model clip to onnx Models	0	203	July 5, 2024

Binary CLIP model

Related topics