Using ResNet50 weights inside `CLIPModel`

sayakpaul · June 23, 2021, 1:34pm

Hi.

Documentation for CLIP is really comprehensive and along with a collaborator, we were able to quickly cook something up.

Now, in order to reduce the RAM requirements particularly in App Engine (GCP), we need to use a model that is smaller than openai/clip-vit-base-patch32. Since OpenAI has pre-trained ResNet50 weights for CLIP (reference), I was wondering if it’s possible to load that into CLIPModel. If so, someone could help me figure out how?

Topic		Replies	Views
Difference on Models Beginners	0	42	November 25, 2024
Discrepancy between OpenAI CLIP and Huggingface CLIP models Models	2	1655	August 19, 2024
How do I load ViT weights into CLIPVisionModel? Models	0	232	April 26, 2023
Converting weights to .safetensors with HF format -> CLIP-L is ruined. Why? Beginners	18	1192	September 21, 2024
Converting CLIPModel to VisionTextDualEncoderModel 🤗Transformers	1	162	March 21, 2024

Using ResNet50 weights inside `CLIPModel`

Related topics