CLIP like contrastive vision-language models for Spanish with pre-trained text and vision models For this project, a pre-trained image model like ViT and a pre-trained text model like BERT can be used as an image encoder and text encoder respectively. Model Pre-trained ViT , BERT models can be fou…

CLIP like contrastive vision-language models for Spanish with pre-trained text and vision models

patrickvonplaten June 29, 2021, 3:22pm 5

Awesome! Let’s define the project

1 Like

Topic		Replies	Views
CLIP like contrastive vision-language models for German with pre-traind text and vision models Flax/JAX Projects	5	1828	July 4, 2021
IndoClip : Pre Training Clip for Indonesian dataset Flax/JAX Projects	3	480	June 30, 2021
Image captioning for Spanish with pre-trained vision and text model Flax/JAX Projects	13	2486	July 19, 2021
Image captioning for Indonesia with pre-trained vision Flax/JAX Projects	4	486	June 29, 2021
Image captioning for Japanese with pre-trained vision and text model Flax/JAX Projects	0	1170	June 23, 2021