Image captioning for Spanish with pre-trained vision and text model

Dimitre · June 27, 2021, 12:44am

Hey there, this is very interesting, I have some experience with NLP and computer vision, and always wanted to get more experience with multi-modal models (text + vision), also since I saw the WIT dataset for the first time, I wanted to use it for some project, this seems a good opportunity.

If you want to know a little more about my background, check out my GitHub.

Topic		Replies	Views
Image captioning for Japanese with pre-trained vision and text model Flax/JAX Projects	0	1171	June 23, 2021
Image captioning for French with pre-trained vision and text model Flax/JAX Projects	6	2159	January 4, 2022
Image captioning for Indonesia with pre-trained vision Flax/JAX Projects	4	486	June 29, 2021
CLIP like contrastive vision-language models for Spanish with pre-trained text and vision models Flax/JAX Projects	4	397	June 29, 2021
Multilingual Image Captioning Flax/JAX Projects	10	1284	July 6, 2021

Image captioning for Spanish with pre-trained vision and text model

Related topics