Image captioning for low resource Indian Languages

There are many image captioning systems exist for english language, here in this project we will develop an Image captioning system for an Indian language

If we have time and resource, we can extend this to other languages as well.

Datasets:
Dataset can be created by translating captions of existing Flickr30k or any other image captioning dataset

An example:
https://www.amitavadas.com/Image2Tweet.html

Other resources:
Vision encoder Decoder model: Vision Encoder Decoder Models — transformers 4.12.2 documentation

Baseline:

Discord channel

To chat and organise with other people interested in this project, head over to our Discord and:

  • Follow the instructions on the #join-course channel
  • Join the #image-captioning channel

Just make sure you comment here to indicate that you’ll be contributing to this project :slight_smile:

5 Likes

Hey Sean - Looks really interesting. I am interested. Joined the Discord Channel.

1 Like

Hi Sean, sounds good. Count me in.

1 Like