There are many image captioning systems exist for english language, here in this project we will develop an Image captioning system for an Indian language
If we have time and resource, we can extend this to other languages as well.
Datasets:
Dataset can be created by translating captions of existing Flickr30k or any other image captioning dataset
An example:
https://www.amitavadas.com/Image2Tweet.html
Other resources:
Vision encoder Decoder model
: https://huggingface.co/transformers/model_doc/visionencoderdecoder.html
Baseline:
Discord channel
To chat and organise with other people interested in this project, head over to our Discord and:
- Follow the instructions on the
#join-course
channel - Join the
#image-captioning
channel
Just make sure you comment here to indicate that you’ll be contributing to this project