Please help me to fine tune image captioning. I want to fine tune CLIP, VIT and BLIp. If any other models are there please help to get.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Fine-tuning CLIP questions | 1 | 484 | May 21, 2024 | |
How to pass CLIP image embeddings to BLIP2 for captioning? | 1 | 1008 | November 15, 2023 | |
What would be the best image-to-text model for a lot of images? | 0 | 858 | November 8, 2023 | |
Image to Text model that can take an additional text as input for context | 1 | 469 | September 5, 2023 | |
Solution for Fine Tuning the Blip Model | 0 | 71 | December 13, 2024 |