Hi, does anyone can recommend an image to text model that can take an additional text input for adding context prior for generating the caption?

Image to Text model that can take an additional text as input for context

nadnadoni1234 September 5, 2023, 2:34pm 2

Anyone please?

Topic		Replies	Views
Image to text model that can take an additional text input 🤗Transformers	1	282	October 2, 2023
Image Captioning fine tuning 🤗Transformers	0	440	February 25, 2023
Inference Api free rate limit Inference Endpoints on the Hub	0	1928	May 20, 2023
Inference provider for captioning (image2text model) Beginners	3	35	June 16, 2025
New Stable Diffusion 🔒 Gradio	0	842	September 2, 2023