I’m using Kosmos-2 referring to
https://huggingface.co/microsoft/kosmos-2-patch14-224
using a simple prompt “An image of”.
The process works well but it is quite slow. Over 1000 images, it requires 3 hours and half on Nvidia A100 GPU. Is it possible to enable batch modality in order to process multiple images per time? At the moment, one image per time is processed.
Other hints to improve processing speed? Thanks.