Feature extraction for image with a hosted model

A post was split to a new topic: Deploying CLIP-Vit as an inference endpoint