Image to Text API Inference - Input Error

Michele96 · October 30, 2023, 10:50am

Hello everyone!
I would like to create a Python script in which I send a POST request via the Hugging Face API Inference for an Image to Text model. The model is: nlpconnect/vit-gpt2-image-captioning link.
I’m having issues with sending the image, as the POST request is returning a 400 error.
The Python script is as follows:

# Call the function with your image path, model, and API key
# caption = image_to_text("your_image.jpg", "nlpconnect/vit-gpt2-image-captioning", "your_api_key")

import requests

def image_to_text(image_path, model, api_key):
url = f"https://api-inference.huggingface.co/models/{model}"
    headers = {
        "Authorization": f"Bearer {api_key}",
    }
    with open(image_path, 'rb') as image:
        files = {
            "file": image,
        }
        try:
            response = requests.post(url, headers=headers, files=files)
            response.raise_for_status()

            result = response.json()
            caption = result["result"][0]["caption"]
            return caption

        except Exception as e:
            print("Error during the API request:", str(e))
            return None

I’m struggling to identify the source of my error. Could someone please offer assistance? Thank you!

Topic		Replies	Views
Inference provider for captioning (image2text model) Beginners	3	22	June 16, 2025
What image type does inference text-to-image API return? Beginners	2	1396	June 27, 2023
How do I use Text-Image to Text models with Huggingface Inference? Beginners	3	258	October 12, 2024
How to make an inference for HuggingFaceModel of type 'image-to-text' Amazon SageMaker	0	502	January 27, 2024
Image-To-Text task on Inference Endpoint Inference Endpoints on the Hub	13	2331	October 17, 2023

Image to Text API Inference - Input Error

Related topics