How to Get Dutch Output for Dutch Audio Using Whisper Model via Hugging Face Inference Endpoint?

timtom12 · January 26, 2024, 5:13pm

Hello Hugging Face community,

I’m working with the Whisper model for a speech-to-text task, specifically handling Dutch audio files. I’m utilizing the dedicated inference endpoint provided by Hugging Face. However, I’m facing a challenge in ensuring that the transcription output aligns with the Dutch language of the audio input.

Below is the code snippet I’m currently using:

import requests

API_URL = "-----"
headers = {
    "Accept": "application/json",
    "Authorization": "Bearer ----",
    "Content-Type": "audio/wav"
}

def query(filename):
    with open(filename, "rb") as f:
        data = f.read()
    response = requests.post(API_URL, headers=headers, data=data)
    return response.json()

output = query("rec.wav")
print(output)

This setup successfully sends the audio file to the Hugging Face inference endpoint and receives a response. However, the language of the output transcription is not in Dutch but in Chinese.
How can I add a language code for dutch?

Topic		Replies	Views
How to configure the language in Whisper-large-v3 endpoint? Inference Endpoints on the Hub	1	519	January 27, 2024
How to use Inference API to perform speech recognition Beginners	1	206	October 12, 2024
How to run text to speech from inference endpoint given audio file url? Beginners	1	898	June 8, 2023
HuggingFace Inference endpoint 504 error Inference Endpoints on the Hub	3	793	January 30, 2024
Inference Model with API and Integrate to LM (Language Model) 🤗Transformers	0	636	June 7, 2022

How to Get Dutch Output for Dutch Audio Using Whisper Model via Hugging Face Inference Endpoint?

Related topics