Seeking detailed parameter docs on Wav2Vec via API

boxabirds · March 15, 2021, 9:59am

Hi y’all I’m trying to find docs on how to call the Wav2Vec model for TTS via the API.

The detailed API docs on parameters don’t seem yet to have information on the format of the API request for that model
Trying to infer it from the demo page fails because of a CORS error in Chrome:

Access to fetch at 'https://api-audio-frontend.huggingface.co/models/facebook/wav2vec2-large-960h-lv60-self' from origin 'https://huggingface.co' has been blocked by CORS policy: No 'Access-Control-Allow-Origin' header is present on the requested resource. If an opaque response serves your needs, set the request's mode to 'no-cors' to fetch the resource with CORS disabled.

bundle.7995df3.js:1 POST https://api-audio-frontend.huggingface.co/models/facebook/wav2vec2-large-960h-lv60-self net::ERR_FAILED
run_api @ bundle.7995df3.js:1
handleClick @ bundle.7995df3.js:1
async function (async)
handleClick @ bundle.7995df3.js:1
(anonymous) @ bundle.7995df3.js:1

valhalla · March 15, 2021, 4:02pm

cc @patrickvonplaten

patrickvonplaten · March 15, 2021, 4:27pm

Wav2Vec2 is not yet included in the official inference-api - it should be included soon though

boxabirds · March 16, 2021, 8:50am

Ah ok thanks @patrickvonplaten – sign me up for notifications when it is! Very interested in pricing too. The STT marketplace is rapidly maturing so I’m looking for the next price-driven challenger in this space.

Narsil · May 4, 2021, 3:19pm

ASR was added to the list of parameters. You basically need to send the raw audio file.

Cheers,
Nicolas

Topic		Replies	Views
Accelerated Inference API Automatic Speech Recognition Beginners	2	635	September 13, 2022
Facebook/wav2vec2-large-it-voxpopuli and /wav2vec2-large-it-voxpopuli seem broken Model cards	0	1540	December 28, 2021
Pre-training for Wav2Vec2-XLSR via Huggingface Models	15	5351	November 5, 2024
Text-to-speech inference API doesn't respect accept headers Inference Endpoints on the Hub	4	304	June 6, 2023
Resources on interpretability of wav2vec-style speech models Research	0	636	September 12, 2022

Seeking detailed parameter docs on Wav2Vec via API

Related topics