Serveless memory problem when deploy Wav2Vec2 with custom inference code

philschmid · May 18, 2022, 1:12pm

@diegoseto is there a particular reason why you are creating a inference.py script? You can directly provide your HF_API_TOKEN in the hub configuration next to you model id and task. See HF_API_TOKEN

Topic		Replies	Views
Deploying Open AI's whisper on Sagemaker Amazon SageMaker	54	16228	April 12, 2024
How to deploy Whisper for other languages to Sagemaker? Amazon SageMaker	0	307	February 5, 2024
Sagemaker Serverless Inference Amazon SageMaker	22	9037	May 22, 2024
Inference failed for FLAN-UL2(20B) on SageMaker Amazon SageMaker	6	2167	April 4, 2023
How to create Wav2Vec2 With Language model 🤗Transformers	15	5993	May 5, 2023

Serveless memory problem when deploy Wav2Vec2 with custom inference code

Related topics