Serveless memory problem when deploy Wav2Vec2 with custom inference code

You mean other than serverless? Yes, there’s actually 4 different inference options on Sagemaker. @philschmid just released a blog post comparing the different options: https://www.philschmid.de/sagemaker-inference-comparison