Serveless memory problem when deploy Wav2Vec2 with custom inference code

I tried this but i’m still getting the same result.

inference.py:

import os
from transformers import pipeline

os.system('install kenlm')

def model_fn(model_dir):

    pipe = pipeline('automatic-speech-recognition', model_dir, chunk_length_s = 10)
    
    return pipe