Serveless memory problem when deploy Wav2Vec2 with custom inference code

Hi @marshmellow77

Cool! I will try this, thanks. What about the use of a language model in inference? There is another option?