Problem with sharing models among processes via multiprocessing

yilmazay · May 11, 2023, 9:00am

Hi All,
I have created a simple asr decoder script which uses an XLSR pretrained model.
In order to improve its decode process, I want to do batch decoding through multiprocessing.
Namely, I want to give a batch of audios to the script, and it will spawn lets say 4 processes of decoders to simultaneously decode the audios in paralel.
I have tried multiprocessing.Process and Pool. However, both of them stuck at process creation step. Because I want to initialize (load) the ASR model at the beginning, and pass the model to each process as a shared object, to avoid reloading the same same model for each process.
During process creation it gives the following error:
RuntimeError: Cowardly refusing to serialize non-leaf tensor which requires_grad, since autograd does not support crossing process boundaries. If you just want to transfer the data, call detach() on the tensor before serializing (e.g., putting it on the queue).
Exception ignored in: <function Pool.del at 0x0000029F8F88CE50>

The related code snippet is as follows:
if model is None or processor is None:
load_model()
the above line will load and set model and processor.
…
with poolcontext(processes=4) as pool:
results = pool.map(process, (processor, model, audio_list))

using Paralel class in joblib also gives the same error.
Here, I am encountering a problem while serializing model and processor classes while creating a process.
How can I overcome this problem?
What is the correct way of sharing models while using multiprocessing ?
I would appreciate any recommendations and guadiance on this issue.
Thanks in advance for taking time and giving support.

Topic		Replies	Views
Distributed inference on multiple files 🤗Transformers	1	1000	January 22, 2023
Cannot load fine-tuned whisper model Beginners	1	1511	October 7, 2023
AutoModel never runs with multiprocessing 🤗Transformers	0	1142	July 19, 2021
ASR Model Tokenizer Won't Load 🤗Transformers	0	74	August 8, 2024
Issue of multiprocessing in map function 🤗Datasets	2	333	March 18, 2024

Problem with sharing models among processes via multiprocessing

Related topics