Can't load huggingface model (Clip) on subprocess

super-cinnamon · January 19, 2025, 10:31pm

Hello,

So I have been working on a multi agent system with langgraph and so far everything has been doing great when working on single agent graphs. For RAG I use CLIP embedding model for retrieval, but the issue that i have been getting is related to model loading.

You see I am working on a supervisor system, where a main agent calls different sub agents. When one of these sub agents is called and is about to load the CLIP model (which has been loaded previously due to the main supervisor agent needing it and worked fine) from the sub agent tools, it gives me this error.

I do have the model downloaded I checked multiple times and works normally, I just can’t find how I can load it?

also yes i did try loading it elsewhere and importing it but my constants are fully reset.
I also tried using multiprocessing Manager but that returned the same error, and if i add the if __name__ == "__main__" condition it never goes into it.

I am completely lost on what to do at this point. I would appreciate python multiprocessing experts to help me here, especially since LangGraph is the one responsible for creating the sub processes, so I couldn’t “use fork” as the error suggests.

Exception has occurred: OSError
Can't load the model for 'openai/clip-vit-base-patch32'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'openai/clip-vit-base-patch32' is the correct path to a directory containing a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.
RuntimeError: 
        An attempt has been made to start a new process before the
        current process has finished its bootstrapping phase.

        This probably means that you are not using fork to start your
        child processes and you have forgotten to use the proper idiom
        in the main module:

            if __name__ == '__main__':
                freeze_support()
                ...

        The "freeze_support()" line can be omitted if the program
        is not going to be frozen to produce an executable.

        To fix this issue, refer to the "Safe importing of main module"
        section in https://docs.python.org/3/library/multiprocessing.html

John6666 · January 19, 2025, 11:04pm

It seems that there are cases of simple bugs and cases of difficult specifications. The former can probably be fixed with a pip update for Transformers, but the latter seems to be a problem at the PyTorch level, and it seems that it is quite difficult to solve or that it will not go smoothly.

github.com/huggingface/transformers

Breaking change due to `multiprocessing.Process` when loading `pytorch_model.bin`-based model

opened 11:19AM - 12 Dec 24 UTC

closed 03:05PM - 12 Dec 24 UTC

tomaarsen

## Bug Overview * Loading any `transformers` models fails if: * the model on…ly has a `pytorch_model.bin`, and * you're not inside of `__name__ == "__main__"` (Taken from https://github.com/huggingface/transformers/pull/34966#issuecomment-2538598145) ## Details In short: `multiprocessing.Process` never works when not inside of `__name__ == "__main__"`. I recognize that most programs should be using that line, but I'd rather not force it on my users. If one of my users loads any model that only has a `pytorch_model.bin`, then it'll fail, e.g.: ```python from sentence_transformers import SentenceTransformer model = SentenceTransformer("embaas/sentence-transformers-gte-base") ``` or ```python from sentence_transformers import CrossEncoder model = CrossEncoder("cross-encoder/ms-marco-MiniLM-L-6-v2") ``` which internally call ```python from transformers import AutoModel model = AutoModel.from_pretrained("embaas/sentence-transformers-gte-base") ``` or ```python from transformers import AutoModelForSequenceClassification model = AutoModelForSequenceClassification.from_pretrained("cross-encoder/ms-marco-MiniLM-L-6-v2") ``` All of these get: ``` RuntimeError: An attempt has been made to start a new process before the current process has finished its bootstrapping phase. This probably means that you are not using fork to start your child processes and you have forgotten to use the proper idiom in the main module: if __name__ == '__main__': freeze_support() ... The "freeze_support()" line can be omitted if the program is not going to be frozen to produce an executable. To fix this issue, refer to the "Safe importing of main module" section in https://docs.python.org/3/library/multiprocessing.html ``` Edit: To prevent people experiencing errors, I've updated all [`cross-encoder` models](https://huggingface.co/cross-encoder) to safetensors. So you can't reproduce those anymore without specifying the `revision`. - Tom Aarsen

github.com/huggingface/transformers

Multiprocessing support

opened 03:27PM - 17 Aug 24 UTC

closed 08:06AM - 13 Oct 24 UTC

keyboardAnt

Usage bug

Running model forwards within a process seems to get stuck. I tried to set `TOKE…NIZERS_PARALLELISM` to `true` and `false` but unfortunately both couldn't help 🥲 ### System Info `transformers-cli env`: ```text - `transformers` version: 4.44.0 - Platform: Linux-6.10.0-linuxkit-aarch64-with-glibc2.35 - Python version: 3.10.14 - Huggingface_hub version: 0.24.5 - Safetensors version: 0.4.4 - Accelerate version: 0.31.0 - Accelerate config: not found - PyTorch version (GPU?): 2.4.0 (False) - Tensorflow version (GPU?): not installed (NA) - Flax version (CPU?/GPU?/TPU?): not installed (NA) - Jax version: not installed - JaxLib version: not installed - Using distributed or parallel set-up in script?: yes ``` ### Who can help? @ArthurZucker @gante ### Information - [ ] The official example scripts - [x] My own modified scripts ### Tasks - [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [x] My own task or dataset (give details below) ### Reproduction Minimal example: ```python from transformers import AutoModelForCausalLM, AutoTokenizer from multiprocess import Process, Queue import os os.environ["TOKENIZERS_PARALLELISM"] = "false" model = AutoModelForCausalLM.from_pretrained("gpt2") tokenizer = AutoTokenizer.from_pretrained("gpt2") tok_ids = tokenizer.encode("Multiprocessing with Hugging Face could be an ", return_tensors="pt") def fwd(model, tok_ids, queue): print("Starting process") print(f"{os.environ['TOKENIZERS_PARALLELISM']=}") print(f"{type(model)=}") print(f"{tok_ids=}") try: outs = model(tok_ids) except Exception as e: print(f"Error: {e}") print(f"{outs=}") queue.put(outs) queue = Queue() pr = Process(target=fwd, args=(model, tok_ids, queue)) pr.start() pr.join() outs = queue.get() print(outs) ``` prints ```text Starting process os.environ['TOKENIZERS_PARALLELISM']='false' type(model)=<class 'transformers.models.gpt2.modeling_gpt2.GPT2LMHeadModel'> tok_ids=tensor([[15205, 541, 305, 919, 278, 351, 12905, 2667, 15399, 714, 307, 281, 220]]) ``` ### Expected behavior Shouldn't get stuck.

system · January 20, 2025, 1:34pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Prakash Hinduja Switzerland (Swiss) How do I load a pre-trained model in Hugging Face? Beginners	1	23	June 26, 2025
Download llama for offline computer Models	1	1132	September 13, 2023
Prohibition on loading models (Probable) 🤗Transformers	0	482	March 25, 2023
Code trying to download model from huggingface instead of using Locally Downloaded Model Beginners	5	2922	July 9, 2024
Can't load a ViTPose model using Transformers Beginners	1	384	March 18, 2025

Can't load huggingface model (Clip) on subprocess

Related topics