Not able run all nodes on DML with optimum

Josephg17 · June 6, 2024, 4:38am

I’m new to HuggingFace/Optimum, in fact I started working on AI/ML stuff recently.

I tried to optimum models with DML EP (on my windows PC), for example take optimum/vit-base-patch16-224 · Hugging Face

model = ORTModelForImageClassification.from_pretrained(model_name, provider=“DmlExecutionProvider”)

onnx 1.16.1
onnxruntime 1.18.0
onnxruntime-directml 1.18.0
optimum 1.20.0

I see nodes are distributed between CPU EP & DML EP. Also, noticed different instances of same node are placed on both DML and CPU.

from verbose logs

2024-06-05 11:11:22.1833502 [V:onnxruntime:, session_state.cc:1152 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Node(s) placed on [DmlExecutionProvider]. Number of nodes: 335
2024-06-05 11:11:22.1936262 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Reshape (Reshape_8)
2024-06-05 11:11:22.2061078 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Concat (Concat_25)

2024-06-05 11:11:22.8286509 [V:onnxruntime:, session_state.cc:1152 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Node(s) placed on [CPUExecutionProvider]. Number of nodes: 9
2024-06-05 11:11:22.8322004 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Concat (Concat_7)
2024-06-05 11:11:22.8387018 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Reshape (Reshape_17)

For example take “Reshape” node/operator, I believe this node is supported on DML(Reshape_8 - is placed on DML), then why Reshape_17 instance of this node is placed on CPU

Why the few node instances are placed on CPU, even though DML have support for those nodes?

I expect, with provider=“DmlExecutionProvider” option, all nodes should be placed on DML only (exception - if there is no native support on DML for a particular node). But in the above case, all the nodes placed on CPU, support is present on DML

How can I force all nodes to be placed on DML? If the nodes got distributed b/w CPU and DML, I expect some overhead due to data transfer b/w CPU and DML

Thanks,

regisss · June 6, 2024, 8:41am

Looks like a duplicate of Not able to run on DML with pipeline

IlyasMoutawwakil · June 6, 2024, 9:28am

This is automatic behavior in onnxruntime:

Some nodes were not assigned to the preferred execution providers which may or may not have an negative impact on performance. e.g. ORT explicitly assigns shape related ops to CPU to improve perf.

There was an issue in onnxruntime repo that was closed without giving a solution, maybe you’d wanna track it there.

github.com/microsoft/onnxruntime

[Performance] How can I forcefully assign nodes to DML EP

opened 07:09AM - 26 Jul 23 UTC

closed 06:34PM - 06 Sep 23 UTC

han-minhee

ep:DML model:transformer platform:windows

### Describe the issue I have a model with shape related operators including "R…eshape" and trying to run it using Dml EP, with .Net. When the session is run, it gives the warning: > [W:onnxruntime:, session_state.cc:1169 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Some nodes were not assigned to the preferred execution providers which may or may not have an negative impact on performance. e.g. ORT explicitly assigns shape related ops to CPU to improve perf. When I add the CPU EP, the warning is gone, but I can see that the ops are run on CPU and the copy to host takes too much time in the performance profile. I want to see the performance when the nodes are assigned to DML EP. Would it be possible? ### To reproduce I'm using a model from: https://huggingface.co/JunkyByte/easy_ViTPose/tree/main ### Urgency _No response_ ### Platform Windows ### OS Version 11 ### ONNX Runtime Installation Released Package ### ONNX Runtime Version or Commit ID 1.15.1 ### ONNX Runtime API C# ### Architecture X64 ### Execution Provider DirectML ### Execution Provider Library Version _No response_ ### Model File _No response_ ### Is this a quantized model? No

Josephg17 · June 6, 2024, 4:21pm

No, it is different. As I mentioned in the other post, pipeline is forcing everything to CPU EP and I don’t see the issue with the latest packages

Josephg17 · June 6, 2024, 4:27pm

Thanks @IlyasMoutawwakil for your response.

Not just with shape related operators. simple ops like add, sub, equal are also getting placed on CPU EP with provider=“DmlExecutionProvider” option

with optimum/gpt2 · Hugging Face

2024-06-06 08:58:16.7010989 [V:onnxruntime:, session_state.cc:1152 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Node(s) placed on [CPUExecutionProvider]. Number of nodes: 502
2024-06-06 08:58:16.7177866 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Add (Add_241)
2024-06-06 08:58:16.7669495 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Pow (Pow_363)
2024-06-06 08:58:16.7753360 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Sub (Sub_380)
:
2024-06-06 08:58:18.0929380 [V:onnxruntime:, session_state.cc:1152 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Node(s) placed on [DmlExecutionProvider]. Number of nodes: 526
2024-06-06 08:58:18.1281232 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Sub (Sub_261)
2024-06-06 08:58:18.1397490 [V:onnxruntime:, session_state.cc:1154 onnxruntime::VerifyEachNodeIsAssignedToAnEp] Add (Add_266)

Topic		Replies	Views
Not able to run on DML with pipeline 🤗Optimum	2	340	June 6, 2024
Some nodes were not assigned to the preferred execution providers 🤗Optimum	1	3121	January 10, 2024
Some nodes were not assigned to the preferred execution providers which may or may not have an negative impact on performance. e.g. ORT explicitly assigns shape related ops to CPU to improve perf Models	0	935	November 10, 2023
Optimum library optimization and quantization fails 🤗Optimum	8	1552	February 22, 2025
\multi-node finetuning with Trainer 🤗Transformers	0	478	July 27, 2022

Not able run all nodes on DML with optimum

Related topics