Could you try after installing Optimum from source? You can do it as follows:
git clone https://github.com/huggingface/optimum.git
cd optimum/
pip install .
We recently added IOBinding in Optimum. A new version with this change will be released very soon but for now it is only available with a source install. You don’t have to change anything in your script, it should be used by default with the CUDAExecutionProvider
.