Using oneDNN with 🤗 models

sayakpaul · July 21, 2022, 2:46am

Hi.

I have been trying to understand the performance benefits of OneDNN for CPU inference (reference).

To this end, I am trying benchmark a pre-trained model with and without OneDNN optimizations. Inference with the traced model seems to be breaking with:

RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got torch.FloatTensor instead (while checking arguments for embedding)

Any suggestions to resolve it?

Here’s my Colab Notebook: Google Colab.

Topic		Replies	Views
RuntimeError: Expected tensor for argument #1 'indices' to have one of the following scalar types: Long, Int; but got MPSFloatType instead (while checking arguments for embedding) 🤗Transformers	1	1383	July 24, 2024
Cannot pin 'torch.cuda.LongTensor' only dense CPU tensors can be pinned 🤗Transformers	1	1163	September 26, 2024
Run_backward: expected dtype Float but got dtype Long Intermediate	4	988	July 3, 2024
DDP + Compile + Torch Dynamo + Huggingface Trainer 🤗Transformers	0	94	August 28, 2024
Inference just halts, no error, how to troubleshoot 🤗Transformers	7	1207	February 13, 2024

Using oneDNN with 🤗 models

Related topics