Backend low level kernel libraries used in Transformers

Hello @ArthurZ @RaushanTurganbay ,
What are the backend libraries used for low level kernels operations(matmul, softmax etc) in transformers library?
Issue: If I ran same model(say mamba or llama) on x86 machine and aarch64 machine, I observe difference in the model timing. I suspect there are different paths for kernels in x86 and aarch64.
Please specify the backend libraries used for x86 and aarch64

1 Like