Cuda version conundrum

mirix · August 4, 2023, 7:33am

Hello,

Transformers relies on Pytorch, Tensorflow or Flax. I typically use the first.

In any case, the latest versions of Pytorch and Tensorflow are, at the time of this writing, compatible with Cuda 11.8.

Lucky me, for Cuda 11.8 is supposed to be the first version to support the RTX 4090 cards.

Well, not fully, apparently:

MapSMtoCores for SM 8.9 is undefined.  Default to use 128 Cores/SM
MapSMtoCores for SM 8.9 is undefined.  Default to use 128 Cores/SM
MapSMtoArchName for SM 8.9 is undefined.  Default to use Hopper
GPU Device 0: "Hopper" with compute capability 8.9

I believe the 4090 to be an Ada Lovelace, not a Hopper.

Will the fact that the card is not correctly identified by Cuda have any effect in resource utilisation and/or performance?

Is there anything we could do about that?

Does anyone know if Torch works with a more recent Cuda? Or can the MapSMtoCores and MapSMtoArchName variables be somehow hard-coded? Or is this completely irrelevant?

Best,

Ed

mirix · August 4, 2023, 9:46am

For Pytorch, the nightly version is compatible with Cuda 12.1, which fully supports the card and simplifies things considerably.

Topic		Replies	Views
If I have a small amount of VRAM compared to the model, will pytorch still use the CUDA accelerations? Beginners	0	455	April 21, 2023
BLOOM models don't run on my GPU using Transformers 🤗Transformers	1	1661	September 18, 2022
Need help performance issues transformers.AutoModelForCausalLM.from_pretrained( 'mosaicml/mpt-7b-instruct' Beginners	0	930	June 12, 2023
Baffling performance issue on most NVidia GPUs with simple transformers + pytorch code Intermediate	5	4506	April 9, 2024
Using a new model in an older version of Transformers library Beginners	0	231	December 15, 2023

Cuda version conundrum

Related topics