Hey thanks for that. Seems that tensorflow actually doesn’t register my GPU when I run
tf.test.gpu_device_name()
I get error as below:
2021-11-08 22:08:10.636081: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:939] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-11-08 22:08:10.636549: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1850] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
''
I can use the GPU with Pytorch definitely. When I change the code to Pytorch and place the model on GPU, and start training, I can see GPU usage increasing from nvidia-smi.
I’m using Tensorflow 2.7.0 as well, so it’s not any of the older versions where you need tensorflow-gpu as well.
I’m using cuda V11.0.221 and tried reinstalling but it seems that my account doesn’t have the permission to do that unfortunately.
Any ideas to tackle this would be appreciated highly