P40 and P100 for inference/training

x0BT · March 2, 2024, 12:22pm

Hi, I’m going to create an inference/training workstation. I heard somewhere that Tesla P100 will be better than Tesla P40 for training, but the situation is the opposite for output. Does it make sense to create a workstation with two variants of video cards, or will only P40 or P100 be enough? And why? (I’m going to build a workstation with 4 graphics cards in a year).
And one more question: P40 supports CUDA 6.1 and P100 supports 6.0, but now torch uses CUDA 11.8/12.1… Will this cause a problem? I’ve seen people run LLM on P40, but because of the CUDA situation i don’t understand how it works at all(

x0BT · March 2, 2024, 12:27pm

Forgot to mention, what are the processor requirements? I won’t be able to provide a big budget for the build, so I’m looking at inexpensive processors like EPYC 7302P (it seems to be able to support 4 video cards).

Topic	Replies	Views
Nvidia P40 and LLama 2 Beginners	2321	August 15, 2023
Best multi-GPU setup for finetuning and inference? Intermediate	529	July 3, 2024
falcon-40B inference on older version of torch Intermediate	228	June 27, 2023
Using 2 GPUs out of 4 Beginners	274	February 28, 2024
If I have a small amount of VRAM compared to the model, will pytorch still use the CUDA accelerations? Beginners	455	April 21, 2023

P40 and P100 for inference/training

Related topics