Apologies if this is a common question. I have a test rig with a 8700 + 3060 12gb running at the moment, but in order to run larger models, is it feasible to run an X99 Xeon (16 or 18 core) along with 128gb RAM, plus 3 or 4 3060 12gb cards (perhaps 3-4 5060ti 16gb cards in future)?
Obviously bigger models could run in system ram but run slower, and anything needing GPU vram would load into that when needed. Is this possible? is the power draw going to be ridiculous?
Im down an AliExpress rabbit hole right now and need some sanity lol!
For hardware purchase advice, I highly recommend Hugging Face Discord.
Obviously bigger models could run in system ram but run slower, and anything needing GPU vram would load into that when needed. Is this possible?
Multi-GPU is more difficult than single-GPU in terms of coding and environment settings, but I think many people use multi-GPU for that purpose. Just be careful with IOMMU settings in BIOS/UEFI… I hear it’s dangerous.
is the power draw going to be ridiculous?
With that spec, a single GeForce card seems to consume about 200 watts, which is an incredible amount of power, but it seems like it would be about the same as running an air conditioner at full power…