The HF manual consists of its introduction and a type that is automatically generated from the library. (They extract what’s written as comments in the code.)
The introduction often contains theoretical ideals and information that was correct at the time it was written but is now incorrect, so ultimately it is quicker to read the library code or watch and steal the work of others who have done it well.
It would be easiest if it could be fixed by updating the library…
But when it comes to multi-GPUs, few people use them on HF’s Spaces, so if it’s buggy, you’ll have to do it manually with torch.
https://pytorch.org/tutorials/intermediate/model_parallel_tutorial.html