Multi-GPU Operation mistralai/Mistral-Large-Instruct-2407

Lokis · September 7, 2024, 3:37am

When I deployed Mistral-Large-Instruct-2407 on a multi-GPU server, I set GPU usage to “auto”, but the returned data was very slow. I wanted to try running my 8 A100 80Gb servers at full speed, but debugging multi-GPU settings, including workers, threads, GPU limits, etc., always resulted in GPU memory being fully occupied. Sometimes, when it worked, I encountered errors when child threads tried to use the model pre-loaded into GPU by the main thread. I implemented a ‘ swap’ solution, but it still tells me to use ‘ swap’ .

I haven’t seen any official sample code on Hugging Face or GitHub. I’m seeking guidance from everyone.

Topic		Replies	Views
Running Mistral-7B-Instruct-v0.2 on multiple GPUs Beginners	4	4297	March 13, 2024
Mistral from Huggingface is slow 🤗Transformers	0	1143	November 19, 2023
Multi Node GPU: `connecting to address with family 7299 is neither AF_INET(2) nor AF_INET6(10)` 🤗Accelerate	1	674	December 2, 2023
SSH connection with the remote server crashes when using device_map="auto" 🤗Accelerate	0	70	July 10, 2024
Mistral-7B-v0.1 finetuning results in Out-of-Memory after some iterations Models	2	1194	January 19, 2024

Multi-GPU Operation mistralai/Mistral-Large-Instruct-2407

Related topics