Any good code/tutorial that is shows how to do inference with Llama 2 70b on multiple GPUs with accelerate?

vbachi · October 26, 2023, 3:25pm

Do you know of any good code/tutorial that is shows how to do inference with Llama 2 70b on multiple GPUs with accelerate?

marcsun13 · November 27, 2023, 6:00pm

Topic		Replies	Views
Does anyone have an idea how we can run llama2 with multiple GPUs? 🤗Transformers	1	1286	October 26, 2023
How to run inference on multigpus 🤗Accelerate	0	151	November 29, 2024
Loading a HF Model in Multiple GPUs and Run Inferences in those GPUs 🤗Accelerate	10	9792	October 16, 2024
Trying the inference with model Llama-2-70b-hf on 2 A100 (80g) GPUs but getting errors Beginners	6	6706	November 28, 2023
Code makes inference with "Llama 3 70b instruct" model on CPU but has problem with inference with GPUs Beginners	0	1357	April 28, 2024