Llama2 13b vs 70 b

lguizard · August 1, 2023, 5:23am

Have you seen slower performance on Llama2 70b vs 13b even if running on a much bigger inference type ?

hansekbrand · August 3, 2023, 1:25pm

What is a “inference type” and which “inference types” did you use on 70b and 13b respectively?

Topic		Replies	Views
Any good code/tutorial that is shows how to do inference with Llama 2 70b on multiple GPUs with accelerate? 🤗Accelerate	1	2762	November 27, 2023
Meta-Llama-3.1-70B-Instruct-IMat-GGUF Beginners	0	144	July 24, 2024
Llama 2 70B on a cpu Beginners	2	6861	August 23, 2023
Llama 70b returning incomplete responses Inference Endpoints on the Hub	0	500	November 7, 2023
LLAMA2 70b Inference api stuck on currently loading Inference Endpoints on the Hub	4	1035	September 3, 2024