Capabilities of an rtx 3070

whereahoodie · June 27, 2023, 4:55pm

Hello! I am running a laptop with an 11th Gen Intel Corei7-11800H 2.30GHz, 2304 Mhz, 8 Cores, 16 Logical Processors and an RTX 3070 8gb VRAM overclocked to 1700mhz. I can run 4bit quantized 7b parameter models (wizardlm, vicuna) with autogptq or gptqforllama very fast (>15 tokens/s) but when I go to 13b versions of the same models it crawls below 1 token per second. Am I reaching the limits of what a 3070 can handle? Or am I misconfiguring and should look for solutions? I just want to know so I’m not troubleshooting for no reason. I am running on webgui and TheBloke’s 4bit quantized models. Thank you in advance for any answers

Topic		Replies	Views
Multi GPU Build Possible? Beginners	2	175	January 19, 2025
Performance with new NVIDIA RTX 30 series 🤗Transformers	4	5943	October 7, 2020
List of AI Projects with AMD GPU support? Beginners	0	686	September 3, 2023
Does the CPU speed matter when training using GPU Beginners	1	187	July 2, 2023
More GPUs = lower performance? Beginners	1	521	December 31, 2020

Capabilities of an rtx 3070

Related topics