Training for GPTQ, possible?

zinomx · July 16, 2023, 7:24am

Apparently it’s not possible to train a Lora for a GPTQ model. If I’m incorrect please let me know. We’re using TheBloke/Wizard-Vicuna-30B-Superhot-8K-GPTQ which runs fast and does everything we need it to do on our Nividia A40 with 48GB of VRAM.

However, to train a Lora for this model, I’ve concluded we need to train for the original model, which is ehartford/wizard_vicuna_70k_unfiltered, which is roughly 130GB

In order to train the original model, we’ll need to lease a GPU server with 130GB of VRAM, which is roughly $3500 a month, so…can someone please tell me if a Lora trained from the original model will FOR SURE work on the GPTQ version of the model, or do I have any of this wrong?

ntheden · October 24, 2023, 10:11pm

This article says you can.

Topic		Replies	Views
LoRa fine tuning a chatbot on 6GB VRAM GPU Beginners	1	301	January 21, 2025
Hardware Requirement GPU Beginners	3	1164	January 27, 2025
LoRA / QLoRA fine tuning a 8b Model(llama 3.1) Beginners	1	297	February 24, 2025
Qunatized model with LORA takes much more GPU memory than the un-quantized model with LORA for the (E-5-Large Embedding Transformer) 🤗Transformers	4	1748	October 8, 2023
GPTQ+PEFT model running very slowly at inference Intermediate	4	1691	October 24, 2023

Training for GPTQ, possible?

Related topics