Regarding GGUF Quantize model

akarshan8001 · April 30, 2024, 7:55am

I have use PEFT lora technique to fine tune a mistral model i used bit and byte for my quantization so wanted to where its saving ,mu quantize model and how i can use quantize model with my adapter layer. if any reference please share article

Topic		Replies	Views
Finetuned LLM model conversion to GGUF - performance drop Models	4	1848	July 31, 2024
How to load a model fine-tuned with QLoRA 🤗Transformers	2	6666	July 29, 2024
Peft model from pretrained load in 8/4 bit 🤗Transformers	6	17565	October 12, 2023
Quantizing a model on M1 Mac for qlora 🤗Transformers	0	1672	March 14, 2024
Peft following bits and bytes seems to have no effect on LLM Intermediate	0	496	January 31, 2024

Regarding GGUF Quantize model

Related topics