Evaluating with quantized model using Trainer

daltonh · April 19, 2024, 11:55am

I have quantized a model using BitsAndBytes, and I want to evaluate this model on some benchmark tasks. For evaluation on other models, I use trainer.evaluate() to obtain the metrics.
This is however different for quantized models. I use the following code:

...
trainer = Trainer(
    model=model_quantized,
    ....
)
trainer.evaluate(test_data)
...

I obtain the following error:

ValueError: You cannot perform fine-tuning on purely quantized models. Please attach trainable adapters on top of the quantized model to correctly perform fine-tuning. Please see: https://huggingface.co/docs/transformers/peft for more details

I understand that the error gets raised, as I am using the Trainer object on a quantized model, but I do not intend to train this model, and only perfom evaluation.
Is there a way to avoid this error and run evaluation on a quantized model?

Thanks in advance!

daminho · August 10, 2024, 11:14am

You can try to use evaluate library from huggingface instead

Topic		Replies	Views
Evaluation on Quantized model yields identical results across bit-precisions Beginners	1	110	June 20, 2024
Using Trainer class + 4/8 bit quantised model for prediction 🤗Transformers	0	233	April 24, 2024
How do I evaluate a pretrained model on a test dataset? Beginners	1	8702	February 24, 2022
Evaluation using bits per character 🤗Transformers	0	348	October 3, 2022
Resolving "Cannot Perform Fine-Tuning on Purely Quantized Models" Error in Falcon Model Training? 🤗Transformers	4	8936	May 9, 2025

Evaluating with quantized model using Trainer

Related topics