How to load a model with from_pretrained() without requiring gradients

marshmellow77 · October 31, 2022, 7:51am

I have an int8 model that I saved with .save_pretrained(). When trying to load the model with .from_pretrained() I get error “RuntimeError: Only Tensors of floating point and complex dtype can require gradients”.

I tried

with torch.no_grad()
    model = AutoModelForCausalLM.from_pretrained("./model-8bit", device_map="auto")

but to no avail.

Any advice would be appreciated

WeyinmiA · July 15, 2023, 11:14pm

Has anyone solved this?

Topic		Replies	Views
If I use trainer.train() and then save the model, is that still useful? Beginners	4	2701	June 24, 2022
How to load a pretrained custom model using `from_pretrained` Beginners	4	7345	June 21, 2023
Saving a model and loading it Models	3	56957	July 5, 2024
How to load finetuned model in TF Beginners	2	449	September 28, 2020
Load quantized model in memory Beginners	1	585	December 8, 2023

How to load a model with from_pretrained() without requiring gradients

Related topics