Load pytorch trained model via optimum

Talha · June 14, 2022, 6:13pm

I train a bert model using pytorch lightning now i want to load it to optimum for inference. How can i do that.
I tried to save it as

torch.save(model.bertmodel.state_dict(), 'bert.pth')

then try to load in optimum as

# The type of quantization to apply
qconfig = AutoQuantizationConfig.arm64(is_static=False, per_channel=False)
quantizer = ORTQuantizer.from_pretrained('bert.pth', feature="sequence-classification")

# Quantize the model!
quantizer.export(
    onnx_model_path="model.onnx",
    onnx_quantized_model_output_path="model-quantized.onnx",
    quantization_config=qconfig,
)

the error it throw is

OSError: bert.pth is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'

Is there a way to handle this without uploading model to huggingface?

Topic		Replies	Views
Optimum & RoBERTa: how far can we trust a quantized model against its pytorch version? 🤗Optimum	10	2421	July 27, 2022
Optimum library optimization and quantization fails 🤗Optimum	8	1606	February 22, 2025
Inference on models with custom head 🤗Optimum	1	22	January 28, 2025
Optimize AND quantize with Optimum 🤗Optimum	11	3316	February 10, 2024
Getting ValueError when exporting model to ONNX using optimum 🤗Optimum	16	5101	November 25, 2022

Load pytorch trained model via optimum

Related topics