How to upload a quantized model?

in case it’s useful, i’ve also answered in another thread some of the main steps you need to re-load the quantized weights using pytorch’s state_dict: Pegasus Model Weights Compression/Pruning - #9 by lewtun