Loading quantized model on CPU only

I have a similar issue AssertionError: Torch not compiled with CUDA enabled