@lewtun I chose dynamic quantization approach. And I dont think from_pretrained
support loading quantised models.
I’d be happy if you can take a look. Can we hop on a google meet call and you can help me out.
Hopefully.
Thanks
@lewtun I chose dynamic quantization approach. And I dont think from_pretrained
support loading quantised models.
I’d be happy if you can take a look. Can we hop on a google meet call and you can help me out.
Hopefully.
Thanks