I tried RoBERTa-Base and BERT-Base with random inputs. In both cases, the outputs of the dynamic-range quantized models don’t produce close outputs as the original ones.
Here’s the Colab Notebook to reproduce the problem: Google Colab
Any solutions?