Hugging Face Forums
Optimum library optimization and quantization fails
🤗Optimum
ddahlmeier
February 10, 2024, 9:05am
2
Quantizing from the non-optimized model works
1 Like
Optimize AND quantize with Optimum
show post in topic
Related Topics
Topic
Replies
Views
Activity
Accelerated gpt2-chinese-cluecorpussmall model
Beginners
0
350
September 17, 2021
Huggingface using only half of the cores for inference
Intermediate
0
360
September 6, 2023
Optimal methods to monitor attention matrices when doing training/inference using BERT-type models
Intermediate
2
565
September 11, 2021
Quantization on customized model
🤗Optimum
1
1143
May 10, 2022
Training BERT from scratch (MLM+NSP) on a new domain
🤗Transformers
10
5617
February 2, 2024