About the 🤗 Optimum category
|
|
0
|
622
|
March 25, 2022
|
Getting ValueError when exporting model to ONNX using optimum
|
|
4
|
61
|
August 19, 2022
|
Optimize an ONNX Seq2Seq model
|
|
0
|
18
|
August 18, 2022
|
How to optimize ONNX seq2seq model?
|
|
2
|
190
|
August 17, 2022
|
Regarding Quantizing gpt2-xl, gpt2-large, &c
|
|
2
|
74
|
August 10, 2022
|
Load pytorch trained model via optimum
|
|
5
|
454
|
August 10, 2022
|
Support for Mpnet models
|
|
2
|
85
|
August 8, 2022
|
What does the decoder with past values means
|
|
1
|
134
|
August 5, 2022
|
Recommended Approach for Distributed Inference
|
|
3
|
182
|
August 1, 2022
|
Optimum & RoBERTa: how far can we trust a quantized model against its pytorch version?
|
|
10
|
477
|
July 27, 2022
|
Symlink error when importing ORTSeqClass model via Pipeline
|
|
4
|
184
|
July 22, 2022
|
Can not import classes ORTModelFor(...) in AWS Sagemaker
|
|
4
|
251
|
July 14, 2022
|
Quantized Model size difference when using Optimum vs. Onnxruntime
|
|
3
|
226
|
July 14, 2022
|
Optimum & T5 for inference
|
|
14
|
974
|
July 8, 2022
|
Use_auth_token and revision with the class ORTModelFor SequenceClassification?
|
|
1
|
209
|
July 5, 2022
|
Unexpected input data type
|
|
1
|
348
|
June 29, 2022
|
InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int32)) , expected: (tensor(int64)
|
|
1
|
399
|
June 29, 2022
|
Onnx Vs Optimum
|
|
1
|
269
|
June 28, 2022
|
Pass CPU cores to speed up inference
|
|
1
|
373
|
June 14, 2022
|
Quantization on customized model
|
|
1
|
463
|
May 10, 2022
|
Optimum v1.1.0 breaking problems
|
|
1
|
487
|
April 26, 2022
|