About the 🤗 Optimum category
|
|
0
|
990
|
March 25, 2022
|
Optimize AND quantize with Optimum
|
|
9
|
565
|
March 27, 2023
|
Potential Memory Leak for ORTModelForCausalLM with TensorRT Providor
|
|
0
|
21
|
March 22, 2023
|
Optimisation and Quantization of Tensorflow Model
|
|
0
|
22
|
March 21, 2023
|
AttributeError: 'NoneType' object has no attribute 'pad_token'
|
|
0
|
46
|
March 15, 2023
|
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
|
|
8
|
147
|
March 12, 2023
|
How to Prune Transformer based Model?
|
|
1
|
160
|
March 9, 2023
|
Optimum vs Accelerate
|
|
5
|
153
|
March 2, 2023
|
CUDA OOM when export a large model to ONNX
|
|
3
|
87
|
February 17, 2023
|
How does the ONNX exporter work for GenerationModel with `past_key_value`?
|
|
9
|
168
|
February 17, 2023
|
Optimum arm64 quantized models on Apple Silicon (M1)
|
|
0
|
74
|
February 16, 2023
|
Optimum & T5 for inference
|
|
18
|
2355
|
February 8, 2023
|
AutoModelForCausalLM and Openvino
|
|
5
|
209
|
February 3, 2023
|
Failed to create CUDAExecutionProvider
|
|
4
|
1247
|
January 31, 2023
|
ONNX on GPU memory footprint
|
|
2
|
124
|
January 30, 2023
|
Longformer Optimum ONNX bug: "ValueError: Model requires 3 inputs. Input Feed contains 2"
|
|
1
|
229
|
December 21, 2022
|
Fail: [ONNXRuntimeError] : 1 : FAIL : Deserialize tensor onnx:
|
|
4
|
462
|
December 7, 2022
|
Getting ValueError when exporting model to ONNX using optimum
|
|
16
|
1149
|
November 25, 2022
|
Optimize an ONNX Seq2Seq model
|
|
3
|
708
|
November 17, 2022
|
Use of from_pretrained design pattern
|
|
5
|
305
|
November 3, 2022
|
How to use Pipeline with re-ranker model and ORTForSequenceClassification
|
|
1
|
259
|
November 3, 2022
|
How to use optimum with encoder-decoder models
|
|
1
|
473
|
October 16, 2022
|
Dynamic quantization problems
|
|
4
|
477
|
October 16, 2022
|
Transformers.onnx vs optimum.onnxruntime
|
|
1
|
376
|
September 12, 2022
|
How to optimize ONNX seq2seq model?
|
|
2
|
986
|
August 25, 2022
|
Exporting Optimum Pipeline for Triton
|
|
1
|
431
|
August 20, 2022
|
Regarding Quantizing gpt2-xl, gpt2-large, &c
|
|
2
|
497
|
August 10, 2022
|
Load pytorch trained model via optimum
|
|
5
|
1268
|
August 10, 2022
|
Support for Mpnet models
|
|
2
|
480
|
August 8, 2022
|
What does the decoder with past values means
|
|
1
|
522
|
August 5, 2022
|