About the 🤗 Optimum category
|
|
0
|
1041
|
March 25, 2022
|
Potential Memory Leak for ORTModelForCausalLM with TensorRT Providor
|
|
4
|
148
|
June 2, 2023
|
Static Quantization with Own dataset
|
|
2
|
35
|
June 1, 2023
|
HOw to make optimum make use of all available GPUs?
|
|
7
|
209
|
June 1, 2023
|
ONNX vs. Apache TVM
|
|
0
|
28
|
June 1, 2023
|
Huggingface Optimizer
|
|
2
|
65
|
May 25, 2023
|
Optimum Exporter TFLite error
|
|
4
|
88
|
May 18, 2023
|
Export a BetterTransformer to ONNX
|
|
1
|
125
|
May 15, 2023
|
Custom model export to onnx-runtime
|
|
7
|
150
|
May 9, 2023
|
Optimisation and Quantization of Tensorflow Model
|
|
1
|
101
|
May 3, 2023
|
AttributeError: 'NoneType' object has no attribute 'pad_token'
|
|
1
|
419
|
May 3, 2023
|
Optimum arm64 quantized models on Apple Silicon (M1)
|
|
1
|
222
|
May 3, 2023
|
ONNX Flan-T5 Model OOM on GPU
|
|
1
|
325
|
April 13, 2023
|
Intel Xeon vs AMD EPYC for inference on CPU
|
|
0
|
171
|
March 29, 2023
|
Optimize AND quantize with Optimum
|
|
9
|
792
|
March 27, 2023
|
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
|
|
8
|
404
|
March 12, 2023
|
How to Prune Transformer based Model?
|
|
1
|
488
|
March 9, 2023
|
Optimum vs Accelerate
|
|
5
|
279
|
March 2, 2023
|
CUDA OOM when export a large model to ONNX
|
|
3
|
332
|
February 17, 2023
|
How does the ONNX exporter work for GenerationModel with `past_key_value`?
|
|
9
|
377
|
February 17, 2023
|
Optimum & T5 for inference
|
|
18
|
3023
|
February 8, 2023
|
AutoModelForCausalLM and Openvino
|
|
5
|
402
|
February 3, 2023
|
Failed to create CUDAExecutionProvider
|
|
4
|
2407
|
January 31, 2023
|
ONNX on GPU memory footprint
|
|
2
|
329
|
January 30, 2023
|
Longformer Optimum ONNX bug: "ValueError: Model requires 3 inputs. Input Feed contains 2"
|
|
1
|
356
|
December 21, 2022
|
Fail: [ONNXRuntimeError] : 1 : FAIL : Deserialize tensor onnx:
|
|
4
|
973
|
December 7, 2022
|
Getting ValueError when exporting model to ONNX using optimum
|
|
16
|
1672
|
November 25, 2022
|
Optimize an ONNX Seq2Seq model
|
|
3
|
881
|
November 17, 2022
|
Use of from_pretrained design pattern
|
|
5
|
385
|
November 3, 2022
|
How to use Pipeline with re-ranker model and ORTForSequenceClassification
|
|
1
|
326
|
November 3, 2022
|