How to configure ONNX models from Hugging Face to use model options in C++?
|
|
0
|
354
|
November 10, 2023
|
Is a wheel to be released with the 1.14.0 release?
|
|
1
|
314
|
November 7, 2023
|
Donut fine tuning question
|
|
0
|
1071
|
October 16, 2023
|
Optimum export ONNX failure
|
|
0
|
484
|
September 30, 2023
|
Improving Whisper for Inference
|
|
11
|
2560
|
September 20, 2023
|
Order between optimization and quantization
|
|
1
|
404
|
September 19, 2023
|
Optimum-Cli [ Task Manager Error ]
|
|
1
|
491
|
September 18, 2023
|
Custom data preparation for LayoutLM model
|
|
1
|
741
|
September 18, 2023
|
Static quantization of gpt2-style models with ORTQuantizer
|
|
3
|
637
|
September 18, 2023
|
How to Prune Transformer based Model?
|
|
2
|
2908
|
August 25, 2023
|
How to ensure that while running with llama2-70B, we use parallelism?
|
|
11
|
1335
|
August 22, 2023
|
How to load checkpoint shards with gaudi instead of cpu?
|
|
1
|
814
|
August 21, 2023
|
Error while Trying to run inference using gaudi on a finetuned llama2 model using habana repo
|
|
9
|
549
|
August 21, 2023
|
ORT CLI vs. Programmatic
|
|
12
|
1012
|
August 17, 2023
|
4 Bit quantization
|
|
4
|
441
|
August 11, 2023
|
Static quantization of activations for transformers
|
|
2
|
1108
|
August 11, 2023
|
Export a BetterTransformer to ONNX
|
|
3
|
2183
|
August 11, 2023
|
Exporting model wav2vec2 not supported?
|
|
3
|
815
|
August 10, 2023
|
BLIP-2 on Optimum
|
|
4
|
940
|
July 21, 2023
|
No module named 'optimum.neuron'; 'optimum' is not a package
|
|
2
|
1259
|
July 21, 2023
|
Dmesg: read kernel buffer failed: Operation not permitted :- Running gaudi-enabled habana model inference on kubernetes cluster
|
|
1
|
2811
|
July 13, 2023
|
InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int32)) , expected: (tensor(int64)
|
|
2
|
3850
|
July 9, 2023
|
Static Quantization with Own dataset
|
|
3
|
609
|
July 1, 2023
|
Is there a way to include the text_projection and/or embedding normalization in an optimum-optimized CLIPTextModelWithProjection?
|
|
8
|
870
|
July 1, 2023
|
Support onnx opset 9 for T5 & GPT_neox
|
|
1
|
366
|
July 1, 2023
|
Are object detection models supported in optimum?
|
|
3
|
379
|
June 22, 2023
|
How to use the cache_dir along with optimum-cli export
|
|
5
|
503
|
June 16, 2023
|
ONNX Flan-T5 Model OOM on GPU
|
|
2
|
1797
|
June 15, 2023
|
Potential Memory Leak for ORTModelForCausalLM with TensorRT Providor
|
|
4
|
711
|
June 2, 2023
|
HOw to make optimum make use of all available GPUs?
|
|
7
|
2442
|
June 1, 2023
|