|
Need advice for implementing Greedy Search for ORTModelForSeq2SeqLM
|
|
1
|
673
|
January 17, 2024
|
|
Optimum roberta base quantization model recall drop 10%
|
|
5
|
511
|
January 15, 2024
|
|
packaging.version.InvalidVersion: Invalid version: ' '
|
|
1
|
1489
|
January 10, 2024
|
|
How to convert Speech Encoder Decoder to onnx
|
|
1
|
988
|
January 10, 2024
|
|
Some nodes were not assigned to the preferred execution providers
|
|
1
|
3678
|
January 10, 2024
|
|
Can bloom-7b1 be fine tuned using gaudi 1?
|
|
11
|
968
|
January 9, 2024
|
|
Optimum warnings while quantizing
|
|
0
|
626
|
January 6, 2024
|
|
FlashAttention-2's 16 bit requirement
|
|
2
|
2711
|
December 26, 2023
|
|
How to configure ONNX models from Hugging Face to use model options in C++?
|
|
0
|
507
|
November 10, 2023
|
|
Is a wheel to be released with the 1.14.0 release?
|
|
1
|
395
|
November 7, 2023
|
|
Donut fine tuning question
|
|
0
|
1705
|
October 16, 2023
|
|
Optimum export ONNX failure
|
|
0
|
757
|
September 30, 2023
|
|
Improving Whisper for Inference
|
|
11
|
4135
|
September 20, 2023
|
|
Order between optimization and quantization
|
|
1
|
543
|
September 19, 2023
|
|
Optimum-Cli [ Task Manager Error ]
|
|
1
|
792
|
September 18, 2023
|
|
Custom data preparation for LayoutLM model
|
|
1
|
1159
|
September 18, 2023
|
|
Static quantization of gpt2-style models with ORTQuantizer
|
|
3
|
970
|
September 18, 2023
|
|
How to Prune Transformer based Model?
|
|
2
|
6034
|
August 25, 2023
|
|
How to ensure that while running with llama2-70B, we use parallelism?
|
|
11
|
1683
|
August 22, 2023
|
|
How to load checkpoint shards with gaudi instead of cpu?
|
|
1
|
989
|
August 21, 2023
|
|
Error while Trying to run inference using gaudi on a finetuned llama2 model using habana repo
|
|
9
|
694
|
August 21, 2023
|
|
ORT CLI vs. Programmatic
|
|
12
|
1419
|
August 17, 2023
|
|
4 Bit quantization
|
|
4
|
611
|
August 11, 2023
|
|
Static quantization of activations for transformers
|
|
2
|
1672
|
August 11, 2023
|
|
Export a BetterTransformer to ONNX
|
|
3
|
2987
|
August 11, 2023
|
|
Exporting model wav2vec2 not supported?
|
|
3
|
1326
|
August 10, 2023
|
|
BLIP-2 on Optimum
|
|
4
|
1374
|
July 21, 2023
|
|
No module named 'optimum.neuron'; 'optimum' is not a package
|
|
2
|
2292
|
July 21, 2023
|
|
Dmesg: read kernel buffer failed: Operation not permitted :- Running gaudi-enabled habana model inference on kubernetes cluster
|
|
1
|
3358
|
July 13, 2023
|
|
InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int32)) , expected: (tensor(int64)
|
|
2
|
4959
|
July 9, 2023
|