How to convert Speech Encoder Decoder to onnx
|
|
1
|
478
|
January 10, 2024
|
Some nodes were not assigned to the preferred execution providers
|
|
1
|
1338
|
January 10, 2024
|
Can bloom-7b1 be fine tuned using gaudi 1?
|
|
12
|
832
|
January 9, 2024
|
UnboundLocalError: cannot access local variable 'all_files' where it is not associated with a value
|
|
3
|
1017
|
January 9, 2024
|
Optimum warnings while quantizing
|
|
0
|
411
|
January 6, 2024
|
FlashAttention-2's 16 bit requirement
|
|
2
|
1007
|
December 26, 2023
|
How to configure ONNX models from Hugging Face to use model options in C++?
|
|
0
|
382
|
November 10, 2023
|
Is a wheel to be released with the 1.14.0 release?
|
|
1
|
331
|
November 7, 2023
|
Donut fine tuning question
|
|
0
|
1146
|
October 16, 2023
|
Optimum export ONNX failure
|
|
0
|
527
|
September 30, 2023
|
Improving Whisper for Inference
|
|
11
|
2758
|
September 20, 2023
|
Order between optimization and quantization
|
|
1
|
423
|
September 19, 2023
|
Optimum-Cli [ Task Manager Error ]
|
|
1
|
532
|
September 18, 2023
|
Custom data preparation for LayoutLM model
|
|
1
|
812
|
September 18, 2023
|
Static quantization of gpt2-style models with ORTQuantizer
|
|
3
|
687
|
September 18, 2023
|
How to Prune Transformer based Model?
|
|
2
|
3211
|
August 25, 2023
|
How to ensure that while running with llama2-70B, we use parallelism?
|
|
11
|
1389
|
August 22, 2023
|
How to load checkpoint shards with gaudi instead of cpu?
|
|
1
|
842
|
August 21, 2023
|
Error while Trying to run inference using gaudi on a finetuned llama2 model using habana repo
|
|
9
|
572
|
August 21, 2023
|
ORT CLI vs. Programmatic
|
|
12
|
1072
|
August 17, 2023
|
4 Bit quantization
|
|
4
|
466
|
August 11, 2023
|
Static quantization of activations for transformers
|
|
2
|
1158
|
August 11, 2023
|
Export a BetterTransformer to ONNX
|
|
3
|
2265
|
August 11, 2023
|
Exporting model wav2vec2 not supported?
|
|
3
|
863
|
August 10, 2023
|
BLIP-2 on Optimum
|
|
4
|
994
|
July 21, 2023
|
No module named 'optimum.neuron'; 'optimum' is not a package
|
|
2
|
1363
|
July 21, 2023
|
Dmesg: read kernel buffer failed: Operation not permitted :- Running gaudi-enabled habana model inference on kubernetes cluster
|
|
1
|
2917
|
July 13, 2023
|
InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int32)) , expected: (tensor(int64)
|
|
2
|
3959
|
July 9, 2023
|
Static Quantization with Own dataset
|
|
3
|
649
|
July 1, 2023
|
Is there a way to include the text_projection and/or embedding normalization in an optimum-optimized CLIPTextModelWithProjection?
|
|
8
|
948
|
July 1, 2023
|