🤗Optimum

Topic	Replies	Views	Activity
How to convert Speech Encoder Decoder to onnx	1	478	January 10, 2024
Some nodes were not assigned to the preferred execution providers	1	1338	January 10, 2024
Can bloom-7b1 be fine tuned using gaudi 1?	12	832	January 9, 2024
UnboundLocalError: cannot access local variable 'all_files' where it is not associated with a value	3	1017	January 9, 2024
Optimum warnings while quantizing	0	411	January 6, 2024
FlashAttention-2's 16 bit requirement	2	1007	December 26, 2023
How to configure ONNX models from Hugging Face to use model options in C++?	0	382	November 10, 2023
Is a wheel to be released with the 1.14.0 release?	1	331	November 7, 2023
Donut fine tuning question	0	1146	October 16, 2023
Optimum export ONNX failure	0	527	September 30, 2023
Improving Whisper for Inference	11	2758	September 20, 2023
Order between optimization and quantization	1	423	September 19, 2023
Optimum-Cli [ Task Manager Error ]	1	532	September 18, 2023
Custom data preparation for LayoutLM model	1	812	September 18, 2023
Static quantization of gpt2-style models with ORTQuantizer	3	687	September 18, 2023
How to Prune Transformer based Model?	2	3211	August 25, 2023
How to ensure that while running with llama2-70B, we use parallelism?	11	1389	August 22, 2023
How to load checkpoint shards with gaudi instead of cpu?	1	842	August 21, 2023
Error while Trying to run inference using gaudi on a finetuned llama2 model using habana repo	9	572	August 21, 2023
ORT CLI vs. Programmatic	12	1072	August 17, 2023
4 Bit quantization	4	466	August 11, 2023
Static quantization of activations for transformers	2	1158	August 11, 2023
Export a BetterTransformer to ONNX	3	2265	August 11, 2023
Exporting model wav2vec2 not supported?	3	863	August 10, 2023
BLIP-2 on Optimum	4	994	July 21, 2023
No module named 'optimum.neuron'; 'optimum' is not a package	2	1363	July 21, 2023
Dmesg: read kernel buffer failed: Operation not permitted :- Running gaudi-enabled habana model inference on kubernetes cluster	1	2917	July 13, 2023
InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int32)) , expected: (tensor(int64)	2	3959	July 9, 2023
Static Quantization with Own dataset	3	649	July 1, 2023
Is there a way to include the text_projection and/or embedding normalization in an optimum-optimized CLIPTextModelWithProjection?	8	948	July 1, 2023