About the 🤗 Optimum category
|
|
0
|
1145
|
March 25, 2022
|
Improving Whisper for Inference
|
|
11
|
330
|
September 20, 2023
|
Order between optimization and quantization
|
|
1
|
27
|
September 19, 2023
|
Optimum-Cli [ Task Manager Error ]
|
|
1
|
39
|
September 18, 2023
|
Custom data preparation for LayoutLM model
|
|
1
|
62
|
September 18, 2023
|
Static quantization of gpt2-style models with ORTQuantizer
|
|
3
|
120
|
September 18, 2023
|
Optimize AND quantize with Optimum
|
|
10
|
1146
|
August 27, 2023
|
How to Prune Transformer based Model?
|
|
2
|
1088
|
August 25, 2023
|
How to ensure that while running with llama2-70B, we use parallelism?
|
|
11
|
198
|
August 22, 2023
|
How to load checkpoint shards with gaudi instead of cpu?
|
|
1
|
124
|
August 21, 2023
|
Error while Trying to run inference using gaudi on a finetuned llama2 model using habana repo
|
|
9
|
167
|
August 21, 2023
|
ORT CLI vs. Programmatic
|
|
12
|
198
|
August 17, 2023
|
4 Bit quantization
|
|
4
|
121
|
August 11, 2023
|
Static quantization of activations for transformers
|
|
2
|
159
|
August 11, 2023
|
Export a BetterTransformer to ONNX
|
|
3
|
1020
|
August 11, 2023
|
Exporting model wav2vec2 not supported?
|
|
3
|
148
|
August 10, 2023
|
Can bloom-7b1 be fine tuned using gaudi 1?
|
|
10
|
254
|
July 24, 2023
|
BLIP-2 on Optimum
|
|
4
|
201
|
July 21, 2023
|
No module named 'optimum.neuron'; 'optimum' is not a package
|
|
2
|
262
|
July 21, 2023
|
UnboundLocalError: cannot access local variable 'all_files' where it is not associated with a value
|
|
1
|
198
|
July 20, 2023
|
Dmesg: read kernel buffer failed: Operation not permitted :- Running gaudi-enabled habana model inference on kubernetes cluster
|
|
1
|
534
|
July 13, 2023
|
InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(int32)) , expected: (tensor(int64)
|
|
2
|
2861
|
July 9, 2023
|
Static Quantization with Own dataset
|
|
3
|
208
|
July 1, 2023
|
Is there a way to include the text_projection and/or embedding normalization in an optimum-optimized CLIPTextModelWithProjection?
|
|
8
|
273
|
July 1, 2023
|
Support onnx opset 9 for T5 & GPT_neox
|
|
1
|
173
|
July 1, 2023
|
Are object detection models supported in optimum?
|
|
3
|
143
|
June 22, 2023
|
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
|
|
9
|
1003
|
June 20, 2023
|
How to use the cache_dir along with optimum-cli export
|
|
5
|
215
|
June 16, 2023
|
ONNX Flan-T5 Model OOM on GPU
|
|
2
|
880
|
June 15, 2023
|
Potential Memory Leak for ORTModelForCausalLM with TensorRT Providor
|
|
4
|
405
|
June 2, 2023
|