About the 🤗 Optimum category
|
|
0
|
1360
|
March 25, 2022
|
Device_map not wokring for ORTModelForSeq2SeqLM - Potential bug?
|
|
0
|
35
|
March 27, 2024
|
"Protobuf parsing failed" when onnxruntime opens a quantized model
|
|
0
|
41
|
March 25, 2024
|
What is the execution provider of optimum[onnxruntime]?
|
|
0
|
60
|
March 11, 2024
|
Optimum library optimization and quantization fails
|
|
6
|
348
|
March 9, 2024
|
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
|
|
12
|
2506
|
March 7, 2024
|
Darshan Hiranandani - How do I handle rate limits or throttling imposed by an API?
|
|
0
|
55
|
March 5, 2024
|
Trying to use Transformers.js
|
|
0
|
131
|
March 1, 2024
|
Should pruning shrink model?; adjusting sparsity didn't change inference time
|
|
2
|
169
|
February 29, 2024
|
Cannot export to ONNX with optimum.onnxruntime
|
|
0
|
95
|
February 28, 2024
|
Error when running examples in optimum habana
|
|
1
|
135
|
February 27, 2024
|
Optimize AND quantize with Optimum
|
|
11
|
2182
|
February 10, 2024
|
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?
|
|
0
|
224
|
February 9, 2024
|
Improving Quantization Accuracy for ONNX Models with Optimum
|
|
0
|
175
|
February 8, 2024
|
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script?
|
|
3
|
564
|
January 30, 2024
|
Audio classifier in TFLite format
|
|
0
|
194
|
January 25, 2024
|
Need advice for implementing Greedy Search for ORTModelForSeq2SeqLM
|
|
2
|
205
|
January 17, 2024
|
Optimum roberta base quantization model recall drop 10%
|
|
5
|
294
|
January 15, 2024
|
packaging.version.InvalidVersion: Invalid version: ' '
|
|
1
|
594
|
January 10, 2024
|
How to convert Speech Encoder Decoder to onnx
|
|
1
|
330
|
January 10, 2024
|
Some nodes were not assigned to the preferred execution providers
|
|
1
|
988
|
January 10, 2024
|
Can bloom-7b1 be fine tuned using gaudi 1?
|
|
12
|
745
|
January 9, 2024
|
UnboundLocalError: cannot access local variable 'all_files' where it is not associated with a value
|
|
3
|
874
|
January 9, 2024
|
Optimum warnings while quantizing
|
|
0
|
291
|
January 6, 2024
|
FlashAttention-2's 16 bit requirement
|
|
2
|
540
|
December 26, 2023
|
How to configure ONNX models from Hugging Face to use model options in C++?
|
|
0
|
289
|
November 10, 2023
|
Is a wheel to be released with the 1.14.0 release?
|
|
1
|
270
|
November 7, 2023
|
Donut fine tuning question
|
|
0
|
966
|
October 16, 2023
|
Optimum export ONNX failure
|
|
0
|
429
|
September 30, 2023
|
Improving Whisper for Inference
|
|
11
|
2301
|
September 20, 2023
|