About the 🤗 Optimum category
|
|
0
|
1412
|
March 25, 2022
|
ONNX only faster at lower sequence lengths
|
|
0
|
8
|
May 7, 2024
|
Error exporting T5 model to ONNX with optimum-cli
|
|
2
|
46
|
May 7, 2024
|
Export pretrained MT5 model to ONNX
|
|
5
|
62
|
May 3, 2024
|
Regarding input_ids and labels while grouping texts
|
|
3
|
69
|
April 24, 2024
|
AttributeError: OV_ModelForTokenClassificatio' object has no attribute 'modules'
|
|
0
|
33
|
April 20, 2024
|
FLOPS computation
|
|
1
|
59
|
April 16, 2024
|
Wandb integration run_clm.py
|
|
3
|
80
|
April 17, 2024
|
Regarding max steps, streaming in language modeling
|
|
3
|
99
|
April 13, 2024
|
Device_map not wokring for ORTModelForSeq2SeqLM - Potential bug?
|
|
0
|
119
|
March 27, 2024
|
"Protobuf parsing failed" when onnxruntime opens a quantized model
|
|
0
|
496
|
March 25, 2024
|
What is the execution provider of optimum[onnxruntime]?
|
|
0
|
138
|
March 11, 2024
|
Optimum library optimization and quantization fails
|
|
6
|
559
|
March 9, 2024
|
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
|
|
12
|
2898
|
March 7, 2024
|
Darshan Hiranandani - How do I handle rate limits or throttling imposed by an API?
|
|
0
|
135
|
March 5, 2024
|
Trying to use Transformers.js
|
|
0
|
288
|
March 1, 2024
|
Should pruning shrink model?; adjusting sparsity didn't change inference time
|
|
2
|
320
|
February 29, 2024
|
Cannot export to ONNX with optimum.onnxruntime
|
|
0
|
253
|
February 28, 2024
|
Error when running examples in optimum habana
|
|
1
|
216
|
February 27, 2024
|
Optimize AND quantize with Optimum
|
|
11
|
2442
|
February 10, 2024
|
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?
|
|
0
|
346
|
February 9, 2024
|
Improving Quantization Accuracy for ONNX Models with Optimum
|
|
0
|
292
|
February 8, 2024
|
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script?
|
|
3
|
874
|
January 30, 2024
|
Audio classifier in TFLite format
|
|
0
|
305
|
January 25, 2024
|
Need advice for implementing Greedy Search for ORTModelForSeq2SeqLM
|
|
2
|
321
|
January 17, 2024
|
Optimum roberta base quantization model recall drop 10%
|
|
5
|
359
|
January 15, 2024
|
packaging.version.InvalidVersion: Invalid version: ' '
|
|
1
|
771
|
January 10, 2024
|
How to convert Speech Encoder Decoder to onnx
|
|
1
|
437
|
January 10, 2024
|
Some nodes were not assigned to the preferred execution providers
|
|
1
|
1251
|
January 10, 2024
|
Can bloom-7b1 be fine tuned using gaudi 1?
|
|
12
|
822
|
January 9, 2024
|