About the 🤗 Optimum category
|
|
0
|
1521
|
March 25, 2022
|
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?
|
|
1
|
538
|
June 28, 2024
|
Optimum - exporting Tensorflow based transformers to openvino
|
|
0
|
54
|
June 27, 2024
|
Is it possible to make the first batch as fast as the subsequent ones?
|
|
1
|
67
|
June 25, 2024
|
Are object detection models supported in optimum?
|
|
7
|
607
|
June 21, 2024
|
UnboundLocalError: cannot access local variable 'all_files' where it is not associated with a value
|
|
4
|
1236
|
June 13, 2024
|
Optimum library optimization and quantization fails
|
|
7
|
1000
|
June 9, 2024
|
Not able run all nodes on DML with optimum
|
|
4
|
162
|
June 6, 2024
|
Not able to run on DML with pipeline
|
|
2
|
232
|
June 6, 2024
|
How to quantize and run inference for CLIP using optimum
|
|
1
|
164
|
June 3, 2024
|
Device_map not wokring for ORTModelForSeq2SeqLM - Potential bug?
|
|
1
|
306
|
May 22, 2024
|
AttributeError: OV_ModelForTokenClassificatio' object has no attribute 'modules'
|
|
1
|
181
|
May 21, 2024
|
ONNX only faster at lower sequence lengths
|
|
2
|
200
|
May 21, 2024
|
Quantization GPTQ
|
|
1
|
157
|
May 21, 2024
|
Trocr after onnx quantisation conversion using optimum-cli , im getting this error
|
|
1
|
171
|
May 20, 2024
|
Optimum onnx-gpu not working inside a docker container
|
|
2
|
381
|
May 20, 2024
|
Error exporting T5 model to ONNX with optimum-cli
|
|
3
|
333
|
May 7, 2024
|
Export pretrained MT5 model to ONNX
|
|
5
|
329
|
May 3, 2024
|
Regarding input_ids and labels while grouping texts
|
|
3
|
206
|
April 24, 2024
|
FLOPS computation
|
|
1
|
136
|
April 16, 2024
|
Wandb integration run_clm.py
|
|
3
|
204
|
April 17, 2024
|
Regarding max steps, streaming in language modeling
|
|
3
|
203
|
April 13, 2024
|
"Protobuf parsing failed" when onnxruntime opens a quantized model
|
|
0
|
1126
|
March 25, 2024
|
What is the execution provider of optimum[onnxruntime]?
|
|
0
|
273
|
March 11, 2024
|
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
|
|
12
|
3498
|
March 7, 2024
|
Darshan Hiranandani - How do I handle rate limits or throttling imposed by an API?
|
|
0
|
302
|
March 5, 2024
|
Trying to use Transformers.js
|
|
0
|
514
|
March 1, 2024
|
Should pruning shrink model?; adjusting sparsity didn't change inference time
|
|
2
|
518
|
February 29, 2024
|
Cannot export to ONNX with optimum.onnxruntime
|
|
0
|
592
|
February 28, 2024
|
Error when running examples in optimum habana
|
|
1
|
384
|
February 27, 2024
|