About the 🤗 Optimum category
|
|
0
|
1537
|
March 25, 2022
|
Difference between model.onnx and model.onnx.data
|
|
3
|
26
|
July 8, 2025
|
What does the decoder with past values means
|
|
2
|
1960
|
July 7, 2025
|
How do I upstream a brand-new hardware backend to 🤗 Transformers/Optimum?
|
|
2
|
28
|
July 2, 2025
|
ONNX export failed for Qwen/Qwen3-Embedding-0.6B with "invalid unordered_map<K, T> key"
|
|
5
|
82
|
June 27, 2025
|
CUDA OOM when export a large model to ONNX
|
|
5
|
2141
|
June 26, 2025
|
[Guide] Quantize LLM CoreML to int8 on Mac ARM (TinyLlama, May 2025, tested workflow & script)
|
|
0
|
56
|
May 26, 2025
|
Trying to convert DeepSeek-R1 into onnx
|
|
1
|
68
|
March 13, 2025
|
Optimum library optimization and quantization fails
|
|
8
|
1552
|
February 22, 2025
|
Paligemma2 onnx export KeyError: "Unknown task: image-text-to-text
|
|
4
|
128
|
February 11, 2025
|
Optimum-habana not working!
|
|
2
|
22
|
February 10, 2025
|
Incorrect Cross Attention Values from Generate Function of ORTModelForVision2Seq
|
|
3
|
57
|
February 1, 2025
|
Inference on models with custom head
|
|
1
|
20
|
January 28, 2025
|
Supported models
|
|
6
|
99
|
January 14, 2025
|
Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM
|
|
2
|
426
|
January 1, 2025
|
Error when running examples in optimum habana
|
|
2
|
611
|
October 30, 2024
|
Question about the infernce flow for optimum exported decoder merged onnx model
|
|
4
|
51
|
October 11, 2024
|
Compiling SD1.5 for Neuron with resolution other than 512x512 fails
|
|
5
|
96
|
September 26, 2024
|
Error while optimizing seq2seq model using optimum
|
|
1
|
61
|
September 16, 2024
|
Neuron StableDiffusion ControlNet Pipeline fails when used with 2 controlnets
|
|
4
|
65
|
September 11, 2024
|
How can I export a transformers model into onnx that not supported with optimum yet
|
|
9
|
521
|
August 30, 2024
|
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script?
|
|
5
|
1775
|
August 26, 2024
|
Optimum Failed download of jina-embeddings-v2-base-es
|
|
4
|
403
|
August 19, 2024
|
Optimum/Neuron: RuntimeError: forward() is missing value for argument 'argument_4'
|
|
2
|
33
|
August 13, 2024
|
What value should the sequence_length parameter be when converting to TFLite
|
|
0
|
19
|
August 10, 2024
|
How to export a fine-tuned SDXL model?
|
|
5
|
201
|
August 9, 2024
|
Make Text Embedding Server compatible
|
|
2
|
254
|
August 8, 2024
|
Exporting SegFormer Image Processor to ONNX Format Using "optimum.exporters.onnx.onnx_export_from_model"
|
|
0
|
98
|
July 30, 2024
|
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?
|
|
1
|
648
|
June 28, 2024
|
Optimum - exporting Tensorflow based transformers to openvino
|
|
0
|
85
|
June 27, 2024
|