About the 🤗 Optimum category
|
|
0
|
1537
|
March 25, 2022
|
How do I upstream a brand-new hardware backend to 🤗 Transformers/Optimum?
|
|
1
|
11
|
July 1, 2025
|
ONNX export failed for Qwen/Qwen3-Embedding-0.6B with "invalid unordered_map<K, T> key"
|
|
5
|
33
|
June 27, 2025
|
CUDA OOM when export a large model to ONNX
|
|
5
|
2128
|
June 26, 2025
|
[Guide] Quantize LLM CoreML to int8 on Mac ARM (TinyLlama, May 2025, tested workflow & script)
|
|
0
|
43
|
May 26, 2025
|
Trying to convert DeepSeek-R1 into onnx
|
|
1
|
59
|
March 13, 2025
|
Optimum library optimization and quantization fails
|
|
8
|
1528
|
February 22, 2025
|
Paligemma2 onnx export KeyError: "Unknown task: image-text-to-text
|
|
4
|
114
|
February 11, 2025
|
Optimum-habana not working!
|
|
2
|
22
|
February 10, 2025
|
Incorrect Cross Attention Values from Generate Function of ORTModelForVision2Seq
|
|
3
|
55
|
February 1, 2025
|
Inference on models with custom head
|
|
1
|
19
|
January 28, 2025
|
Supported models
|
|
6
|
97
|
January 14, 2025
|
Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM
|
|
2
|
395
|
January 1, 2025
|
Error when running examples in optimum habana
|
|
2
|
599
|
October 30, 2024
|
Question about the infernce flow for optimum exported decoder merged onnx model
|
|
4
|
51
|
October 11, 2024
|
Compiling SD1.5 for Neuron with resolution other than 512x512 fails
|
|
5
|
96
|
September 26, 2024
|
Error while optimizing seq2seq model using optimum
|
|
1
|
59
|
September 16, 2024
|
Neuron StableDiffusion ControlNet Pipeline fails when used with 2 controlnets
|
|
4
|
65
|
September 11, 2024
|
How can I export a transformers model into onnx that not supported with optimum yet
|
|
9
|
498
|
August 30, 2024
|
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script?
|
|
5
|
1769
|
August 26, 2024
|
Optimum Failed download of jina-embeddings-v2-base-es
|
|
4
|
399
|
August 19, 2024
|
Optimum/Neuron: RuntimeError: forward() is missing value for argument 'argument_4'
|
|
2
|
33
|
August 13, 2024
|
What value should the sequence_length parameter be when converting to TFLite
|
|
0
|
17
|
August 10, 2024
|
How to export a fine-tuned SDXL model?
|
|
5
|
191
|
August 9, 2024
|
Make Text Embedding Server compatible
|
|
2
|
249
|
August 8, 2024
|
Exporting SegFormer Image Processor to ONNX Format Using "optimum.exporters.onnx.onnx_export_from_model"
|
|
0
|
94
|
July 30, 2024
|
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?
|
|
1
|
647
|
June 28, 2024
|
Optimum - exporting Tensorflow based transformers to openvino
|
|
0
|
85
|
June 27, 2024
|
Is it possible to make the first batch as fast as the subsequent ones?
|
|
1
|
84
|
June 25, 2024
|
Are object detection models supported in optimum?
|
|
7
|
833
|
June 21, 2024
|