🤗Optimum

Topic	Replies	Views	Activity
About the 🤗 Optimum category	0	1541	March 25, 2022
CUDA OOM when export a large model to ONNX	6	2236	July 26, 2025
Difference in the vector generated by the int8 quantized model vs base onnx model	4	46	July 25, 2025
Difference between model.onnx and model.onnx.data	3	239	July 8, 2025
What does the decoder with past values means	2	1996	July 7, 2025
How do I upstream a brand-new hardware backend to 🤗 Transformers/Optimum?	2	36	July 2, 2025
ONNX export failed for Qwen/Qwen3-Embedding-0.6B with "invalid unordered_map<K, T> key"	5	321	June 27, 2025
[Guide] Quantize LLM CoreML to int8 on Mac ARM (TinyLlama, May 2025, tested workflow & script)	0	160	May 26, 2025
Trying to convert DeepSeek-R1 into onnx	1	109	March 13, 2025
Optimum library optimization and quantization fails	8	1647	February 22, 2025
Paligemma2 onnx export KeyError: "Unknown task: image-text-to-text	4	185	February 11, 2025
Optimum-habana not working!	2	32	February 10, 2025
Incorrect Cross Attention Values from Generate Function of ORTModelForVision2Seq	3	97	February 1, 2025
Inference on models with custom head	1	28	January 28, 2025
Supported models	6	115	January 14, 2025
Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM	2	543	January 1, 2025
Error when running examples in optimum habana	2	629	October 30, 2024
Question about the infernce flow for optimum exported decoder merged onnx model	4	72	October 11, 2024
Compiling SD1.5 for Neuron with resolution other than 512x512 fails	5	112	September 26, 2024
Error while optimizing seq2seq model using optimum	1	73	September 16, 2024
Neuron StableDiffusion ControlNet Pipeline fails when used with 2 controlnets	4	69	September 11, 2024
How can I export a transformers model into onnx that not supported with optimum yet	9	683	August 30, 2024
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script?	5	1796	August 26, 2024
Optimum Failed download of jina-embeddings-v2-base-es	4	474	August 19, 2024
Optimum/Neuron: RuntimeError: forward() is missing value for argument 'argument_4'	2	39	August 13, 2024
What value should the sequence_length parameter be when converting to TFLite	0	30	August 10, 2024
How to export a fine-tuned SDXL model?	5	259	August 9, 2024
Make Text Embedding Server compatible	2	297	August 8, 2024
Exporting SegFormer Image Processor to ONNX Format Using "optimum.exporters.onnx.onnx_export_from_model"	0	123	July 30, 2024
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?	1	661	June 28, 2024