🤗Optimum

Topic	Replies	Views	Activity
About the 🤗 Optimum category	0	1537	March 25, 2022
Difference between model.onnx and model.onnx.data	3	17	July 8, 2025
What does the decoder with past values means	2	1958	July 7, 2025
How do I upstream a brand-new hardware backend to 🤗 Transformers/Optimum?	2	28	July 2, 2025
ONNX export failed for Qwen/Qwen3-Embedding-0.6B with "invalid unordered_map<K, T> key"	5	64	June 27, 2025
CUDA OOM when export a large model to ONNX	5	2138	June 26, 2025
[Guide] Quantize LLM CoreML to int8 on Mac ARM (TinyLlama, May 2025, tested workflow & script)	0	47	May 26, 2025
Trying to convert DeepSeek-R1 into onnx	1	63	March 13, 2025
Optimum library optimization and quantization fails	8	1541	February 22, 2025
Paligemma2 onnx export KeyError: "Unknown task: image-text-to-text	4	124	February 11, 2025
Optimum-habana not working!	2	22	February 10, 2025
Incorrect Cross Attention Values from Generate Function of ORTModelForVision2Seq	3	56	February 1, 2025
Inference on models with custom head	1	19	January 28, 2025
Supported models	6	99	January 14, 2025
Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM	2	415	January 1, 2025
Error when running examples in optimum habana	2	609	October 30, 2024
Question about the infernce flow for optimum exported decoder merged onnx model	4	51	October 11, 2024
Compiling SD1.5 for Neuron with resolution other than 512x512 fails	5	96	September 26, 2024
Error while optimizing seq2seq model using optimum	1	60	September 16, 2024
Neuron StableDiffusion ControlNet Pipeline fails when used with 2 controlnets	4	65	September 11, 2024
How can I export a transformers model into onnx that not supported with optimum yet	9	515	August 30, 2024
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script?	5	1771	August 26, 2024
Optimum Failed download of jina-embeddings-v2-base-es	4	402	August 19, 2024
Optimum/Neuron: RuntimeError: forward() is missing value for argument 'argument_4'	2	33	August 13, 2024
What value should the sequence_length parameter be when converting to TFLite	0	17	August 10, 2024
How to export a fine-tuned SDXL model?	5	198	August 9, 2024
Make Text Embedding Server compatible	2	252	August 8, 2024
Exporting SegFormer Image Processor to ONNX Format Using "optimum.exporters.onnx.onnx_export_from_model"	0	98	July 30, 2024
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete?	1	648	June 28, 2024
Optimum - exporting Tensorflow based transformers to openvino	0	85	June 27, 2024