Using Trainer at inference time

BramVanroy · August 22, 2021, 8:56pm

Considering efficiency, the Trainer should be perfectly fine. You may wish to handle some specific optimisations though. See this post: Faster and smaller quantized NLP with Hugging Face and ONNX Runtime | by Yufeng Li | Microsoft Azure | Medium

Topic		Replies	Views
How do I use a fine-tuned Trainer model for inference correctly? 🤗Transformers	0	983	June 9, 2023
How to do inference with fined-tuned huggingface models? 🤗Transformers	3	820	February 4, 2024
Looking for tool class to do predictions 🤗Transformers	3	551	October 9, 2020
Batch size for trainer.predict() 🤗Transformers	4	6911	November 26, 2022
How to make single-input inference faster? Create my own pipeline? 🤗Transformers	9	3949	August 26, 2021