What is best way to serve huggingface model with API?

jplu · December 17, 2020, 9:31am

Thanks for your question, unfortunately it is currently not possible to integrate the tokenization process along with inference directly inside a saved model. Nevertheless, it is part of our plans to make this available and we are currently rethinking the way the saved models are handled in transformers

Topic		Replies	Views
How can I adapt this code to deploy it in HuggingFace? Beginners	0	243	September 10, 2023
Using huggingface as a hosting / CDN for a pretrained model 🤗Transformers	0	138	November 29, 2024
Is that possible to embed the tokenizer into the model to have it running on GCP using TensorFlow Serving? 🤗Tokenizers	4	3240	January 12, 2023
Help for inference.py code Amazon SageMaker	10	4003	March 8, 2022
Productionizing HuggingFace Transformers? Beginners	1	3170	September 12, 2022

What is best way to serve huggingface model with API?

Related topics