Containerizing Huggingface Transformers for GPU inference with Docker and FastAPI

ramsrigouthamg · October 5, 2021, 7:53am

Hi everyone!

A while ago I was searching on the HF forum and web to create a GPU docker and deploy it on cloud services like AWS.
Couldn’t find a comprehensive guide that showed how to create and deploy transformers on GPU.

So decided to do one myself and publish it so that it is helpful for others who want to create a GPU docker with HF transformers and deploy it.

Just wanted to add the resource here.

Github: GitHub - ramsrigouthamg/GPU_Docker_Deployment_HuggingFace_Summarization: Huggingface inference with GPU Docker on AWS
Detailed Youtube Video: Containerizing Huggingface Transformers for GPU inference with Docker and FastAPI on AWS - YouTube

Happy learning!

Cheers,
Ramsri

Topic		Replies	Views
Containerizing transformers with Docker and FastAPI 🤗Transformers	1	2054	August 28, 2020
First time to AI - apps. Do I need a GPU in order to run a model using transformers? 🤗Transformers	1	261	October 17, 2024
Best practice for using Transformers on GPU on EC2? Beginners	5	5186	April 10, 2021
Scaling up BERT-like model Inference on modern CPU - Part 1 Intermediate	3	1119	April 22, 2021
Running ASR inference pipeline on multiple GPU's 🤗Transformers	0	133	February 19, 2024

Containerizing Huggingface Transformers for GPU inference with Docker and FastAPI

Related topics