Has anyone here deployed a transformers model on Google Cloud using AI Platform?

farazk86 · January 15, 2021, 12:44am

Hi,

I have a fine tuned distilgpt2 model that I want to deploy using GCP ai-platform.

I’ve followed all the documentation for deploying a custom prediction routine on GCP but when creating the model I get the error:

Create Version failed. Bad model detected with error: Model requires more memory than allowed. Please try to decrease the model size and re-deploy.

Here is my setup.py file:

from setuptools import setup

setup(
    name="generator_package",
    version="0.2",
    include_package_data=True,
    scripts=["generator_class.py"],
    install_requires=['transformers==2.8.0']
)

I then create a model version using:

gcloud beta ai-platform versions create v1 --model my_model \
 --origin=gs://my_bucket/model/ \
 --python-version=3.7 \
 --runtime-version=2.3 \
 --package-uris=gs://my_bucket/packages/gpt2-0.1.tar.gz,gs://cloud-ai-pytorch/torch-1.3.1+cpu-cp37-cp37m-linux_x86_64.whl \
 --prediction-class=model_prediction.CustomModelPrediction

I have tried every suggested route and cant get this to work and I’m still getting the above error. I’m using the smallest gpt2 model and am well within memory.

Can anyone who have successfully deployed to GCP please give some insight here.

Thank you

hadrienj · July 10, 2021, 2:24pm

Hi @farazk86,

Any updates about this? Did you manage to use AI platform to serve your model’s predictions?

farazk86 · July 11, 2021, 11:07am

Unfortunately, I could not. There were too many issues and I eventually gave up on the project.

Topic		Replies	Views
Deploying PyTorch ViT to Vertex AI using model artifacts 🤗Transformers	0	333	December 29, 2022
Deployment issue in AWS Sagemaker and GCP 🤗Transformers	0	197	April 2, 2024
Deploying 🤗 ViT on Vertex AI Intermediate	1	890	September 25, 2023
Gen AI on GCP GKE Models	2	519	January 16, 2024
What is best way to serve huggingface model with API? Beginners	11	42553	August 29, 2023

Has anyone here deployed a transformers model on Google Cloud using AI Platform?

Related topics