Batch Transform with Custom Infrastructure

jparmet · August 1, 2023, 3:39pm

Hi,

I’m attempting to deploy a model which is trained via PyTorch but saved in HuggingFace and uses a custom class which extends PreTrainedModel:

class BertClassifierConfig(BertConfig):
    model_type = "BertClassifier"
    problem_type = "single_label_classification"

class BertClassifier(PreTrainedModel):
    config_class = BertClassifierConfig

    def __init__(self, labels, dropout=0.1):
        ...

    def forward(self, input_id, mask):
        ...

I’m saving this model to HuggingFace and attempting to deploy it with Batch Transform in SageMaker, but am having problems. Specifically KeyError: 'BertClassifier' — which seems to be because the custom architecture isn’t defined.

Is there an easy way to address this problem? I’ve found a few solutions:

Creating a custom inference.py (I have no docker experience, so this could be pretty rough, not to mention I haven’t found any documentation/tutorials around creating an image which extends the HF image, just PyTorch)
Reconfiguring my architecture to extend nn.Module, skip HuggingFace, and use PyTorch-based batch transform (still requires a docker image? Unsure)
Reconfigure the architecture to not implement a custom architecture — I haven’t been able to get this to match the performance of the custom architecture, so this is really not preferred.

Appreciate any and all input — very new to SageMaker and only a few months experience with HuggingFace!

Topic		Replies	Views
About the Amazon SageMaker category Amazon SageMaker	25	4102	August 5, 2021
Use my finetuned Bert Model in SageMaker BatchTransform Amazon SageMaker	4	2968	April 30, 2022
Endpoint Deployment Amazon SageMaker	1	1109	September 20, 2021
ClientErro:400 when using batch transformer for inference Amazon SageMaker	11	2222	January 13, 2022
Infer on sagemaker with custom pipeline Amazon SageMaker	2	498	September 14, 2023

Batch Transform with Custom Infrastructure

Related topics