The expanded size of the tensor (22528) must match the existing size (1024) at non-singleton dimension 0

weyseing · July 25, 2023, 3:04pm

There is error below when trying to deploy HF model to Amazon SageMaker.

Error:

RuntimeError: The expanded size of the tensor (22528) must match the existing size (1024) at non-singleton dimension 0.  Target sizes: [22528, 8192].  Tensor sizes: [1024, 8192]

SageMaker Instance: ml.g4dn.2xlarge

Code to deploy HF model to Amazon SageMaker:

import os
import json
import sagemaker
import boto3
from sagemaker.huggingface import HuggingFaceModel, get_huggingface_llm_image_uri

try:
    role = sagemaker.get_execution_role()
except ValueError:
    iam = boto3.client('iam')
    role = iam.get_role(RoleName='sagemaker_execution_role')['Role']['Arn']

hub = {
    'HF_MODEL_ID': 'meta-llama/Llama-2-7b-chat-hf',
    'SM_NUM_GPUS': json.dumps(1),
    'HUGGING_FACE_HUB_TOKEN': <HF_TOKEN>
}

huggingface_model = HuggingFaceModel(
    image_uri=get_huggingface_llm_image_uri("huggingface",version="0.8.2"),
    env=hub,
    role=role
)

predictor = huggingface_model.deploy(
    initial_instance_count=1,
    instance_type="ml.g4dn.2xlarge",
    container_startup_health_check_timeout=1800
  )

Topic		Replies	Views
Need help deploying a HF model to AWS Sagemaker Amazon SageMaker	3	151	September 27, 2024
Training model file too large and fail to deploy Amazon SageMaker	3	1377	July 3, 2023
Deploying Mixtral8x7B on AWS Sagemaker from S3 Amazon SageMaker	2	481	June 11, 2024
{"error":"The expanded size of the tensor (524) must match the existing size (514) at non-singleton dimension 1. Target sizes: [1, 524]. Tensor sizes: [1, 514]"} Models	0	152	November 22, 2024
Error when deploying GPT4-Alpaca on Sagemaker via HF model hub Beginners	8	1329	July 11, 2023

The expanded size of the tensor (22528) must match the existing size (1024) at non-singleton dimension 0

Related topics