Predict function ignore parameters

AliYZ · January 28, 2022, 10:15am

Hey,
I’m trying to deploy a Hugging Face Model (GPT-Neo) on the SageMaker endpoint. I followed the official example and this forum. but it seems that the generate function is totally ignoring my parameters (it generates just one word despite setting the min length to 10000!) Any idea what is wrong?

My code:

from sagemaker.huggingface import HuggingFaceModel
import sagemaker

role = sagemaker.get_execution_role()
# Hub Model configuration. https://huggingface.co/models
hub = {
	'HF_MODEL_ID':'EleutherAI/gpt-neo-1.3B',
	'HF_TASK':'text-generation'
}

# create Hugging Face Model Class
huggingface_model = HuggingFaceModel(
	transformers_version='4.6.1',
	pytorch_version='1.7.1',
	py_version='py36',
	env=hub,
	role=role, 
)

# deploy model to SageMaker Inference
predictor = huggingface_model.deploy(
	initial_instance_count=1, # number of instances
	instance_type='ml.g4dn.xlarge' # ec2 instance type
)

prompt = "Some prompt"

gen_tex = predictor.predict({
	"inputs": prompt,
    "parameters" : {"min_length":10,}
})

print(gen_tex[0]['generated_text'])

philschmid · January 28, 2022, 10:45am

Could you please update to the latest version? which would be transformers_version="4.12" and pytorch_version="1.9" and test it again?

What is the output of gen_tex[0]['generated_text']? Is the , in the “parameters” on purpose?

AliYZ · January 28, 2022, 11:20am

Updated to the latest version but it still ignore the parameter:

predictor.predict({
    'inputs': "Can you please let us know more details about your",
    'parameters': {"min_length":1000}
})

output:

[{'generated_text': 'Can you please let us know more details about your account?\n\nHello,\nI am interested in the above-mentioned company and I have read some very interesting articles about it. I am interested in starting the work. Please let me know if'}]```

AliYZ · January 28, 2022, 11:24am

I guess it’s only for min_length since it works fine for max_length=3
output:

[{'generated_text': 'Can you please let us know more details about your research'}]

philschmid · January 28, 2022, 12:24pm

And when you run model.generate with your parameter in a colab or other environment with the same Transformers version it works?

AliYZ · January 28, 2022, 12:37pm

Yes, it works fine when I use it locally with from_pretrained model.

marshmellow77 · January 28, 2022, 12:57pm

Hi Ali, what happens if you set the min_length and the max_length parameters explicitly? I’m asking because I believe the text generator uses the max_length parameter from the model configuration if you don’t set it explicitly. And if the max_length parameter in the config file is smaller than you min_length the output would be truncated. So, just wondering if setting both (e.g. min_length=1000 and max_length=2000) helps?

AliYZ · January 28, 2022, 1:29pm

Thank you both for your help. That solved the problem @marshmellow77.

marshmellow77 · January 28, 2022, 1:46pm

Awesome, glad it worked. It’d be great if you could mark this thread as Answered/Solved - it would make it easier and quicker in the future for other users with the same problem to find the solution

Cheers
Heiko

Topic		Replies	Views
Inference Hyperparameters Amazon SageMaker	29	4831	October 8, 2021
Deploying Open AI's whisper on Sagemaker Amazon SageMaker	54	16181	April 12, 2024
Volume Size Parameter in HuggingFace Model Class Amazon SageMaker	1	685	February 29, 2024
How to pass `min_length` parameter in generate? 🤗Transformers	1	339	September 29, 2023
Truncated un-finished response after deploying hugging-face models Amazon SageMaker	0	379	January 19, 2024

Predict function ignore parameters

Related topics