Predict function ignore parameters

AliYZ · January 28, 2022, 10:15am

Hey,
I’m trying to deploy a Hugging Face Model (GPT-Neo) on the SageMaker endpoint. I followed the official example and this forum. but it seems that the generate function is totally ignoring my parameters (it generates just one word despite setting the min length to 10000!) Any idea what is wrong?

My code:

from sagemaker.huggingface import HuggingFaceModel
import sagemaker

role = sagemaker.get_execution_role()
# Hub Model configuration. https://huggingface.co/models
hub = {
	'HF_MODEL_ID':'EleutherAI/gpt-neo-1.3B',
	'HF_TASK':'text-generation'
}

# create Hugging Face Model Class
huggingface_model = HuggingFaceModel(
	transformers_version='4.6.1',
	pytorch_version='1.7.1',
	py_version='py36',
	env=hub,
	role=role, 
)

# deploy model to SageMaker Inference
predictor = huggingface_model.deploy(
	initial_instance_count=1, # number of instances
	instance_type='ml.g4dn.xlarge' # ec2 instance type
)

prompt = "Some prompt"

gen_tex = predictor.predict({
	"inputs": prompt,
    "parameters" : {"min_length":10,}
})

print(gen_tex[0]['generated_text'])

philschmid · January 28, 2022, 10:45am

Could you please update to the latest version? which would be transformers_version="4.12" and pytorch_version="1.9" and test it again?

What is the output of gen_tex[0]['generated_text']? Is the , in the “parameters” on purpose?

AliYZ · January 28, 2022, 11:20am

Updated to the latest version but it still ignore the parameter:

predictor.predict({
    'inputs': "Can you please let us know more details about your",
    'parameters': {"min_length":1000}
})

output:

[{'generated_text': 'Can you please let us know more details about your account?\n\nHello,\nI am interested in the above-mentioned company and I have read some very interesting articles about it. I am interested in starting the work. Please let me know if'}]```

AliYZ · January 28, 2022, 11:24am

I guess it’s only for min_length since it works fine for max_length=3
output:

[{'generated_text': 'Can you please let us know more details about your research'}]

philschmid · January 28, 2022, 12:24pm

And when you run model.generate with your parameter in a colab or other environment with the same Transformers version it works?

AliYZ · January 28, 2022, 12:37pm

Yes, it works fine when I use it locally with from_pretrained model.

marshmellow77 · January 28, 2022, 12:57pm

Hi Ali, what happens if you set the min_length and the max_length parameters explicitly? I’m asking because I believe the text generator uses the max_length parameter from the model configuration if you don’t set it explicitly. And if the max_length parameter in the config file is smaller than you min_length the output would be truncated. So, just wondering if setting both (e.g. min_length=1000 and max_length=2000) helps?

AliYZ · January 28, 2022, 1:29pm

Thank you both for your help. That solved the problem @marshmellow77.

marshmellow77 · January 28, 2022, 1:46pm

Awesome, glad it worked. It’d be great if you could mark this thread as Answered/Solved - it would make it easier and quicker in the future for other users with the same problem to find the solution

Cheers
Heiko

Topic		Replies	Views
Inference Hyperparameters Amazon SageMaker	29	4757	October 8, 2021
ModelError when I run predict after deploying gpt-j for question answering Amazon SageMaker	4	1300	February 28, 2023
Emotion Model: Additional inference parameter not processed in Sagemaker Amazon SageMaker	4	1206	June 28, 2022
ModelError when I run predict after deploying wizardcoder for text-generation Amazon SageMaker	1	898	September 25, 2023
Running out of memory with all except the basic GPT2 and GPT neo models on sagemaker127M Beginners	0	248	March 31, 2023

Predict function ignore parameters

Related topics