Llama3 8b instruct not answering question

swtb · May 8, 2024, 8:54am

What I meant was that on the module card they set that to be the end of sequence token within the models configuration (NOT in the prompt)

The model then generates this token itself and stops generating.

The logic is basically that the model keeps generating new words based on the previous words until it sees (generates) the end of sequence token OR it hits the token limit which I think defaults to 256 in this case (though I could be wrong on the exact number)

I’m unsure as to whether sagemaker looks after this or if you will need to set it yourself.

Topic		Replies	Views
Truncated un-finished response after deploying hugging-face models Amazon SageMaker	0	381	January 19, 2024
Sagemaker model generates incomplete responses (or even completely random output) Amazon SageMaker	0	185	May 23, 2024
Payload format for LeoLM/leo-mistral-hessianai-7b-chat Sagemaker Endpoint Amazon SageMaker	2	669	October 20, 2023
AWS Sagemaker doesn't return the full response Amazon SageMaker	1	131	July 17, 2024
When deployed meta-llama/Llama-2-7b-chat-hf on sagemaker, it resulted in complete hallunciations Amazon SageMaker	0	299	March 11, 2024

Llama3 8b instruct not answering question

Related topics