Code Llama Instruct 34B accepts only 4096 tokens on PRO

saideepak · January 11, 2024, 6:43am

The PRO page Inference for PROs mentions that the model Code Llama Instruct 34B has a context length of 16K. On trying to inference with a PRO account using 4900 tokens, it threw an error mentioning limit of tokens is 4096.

Is this expected behavior? Shouldn’t this be clearly mentioned in the PRO page before we subscribe and pay for it.

Thanks

Topic		Replies	Views
Problem for large context window (400k) Models	4	595	July 24, 2024
Question About the Practicality of the Context Length Models	3	6723	August 8, 2024
How to increase max_new_tokens beyond 1200 in code llama Models	2	761	September 25, 2024
Meta-Llama-3-8B-Instruct: Validation Error "Max_new_tokens" Models	6	644	October 2, 2024
Text generation using LLAMA3 Beginners	0	838	July 24, 2024

Code Llama Instruct 34B accepts only 4096 tokens on PRO

Related topics