ClientErro:400 when using batch transformer for inference

marshmellow77 · January 11, 2022, 8:31pm

Hi @miOmiO - text length limitation could be an issue here. Note that the length refers to the number of tokens, not the number of words. Because BERT models generally use subword tokenization it can happen that one word is split into 2 or more tokens. That is why even reducing the number of words to 460 sometimes might throw an error.

To test this you could try to use the model row by row and see if the number of examples that fail correspond to the same ones in your batch job. If it is indeed the number of tokens that cause the model to fail you should be seeing an error message like "... sequence length is longer than the specified maximum sequence length for this model ..."

If this indeed the source of error then it might be easiest to truncate the input sequence of tokens after the tokenization (rather than the number of words before tokenization).

Hope that helps.

Topic		Replies	Views
ClientError:400 when using batch transformer on sagemaker for inference Amazon SageMaker	3	2043	January 11, 2022
Errors: Batch transform on fine-tuned models Amazon SageMaker	4	1576	May 4, 2023
Errors while running a sagemaker batch transform (inference) job Beginners	2	1085	May 15, 2023
Text Length FinBert - Serverless Inference Endpoint Amazon SageMaker	3	1475	November 5, 2022
How are the inputs tokenized when model deployment? Amazon SageMaker	13	4278	September 3, 2021

ClientErro:400 when using batch transformer for inference

Related topics