How are the inputs tokenized when model deployment?

philschmid · September 2, 2021, 4:27pm

We have a separate section for batch transform in the documentation, which contains a YT video and a sample notebook. The notebook shows how you can create the jsonl file for your batch transform.

As far as I know, batch transform works like that, SageMaker is sending each “line” of the jsonl as a normal HTTP Request to the inference toolkit. Meaning each line should be a valid JSON document. so true should be correct.

Could you share your cloudwatch logs? Maybe they are more saying about the error?

Topic		Replies	Views
Inference Hyperparameters Amazon SageMaker	29	4839	October 8, 2021
Access Tokenizer from Sagemaker BART Endpoint Amazon SageMaker	4	1002	November 29, 2022
Errors: Batch transform on fine-tuned models Amazon SageMaker	4	1576	May 4, 2023
Deploying Open AI's whisper on Sagemaker Amazon SageMaker	54	16200	April 12, 2024
Access tokenizer from within predict_fn Amazon SageMaker	7	1032	January 14, 2022

How are the inputs tokenized when model deployment?

Related topics