How are the inputs tokenized when model deployment?

Oigres · September 2, 2021, 3:25pm

Yes, I know that is not possible to predict with inputs longer than 512 and this is in fact what complain me, as what I wanted to do, is to use my personal tokenizer on inference time.


long_sentence = "...." # longer than 512 tokens
sentiment_input= {
   {'inputs':long_sentence,
    'parameters': {'truncation':True}
   }
predictor.predict(sentiment_input)

Seems that this solution is pretty helpful. I didn’t know I could customize the input sentence with parameters. Where can I learn more about this customization? I mean, what other parameter I can customize and where is such documentation?

Thank you very much for your time.

Topic		Replies	Views
Inference Hyperparameters Amazon SageMaker	29	4905	October 8, 2021
Access Tokenizer from Sagemaker BART Endpoint Amazon SageMaker	4	1027	November 29, 2022
Access tokenizer from within predict_fn Amazon SageMaker	7	1044	January 14, 2022
ClientErro:400 when using batch transformer for inference Amazon SageMaker	11	2249	January 13, 2022
Errors: Batch transform on fine-tuned models Amazon SageMaker	4	1593	May 4, 2023

How are the inputs tokenized when model deployment?

Related topics