Looking for an overview of KLL sketches

ralemy · April 30, 2021, 4:17pm

I am trying to use Deequ library to profile my data, among other things. I can see that AWS has now implemented the KLL algorithm (from Karnin et.al Optimal Quantile Approximation in Streams https://arxiv.org/pdf/1603.05346.pdf)

The paper explains the algorithm, which I am trying to understand, but was wandering perhaps anyone in the community can provide a simpler overview, not about the mechanics of the algorithm (i.e. not so much how it works), but mostly on what the three parameters of the AWS implementation mean (sketch size, shrinksize, and bucket)

I appreciate any help or advice.
Cheers,
Reza

Cosmos218 · April 8, 2024, 12:40pm

I found the background section of those papers provides a good overview:

Topic		Replies	Views
Curl parameters for aws-whisper-large inference end point? Amazon SageMaker	2	1123	October 17, 2022
AWS Deep Learning Containers Amazon SageMaker	0	524	October 6, 2023
Offering a Technical Deep Dive on GRPO/DAPO/Dr. GRPO Algorithms Show and Tell	2	245	May 11, 2025
Deploying Stable Diffusion on s3-Memory issues Amazon SageMaker	0	446	September 15, 2023
Additional loss logging 🤗Transformers	1	643	January 4, 2024

Looking for an overview of KLL sketches

Related topics