Novel, provably optimal, lossy variant of speculative decoding

vivien · September 18, 2023, 8:02pm

Hi!

Speculative decoding is a hot topic right now!

Here is a blog post I wrote proposing mentored decoding, a new variant of speculative decoding. It maximizes the probability to accept the draft tokens while maintaining the Kullback-Leibler divergence between the resulting distribution and the target distribution below a constant D. Speculative decoding can be seen as a special case of mentored decoding for D = 0.

Feedback welcome!

Blog post
Summary in a Twitter thread

Thanks to @joaogante who got me interested in speculative decoding with his blog post.

Jung · December 9, 2023, 1:59am

I am a bit late, but want to praise that you wrote an excellent blog, which is highly appreciate to many researchers …

Would love to follow every blog you write!

vivien · January 11, 2024, 8:17pm

Hi @Jung. Many thanks, that’s very kind of you. When I publish a new blog post (quite soon hopefully), I’ll mention it on Twitter, so you can follow me (@vivien000000) there to be notified.

Topic		Replies	Views
Speculative Decoding: How to verify multiple tokens in a single forward pass? Beginners	0	339	January 4, 2024
Hugging Face Reads - 01/2021 - Sparsity and Pruning Research	14	7487	June 3, 2025
Custom Decoding Strategy Beginners	0	458	December 6, 2023
Constrained decoding based on position 🤗Transformers	0	36	October 4, 2024
Special tokens and inference Intermediate	0	333	November 16, 2020

Novel, provably optimal, lossy variant of speculative decoding

Related topics