Constraining an LLM output to match a regular expression

vivien · February 25, 2024, 11:04pm

Hi! I’m happy to share a new Community Blog Post introducing two algorithms that guarantee that an LLM output will match an arbitrary regex: https://huggingface.co/blog/vivien/llm-decoding-with-regex-constraints.

It comes with a detailed notebook and a technical appendix with the formal description of the algorithms and their correctness proof.

If you’re looking for a TL;DR, please have a look at this Twitter thread: https://x.com/vivien000000/status/1760933100798767314?s=20

Topic		Replies	Views
FineTune LLM for regex Intermediate	3	2164	April 21, 2024
Evaluate fine-tuned LLM for question answering Beginners	1	50	May 2, 2025
Best way to find a segment of code (output) that matches a given input segment? Beginners	1	17	February 24, 2025
Seeking Advice on Fine-Tuning LLMs for Generating Documents Beginners	1	121	February 15, 2025
Fine-tuning a language model on domain specific embeddings 🤗Transformers	1	1131	November 21, 2023