How to read/use LLM model templates

jvoid · March 24, 2025, 9:53pm

Hi guys
Actually as subject. What are the templates of models. And how actually to read them and to use?

is it some kind of real go templates or something or is it just an example to grasp the structure idea?
Are these for parsing/formatting purposes?
What about the context data (e.g. .Role, .Content) is it more like to be injected while formatting or placeholders to be scanned?
Are these quite standardized in language/syntax or fluent for any other model

Thank you

John6666 · March 24, 2025, 11:47pm

The apply_chat_template function is probably more primitive than you think. It literally just takes a text template and puts it together with the model, and at runtime the tokenizer reads it and converts the OpenAI-style format list[dict] to str.

I don’t think the OpenAI-style format is standardized in any way, it’s just something that’s convenient to use. To be precise, it’s a little different for each software…
Well, it’s easy to use.

github.com/huggingface/huggingface.js

Incompatibility between OpenAI and HF's Chat Completion `response_format`

opened 03:43PM - 27 Sep 24 UTC

sadra-barikbin

Hi there! OpenAI and HF api seem to have diverged on chat_completion `respons…e_format`. That of the [OpenAI](https://github.com/openai/openai-python/blob/37f5615da1f4360710f6f45920dbb81387d1a4c5/src/openai/types/chat/completion_create_params.py#L272) has a `type` attribute which takes one of three values `text`, `json_object` (free-form JSON generation which isn't in TGI yet) and `json_schema`, and for the last one it has another `json_schema` attribute which itself contains `name`, `schema`, `description` and `strict` attributes. On the other hand, HF's `response_format: ChatCompletionInputGrammarType` [contains](https://github.com/huggingface/huggingface.js/blob/aeae64e4df999f61f5d872aa78545bd6a486c9a9/packages/tasks/src/tasks/chat-completion/spec/input.json#L228-L261) `type` and `value` attributes. This might break the OpenAI api usability on HF models. Could we have compatibility between these two on this matter? Or does it need probably breaking changes in TGI and huggingface_hub? @Wauplin

jvoid · March 25, 2025, 7:56am

Ah great. Thanks.
So it’s just a matter of serialization of structured messages into a text input

Still the question around the standards. It was mentioned in referred docs it uses Jinja as a templater. Yet it seems not all the models follow this as well.
E.g. Gemmas something go-like style

{{- range $i, $_ := .Messages }}

so how apply_chat_template knows how to render them in general?

And regarding input structure it self. Are at least .messages .role and .content are some how basic structure or these could vary as well?

John6666 · March 25, 2025, 9:56am

Even though it is an OpenAI-style format, it is not an OpenAI class. It must be a list[dict] with a similar feel…

For more information on the template format, see below.

chat = [
  {"role": "user", "content": "Hello, how are you?"},
  {"role": "assistant", "content": "I'm doing great. How can I help you today?"},
  {"role": "user", "content": "I'd like to show off how chat templating works!"},
]

Topic		Replies	Views
SFT Trainer and chat templates Beginners	3	349	March 26, 2025
Open-Source Fine-tuned LLM Models for Data Extraction Tasks Models	1	1655	September 24, 2024
Help Needed: Converting OpenNMT Model to Hugging Face Format 🤗Transformers	1	92	November 5, 2024
As of transformers v4.44, default chat template is no longer allowed 🤗Transformers	2	3667	March 7, 2025
Advice on LLMs that can be used directly in Python Beginners	1	228	November 15, 2024

How to read/use LLM model templates

Related topics