Tokenizer.apply_chat_template vs formatting_func

zy113 · July 2, 2024, 7:45am

From what I understand from https://huggingface.co/docs/transformers/main/en/chat_templating#what-are-generation-prompts, tokenizer.apply_chat_template serves exactly the same role as

def formatting_prompts_func(example):
    output_texts = []
    for i in range(len(example['question'])):
        text = f"### Question: {example['question'][i]}\n ### Answer: {example['answer'][i]}"
        output_texts.append(text)
    return output_texts

in Supervised Fine-tuning Trainer. Is my understanding correct? If not, what are some of their differences?

Topic		Replies	Views
SFT Trainer and chat templates Beginners	3	391	March 26, 2025
Fine Tuning with Alpaca vs Chat Template Beginners	0	546	December 12, 2024
Chat_template is not set & throwing error 🤗Tokenizers	3	12649	August 31, 2024
Cannot use chat template functions because tokenizer.chat_template is not set Beginners	1	1048	January 27, 2025
Chat Templates for BlenderBot 🤗Transformers	5	1206	September 2, 2024

Tokenizer.apply_chat_template vs formatting_func

Related topics