Best practice for usage of Data Collator For CompletionOnlyLM in multi-turn chat

John6666 · May 25, 2025, 10:56am

If the appropriate configuration file is placed in the repository, apply_chat_template() should work fine. If not, it will probably be ChatML equivalent.

github.com/huggingface/trl

How does SFTTrainer handle instruction formatted datasets when a tokenizer has no chat_template?

opened 06:08PM - 16 Jan 24 UTC

closed 05:22PM - 17 Jan 24 UTC

JohnGiorgi

Hi! I am interested in using the `SFTTrainer` for instruction-tuning. Following …[the docs](https://huggingface.co/docs/trl/main/en/sft_trainer#dataset-format-support), I can see that I can provided examples in the following format to have the trainer format things for me: ```json {"prompt": "<prompt text>", "completion": "<ideal generated text>"} {"prompt": "<prompt text>", "completion": "<ideal generated text>"} {"prompt": "<prompt text>", "completion": "<ideal generated text>"} ``` The docs also say: > The [SFTTrainer](https://huggingface.co/docs/trl/main/en/trainer#trl.SFTTrainer) will then format the dataset for you using the defined format from the model’s tokenizer with the [apply_chat_template](https://huggingface.co/docs/transformers/main/en/chat_templating#templates-for-chat-models) method. My question and confusion is, what does the trainer do if the tokenizer has no `chat_template`, as is the case with the [base llama model](https://huggingface.co/meta-llama/Llama-2-13b-hf/blob/main/tokenizer_config.json)?

gist.github.com

https://gist.github.com/Blaizzy/40de0f6b4340490e3920db9e182e6455

DataCollatorForCompletionOnlyLM

```python
from trl import SFTTrainer, DataCollatorForCompletionOnlyLM
from transformers import AutoTokenizer
from datasets import load_dataset

# Load Dataset and tokenizer
dataset = load_dataset('prince-canuma/tinyOrca', split='train')
tokenizer = AutoTokenizer.from_pretrained("prince-canuma/Damysus-2.7B-Chat")

This file has been truncated. show original

Topic		Replies	Views
SFT Trainer and chat templates Beginners	3	594	March 26, 2025
When to use a DataCollator for SFTTrainer Beginners	1	802	March 15, 2025
Fine Tuning with Alpaca vs Chat Template Beginners	0	650	December 12, 2024
Get the predictions using DataCollator For Completion OnlyLM after fine-tuning Llama2 using SFT trainer 🤗Transformers	0	524	November 13, 2023
Fine tune with SFTTrainer Intermediate	17	14918	September 12, 2024

Best practice for usage of Data Collator For CompletionOnlyLM in multi-turn chat

Related topics