Ho to train System Prompt and Prompt Extensions at Fine Tune stage

John6666 · March 25, 2025, 12:21am

It is possible to train a single model to be able to switch between modes, but it seems to be quite difficult. It also consumes the model’s parameters…
LLMs for role-playing (RP) are specialized for that purpose, so it might be a good idea to search for them.

If you want to be able to switch between modes with a short command, I think it would be more reliable to train several LoRA separately with PEFT, etc., and then switch between them with a program before passing them to the LLM.

following is by Hugging Chat

To fine-tune an LLM to incorporate system prompt concepts as default behavior and recognize short commands, follow these steps:

System Prompt Integration with Prefix-Tuning: Use prefix-tuning to embed the core identity and behaviors into the model. This method involves adding a “soft prompt” that guides the model without altering its core weights, making the system prompt concepts the model’s default characteristics [2].
Short Command Recognition with PEFT: Implement Parameter-efficient Fine-tuning (PEFT) to train the model on recognizing short commands. PEFT allows efficient adaptation without full retraining, enabling the model to respond to commands like /polite or /humor with specific behaviors [4].
System Message Templates: Define clear command interpretations in the system message. Outline how each command triggers a specific behavior, ensuring the model understands the context and function of each command [3].
Retrieve and Generate (RAG) Integration: Post-retrieval, fine-tune the model to use commands for style adjustments, enhancing responses with polite or humorous tones as needed [5].
Token Management with Few-Shot Examples: Include concise examples in prompts to illustrate command use, aiding the model in learning without exceeding token limits [1].

By integrating these methods, the model will naturally adopt system prompt behaviors and respond to short commands efficiently.

Answer:
To achieve the desired fine-tuning, employ prefix-tuning for default system behaviors, PEFT for recognizing short commands, clear command definitions in prompts, and efficient token management. This approach ensures the model understands and applies commands as default behaviors without requiring lengthy prompts each time [2][4][3][5][1].

Topic		Replies	Views
Fine-tune model with CoT Intermediate	1	392	January 27, 2025
Model Tuning and Re-Tuning Problems Models	2	34	June 10, 2025
Training in a long prompt Beginners	3	386	January 15, 2024
Help with autotrain/LLM finetuning please Beginners	3	2143	August 11, 2023
Giving personality to LLM Beginners	1	714	March 8, 2025

Ho to train System Prompt and Prompt Extensions at Fine Tune stage

Related topics