How to Implement Few-Shot Prompting in LLaMA-2 Chat Model

audreyeleven · December 17, 2023, 11:32pm

Hi, I wan to know how to implement few-shot prompting with the LLaMA-2 chat model. Currently, I have a basic zero-shot prompt setup as follows:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "meta-llama/Llama-2-7b-chat-hf"
model = AutoModelForCausalLM.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

messages = [
    {"role": "system", "content": "Please answer the math question."},
    {"role": "user", "content": "2+2=?"}
]

input_ids = tokenizer.apply_chat_template(messages, return_tensors="pt")

generated_ids = model.generate(input_ids, max_new_tokens=1000, do_sample=True)
outputs = tokenizer.batch_decode(generated_ids)

I’m considering adding a few examples to the messages sequence for few-shot prompting. However, I haven’t found any specific guidelines on this for LLaMA-2. Drawing inspiration from a blog about how to fewshot prompt with OpenAI API, my idea is to insert several user and assistant interactions right after the system prompt. It looks like this:

messages = [
    {"role": "system", "content": "Please answer the math question."},
    {"role": "user", "content": "1+1=?"},  # example 1
    {"role": "assistant", "content": "2"},  # example 1
    {"role": "user", "content": "1+2=?"},  # example 2
    {"role": "assistant", "content": "3"},  # example 2
    {"role": "user", "content": "2+2=?"}
]

input_ids = tokenizer.apply_chat_template(messages, return_tensors="pt")

generated_ids = model.generate(input_ids, max_new_tokens=1000, do_sample=True)
outputs = tokenizer.batch_decode(generated_ids)

Is this approach correct?

dnathani · March 13, 2024, 6:42pm

I also have the same question. Were you able to find the best practice for doing this?

fghdy · September 18, 2024, 7:06am

I am also confused about this, have you got any idea yet?
Thank you!

mayank64ce · April 8, 2025, 5:00pm

Did any of you figure this out ?

John6666 · April 9, 2025, 4:28am

Maybe correct approach.

Topic		Replies	Views
Llama 2 don't reponse prompt invokes Models	0	404	February 9, 2024
Extending Llama2 with Few-Shot Learning without Prompts Beginners	0	124	July 17, 2024
Llama model outputs strange words Beginners	0	130	December 1, 2024
How to set Llama-2-Chat prompt context Models	2	15484	October 18, 2023
Trying to understand system prompts with Llama 2 and transformers interface 🤗Transformers	9	45707	October 19, 2024

How to Implement Few-Shot Prompting in LLaMA-2 Chat Model

Related topics