meta-llama/Llama-2-7b-chat-hf not generate response when prompt is long

safetyllm · September 27, 2023, 2:22am

To use llama2 chat model, the prompt needs to follow a specific format, which includes the INST and <> tags, BOS and EOS tokens, etc. The format_tokens() function in llama-recipes (https://github.com/facebookresearch/llama-recipes/blob/main/examples/chat_completion/chat_completion.py#L83) shows us how to do the formatting.

Topic		Replies	Views
meta-llama/Llama-2-7b-chat-hf not generate response 🤗Transformers	0	648	September 18, 2023
meta-llama/Llama-2-7b-chat-hf weird responses, compared to the ones returned by the HF API 🤗Transformers	1	115	February 2, 2025
LLAMA-2 conversation generated responses always empty Beginners	1	3933	September 21, 2023
How to set Llama-2-Chat prompt context Models	2	15484	October 18, 2023
Meta-Llama-3-8B-Instruct: Validation Error "Max_new_tokens" Models	6	642	October 2, 2024