What is the difference between llama2_7B and llama2_7B_hf?

Kwond · May 2, 2024, 9:12am

We implemented both the llama2_7b and llama2_7b_hf models on a local network to evaluate the performance of the models. Even though the same questions were asked, different results were obtained, and in my case, llama2_7b had higher answer satisfaction.

The implementation code is as follows.
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained(“meta-llama/Llama-2-7b-chat-hf”)
tokenizer.save_pretrained(‘Llama2-7b-tokenizer’)

model = AutoModelForCausalLM.from_pretrained(“meta-llama/Llama-2-7b-chat-hf”)
model.save_pretrained(‘Llama2-7b-model’)

When implementing llama2_7b, we used python convert code.
The code is: python convert_llama_weights_to_hf.py

Here is the answer to the same question:
prompt = “What is the capital of South Korea?”
and there was some difference.

In hf model, it answered some strange answer
but in 7b model, it answered correct answer.

I thought these two models are same, but Why do I get different answers to the same question?
Is there anything that I missed?

Topic		Replies	Views
meta-llama/Llama-2-7b-chat-hf weird responses, compared to the ones returned by the HF API 🤗Transformers	1	115	February 2, 2025
Loading pre-trained models with AddedTokens 🤗Transformers	2	750	October 14, 2024
LLama 70B not working Beginners	1	1348	August 8, 2023
Hugging Face Llama-2 (7b) taking too much time while inferencing Models	1	1495	June 23, 2024
Loading a locally saved model is very slow 🤗Transformers	1	3742	July 10, 2024

What is the difference between llama2_7B and llama2_7B_hf?

Related topics