Help with Llama 2 Finetuning Setup

nathng · August 8, 2023, 10:14pm

Hello!
I’m trying to follow the Llama 2 finetunning example provided by databricks here https://github.com/databricks/databricks-ml-examples/blob/b1ca47c058461f7fde214914d53b051990064d94/llm-models/llamav2/llamav2-7b/scripts/fine_tune_deepspeed.py#L94 . However, I ran into the following issue “lib/python3.9/site-packages/transformers/generation/configuration_utils.py”, line 354, in validate
raise ValueError(
ValueError: do_sample is set to False. However, temperature is set to 0.9 – this flag is only used in sample-based generation modes. Set do_sample=True or unset temperature to continue." on the " model = transformers.AutoModelForCausalLM.from_pretrained(
pretrained_model_name_or_path,
torch_dtype=torch.float16,
trust_remote_code=True,
use_auth_token=True,
)" line for the meta-llama/Llama-2-7b-chat-hf model. I don’t set temperature anywhere in the script.
Does anyone have any idea what the issue is?

vijayjadhavaug · August 9, 2023, 4:11am

try setting the temperature parameter to 0.1 while initialising the hugging face pipeline

muzammil-eds · August 9, 2023, 6:01am

ValueError: do_sample is set to False. However, temperature is set to 0.9 – this flag is only used in sample-based generation modes. Set do_sample=True or unset temperature to continue.

my code is:

from transformers import GenerationConfig, LlamaForCausalLM, LlamaTokenizer

MODEL_NAME = “meta-llama/Llama-2-7b-chat-hf”

bnb_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_use_double_quant=True,
bnb_4bit_quant_type=“nf4”,
bnb_4bit_compute_dtype=torch.bfloat16,
)

model = LlamaForCausalLM.from_pretrained(
MODEL_NAME,
device_map=“auto”,
trust_remote_code=True,
use_auth_token=True,
temperature=0.1,
do_sample=True,
quantization_config=bnb_config,
)

tokenizer = LlamaTokenizer.from_pretrained(MODEL_NAME)
tokenizer.pad_token = tokenizer.eos_token

i explicitly change the do_sample=True in configuration_utils.py but didn`t worked

Monlp · August 9, 2023, 6:44am

I am having the same issue. It was working without problem until last night.
I tried to change the config file and update it by adding do_sample=true but did not work.

!pip install -qqq bitsandbytes --progress-bar off

!pip install -qqq torch --progress-bar off

!pip install -q -U git+https://github.com/huggingface/transformers.git

MODEL_NAME = “meta-llama/Llama-2-7b-chat-hf”

model = AutoModelForCausalLM.from_pretrained(MODEL_NAME, device_map=“auto”, quantization_config=bnb_config)

Error msg:

ValueError Traceback (most recent call last)
in <cell line: 1>()
----> 1 model = AutoModelForCausalLM.from_pretrained(MODEL_NAME, device_map=“auto”, quantization_config=bnb_config)

5 frames
/usr/local/lib/python3.10/dist-packages/transformers/generation/configuration_utils.py in validate(self)
352 )
353 if self.temperature != 1.0:
→ 354 raise ValueError(
355 greedy_wrong_parameter_msg.format(flag_name=“temperature”, flag_value=self.temperature)
356 )

ValueError: do_sample is set to False. However, temperature is set to 0.9 – this flag is only used in sample-based generation modes. Set do_sample=True or unset temperature to continue.

nathng · August 9, 2023, 6:57am

For anyone looking for a solution, it was an issue with the latest release of hugging face transformers released recently. Please downgrade to the previous version !pip install git+https://github.com/huggingface/transformers@v4.31-release to fix the issue.

Monlp · August 9, 2023, 7:08am

Thank you.

Downgrading made it work

muzammil-eds · August 9, 2023, 7:19am

Thanks It works

prajwalk · August 9, 2023, 12:14pm

I was using autotrain and got the same error but downgrading transformers didn’t solve the issue.

!pip install huggingface_hub autotrain-advanced
!pip install git+https://github.com/huggingface/transformers@v4.31-release

This is how I installed the package,
Any solution for this

Osd111 · January 30, 2024, 6:58am

besides downgrade any other solution?

Osd111 · January 30, 2024, 7:26am

As the error raised on

File "/usr/local/lib/python3.10/dist-packages/transformers/generation/configuration_utils.py", line 560, in save_pretrained
    raise ValueError(
ValueError: The generation config instance is invalid -- `.validate()` throws warnings and/or exceptions. Fix these issues to save the configuration.

I just comment the code that raised the exception.

Thinkcru · January 30, 2024, 9:24pm

I’m hitting the same error, doesn’t do regression testing? This is code is so brittle, I just tried with release version 4.38.0.dev0 and it still is crapping out.

pham-llm · February 16, 2024, 7:15pm

Downgrading to 4.31 did not solve the issue. Can anyone point me to the fix?

NeerajSantosh · February 27, 2024, 8:49pm

This solution works. Thank you so much!!

remobor1 · March 18, 2024, 5:25pm

I am at version 4.37.2 and I still see the issue. Can anyone let me know, please, if the issue also appears in most recent releases?
Is there any fix on its way?
I tried all suggestions I could find on this issue, none of them worked.

Thanks

cvetanovskaa · March 20, 2024, 7:54pm

The latest transformers version I could get actually working & saving was 4.36.2 (it still shows the warning but doesn’t crash). Downgrading more than that caused error with other things (like not knowing what “mistral” is a model for example).

yananchen · April 2, 2024, 2:26am

version 4.39.2 crashes due to this error.
anybody encounters this ?

tiago-machado · May 20, 2024, 7:15pm

I’m on version 4.40.1 and it still gives the same error. Downgrading doesn’t work for me since it crashes other parts of the code.

Topic		Replies	Views
Fine-tuning not producing results with default settings on HF Beginners	1	741	January 31, 2024
Do_sample defaults to False when I'm trying to set to True Beginners	5	2877	June 18, 2024
LLama 2 (meta-llama/Llama-2-7b-hf) fine-tunning 🤗AutoTrain	2	3390	October 16, 2023
Error When Trying to Finetune Llama 2 Chat 13B Beginners	0	474	October 2, 2023
Why can't temperature be 0 for GPT2 and GPT-Neo? 🤗Transformers	2	2681	June 18, 2023

Help with Llama 2 Finetuning Setup

Related topics