ValueError fp16 lm_head.weight

TallBoi · October 1, 2021, 12:25pm

I am trying to run run_translation.py with mt5-large and DeepSpeed enabled. I use ds_config_zero3.json as the config file. However, when I try to run this, I get the following error:

ValueError: fp16 is enabled but the following parameters have dtype that is not fp16: lm_head.weight

Is there some config setting I’m missing that could help resolve this issue?

alexerdmann · October 24, 2021, 3:58pm

Hey did you figure out to resolve this? I’d be interested to learn what you did.

I ran the ASR example here, and it ran fine, but I noticed it had fp16 set to false. If I try to save memory by passing --fp16 at the command line or manually invoking fp16=True when calling TrainingArguments, I get the same error you report.

Topic		Replies	Views
DeepSpeed Zero3 and Peft LoRA fp16 issue Intermediate	3	2987	May 24, 2023
Explicitly disable bf16 for some layers 🤗Transformers	2	14	June 17, 2025
It says that `bfloat16.enabled` without `auto' needed to be specified when training T5, is anyone aware of how to solve that? DeepSpeed	0	255	February 20, 2024
Bitsandbytes `has_fp16_weights` issue 🤗Transformers	1	173	August 15, 2024
[Deepspeed] ZeRO-Infinity integration released and config changes DeepSpeed	2	2295	April 28, 2021

ValueError fp16 lm_head.weight

Related topics