Float16 on CPU torch support

rmiller3 · January 16, 2024, 9:04pm

I’ve read about torch not supporting float16 on CPU when trying to test microsoft/phi2 model.
Example threads:

microsoft/phi-2 · RuntimeError: "LayerNormKernelImpl" not implemented for 'Half'
Better error message when trying to run fp16 weights on CPU · Issue #96292 · pytorch/pytorch · GitHub

How come the following code works on a GPT2 model where I’m seemingly setting it to run in that config? Is torch_dtype somehow different than the topic in those comments?

        pipe = transformers.pipeline('text-generation', model=model, tokenizer=tokenizer, device="cpu", torch_dtype=torch.float16)

        res = pipe("Here is a recipe for vegan banana bread:\n",
                           max_new_tokens=max_new_tokens,
                           do_sample=False,
                           use_cache=True)

Topic		Replies	Views
Confused with setting up torch_dtype while using CPU as device 🤗Transformers	0	2271	October 12, 2022
Loading in Float32 vs Float16 has very different speed 🤗Transformers	1	130	February 20, 2025
Issues running GPT-J-6B Beginners	1	1121	January 31, 2023
How BitsAndBytesConfig use bnb_4bit_compute_dtype=torch.bfloat16 on gpu don't support torch.bfloat16 Beginners	0	369	October 20, 2023
Does autogpt-q require float16? 🤗Transformers	0	384	August 28, 2023

Float16 on CPU torch support

Related topics