Llama2 torch_dtype

Yingshu · November 20, 2023, 11:20am

In the introduction about llama, it says that llama2 was trained on bfloat16. The model weight uploaded to Hugging Face is on float16. I’m wondering why the current work based on llama2 loads it on float16 other than bfloat16.

Topic		Replies	Views
Loading in Float32 vs Float16 has very different speed 🤗Transformers	1	126	February 20, 2025
Why is llama 2 model Size after Lora finetune is too large? Models	1	301	December 1, 2023
GPTQ model to bfloat16 🤗Transformers	0	431	January 10, 2024
How to Load Llama-3.3-70B-Instruct Model in Float8 Precision? 🤗Transformers	1	297	December 11, 2024
How to properly UPCAST the model weights to float32? 🤗Transformers	2	468	April 11, 2024

Llama2 torch_dtype

Related topics