Loading Model in 8Bit but training time has quadrupled?

cryptojointer · August 18, 2023, 4:21am

Morning, quick Q… Why would a models training raise from ~111hrs to 400hrs when I changed the model to load in 8 bit, instead of loading model straight from the hub(i guess 32bit originally)?

originally model was loaded as t5-xl, and i changed to (t5-xl, load_in_8bit…), for less of a strain on gpu memory, but the training time has gone up?

Sort of sounds correct when writing it out loud, but just for clarity is that normal?

Thanks.

Topic		Replies	Views
Load_in_8bit vs. loading 8-bit quantized model 🤗Transformers	6	6657	May 13, 2024
Does load_in_8bit directly load the model in 8bit? (spoliler, do not seem like it) Beginners	0	1475	July 11, 2023
Does loading in 4bit override an 8bit model? 🤗Transformers	0	693	October 20, 2023
How can I load an LLM in 4-bits 🤗Transformers	0	483	August 2, 2023
How to load a model and make in parallel (T5) 🤗Transformers	0	398	February 22, 2021

Loading Model in 8Bit but training time has quadrupled?

Related topics