Memory requierements

Pixies · December 12, 2024, 1:22am

Hi, I am trying to fine-tune meta-llama/Llama-3.2-1B-Instruct. I loaded the model in 4-bit precision using the Transformers library and applying the LoRA method using the PEFT library and TRL. The issue comes when I start the training step, as I am permanently running out of memory, and I don’t know why. These are my training arguments:

training_args = SFTConfig(
    output_dir='/content/results',
    num_train_epochs=5,
    per_device_train_batch_size=1,
    per_device_eval_batch_size=1,
    gradient_accumulation_steps=2,
    learning_rate=2e-4,
    bf16=True,
    logging_steps=50,
    eval_strategy='steps',
    eval_steps=500,
    save_strategy="steps",
    save_steps=500,
    warmup_steps=100,
    weight_decay=0.01,
    logging_dir="/content/logs",
    packing=True,
    report_to="none"
)

trainer = SFTTrainer(
    model=model,
    train_dataset=templated_dataset['train'],
    eval_dataset=templated_dataset['test'],
    args=training_args,
    tokenizer=tokenizer,
    
)

The sequence length is 2048, and the parameters to train are 1,179,648 (LoRA). I calculated that I will need around 3.57GB, but with 15GB which I have, I am running out of memory. I don’t know if there is something wrong with my training arguments configurations. Can you help me, please? Thanks in advance.

DannyDias · January 23, 2025, 4:46pm

Running LLaMA 3.2 locally requires adequate computational resources.

Below are the recommended specifications:
Hardware:

GPU: NVIDIA GPU with CUDA support (16GB VRAM or higher recommended).
RAM: At least 32GB (64GB for larger models).
Storage: Minimum 50GB of free disk space for the model and dependencies.

Software:

Operating System: Linux (preferred), macOS, or Windows.
Python: Version 3.8 or higher.
CUDA Toolkit: Required for GPU acceleration (11.6 or newer).

CitizenDC · February 18, 2025, 4:59pm

@Pixies did you ever figure out why the RAM usage was so high?

I’m training on a CPU with 32bit precision and memory consumption averages between 5GB-8GB RAM usage. I did notice that the RAM usage shot up to 16GB at the start before settling lower.

Topic		Replies	Views
Training CodeLlama2 using LORA doesnt save any memory Beginners	0	711	November 23, 2023
Fine Tuning LLama 3.2 1B Quantized Memory Requirements Models	6	1683	June 16, 2025
Training llama2-13b-16k model with peft on 3 A100 of 80GB is still throwing cuda out of memory 🤗Accelerate	0	795	October 16, 2023
Streamlit + Llama 3, takes too much gpu memory? Models	0	196	July 13, 2024
Fine tune Meta-Llama-3.1-8B OOM error after the 1st training step Models	0	172	September 6, 2024

Memory requierements

Related topics