Guidance Scale for Flux LoRA

asspresso · February 11, 2025, 4:53pm

I have been training dreambooth + lora for a while following the train script format from diffusers/examples/dreambooth/README_flux.md at main · huggingface/diffusers · GitHub

export MODEL_NAME="black-forest-labs/FLUX.1-dev"
export INSTANCE_DIR="dog"
export OUTPUT_DIR="trained-flux-lora"

accelerate launch train_dreambooth_lora_flux.py \
  --pretrained_model_name_or_path=$MODEL_NAME  \
  --instance_data_dir=$INSTANCE_DIR \
  --output_dir=$OUTPUT_DIR \
  --mixed_precision="bf16" \
  --instance_prompt="a photo of sks dog" \
  --resolution=512 \
  --train_batch_size=1 \
  --guidance_scale=1 \
  --gradient_accumulation_steps=4 \
  --optimizer="prodigy" \
  --learning_rate=1. \
  --report_to="wandb" \
  --lr_scheduler="constant" \
  --lr_warmup_steps=0 \
  --max_train_steps=500 \
  --validation_prompt="A photo of sks dog in a bucket" \
  --validation_epochs=25 \
  --seed="0" \
  --push_to_hub

My quetion is: why setting guidance scale as 1? As far as I know, a guidance scale of 1 is not ideal, and some preferabled values would be 3.5, 7, an etc.

For example, in the source code of huggingface/diffusers/blob/main/examples/dreambooth/train_dreambooth_lora_flux.py L460, guidance scale is default to 3.5:

    parser.add_argument(
        "--guidance_scale",
        type=float,
        default=3.5,
        help="the FLUX.1 dev variant is a guidance distilled model",
    )

As I’m working on Flux, I also checked the source code of FluxTransformer2DModel, which is used in the flux lora dreambooth code. In the forward() function, I see that the guidance scale is multiplied by 1000: diffusers/src/diffusers/models/transformers/transformer_flux.py at main · huggingface/diffusers · GitHub

        if guidance is not None:
            guidance = guidance.to(hidden_states.dtype) * 1000
        else:
            guidance = None

what’s the reason here?

John6666 · February 12, 2025, 2:55am

I don’t understand the theory, but I understand the reason. Let’s set it to 1.0. If kohya-ss says so, it must be correct.

github.com/kohya-ss/sd-scripts

CFG with full finetuning of Flux

opened 11:34AM - 29 Aug 24 UTC

mamad-sd

Hello, I've ran a full finetuning on Flux using guidance_scale=1.0. Now w…hen I try to inference the model, I need to set the CFG scale > 1 because if I let it at 1 to disable it like I'm used to do with Flux, then the results are washed up. I'm used to training loras with SimpleTuner and using guidance_scale=1 during training doesn't require setting CFG > 1 during inference. Is that a problem due to full finetuning or is it a bug ? Here is my full training command : `accelerate launch --mixed_precision bf16 --num_cpu_threads_per_process 1 flux_train.py ^ --pretrained_model_name_or_path "flux1-dev.safetensors" ^ --clip_l "clip_l.safetensors" --t5xxl "t5xxl_fp16.safetensors" --ae "ae.sft" ^ --save_model_as safetensors --sdpa --persistent_data_loader_workers --max_data_loader_n_workers 2 ^ --seed 42 --gradient_checkpointing --mixed_precision bf16 --save_precision bf16 ^ --dataset_config "dataset.toml" --output_dir "output" --output_name output_5e5 ^ --learning_rate 4e-5 --max_train_epochs 150 --sdpa --highvram --cache_text_encoder_outputs_to_disk --cache_latents_to_disk --save_every_n_epochs 10 ^ --optimizer_type adafactor --optimizer_args "relative_step=False" "scale_parameter=False" "warmup_init=False" ^ --timestep_sampling sigmoid --model_prediction_type raw --guidance_scale 1.0 ^ --enable_wildcard ^ --fused_backward_pass --double_blocks_to_swap 6 --cpu_offload_checkpointing --full_bf16` Here are two samples using Distilled CFG at 3.5 using Forge UI. The first one uses CFG at 1 and the second one uses CFG at 4. Sample CFG 1: ![00005-461397015](https://github.com/user-attachments/assets/ee0566a0-55c3-4c1f-903f-04a43485a458) Sample CFG 4: ![00006-461397015](https://github.com/user-attachments/assets/a5363e09-0952-4a75-95f4-e51195898cc6) "A dressed table with a cup of coffee. On the cup of coffee is written "FLUX CFG". Very detailed, professional photography." ![xyz_grid-0005-1](https://github.com/user-attachments/assets/fb2c75f5-c056-4327-829e-55883c6415b8) Any help would be appreciated :) Thanks

kohya-ss on Aug 29, 2024
I don’t know about “Distilled CFG at 3.5”, but a model trained with guidance_scale 1.0 should require a guidance scale of around 3.5 for inference, just like the original model.

asspresso · February 12, 2025, 3:47pm

lol thanks for the information!

Topic		Replies	Views
About Lora Training Script 🧨 Diffusers	1	1459	May 2, 2023
Confusing diffusers documentation on usage of kohya _ss Lora + SDXL 🧨 Diffusers	0	1104	November 15, 2023
Additional training of models Beginners	1	138	October 5, 2024
Explicit support of masked loss and schedulefree optimizers 🧨 Diffusers	0	80	December 29, 2024
Train_dreambooth_lora is not working! 🧨 Diffusers	1	1164	May 2, 2023

Guidance Scale for Flux LoRA

Related topics