kohya_SS (Output Interpretation)

deicool · February 20, 2025, 9:29am

Hello

I have trained the kohya_ss model (stabilityai/stable-diffusion-xl-base-1.0) with 10 images. I was wondering where the output comes from (from the base model or my customized training).

How much % is the final output composed of ?
Eg:
(Base Model:60%, Customized Training:40%)
(Base Model:70%, Customized Training:30%)

For example:
The prompt is: DNA has to be shown in the background with a Indain-Woman-with-Mouth-Cancer in the Foreground

And the image created by the program is:

The program is:

from diffusers import AutoPipelineForText2Image, AutoencoderKL
import torch
import os
import numpy as np
from PIL import Image

print("vae")

# Clear GPU memory before starting 
torch.cuda.empty_cache() 

# Set seed for reproducibility 
#torch.manual_seed(6666666) 
#np.random.seed(6666666)

# Define the path to the directory containing your model and LoRA weights
print("Define the path to the directory containing your model and LoRA weights")
model_dir = "D:\\Ganu\\AIImage\\huggingface\\kohya_ss\\kohya_ss\\trained-model\\model\\" 
lora_weights_path = os.path.join(model_dir, "last.safetensors")

# Load the base model using StableDiffusionPipeline
print("Load the base model using StableDiffusionPipeline")
model_id = "stabilityai/stable-diffusion-xl-base-1.0"
adapter_id = "wangfuyun/PCM_SDXL_LoRAs"

#vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16)
pipeline = AutoPipelineForText2Image.from_pretrained(model_id, torch_dtype=torch.float32, variant="fp16").to("cpu")
pipeline.enable_sequential_cpu_offload()
pipeline.enable_attention_slicing("max")

# Load the LoRA weights
print("Load the LoRA weights")
try:
    pipeline.load_lora_weights(lora_weights_path, weight_name="last.safetensors")
except ValueError as e:
    print("Invalid LoRA checkpoint. Please check the compatibility and format of the weights file.")
    raise e

# Generate an image from a text prompt
print("Generate an image from a text prompt")
text_prompt = "DNA has to be shown in the background with a Indain-Woman-with-Mouth-Cancer in the Foreground"
generated_image = pipeline(prompt=text_prompt).images[0]
generated_image.save("generated_image.png")
generated_image.show()

John6666 · February 20, 2025, 1:46pm

Good evening. That question is essentially impossible to answer…

The answer would be something like “it depends on the base model”, “it depends on what you want to express with LoRA (if it’s something like the characteristics of a person or a character, then LoRA will have a big impact)”, or “it can’t be expressed as a percentage in the first place”.

This is because the base model and LoRA are fused together when inference is executed. The mixed neural network is no longer suitable for being expressed as a percentage.

LoRA is not the same as full fine tuning, but it is one of the methods for training models, and there are various LoRA algorithms, each with their own strengths and weaknesses. (I am not familiar with each algorithm.)

deicool · February 21, 2025, 7:22am

Hello

Can I get the last.safetensors weights file (for the model: stabilityai/stable-diffusion-xl-base-1.0) without my customized training (the original one)? So I can check the difference from my customized training?

John6666 · February 21, 2025, 8:31am

Hmmm? How do you want it to be?

deicool · February 21, 2025, 8:32am

Sorry, didn’t get your question?

John6666 · February 21, 2025, 8:38am

Yea. I didn’t understand it very well. I think you want to do something for comparison…

deicool · February 21, 2025, 8:42am

When I do training with kohya_ss (LORA), it generates a (last.safetensors) file which I use for image generation.

What I want is a original file (last.safetensors) without the changes done due to my training?

deicool · February 21, 2025, 9:01am

For example, the following code:

from diffusers import AutoPipelineForText2Image, AutoencoderKL
import torch
import os
import numpy as np
from PIL import Image

print("vae")

# Clear GPU memory before starting 
torch.cuda.empty_cache() 

# Set seed for reproducibility 
#torch.manual_seed(6666666) 
#np.random.seed(6666666)

# Define the path to the directory containing your model and LoRA weights
print("Define the path to the directory containing your model and LoRA weights")
model_dir = "D:\\Ganu\\AIImage\\huggingface\\kohya_ss\\kohya_ss\\trained-model\\model\\" 
lora_weights_path = os.path.join(model_dir, "last.safetensors")

# Load the base model using StableDiffusionPipeline
print("Load the base model using StableDiffusionPipeline")
model_id = "stabilityai/stable-diffusion-xl-base-1.0"
adapter_id = "wangfuyun/PCM_SDXL_LoRAs"

#vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16)
pipeline = AutoPipelineForText2Image.from_pretrained(model_id, torch_dtype=torch.float32, variant="fp16").to("cpu")
pipeline.enable_sequential_cpu_offload()
pipeline.enable_attention_slicing("max")

# Load the LoRA weights
print("Load the LoRA weights")
try:
    pipeline.load_lora_weights(lora_weights_path, weight_name="last.safetensors")
except ValueError as e:
    print("Invalid LoRA checkpoint. Please check the compatibility and format of the weights file.")
    raise e

# Generate an image from a text prompt
print("Generate an image from a text prompt")
text_prompt = "DNA has to be shown in the background, and a Indain Woman with Skin Disease in the Foreground"
generated_image = pipeline(prompt=text_prompt).images[0]
generated_image.save("generated_image.png")
generated_image.show()

generates the image:

Whereas the following code:

from diffusers import AutoPipelineForText2Image, AutoencoderKL
import torch
import os
import numpy as np
from PIL import Image

print("vae")

# Clear GPU memory before starting 
torch.cuda.empty_cache() 

# Set seed for reproducibility 
#torch.manual_seed(6666666) 
#np.random.seed(6666666)

# Load the base model using StableDiffusionPipeline
print("Load the base model using StableDiffusionPipeline")
model_id = "stabilityai/stable-diffusion-xl-base-1.0"
adapter_id = "wangfuyun/PCM_SDXL_LoRAs"

#vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16)
pipeline = AutoPipelineForText2Image.from_pretrained(model_id, torch_dtype=torch.float32, variant="fp16").to("cpu")
pipeline.enable_sequential_cpu_offload()
pipeline.enable_attention_slicing("max")


# Generate an image from a text prompt
print("Generate an image from a text prompt")
text_prompt = "DNA has to be shown in the background, and a Indain Woman with Skin Disease in the Foreground"
generated_image = pipeline(prompt=text_prompt).images[0]
generated_image.save("generated_image.png")
generated_image.show()

generates the following image:

The two images generated are very different.

I was wondering why…

John6666 · February 21, 2025, 10:10am

The two images generated are very different.

I think this is because the latter code does not apply last.safetensors (LoRA). Also, if you want to keep both the pre-training and post-training models in KohyaSS, you need to specify an option…

github.com/kohya-ss/sd-scripts

How can I continue my Lora(as well as classic fine tune) training without starting it over?

opened 07:25AM - 30 Apr 23 UTC

closed 04:34AM - 07 May 23 UTC

terrificdm

Supposed I have done the Lora training, but the result wasn't as expected, a bit… of under-fitting. My question is how can I continue the training basing on current Lora result without starting it over from beginning. BTW, I saved Lora as safetensor format. Should I use --resume or sth? Another similar question for classic fine tune, regarding the same challenge, should I just change model.safetensors to diffuser format then point "pretrained_model_name_or_path" to the directory of diffusers files, and continue my training? Thanks for help.

deicool · March 1, 2025, 6:18am

Hello,

I am getting great images from the program without LORA. So if I want to retain the core design (without LORA) and then apply my LORA fine-tuning on it to apply cosmetic changes (all in one go!), how can I achieve that?

Please advise. Thank You.

John6666 · March 1, 2025, 9:09am

Good evening.

I see. You want to train and apply LoRA to the extent that it doesn’t erase the goodness of the base model.
One way to do this is to lower the weight (scale) below 1.0 when applying LoRA that has already been trained.
Another way is to specify, using parameters, how much of the training data to include in the training using LoRA. In the case of KohyaSS, the parameters are as follows.

When applying LoRA

When training LoRA

github.com/kohya-ss/sd-scripts

Dropout and Max Norm Regularization for LoRA training

dev ← AI-Casanova:max_norm

opened 02:50AM - 29 May 23 UTC

AI-Casanova

+77 -9

This PR adds Dropout and Max Norm Regularization [[Paper]](https://www.cs.toront…o.edu/~rsalakhu/papers/srivastava14a.pdf) to `train_network.py` Dropout randomly removes some weights/neurons from calculation on both the forward and backward passes, effectively training many neural nets successively like so: ![image](https://github.com/kohya-ss/sd-scripts/assets/54461896/728a3670-0ec3-4bdf-8d32-ed3771bd1050) This encourages the LoRA to diversify its training, instead of only picking a few weights to continuously update, hopefully reducing overtraining. Max Norm Regularization calculates the L2 norm of the weights at each key and if they exceed the cutoff, scales the entire key by a factor to bring them in line, (mentioned in section 5.1 of the paper) This works because the relationships between weights in a layer seem to be more important than the total magnitude. When enabled, adds logging for TensorBoard, and an average norm value and number of keys scaled each step to the progress bar. Either option can be used independently: - Dropout suggested setting >0.3 - Max Norm suggested setting = 1 (You can also set it high enough to never trigger ie 10 to watch Tensor Board and see where a good point to set it at might be) Example of training with dropout [0.5,0.25,0.10.05,0] and Max Norm 1 all other settings deterministic ![dropout](https://github.com/kohya-ss/sd-scripts/assets/54461896/766a4ba1-ff38-40d1-8145-283834856451) Notes for @kohya-ss Dropout requires Xformers, and I didn't know how you wanted to do the assertion for that Dropout requires the Cutlass kernel for GPUs with Capability <8 (A100, 4090 etc) this has been tested on a Colab T4 with xformers 0.0.19, no idea minimum requirements. I believe I passed dropout in a way that won't interfere with the other trainers `(dropout=None)` in the function call, but should be checked. Also I'm currently scaling lora_up and lora_down by `ratio**0.5` as a scalar should be commutative when multiplied to a matrix multiplication (ie `matmul(r*A, B) = matmul(A, B*r), matmul(sqrt(r)*A, sqrt(r)*B)`) , will do further testing to confirm whether to remain this way or only multiply up or down by the full ratio.

deicool · March 4, 2025, 4:51am

Hi John6666,

There are a lot of “Training Parameters”. Is there a default value for all of them, or will I have to do a lot of “trial and errors” with each of them?

John6666 · March 4, 2025, 4:58am

Is there a default value for all of them,

Here.

or will I have to do a lot of “trial and errors” with each of them

Or search parameters for similar use-case?

deicool · March 6, 2025, 5:52am

Automated hyperparameter optimization (Optuna)?

John6666 · March 6, 2025, 5:58am

Existing semi-automatic training scripts such as Kohya SS and OneTrainer use parameters that are within a certain range of acceptability from the start.
So it would probably be faster to search for know-how on how to create LoRA for similar use cases and borrow the detailed parameters.

I think that Optuna and other tools are more like frameworks for finding parameters when fine-tuning models fully manually.

deicool · March 6, 2025, 6:24am

Would this be a good start?

How to Train a Highly Convincing Real-Life LoRA Model - MyAIForce.

system · March 12, 2025, 9:36am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Creation of Images from Text-Prompt (Customized Training) Beginners	37	582	January 15, 2025
Need help on training LoRA model Beginners	5	972	January 30, 2025
Error while training LORA in KOHYA_SS (stabilityai/stable-diffusion-xl-base-1.0) Beginners	21	1500	February 13, 2025
Confusing diffusers documentation on usage of kohya _ss Lora + SDXL 🧨 Diffusers	0	1131	November 15, 2023
Extracting loras Models	0	659	October 13, 2023

When applying LoRA

When training LoRA

Related topics