Why does Classifer-Free Guidance (CFG) add guidances to a negative-prompts-conditional distribution instead of an unconditional distribution?

mickelliu · April 23, 2023, 1:41pm

I have trouble understanding the following lines of code from the file /src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py#L692-L694

if do_classifier_free_guidance:
    noise_pred_uncond, noise_pred_text = noise_pred.chunk(2)
    noise_pred = noise_pred_uncond + guidance_scale * (noise_pred_text - noise_pred_uncond)

I get the part that when we sample without negative prompts, noise_pred_uncond as the name suggested is an unconditional distribution, and the conditional noise difference (noise_pred_text - noise_pred_uncond) prodivdes “guidance” for the sampling process based on positive prompts.

However when we sample with negative prompts, noise_pred_uncond becomes a conditional distribution of the negative prompts according to the implementation:

if do_classifier_free_guidance and negative_prompt_embeds is None:
    uncond_tokens: List[str]
    if negative_prompt is None:
        uncond_tokens = [""] * batch_size
    elif type(prompt) is not type(negative_prompt):
        raise TypeError(
            f"`negative_prompt` should be the same type to `prompt`, but got {type(negative_prompt)} !="
            f" {type(prompt)}."
        )
    elif isinstance(negative_prompt, str):
        uncond_tokens = [negative_prompt]
    elif batch_size != len(negative_prompt):
        raise ValueError(
            f"`negative_prompt`: {negative_prompt} has batch size {len(negative_prompt)}, but `prompt`:"
            f" {prompt} has batch size {batch_size}. Please make sure that passed `negative_prompt` matches"
            " the batch size of `prompt`."
        )
    else:
        uncond_tokens = negative_prompt

I don’t really get the part where you add the guidance to a negative-prompts-conditional distribution. Why don’t add the guidance between +ve and -ve to an unconditional instead? Shouldn’t we worry that the final image will possess features that described by the negative prompts?

Thanks!

robikshrestha · April 18, 2024, 3:03am

Note that the negative prompt is used only when guidance_scale > 1, e.g., for a guidance scale of 6, we get:
noise_pred = 6 * noise_pred_text - 5 * negative_prompt,

So, essentially it is steering away from the negative prompt.

Topic		Replies	Views
Stable diffusion bs>1 uses negative as prompt 🧨 Diffusers	3	7948	October 8, 2022
How to input negative prompt with Flax Stable Diffusion Img2Img Pipeline when using diffusers? 🧨 Diffusers	0	744	June 15, 2023
Negative prompts for the inference api 🤗Hub	10	2585	January 10, 2024
Uploading model with negative prompt? Intermediate	0	420	December 19, 2022
String Format Error in Stable Diffusion Pipeline 🧨 Diffusers	0	664	August 8, 2023

Why does Classifer-Free Guidance (CFG) add guidances to a negative-prompts-conditional distribution instead of an unconditional distribution?

Related topics