SDXL custom pipeline - Input to unet? - Why 2 text encoders?

Did you figure it out with the two text encoders?