I found the Diffusers implementation to be problematic, as the results appear abnormal. When running the official example, the following warning is displayed:
'cross_attention_kwargs ['instdiff'] are not expected by AttnProcessor2_0 and will be ignored.'
