NaN encountered during forward guidance: any known instability or mitigation advice?

Thank you very much for your excellent work on Universal Guidance for Diffusion Models!

I have been attempting to reproduce your forward guidance pipeline from scratch using the [🤗 HuggingFace diffusers] library. The general logic follows your original design:

1. Computing \hat{z}_0 using the denoising UNet;

2. Evaluating task-specific loss (in my case, segmentation loss in keeping with the implementation);

3. Computing \nabla_{z_t} loss(c, f(\hat{z}_0));

4. Applying that gradient to update the predicted noise.

However, during random time step, I occasionally encounter NaN values in the grad.

May I ask:

1. Have you(or others) observed similar numerical instability or NaN issues in your own experiments?

2. If yes, what techniques did you use to mitigate it?

3. Or is it possible that there’s something wrong in my reproduction setup?

Any insight you could share would be very helpful for debugging and improving the stability of my implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NaN encountered during forward guidance: any known instability or mitigation advice? #22

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

NaN encountered during forward guidance: any known instability or mitigation advice? #22

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions