failed daemon leaves workflow in bad state

### Pre-requisites

- [x] I have double-checked my configuration
- [x] I have tested with the `:latest` image tag (i.e. `quay.io/argoproj/workflow-controller:latest`) and can confirm the issue still exists on `:latest`. If not, I have explained why, **in detail**, in my description below.
- [x] I have searched existing issues and could not find a match for this bug
- [ ] I'd like to contribute the fix myself (see [contributing guide](https://github.com/argoproj/argo-workflows/blob/main/docs/CONTRIBUTING.md))

### What happened? What did you expect to happen?

possibly related to #14544 

<img width="957" height="799" alt="Image" src="https://github.com/user-attachments/assets/a5aaac22-5a91-416f-83de-49e25a595c8b" />

i have transient retries on things such as pod deletion

that staging app server errored out with status 143
the other app server deleted
not sure why
then it looks like it spun back up due to transient errors and retries we have with pod deletions
then it looks like it tried to do 2 more retries with the git clone steps again
and workflows are just stuck in a weird state now
the workflow looks failed, but still has pods running

### Version(s)

3.7.0

### Paste a minimal workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflow that uses private images.

```YAML
n/a
```

### Logs from the workflow controller

```text
running into character limit. linking to logs in slack snippet
https://cloud-native.slack.com/archives/C01QW9QSSSK/p1753901691851319?thread_ts=1753901385.723079&cid=C01QW9QSSSK
```

### Logs from in your workflow's wait container

```text
garbage collection ran i think


kubectl logs -n frontend -c wait -l workflows.argoproj.io/workflow=playwright-simple-site-dg7wr,workflow.argoproj.io/phase!=Succeeded
No resources found in frontend namespace.
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

failed daemon leaves workflow in bad state #14715

Pre-requisites

What happened? What did you expect to happen?

Version(s)

Paste a minimal workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflow that uses private images.

Logs from the workflow controller

Logs from in your workflow's wait container

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

failed daemon leaves workflow in bad state #14715

Description

Pre-requisites

What happened? What did you expect to happen?

Version(s)

Paste a minimal workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflow that uses private images.

Logs from the workflow controller

Logs from in your workflow's wait container

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions