Skip to content

Conversation

@ethany-nv
Copy link
Collaborator

@ethany-nv ethany-nv commented Dec 2, 2025

Description

In a big cluster where there are 1000+ pods in the workflow namespace, and up to 40k pods in the rest of the namespaces, workflow status updates become extremely slow. This focuses on retrieving pod events from the workflow namespace to greatly speed up the workflow status updates to the service.

A follow-up change is to have a separate process or worker to retrieve pod events from all namespaces in parallel, if necessary.

Issue - None

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@ethany-nv ethany-nv requested a review from a team December 2, 2025 19:56
@ethany-nv ethany-nv self-assigned this Dec 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants