[WIP] PartitionStoreManager exclusive ownership by Worker role #3606

AhmedSoliman · 2025-07-30T15:13:36Z

This allows the worker to flush rocksdb as soon as the worker role is stopped and before the rest of the system is shut down. In order to achieve this, datafusion queries will always use the remote scanner even on worker nodes. This adds serialization/memory cost that would have been otherwise avoided but potentially can be optimized in future PRs.

Stack created with Sapling. Best reviewed with ReviewStack.

github-actions · 2025-07-30T15:34:07Z

Test Results

7 files ±0 7 suites ±0 4m 51s ⏱️ + 1m 48s
54 tests ±0 53 ✅ ±0 1 💤 ±0 0 ❌ ±0
223 runs ±0 220 ✅ ±0 3 💤 ±0 0 ❌ ±0

Results for commit e713752. ± Comparison against base commit 1ef58b3.

♻️ This comment has been updated with latest results.

This introduces a few crucial changes to how we handle panics in restate. Prior to this change, we would abort the process at panic time without considering a clean shutdown nor rocksdb wal fsync. The summary of changes is as follows: - We now always unwind the stack on panics. TaskCenter is designed to catch panics of important tasks and trigger a clean shutdown and reports a non-zero exit code. - Ensure that on graceful shutdown timeout that we attempt to cleanly flush/shutdown rocksdb manager. This is important to avoid massive backfills of lost memtables on unclean shutdown. - Catch panics at top-level task-center runtime control loop and trigger an emergency rocksdb WAL fsync to ensure that we flush the WAL to avoid loss of in-memory WAL buffer if/when we add support to manual wal flushing in the future. - Makes sure that panics from network connection tasks do not trigger a system shutdown, instead, they are caught and properly logged. This avoids a situation where a network bad request/handler can cause the entire node to panic. - In situations where tracing might have been lost/dropped, ensure that we also log critical information on stderr. This hardens restate against unclean crashes and ensures we perform a clean handoff to other cluster members in case of an unrecoverable crash.

This allows the worker to flush rocksdb as soon as the worker role is stopped and before the rest of the system is shut down. In order to achieve this, datafusion queries will always use the remote scanner even on worker nodes. This adds serialization/memory cost that would have been otherwise avoided but potentially can be optimized in future PRs.

This was referenced Jul 30, 2025

[minor] Durability tracker shouldn't pin partition store manager #3604

Merged

[minor] Worker task management cleanup #3601

Merged

[minor] Minor improvements to partition snapshotting task #3605

Merged

AhmedSoliman force-pushed the pr3606 branch from 75ae7cb to 2d4bc92 Compare July 30, 2025 17:01

AhmedSoliman mentioned this pull request Jul 30, 2025

Remove ingress dedicated runtime #3607

Merged

AhmedSoliman force-pushed the pr3606 branch from 2d4bc92 to 14905f0 Compare July 31, 2025 21:49

This was referenced Jul 31, 2025

[Core] Always unwind and ensure rocksdb's WAL is flushed on panics #3611

Merged

[TaskCenter] spawn_unmanaged_child and scoped cancellations #3612

Merged

AhmedSoliman added 2 commits July 31, 2025 22:56

AhmedSoliman force-pushed the pr3606 branch from 14905f0 to e713752 Compare July 31, 2025 21:57

AhmedSoliman mentioned this pull request Jul 31, 2025

[Core] Use scoped cancellations and work towards clean graceful shutdown #3613

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] PartitionStoreManager exclusive ownership by Worker role #3606

[WIP] PartitionStoreManager exclusive ownership by Worker role #3606

Uh oh!

AhmedSoliman commented Jul 30, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

[WIP] PartitionStoreManager exclusive ownership by Worker role #3606

Are you sure you want to change the base?

[WIP] PartitionStoreManager exclusive ownership by Worker role #3606

Uh oh!

Conversation

AhmedSoliman commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

Uh oh!

AhmedSoliman commented Jul 30, 2025 •

edited

Loading

github-actions bot commented Jul 30, 2025 •

edited

Loading