Synchronise integration tests on IPC run-finished signal instead of sleeps by mostafaNazari702 · Pull Request #5988 · mochajs/mocha

mostafaNazari702 · 2026-05-22T14:52:28Z

PR Checklist

Addresses an existing open issue: fixes 🛠️ Repo: improve speed and reliability of watch mode tests #5714
That issue was marked as status: accepting prs
Steps in CONTRIBUTING.md were taken

Overview

runMochaWatchAsync used fixed 2 seconds delays between each "change" which was slow in CI.
Replace with an IPC mocha:watch:runFinished event to let tests to wait for completion instead of sleeping. Helper now always forks and exposes waitForRunFinished.

codecov · 2026-05-22T15:14:32Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.02%. Comparing base (6695fba) to head (9f670e3).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5988      +/-   ##
==========================================
+ Coverage   80.89%   81.02%   +0.12%     
==========================================
  Files          64       64              
  Lines        4602     4607       +5     
  Branches      976      997      +21     
==========================================
+ Hits         3723     3733      +10     
+ Misses        879      874       -5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

JoshuaKGoldberg

This PR is meant to work on the watch tests, but looking at the two failures in CI, both are:

  1) --watch
       when enabled
         reruns test when file and directory paths under --watch-files are added:
     Error: runMochaWatchAsync: timed out after 6000ms waiting for watch run to finish
      at Timeout._onTimeout (test/integration/helpers.js:490:13)
      at listOnTimeout (node:internal/timers:588:17)
      at process.processTimers (node:internal/timers:523:7)

The strategies of waiting for explicit signals (rather than hardcoded timing) sounds good to me. But it looks like this PR doesn't fully fix the issues.

…leeps

mostafaNazari702 · 2026-05-25T22:02:39Z

// Not ready yet, please review when i Re-request review

mostafaNazari702 · 2026-05-27T01:49:11Z

The in-PR tests failed even after the commit "wait for chokidar to start watching before first run" which was supposed to fix the test timing issues, and then when i pushed tracing instruments, suddenly the tests work, twice....This is very mind-boggling.

We are dealing with a bug that does not want to be caught.

mostafaNazari702 · 2026-05-27T13:40:36Z

We ( as in me ) are now re-running tests to confirm whether i hopefully fixed it to remove the tracing insturments.

Run 1 ( after committing ): 79 successful checks, all have passed and are green.

(Josh commented during this specific stage).

Updates:

Re-run 1: 80 checks, all green and passed.

Re-run 2: 80 checks, all green and passed.

Re-run 3: The issue has appeared again:

[Tests / lint / lint]: Failing after 24s

[Tests / Test integration in all environments / test-node:integration with node.js 20.19.4 on ubuntu-latest]: Failing after 3m

Last edit to this message: Evaluating whether i should give up or not, 6 hours of troubleshooting totally useless and returned partial positive results.

JoshuaKGoldberg · 2026-05-27T13:43:28Z

Swell. Whenever you think it's ready, feel free to re-request my review & mark this as ready / not draft. Exciting!

mostafaNazari702 · 2026-05-27T15:47:55Z

Swell. Whenever you think it's ready, feel free to re-request my review & mark this as ready / not draft. Exciting!

I recommend to read this comment first before continuing to read.

After instrumenting both sides and comparing passing vs failing CI runs, the issue turned out not to be in our code or chokidar itself but a linux inotify limitation.

inotify watches are per-inode and non-recursive so when a new subdirectory is created chokidar only starts watching it AFTER receiving the parent's IN_CREATE event. Any file events inside that new subdir before the child watch is installed are missed. Our test lands its touchFile exactly in that race window.

This matches prior chokidar work ( that made me give up after finding and going through them):

Race condition when watching dirs leads to missed files paulmillr/chokidar#1112 describes the same race condition
Fix for a race condition on Linux paulmillr/chokidar#1228 proposed a rescan-after-watch-install fix, but it was never merged/ported after the v4 rewrite
Files not detected if directory does not exist before watching paulmillr/chokidar#1422 and added CNAME record, fixes #1436 #1438 show the issue is still reproducible on Linux/Docker/Electron

i also checked alternatives (@parcel/watcher, nsfw, Watchman) and they all have equivalent limitations or unresolved races, so this doesn't appear solvable at the userspace watcher level.

So im stopping further fix attempts here as i have truely put a lot of effort that i should not have ( i don't mean that it does not deserve my time, but rather that i should have been smarter and actually tried to google my issue and find the impossibleness of this issue ). the IPC handshake changes are still a real improvement and stabilize the rest of the watch suite, this one test just hits the kernel race window.

mark-wiemer · 2026-05-28T21:42:56Z

a linux inotify limitation.

On main, these tests currently fail only on Windows (#5361), have we at least solved that? If we've solved that and introduced an issue unique to Linux (where these tests have passed consistently for the past 6 months), then I may have some ideas:

Our test lands its touchFile exactly in that race window.

Can we, um, move outside of this window? That is, add a 500-millisecond sleep (or whatever works) back into the test to workaround this? I know the PR title is currently "Synchronise integration tests on IPC run-finished signal instead of sleeps" but if we add "when possible" to the end I'm still a happy camper with passing tests.

Of course, would this open us up to no longer catching failing scenarios? If so we should detail those and see what we can do.

(I'm almost back to 100% capacity from my disability, haven't looked at this code yet, but I definitely want to fix this bug!)

…iments

mostafaNazari702 · 2026-05-28T23:26:59Z

On main, these tests currently fail only on Windows (#5361), have we at least solved that?

Windows is fixed, on this PR, the watch integration suite is green on windows-latest across node 20/22/24 in every CI re-run (and locally), where main is the flaky one (#5361). the watch suite is also faster locally (44 seconds in my branch vs 2 mins in main), which was the original goal of #5714

I have decided to drop my Linux-kernel-related issues and focus solely on #5714 and its goal. That issue will need a new and separate issue that addresses it. Only remaining failure is intermittent on ubuntu Node 20, the "…file and directory paths under --watch-files are added"-test.

…dd watch test

mostafaNazari702 · 2026-05-29T01:19:06Z

Fails again. I can not do anything more in here unfortuantely.

mark-wiemer · 2026-06-02T01:57:25Z

Yes, please do feel free to step away if you're ever frustrated with a PR :)

mostafaNazari702 · 2026-06-02T14:54:58Z

Yes, please do feel free to step away if you're ever frustrated with a PR :)

I wish you the best of luck with this PR, my friend. The best that can be done is basically extending the timeout time. For now, signal-based is not 100% feasible but you are the expert!

mostafaNazari702 force-pushed the watch-test-sync branch from 4b58f06 to bdb63e0 Compare May 22, 2026 15:08

JoshuaKGoldberg requested changes May 25, 2026

View reviewed changes

JoshuaKGoldberg added the status: waiting for author waiting on response from OP or other posters - more information needed label May 25, 2026

JoshuaKGoldberg reviewed May 25, 2026

View reviewed changes

Comment thread lib/cli/watch-run.js Outdated

mostafaNazari702 mentioned this pull request May 25, 2026

Testing, do not merge, read commit message #6018

Closed

3 tasks

mostafaNazari702 force-pushed the watch-test-sync branch from bdb63e0 to 76376b6 Compare May 25, 2026 21:50

Synchronise integration tests on IPC run-finished signal instead of s…

4ba8e3c

…leeps

mostafaNazari702 force-pushed the watch-test-sync branch from 76376b6 to 4ba8e3c Compare May 25, 2026 21:52

mostafaNazari702 added 2 commits May 26, 2026 13:54

backup

40a70d3

test for mochajs#5988 linux failure

55bd5c4

mostafaNazari702 changed the title ~~Synchronise integration tests on IPC run-finished signal instead of sleeps~~ Synchronise integration tests on IPC run-finished signal instead of sleeps May 26, 2026

mostafaNazari702 added 2 commits May 27, 2026 02:29

wait for chokidar to start watching before first run

522ae68

watch lifecycle for linux investigation

828267f

mostafaNazari702 changed the title ~~Synchronise integration tests on IPC run-finished signal instead of sleeps~~ Synchronise integration tests on IPC run-finished signal instead of sleeps May 27, 2026

JoshuaKGoldberg marked this pull request as draft May 27, 2026 04:21

mostafaNazari702 added 2 commits May 27, 2026 13:46

defer rerun on addDir until chokidar installs the new subdir watch

9161366

replacing setImmediate global with node:timers/promises.setImmediate

efca92d

Remove investigation tracing and revert production watch-run.js exper…

d5cdbe9

…iments

mark-wiemer removed the status: waiting for author waiting on response from OP or other posters - more information needed label May 28, 2026

mostafaNazari702 force-pushed the watch-test-sync branch from 0f3d2a2 to d5cdbe9 Compare May 29, 2026 00:17

bridge inotify watch-install window with a bounded sleep in the dir-a…

9f670e3

…dd watch test

mark-wiemer self-assigned this Jun 2, 2026

GrahamCampbell mentioned this pull request Jun 10, 2026

chore: synchronize watch integration tests on observed runs instead of sleeps #6058

Merged

3 tasks

Uh oh!

Conversation

mostafaNazari702 commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Checklist

Overview

Uh oh!

codecov Bot commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

JoshuaKGoldberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mostafaNazari702 commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mostafaNazari702 commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mostafaNazari702 commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JoshuaKGoldberg commented May 27, 2026

Uh oh!

mostafaNazari702 commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mark-wiemer commented May 28, 2026

Uh oh!

mostafaNazari702 commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mostafaNazari702 commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mark-wiemer commented Jun 2, 2026

Uh oh!

mostafaNazari702 commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mostafaNazari702 commented May 22, 2026 •

edited

Loading

codecov Bot commented May 22, 2026 •

edited

Loading

mostafaNazari702 commented May 25, 2026 •

edited

Loading

mostafaNazari702 commented May 27, 2026 •

edited

Loading

mostafaNazari702 commented May 27, 2026 •

edited

Loading

mostafaNazari702 commented May 27, 2026 •

edited

Loading

mostafaNazari702 commented May 28, 2026 •

edited

Loading

mostafaNazari702 commented May 29, 2026 •

edited

Loading