Fix stale transform copy-chain leaks by Brooooooklyn · Pull Request #3290 · ml-explore/mlx

Brooooooklyn · 2026-03-21T13:26:08Z

vjp() and jvp() wrap each primal in copy(p, s) to create tracers. When user code stores a tracer into an external container and feeds it back as a primal in the next call, each iteration nests another Copy:

  call 1: container[0] = copy(original)
  call 2: container[0] = copy(copy(original))
  call N: container[0] = copy^N(original)

The container keeps the head alive, which transitively keeps every intermediate Copy node alive — linear memory growth per call.

Fix: before creating the new tracer copy, unwrap_stale_copy_wrappers() peels off Copy nodes that have is_tracer()=false, collapsing the chain to depth 1. Active tracers (is_tracer()=true, from nested transforms) are never unwrapped, preserving nested transform semantics. Copy's VJP is identity so flattening is gradient-safe.

Close [BUG] Memory leak in function transform with unused container #2841

mlx/transforms.cpp

zcbenz

I wonder if we should make it a new primitive, but I think current approach is good enough. 👍

mlx/transforms.cpp

Fixes ml-explore#2841 vjp() and jvp() wrap each primal in copy(p, s) to create tracers. When user code stores a tracer into an external container and feeds it back as a primal in the next call, each iteration nests another Copy: call 1: container[0] = copy(original) call 2: container[0] = copy(copy(original)) call N: container[0] = copy^N(original) The container keeps the head alive, which transitively keeps every intermediate Copy node alive — linear memory growth per call. Fix: before creating the new tracer copy, peel off one stale Copy wrapper (non-tracer Copy primitive with inputs). Active tracers (is_tracer()=true, from nested transforms) are never unwrapped, preserving nested transform semantics. Copy's VJP is identity so flattening is gradient-safe.

angeloskath

This looks good thanks.

However, it is mostly a convenience for the user imho. Because the pattern that exhibits the issue has a bug in that it doesn't evaluate the container. If it did then there would be no copy-chain.

Just to make it clear why the problem is not the copy. The following code will "leak" even with the "fix".

auto grad_fn = grad([&container](const std::vector<array>& inputs) {
  container[0] = 2 * inputs[0];
  return sum(inputs[1]);
});

container[0] will no longer just be copy but 2x of copy so we can't extract it from the input.

Another way of seeing this is that just calling forward in a loop will end up having huge chains of arrays if you never evaluate anything.

zcbenz requested changes Mar 22, 2026

View reviewed changes

mlx/transforms.cpp Outdated Show resolved Hide resolved

mlx/transforms.cpp Outdated Show resolved Hide resolved

Brooooooklyn force-pushed the fix/transform-stale-wrapper-leak branch from 5fdb196 to 314a742 Compare March 23, 2026 12:45

Brooooooklyn requested a review from zcbenz March 23, 2026 12:55

zcbenz approved these changes Mar 23, 2026

View reviewed changes

mlx/transforms.cpp Outdated Show resolved Hide resolved

zcbenz requested a review from angeloskath March 23, 2026 22:06

Brooooooklyn force-pushed the fix/transform-stale-wrapper-leak branch from 314a742 to 3e253ce Compare March 24, 2026 01:48

angeloskath approved these changes Mar 24, 2026

View reviewed changes

angeloskath merged commit 604c825 into ml-explore:main Mar 24, 2026
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix stale transform copy-chain leaks#3290

Fix stale transform copy-chain leaks#3290
angeloskath merged 1 commit intoml-explore:mainfrom
mlx-node:fix/transform-stale-wrapper-leak

Brooooooklyn commented Mar 21, 2026

Uh oh!

Uh oh!

Uh oh!

zcbenz left a comment

Uh oh!

Uh oh!

angeloskath left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Brooooooklyn commented Mar 21, 2026

Uh oh!

Uh oh!

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

angeloskath left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants