[p5.strands] Significant refactor for p5.strands #8009

lukeplowden · 2025-07-30T12:05:27Z

Addresses #7868

Changes:

This (draft) PR is for a significant refactor for p5.strands I've been working on for the past month. Thank you to the contributors in other issues who have been patient in waiting for this update as it has blocked some progress in other areas. And thanks to all in Discord showing an interest too. I would love to get the thoughts of those who have been interested in contributing to p5.strands thus far (or any newcomers). This refactor is all about developer ergonomics for p5.strands:
@LalitNarayanYadav @perminder-17 @reshma045 @pratham-radadiya @ShaunakMishra25 @Orsenna187

At current, the refactor is just missing swizzles, a slight change needs to be made to the transpiler to make Unary operations work. Then, I need to do a once over and remove any extra types etc. which are left over from earlier stages in this refactor.

Overview of the refactor

The main purpose of this refactor is to make it more extendable to WGSL in the future, to modularise for developer ergonomics generally, and to make tests and FES easier to re-implement. It separates concerns throughout the p5.strands architecture and adds a much clearer type system. By modularising the codebase, a more straightforward roadmap and contributor documentation can be written up for p5.strands. More on that at the end of this PR, and I will leave some stubs for new issues related to this.

Entry point

p5.strands is still accessible through the same p5.Shader.modify method. The function override for this now exists in p5.strands.js, however. This file also initialises a strandsContext object, and also initialises the user API with this context. In the future, this file could potentially override createShader().

User API

The user API in strands_api.js includes all of the hooks, i.e. methods available p5.Shader.modify() such as getWorldInputs() , getFinalColor() and so on.

It also includes StrandsNode, a simplified class as compared to the previous implementation. Previously, the user had handles to classes derived from BaseNode. There were between 10-15 of these, each with slightly different methods and data, to handle all edge cases for both operations and also for code generation. This was confusing for developer experience, but also created the problem that it was hard to know where to document strands features, and what to document.

The StrandsNode class only contains user facing methods, like .add(), .mult(), and members for swizzling such as .xyz, .rrg etc. Apart from that, it has a this.id which corresponds to an ID in the compilers Intermediate Representation. More on this later, but overall the user API is less tied to backend specifics now.

This file also contains a few more functions like type constructors (vec3, float, also now with ivec3, bool etc.), strandsIf() and discard() which are in progress, and (now I'm reminded I need to add this:) instanceID() as before.

Finally, it also pulls in functions from strands_builtins.js. These are similar as in the previous implementation, except now with a more robust type system which is explained below. @LalitNarayanYadav, you might be interested in reviewing this and potentially re-porting lerp here (sorry!) and copying noise across too, which shouldn't need to change!

Stages of the compiler

The p5.strands compiler is broken more clearly into separate stages. These are similar, but a bit different, to the classic three stages of a compiler. Previously, These stages were shared between the BaseNode class and its children, the ShaderGenerator class, and the p5.Shader.modify() method. The resulting codebase was becoming difficult to extend, and also difficult to summarise.

1. Front-end: Transpile Stage

Overview: Transpiles from the p5.strands 'language' to the JavaScript API.
Files: strands_transpiler.js
External Dependencies: ESCodegen and Acorn

adds operator overloading to allow normal JS operators ([], +, -, == etc) to work on Strands Nodes.
It works by using Acorn to generate an AST, traversing the AST and replacing nodes. Then, it uses ESCodegen to turn this back into code.

2. Middle-end: Building the Intermediate Representation (IR)

Overview: Builds graphs which represent the user's code
Files: ir_dag.js, ir_cfg.js, ir_types.js, ir_builders.js

ir_builders.js is one step beyond the User API file. The functions in here do most of the heavy lifting in building up the IR graphs. All of the functions in the User API call to here.

When the user calls methods like .add() or vec3(), they are returned a user facing StrandsNode as mentioned above. However, this also builds a node in the IR's directed acyclic graph (DAG), which model data dependencies, and records its existence in the control flow graph (CFG), which models data flow. These graphs are implemented in the ir_dag.js and ir_cfg.js files respectively.

The users nodes are handles to nodes in the DAG. So this includes variables and operations (that's it for the most part). There are no 'no-ops' at current. Inside of strandsIf(), a new 'basic block' is made in the CFG. The strandsContext (via the builder functions) keeps track of the current block, and any user instructions (like a function call or addition) are recorded in the current block.

The ir_types.js file has a number of pseudo enums and look up tables for different types. These include BlockType or basic blocks, NodeType for variables vs operations (maybe name is too ambiguous now but use if obvious), etc.

The most obvious (and complex) of these are DataType's which model types such as float, int and their vector variations. As a shader DSL based in JS, I've arrived at objects with a shape: { baseType: 'float', dimension: '1', priority: '3' } etc. Therefore you can compose a final shader type by doing node.baseType + node.dimension, which just separates our types from GLSL a bit for down the road.

Once the user's code has finished running and all of the graphs are built, we do a topological sort on the CFG. We are able to topo sort because, although there are kind of back-edges in the graph, we don't really need to model goto's purely, we just need to output code gen if() in the codegen. This is still a work in progress, however.

3. Back-end: Code generation

Overview: Generates GLSL code from the intermediate representation
Files: strands_codegen.js, strands_glslBackend.js

This does as it says: generates GLSL code from the IR. We currently do the CFG sort in this section, and create generationContext object to store our lines of generated code, and temporary variable names. Next, we loop over the basic blocks and output the code for each visited node.

We only have to use some of the same types from the IR, but most of the heavy lifting is already done (as mentioned) and the code output is relatively simpler code. It is similarly structured to Acorn's visitor functions: we define an object with different visitor functions for different node types.

Importantly, the WGSL implementation should be a similarly simple process to add, and could be done by a direct port of the ``strands_glslBackend.js` file.

FES file

I have also disabled and reenabled FES in the strandsContext object as before, however I have also added a temporary strands_FES.js file here. There are several places in which I have added user errors, but I'm not sure on the best approach for this and have to look more deeply at the rest of FES before overriding it.

Next steps / input

Right now, there are few classes (only user facing ones, in order to have chainable methods easily). I was reading about data oriented design whilst making this (not saying its perfect) but ended up having few classes because of this. It also means that the graphs are structs of arrays, and nodes are just indices into them. If people feel strongly I can refactor these into classes. For example, strandsContext could become class StrandsRuntime or similar:

function initStrandsContext(ctx, backend) {
    ctx.dag = createDirectedAcyclicGraph();
    ctx.cfg = createControlFlowGraph();
    ctx.uniforms = [];
    ctx.hooks = [];
    ctx.backend = backend;
    ctx.active = true;
    ctx.previousFES = p5.disableFriendlyErrors;
    p5.disableFriendlyErrors = true;
}

I'm not sure how I feel about the current (broken) approach to swizzling. Maybe it was better to have Proxy objects as in the previous implementation. I don't like attaching hundreds of members of xyzw permutations to the StrandsNodes prototype, what do you think @davepagurek ?
There are fair number of new files and a new strands folder added to the repo in this PR. How do you feel about that and also naming conventions @ksen0?
In the coming weeks, this writeup could be adapted into a proper contributor docs outlining all of this in a succinct (and visual way).
It could be neat to represent the IR graphs in a p5.sketch (a shader) and use this as a visual test. Visualising the language as much as possible will help contributors to understand how its working.
Once I have figured out the if statements properly and finally, loops would follow a similar structure and a good issue for somebody to tackle if they want to
There is a possibility to optimize the shader code after the IR is built, for example there's a template for constant folding already in ir_types. I'm just not sure whether this will actually optimize anything, or whether the respective backend compiler (GLSL/ WGPU) will do a better job anyway.
As mentioned, would be good to get somebody from FES @IIITM-Jay looking at this at some point, although no rush for now.
As the type system mas matured, we are more capable of defining the input structs to the hooks in our IR. I can see a possibility that more and more of our internal shader workings could run on p5.strands

Will write anything down here as I think of more

PR Checklist

npm run lint passes
Inline reference is included / updated
Unit tests are included / updated

…gnments)

…not p5 defined structs such as Vertex inputs)

…strands-refactor

…ready a node.

…k on swizzles

lukeplowden · 2025-08-05T13:02:28Z

Apologies for the wall of text & let me know if any of this seems wrong, it's pretty complex and I'm drawing from compiler theory, but it is a bit unique as we don't parse the code. P.s. I will address the reviews above in code shortly.

Overall, this should be more comprehensible than the previous implementation. There are some fine details to be clarified as you mentioned, and it could be that the two graphs need more specific names.

Overview:

Basic blocks (CFG nodes) are created by user functions like getWorldInputs (function block) and If() etc.
These push a new block to be generated.
Variables, operations etc. (DAG nodes) are created by user functions like vec3, + etc.
These push instructions into the new block
At code gen time, we can just loop over block->block instructions and the code is already ordered

Do these have to be sorted by DAG order to be valid? (Are these naturally stored in sorted order already?)
The whole process is self sorting, because the users function necessarily create the nodes in order. We flatten all users code so that there are no jumps, i.e. loops and if's are transpiled out. This is something I recently realised about the DAG, which I was previously sorting. And I'm now realising the same is true for the CFG. I'll remove the sorting for the CFG.

The DAG doesn't store states of variables perse, as it doesn't track names. We would need to do something like createFloat('a', value) in order to have the a' or a0 concept from the diagram implemented in the DAG, as far as I understand the problem. That would be static single assignment form and requires a symbol table, which would usually be made during parsing. Currently we flush every used operation as a temporary variable, in case it is altered down the line.

What the DAG does is track data dependencies in a simple form, as in op(a + b) depends on a, and b. We already know that nodes which came before exist, because as stated we flatten the users code and JS runtime creates the nodes in execution order. So we just need to know what to print.

The control flow graph, which is maybe not the most accurate name, indicates that a certain code generation pattern is required for something related to control flow. I.e. output if () { and increase the indent. The inner body of conditionals is not implemented yet.

So are the control flow graph nodes sort of like the big IfElse block in there but that also draw a line around which values should be within the different parts of the if?

Yes, exactly. And the example would be like your first one, because the condition has its own block, and the c value would be recorded in the body of the if statement.

let a = 1
let b = 2
if (b > 1) {
  let c = 3
  a = c
}
return a

We will capture names in the If() case:

let assignments = If(condtion, () => {
  x: x.add(10),
  y: x
}).Else(() => {
  x,
  y,
});
{x, y} = assignments;

We could either use this context and introduce a symbol table (this is more 'correct' and probably more stable), or just grab the temp name for the node during codegen and sneakily reassign it. We wouldn't add names too all nodes, only if they're modified in this way.

…pes to strings for debug, attach API to fn instead of window

…strands-refactor

lukeplowden · 2025-09-15T16:49:34Z

Hey @davepagurek, I think this is pretty close now. When you have some time to review, let me know if there are any final changes. Maybe it is good to collapse the changes into fewer new files, for example.

…n hooks didnt cast from float->vec types

…s logged a warning

davepagurek · 2025-09-16T23:52:23Z

src/strands/ir_builders.js

+
+      const dim = target.dimension;
+
+      const lanes = new Array(dim);


are "lanes" like the underlying values for each dimension, which may go under different name aliases? e.g. lane 0 could be x or r? (If so maybe add a comment explaining)

davepagurek · 2025-09-16T23:53:10Z

src/strands/ir_builders.js

-      return Reflect.set(...arguments);
+
+      for (let j = 0; j < chars.length; j++) {
+        const canonicalIndex = basis.indexOf(chars[j]);


Possibly also worth a comment tying this back to the lane definition. e.g. map x and r to 0, y and g to 1, etc

davepagurek · 2025-09-16T23:56:16Z

src/strands/ir_builders.js

+
+      target.id = newID;
+      if (typeof onRebind === 'function') {
+        onRebind(newID);


This is the bit that ensures dependency ordering works after reassigning a property right? Maybe comment around here explaining what this is for

davepagurek · 2025-09-16T23:56:52Z

src/strands/strands_api.js

          get() {
            const propNode = getNodeDataFromID(dag, dag.dependsOn[structNode.id][i])
-            return createStrandsNode(propNode.id, propNode.dimension, strandsContext);
+            const onRebind = (newFieldID) => {


Worth reiterating what rebinding is for here too from the member assignment section

davepagurek · 2025-09-17T01:03:35Z

src/strands/ir_types.js

+};
+
+export const StructType = {
+  Vertex: {


Ok I think this is actually the last code thing we need: rather than hardcoding these, we need to map from the output of the hook info function to something in this format. There's one failing test and it's due to this

…strands-refactor

lukeplowden added 30 commits June 24, 2025 16:47

syntax/ remove unneccessary

23ff7e6

blocking out new modular strands structure

1511ffb

chipping away at DOD approach.

604c2dd

nested ifs

8950817

if/else semi working

f6369e7

change if/elseif/else api to be chainable and functional (return assi…

a355416

…gnments)

binary ops and contructors prototyped

3e1e149

simplify type system

f718717

SSA

24f0c46

Return type checking for hooks with native types reimplemented (i.e. …

0851285

…not p5 defined structs such as Vertex inputs)

declarations moved to backend, hook arguments fixed

9b84f6f

rename file

8509231

update api imports for new filename

47eda1a

move extractTypeInfo and rename to extractNodeTypeInfo

1088b4d

rename files for clarity

87e8a99

builtin function overloads type checking

e32fd47

function calls partially reimplemented. Still needs more error checking.

11a1610

update function calls to conform parameters when raw numbers are handed

e8f03d6

adding struct types

1ddd9a2

adding struct types

f3155e6

Merge branch 'strands-refactor' of github.com:lukeplowden/p5.js into …

babedfd

…strands-refactor

struct types working

afff707

comment old line. Should revisit structs if needs optimisation.

2e70e0e

fix wrong ID in binary op node

6d5913a

fix bug with binary op, and make strandsNode return node if arg is al…

2745bda

…ready a node.

fix function call bugs

4133fae

remove dag sort, use basic block instructions instead. Also start wor…

b3ce3ec

…k on swizzles

syntax/ remove unneccessary

9ebf77e

blocking out new modular strands structure

faae3aa

chipping away at DOD approach.

f6783d2

lukeplowden added 3 commits August 5, 2025 14:09

remove CFG sorting, make merge block use default behaviour, change ty…

347900f

…pes to strings for debug, attach API to fn instead of window

remove old file and imports

1ddd5f8

Merge branch 'strands-refactor' of github.com:lukeplowden/p5.js into …

085d1b8

…strands-refactor

lukeplowden mentioned this pull request Aug 29, 2025

[p5.strands] Reenabling and implementing FES in strands #7899

Open

17 tasks

ksen0 moved this from Open for Discussion to In Progress in p5.js 2.x 🌱🌳 Sep 3, 2025

ksen0 removed this from p5.js 2.x 🌱🌳 Sep 3, 2025

ksen0 removed this from the 2.1 milestone Sep 3, 2025

lukeplowden added 2 commits September 11, 2025 18:30

bug fixes, swizzle reads working, swizzle writes WIP

f806006

fix textures, struct bugs, and add swizzle assign.

d2c17af

lukeplowden marked this pull request as ready for review September 15, 2025 16:47

remove old shadergenerator file

d5c7fe8

lukeplowden added 3 commits September 15, 2025 17:51

remove dev console.log

100304f

add instance mode changes, fix bug where struct properties returned i…

2b863e6

…n hooks didnt cast from float->vec types

mark atan as p5 function, prevent bug where using atan outside strand…

6462345

…s logged a warning

lukeplowden self-assigned this Sep 16, 2025

lukeplowden added 2 commits September 16, 2025 20:06

add back documentation

37abf7f

add todo for internal parser options

f3afffc

davepagurek reviewed Sep 16, 2025

View reviewed changes

davepagurek added 3 commits September 16, 2025 20:49

Merge branch 'dev-2.0' into strands-refactor

d4d968a

Fix issue with strands being immediately active

bf92d1c

Add back alias for previous uniformVector2 syntax

a269bd7

davepagurek reviewed Sep 17, 2025

View reviewed changes

lukeplowden and others added 5 commits September 18, 2025 11:58

add comments for clarity on swizzling and onrebind

18eb43c

Merge branch 'strands-refactor' of github.com:lukeplowden/p5.js into …

bae0545

…strands-refactor

Parse hookTypes into a strands codegen type

e81920f

Merge branch 'dev-2.0' into strands-refactor

f43770c

Merge atan test file into trigonometry tests and get it working

ecc8061

davepagurek merged commit d2cda2a into processing:dev-2.0 Sep 18, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[p5.strands] Significant refactor for p5.strands #8009

[p5.strands] Significant refactor for p5.strands #8009

Uh oh!

lukeplowden commented Jul 30, 2025 •

edited

Loading

Uh oh!

lukeplowden commented Aug 5, 2025 •

edited

Loading

Uh oh!

lukeplowden commented Sep 15, 2025

Uh oh!

davepagurek Sep 16, 2025

Uh oh!

davepagurek Sep 16, 2025

Uh oh!

davepagurek Sep 16, 2025

Uh oh!

davepagurek Sep 16, 2025

Uh oh!

davepagurek Sep 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[p5.strands] Significant refactor for p5.strands #8009

[p5.strands] Significant refactor for p5.strands #8009

Uh oh!

Conversation

lukeplowden commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Overview of the refactor

Entry point

User API

Stages of the compiler

1. Front-end: Transpile Stage

2. Middle-end: Building the Intermediate Representation (IR)

3. Back-end: Code generation

FES file

Next steps / input

PR Checklist

Uh oh!

lukeplowden commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lukeplowden commented Sep 15, 2025

Uh oh!

davepagurek Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

davepagurek Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

davepagurek Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

davepagurek Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

davepagurek Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lukeplowden commented Jul 30, 2025 •

edited

Loading

lukeplowden commented Aug 5, 2025 •

edited

Loading