Unified traversal #CT-997 #2198

ubik2 · 2025-12-04T21:10:55Z

Changes the way we do traversal so the same system can be used by both the cell.get() implementation and the query implementation that determines which linked docs we need.

The cell.get() system, or validateAndTransform doesn't follow asCell links, just creating the cell objects that will let us access those in the future.

The query system does follow those links to determine what we need to have available.

To unify tracking, the query system was re-written to use the transaction interface we use for reactivity instead of directly accessing storage.

There's a lot more link magic that needed to go on for the validateAndTransform, and to make this all work, I split the code into a SchemaObjectTraverser portion that knows how to walk objects with rules like the schema, and an IObjectCreator portion that knows how to create the objects.

For the validateAndTransform, that IObjectCreator does all the magic of creating new cells.
For the querySchema, that IObjectCreator does very little, mostly just passing along information about whether the data matched the schema pattern.

Things I intend to address later:

Our typing is unsound. The IObjectCreator returns either the complex mix of proxy and cell objects, or simple JSONValue | undefined depending on the mode.
We're not trying to be perfect with schemas, but many anyOf/allOf that expect to combine object properties may break.
SchemaObjectTraverser should probably switch to a link interface instead of IAttribution.
We really need some documentation on how these cell.get calls work, and what's going on behind the scenes. A lot of the rules when we combine schemas from links that we're following should have unit tests.

Summary by cubic

Unifies traversal and validation to run on storage transactions and normalizes selectors to start at "value", improving consistency, default handling, and link behavior. Addresses CT-997 by removing tight coupling to manager classes, stabilizing array/link semantics, and preventing actions from running with invalid arguments.

Refactors
- Traversal/validation run on IExtendedStorageTransaction (via ManagedStorageTransaction); removed direct BaseObjectManager usage.
- Selectors and addresses are now rooted at ["value", …]; tests and callers updated.
- Improved schema handling: merges schemas across links, adds limited anyOf merging, applies defaults to missing/undefined values (including top-level), preserves asCell/asStream, follows first array element link when appropriate, unifies additionalProperties behavior, and treats broken links as undefined.
- ContextualFlowControl now supports prefixItems and false schemas, reuses a single instance per traversal for better performance, and tightens legacy cell link detection.
- validateAndTransform delegates to SchemaObjectTraverser and drops the unused sync flag; lowered /memory/transact logs to debug.
- Actions only run when arguments validate; still run if argumentSchema is false.
- Shared schema tracker maps space+doc to schema selectors, acting as a watch list and cache to avoid redundant re-traversals.
- Event streams with false/undefined schemas now use a permissive schema so handlers still run.
Migration
- Update SchemaPathSelector.path to start with "value".
- Use an ExtendedStorageTransaction for traversal instead of passing a manager.
- Note: charm list schema now defaults to [] (no action needed unless relying on undefined).

^{Written for commit b951078. Summary will update automatically on new commits.}

Added the better performance test in this branch

…ion instead of using the ObjectStorageManager classes.

…jects from validateAndTransform are not simply JSONValue objects. - Combined the annotateObject and createObject hooks. - Changed the cycle detector, so that we put in an object immediately, then populate it later. This makes it so we can walk into a object/array with a link to itself, and point the resulting object at itself, adding the special fields when we return. - Fix typo in Replica log message (already exists) - Addd TransformObjectCreator which implements the callbacks needed for SchemaObjectTraverser with the validateAndTransform functionality. - Added new validateAndTransform implementation, temporarily switching between them.

…on existence of properties field.

- Convert SchemaQuery SchemaPathSelectors to have the "value" prefix before passing them to traverse code. Do the same for the selectors created for validateAndTransform. - Changed factValue passed to getAtPath to not narrow to value

…thout value

…stead of just when undefined), since that was the previous behavior Tidy up the link value path in validateAndTransform

…ll use a cell, and not a query result proxy.

Added applyDefault helper function to SchemaObjectTraverser Added test for top level default

…ors.

Partial pass at type cleanup (still bad) Added temporary console logs Added cell test for defaults Added check for non-empty fact address path in loadFactsForDoc.

…ls (or spell or TYPE links). These should not be included for client traversal, and will break unit tests if we do, because we end up having a read assertion for a recipe that wasn't written.

…ink with the schema in our parent context. Use the id/type/path from the data. For server traversal, I should switch over to this as well, but I'm going to wait until more things work.

Move schema combination logic into traversePointerWithSchema

Added missing boolean handling to traverseWithSchemaContext

Fix typo in cache.ts error message Cleaned up some tests to be more correct

Code for array post-processing to follow first link (disabled) Changed navigate-to.ts to handle getting a proxy object instead of a cell (need to track this down)

…ating a proxy object instead of a cell. This removes the need for the workaround in navigate-to.ts.

…ments in an array. This needed to be done in traverse, since it should happen before we walk into the object.

…ng optional unused seen parameter to new version for now.

seefeldb · 2025-12-16T17:04:34Z

packages/runner/src/traverse.ts

+   *
+   * When properties are not specified in the schema, and additionalProperties
+   * is not specified, we have the standard JSONSchema behavior, where this is
+   * equivalent to additionalProperties of true.


do we actually see this at this point or would have this been collapsed to a true schema earlier?

I think we still see this. I don't think @mathpirate 's transformer code ever generates this style of schema, but you can specify it manually still.

… an error, since these won't be re-run

…e the object isn't a cell (this was just an error). There's some type issues here, but I'm deferring that. When creating an event stream, if the schema is false (which it often is if the event isn't used), create a simple object schema. Otherwise, we don't run the event handlers.

…ect for JSONSchema, but seems like a good compromise to get the bits we want.

…e anyOf matches

… often not present, and we do stricter validation now. Change the counter fallback test, so it doesn't rely on having undefined in the array. Uses null instead to test the same general thing. If the schema to be used for an event stream is undefined or false, change it to true instead. This is a temporary workaround until I track down why I'm not seeing that triggered.

JSONSchema doesn't allow NaN as a number (since JSON doesn't). I could alter traverse to allow this, but I'm just going to alter the test instead, since neither is ideal.

…rts shared.

ubik2 added 30 commits October 16, 2025 14:01

Use a single instance of cfc object when traversing

3d83281

Added the better performance test in this branch

Rework the traverse methods to operate on am IExtendedStorageTransact…

2f338e7

…ion instead of using the ObjectStorageManager classes.

Remove sync flag from validateAndTransform, since it wasn't used.

42df9c6

Just adding WIP in case I forget tomorrow

71637c4

Unified behavior for additionalProperties with behavior change based …

fde025e

…on existence of properties field.

Semi-working state without default or anyOf

0fb2599

Fix another path value mismatch where getAtPath used the link.path wi…

ad9a013

…thout value

Return a QueryResultProxy if the link schema is true or undefined (in…

c50a376

…stead of just when undefined), since that was the previous behavior Tidy up the link value path in validateAndTransform

Fix regression with asCell schema that are true-ish. These should sti…

8508e5f

…ll use a cell, and not a query result proxy.

Handle undefined doc.value in traverseWithSchemaContext

b6b4936

Added applyDefault helper function to SchemaObjectTraverser Added test for top level default

Update query tests to use the new "value" convention for their select…

df0ec18

…ors.

Updated traverse test to use the value in path

59dd84c

Added method to combine schema context

6d2bba4

Partial pass at type cleanup (still bad) Added temporary console logs Added cell test for defaults Added check for non-empty fact address path in loadFactsForDoc.

Use our traverseCells flag to determine whether to include source cel…

303dc5e

…ls (or spell or TYPE links). These should not be included for client traversal, and will break unit tests if we do, because we end up having a read assertion for a recipe that wasn't written.

When traversing cell links, combine the schema from the data in the l…

257d037

…ink with the schema in our parent context. Use the id/type/path from the data. For server traversal, I should switch over to this as well, but I'm going to wait until more things work.

Fix mergeSchemaFlags

1d82fd0

Move schema combination logic into traversePointerWithSchema

Updated test to reflect new additionalProperties/properties behavior

2e4d017

Added missing boolean handling to traverseWithSchemaContext

Improve combineSchema for primitives with flags

07e93be

more tests pass (applying property defaults)

ad668c4

Handle prefixItems in ContextualFlowControl.schemaAtPath

9afec1d

Fix typo in cache.ts error message Cleaned up some tests to be more correct

Removed a bunch of debugging

2828bd2

Code for array post-processing to follow first link (disabled) Changed navigate-to.ts to handle getting a proxy object instead of a cell (need to track this down)

Fix a case where I was treating an asCell schema as undefined and cre…

c65b577

…ating a proxy object instead of a cell. This removes the need for the workaround in navigate-to.ts.

Better asCell/asStream detection

c641881

Better array support in cfc schemaAtPath

d374278

Replicate the client behavior, where we follow the first link for ele…

5b23f5a

…ments in an array. This needed to be done in traverse, since it should happen before we walk into the object.

Leaving the existing validateAndTransform with the _ prefix, and addi…

f6ad462

…ng optional unused seen parameter to new version for now.

Merge branch 'main' into robin/traverse-tx

dd12bb8

Improve FIXME comment

de778f2

seefeldb reviewed Dec 16, 2025

View reviewed changes

Changes based on PR feedback

bee74d7

ubik2 temporarily deployed to ci December 16, 2025 20:34 — with GitHub Actions Inactive

ubik2 had a problem deploying to ci December 16, 2025 20:34 — with GitHub Actions Failure

ubik2 temporarily deployed to ci December 16, 2025 20:34 — with GitHub Actions Inactive

ubik2 had a problem deploying to ci December 16, 2025 20:34 — with GitHub Actions Failure

If the argument to a handler is undefined (invalid schema), log it as…

6bc5508

… an error, since these won't be re-run

ubik2 had a problem deploying to ci December 16, 2025 22:47 — with GitHub Actions Failure

ubik2 temporarily deployed to ci December 16, 2025 22:47 — with GitHub Actions Inactive

ubik2 had a problem deploying to ci December 16, 2025 22:47 — with GitHub Actions Failure

ubik2 had a problem deploying to ci December 17, 2025 08:06 — with GitHub Actions Failure

ubik2 temporarily deployed to ci December 17, 2025 08:06 — with GitHub Actions Inactive

ubik2 added 2 commits December 17, 2025 11:21

Limited implementation of an anyOf merge. This isn't technically corr…

7f0000a

…ect for JSONSchema, but seems like a good compromise to get the bits we want.

Added mergeMatches function to IObjectCreator for integrating multipl…

b549ed1

…e anyOf matches

ubik2 temporarily deployed to ci December 18, 2025 01:08 — with GitHub Actions Inactive

ubik2 had a problem deploying to ci December 18, 2025 01:08 — with GitHub Actions Failure

ubik2 added 4 commits December 18, 2025 12:15

Altered counter-nested-computed-totals test

cd6d644

JSONSchema doesn't allow NaN as a number (since JSON doesn't). I could alter traverse to allow this, but I'm just going to alter the test instead, since neither is ideal.

Break out the mergeAnyOfMatches function, so I can keep the common pa…

d3bf3b6

…rts shared.

Merge branch 'main' into robin/traverse-tx

b951078

ubik2 temporarily deployed to ci December 18, 2025 22:01 — with GitHub Actions Inactive

ubik2 had a problem deploying to ci December 18, 2025 22:01 — with GitHub Actions Failure

ubik2 temporarily deployed to ci December 18, 2025 22:01 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unified traversal #CT-997 #2198

Unified traversal #CT-997 #2198

Uh oh!

ubik2 commented Dec 4, 2025 •

edited by cubic-dev-ai bot

Loading

Uh oh!

seefeldb Dec 16, 2025

Uh oh!

ubik2 Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Unified traversal #CT-997 #2198

Are you sure you want to change the base?

Unified traversal #CT-997 #2198

Uh oh!

Conversation

ubik2 commented Dec 4, 2025 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Uh oh!

seefeldb Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

ubik2 Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ubik2 commented Dec 4, 2025 •

edited by cubic-dev-ai bot

Loading