Add async iterator on result #234

abelcha · 2025-06-19T01:37:01Z

Summary

This provides a high-level abstraction for result streaming that matches JavaScript language idioms alongside existing chunk-based APIs.
it permit to iterate over query results using for await loops

Usage Example

const result = await connection.run('SELECT * FROM large_table');

for await (const row of result) {
  console.log(row);
}

Features Added

Async Iterator Implementation: Added [Symbol.asyncIterator]() method to DuckDBResult class

Technical Details

The async iterator fetches chunks progressively, reducing memory usage for large result sets
Maintains compatibility with existing DuckDBResult API
Properly handles edge cases like empty results and null values

Testing

the tests verify:

Correct iteration behavior
Memory-efficient chunk fetching
Proper handling of edge cases
Early termination scenarios

jraymakers · 2025-06-19T04:50:55Z

Thanks for the PR! This is a very cool idea.

To make it even better, and to fit in with the rest of the API, it should allow iterating over either row arrays or row objects, and support the raw or converted (to JS, JSON, or custom) variants. To make that maintainable, we'd like need an async chunk iterator as a building block.

If you'd like to give that a shot, go ahead, or I can try to outline the API I have in mind when I get some time.

abelcha · 2025-06-27T12:27:03Z

I tried wiring up support for all the variants, but it add a lot of stuff in the codebase, i feel like the kind of call that’s yours to make. This is just a minimal version that could serve as a base.

this binding is already a blessing compared to the first one — I’d rather not mess it up

Performance-wise, I was surprised how much per-row object creation adds up. With a template object + Object.create for each row i got a ~10% improvement though it’s hard to benchmark. but yeah at this level its best to let the consumer choose to eat the cost or not

I’m working on a more experimental, fully typed high-level DuckDB TypeScript runtime, and this is the UX I’ve landed on based on the select return value:

im mapping Bigint to Number so it simplifies a lot

jraymakers · 2025-06-28T05:05:53Z

Yes, the reason for the variants is to provide a choice between convenience and performance. Generally the column-oriented ones are going to perform better than the row-oriented ones, and raw arrays will perform better than objects, but for small results it doesn't matter, and rows and objects can be convenient at times.

Supporting all the variants without a lot of code duplication that's hard to maintain took some iteration. I think it could be done while also supporting async iterators, but it will take some experimentation, which I haven't had time for yet. (I still hope to, though probably not very soon.)

That library/runtime you're building looks interesting. How are you ensuring the results are correctly typed? I'd like to provide better typing for results, but I haven't discovered a good way yet. (See #140.)

abelcha · 2025-07-15T22:34:16Z

I follow a similar approach to convex.dev, where intermediate schemas are written to a local .buckdb/ directory.

Either on first execution it inspects .columnTypes() dynamically, or — if you’re in a live environment — it can describe the schema ahead of time (e.g. https://buckdb.pages.dev).

It also codegens phantom types from duckdb_functions() and duckdb_types() to produce full method signatures and static type info for function calls.

Then it use TS generics to handle joins, CTEs, name aliases, etc. to infer return value
src/build.types.ts

btw… are you guys hiring ?
I genuinely love DuckDB and would be thrilled to contribute more to it

Votre Nom added 2 commits June 19, 2025 03:35

Add async iterator

9f28519

Fix async iterator test imports

fc25c2c

abelcha changed the title ~~Add async iterator~~ Add async iterator on result Jun 19, 2025

fixes

c82f392

remove double assert import

6d148ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add async iterator on result #234

Add async iterator on result #234

Uh oh!

abelcha commented Jun 19, 2025 •

edited

Loading

Uh oh!

jraymakers commented Jun 19, 2025

Uh oh!

abelcha commented Jun 27, 2025

Uh oh!

jraymakers commented Jun 28, 2025

Uh oh!

abelcha commented Jul 15, 2025

Uh oh!

Uh oh!

Add async iterator on result #234

Are you sure you want to change the base?

Add async iterator on result #234

Uh oh!

Conversation

abelcha commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Usage Example

Features Added

Technical Details

Testing

Uh oh!

jraymakers commented Jun 19, 2025

Uh oh!

abelcha commented Jun 27, 2025

Uh oh!

jraymakers commented Jun 28, 2025

Uh oh!

abelcha commented Jul 15, 2025

Uh oh!

Uh oh!

abelcha commented Jun 19, 2025 •

edited

Loading