Add support for Aggregates#213
Conversation
|
@wtsnz the audit check can be ignored, but mind looking into the dialyzer issues? |
|
Alright @zachdaniel I think I've got it - could you run the CI again please? |
|
@wtsnz looks like just some reuse errors and compile warnings now. |
a7e1639 to
28b3b48
Compare
|
Alright @zachdaniel - third time's the charm - could you run it again please? Also before we merge this, do you have any thoughts on this - There's two possible approaches to add aggregates into AshSqlite -
Let me know which way you'd like to take it. For what it's worth I've been using the ash_sql aggregate version (2) in my local app to great success! |
|
Ah, right, I like the ash_sql based approach 😄 Lets go with that one instead. |
|
Those PRs are drafts, would you say they are ready for review? |
|
@zachdaniel yes, kinda! Although I was keeping them as draft while we figured out which direction - ash_sqlite vs ash_sql - we wanted to land. I'm just on vacation this next week so I'm happy if you want to close this PR unmerged - and we'll pursue the ash_sql approach when I get back to my desk. |
|
Sounds great! |
Contributor checklist
Leave anything that you believe does not apply unchecked.
This is my first proper dive into extending the Ash ecosystem, so I’m still getting my bearings with some of the internals and conventions.
I needed aggregates for a local app I’m building on SQLite, and spent the last couple of days exploring the existing SQL adapters and getting this branch into shape with Codex helping lots along the way. My goal was to get most of the common usage of aggregates working with sqlite so that I could continue building my app.
I took a look at what AshPostgres supports - the tests helped a lot to get an idea of what to support - and tried to get most of it working in SQLite.
For what it's worth I created this PR with a lot of help from AI; Codex and Claude.
Summary
This covers the aggregate kinds I most wanted to use day-to-day:
countsumavgminmaxexistsfirstlistcustomaggregatesThis also adds
AshSqlite.CustomAggregatethat follows the same general idea asAshPostgres.CustomAggregate- you've just got to make sure that you emit SQLite-compatible SQL.Approach
I originally hoped this could mostly reuse the shared
ash_sqlaggregate machinery, but the existing aggregate planner is built around SQL shapes SQLite does not support cleanly. Lateral joins and PostgreSQL-style list aggregation are the big ones. For SQLite, the safer route (according to the LLMs!) is grouped/windowed subqueries, JSON aggregation for lists, and explicit errors when a filter shape could multiply the rows being aggregated.So this different approach requires us to add our own SQLite-specific aggregate planner in
AshSqlite.Aggregate.The implementation builds aggregate queries as grouped subqueries or windowed subqueries, then joins those results back to the parent query. Scalar aggregates can often share grouped subqueries.
firstandlistneed window/query handling so ordering stays deterministic, andlistuses SQLite JSON aggregation before being decoded back into the loaded aggregate value.A lot of the work here is defensive so that AshSqlite does not generate SQL that returns subtly wrong results. Aggregate filters that join across to-many relationships can accidentally multiply rows and over-count or over-sum. For those cases, I've tried to either use a safe shape or return an explicit unsupported error. I've made sure to add enough tests to be confident in the implementation.
Future potential
This pr is intentionally SQLite-scoped rather than full AshPostgres parity, and already does pretty much everything I need for my project so far.
The main things I’ve left out are cases where SQLite would need a very different query shape, or where the generated SQL could quietly return the wrong answer. That includes things like parent-dependent aggregate filters, manual relationships, some mixed multi-hop/many-to-many paths, and fanout-prone filters over to-many relationships.
Those cases should fail with explicit unsupported errors rather than producing over-counted or over-summed results. The docs include the fuller list of current limitations.
Even more future
After quickly posting a concept of this to the Ash Discord, Zach brought up an interesting idea for a future refactor of aggregates support so that it lives in the ash_sql:
I like the idea, and am open to taking a look after it gets landed in ash_sqlite.
Validation
MIX_ENV=test mix test.resetMIX_ENV=test mix test