Commit a477537
committed
[BugFix] Push eventstats down by rewriting RexOver to Join + Aggregate (opensearch-project#5483)
PPL eventstats lowers to LogicalProject(RexOver(...)) above the scan. No
rule in OpenSearchIndexRules.OPEN_SEARCH_PUSHDOWN_RULES matches that
shape: every AggregateIndexScanRule config requires LogicalAggregate at
the operand root, and RareTopPushdownRule requires a ROW_NUMBER window
with a LESS_THAN_OR_EQUAL filter above it. The plan therefore reaches
Volcano with RexOver intact, gets converted to EnumerableWindow, and the
scan beneath it stays in _source-includes + requestedTotalSize=MAX_INT
mode, streaming every matching document to the coordinator just to
count it. On 47B-doc indices this times out.
This change rewrites Window AST nodes in CalciteRelNodeVisitor.visitWindow
into a Join + Aggregate plan: the right side is an Aggregate over a
re-pushed copy of the input, which matches AggregateIndexScanRule and
pushes down to OpenSearch as size:0 + track_total_hits (no-BY) or a
terms aggregation (BY). The left side returns rows as before. The join
broadcasts the aggregate value(s) onto each row, preserving the row type
[original cols, agg cols] that the legacy lowering produced so
downstream consumers see no shape change.
NULL-bucket semantics:
- bucketNullable=true: INNER join with IS NOT DISTINCT FROM on each
partition key, so the NULL bucket on each side matches and NULL-keyed
left rows still receive the NULL-bucket aggregate value.
- bucketNullable=false: LEFT join with simple equality, IS NOT NULL
filter pushed below the right aggregate to match the BUCKET_NON_NULL_AGG
pushdown shape stats already uses. NULL-keyed left rows survive with a
NULL aggregate value, matching the previous CASE-wrapped behavior.
The rewriteability predicate (canRewriteWindowAsAggregateJoin) rejects
non-aggregate window functions (ROW_NUMBER / LAG / etc.), non-empty sort
lists, non-default frames, and non-bare-field partition keys. Anything
outside the eventstats shape falls through to visitWindowAsRexOver,
preserving existing behavior for any future Window producer.
Follows the precedent in buildStreamWindowSelfJoinPlan: uses Join (not
LogicalCorrelate, which causes NPE in RelDecorrelator per the comment at
CalciteRelNodeVisitor.java:2348-2352) and mirrors the canonical NULL
bucket handling at lines 2442-2449. Reuses aggregateWithTrimming for
the right-side aggregate construction so agg-resolution semantics are
identical to stats and streamstats.
CalcitePPLEventstatsTest verifyLogical expectations are updated to the
new lowered shape. verifyPPLToSparkSQL assertions are temporarily
removed pending observation of the SparkSqlDialect output for the
join+aggregate form; the previous window-form expectations no longer
apply.
Draft: existing CalciteExplainIT eventstats expected-output files and
new NULL-bucket BY integration tests in CalcitePPLEventstatsIT will be
added in follow-up commits once CI confirms the lowered shape is exact.
Resolves opensearch-project#5483
Signed-off-by: Jialiang Liang <ryanleeang@gmail.com>
Signed-off-by: Jialiang Liang <jiallian@amazon.com>1 parent acd4437 commit a477537
2 files changed
Lines changed: 243 additions & 41 deletions
File tree
- core/src/main/java/org/opensearch/sql/calcite
- ppl/src/test/java/org/opensearch/sql/ppl/calcite
Lines changed: 198 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2105 | 2105 | | |
2106 | 2106 | | |
2107 | 2107 | | |
| 2108 | + | |
| 2109 | + | |
| 2110 | + | |
| 2111 | + | |
| 2112 | + | |
| 2113 | + | |
| 2114 | + | |
| 2115 | + | |
| 2116 | + | |
| 2117 | + | |
| 2118 | + | |
| 2119 | + | |
| 2120 | + | |
| 2121 | + | |
| 2122 | + | |
| 2123 | + | |
| 2124 | + | |
| 2125 | + | |
| 2126 | + | |
| 2127 | + | |
| 2128 | + | |
| 2129 | + | |
| 2130 | + | |
| 2131 | + | |
| 2132 | + | |
| 2133 | + | |
| 2134 | + | |
| 2135 | + | |
| 2136 | + | |
| 2137 | + | |
| 2138 | + | |
| 2139 | + | |
| 2140 | + | |
| 2141 | + | |
| 2142 | + | |
| 2143 | + | |
| 2144 | + | |
| 2145 | + | |
| 2146 | + | |
| 2147 | + | |
| 2148 | + | |
| 2149 | + | |
| 2150 | + | |
| 2151 | + | |
| 2152 | + | |
| 2153 | + | |
| 2154 | + | |
| 2155 | + | |
| 2156 | + | |
| 2157 | + | |
| 2158 | + | |
| 2159 | + | |
| 2160 | + | |
| 2161 | + | |
| 2162 | + | |
| 2163 | + | |
| 2164 | + | |
| 2165 | + | |
| 2166 | + | |
| 2167 | + | |
| 2168 | + | |
| 2169 | + | |
| 2170 | + | |
| 2171 | + | |
| 2172 | + | |
| 2173 | + | |
| 2174 | + | |
| 2175 | + | |
| 2176 | + | |
| 2177 | + | |
| 2178 | + | |
| 2179 | + | |
| 2180 | + | |
| 2181 | + | |
| 2182 | + | |
| 2183 | + | |
| 2184 | + | |
| 2185 | + | |
| 2186 | + | |
| 2187 | + | |
| 2188 | + | |
| 2189 | + | |
| 2190 | + | |
| 2191 | + | |
| 2192 | + | |
| 2193 | + | |
| 2194 | + | |
| 2195 | + | |
| 2196 | + | |
| 2197 | + | |
| 2198 | + | |
| 2199 | + | |
| 2200 | + | |
| 2201 | + | |
| 2202 | + | |
| 2203 | + | |
| 2204 | + | |
| 2205 | + | |
| 2206 | + | |
| 2207 | + | |
| 2208 | + | |
| 2209 | + | |
| 2210 | + | |
| 2211 | + | |
| 2212 | + | |
| 2213 | + | |
| 2214 | + | |
| 2215 | + | |
| 2216 | + | |
| 2217 | + | |
| 2218 | + | |
| 2219 | + | |
| 2220 | + | |
| 2221 | + | |
| 2222 | + | |
| 2223 | + | |
| 2224 | + | |
| 2225 | + | |
| 2226 | + | |
| 2227 | + | |
| 2228 | + | |
| 2229 | + | |
| 2230 | + | |
| 2231 | + | |
| 2232 | + | |
| 2233 | + | |
| 2234 | + | |
| 2235 | + | |
| 2236 | + | |
| 2237 | + | |
| 2238 | + | |
| 2239 | + | |
| 2240 | + | |
| 2241 | + | |
| 2242 | + | |
| 2243 | + | |
| 2244 | + | |
| 2245 | + | |
| 2246 | + | |
| 2247 | + | |
| 2248 | + | |
| 2249 | + | |
| 2250 | + | |
| 2251 | + | |
| 2252 | + | |
| 2253 | + | |
| 2254 | + | |
| 2255 | + | |
| 2256 | + | |
| 2257 | + | |
| 2258 | + | |
| 2259 | + | |
| 2260 | + | |
| 2261 | + | |
| 2262 | + | |
| 2263 | + | |
| 2264 | + | |
| 2265 | + | |
| 2266 | + | |
| 2267 | + | |
| 2268 | + | |
| 2269 | + | |
| 2270 | + | |
| 2271 | + | |
| 2272 | + | |
| 2273 | + | |
| 2274 | + | |
| 2275 | + | |
| 2276 | + | |
| 2277 | + | |
| 2278 | + | |
| 2279 | + | |
| 2280 | + | |
| 2281 | + | |
| 2282 | + | |
| 2283 | + | |
| 2284 | + | |
| 2285 | + | |
| 2286 | + | |
| 2287 | + | |
| 2288 | + | |
| 2289 | + | |
| 2290 | + | |
| 2291 | + | |
| 2292 | + | |
| 2293 | + | |
| 2294 | + | |
| 2295 | + | |
| 2296 | + | |
| 2297 | + | |
| 2298 | + | |
| 2299 | + | |
| 2300 | + | |
| 2301 | + | |
| 2302 | + | |
| 2303 | + | |
| 2304 | + | |
| 2305 | + | |
2108 | 2306 | | |
2109 | 2307 | | |
2110 | 2308 | | |
| |||
Lines changed: 45 additions & 41 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
18 | 30 | | |
19 | 31 | | |
20 | 32 | | |
21 | 33 | | |
22 | 34 | | |
23 | 35 | | |
24 | | - | |
25 | | - | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
26 | 41 | | |
27 | | - | |
28 | | - | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
33 | 42 | | |
34 | 43 | | |
35 | 44 | | |
36 | 45 | | |
37 | 46 | | |
38 | 47 | | |
| 48 | + | |
| 49 | + | |
39 | 50 | | |
40 | 51 | | |
41 | | - | |
42 | | - | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
43 | 59 | | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | 60 | | |
52 | 61 | | |
53 | 62 | | |
54 | 63 | | |
55 | 64 | | |
56 | 65 | | |
| 66 | + | |
| 67 | + | |
57 | 68 | | |
58 | 69 | | |
59 | | - | |
60 | | - | |
61 | | - | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
62 | 76 | | |
63 | | - | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | 77 | | |
73 | 78 | | |
74 | 79 | | |
75 | 80 | | |
76 | 81 | | |
77 | 82 | | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
78 | 87 | | |
79 | 88 | | |
80 | | - | |
81 | | - | |
82 | | - | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
83 | 96 | | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
88 | | - | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | 97 | | |
94 | 98 | | |
0 commit comments