[BugFix] Restore dedup pushdown when combined with WHERE clause (#5482) by ryan-gh-bot · Pull Request #5488 · opensearch-project/sql

ryan-gh-bot · 2026-05-29T01:48:13Z

Description

PPL dedup is initially compiled into a ROW_NUMBER() OVER (PARTITION BY field) window pattern with an IS NOT NULL(field) bucket-non-null filter directly above the scan. PPLSimplifyDedupRule folds that pattern into a LogicalDedup which DedupPushdownRule then rewrites into a composite + top_hits aggregation against the shard.

The HEP program registers FilterMergeRule.DEFAULT and PPLSimplifyDedupRule.DEDUP_SIMPLIFY_RULE via addRuleCollection. When a user where precedes dedup, the user's filter sits adjacent to the bucket-non-null filter; FilterMergeRule can merge them into AND(IS NOT NULL(field), <user predicate>). The previous mayBeFilterFromBucketNonNull operand predicate required every conjunct of the AND to be IS NOT NULL($ref) (allMatch), so the merged shape failed the match. PPLSimplifyDedupRule never fired, no LogicalDedup was produced, and DedupPushdownRule had nothing to match — dedup fell through to the in-memory ROW_NUMBER window form on the coordinator, which exceeds timeouts on large indices.

Fix

Loosen the operand predicate at the source rather than reordering HEP rules.

PlanUtils.mayBeFilterFromBucketNonNull now accepts the un-merged IS NOT NULL($ref) shape or any AND whose conjuncts contain at least one IS NOT NULL($ref) (anyMatch instead of allMatch). The predicate is order-independent — the simplify rule fires whether or not FilterMergeRule ran first.
PPLSimplifyDedupRule.apply splits the matched filter's conjuncts into (a) the IS NOT NULL on a partition key (absorbed by LogicalDedup) and (b) everything else (emitted as a separate filter below LogicalDedup), so any user predicate folded into the AND is preserved verbatim. A defensive bail-out: if no IS NOT NULL on a partition key is present in the matched filter, the rule returns without transforming.
PlanUtils.isNotNullOnRef is promoted to public so PPLSimplifyDedupRule.isNotNullOnPartitionKey can reuse it.

Verification

CalcitePPLDedupTest.testDedupAfterWhereProducesLogicalDedup runs the rule sequence (PPLSimplifyDedupRule, FilterMergeRule) on a where + dedup plan and asserts that LogicalDedup is produced regardless of rule order. The user predicate appears as a separate filter below LogicalDedup, ready for downstream pushdown:

LogicalProject(EMPNO=[$0], ..., DEPTNO=[$7])
  LogicalDedup(dedup_fields=[$7], allowed_dedup=1, keepEmpty=false, consecutive=false)
    LogicalProject(EMPNO=[$0], ..., DEPTNO=[$7])
      LogicalFilter(condition=[>($5, 1000)])
        LogicalTableScan(table=[[scott, EMP]])

CalcitePPLDedupTest.testDedupAfterWhereProducesLogicalDedupRegardlessOfRuleOrder exercises the swapped order (FilterMergeRule first) and asserts the same result, pinning the user-visible contract rather than any rule-ordering invariant.
CalciteExplainIT.testDedupAfterWherePushDown (new) — end-to-end against a live cluster: runs source=opensearch-sql_test_index_account | where age > 25 | dedup gender and asserts LogicalDedup is present in the explain output and EnumerableWindow is absent (i.e., dedup is actually pushed down and the in-memory window fallback the bug caused is gone).
All existing CalcitePPLDedupTest cases continue to pass.

Related Issues

Resolves #5482

Check List

New functionality includes testing.
New functionality has been documented.
New PPL command checklist all confirmed.
API changes companion pull request created.
Commits are signed per the DCO using --signoff or -s.
Public documentation issue/PR created.

…search-project#5482) Run PPLSimplifyDedupRule before FilterMergeRule in the HEP optimizer so the bucket-non-null filter PPL emits for dedup is matched as-is. With the previous order, an upstream user where filter sat adjacent to the bucket-non-null filter; FilterMergeRule fired first and merged them into a conjunction that no longer satisfied PPLSimplifyDedupRule's operand predicate, defeating dedup pushdown to the shard. Use sequential addRuleInstance phases for explicit ordering rather than addRuleCollection, which is documented as non-deterministic in firing order. Adds two regression tests in CalcitePPLDedupTest: one that asserts LogicalDedup is produced under the fixed order, and one that pins the buggy behavior under the swapped order. Signed-off-by: ryan-gh-bot <ryan-gh-bot@users.noreply.github.com>

github-actions · 2026-05-29T01:49:21Z

PR Reviewer Guide 🔍

(Review updated until commit `38fc6c3`)

Here are some key observations to aid the review process:

🧪 PR contains tests
🔒 No security concerns identified
✅ No TODO sections
🔀 No multiple PR themes
⚡ Recommended focus areas for review Possible Issue When the filter is only partially pushable, the code bails out without transforming. However, the original dedup pattern (with the filter still present) remains in the plan. If the residual filter is later removed or merged by another rule pass, a subsequent optimizer iteration might re-match this same dedup pattern, potentially causing an infinite loop or redundant rule applications. The code should either mark the node as transformed or ensure the pattern cannot re-match. if (call.rels[1] instanceof LogicalFilter filter) { // Push the filter into the scan, then synthesize an identity project so the standard // apply() can run on the resulting Dedup → Project → Scan shape. If the filter is only // partially pushable, pushDownFilter returns a Filter wrapping the residual condition over // the new scan; we can't strip a residual filter without breaking semantics, so bail. final CalciteLogicalIndexScan scan = call.rel(2); if (!(scan.pushDownFilter(filter) instanceof CalciteLogicalIndexScan newScan)) { return; } RelBuilder relBuilder = call.builder(); Possible Issue If `dedupColumns` contains a non-RexInputRef element (e.g., a literal or expression), `partitionKeyIndices` will not include it, but the dedup pattern may still be valid. The defensive check at line 148 will bail out even though a valid IS_NOT_NULL conjunct on an actual partition key might exist. This could silently skip legitimate dedup simplifications when partition keys include expressions. Set<Integer> partitionKeyIndices = new HashSet<>(); for (RexNode key : dedupColumns) { if (key instanceof RexInputRef ref) { partitionKeyIndices.add(ref.getIndex()); } } List<RexNode> bucketNonNullConjuncts = new ArrayList<>(); List<RexNode> remainingConjuncts = new ArrayList<>(); for (RexNode conjunct : RelOptUtil.conjunctions(bucketNonNullFilter.getCondition())) { if (isNotNullOnPartitionKey(conjunct, partitionKeyIndices)) { bucketNonNullConjuncts.add(conjunct); } else { remainingConjuncts.add(conjunct); } } // Defensive: if no IS_NOT_NULL conjunct on a partition key is present, this filter is not // actually a bucket-non-null filter — bail out without transforming. The loose operand // predicate may have matched on an unrelated AND that happens to contain an IS_NOT_NULL on // some other ref. if (bucketNonNullConjuncts.isEmpty()) { return; }

github-actions · 2026-05-29T01:49:33Z

PR Code Suggestions ✨

Latest suggestions up to 38fc6c3

Explore these optional code suggestions:

Category	Suggestion	Impact
General	Add type check before casting The code assumes `call.rel(2)` is always a `CalciteLogicalIndexScan` when `call.rels[1]` is a `LogicalFilter`, but this assumption isn't validated. If the operand structure changes or the rule matches unexpectedly, a `ClassCastException` could occur. Add a type check before casting. opensearch/src/main/java/org/opensearch/sql/opensearch/planner/rules/DedupPushdownRule.java [51-59] if (call.rels[1] instanceof LogicalFilter filter) { // Push the filter into the scan, then synthesize an identity project so the standard // apply() can run on the resulting Dedup → Project → Scan shape. If the filter is only // partially pushable, pushDownFilter returns a Filter wrapping the residual condition over // the new scan; we can't strip a residual filter without breaking semantics, so bail. - final CalciteLogicalIndexScan scan = call.rel(2); + if (!(call.rel(2) instanceof CalciteLogicalIndexScan scan)) { + return; + } if (!(scan.pushDownFilter(filter) instanceof CalciteLogicalIndexScan newScan)) { return; } Suggestion importance[1-10]: 7 __ Why: Valid safety improvement. The operand structure is defined by the rule configuration, but adding an explicit type check before casting `call.rel(2)` to `CalciteLogicalIndexScan` prevents potential `ClassCastException` if the rule configuration changes or matches unexpectedly, improving robustness.	Medium
Possible issue	Validate no predicates are lost When no bucket-non-null conjuncts are found, the method returns early without transforming. However, the extracted `remainingConjuncts` are discarded, potentially losing user predicates. Consider logging a warning or validating that `remainingConjuncts` is also empty to ensure no predicates are silently dropped. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [148-150] if (bucketNonNullConjuncts.isEmpty()) { + // Defensive: if no IS_NOT_NULL conjunct on a partition key is present, this filter is not + // actually a bucket-non-null filter — bail out without transforming. + if (!remainingConjuncts.isEmpty()) { + // Log or handle unexpected case where user predicates exist but no bucket guards found + throw new IllegalStateException("Expected bucket-non-null filter but found only user predicates"); + } return; } Suggestion importance[1-10]: 3 __ Why: The suggestion raises a valid concern about potentially discarding `remainingConjuncts`, but the current behavior is intentional: if no bucket-non-null conjuncts exist, the filter isn't a dedup pattern and the rule shouldn't transform. The `remainingConjuncts` remain in the original filter. Throwing an exception would be too aggressive for a defensive check.	Low

Previous suggestions

Suggestions up to commit 95e2aeb

Category	Suggestion	Impact
General	Handle partial filter pushdown explicitly The pattern match checks if `pushDownFilter` returns a `CalciteLogicalIndexScan`, but doesn't verify whether the filter was fully pushed down or if a residual filter remains. If `pushDownFilter` returns a `LogicalFilter` wrapping a scan (indicating partial pushdown), the cast fails silently and the method returns. Consider explicitly checking the return type and handling partial pushdown scenarios. opensearch/src/main/java/org/opensearch/sql/opensearch/planner/rules/DedupPushdownRule.java [57-59] -if (!(scan.pushDownFilter(filter) instanceof CalciteLogicalIndexScan newScan)) { +var pushResult = scan.pushDownFilter(filter); +if (!(pushResult instanceof CalciteLogicalIndexScan newScan)) { + // Partial pushdown or unsupported filter - bail out return; } Suggestion importance[1-10]: 4 __ Why: The suggestion correctly identifies that the code checks for partial pushdown via the `instanceof` pattern match and bails out if a residual filter remains (as stated in the comment on lines 53-55). Extracting to a variable improves readability slightly, but the logic is already correct and the comment explains the behavior. The improvement is marginal.	Low
General	Validate all conjuncts are classified The loop iterates over all conjunctions and classifies them into two lists. If the filter condition is complex or deeply nested, `RelOptUtil.conjunctions` may not flatten all levels of nested ANDs. Consider adding validation to ensure all conjuncts are properly classified, or document the assumption that only top-level ANDs are expected. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [137-143] -for (RexNode conjunct : RelOptUtil.conjunctions(bucketNonNullFilter.getCondition())) { +List<RexNode> allConjuncts = RelOptUtil.conjunctions(bucketNonNullFilter.getCondition()); +for (RexNode conjunct : allConjuncts) { if (isNotNullOnPartitionKey(conjunct, partitionKeyIndices)) { bucketNonNullConjuncts.add(conjunct); } else { remainingConjuncts.add(conjunct); } } +// Validate that all conjuncts were classified +assert bucketNonNullConjuncts.size() + remainingConjuncts.size() == allConjuncts.size(); Suggestion importance[1-10]: 2 __ Why: The suggestion adds an assertion to validate that all conjuncts are classified, but `RelOptUtil.conjunctions` is a standard Calcite utility that flattens top-level ANDs reliably. The assertion would always pass in normal operation and only catch programming errors. The concern about deeply nested ANDs is unfounded since `conjunctions` handles this. The added validation has minimal practical value.	Low
Possible issue	Validate remaining conjuncts before early return When no bucket-non-null conjuncts are found, the method returns early without transforming. However, the extracted `remainingConjuncts` list is not checked for emptiness. If `remainingConjuncts` is non-empty but `bucketNonNullConjuncts` is empty, user predicates could be silently dropped. Consider logging a warning or validating that `remainingConjuncts` is also empty before returning. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [148-150] if (bucketNonNullConjuncts.isEmpty()) { + if (!remainingConjuncts.isEmpty()) { + // Log or handle unexpected case: filter has user predicates but no bucket-non-null guards + } return; } Suggestion importance[1-10]: 3 __ Why: The suggestion raises a valid edge case where `remainingConjuncts` could be non-empty while `bucketNonNullConjuncts` is empty. However, the early return is intentional per the defensive comment (lines 144-147), and silently dropping user predicates is unlikely since the operand predicate should prevent matching unrelated filters. The suggestion to log or validate is reasonable but has low impact.	Low

Suggestions up to commit a66ad5b

Category	Suggestion	Impact
General	Handle non-RexInputRef partition keys The code only handles `RexInputRef` instances when building `partitionKeyIndices`, silently ignoring other node types. If `dedupColumns` contains non-`RexInputRef` nodes (e.g., expressions), those partition keys won't be tracked, potentially causing incorrect classification of conjuncts. Consider handling or explicitly documenting this limitation. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [130-134] for (RexNode key : dedupColumns) { if (key instanceof RexInputRef ref) { partitionKeyIndices.add(ref.getIndex()); + } else { + // Log or handle non-RexInputRef partition keys if they're expected + call.getPlanner().getContext().unwrap(org.slf4j.Logger.class) + .warn("Unexpected partition key type: {}", key.getClass().getSimpleName()); } } Suggestion importance[1-10]: 4 __ Why: The suggestion correctly identifies that non-`RexInputRef` nodes in `dedupColumns` are silently ignored. However, based on the PR context and the dedup pattern, partition keys are expected to be simple column references (`RexInputRef`). Adding logging for unexpected types could help catch edge cases, but the current behavior appears intentional for the dedup use case. The suggestion provides moderate value for defensive programming.	Low
	Add logging for defensive check When no bucket-non-null conjuncts are found, the method returns early without transforming the plan. However, this leaves the original plan unchanged, which may cause downstream rules to fail or produce incorrect results. Consider logging a warning or throwing an exception to make this defensive check more explicit and aid debugging. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [148-150] if (bucketNonNullConjuncts.isEmpty()) { + // Log or handle the case where the filter doesn't match the expected pattern + call.getPlanner().getContext().unwrap(org.slf4j.Logger.class) + .warn("Expected bucket-non-null filter pattern not found; skipping transformation"); return; } Suggestion importance[1-10]: 3 __ Why: While adding logging for the defensive check could aid debugging, the early return is already well-documented with a comment explaining why the transformation is skipped. The suggestion to add logging is a minor improvement that would help debugging but doesn't address a critical issue. The improved code also assumes a specific logging framework is available in the context, which may not be the case.	Low
	Log filter pushdown failures The `pushDownFilter` method may return null if the filter cannot be pushed down, but the code silently returns without any indication of why the transformation failed. This makes debugging difficult when pushdown unexpectedly fails. Consider logging the failure reason or the filter condition that couldn't be pushed. opensearch/src/main/java/org/opensearch/sql/opensearch/planner/rules/DedupPushdownRule.java [56-59] AbstractRelNode scanWithFilter = scan.pushDownFilter(filter); if (scanWithFilter == null) { + // Log when filter pushdown fails to aid debugging + call.getPlanner().getContext().unwrap(org.slf4j.Logger.class) + .debug("Filter pushdown failed for condition: {}", filter.getCondition()); return; } Suggestion importance[1-10]: 3 __ Why: Adding debug logging when `pushDownFilter` returns null could help with debugging, but this is a normal control flow path where the rule simply doesn't match. The null return indicates the filter cannot be pushed down, which is expected behavior in some cases. The suggestion provides marginal debugging value but doesn't fix any bug or improve correctness.	Low

Suggestions up to commit f67458f

Category	Suggestion	Impact
General	Add fallback for failed filter pushdown When `pushDownFilter` returns null, the method returns early without any transformation. This silently fails the optimization. Consider logging this case or falling back to the standard path to ensure the query still executes correctly, even if the filter cannot be pushed down. opensearch/src/main/java/org/opensearch/sql/opensearch/planner/rules/DedupPushdownRule.java [56-59] AbstractRelNode scanWithFilter = scan.pushDownFilter(filter); if (scanWithFilter == null) { + // Filter cannot be pushed down; fall back to standard dedup path + // or log the failure for debugging return; } Suggestion importance[1-10]: 4 __ Why: The suggestion identifies a valid concern about the early return when `pushDownFilter` returns null. However, the current behavior is intentional - when filter pushdown fails, the rule simply doesn't match and other rules can handle the query. Adding a log for debugging could be helpful, but the suggestion to "fall back to standard dedup path" is vague since the early return already allows the standard path (without the filter optimization) to proceed via the else branch at line 70. The comment addition would provide minimal value.	Low
General	Handle edge case for missing bucket guards When `bucketNonNullConjuncts` is empty, the method returns early without transforming. However, `remainingConjuncts` may contain valid user predicates that should still be processed. Consider logging a warning or handling the case where only non-partition-key predicates exist to avoid silently dropping valid filters. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [148-150] if (bucketNonNullConjuncts.isEmpty()) { + // Log or handle case where filter doesn't match expected dedup pattern + // but may contain valid user predicates return; } Suggestion importance[1-10]: 3 __ Why: The suggestion correctly identifies the early return when `bucketNonNullConjuncts` is empty, but the concern about "silently dropping valid filters" is misplaced. The code comment at lines 144-147 explicitly states this is a defensive check to bail out when the filter doesn't match the expected bucket-non-null pattern. The `remainingConjuncts` are only relevant when there are actual bucket guards present. Adding a log might be marginally useful for debugging, but doesn't address a real correctness issue.	Low

Suggestions up to commit 0bf8f7c

Category	Suggestion	Impact
General	Remove redundant operand check The `!rexCall.getOperands().isEmpty()` check is redundant because `IS_NOT_NULL` always has exactly one operand by definition. Accessing `getOperands().get(0)` without this check is safe and simplifies the logic. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [174-180] private static boolean isNotNullOnPartitionKey(RexNode rex, Set<Integer> partitionKeyIndices) { return rex instanceof RexCall rexCall && rexCall.isA(SqlKind.IS_NOT_NULL) - && !rexCall.getOperands().isEmpty() && rexCall.getOperands().get(0) instanceof RexInputRef ref && partitionKeyIndices.contains(ref.getIndex()); } Suggestion importance[1-10]: 4 __ Why: While `IS_NOT_NULL` typically has exactly one operand, the check `!rexCall.getOperands().isEmpty()` provides defensive programming against malformed expressions. Removing it offers minimal benefit and slightly reduces code robustness. The same pattern is used consistently in `PlanUtils.isNotNullOnRef` at line 826, suggesting it's an intentional defensive practice.	Low
General	Handle non-reference partition keys Handle non-RexInputRef partition keys gracefully. If `dedupColumns` contains complex expressions that aren't simple column references, they won't be added to `partitionKeyIndices`, potentially causing valid bucket-non-null filters to be incorrectly rejected. Consider logging a warning or adding a comment explaining this limitation. core/src/main/java/org/opensearch/sql/calcite/plan/rule/PPLSimplifyDedupRule.java [130-134] for (RexNode key : dedupColumns) { if (key instanceof RexInputRef ref) { partitionKeyIndices.add(ref.getIndex()); } + // Note: Complex expressions in dedupColumns are not tracked as partition keys. + // This may cause the rule to bail out if bucket-non-null filters reference them. } Suggestion importance[1-10]: 3 __ Why: The suggestion correctly identifies that non-`RexInputRef` partition keys won't be tracked, but this appears to be intentional design rather than a bug. Adding a comment could improve code clarity, but the impact is minimal since the defensive check at line 148 already handles this case by bailing out when no matching filters are found.	Low

Suggestions up to commit 19f506d

Category	Suggestion	Impact
General	Make filter assertion more specific The assertion checking for "1000" is too generic and may match unintended occurrences in the plan. Consider checking for a more specific pattern like ">($7, 1000)" or "SAL > 1000" to ensure the user's WHERE clause filter is actually present in the expected location. ppl/src/test/java/org/opensearch/sql/ppl/calcite/CalcitePPLDedupTest.java [380-386] String rawExplain = raw.explain(); Assert.assertTrue( "Raw plan should contain the bucket-non-null filter:\n" + rawExplain, rawExplain.contains("IS NOT NULL")); Assert.assertTrue( "Raw plan should contain the user where filter:\n" + rawExplain, - rawExplain.contains("1000")); + rawExplain.contains(">") && rawExplain.contains("1000")); Suggestion importance[1-10]: 4 __ Why: The suggestion correctly identifies that checking for "1000" alone could match unintended occurrences. However, the improved code `rawExplain.contains(">") && rawExplain.contains("1000")` is still too generic and doesn't guarantee the filter is in the expected location. A more specific pattern would be better, but the suggestion does improve robustness slightly.	Low

RyanL1997

Hi @ryan-gh-bot , thanks for the change, left some reviews.

…search-project#5488) Per maintainer review feedback, the regression-test JavaDoc for testDedupAfterWhereProducesLogicalDedup mentioned the originating issue URL. The remaining JavaDoc paragraphs already describe the bug shape and the rule-ordering invariant, so the explicit issue link is unnecessary noise. Signed-off-by: ryan-gh-bot <ryan-gh-bot@users.noreply.github.com>

github-actions · 2026-05-29T02:04:28Z

Persistent review updated to latest commit 19f506d

RyanL1997 · 2026-05-29T21:37:16Z

@ryan-gh-bot the rule-reordering fix works but treats a symptom rather than the root cause. The actual brittleness is in PlanUtils.mayBeFilterFromBucketNonNull — its allMatch(isNotNullOnRef) requirement means any future code path that produces a merged conjunction (a new rule, a different optimization order, a SQL-side caller) will silently re-disable dedup pushdown the same way. The HEP ordering is now a load-bearing invariant with nothing enforcing it; if anyone later adds a third rule to HEP_PROGRAM or refactors it, the bug comes back silently. The new testDedupAfterWhereWithFilterMergeFirstFailsToSimplify test pins the implementation strategy rather than the user-visible behavior.

Please revise to take option 1 from the issue instead:

Extend mayBeFilterFromBucketNonNull (core/.../utils/PlanUtils.java) to accept AND(IS_NOT_NULL(partition_col), <rest>) — i.e., recognize an AND whose operands contain an IS_NOT_NULL on the partition column rather than requiring every conjunct to be one.
In PPLSimplifyDedupRule.apply, split the non-IS_NOT_NULL conjuncts off and emit them as a separate filter below the new LogicalDedup, so the user predicate is preserved and pushdown still works.
Revert the addRuleCollection → addRuleInstance change in CalciteToolsHelper.HEP_PROGRAM — once the predicate is order-independent, rule order no longer matters.
Replace testDedupAfterWhereWithFilterMergeFirstFailsToSimplify (which pins buggy behavior) with a test that asserts the user-visible contract: with where upstream, dedup still produces a LogicalDedup regardless of rule order. Keep testDedupAfterWhereProducesLogicalDedup.

Please sanity-check that the loosened operand can't accidentally match user-written IS_NOT_NULL(x) AND ... filters outside the dedup pattern — the four-level operand chain (project → filter with row_number <= N → project containing ROW_NUMBER-dedup → filter) should be specific enough, but worth confirming.

ryan-gh-bot · 2026-05-29T21:37:39Z

Working on /@triage (invoked by @RyanL1997) — posting result when done.

…oject#5488) Address review feedback on opensearch-project#5488: extend mayBeFilterFromBucketNonNull to accept the merged conjunction shape FilterMergeRule produces, so PPLSimplifyDedupRule fires regardless of whether FilterMergeRule has already merged the user where clause into the bucket-non-null filter. PPLSimplifyDedupRule.apply now splits the bottom filter into IS_NOT_NULL conjuncts on partition keys (absorbed into LogicalDedup semantics) and any remaining conjuncts (preserved as a separate filter below the new LogicalDedup), so a user predicate that was folded in is no longer dropped. With the operand predicate order-independent, the HEP rule order is no longer a load-bearing invariant. Revert the addRuleCollection -> addRuleInstance change in CalciteToolsHelper.HEP_PROGRAM that the previous patch introduced. Replace the regression test that pinned the buggy rule order with one that asserts the user-visible contract: with where preceding dedup, a LogicalDedup is produced and the user predicate is preserved regardless of which order FilterMergeRule and PPLSimplifyDedupRule fire. Signed-off-by: ryan-gh-bot <ryan-gh-bot@users.noreply.github.com>

ryan-gh-bot · 2026-05-29T22:03:13Z

Done — pushed 0bf8f7c73.

github-actions · 2026-05-29T22:04:25Z

Persistent review updated to latest commit 0bf8f7c

dai-chen · 2026-06-03T17:41:03Z

@ryan-gh-bot the rule-reordering fix works but treats a symptom rather than the root cause. The actual brittleness is in PlanUtils.mayBeFilterFromBucketNonNull — its allMatch(isNotNullOnRef) requirement means any future code path that produces a merged conjunction (a new rule, a different optimization order, a SQL-side caller) will silently re-disable dedup pushdown the same way. The HEP ordering is now a load-bearing invariant with nothing enforcing it; if anyone later adds a third rule to HEP_PROGRAM or refactors it, the bug comes back silently. The new testDedupAfterWhereWithFilterMergeFirstFailsToSimplify test pins the implementation strategy rather than the user-visible behavior.

Please revise to take option 1 from the issue instead:

Extend mayBeFilterFromBucketNonNull (core/.../utils/PlanUtils.java) to accept AND(IS_NOT_NULL(partition_col), <rest>) — i.e., recognize an AND whose operands contain an IS_NOT_NULL on the partition column rather than requiring every conjunct to be one.

In PPLSimplifyDedupRule.apply, split the non-IS_NOT_NULL conjuncts off and emit them as a separate filter below the new LogicalDedup, so the user predicate is preserved and pushdown still works.

Revert the addRuleCollection → addRuleInstance change in CalciteToolsHelper.HEP_PROGRAM — once the predicate is order-independent, rule order no longer matters.

Replace testDedupAfterWhereWithFilterMergeFirstFailsToSimplify (which pins buggy behavior) with a test that asserts the user-visible contract: with where upstream, dedup still produces a LogicalDedup regardless of rule order. Keep testDedupAfterWhereProducesLogicalDedup.

Please sanity-check that the loosened operand can't accidentally match user-written IS_NOT_NULL(x) AND ... filters outside the dedup pattern — the four-level operand chain (project → filter with row_number <= N → project containing ROW_NUMBER-dedup → filter) should be specific enough, but worth confirming.

I'm weighing the pros and cons of these two options:

Option 1 — make PPLSimplifyDedupRule run first: If this rule runs before FilterMergeRule, it matches the original unmerged pattern and fixes the problem directly. The concern is that some other rule could run ahead of it and reintroduce the mismatch.
Option 2 - current PR approach: Does this cover all the plan shapes visitDedupe can produce, or does it only fix the plain where + dedup case? e.g. dedup N, multiple dedup fields, and the sort/collation cases. Or these cases are not supported by DSL query anyway?

penghuo

Add end-to-end test asserts dedup is actually pushed down when where precedes dedup.
Refactor PR description to match the implementation.

@penghuo

Per @penghuo review: PlanUtils.java - Revert mayBeFilterFromBucketNonNull to the original ternary form; drop the early-return refactor (no behavior change, just cleaner). - Drop the !rexCall.getOperands().isEmpty() guard before .get(0): IS NOT NULL is always unary in Calcite, so the check is dead. - Trim the JavaDoc to the essentials (un-merged vs merged-AND shape; concrete partition-key match happens in PPLSimplifyDedupRule#apply). - Promote isNotNullOnRef from package-private to public so the dedup rule can reuse it from a different package. PPLSimplifyDedupRule.java - isNotNullOnPartitionKey now delegates the IS NOT NULL($ref) structural check to PlanUtils.isNotNullOnRef and adds the partition-key index check on top. CalciteExplainIT.java - Add testDedupAfterWherePushDown: an end-to-end regression that runs the shape `... | where <pred> | dedup <field>` and asserts (a) LogicalDedup appears in the explain output (PPLSimplifyDedupRule fired even after FilterMergeRule had a chance to merge the two filters), and (b) EnumerableWindow does NOT appear (the in-memory ROW_NUMBER fallback the bug caused is gone). Signed-off-by: Jialiang Liang <jiallian@amazon.com>

github-actions · 2026-06-05T16:38:47Z

Persistent review updated to latest commit ae86ece

RyanL1997 · 2026-06-05T16:44:11Z

Hi @penghuo, thanks for the feedback:

Add end-to-end test asserts dedup is actually pushed down when where precedes dedup.

added CalciteExplainIT.testDedupAfterWherePushDown

Refactor PR description to match the implementation.

PR description refactored — now describes the actual implementation (loosen predicate at source, not reorder HEP rules) instead of the bot's earlier reorder approach.

RyanL1997 · 2026-06-05T17:40:46Z

Walked the axes against buildDedupNotNull / buildDedupOrNull and reasoned about each. The fix lives in two places — the mayBeFilterFromBucketNonNull operand predicate, and the conjunct split inside PPLSimplifyDedupRule#apply. As long as the bottom filter of the rule's 4-level operand chain still includes an IS NOT NULL($pk) somewhere (alone or in an AND), the rule fires and apply cleanly separates the partition-key guards from the user predicate.

Axis	Effect on bucket-non-null filter shape	Covered?	Why
`dedup N` (N ≥ 2)	`N` lives in the upper `_row_number_ <= N` filter, not the bucket-non-null filter. Bottom-filter shape is identical regardless of `N`.	Yes	Predicate match path is unchanged; `apply` reads `N` from the upper filter as before.
Multiple dedup fields (`dedup a, b`)	Bucket-non-null becomes `AND(IS NOT NULL(a), IS NOT NULL(b))`. With a user `where`, FilterMergeRule produces `AND(IS NOT NULL(a), IS NOT NULL(b), <user predicate>)`.	Yes	The `anyMatch` predicate matches (multiple `IS NOT NULL` conjuncts present). `apply` absorbs all `IS NOT NULL`-on-partition-key conjuncts into `LogicalDedup` and emits the user predicate as a separate filter below it.
Sort/collation (`\| sort x \| dedup a` / `\| where p \| sort x \| dedup a`)	`stripInputSort` pulls the prior sort out and threads it into `ROW_NUMBER`'s ORDER BY. Bucket-non-null filter shape is unchanged.	Yes	`apply` extracts collation from the window. The user `where` can sit below or above the sort; either way the bottom filter is the same bucket-non-null shape (possibly merged with the user predicate).
`keepempty=true` (with or without `where`)	`buildDedupOrNull` does not emit a bucket-non-null filter at all. The 4-level operand chain has no matching bottom filter.	No, by design	The rule's operand chain doesn't match, so the rule never fires. Documented in code: `PPLSimplifyDedupRule.java:101` (`Since we cannot push down dedup with keepEmpty=true, we don't simplify that pattern`) and `DedupPushdownRule.java:218` (`Cannot push dedup operator if keepEmpty=true`). Unchanged by this PR.

Net: this PR covers the keepempty=false cases (the default and only pushable shape) across all four where + dedup <variant> combinations — simple, dedup N, multi-field, and sort/collation. The keepempty=true case is an existing, intentional non-pushable shape, unchanged here.

RyanL1997 · 2026-06-05T18:20:50Z

Checking on the CI failures

github-actions · 2026-06-05T20:16:54Z

Persistent review updated to latest commit 7725e0b

github-actions · 2026-06-05T20:17:54Z

Persistent review updated to latest commit f67458f

PPLSimplifyDedupRule correctly produces Dedup -> Filter(user where) -> Scan when a `where` precedes `dedup`. The Filter between Dedup and Scan blocks DedupPushdownRule's strict Dedup -> Project -> Scan operand chain, so Volcano falls back to PPLDedupConvertRule and the plan ends up with an in-memory ROW_NUMBER window instead of the pushed-down composite + top_hits aggregation. Add a WITH_FILTER operand variant to DedupPushdownRule that matches Dedup -> Filter -> Scan, pushes the filter into the scan, then runs the standard apply() on the resulting Dedup -> Project -> Scan shape. Signed-off-by: Jialiang Liang <jiallian@amazon.com>

github-actions · 2026-06-05T20:19:12Z

Persistent review updated to latest commit a66ad5b

pushDownFilter returns a Filter (not a CalciteLogicalIndexScan) when the predicate analyzer can only partially push the condition. The previous cast would have thrown ClassCastException in that case. Use an instanceof-pattern check so the rule bails out cleanly and leaves the plan untouched, letting other rules handle the residual. Also drop a stale issue-link reference from a test comment. Signed-off-by: Jialiang Liang <jiallian@amazon.com>

github-actions · 2026-06-05T20:45:21Z

Persistent review updated to latest commit 95e2aeb

RyanL1997 · 2026-06-05T22:15:33Z

The current CI failure is unrelated.

The failure is:

 * What went wrong:
  Execution failed for task ':doctest:startPrometheus'.
  > java.net.SocketTimeoutException: Read timed out

Reflowed the JavaDoc on testDedupAfterWhereProducesLogicalDedupWithProductionHepProgram to match Google Java Format's preferred line break, fixing the spotlessJavaCheck violation that failed the unit-test matrix on CI. Signed-off-by: Jialiang Liang <jiallian@amazon.com>

github-actions · 2026-06-05T22:34:41Z

Persistent review updated to latest commit 38fc6c3

ryan-gh-bot requested review from Swiddis, dai-chen, joshuali925, mengweieric, penghuo, ps48 and vamsimanohar as code owners May 29, 2026 01:48

ryan-gh-bot mentioned this pull request May 29, 2026

[BUG] Dedup pushdown disabled when combined with WHERE clause (FilterMergeRule defeats PPLSimplifyDedupRule) #5482

Closed

1 task

ryan-gh-bot requested review from LantaoJin, RyanL1997, acarbonetto, ahkcs, anirudha, noCharger, qianheng-aws, songkant-aws, ykmr1224 and yuancu as code owners May 29, 2026 01:48

RyanL1997 reviewed May 29, 2026

View reviewed changes

Comment thread core/src/main/java/org/opensearch/sql/calcite/utils/CalciteToolsHelper.java

Comment thread ppl/src/test/java/org/opensearch/sql/ppl/calcite/CalcitePPLDedupTest.java Outdated

RyanL1997 added PPL Piped processing language bugFix labels May 29, 2026

ahkcs previously approved these changes May 29, 2026

View reviewed changes

RyanL1997 requested changes May 29, 2026

View reviewed changes

ryan-gh-bot dismissed ahkcs’s stale review via 0bf8f7c May 29, 2026 22:03

penghuo reviewed Jun 3, 2026

View reviewed changes

RyanL1997 force-pushed the bot-fix-opensearch-project-sql-5482 branch from 7725e0b to f67458f Compare June 5, 2026 20:16

RyanL1997 force-pushed the bot-fix-opensearch-project-sql-5482 branch from f67458f to a66ad5b Compare June 5, 2026 20:18

dai-chen approved these changes Jun 5, 2026

View reviewed changes

penghuo approved these changes Jun 8, 2026

View reviewed changes

penghuo enabled auto-merge (squash) June 8, 2026 16:23

RyanL1997 approved these changes Jun 8, 2026

View reviewed changes

penghuo merged commit e5de8f6 into opensearch-project:main Jun 8, 2026
40 of 41 checks passed

Conversation

ryan-gh-bot commented May 29, 2026 • edited by RyanL1997 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Fix

Verification

Related Issues

Check List

Uh oh!

github-actions Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Reviewer Guide 🔍

(Review updated until commit 38fc6c3)

Uh oh!

github-actions Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Code Suggestions ✨

Previous suggestions

Uh oh!

RyanL1997 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented May 29, 2026

Uh oh!

RyanL1997 commented May 29, 2026

Uh oh!

ryan-gh-bot commented May 29, 2026

Uh oh!

ryan-gh-bot commented May 29, 2026

Uh oh!

github-actions Bot commented May 29, 2026

Uh oh!

dai-chen commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

penghuo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

RyanL1997 commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RyanL1997 commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RyanL1997 commented Jun 5, 2026

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

RyanL1997 commented Jun 5, 2026

Uh oh!

github-actions Bot commented Jun 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ryan-gh-bot commented May 29, 2026 •

edited by RyanL1997

Loading

github-actions Bot commented May 29, 2026 •

edited

Loading

(Review updated until commit `38fc6c3`)

github-actions Bot commented May 29, 2026 •

edited

Loading

dai-chen commented Jun 3, 2026 •

edited

Loading

RyanL1997 commented Jun 5, 2026 •

edited

Loading

RyanL1997 commented Jun 5, 2026 •

edited

Loading