Skip to content

SNOW-2257191: Bugfix join bug due to dataframe alias#3685

Closed
sfc-gh-aalam wants to merge 2829 commits into
mainfrom
aalam-SNOW-2257191-cte-join-bugfix
Closed

SNOW-2257191: Bugfix join bug due to dataframe alias#3685
sfc-gh-aalam wants to merge 2829 commits into
mainfrom
aalam-SNOW-2257191-cte-join-bugfix

Conversation

@sfc-gh-aalam

Copy link
Copy Markdown
Contributor
  1. Which Jira issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

    Fixes SNOW-2257191

  2. Fill out the following pre-review checklist:

    • I am adding a new automated test(s) to verify correctness of my new code
      • If this test skips Local Testing mode, I'm requesting review from @snowflakedb/local-testing
    • I am adding new logging messages
    • I am adding a new telemetry message
    • I am adding new credentials
    • I am adding a new dependency
    • If this is a new feature/behavior, I'm adding the Local Testing parity changes.
    • I acknowledge that I have ensured my changes to be thread-safe. Follow the link for more information: Thread-safe Developer Guidelines
    • If adding any arguments to public Snowpark APIs or creating new public Snowpark APIs, I acknowledge that I have ensured my changes include AST support. Follow the link for more information: AST Support Guidelines
  3. Please describe how your code solves the related issue.

    In this PR we fix update of df_aliased_col_name_to_real_col_name child to parent by making sure all dictionaries within the default dict are copied by value instead of reference.

@sfc-gh-aalam sfc-gh-aalam added the NO-CHANGELOG-UPDATES This pull request does not need to update CHANGELOG.md label Aug 21, 2025
sfc-gh-kgadomski and others added 29 commits August 25, 2025 09:40
…ine (#3610)

Co-authored-by: graphite-app[bot] <96075541+graphite-app[bot]@users.noreply.github.com>
…3706)

This is working towards running most of our snowpandas tests with hybrid mode.
…sts for the integration module (#3715)

The new test parameter is called '--enable_modin_hybrid_mode' which is only applied to the integ
modin module. This is not used yet; but it allows for enabling hybrid in an adhoc way. Eventually there
will be a new pre-commit test which enables hybrid just for the integration modin module.

This change also disables the sql_counter when running under hybrid mode; because virtually no sql queries are issued.
…das (#3717)

SNOW-2305345 - Eliminate duplicate casing parameter checks in snowpandas

While working on SHOW OBJECT usage to see if we can fetch row size quickly I noticed we issue SHOW PARAMETERS LIKE 'QUOTED_IDENTIFIERS_IGNORE_CASE' IN SESSION queries every time we fetch the session. This is done to issue a warning, but we really only need to do this once.
sfc-gh-stramer and others added 26 commits October 30, 2025 14:50
…#3975)

While testing #3973, I noticed that aggregations on single-column frames/series were producing queries with JSON serialization and unnecessary UNPIVOT operations. The QC's `transpose_single_row` helper method is used in aggregations to skip a PIVOT operation used in the general transpose case, but for transposing a 1x1 frame, we don't even need to UNPIVOT and need only re-label the index since we already know that the column's dtype will not change.

This PR adds a fast path for 1x1 `transpose_single_row` operations, which replaces JSON/UNPIVOT operations with simple projections. It produces some modest performance improvements for operations on a 2000x1 frame:
- `DataFrame.count`: 1.48s -> 1.31s (11.2% improvement)
- `DataFrame.describe`: 2.64s -> 2.36s (10.9% improvement)
- `DataFrame.nunique`: 1.25s -> 1.21s (3.4% improvement)

These improvements are likely to be more noticeable on frame produced from more complex queries.

This PR also adds explicit row count caching for the general transpose case. We currently cannot directly use the `transpose_single_row` path for the `transpose` API itself since the helper function drops the column labels of the result.
…k/weekday/dayofyear/isocalendar (already supported in faster pandas) (#3992)
@sfc-gh-aalam sfc-gh-aalam force-pushed the aalam-SNOW-2257191-cte-join-bugfix branch from 50b5998 to 74f91f2 Compare November 12, 2025 23:20
@github-actions github-actions Bot locked and limited conversation to collaborators Nov 12, 2025
@sfc-gh-aalam sfc-gh-aalam deleted the aalam-SNOW-2257191-cte-join-bugfix branch November 12, 2025 23:29
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

NO-CHANGELOG-UPDATES This pull request does not need to update CHANGELOG.md

Projects

None yet

Development

Successfully merging this pull request may close these issues.