fix(bigframes): Fix bugs compiling ambiguous ids and in subqueries by TrevorBergeron · Pull Request #16617 · googleapis/google-cloud-python

TrevorBergeron · 2026-04-10T22:53:27Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

gemini-code-assist

Code Review

This pull request introduces several updates, including adding a fillna(False) to isin operations, refactoring BigQuery query generation to use explicit table aliasing (_bf_source), and adjusting the SQL parser to handle table aliases before time travel clauses. Feedback suggests qualifying the wildcard selector in BigQuery queries for consistency and warns that the parser change might cause regressions for SQL dialects that expect aliases after table samples.

gemini-code-assist · 2026-04-10T22:56:01Z

-        select_clause = "SELECT " + ", ".join(f"`{column}`" for column in columns)
+        select_clause = "SELECT " + ", ".join(f"`_bf_source`.`{column}`" for column in columns)
    else:
        select_clause = "SELECT *"


To maintain consistency with the qualified column selection in the if block and to further prevent ambiguity when this query is used as a subquery, consider qualifying the wildcard selector as well.

Suggested change

select_clause = "SELECT *"

select_clause = "SELECT _bf_source.*"

gemini-code-assist · 2026-04-10T22:56:01Z

+        alias = self._parse_table_alias(
+            alias_tokens=alias_tokens or self.TABLE_ALIAS_TOKENS
+        )
+        if alias:
+            this.set("alias", alias)


Moving the alias parsing to the top of _parse_table correctly addresses BigQuery's syntax requirements (where the alias precedes the FOR SYSTEM_TIME AS OF clause). However, this change unconditionally parses the alias before the table sample, which may break dialects that set ALIAS_POST_TABLESAMPLE = True. Since this is a vendored parser, if it's intended to support multiple dialects, consider making this move conditional on the dialect's settings or ensuring that the alias can still be parsed after the sample if needed.

chalmerlowe

LGTM

…16617) Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly: - [ ] Make sure to open an issue as a [bug/issue](https://github.com/googleapis/google-cloud-python/issues) before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea - [ ] Ensure the tests and linter pass - [ ] Code coverage does not decrease (if any source code was changed) - [ ] Appropriate docs were updated (if necessary) Fixes #<issue_number_goes_here> 🦕

PR created by the Librarian CLI to initialize a release. Merging this PR will auto trigger a release. Librarian Version: v0.13.0 Language Image: us-central1-docker.pkg.dev/cloud-sdk-librarian-prod/images-prod/python-librarian-generator@sha256:234b9d1f2ddb057ed7ac6a38db0bf8163d839c65c6cf88ade52530cddebce59e <details><summary>bigframes: v2.40.0</summary> ## [v2.40.0](bigframes-v2.39.0...bigframes-v2.40.0) (2026-05-13) ### Features * Add `bigframes.execution_history` API to track BigQuery jobs (#16588) ([fa20a74](fa20a740)) ```python import bigframes.pandas as bpd bpd.options.compute.enable_execution_history = True df = bpd.read_gbq("my_table") # ... perform operations ... history = bpd.execution_history print(history.jobs) # Access BigQuery job details for executed queries ``` * Implement `ai.similarity` and `ai.embed` for text embeddings and semantic similarity (#16771, #16759) ([d4afa2c](d4afa2c8), [fcb4579](fcb4579b)) ```python import bigframes.pandas as bpd # Generate embeddings df["embeddings"] = bpd.bigquery.ai.embed(df["text_col"]) # Compute similarity df["similarity"] = bpd.bigquery.ai.similarity(df["embeddings_a"], df["embeddings_b"]) ``` * Support `hparam_range` and `hparam_candidates` parameters for hyperparameter tuning in model creation (#16640) ([ca47835](ca47835c)) * Update `ai.score`, `ai.classify` and `ai.if_` parameters to match their SQL equivalents (#16919, #16990, #16857) ([9f42fe1](9f42fe14), [e9c52b1](e9c52b12), [f3cb4ad](f3cb4ad0)) * Support unstable sorting in `sort_values` and `sort_index` (#16665) ([bbdeb70](bbdeb70f)) * Support loading Avro and ORC data formats (#16555) ([6d46cba](6d46cba3)) * Add NumPy ufunc support directly on column expressions (#16554) ([2f792ab](2f792abd)) ### Bug Fixes * Fix bugs compiling ambiguous ids and in subqueries (#16617) ([479e44d](479e44dd)) * BigFrames respects bq default region (#16933) ([ef9945a](ef9945a5)) * avoid views when querying BigLake tables from SQL cells (#16562) ([fdd3e0d](fdd3e0de)) * avoid `copy` argument warning in `to_pandas` (#16917) ([fe5245b](fe5245b8)) ### Performance Improvements * Improve write api upload throughput (#16641) ([ef856b0](ef856b04)) ### Documentation * Add docs to the to_csv methods of dataframe and series (#16570) ([a8fccef](a8fccefd)) </details>

fix(bigframes): Fix bugs compiling ambiguous ids and in subqueries

9741f50

TrevorBergeron requested review from a team as code owners April 10, 2026 22:53

TrevorBergeron requested review from sycai and removed request for a team April 10, 2026 22:53

gemini-code-assist Bot reviewed Apr 10, 2026

View reviewed changes

TrevorBergeron added 5 commits April 10, 2026 23:00

reformat

6b28e7e

update snapshots

375f162

update snapshots again

564194a

update to_query tests

1708e33

fix isin behavior with left null

83f3119

chalmerlowe reviewed Apr 13, 2026

View reviewed changes

sycai approved these changes Apr 13, 2026

View reviewed changes

sycai merged commit 479e44d into main Apr 13, 2026
31 checks passed

sycai deleted the tbergeron_isin_test_fixing branch April 13, 2026 18:20

shuoweil mentioned this pull request May 13, 2026

chore: release bigframes 2.40.0 #17056

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(bigframes): Fix bugs compiling ambiguous ids and in subqueries#16617

fix(bigframes): Fix bugs compiling ambiguous ids and in subqueries#16617
sycai merged 6 commits into
mainfrom
tbergeron_isin_test_fixing

TrevorBergeron commented Apr 10, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Uh oh!

chalmerlowe left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	select_clause = "SELECT *"
	select_clause = "SELECT _bf_source.*"

Uh oh!

Conversation

TrevorBergeron commented Apr 10, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

chalmerlowe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants