feat(bigframes): Add numpy ufunc support to col expressions by TrevorBergeron · Pull Request #16554 · googleapis/google-cloud-python

TrevorBergeron · 2026-04-03T20:34:27Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

gemini-code-assist

Code Review

This pull request introduces support for NumPy universal functions (ufuncs) in BigFrames by implementing the __array_ufunc__ method in the Expression class. It also refactors binary operation logic into a helper function _as_bf_expr and adds unit tests to verify the new functionality. Feedback was provided regarding the use of a non-standard type hint for the method parameter and an issue in the unit tests where non-standard pandas API calls were used to compute expected results.

gemini-code-assist · 2026-04-03T20:37:00Z

packages/bigframes/bigframes/core/col.py

        return strings.StringMethods(self)

+    def __array_ufunc__(
+        self, ufunc: numpy.ufunc, method: __builtins__.str, *inputs, **kwargs


Using __builtins__.str as a type hint is non-standard and potentially fragile. It is recommended to use the built-in str type directly.

Suggested change

self, ufunc: numpy.ufunc, method: __builtins__.str, *inputs, **kwargs

self, ufunc: numpy.ufunc, method: str, *inputs, **kwargs

References

Standard Python type hinting practices (PEP 484) recommend using built-in types like 'str' directly instead of accessing them through 'builtins'. ^(link)

gemini-code-assist · 2026-04-03T20:37:00Z

packages/bigframes/tests/unit/test_col.py

+    pd_kwargs = {
+        "sqrt": np.sqrt(pd.col("float64_col")),  # type: ignore
+        "add_const": np.add(pd.col("float64_col"), 2.4),  # type: ignore
+        "radd_const": np.add(2.4, pd.col("float64_col")),  # type: ignore
+        "add_cols": np.add(pd.col("float64_col"), pd.col("int64_col")),  # type: ignore
+    }


The pd_kwargs dictionary uses pd.col, which is not a standard pandas API. To correctly verify the BigFrames implementation against pandas, the expected results should be computed using standard pandas column access on scalars_pandas_df. Additionally, standard pandas assign does not support BigFrames Expression objects. To ensure dictionary keys remain sorted without manual effort, the dictionary should be programmatically sorted.

Suggested change

pd_kwargs = {

"sqrt": np.sqrt(pd.col("float64_col")), # type: ignore

"add_const": np.add(pd.col("float64_col"), 2.4), # type: ignore

"radd_const": np.add(2.4, pd.col("float64_col")), # type: ignore

"add_cols": np.add(pd.col("float64_col"), pd.col("int64_col")), # type: ignore

}

pd_kwargs = dict(sorted({

"sqrt": np.sqrt(scalars_pandas_df["float64_col"]),

"add_const": np.add(scalars_pandas_df["float64_col"], 2.4),

"radd_const": np.add(2.4, scalars_pandas_df["float64_col"]),

"add_cols": np.add(scalars_pandas_df["float64_col"], scalars_pandas_df["int64_col"]),

}.items()))

References

To ensure dictionary keys remain sorted without manual effort, programmatically sort the dictionary instead of relying on manual ordering in the code.

feat(bigframes): Add numpy ufunc support to col expressions

48f7979

gemini-code-assist bot reviewed Apr 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(bigframes): Add numpy ufunc support to col expressions#16554

feat(bigframes): Add numpy ufunc support to col expressions#16554
TrevorBergeron wants to merge 1 commit intomainfrom
tbergeron_col_numpy

TrevorBergeron commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 3, 2026

Uh oh!

gemini-code-assist bot Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	self, ufunc: numpy.ufunc, method: __builtins__.str, inputs, *kwargs
	self, ufunc: numpy.ufunc, method: str, inputs, *kwargs

-    pd_kwargs = {
-        "sqrt": np.sqrt(pd.col("float64_col")),  # type: ignore
-        "add_const": np.add(pd.col("float64_col"), 2.4),  # type: ignore
-        "radd_const": np.add(2.4, pd.col("float64_col")),  # type: ignore
-        "add_cols": np.add(pd.col("float64_col"), pd.col("int64_col")),  # type: ignore
-    }
+    pd_kwargs = dict(sorted({
+        "sqrt": np.sqrt(scalars_pandas_df["float64_col"]),
+        "add_const": np.add(scalars_pandas_df["float64_col"], 2.4),
+        "radd_const": np.add(2.4, scalars_pandas_df["float64_col"]),
+        "add_cols": np.add(scalars_pandas_df["float64_col"], scalars_pandas_df["int64_col"]),
+    }.items()))

Conversation

TrevorBergeron commented Apr 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant