You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Apr 1, 2026. It is now read-only.
Copy file name to clipboardExpand all lines: docs/index.rst
+37-3Lines changed: 37 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,16 +5,50 @@ Welcome to BigQuery DataFrames
5
5
6
6
**BigQuery DataFrames** (``bigframes``) provides a Pythonic interface for data analysis that scales to petabytes. It gives you the best of both worlds: the familiar API of **pandas** and **scikit-learn**, powered by the distributed computing engine of **BigQuery**.
7
7
8
+
BigQuery DataFrames consists of three main components:
9
+
10
+
* **bigframes.pandas**: A pandas-compatible API for data exploration and transformation.
11
+
* **bigframes.ml**: A scikit-learn-like interface for BigQuery ML, including integration with Gemini.
12
+
* **bigframes.bigquery**: Specialized functions for managing BigQuery resources and deploying custom logic.
8
13
9
14
Why BigQuery DataFrames?
10
15
------------------------
11
16
12
17
BigFrames allows you to process data where it lives. Instead of downloading massive datasets to your local machine, BigFrames translates your Python code into SQL and executes it across the BigQuery fleet.
13
18
14
-
* **Scalability:** Work with datasets that exceed local memory limits.
15
-
* **Efficiency:** Minimize data movement and leverage BigQuery's query optimizer.
19
+
* **Scalability:** Work with datasets that exceed local memory limits without complex refactoring.
20
+
* **Collaboration & Extensibility:** Bridge the gap between Python and SQL. Deploy custom Python functions to BigQuery, making your logic accessible to SQL-based teammates and data analysts.
21
+
* **Production-Ready Pipelines:** Move seamlessly from interactive notebooks to production. BigFrames simplifies data engineering by integrating with tools like **dbt** and **Airflow**, offering a simpler operational model than Spark.
22
+
* **Security & Governance:** Keep your data within the BigQuery perimeter. Benefit from enterprise-grade security, auditing, and data governance throughout your entire Python workflow.
16
23
* **Familiarity:** Use ``read_gbq``, ``merge``, ``groupby``, and ``pivot_table`` just like you do in pandas.
17
-
* **Integrated ML:** Access BigQuery ML (BQML) capabilities through a familiar estimator-based interface.
24
+
25
+
Quickstart
26
+
----------
27
+
28
+
Install the library via pip:
29
+
30
+
.. code-block:: bash
31
+
32
+
pip install --upgrade bigframes
33
+
34
+
Load and aggregate a public dataset in just a few lines:
0 commit comments