Skip to content
This repository was archived by the owner on Apr 1, 2026. It is now read-only.

Commit f1438c1

Browse files
update homepage
1 parent 772f2ac commit f1438c1

File tree

1 file changed

+37
-3
lines changed

1 file changed

+37
-3
lines changed

docs/index.rst

Lines changed: 37 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,16 +5,50 @@ Welcome to BigQuery DataFrames
55

66
**BigQuery DataFrames** (``bigframes``) provides a Pythonic interface for data analysis that scales to petabytes. It gives you the best of both worlds: the familiar API of **pandas** and **scikit-learn**, powered by the distributed computing engine of **BigQuery**.
77

8+
BigQuery DataFrames consists of three main components:
9+
10+
* **bigframes.pandas**: A pandas-compatible API for data exploration and transformation.
11+
* **bigframes.ml**: A scikit-learn-like interface for BigQuery ML, including integration with Gemini.
12+
* **bigframes.bigquery**: Specialized functions for managing BigQuery resources and deploying custom logic.
813

914
Why BigQuery DataFrames?
1015
------------------------
1116

1217
BigFrames allows you to process data where it lives. Instead of downloading massive datasets to your local machine, BigFrames translates your Python code into SQL and executes it across the BigQuery fleet.
1318

14-
* **Scalability:** Work with datasets that exceed local memory limits.
15-
* **Efficiency:** Minimize data movement and leverage BigQuery's query optimizer.
19+
* **Scalability:** Work with datasets that exceed local memory limits without complex refactoring.
20+
* **Collaboration & Extensibility:** Bridge the gap between Python and SQL. Deploy custom Python functions to BigQuery, making your logic accessible to SQL-based teammates and data analysts.
21+
* **Production-Ready Pipelines:** Move seamlessly from interactive notebooks to production. BigFrames simplifies data engineering by integrating with tools like **dbt** and **Airflow**, offering a simpler operational model than Spark.
22+
* **Security & Governance:** Keep your data within the BigQuery perimeter. Benefit from enterprise-grade security, auditing, and data governance throughout your entire Python workflow.
1623
* **Familiarity:** Use ``read_gbq``, ``merge``, ``groupby``, and ``pivot_table`` just like you do in pandas.
17-
* **Integrated ML:** Access BigQuery ML (BQML) capabilities through a familiar estimator-based interface.
24+
25+
Quickstart
26+
----------
27+
28+
Install the library via pip:
29+
30+
.. code-block:: bash
31+
32+
pip install --upgrade bigframes
33+
34+
Load and aggregate a public dataset in just a few lines:
35+
36+
.. code-block:: python
37+
38+
import bigframes.pandas as bpd
39+
40+
# Load data from BigQuery
41+
df = bpd.read_gbq("bigquery-public-data.usa_names.usa_1910_2013")
42+
43+
# Perform familiar pandas operations at scale
44+
top_names = (
45+
df.groupby("name")
46+
.agg({"number": "sum"})
47+
.sort_values("number", ascending=False)
48+
.head(10)
49+
)
50+
51+
print(top_names.to_pandas())
1852
1953
2054
User Guide

0 commit comments

Comments
 (0)