Skip to content

Commit c980a16

Browse files
committed
extra?
1 parent 3ba85f0 commit c980a16

1 file changed

Lines changed: 0 additions & 46 deletions

File tree

mkdocs/docs/api.md

Lines changed: 0 additions & 46 deletions
Original file line numberDiff line numberDiff line change
@@ -1564,52 +1564,6 @@ print(ray_dataset.take(2))
15641564
]
15651565
```
15661566

1567-
### Bodo
1568-
1569-
PyIceberg interfaces closely with Bodo Dataframes (see [Bodo Iceberg Quick Start](https://docs.bodo.ai/latest/quick_start/quickstart_local_iceberg/)),
1570-
which provides a drop-in replacement for Pandas that applies query, compiler and HPC optimizations automatically.
1571-
Bodo accelerates and scales Python code from single laptops to large clusters without code rewrites.
1572-
1573-
<!-- prettier-ignore-start -->
1574-
1575-
!!! note "Requirements"
1576-
This requires [`bodo` to be installed](index.md).
1577-
1578-
```python
1579-
pip install pyiceberg['bodo']
1580-
```
1581-
<!-- prettier-ignore-end -->
1582-
1583-
A table can be read easily into a Bodo Dataframe to perform Pandas operations:
1584-
1585-
```python
1586-
df = table.to_bodo() # equivalent to `bodo.pandas.read_iceberg_table(table)`
1587-
df = df[df["trip_distance"] >= 10.0]
1588-
df = df[["VendorID", "tpep_pickup_datetime", "tpep_dropoff_datetime"]]
1589-
print(df)
1590-
```
1591-
1592-
This creates a lazy query, optimizes it, and runs it on all available cores (print triggers execution):
1593-
1594-
```python
1595-
VendorID tpep_pickup_datetime tpep_dropoff_datetime
1596-
0 2 2023-01-01 00:27:12 2023-01-01 00:49:56
1597-
1 2 2023-01-01 00:09:29 2023-01-01 00:29:23
1598-
2 1 2023-01-01 00:13:30 2023-01-01 00:44:00
1599-
3 2 2023-01-01 00:41:41 2023-01-01 01:19:32
1600-
4 2 2023-01-01 00:22:39 2023-01-01 01:30:45
1601-
... ... ... ...
1602-
245478 2 2023-01-31 22:32:57 2023-01-31 23:01:48
1603-
245479 2 2023-01-31 22:03:26 2023-01-31 22:46:13
1604-
245480 2 2023-01-31 23:25:56 2023-02-01 00:05:42
1605-
245481 2 2023-01-31 23:18:00 2023-01-31 23:46:00
1606-
245482 2 2023-01-31 23:18:00 2023-01-31 23:41:00
1607-
1608-
[245483 rows x 3 columns]
1609-
```
1610-
1611-
Bodo is optimized to take advantage of Iceberg features such as hidden partitioning and various statistics for efficient reads.
1612-
16131567
### Daft
16141568

16151569
PyIceberg interfaces closely with Daft Dataframes (see also: [Daft integration with Iceberg](https://docs.daft.ai/en/stable/io/iceberg/)) which provides a full lazily optimized query engine interface on top of PyIceberg tables.

0 commit comments

Comments
 (0)