Skip to content

Commit b73cc97

Browse files
authored
docs: update broken link paths
2 parents c121c8b + f0deac1 commit b73cc97

3 files changed

Lines changed: 5 additions & 2 deletions

File tree

README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -64,6 +64,8 @@ By leveraging the Polars Rust engine (Lazy API), the system achieves near-optima
6464
| 40M Snapshot (8GB / 4 vCPU) |
6565
| :---: |
6666
| ![engine-performance-8gb](assets/screenshots/engine-performance-8gb-4cpu.png) |
67+
> Benchmark data: [`40m_stats_log.csv`](assets/benchmarks/polars/)
68+
> Dataset : [`Dataset Information`](data/)
6769
6870
| Metric | Data |
6971
|:---|:---|
@@ -72,6 +74,7 @@ By leveraging the Polars Rust engine (Lazy API), the system achieves near-optima
7274
| Efficiency (Processing) | ~307k Rows / Second |
7375
| Total Runtime (Wall-Clock) | 130 Seconds |
7476

77+
7578
* **Maximized Memory Density:** The **Primitive Integer Pipeline** allows a ~5.34GB analytical model to process within the 8GB RAM limit by shrinking join-key overhead by ~16x.
7679
* **Near-Linear Performance Scaling:** The engine saturates available vCPUs, yielding high throughput during streaming execution.
7780
* **Zero-Idle Economics:** 100% serverless execution ensures zero billable time during idle periods.

assets/benchmarks/polars/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Measurement Methodology
22

3-
This section details the methodology used to capture the memory metrics in the [`GCP Stress-Test Metrics (Scaling Efficiency)`](../../../README.md#gcp-stress-test-metrics-scaling-efficiency)
3+
This section details the methodology used to capture the memory metrics in the [`GCP Stress-Test Metrics (Scaling Efficiency)`](../../../README.md###gcp-stress-test-metrics-scaling-efficiency)
44

55
The telemetry logger below was added to the orchestrator for a specific benchmarking run.
66

data/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
This directory serves as the local state provider for the pipeline when executing in a non-cloud environment. It mimics the structure of the Google Cloud Storage (GCS) buckets.
44

55
## Synthetic Dataset
6-
To replicate the high-volume environment described in the [GCP Stress-Test Metrics (Scaling Efficiency)](/README.md#gcp-stress-test-metrics-scaling-efficiency) section, you can download the 40M-row synthetic dataset here: [**Kaggle Dataset Link**](https://www.kaggle.com/datasets/melvidabryan/e-commerce-synthetic-dataset)
6+
To replicate the high-volume environment described in the [GCP Stress-Test Metrics (Scaling Efficiency)](/README.md###GCP-Stress-Test-Metrics) section, you can download the 40M-row synthetic dataset here: [**Kaggle Dataset Link**](https://www.kaggle.com/datasets/melvidabryan/e-commerce-synthetic-dataset)
77

88
> *Note: This upload contains the **Contracted Version** of the dataset. The original "Raw" state, totaling approximately ~26GB of unrefined CSVs was omitted to prioritize transfer efficiency.*
99

0 commit comments

Comments
 (0)