|
17 | 17 | - **Natural Language Pipelines** — Describe what you need in plain text; the AI agent generates and executes a validated YAML pipeline |
18 | 18 | - **11 Composable Services** — Extract (CSV, SQL, API, Excel), Transform (clean, filter, join, quality checks, outlier detection, LLM), Load (CSV, Excel, JSON, Parquet) |
19 | 19 | - **High-Performance Data Transfer** — Apache Arrow IPC binary format between all services (zero-copy, no CSV/JSON parsing overhead) |
20 | | -- **Visual Pipeline Builder** — Streamlit UI with YAML editor, real-time execution monitor, dataset explorer (browse outputs, preview, download), and service catalog |
| 20 | +- **Visual Pipeline Builder** — Streamlit UI with YAML editor, platform readiness checks, one-click Airflow triggers, real-time execution monitor, dataset explorer (browse outputs, preview, download), and service catalog |
21 | 21 | - **Airflow Orchestration** — Production-ready DAGs with file-based XCom for large datasets |
22 | 22 | - **Full Observability** — Prometheus metrics + Grafana dashboards + structured JSON logging + correlation ID tracing |
23 | 23 | - **Extensible** — Add a new service in minutes using the included scaffold template and step-by-step guide |
@@ -64,7 +64,16 @@ Trigger one of the pre-built DAGs from the Airflow UI: |
64 | 64 |
|
65 | 65 | Or paste a YAML from [`examples/pipelines/`](examples/pipelines/) into the Streamlit YAML Editor. |
66 | 66 |
|
67 | | -After execution, switch to the **Datasets** tab to browse output files, preview data, and download results. |
| 67 | +After execution, switch to the **Datasets** tab to browse output files, preview data, download results, and compare the latest run against the previous successful run. |
| 68 | + |
| 69 | +### New in Streamlit UX |
| 70 | + |
| 71 | +- **Platform Readiness** panel in Execution tab: live checks for Airflow, Streamlit, Prometheus, Grafana, including Airflow scheduler heartbeat status |
| 72 | +- **Quick Airflow Triggers** in Execution tab: trigger `hr_analytics_pipeline`, `ecommerce_pipeline`, or `weather_api_pipeline` without leaving Streamlit |
| 73 | +- **Execution insights**: successful steps, processed data volume, slowest step, and orchestration overhead (%) |
| 74 | +- **Run diagnostics** in Datasets tab: per-run active processing vs queue/orchestration gap timeline |
| 75 | +- **Run Comparison** in Datasets tab: current run vs previous successful run deltas for duration, final rows, and removed outliers |
| 76 | +- **Business KPI snapshot** from latest output file (domain-aware: HR, e-commerce, weather, or generic completeness) |
68 | 77 |
|
69 | 78 | --- |
70 | 79 |
|
@@ -215,7 +224,7 @@ Results including PNG charts and an interactive Plotly report are saved to `benc |
215 | 224 | ### Testing |
216 | 225 |
|
217 | 226 | ```bash |
218 | | -make test # Run all 208 tests (unit + integration) |
| 227 | +make test # Run all tests (unit + integration) |
219 | 228 | make test-coverage # With coverage report |
220 | 229 | make lint # Ruff linter |
221 | 230 | ``` |
|
0 commit comments