coding-kitties
diff --git a/‎.gitignore‎
Lines changed: 1 addition & 0 deletions b/‎.gitignore‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docusaurus/docs/Advanced Concepts/pipelines-event-backtest.md‎
Lines changed: 239 additions & 0 deletions b/‎docusaurus/docs/Advanced Concepts/pipelines-event-backtest.md‎
Lines changed: 239 additions & 0 deletions
diff --git a/‎docusaurus/docs/Advanced Concepts/pipelines-live.md‎
Lines changed: 53 additions & 0 deletions b/‎docusaurus/docs/Advanced Concepts/pipelines-live.md‎
Lines changed: 53 additions & 0 deletions
diff --git a/‎docusaurus/docs/Advanced Concepts/pipelines-vector-backtest.md‎
Lines changed: 46 additions & 0 deletions b/‎docusaurus/docs/Advanced Concepts/pipelines-vector-backtest.md‎
Lines changed: 46 additions & 0 deletions
@@ -164,4 +164,5 @@ venv
 examples/tutorial/data
 examples/tutorial/backtest_results
 examples/tutorial/resources
+examples/**/resources/
 .data_cache/
@@ -0,0 +1,239 @@
+---
+sidebar_position: 10
+title: Pipelines — Event-driven backtest
+description: Use cross-sectional pipelines in the event-driven backtest engine (Phase 1, available today).
+---
+
+# Pipelines: Event-driven backtest
+
+This is the full Phase 1 reference. Read [Pipelines](pipelines.md) first
+for the high-level concept.
+
+## Quick start
+
+```python
+from typing import Any, Dict
+
+from investing_algorithm_framework import (
+    AverageDollarVolume,
+    BacktestDateRange,
+    Context,
+    DataSource,
+    Pipeline,
+    Returns,
+    TimeUnit,
+    TradingStrategy,
+    create_app,
+)
+
+
+class MomentumScreener(Pipeline):
+    dollar_volume = AverageDollarVolume(window=30)
+    momentum = Returns(window=30)
+
+    universe = dollar_volume.top(3)
+    alpha = momentum.rank(mask=universe)
+
+
+class CrossSectionalMomentum(TradingStrategy):
+    algorithm_id = "cross-sectional-momentum"
+    time_unit = TimeUnit.DAY
+    interval = 1
+    data_sources = [
+        DataSource(
+            data_type="OHLCV",
+            market="binance",
+            symbol=symbol,
+            warmup_window=60,
+            time_frame="1d",
+            identifier=f"{symbol}-ohlcv",
+        )
+        for symbol in ["BTC/EUR", "ETH/EUR", "SOL/EUR", "ADA/EUR", "XRP/EUR"]
+    ]
+    pipelines = [MomentumScreener]
+
+    def run_strategy(self, context: Context, data: Dict[str, Any]):
+        screen = data["MomentumScreener"]
+        top = screen.sort("alpha", descending=True).head(2)
+        for row in top.iter_rows(named=True):
+            print(row["symbol"], row["momentum"], row["alpha"])
+
+
+app = create_app()
+app.add_strategy(CrossSectionalMomentum)
+app.add_market(market="binance", trading_symbol="EUR", initial_balance=1000)
+
+if __name__ == "__main__":
+    app.run_backtest(
+        backtest_date_range=BacktestDateRange(
+            start_date="2024-01-01", end_date="2024-06-01"
+        ),
+    )
+```
+
+A complete runnable example lives in
+[`examples/pipeline_momentum_screener.py`](https://github.com/coding-kitties/investing-algorithm-framework/blob/dev/examples/pipeline_momentum_screener.py).
+
+## How it works
+
+On every iteration of the event loop:
+
+1. The framework discovers your strategy's OHLCV data sources from
+   `strategy.data_sources` (filtered by `DataType.OHLCV`).
+2. For each `Pipeline` listed in `strategy.pipelines`:
+   - Each per-symbol OHLCV frame is converted to Polars (Pandas inputs
+     are auto-converted) and lower-cased.
+   - The frames are stacked into a long-form panel and **truncated at
+     the current bar** (`datetime <= as_of`) — guaranteed no
+     look-ahead.
+   - Each declared `Factor` is computed in vectorised Polars over the
+     full panel.
+   - The pipeline's `universe` filter (if any) is applied as a top-level
+     mask; symbols failing the mask are dropped.
+   - The frame is sliced to the current bar, and the result is stored
+     under `data["YourPipelineClassName"]`.
+
+The output is a `polars.DataFrame` with columns:
+
+```
+symbol  | <factor_1> | <factor_2> | ... | <factor_n>
+```
+
+Symbols with no data at the current bar (e.g. listed late) are dropped.
+
+## Declaring a pipeline
+
+```python
+from investing_algorithm_framework import (
+    AverageDollarVolume, Pipeline, Returns, RSI, SMA, Volatility,
+)
+
+class MyScreen(Pipeline):
+    # Any factor declared as a class attribute becomes an output column
+    # named after the attribute.
+    momentum = Returns(window=30)
+    rsi = RSI(window=14)
+    vol = Volatility(window=30)
+
+    # Optional: a Filter assigned to `universe` becomes the master mask.
+    # Every other column is restricted to symbols where universe is True.
+    # The universe column itself is NOT exposed in the output.
+    universe = AverageDollarVolume(window=30).top(100)
+
+    # `rank` works inside the universe.
+    alpha = momentum.rank(mask=universe)
+```
+
+Rules enforced at class definition time:
+
+- A pipeline must declare at least one factor column (otherwise
+  `TypeError`).
+- The `universe` attribute, if present, **must** be a `Filter` (e.g.
+  `factor.top(n)` / `factor.bottom(n)`); using a plain `Factor` raises
+  `TypeError`.
+- Factors and filters are inherited via the MRO; subclass declarations
+  override parent ones with the same name.
+
+## Built-in factors
+
+| Class | Inputs | Notes |
+| --- | --- | --- |
+| `Returns(window)` | close | `close[t] / close[t - window] - 1` |
+| `AverageDollarVolume(window)` | close, volume | rolling mean of `close * volume` |
+| `SMA(window)` | close | simple moving average |
+| `RSI(window)` | close | Wilder's RSI; clamps to 100 when there are no losses |
+| `Volatility(window, periods_per_year=252)` | close | rolling stdev of log returns × √periods_per_year |
+
+All built-ins compute per-symbol via `over("symbol")` so symbols are
+independent.
+
+## Custom factors
+
+Subclass `CustomFactor` and implement `compute_panel`:
+
+```python
+import polars as pl
+from investing_algorithm_framework import CustomFactor
+
+class HighLowRange(CustomFactor):
+    inputs = ["high", "low"]
+    window = 1
+
+    def compute_panel(self, panel: pl.DataFrame) -> pl.Series:
+        return (panel["high"] - panel["low"]).rename("range")
+```
+
+`compute_panel` receives the full long-form panel and must return a
+`pl.Series` aligned with the panel rows. Set:
+
+- `inputs` — the OHLCV columns you read from the panel.
+- `window` — the lookback in bars (used for warmup sizing checks in
+  future phases; also exposed via `pipeline.required_window()`).
+
+## Cross-sectional ops
+
+```python
+factor.rank(mask=optional_filter)   # ascending ordinal rank per bar
+factor.top(n)                       # mask: top-n per bar by descending value
+factor.bottom(n)                    # mask: bottom-n per bar by ascending value
+```
+
+`rank` returns ordinal ranks (1, 2, 3, …) within each `datetime`. With
+a `mask`, symbols outside the mask receive `null`.
+
+## Reading the result
+
+```python
+def run_strategy(self, context, data):
+    screen: pl.DataFrame = data["MomentumScreener"]
+    if screen.is_empty():
+        return  # universe drained or warmup not yet satisfied
+
+    top = screen.sort("alpha", descending=True).head(5)
+    symbols = top["symbol"].to_list()
+```
+
+Common patterns:
+
+```python
+# Symbol → row dict
+rows = {row["symbol"]: row for row in screen.iter_rows(named=True)}
+
+# To pandas if you prefer:
+pdf = screen.to_pandas()
+```
+
+## Performance notes
+
+Phase 1 is **eager**: the panel is rebuilt on every iteration. That is
+fine for daily/hourly backtests with up to a few hundred symbols. If
+you need to push further, Phase 2 ([#502](https://github.com/coding-kitties/investing-algorithm-framework/issues/502))
+introduces a vector-mode pipeline executor that materialises factors
+once over the full backtest window.
+
+## Limitations (Phase 1)
+
+- No factor arithmetic (`a + b`, `a / b`, `(a - b).zscore()`); use a
+  `CustomFactor` for now.
+- No cached results between bars (rebuilt each iteration).
+- Only OHLCV inputs. External data joining is on the roadmap.
+
+These are intentional — the goal of Phase 1 is to nail down the public
+declarative surface (`Pipeline`, `Factor`, `Filter`, `top` / `bottom` /
+`rank`) before scaling the executor.
+
+## Troubleshooting
+
+- **`TypeError: <Pipeline>.universe must be a Filter`** — assign a
+  `Filter` (e.g. `factor.top(100)`), not a raw `Factor`.
+- **`KeyError: ... missing required column 'volume'`** — your data
+  source is not OHLCV; pipelines need full OHLCV frames.
+- **Empty result frame** — either your warmup hasn't yet satisfied the
+  largest `window`, or your universe filtered every symbol out.
+
+## See also
+
+- [Pipelines](pipelines.md) — concept page.
+- [Pipelines: Vector backtest](pipelines-vector-backtest.md) — Phase 2 roadmap.
+- [Pipelines: Live trading](pipelines-live.md) — Phase 3 roadmap.
+- Design doc: [`docs/design/pipeline-api.md`](https://github.com/coding-kitties/investing-algorithm-framework/blob/dev/docs/design/pipeline-api.md).
@@ -0,0 +1,53 @@
+---
+sidebar_position: 12
+title: Pipelines — Live trading (roadmap)
+description: Pipelines in live and paper trading. Tracked under #503.
+---
+
+# Pipelines: Live trading
+
+:::info Status: hardening (Phase 3)
+
+Because pipelines run inside the same event loop as backtests, they
+**already execute** when you run a strategy live or in paper mode.
+However, Phase 3 covers the production-grade work needed to declare
+this officially supported: streaming-friendly panel updates, tighter
+warmup-window guarantees, and explicit market-hours / partial-bar
+handling.
+
+Tracked under
+[#503](https://github.com/coding-kitties/investing-algorithm-framework/issues/503).
+:::
+
+## What works today
+
+Strategies with `pipelines = [...]` run end-to-end against any
+configured live market. The pipeline output appears in
+`data["YourPipelineClassName"]` exactly like in a backtest.
+
+If you want to experiment, the same example covers both:
+[`examples/pipeline_momentum_screener.py`](https://github.com/coding-kitties/investing-algorithm-framework/blob/dev/examples/pipeline_momentum_screener.py).
+
+## What's planned for Phase 3
+
+- A streaming panel that appends new bars instead of rebuilding from
+  scratch.
+- Validation of `warmup_window` against `pipeline.required_window()`
+  so you get a clear error at startup if any data source is too short.
+- First-class handling of partial bars (so you don't accidentally trade
+  on an unclosed candle).
+- Live observability hooks for pipeline output (print/log/snapshot).
+
+## What stays the same
+
+The same `Pipeline` subclasses you use in backtests run live.
+
+## Want to help?
+
+Track or comment on the implementation issue:
+[#503 — Pipeline API: Phase 3 (live hardening)](https://github.com/coding-kitties/investing-algorithm-framework/issues/503).
+
+## See also
+
+- [Pipelines](pipelines.md) — concept page.
+- [Pipelines: Event-driven backtest](pipelines-event-backtest.md) — full Phase 1 reference.
@@ -0,0 +1,46 @@
+---
+sidebar_position: 11
+title: Pipelines — Vector backtest (roadmap)
+description: Vector-mode pipeline execution. Tracked under #502.
+---
+
+# Pipelines: Vector backtest
+
+:::info Status: not yet shipped (Phase 2)
+
+Vector-mode pipelines are tracked under
+[#502](https://github.com/coding-kitties/investing-algorithm-framework/issues/502).
+The public API (`Pipeline`, `Factor`, `Filter`) defined in
+[Pipelines: Event-driven backtest](pipelines-event-backtest.md) is
+intentionally engine-agnostic, so strategies you write against Phase 1
+will keep working when Phase 2 lands.
+:::
+
+## What's planned
+
+- A vector executor that materialises every factor in the pipeline
+  once over the **entire** backtest window, instead of rebuilding the
+  panel on each event.
+- Integration with the existing vector backtester (see
+  [Vector backtesting](vector-backtesting.md)).
+- Cached intermediate frames so a `rank` of a `Returns` doesn't
+  recompute returns.
+- Optional Polars **lazy** execution path for memory-bound runs.
+
+## What stays the same
+
+The strategy author surface — declaring a `Pipeline` subclass, listing
+it on `strategy.pipelines`, and reading
+`data["YourPipelineClassName"]` inside `run_strategy` — does not
+change. Switching from event mode to vector mode is meant to be a
+runner choice, not a strategy rewrite.
+
+## Want to help?
+
+Track or comment on the implementation issue:
+[#502 — Pipeline API: Phase 2 (vector executor)](https://github.com/coding-kitties/investing-algorithm-framework/issues/502).
+
+## See also
+
+- [Pipelines](pipelines.md) — concept page.
+- [Pipelines: Event-driven backtest](pipelines-event-backtest.md) — what works today.