dougborg
diff --git a/‎katana_mcp_server/docs/adr/0019-tool-description-batch-conventions.md‎
Lines changed: 96 additions & 100 deletions b/‎katana_mcp_server/docs/adr/0019-tool-description-batch-conventions.md‎
Lines changed: 96 additions & 100 deletions
diff --git a/‎katana_mcp_server/docs/adr/0020-consistent-tool-surface-and-cache-unification.md‎
Lines changed: 17 additions & 2 deletions b/‎katana_mcp_server/docs/adr/0020-consistent-tool-surface-and-cache-unification.md‎
Lines changed: 17 additions & 2 deletions
@@ -16,27 +16,26 @@ shape exists. Three things drove that fallback.
 **Batch capability buried mid-docstring.** First-line summaries said what the tool
 *returned* but not that it accepted a batch. Agents sized the input to the field they
 saw on the first read pass and didn't scroll for a "Note: this also accepts a list"
-buried five paragraphs later. PR #393 moved batch hints into the first sentence;
-without a written rule the next tool will drift back.
+buried five paragraphs later. PR #393 moved batch hints into the first sentence; without
+a written rule the next tool will drift back.
 
-**Parallel singular + plural fields.** Tools like the original `check_inventory`
-exposed `sku` + `skus` + `variant_id` + `variant_ids` — four fields, mutually
-exclusive, runtime-validated. Agents guessed the wrong combination, sent both, or
-sent neither. PR #397 collapsed those four into a single
-`skus_or_variant_ids: list[str | int]`. Same fork in the road for every future tool;
-without a rule, contributors will reach for the four-field pattern again because
-"singular + plural" looks more discoverable.
+**Parallel singular + plural fields.** Tools like the original `check_inventory` exposed
+`sku` + `skus` + `variant_id` + `variant_ids` — four fields, mutually exclusive,
+runtime-validated. Agents guessed the wrong combination, sent both, or sent neither. PR
+#397 collapsed those four into a single `skus_or_variant_ids: list[str | int]`. Same
+fork in the road for every future tool; without a rule, contributors will reach for the
+four-field pattern again because "singular + plural" looks more discoverable.
 
 **`get_*` tools didn't advertise their `list_*` partner.** When an agent had multiple
 IDs in hand and reached for `get_sales_order`, nothing in the docstring pointed them at
 `list_sales_orders(ids=[...])` for a single round-trip. PR #397 added the cross-
 reference; without a rule, future tool pairs won't get it.
 
 **Help-resource drift.** `katana_mcp_server/src/katana_mcp/resources/help.py` mirrors
-the tool docstrings as a top-level resource. The PR #397 review caught it
-out-of-sync with the source-file docstrings — three workflow examples still used the
-old `{"sku": "WIDGET"}` shape after the schema collapse. Without a written sync
-expectation, this will recur on every tool change.
+the tool docstrings as a top-level resource. The PR #397 review caught it out-of-sync
+with the source-file docstrings — three workflow examples still used the old
+`{"sku": "WIDGET"}` shape after the schema collapse. Without a written sync expectation,
+this will recur on every tool change.
 
 ## Decision
 
@@ -57,16 +56,16 @@ class CheckInventoryRequest(BaseModel):
         min_length=1,
         description=(
             "SKUs (strings) or variant IDs (integers) to check — mix freely. "
-            "Pass one for a detailed stock card; pass many for a summary table. "
-            "Batching N items in a single call beats N separate invocations. "
-            "Output order matches input order."
+            "Pass one for a detailed Prefab stock card; pass many for a JSON "
+            "envelope of stock rows. Batching N items in a single call beats N "
+            "separate invocations. Output order matches input order."
         ),
     )
 ```
 
 The field is **required** (no `default_factory=list` — see PR #397 review) so the MCP
-schema and Pydantic validation agree the field is required. The implementation
-branches on `isinstance(item, str)` to dispatch.
+schema and Pydantic validation agree the field is required. The implementation branches
+on `isinstance(item, str)` to dispatch.
 
 **Homogeneous batch on a `list_*` tool — pass `ids=[...]`.**
 
@@ -78,20 +77,20 @@ class ListSalesOrdersRequest(BaseModel):
     )
 ```
 
-This is the cache-back pattern (PR #388 onward): every cache-backed `list_<entity>`
-tool exposes an `ids` filter that runs as `WHERE id IN (...)` against the typed
-cache. Agents holding a batch of IDs use the list tool instead of calling
-`get_<entity>` N times. **Exception:** `list_stock_transfers` does not yet expose
-`ids` (tracked as a follow-up — agents fall back to per-transfer
-`stock_transfer_number` filters until it lands). When adding `ids` to a list tool,
-update its docstring to the standard `ids=[1,2,3]` opening (see section 2).
+This is the cache-back pattern (PR #388 onward): every cache-backed `list_<entity>` tool
+exposes an `ids` filter that runs as `WHERE id IN (...)` against the typed cache. Agents
+holding a batch of IDs use the list tool instead of calling `get_<entity>` N times.
+**Exception:** `list_stock_transfers` does not yet expose `ids` (tracked as a follow-up
+— agents fall back to per-transfer `stock_transfer_number` filters until it lands). When
+adding `ids` to a list tool, update its docstring to the standard `ids=[1,2,3]` opening
+(see section 2).
 
-**What we don't do: parallel singular + plural fields.** No `sku` + `skus`,
-`variant_id` + `variant_ids`, `customer_id` + `customer_ids` on the request schema.
+**What we don't do: parallel singular + plural fields.** No `sku` + `skus`, no
+`variant_id` + `variant_ids`, no `customer_id` + `customer_ids` on the request schema.
 The four-field shape is mutually-exclusive at runtime, opaque at the MCP-schema layer,
 and the source of the agent fallback that motivated PR #397. Always a single field —
-either a list (always — `min_length=1` enforces non-empty when the field is required)
-or a heterogeneous list with a discriminator type.
+either a list (always — `min_length=1` enforces non-empty when the field is required) or
+a heterogeneous list with a discriminator type.
 
 ### 2. Docstring opening-sentence pattern
 
@@ -107,36 +106,34 @@ The em-dash (`—`, U+2014) separates the purpose clause from the batch hook. Th
 parenthetical notes the implementation (cache-backed, indexed SQL, etc.) when it
 materially affects the call shape.
 
-For `list_*` tools that don't yet expose `ids` (currently only
-`list_stock_transfers`), the hook describes the value of getting multiple rows
-back — keeping the same em-dash shape so the convention is recognizable:
+For `list_*` tools that don't yet expose `ids` (currently only `list_stock_transfers`),
+the hook describes the value of getting multiple rows back — keeping the same em-dash
+shape so the convention is recognizable:
 
 ```
 List stock transfers with filters — returns multiple transfers for discovery or bulk review.
 ```
 
-This is a **documented exception**, not a target. Adding `ids` to
-`list_stock_transfers` (and migrating to the standard `ids=[1,2,3]` form) is
-out-of-scope for this ADR — file a feature issue if/when the migration is
-prioritised.
+This is a **documented exception**, not a target. Adding `ids` to `list_stock_transfers`
+(and migrating to the standard `ids=[1,2,3]` form) is out-of-scope for this ADR — file a
+feature issue if/when the migration is prioritised.
 
-**`check_*` / `search_*` / single-item tools — short purpose line, then an
-input-shape paragraph that includes the em-dash batch hint.**
+**`check_*` / `search_*` / single-item tools — short purpose line, then an input-shape
+paragraph that includes the em-dash batch hint.**
 
 ```
 Check current stock levels for one or more SKUs or variant IDs.
 
 Pass a list of SKUs (strings) or variant IDs (integers) — or mix both — to
-``skus_or_variant_ids``. A single item returns a rich stock card; multiple
-items return a summary table. Batching N checks in one call is faster than
-N separate invocations.
+``skus_or_variant_ids``. A single item returns a rich Prefab stock card;
+multiple items return a JSON envelope of stock rows (see ADR-0020 / #719).
+Batching N checks in one call is faster than N separate invocations.
 ```
 
-Two paragraphs: a short purpose line first (no em-dash on this line — it just
-states what the tool does), then the input-shape paragraph that uses the em-dash
-to introduce the batch hint and the field name. `get_*` tools follow the same
-two-paragraph shape but their input-shape paragraph is the cross-reference
-described in section 3.
+Two paragraphs: a short purpose line first (no em-dash on this line — it just states
+what the tool does), then the input-shape paragraph that uses the em-dash to introduce
+the batch hint and the field name. `get_*` tools follow the same two-paragraph shape but
+their input-shape paragraph is the cross-reference described in section 3.
 
 ### 3. `get_*` ↔ `list_*` cross-references
 
@@ -145,67 +142,67 @@ cross-reference as the second paragraph of its docstring:
 
 ```
 For multiple <entities> at once, use ``list_<entity>(ids=[...])`` —
-it returns a summary table and supports all the same filters.
+it returns a JSON envelope of rows and supports all the same filters.
 ```
 
-Use double-backticks (RST inline-literal form) for the call; em-dash
-continuation. The cross-reference lives *after* the first-line summary and
-*before* the parameters / returns sections.
+Use double-backticks (RST inline-literal form) for the call; em-dash continuation. The
+cross-reference lives *after* the first-line summary and *before* the parameters /
+returns sections.
 
 When a `get_*` tool's per-call cost is non-trivial (e.g.,
 `get_manufacturing_order_recipe` fetches inline rows that aren't cached), the cross-
-reference also explains the cost trade-off: agents holding a batch of IDs should
-prefer the list tool's summary path unless they specifically need the rich detail.
+reference also explains the cost trade-off: agents holding a batch of IDs should prefer
+the list tool's summary path unless they specifically need the rich detail.
 
 **Exception** — `get_manufacturing_order_recipe` cross-references
-`get_manufacturing_order` (another `get_*` tool, not a `list_*`) because no
-`list_*` partner exists for inline recipe rows. The cross-reference still lives
-in the second paragraph; only the target tool differs from the standard form.
+`get_manufacturing_order` (another `get_*` tool, not a `list_*`) because no `list_*`
+partner exists for inline recipe rows. The cross-reference still lives in the second
+paragraph; only the target tool differs from the standard form.
 
 ### 4. Help-resource sync expectation
 
 `katana_mcp_server/src/katana_mcp/resources/help.py` is a hardcoded mirror of the tool
-docstrings — it powers the `katana://help` resource that agents read during
-onboarding. When a tool docstring or request schema changes:
+docstrings — it powers the `katana://help` resource that agents read during onboarding.
+When a tool docstring or request schema changes:
 
-- **Workflow examples** that reference the changed tool must be updated to use the
-  new field names / signatures.
+- **Workflow examples** that reference the changed tool must be updated to use the new
+  field names / signatures.
 - **Parameter sections** that enumerate the tool's fields must be re-aligned.
 - The change lands in the same PR as the tool change — never as a follow-up.
 
-PR #397 review caught three stale `{"sku": "WIDGET"}` workflow examples after the
-schema collapse landed in the source-file docstrings; treat that as the canonical
-miss to avoid. The `pr-preparer` agent's project-specific PR-readiness check is
-expected to flag help-resource drift; if a future drift slips past it, fold the
-detection into the agent's brief.
+PR #397 review caught three stale `{"sku": "WIDGET"}` workflow examples after the schema
+collapse landed in the source-file docstrings; treat that as the canonical miss to
+avoid. The `pr-preparer` agent's project-specific PR-readiness check is expected to flag
+help-resource drift; if a future drift slips past it, fold the detection into the
+agent's brief.
 
 ## Consequences
 
 ### Positive
 
-- Agents see batch capability in the first sentence, before deciding on N
-  single-item calls.
+- Agents see batch capability in the first sentence, before deciding on N single-item
+  calls.
 - Heterogeneous batch fields stop generating four-way fork choices for contributors
   adding new tools.
-- `get_*` ↔ `list_*` partner discovery is part of the tool itself, not the agent's
-  prior knowledge.
+- `get_*` ↔ `list_*` partner discovery is part of the tool itself, not the agent's prior
+  knowledge.
 - Help-resource drift is closed at PR review time, before it ships.
 
 ### Negative
 
-- One more thing for new tools to comply with — easy to miss until a reviewer
-  mentions it.
-- Existing tools that don't yet match the conventions look like outliers; back-fill
-  is gradual, not all-at-once.
+- One more thing for new tools to comply with — easy to miss until a reviewer mentions
+  it.
+- Existing tools that don't yet match the conventions look like outliers; back-fill is
+  gradual, not all-at-once.
 - The "single field" rule means that adding a homogeneous batch shape to a tool that
-  currently takes a single value is a breaking change to the request schema — see
-  PR #397 for the precedent.
+  currently takes a single value is a breaking change to the request schema — see PR
+  #397 for the precedent.
 
 ### Neutral
 
-- The conventions are codified in this ADR, not in lint rules. Future contributors
-  rely on review and on the `code-modernizer` agent's brief; if drift recurs, escalate
-  to a `code-modernizer` rewrite pass.
+- The conventions are codified in this ADR, not in lint rules. Future contributors rely
+  on review and on the `code-modernizer` agent's brief; if drift recurs, escalate to a
+  `code-modernizer` rewrite pass.
 
 ## Examples in tree
 
@@ -214,12 +211,12 @@ The following tools follow all four conventions and serve as canonical reference
 - `check_inventory` (`katana_mcp_server/.../foundation/inventory.py`) — heterogeneous
   collapse (`skus_or_variant_ids: list[str | int]`), order-preserving impl,
   required-field semantics.
-- `list_sales_orders` (`.../foundation/sales_orders.py`) — first-line `ids=[...]`
-  hook on a cache-backed list tool.
+- `list_sales_orders` (`.../foundation/sales_orders.py`) — first-line `ids=[...]` hook
+  on a cache-backed list tool.
 - `get_sales_order` (same file) — second-paragraph cross-reference to
   `list_sales_orders(ids=[...])`.
-- `list_manufacturing_orders` / `get_manufacturing_order` — same pair pattern with
-  the recipe-row caveat called out in `get_manufacturing_order_recipe`.
+- `list_manufacturing_orders` / `get_manufacturing_order` — same pair pattern with the
+  recipe-row caveat called out in `get_manufacturing_order_recipe`.
 
 When adding a new tool, copy the shape from one of these and adjust.
 
@@ -228,36 +225,35 @@ When adding a new tool, copy the shape from one of these and adjust.
 ### Alternative 1: Lint rule (ruff custom check / generated docstring linter)
 
 Catch first-sentence drift, parallel singular+plural fields, and missing cross-
-references mechanically. **Rejected for now** — the conventions aren't yet stable
-enough across the surface area of the tool set, and a lint rule that fires on every
-new tool's first commit would generate more friction than value. Revisit if drift
-recurs after this ADR lands; the bar for upgrading is "we reverted to the four-field
-pattern in a real PR."
+references mechanically. **Rejected for now** — the conventions aren't yet stable enough
+across the surface area of the tool set, and a lint rule that fires on every new tool's
+first commit would generate more friction than value. Revisit if drift recurs after this
+ADR lands; the bar for upgrading is "we reverted to the four-field pattern in a real
+PR."
 
 ### Alternative 2: Tool-description templates / scaffolding
 
-Add a `scripts/new_tool.py` that emits a docstring shell with the conventions baked
-in. **Rejected** — the surface area of differences between tools is too high for one
-scaffold to be useful (`list_*` vs. `check_*` vs. `get_*` vs. `create_*` /
-`update_*` / `delete_*` all have different opening shapes). Revisit when the
-write-tool conventions are also documented.
+Add a `scripts/new_tool.py` that emits a docstring shell with the conventions baked in.
+**Rejected** — the surface area of differences between tools is too high for one
+scaffold to be useful (`list_*` vs. `check_*` vs. `get_*` vs. `create_*` / `update_*` /
+`delete_*` all have different opening shapes). Revisit when the write-tool conventions
+are also documented.
 
 ### Alternative 3: Generate help.py from the tool docstrings
 
 Eliminate the manual sync requirement (section 4) by extracting the workflow examples
-and parameter sections from the tool docstrings at build time. **Rejected for
-now** — would require docstring sub-section markers (`:: Examples ::`,
-`:: Workflows ::`) that don't exist today, and the `katana://help` resource needs
-high-level workflow narrative that doesn't belong in any single tool's docstring.
-Track separately if the manual-sync bug recurs.
+and parameter sections from the tool docstrings at build time. **Rejected for now** —
+would require docstring sub-section markers (`:: Examples ::`, `:: Workflows ::`) that
+don't exist today, and the `katana://help` resource needs high-level workflow narrative
+that doesn't belong in any single tool's docstring. Track separately if the manual-sync
+bug recurs.
 
 ## References
 
 - PR #393 — surfaced batch shapes via first-line docstring updates (precursor)
-- PR #397 — schema collapse for `check_inventory`; reviewer flagged the missing
-  ADR; this ADR is the deferred Phase 1 work
-- ADR-0017 — automated tool documentation (the layered docs strategy this ADR
-  refines)
+- PR #397 — schema collapse for `check_inventory`; reviewer flagged the missing ADR;
+  this ADR is the deferred Phase 1 work
+- ADR-0017 — automated tool documentation (the layered docs strategy this ADR refines)
 - ADR-0016 — tool interface pattern (the Annotated/Unpack/Pydantic shape these
   conventions live on top of)
 - Issue #398 — this ADR's tracking issue
 
@@ -207,8 +207,23 @@ tracker), referenced from each umbrella issue.
 - **Bigger maintained surface.** Going from "modify is consistent" to "the whole surface
   is consistent" means more tools, more tests, more help-resource entries. The
   discipline is to keep them mechanically uniform — same param shapes (especially
-  `ids: CoercedIntListOpt`, `format: "markdown" | "json"`, `limit` / `page` semantics),
-  same response shapes, same empty-state UX.
+  `ids: CoercedIntListOpt`, `limit` / `page` semantics), same response shapes, same
+  empty-state UX. The `format: "markdown" | "json"` parameter that originally appeared
+  here was removed in #719 — every tool now returns JSON-only content. When a response
+  emits a Prefab card, `structured_content` carries the `PrefabApp` envelope (built via
+  `make_tool_result(response, ui=app)`); when a response is data-only,
+  `structured_content` carries the payload dict — typically built via
+  `make_json_result(response)`, or an inline `ToolResult(...)` when the tool needs to
+  thread request-driven kwargs through `model_dump_json` / `model_dump` (e.g.,
+  `get_manufacturing_order` composes an `exclude={...}` selector + `exclude_none=True`
+  from its include/verbose flags; the batch branches of `check_inventory` and
+  `get_variant_details` build `ToolResult` inline to share the JSON content text with
+  their single-item Prefab path). The `meta=UI_META` registration flag advertises that a
+  tool *may* emit a Prefab card — not that it always does (e.g., `check_inventory` and
+  `get_variant_details` attach Prefab on single-item requests and return a payload dict
+  on batches). Promising a card via `meta=UI_META` and then never emitting one crashes
+  MCP-Apps hosts looking for the widget they were advertised; the contract is the
+  runtime payload, not the registration flag.
 - **Cache migration is a multi-week lift on the most-used cache entity.** Variants are
   touched by every `search_items` / `get_variant_details` / `check_inventory` call.
   Migration risk is real — incremental rollout per entity with fallback to legacy until