Skip to content

Commit 306cebf

Browse files
committed
feat(runbook): surface ann calibration gate in workspace
1 parent 516ca4c commit 306cebf

15 files changed

Lines changed: 199 additions & 8 deletions

docs/diataxis/en/explanation/development-progress-dashboard.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ It tracks what is already implemented, where the hard gaps remain, and how to ve
2323
- runtime capability/runbook governance now includes explicit ANN remote index-sync health (`query_vector_acceleration_index_sync_health`) in addition to prefilter, health, traceability, and circuit checks,
2424
- runtime capability governance now also includes explicit gate `query_vector_acceleration_calibration_readiness`, which formalizes whether the ANN path is even ready for release-grade threshold tuning,
2525
- `server.ts` now closes the corresponding operator loop: the index-sync gate participates in verification escalation, remediation action-queue generation, and per-check runbook history summaries,
26-
- the agent workspace runtime-runbook surfaces now render operator-facing ANN governance directly in the frontend shell: verify/checks now expose sync-health plus circuit-budget, traceability, and prefilter summaries, and they now also show threshold/signal drilldowns plus calibration-readiness state needed for budget tuning work, while action-queue keeps the index-sync incident drilldown,
26+
- the agent workspace runtime-runbook surfaces now render operator-facing ANN governance directly in the frontend shell: verify/checks now expose sync-health plus circuit-budget, traceability, and prefilter summaries, and they now also show threshold/signal drilldowns plus calibration-readiness state and the explicit calibration gate needed for budget tuning work, while action-queue keeps the index-sync incident drilldown,
2727
- the modular `src/routes/knowledge.ts` runtime-runbook surfaces now delegate to live server-side runbook ops with full query-parameter passthrough, so browser/runtime consumers no longer hit the old KLP placeholder payloads for verify/history/checks/action-queue/remediation/schedule flows,
2828
- the browser strict smoke gate now also proves those ANN runbook surfaces from real browser evidence: verify-card ANN sync/circuit/traceability/prefilter content plus threshold/signal and calibration-readiness labels, checks-card first-check ANN sync plus circuit/traceability/prefilter snapshots, and action-queue index-sync drilldown are now asserted end to end instead of remaining component-test-only,
2929
- locale governance for the agent workspace is now tighter on both static and runtime surfaces: bilingual locale bundles now cover the query/quality/runbook cards exercised by strict browser smoke, `src/agent_workspace.locale.contract.test.ts` blocks source-referenced `agentWorkspace.*` key drift, and startup-time translate helpers no longer emit false missing-key warnings before locale initialization finishes,

docs/diataxis/zh/explanation/development-progress-dashboard.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
- runtime capability / runbook 治理也已新增显式的 ANN 远端索引同步健康度检查(`query_vector_acceleration_index_sync_health`),与 prefilter、health、traceability、circuit 并列,
1414
- runtime capability 治理现在也新增了显式门禁 `query_vector_acceleration_calibration_readiness`,用来正式回答当前 ANN 路径是否已经具备进入发布级阈值校准的前提条件,
1515
- `server.ts` 现已补齐对应的 operator 闭环:该 sync-health 门禁已经进入 verification escalation、remediation action queue、以及 per-check runbook history summary,
16-
- agent workspace 的 runtime-runbook 前端面现已把面向运维的 ANN 治理直接前推到壳层:verify/checks 不仅能看到 sync-health,还能看到熔断预算、可追踪性、预筛选摘要,并进一步看到用于校准的阈值/信号钻取和校准就绪态;action-queue 则继续承载 index-sync 事故钻取,
16+
- agent workspace 的 runtime-runbook 前端面现已把面向运维的 ANN 治理直接前推到壳层:verify/checks 不仅能看到 sync-health,还能看到熔断预算、可追踪性、预筛选摘要,并进一步看到用于校准的阈值/信号钻取、校准就绪态以及显式校准门禁;action-queue 则继续承载 index-sync 事故钻取,
1717
- modular `src/routes/knowledge.ts` 的 runtime-runbook 路由面现在也会委托到真实 server 侧 runbook ops,并完整透传 query 参数,因此浏览器/运行时消费者不再命中旧的 KLP placeholder verify/history/checks/action-queue/remediation/schedule 响应,
1818
- 浏览器 strict smoke 现在也会用真实浏览器证据证明这批 ANN runbook 面:verify 卡的 ANN sync/熔断/可追踪性/预筛选内容及阈值/信号/校准就绪标签、checks 卡的首个检查 ANN sync 加熔断/可追踪性/预筛选快照,以及 action-queue 的 index-sync 钻取都已纳入端到端断言,而不再只停留在组件测试层,
1919
- agent workspace 的 locale 治理现在也更严了:双语 locale bundle 已补齐 strict browser smoke 实际触达的 query/quality/runbook 卡片文案,`src/agent_workspace.locale.contract.test.ts` 会阻断源码引用的 `agentWorkspace.*` key 漂移,而启动期 `translate()` 也不再在 locale 初始化完成前发出误报式 missing-key warning,

docs/en/TEST_REPORT.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,8 @@
5656
- action-queue continues to carry the index-sync incident drilldown.
5757
2. `query_vector_acceleration_prefilter_effectiveness` now shares the ANN fast-lane escalation path instead of using the slower generic escalation branch.
5858
3. Runtime capability governance now has explicit gate `query_vector_acceleration_calibration_readiness`, which fails or warns until the ANN path has representative sync/prefilter/traceability/stability telemetry in the same runtime window.
59-
4. This refresh still does **not** prove release-grade Phase-2 closure:
59+
4. The workspace verify/checks cards now surface that calibration-readiness gate directly, instead of forcing operators to infer it only from the underlying budget signals.
60+
5. This refresh still does **not** prove release-grade Phase-2 closure:
6061
- it closes visibility and browser/runtime proof for the new ANN governance summaries,
6162
- it does **not** close workload/threshold calibration for those budgets.
6263

docs/en/TODO.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@
1515
- [x] `query_vector_acceleration_prefilter_effectiveness` now shares the ANN fast-lane escalation path instead of lagging behind the other ANN governance checks.
1616
- [x] Agent-workspace runbook verify/checks now also surface ANN circuit budget flags and prefilter calibration-readiness cues, so budget tuning no longer depends on raw JSON inspection.
1717
- [x] Runtime capability matrix/runbook now has explicit gate `query_vector_acceleration_calibration_readiness` to formalize whether ANN threshold tuning can start.
18+
- [x] Agent-workspace runbook verify/checks now also surface the explicit `query_vector_acceleration_calibration_readiness` gate, not only the supporting budget signals.
1819
- [ ] Move the newly surfaced ANN governance budgets from visibility closure to workload/threshold calibration closure, then promote the new Phase-2 diagnostics to release-grade gates only after the same checks run on a release-grade graphdb/ANN baseline.
1920
- [ ] Extend tutor routing from the new local-first baseline into a production-proven multi-provider policy.
2021
- [ ] Continue FR-009 evidence freshness, Linux strict Tauri host provisioning, and final Electron decommission review.

docs/en/implementation_plan.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ Bring code truth, active progress docs, and next execution order back into align
2121
- runtime capability governance now treats ANN remote index sync as a first-class check: `query_vector_acceleration_index_sync_health` is emitted in the matrix/runbook alongside health, traceability, prefilter, and circuit checks.
2222
- `server.ts` now closes the runbook loop for that new gate: ANN index-sync health is included in verification escalation, remediation action-queue generation, and per-check history summaries.
2323
- runtime capability governance now also has an explicit ANN calibration prerequisite gate: `query_vector_acceleration_calibration_readiness` blocks release-grade threshold tuning until sync telemetry, stable connector state, prefilter sample readiness, evaluable candidate ratios, and external traceability signals are all present in the same runtime window.
24-
- the agent workspace runtime runbook surfaces now expose ANN sync-health metrics across verify/checks/action-queue flows, and the verify/checks cards now also surface ANN circuit-budget, traceability, and prefilter summaries plus threshold/signal drilldowns and calibration-readiness state, so operator-facing governance no longer stops at `index_sync_health`.
24+
- the agent workspace runtime runbook surfaces now expose ANN sync-health metrics across verify/checks/action-queue flows, and the verify/checks cards now also surface ANN circuit-budget, traceability, and prefilter summaries plus threshold/signal drilldowns, calibration-readiness state, and the explicit `query_vector_acceleration_calibration_readiness` gate, so operator-facing governance no longer stops at `index_sync_health`.
2525
- modular knowledge-route wiring for `runtime-capability-runbook/*` is now backed by live server-side runbook ops instead of KLP placeholder payloads, and the route layer now preserves `checkId` / `sinceMinutes` / queue-filter query params rather than dropping them.
2626
- the real browser smoke gate now proves those verify/checks/action-queue surfaces end to end: strict browser evidence must show the ANN sync-health verify card, the new verify/checks ANN circuit/traceability/prefilter drilldowns, the first-check ANN sync metric, and the index-sync action-queue drilldown instead of only proving that the cards can open.
2727
- agent-workspace locale hardening now covers the currently surfaced diagnostics cards/messages: source-referenced `agentWorkspace.*` keys are guarded by `src/agent_workspace.locale.contract.test.ts`, bilingual locale bundles now back the query/quality/runbook card labels that strict browser smoke actually exercises, and startup-time translate helpers defer `window.i18n.t()` until locale init to avoid false missing-key warnings before locales hydrate.

docs/zh/TEST_REPORT.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,8 @@
5656
- action-queue 继续承载 index-sync 事故钻取。
5757
2. `query_vector_acceleration_prefilter_effectiveness` 现已进入 ANN 快速升级路径,不再沿用较慢的通用升级分支。
5858
3. runtime capability 治理现在已经具备显式门禁 `query_vector_acceleration_calibration_readiness`,会在 ANN 路径尚未形成同窗口 sync/prefilter/traceability/stability 代表性遥测时直接给出 fail/warn。
59-
4. 这轮刷新仍然**不等于**发布级 Phase-2 闭环:
59+
4. workspace 的 verify/checks 卡片现在也会直接展示这条 calibration-readiness gate,而不再要求运维只从底层预算信号里自己推断。
60+
5. 这轮刷新仍然**不等于**发布级 Phase-2 闭环:
6061
- 它闭合的是新 ANN 治理摘要的可见性与 browser/runtime 证明,
6162
-**没有**闭合这些预算的工作负载/阈值校准。
6263

docs/zh/TODO.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,7 @@
88
- [x] `query_vector_acceleration_prefilter_effectiveness` 现已进入 ANN 快速升级路径,不再落后于其他 ANN 治理检查。
99
- [x] agent-workspace 的 runbook verify/checks 现已进一步显式暴露 ANN 熔断预算标志与预筛选校准就绪态,预算调优不再依赖人工翻 raw JSON。
1010
- [x] runtime capability matrix/runbook 现已具备显式门禁 `query_vector_acceleration_calibration_readiness`,正式约束 ANN 阈值校准何时可以开始。
11+
- [x] agent-workspace 的 runbook verify/checks 现已把显式门禁 `query_vector_acceleration_calibration_readiness` 一并前推到前端,而不只是展示支撑它的预算信号。
1112
- [ ] 先把这批新暴露出来的 ANN 治理预算从“可见”推进到“可校准”,再在同一套检查运行在发布级 graphdb/ANN 基线之上后,把新的 Phase-2 诊断面升级为发布级门禁。
1213

1314
- [x] agent-workspace 的 browser/runtime/Tauri 验证闭环已经是真实状态。

docs/zh/implementation_plan.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@
2020
- runtime capability 治理现在也把 ANN 远端索引同步当成一等检查:matrix/runbook 已新增 `query_vector_acceleration_index_sync_health`,与 health、traceability、prefilter、circuit 同级。
2121
- `server.ts` 现已把这条新门禁接入完整 runbook 闭环:ANN index-sync health 已进入 verification escalation、remediation action queue、以及 per-check history summary。
2222
- runtime capability 治理现在也有了显式的 ANN 校准前提门禁:`query_vector_acceleration_calibration_readiness` 会在同一运行时窗口内缺少 sync telemetry、稳定 connector、prefilter 样本就绪、可评估 candidate ratio、或外部 traceability 信号时阻断发布级阈值校准。
23-
- agent workspace 的 runtime runbook 界面现已在 verify/checks/action-queue 三条链路中展示 ANN sync-health 指标,而且 verify/checks 卡片还进一步前推了 ANN 熔断预算、可追踪性、预筛选摘要以及阈值/信号钻取和校准就绪态,运维侧的 ANN 治理视图已不再停留在 `index_sync_health`
23+
- agent workspace 的 runtime runbook 界面现已在 verify/checks/action-queue 三条链路中展示 ANN sync-health 指标,而且 verify/checks 卡片还进一步前推了 ANN 熔断预算、可追踪性、预筛选摘要以及阈值/信号钻取、校准就绪态和显式门禁 `query_vector_acceleration_calibration_readiness`,运维侧的 ANN 治理视图已不再停留在 `index_sync_health`
2424
- `runtime-capability-runbook/*` 这组 modular knowledge route 现已改为接入真实 server 侧 runbook ops,而不再返回 KLP placeholder payload;route 层现在也会保留 `checkId` / `sinceMinutes` / queue-filter 这类 query 参数,不再静默丢弃。
2525
- 真实浏览器 smoke 门禁现在也会端到端证明这三条链路:严格浏览器证据必须能看到 ANN sync-health verify 卡、新增的 verify/checks ANN 熔断/可追踪性/预筛选钻取、首个检查的 ANN sync 指标,以及 index-sync action-queue 钻取,而不再只是证明卡片“能打开”。
2626
- agent-workspace 的 locale 加固现在也覆盖了当前真实暴露出来的诊断卡片/消息空间:源码里引用到的 `agentWorkspace.*` key 已由 `src/agent_workspace.locale.contract.test.ts` 做门禁,双语 locale bundle 现已补齐 strict browser smoke 实际触达的 query/quality/runbook 卡片标签,并且启动期 `translate()` 会等 locale 完成初始化后再调用 `window.i18n.t()`,避免在 locale hydrate 前产生误报式 missing-key warning。

scripts/verify-agent-workspace-browser.js

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2342,6 +2342,9 @@ async function verifyAgentWorkspaceBrowser(options = {}) {
23422342
if (!report.browserChecks.runtimeRunbookVerifyCardTextZh.includes('ANN 预筛选校准')) {
23432343
failures.push(`runtimeRunbookVerifyCardTextZh='${report.browserChecks.runtimeRunbookVerifyCardTextZh}'`);
23442344
}
2345+
if (!report.browserChecks.runtimeRunbookVerifyCardTextZh.includes('ANN 校准就绪态')) {
2346+
failures.push(`runtimeRunbookVerifyCardTextZh='${report.browserChecks.runtimeRunbookVerifyCardTextZh}'`);
2347+
}
23452348
if (!report.browserChecks.runtimeRunbookVerifyCardTextZh.includes('query_vector_acceleration_index_sync_health')) {
23462349
failures.push(`runtimeRunbookVerifyCardTextZh='${report.browserChecks.runtimeRunbookVerifyCardTextZh}'`);
23472350
}
@@ -2378,6 +2381,9 @@ async function verifyAgentWorkspaceBrowser(options = {}) {
23782381
if (!report.browserChecks.runtimeRunbookChecksCardTextZh.includes('ANN 预筛选校准快照')) {
23792382
failures.push(`runtimeRunbookChecksCardTextZh='${report.browserChecks.runtimeRunbookChecksCardTextZh}'`);
23802383
}
2384+
if (!report.browserChecks.runtimeRunbookChecksCardTextZh.includes('ANN 校准就绪态快照')) {
2385+
failures.push(`runtimeRunbookChecksCardTextZh='${report.browserChecks.runtimeRunbookChecksCardTextZh}'`);
2386+
}
23812387
if (!report.browserChecks.runtimeRunbookChecksCardTextZh.includes('query_vector_acceleration_index_sync_health')) {
23822388
failures.push(`runtimeRunbookChecksCardTextZh='${report.browserChecks.runtimeRunbookChecksCardTextZh}'`);
23832389
}

0 commit comments

Comments
 (0)