You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/diataxis/en/explanation/development-progress-dashboard.md
+5-2Lines changed: 5 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,6 +19,7 @@ It tracks what is already implemented, where the hard gaps remain, and how to ve
19
19
- What is real at HEAD:
20
20
- graph/store operations semantics exist in `src/learning/store.ts`, including file-backed ops, embedded SQLite graphdb persistence/query paths, and HTTP adapter paths with fallback diagnostics,
21
21
- the embedded sqlite baseline now also has restart-durability proof: shutdown closes the store cleanly, the adapter can reopen safely, and server integration covers ingest -> shutdown -> fresh module reload -> diagnostics/query/readiness continuity,
22
+
- a new host-level verifier now exercises that same embedded sqlite baseline through both `dist` runtime and packaged sidecar flows on the current Windows host: ingest -> store diagnostics/foundation readiness -> restart -> query continuity (`scripts/verify-foundation-sqlite-runtime.js`),
22
23
- ANN-style prefilter, representation telemetry, circuit health, remote index sync, and live `external_http` connector proof now exist in `src/learning/queryBackend.ts` and `src/learning/vectorAccelerationAdapter.ts`,
23
24
- runtime capability/runbook governance now includes explicit ANN remote index-sync health (`query_vector_acceleration_index_sync_health`) in addition to prefilter, health, traceability, and circuit checks,
24
25
- runtime capability governance now also includes explicit gate `query_vector_acceleration_calibration_readiness`, which formalizes whether the ANN path is even ready for release-grade threshold tuning,
@@ -30,12 +31,12 @@ It tracks what is already implemented, where the hard gaps remain, and how to ve
30
31
- Phase-2 runtime diagnostics are now materially implemented in `src/learning/KnowledgeLearningPlatform.ts` for query-backend comparison/history/trend, knowledge staleness diagnostics/rebuild planning, learning-quality history/trend, session-plan quality evaluation/history/trend/runtime-threshold diagnostics, query-backend config, and query-backend diagnostics,
31
32
- Phase-3 tutor/memory diagnostics remain real and now include an active default runtime tutor adapter path in `src/server.ts`, so normal server execution can emit adapter telemetry instead of staying catalog-only.
32
33
- What is not closed yet:
33
-
- Phase-1 A8 has advanced beyond a file-only default: `src/server.ts` now defaults to `graphdb/sqlite` with explicit file fallback, and restart durability is already proved, but packaged/runtime proof and heavier-workload hardening are still open before calling the local graph backend production-closed,
34
+
- Phase-1 A8 has advanced beyond a file-only default: `src/server.ts` now defaults to `graphdb/sqlite` with explicit file fallback, restart durability is already proved, and host-level dist/runtime + packaged sidecar proof is now in place, but heavier-workload hardening is still open before calling the local graph backend production-closed,
34
35
- Phase-1 A9 is now operational rather than scaffold-only, but recall/latency calibration and larger-workload validation are still open before calling the ANN layer production-closed,
35
36
- Phase-2 quality/session/query observability is now real, but it is not yet release-closed because these gates still require release-grade calibration on top of the current graph/ANN operational baseline; the new ANN calibration-readiness gate only formalizes prerequisites, not closure,
36
37
- default tutor routing is no longer catalog-only, but the runtime is still effectively `local`-first and retains explicit rule-engine fallback rather than a production-proven multi-provider routing policy.
37
38
- Active execution focus therefore shifts to truth-first foundation recovery:
38
-
- finish the remaining packaged/runtime + heavier-workload closure for the embedded graph backend baseline,
39
+
- finish the remaining heavier-workload closure for the embedded graph backend baseline while keeping the new dist/runtime + packaged sidecar proof green,
39
40
- finish the remaining workload/threshold closure for the now-live ANN connector baseline,
40
41
- move the newly surfaced ANN runbook visibility from operator-readable summaries to workload-calibrated release gates,
41
42
- keep the new diagnostic surfaces honest against the same runtime truth,
@@ -133,7 +134,9 @@ Current branch status for this slice:
133
134
## Latest Validation Snapshot (2026-05-14)
134
135
135
136
- Reconfirmed on the current Windows host in this turn: `node node_modules/jest/bin/jest.js src/learning/runtimeCapability.test.ts src/knowledge.api.contract.test.ts --runInBand --no-cache`, `node node_modules/jest/bin/jest.js src/agent_workspace.frontend.test.ts --runInBand --no-cache`, `npm run test:agent-workspace:contracts`, `npm run build:with-vite`, `npm run docs:diataxis:check`, `npm run docs:site:build`, `NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_STRICT=1 NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_UI_STRICT=1 NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_UI_DYNAMIC_STRICT=1 node scripts/verify-agent-workspace-browser.js`.
137
+
- Reconfirmed on the current Windows host in this turn: `npm run build:sidecar`, `npm run verify:foundation:sqlite-runtime`.
136
138
- The strict browser proof now explicitly verifies the bilingual runtime-runbook verify/checks ANN governance labels that were added in this slice: sync-health plus circuit, traceability, and prefilter summaries, along with the threshold/signal drilldowns and calibration-readiness cues that support budget-tuning work.
139
+
- The embedded sqlite graph baseline now also has a repeatable host-level runtime proof outside Jest integration scope: both `dist` runtime and packaged sidecar flows preserve ingest -> diagnostics/readiness -> restart -> query continuity on the same runtime data directory.
137
140
- Tauri strict evidence is implementation-closed but still host-dependent:
138
141
- the current Windows host proves non-strict tauri/runtime behavior and load-flow parity,
139
142
- Linux strict evidence commands (`verify:agent-workspace:tauri:rust:strict`, `verify:agent-workspace:tauri:window-evidence:strict`, strict evidence index/manifest) still require provisioned `webkit2gtk-4.1`, `javascriptcoregtk-4.1`, and `libsoup-3.0`.
3. 把当前已前推到 runbook 卡片中的 ANN 指标可见性,继续推进为带工作负载校准的发布级门禁,
31
32
4. 让这批新诊断面始终与同一份运行时真相保持一致,
@@ -145,7 +146,9 @@
145
146
## 最新验证快照(2026-05-14)
146
147
147
148
- 本轮已在当前 Windows 宿主重新确认通过:`node node_modules/jest/bin/jest.js src/learning/runtimeCapability.test.ts src/knowledge.api.contract.test.ts --runInBand --no-cache`、`node node_modules/jest/bin/jest.js src/agent_workspace.frontend.test.ts --runInBand --no-cache`、`npm run test:agent-workspace:contracts`、`npm run build:with-vite`、`npm run docs:diataxis:check`、`npm run docs:site:build`、`NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_STRICT=1 NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_UI_STRICT=1 NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_UI_DYNAMIC_STRICT=1 node scripts/verify-agent-workspace-browser.js`。
149
+
- 本轮已在当前 Windows 宿主重新确认通过:`npm run build:sidecar`、`npm run verify:foundation:sqlite-runtime`。
148
150
- 严格浏览器证据现在已显式校验本轮新增的双语 runtime-runbook verify/checks ANN 治理标签:不仅验证 sync-health,也验证熔断、可追踪性、预筛选摘要,以及支撑校准工作的阈值/信号钻取和校准就绪态。
Copy file name to clipboardExpand all lines: docs/en/TEST_REPORT.md
+14-6Lines changed: 14 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,6 +41,10 @@
41
41
- PASS
42
42
-[x]`npm run build:with-vite`
43
43
- PASS
44
+
-[x]`npm run build:sidecar`
45
+
- PASS
46
+
-[x]`npm run verify:foundation:sqlite-runtime`
47
+
- PASS
44
48
-[x]`npm run docs:diataxis:check`
45
49
- PASS
46
50
-[x]`npm run docs:site:build`
@@ -50,14 +54,18 @@
50
54
51
55
### What This Refresh Adds
52
56
53
-
1. The Phase-2 ANN governance slice is now operator-visible through the frontend runbook shell, not only backend JSON:
57
+
1. The embedded `graphdb/sqlite` baseline now also has repeatable host-level runtime proof on the current Windows machine:
58
+
-`npm run verify:foundation:sqlite-runtime` covers both `dist` runtime and packaged sidecar flows,
59
+
- both modes now prove ingest -> store diagnostics/foundation readiness -> restart -> query continuity on the same runtime data directory,
60
+
- A8 is therefore no longer blocked on missing packaged/runtime evidence; the remaining honest gap is heavier-workload hardening.
61
+
2. The Phase-2 ANN governance slice is now operator-visible through the frontend runbook shell, not only backend JSON:
54
62
- verify/checks now surface ANN sync-health, circuit-budget, traceability, and prefilter summaries plus threshold/signal drilldowns,
55
63
- they now also surface ANN circuit budget flags and prefilter calibration-readiness cues,
56
64
- action-queue continues to carry the index-sync incident drilldown.
57
-
2.`query_vector_acceleration_prefilter_effectiveness` now shares the ANN fast-lane escalation path instead of using the slower generic escalation branch.
58
-
3. Runtime capability governance now has explicit gate `query_vector_acceleration_calibration_readiness`, which fails or warns until the ANN path has representative sync/prefilter/traceability/stability telemetry in the same runtime window.
59
-
4. The workspace verify/checks cards now surface that calibration-readiness gate directly, instead of forcing operators to infer it only from the underlying budget signals.
60
-
5. This refresh still does **not** prove release-grade Phase-2 closure:
65
+
3.`query_vector_acceleration_prefilter_effectiveness` now shares the ANN fast-lane escalation path instead of using the slower generic escalation branch.
66
+
4. Runtime capability governance now has explicit gate `query_vector_acceleration_calibration_readiness`, which fails or warns until the ANN path has representative sync/prefilter/traceability/stability telemetry in the same runtime window.
67
+
5. The workspace verify/checks cards now surface that calibration-readiness gate directly, instead of forcing operators to infer it only from the underlying budget signals.
68
+
6. This refresh still does **not** prove release-grade Phase-2 closure:
61
69
- it closes visibility and browser/runtime proof for the new ANN governance summaries,
62
70
- it does **not** close workload/threshold calibration for those budgets.
63
71
@@ -69,7 +77,7 @@
69
77
- conversation memory lifecycle,
70
78
- memory-policy diagnostics/history/trend.
71
79
2. These passes do **not** prove Phase-1 A8/A9 closure:
72
-
- runtime no longer defaults to `local-file-graphdb`, and restart durability for the embedded `graphdb/sqlite` baseline is now integration-proved, but packaged/runtime proof and heavier workload hardening still remain,
80
+
- runtime no longer defaults to `local-file-graphdb`, restart durability for the embedded `graphdb/sqlite` baseline is integration-proved, and host-level dist/runtime + packaged sidecar proof is now in place, but heavier workload hardening still remains,
73
81
- ANN no longer stops at query-only scaffolding: the `external_http` path now has remote index sync plus live end-to-end query proof, but workload and threshold calibration still remain before production closure.
74
82
3. These passes do **not** prove Phase-2 quality-gate closure:
75
83
- query comparison, staleness, learning-quality, session-plan-quality, and query-backend diagnostics are now implementation-real, but they still require release-grade calibration on top of the current graph/ANN operational baseline.
Copy file name to clipboardExpand all lines: docs/en/TODO.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,8 @@
9
9
-[x] Query-backend comparison/history/trend, staleness diagnostics/rebuild planning, learning-quality history/trend, session-plan quality evaluate/history/trend/runtime-threshold diagnostics, query-backend config, and query-backend diagnostics are now live in `src/learning/KnowledgeLearningPlatform.ts`.
10
10
-[x] Default server runtime now injects an active local `tutorAdapter` while retaining the `local` + `cloud` adapter catalog.
11
11
-[x] Default runtime graph backend is no longer `local-file-graphdb`: the server now targets embedded `graphdb/sqlite` with explicit file fallback.
12
-
-[~] The new embedded `graphdb/sqlite` baseline is now restart-durability-proved, but packaged/runtime proof and heavier workload gates still remain before calling A8 production-closed.
12
+
-[x] Host-level dist runtime plus packaged sidecar verification now proves the embedded `graphdb/sqlite` baseline across ingest -> store diagnostics/foundation readiness -> restart -> query continuity on the current Windows host (`npm run verify:foundation:sqlite-runtime`).
13
+
-[~] The new embedded `graphdb/sqlite` baseline is now restart-durability-proved and host-level runtime-packaging-proved, but heavier workload gates still remain before calling A8 production-closed.
13
14
-[~] Phase-1 A9 now has a live `external_http` connector baseline with remote index sync and end-to-end query proof, but recall/latency thresholds and larger-workload validation still remain before production closure.
14
15
-[x] Agent-workspace runbook verify/checks now surface ANN index-sync, circuit, traceability, and prefilter summaries plus threshold/signal drilldowns, while action-queue keeps the index-sync incident drilldown.
15
16
-[x]`query_vector_acceleration_prefilter_effectiveness` now shares the ANN fast-lane escalation path instead of lagging behind the other ANN governance checks.
@@ -38,11 +39,12 @@ After syncing the repository to the latest upstream `main` baseline, we re-audit
38
39
-[x] ANN connector hardening does expose candidate normalization, representation telemetry, and prefilter effectiveness signals.
39
40
-[x] Default runtime now targets an embedded `graphdb/sqlite` backend with explicit file fallback.
40
41
-[x] Embedded sqlite restart durability is now covered by server integration proof (`ingest -> shutdown -> fresh module restart -> query/readiness continuity`).
42
+
-[x] Host-level dist runtime + packaged sidecar verification now proves the same embedded sqlite baseline across ingest/readiness/diagnostics/restart/query continuity (`npm run verify:foundation:sqlite-runtime`).
41
43
-[x] The `external_http` ANN path now syncs a remote prefilter index and serves live query traffic under integration proof.
42
-
-[ ]Packaged/runtime proof and heavier workload hardening still remain for A8.
44
+
-[ ]Heavier workload hardening still remains for A8.
43
45
-[ ] ANN still needs workload/threshold calibration before production closure.
44
46
-[ ] Next active implementation phase is now split:
45
-
-[ ] finish the remaining A8 packaged/runtime + workload closure,
47
+
-[ ] finish the remaining A8 heavier-workload closure on top of the new dist/runtime + packaged sidecar proof,
46
48
-[ ] finish the remaining A9 workload/threshold closure on top of the live connector baseline,
47
49
-[ ] move next into Phase-2 release-grade gate hardening,
48
50
-[ ] continue Phase-3 tutor/memory hardening in parallel.
0 commit comments