Skip to content

Commit 9871f43

Browse files
committed
feat: add sqlite soak verification gate
1 parent 66e9767 commit 9871f43

9 files changed

Lines changed: 518 additions & 50 deletions

docs/diataxis/en/explanation/development-progress-dashboard.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,26 @@ The next gap is narrower now:
7373
- keep the new render substrate honest under real browser/Tauri runtime verification,
7474
- preserve a clean downgrade/materialization boundary for later Godot reuse.
7575

76+
## 2026-05-27 Phase-1 SQLite Soak Gate Baseline
77+
78+
- The embedded `graphdb/sqlite` baseline no longer stops at restart continuity plus workload-envelope proof.
79+
- Current HEAD now also carries a dedicated host-level soak/performance verifier:
80+
- `npm run verify:foundation:sqlite-runtime:soak`
81+
- emits structured JSON reports under `output/verification/foundation-sqlite-runtime/`,
82+
- keeps release-grade evidence separate from the lighter `smoke` / `medium` / `heavy` matrix path.
83+
84+
What this newly proves:
85+
86+
- repeated restart cycles on both `dist` runtime and packaged sidecar paths,
87+
- structured startup / ingest / readiness / diagnostics / query duration summaries,
88+
- threshold-gated p95 / max latency checks for the sqlite baseline.
89+
90+
What it still does not prove:
91+
92+
- long-horizon multi-host evidence,
93+
- calibrated final release thresholds across heterogeneous machines,
94+
- a declaration that A8 is fully production-closed.
95+
7696
Code-vs-plan reality for this slice:
7797

7898
| Area | Prior expectation | Current HEAD reality | Status |

docs/diataxis/zh/explanation/development-progress-dashboard.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,26 @@ Tauri-first reply rendering 基线已交付:
7373
- 让新的共享 render substrate 在真实 browser/Tauri 运行时下持续经受验证,
7474
- 为后续 Godot 降级 / 物化保留干净边界,而不是重新把 Godot 约束塞回当前 Tauri UX。
7575

76+
## 2026-05-27 Phase-1 SQLite Soak Gate 基线
77+
78+
- embedded `graphdb/sqlite` 基线现在不再只停留在“重启连续性 + workload envelope”证明。
79+
- 当前 HEAD 还新增了一条专用的主机级 soak/performance verifier:
80+
- `npm run verify:foundation:sqlite-runtime:soak`
81+
- 会把结构化 JSON 报告写入 `output/verification/foundation-sqlite-runtime/`
82+
- 并把发布级证据与较轻量的 `smoke` / `medium` / `heavy` 矩阵验证分开。
83+
84+
这条新门禁当前能够证明:
85+
86+
- `dist` runtime 与 packaged sidecar 两条路径上的重复重启周期,
87+
- startup / ingest / readiness / diagnostics / query 五类时延统计摘要,
88+
- 面向 sqlite 基线的 p95 / max latency 阈值门禁。
89+
90+
这条门禁当前仍不能单独证明:
91+
92+
- 跨多宿主、长时间窗口的稳定证据,
93+
- 面向异构机器的最终发布阈值校准,
94+
- A8 已经完全达到 production-closed。
95+
7696
这条切片的代码 / 方案现实矩阵:
7797

7898
| 区域 | 先前期望 | 当前 HEAD 现实 | 状态 |

docs/en/implementation_plan.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,8 @@ Bring the implementation plan back in line with `main` reality at three levels:
3030
- avoid reintroducing transition-only assumptions into CI gates,
3131
- keep fixrisk closure criteria tied to repo-controlled reality.
3232
2. **P1: release-grade graph/store hardening**
33-
- extend the current operational `graphdb/sqlite` baseline from restart/workload proof into soak and performance closure.
33+
- extend the current operational `graphdb/sqlite` baseline from restart/workload proof into soak and performance closure,
34+
- keep a dedicated host-level soak gate (`verify:foundation:sqlite-runtime:soak`) so release-grade evidence is not diluted into the lighter matrix proof.
3435
3. **P2: release-grade ANN calibration**
3536
- keep the `external_http` connector green under workload proof, then close recall/latency threshold calibration.
3637
4. **P3: Tauri-first reply/render surface expansion**

docs/en/task.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,7 @@ Primary references:
2323
- [x] Agent-workspace browser/runtime/Tauri verification closure is real and repeatable on the current branch.
2424
- [x] Phase-3 tutor telemetry, tutor trace/provider trend diagnostics, conversation memory, and memory-policy diagnostics now have concrete backend implementations.
2525
- [~] Phase-1 A8 has advanced to an embedded `graphdb/sqlite` operational baseline and now has restart-durability proof, host-level dist/runtime + packaged sidecar proof, and a host-level workload matrix across `smoke` / `medium` / `heavy`; soak / longer-duration / performance hardening still remain before production closure.
26+
- [~] Phase-1 A8 now also has a dedicated host-level soak/performance verifier path (`verify:foundation:sqlite-runtime:soak`) with structured report output, but release-grade closure still requires sustained threshold tuning and repeated host evidence rather than one passing command.
2627
- [~] Phase-1 A9 now has a live `external_http` sync-backed connector baseline under real query traffic, host-level dist/runtime + packaged sidecar proof, and a host-level workload matrix across `smoke` / `medium` / `heavy`, but benchmark-backed rollout thresholds and release-grade calibration still remain before production closure.
2728
- [x] `KnowledgeLearningPlatform.ts` no longer uses placeholder-backed runtime surfaces for query comparison, staleness, learning-quality, and session-plan-quality diagnostics.
2829
- [x] Server bootstrap now injects an active local `tutorAdapter`; the remaining tutor gap is production-proven multi-provider routing rather than default activation.
@@ -38,6 +39,7 @@ Primary references:
3839
- [x] Runtime path adaptation has been integrated for sidecar and frontend data roots.
3940
- [x] Worker runtime resolution has been stabilized for packaged sidecar scenarios.
4041
- [ ] Soak / longer-duration / performance hardening for the embedded graph backend baseline remain pending after the new packaged/runtime and workload-matrix proofs.
42+
- [ ] Promote the new sqlite soak verifier from initial host-level gate to sustained release evidence with repeated runs and tuned thresholds.
4143
- [ ] Production ANN connector threshold convergence and release-grade calibration remain pending after the new host-level runtime and workload-matrix proofs.
4244
- [ ] Phase-2 quality/query/session diagnostics now need release-grade calibration on top of a release-grade graphdb/ANN baseline.
4345
- [ ] Tutor routing now needs multi-provider hardening beyond the active local-first adapter path.
@@ -59,6 +61,8 @@ Primary references:
5961
- Same orchestration path, but also restores transient tracked `src-tauri/bin/server-*` dirtiness introduced by the current verification run so the worktree can be kept clean.
6062
- `npm run verify:foundation:sqlite-runtime:matrix`
6163
- Highest-value host/runtime proof for the embedded sqlite graph backend across `smoke` / `medium` / `heavy` workloads.
64+
- `npm run verify:foundation:sqlite-runtime:soak`
65+
- Dedicated P1 host/runtime soak and performance gate for the embedded sqlite graph backend. Writes structured JSON reports under `output/verification/foundation-sqlite-runtime/`.
6266
- `npm run verify:foundation:ann-runtime:matrix`
6367
- Highest-value host/runtime proof for the `external_http` ANN connector across `smoke` / `medium` / `heavy` workloads.
6468
- `npm run verify:agent-workspace:browser`

docs/zh/implementation_plan.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,8 @@
3131
- 不再让过渡期假设重新进入 CI 门禁,
3232
- fixrisk 的闭环标准始终绑定仓库可控现实。
3333
2. **P1:release-grade graph/store 加固**
34-
- 把当前 `graphdb/sqlite` operational baseline 从重启/工作负载证明继续推进到 soak 与性能闭环。
34+
- 把当前 `graphdb/sqlite` operational baseline 从重启/工作负载证明继续推进到 soak 与性能闭环,
35+
- 保留独立的主机级 soak 门禁(`verify:foundation:sqlite-runtime:soak`),避免把发布级证据稀释到较轻量的矩阵验证里。
3536
3. **P2:release-grade ANN 校准**
3637
- 保持 `external_http` connector 在 workload proof 下稳定,再收口 recall/latency 阈值校准。
3738
4. **P3:Tauri-first reply/render surface 扩展**

docs/zh/task.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@
1919
- [x] agent-workspace 的 browser/runtime/Tauri 验证闭环已经是真实可重复的。
2020
- [x] Phase-3 中的 tutor telemetry、tutor trace/provider trend、conversation memory、memory-policy diagnostics 已有具体后端实现。
2121
- [~] Phase-1 A8 已推进到 embedded `graphdb/sqlite` operational baseline,并已具备 shutdown/fresh restart 的重启耐久性证明、主机级 dist/runtime + packaged sidecar 证明,以及覆盖 `smoke` / `medium` / `heavy` 的主机级 workload matrix;但 soak、长时段与性能级加固仍未达到生产闭环。
22+
- [~] Phase-1 A8 现在也新增了独立的主机级 soak/performance verifier(`verify:foundation:sqlite-runtime:soak`)并输出结构化报告,但发布级闭环仍需要持续阈值校准与多轮主机证据,而不是单次通过命令就结束。
2223
- [~] Phase-1 A9 现已具备 live `external_http` sync-backed connector baseline,并在真实 query 流量下得到证明;主机级 dist/runtime + packaged sidecar 证明和 `smoke` / `medium` / `heavy` workload matrix 也已具备;但 benchmark rollout 阈值与发布级校准仍未完成。
2324
- [x] `KnowledgeLearningPlatform.ts` 中 query compare / staleness / learning-quality / session-plan-quality 运行面已不再返回 placeholder。
2425
- [x] `server.ts` 已注入激活态本地 `tutorAdapter`;剩余导师缺口已不再是默认激活,而是生产级多 provider 路由。
@@ -34,6 +35,7 @@
3435
- [x] Sidecar 与前端数据根路径的运行时适配已集成。
3536
- [x] 打包 Sidecar 场景下的 Worker 路径解析已稳定。
3637
- [ ] embedded graph backend 基线在新的 packaged/runtime 与主机级 workload matrix 证明之外,仍待补齐 soak、长时段与性能级加固。
38+
- [ ] 把新的 sqlite soak verifier 从“初始主机门禁”继续推进为“可持续的发布级证据”,补齐多轮运行与阈值调优。
3739
- [ ] 生产级 ANN connector 在新的主机级 runtime 与 workload matrix 证明之外,仍待补齐阈值收敛与发布级校准。
3840
- [ ] query/quality/session 运行面虽已真实接通,但仍需在发布级 graphdb/ANN 基线上完成发布级校准。
3941
- [ ] tutor 运行路径接下来要从 local-first 激活态 adapter 扩展为生产级多 provider 路由。
@@ -55,6 +57,8 @@
5557
- 与上述统一编排相同,但会额外回滚本次验证新引入的受跟踪 `src-tauri/bin/server-*` 脏改动,用于保持工作区 clean。
5658
- `npm run verify:foundation:sqlite-runtime:matrix`
5759
- 当前 embedded sqlite 图后端最有价值的主机/runtime 证明,覆盖 `smoke` / `medium` / `heavy` 三档 workload。
60+
- `npm run verify:foundation:sqlite-runtime:soak`
61+
- 面向 P1 的专用 embedded sqlite 主机/runtime soak 与性能门禁,会把结构化 JSON 报告写到 `output/verification/foundation-sqlite-runtime/`
5862
- `npm run verify:foundation:ann-runtime:matrix`
5963
- 当前 `external_http` ANN connector 最有价值的主机/runtime 证明,覆盖 `smoke` / `medium` / `heavy` 三档 workload。
6064
- `npm run verify:agent-workspace:browser`

package.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -97,6 +97,7 @@
9797
"verify:foundation:sqlite-runtime": "npm run build && node scripts/ensure-sidecar-ready.js && node scripts/verify-foundation-sqlite-runtime.js",
9898
"verify:foundation:sqlite-runtime:heavy": "npm run build && node scripts/ensure-sidecar-ready.js && node scripts/verify-foundation-sqlite-runtime.js --heavy",
9999
"verify:foundation:sqlite-runtime:matrix": "npm run build && node scripts/ensure-sidecar-ready.js && node scripts/verify-foundation-sqlite-runtime.js --matrix",
100+
"verify:foundation:sqlite-runtime:soak": "npm run build && node scripts/ensure-sidecar-ready.js && node scripts/verify-foundation-sqlite-runtime.js --soak",
100101
"verify:core-real-machine": "node scripts/verify-core-real-machine-tests.js",
101102
"verify:core-real-machine:clean": "node scripts/verify-core-real-machine-tests.js --restore-sidecar-binaries",
102103
"verify:agent-workspace:browser": "node scripts/verify-agent-workspace-browser.js",

0 commit comments

Comments
 (0)