Explanation: Development Progress Dashboard

This page is the implementation-facing dashboard for the Knowledge Mastery evolution plan. It tracks what is already implemented, where the hard gaps remain, and how to verify progress from code and runtime behavior.

2026-06-10 Knowledge Workspace Durable Artifact and DAG Alignment

This slice reconciles the current codebase against the earlier lightweight-RAG and agent-workspace plans, with special focus on the existing DAG-backed learning substrate.

The key result is that the current code has moved further than the old plan wording suggested:

the Knowledge Workspace is no longer just a thin scoped-chat shell,
workflow artifacts now include durable flashcard_batch and knowledge_run records,
reply rendering now has a typed structured-answer path,
workflow-artifact review follow-up is part of the runtime surface,
graph focus can render original markdown with matched-span highlighting,
a dedicated evidence pane now carries grounding inspection plus durable knowledge_run and flashcard_batch inspection,
grouped conversation knowledge points now retain relation-path and temporal-validity signals,
the composer now materializes an explicit graphContext,
that graphContext now survives through conversation trace, snapshot persistence, and workspace export,
the grounding inspector now renders that persisted graphContext as structured graph evidence,
relation aggregation now spans the whole grouped knowledge-point set instead of only the leading anchor's local hint set,
direct grouped-knowledge-point relations are now preserved as first-class graph-context data instead of only implicit edge membership,
temporal-validity aggregation now preserves temporal edge kinds/details in addition to warning text,
supersession details now also influence explanation and next-action text instead of staying limited to warning-only evidence,
and grounding inspectability is now normalized per turn so later turns without grounding payload do not keep stale evidence-pane state alive.

At the same time, the product surface is still behind the intended final behavior:

the primary answer area is now contracted to user-facing reply blocks only,
left-side knowledge hits now route directly into graph focus as file-only entries,
the existing DAG substrate is real, but answer synthesis still behaves more like evidence-grouped text RAG than graph-native answer planning.

Code-vs-plan reconciliation for this slice:

Requirement	Current implementation evidence	Progress call
Structured grounded conversation with backward compatibility	`src/learning/types.ts`, `src/learning/conversationComposer.ts`, `src/learning/KnowledgeLearningPlatform.ts`, `src/frontend/agent_workspace.js`, and `src/frontend/workspace_panes.js` now support `answer`, `assistantBlocks`, `knowledgeRun`, grouped `knowledgePoints`, citations, and legacy `assistantMessage`; the primary assistant area now limits visible blocks to user-facing reply content.	Implemented current slice
Durable learning/review artifacts	`src/workflows/WorkflowArtifactStore.ts`, `src/learning/KnowledgeLearningPlatform.ts`, and `src/routes/knowledge.ts` now support durable `flashcard_batch` / `knowledge_run` artifacts plus `/api/knowledge/workflow-artifacts` and `/api/knowledge/workflow-artifacts/review-follow-up`; `workspace_panes.js` now routes their inspection into a dedicated evidence pane.	Implemented current slice
File-first scoped knowledge hits	`workspace_panes.js` renders grouped knowledge hits by source file and routes file selection into graph focus instead of inline preview/action expansion.	Implemented current slice
Right-pane evidence reading	Graph focus reuses the shared markdown runtime and highlights matched spans in rendered source markdown.	Implemented baseline
Answer area contraction to a single targeted answer	`agent_workspace.js` now keeps the full conversation result in runtime state while only rendering user-facing answer blocks (`structured_answer`, `main_markdown`, `html_artifact`) in the main chat surface.	Implemented current slice
Hide developer-heavy evidence from the primary hit list	`workspace_panes.js` no longer renders inline knowledge previews or visible typed capability buttons in the left-side hit list; those flows now route into graph focus or the dedicated evidence pane.	Implemented current slice
Durable evidence/claim inspector	`workspace_panes.js` now exposes a dedicated evidence pane for grounding metadata, `knowledge_run`, `knowledge_run_history`, `knowledge_run_compare`, and `flashcard_batch`; `agent_workspace.js` wires the API status strip into grounding inspection and normalizes per-turn inspectability so stale grounding is cleared on later turns without evidence payloads.	Implemented broader current slice
DAG-native answer planning	`AgentConversationKnowledgePoint` now carries grouped `relationPath`, `relationKinds`, `relationPathAtomIds`, and `temporalValidity`; `conversationComposer.ts` now materializes an explicit `graphContext`; `KnowledgeLearningPlatform.ts` and `WorkspaceExportBundle.ts` now preserve it through trace, persistence, and export surfaces; `workspace_panes.js` now renders that persisted `graphContext` in the evidence pane as a structured graph explanation surface; relation and temporal aggregation now span the whole grouped knowledge-point set and preserve source-atom plus temporal-edge detail; direct grouped-knowledge-point relations are now preserved and surfaced; supersession details now also flow into explanation and next-action text.	Implemented broader partial slice

Immediate next direction from this point:

Extend the new DAG-aware conversation slice into a dedicated graph-conditioned context-assembly layer instead of relying on grouped relation hints and the current still-thin graphContext alone.
Build on the new multi-point relation/temporal aggregation by adding richer predecessor/successor/path semantics and explicit supersession handling without expanding the primary chat transcript.
Continue ownership reduction across src/server.ts, KnowledgeLearningPlatform.ts, agent_workspace.js, and workspace_panes.js.

Verification for the current code-backed alignment:

npm.cmd exec -- tsc --noEmit
node --check src/frontend/agent_workspace.js
node --check src/frontend/workspace_panes.js
npm.cmd exec -- jest src/agent_workspace.frontend.test.ts src/agent_workspace.locale.contract.test.ts --runInBand --no-cache
npm.cmd exec -- jest src/learning/conversationComposer.test.ts src/learning/KnowledgeLearningPlatform.test.ts src/learning/KnowledgeLearningPlatform.persistence.test.ts src/learning/KnowledgeLearningPlatform.program-f.test.ts src/agent_workspace.frontend.test.ts src/knowledge.api.contract.test.ts src/routes/registry.contract.test.ts src/pathbridge.handshake.contract.test.ts src/server.port.fallback.contract.test.ts src/workflows/WorkflowArtifactStore.test.ts --runInBand --no-cache
npm.cmd exec -- jest src/agent_workspace.frontend.test.ts src/agent_workspace.locale.contract.test.ts src/agent_workspace.contract.parity.test.ts src/agent_workspace.runtime.behavior.test.ts --runInBand --no-cache
npm.cmd exec -- jest src/export/WorkspaceExportBundle.test.ts --runInBand --no-cache
npm.cmd exec -- jest src/learning/conversationComposer.test.ts src/learning/KnowledgeLearningPlatform.test.ts src/export/WorkspaceExportBundle.test.ts src/learning/KnowledgeLearningPlatform.persistence.test.ts src/learning/KnowledgeLearningPlatform.program-f.test.ts --runInBand --no-cache
npm.cmd run test:agent-workspace:contracts
npm.cmd run build

2026-06-06 Knowledge Workspace Scope Switcher and Evidence-Focused Hit UI

This slice closes the frontend usability gap that made the active RAG scope hard to see from inside the Knowledge Workspace itself. The prior implementation depended on the global folder selector and request payload state, so users could ask a scoped question without a local, task-adjacent confirmation of the actual retrieval boundary. It also rendered summaries, citations, matched sections, and actions in every hit card, which pushed the question/reply area out of view after results returned and increased first-level choice density.

Code-vs-plan reconciliation for this slice:

Requirement	Current implementation evidence	Progress call
Show and switch the Knowledge Workspace scope in the workspace window	`src/frontend/index.html` now includes an in-pane `agent-workspace-scope-select`; `src/frontend/agent_workspace.js` mirrors the global `folder-select`, publishes the same active-target event, updates `localStorage.nc_last_target`, and sends `activeTarget` plus `scope` with conversation requests.	Implemented
Keep the question/reply area visible after hits return	`src/frontend/styles.css` gives chat messages a stable flex floor and caps the knowledge-hit list height, so the conversation stream and input remain part of the active workspace instead of being displaced by result cards.	Implemented
Show only interactive knowledge-point filenames at the first level	`src/frontend/workspace_panes.js` now renders each knowledge point as a file button resolved from `sourcePath`, `citation.sourcePath`, or `matchedSpans[0].sourcePath`; summaries, scores, citations, and matched section snippets are no longer first-level card content.	Implemented
Move secondary actions behind a long-press/context menu	Typed capability buttons remain backward-compatible, but are hidden in `agent-knowledge-actions-menu` until long-press, context menu, or keyboard context-menu activation opens them.	Implemented
Highlight matched passages in the right pane	Clicking the filename opens the graph-focus pane with the knowledge point title and `matchedSpans`, rendering each matched passage as highlighted evidence.	Implemented
Bilingual locale and contract compatibility	`agentWorkspace.scope.*` and new knowledge action labels are present in both frontend locale bundles, and the locale contract test verifies key coverage and placeholder parity.	Preserved

Verification for this slice:

npm.cmd exec -- jest src/agent_workspace.frontend.test.ts -t "scope selector|knowledge hits as file entries" --runInBand
npm.cmd exec -- jest src/agent_workspace.frontend.test.ts --runInBand
npm.cmd exec -- jest src/agent_workspace.locale.contract.test.ts --runInBand
node --check src/frontend/agent_workspace.js
node --check src/frontend/workspace_panes.js
npm.cmd run build

2026-06-07 Graph-Focus Source Rendering and AhaDiff Comparison

This slice corrects a remaining mismatch between the intended Knowledge Workspace interaction and the actual right-side graph-focus pane behavior. The earlier implementation opened the right pane with matchedSpans, but only rendered a compact evidence list. That meant the user could not inspect the original knowledge point in its normal rendered form and therefore could not see matched passages highlighted in-place.

Code-vs-plan reconciliation for this slice:

Requirement	Current implementation evidence	Progress call
Render the original knowledge point in the right pane	`src/frontend/workspace_panes.js` now resolves `sourcePath`, reads the original markdown through `NoteConnectionStorage.readContent()`, and renders it through the shared `NoteConnectionMarkdownRuntime.renderMarkdownInto()` path instead of falling back to a snippet-only card.	Implemented
Highlight matched passages inside the normal rendered document	The graph-focus pane now scores rendered paragraphs/blocks against `matchedSpans[].snippet` terms and applies a restrained highlight style in `src/frontend/styles.css` via `.agent-focus-match` / `data-agent-focus-highlight`.	Implemented
Preserve backward compatibility when source rendering is unavailable	The previous summary + evidence-list view remains the fallback path when source markdown, storage access, or markdown runtime rendering is unavailable.	Preserved
Keep the pane inside the same Reader-aligned rendering substrate	The focus pane reuses the same markdown runtime owner already used by the Tauri reply surface rather than creating a second markdown/mermaid/math render stack.	Implemented

Verification for this slice:

npm.cmd exec -- jest src/agent_workspace.frontend.test.ts -t "graph focus|knowledge hits as file entries" --runInBand --no-cache
npm.cmd exec -- tsc --noEmit

AhaDiff comparison and implications

The latest ref/ahadiff codebase makes three moves that are directly relevant to this project's next phase.

It treats claims, evidence, runtime validation, and review state as first-class product surfaces, not hidden backend details. Evidence in ref/ahadiff/src/ahadiff/claims/verify.py is verified against concrete file/hunk anchors instead of being treated as a loose text summary problem. Viewer-side API boundaries are runtime-validated in ref/ahadiff/viewer/src/api/schemas.ts, which prevents UI drift from silently accepting malformed payloads.
It productizes memory/review/challenge loops as durable state, not just one-shot outputs. review.sqlite is guarded as a real persistence boundary in ref/ahadiff/src/ahadiff/review/database.py, and the challenge loop in ref/ahadiff/src/ahadiff/challenge/engine.py converts missed understanding back into new learning signals.
Its UI is organized around explicit task surfaces with typed stores, typed API clients, and route-level product pages. The viewer separates shell, API schemas, Zustand state, and focused pages/components rather than letting one large host file own too much of the interaction model.

For this project, that comparison sharpens the next gaps:

Area	Current NoteConnection position	Gap vs AhaDiff-style maturity	Next move
Evidence model	Scoped citations and `matchedSpans` exist, but evidence is still mostly conversation-output metadata.	Missing a first-class evidence ledger that can survive beyond one answer turn and be reused by later agent flows.	Introduce a durable evidence/claim projection for agent answers and learning artifacts.
Runtime contract validation	TypeScript contracts exist, but runtime response validation is still uneven across the frontend boundary.	UI can still trust structurally malformed payloads too often.	Expand runtime schema validation at the agent-workspace API boundary, especially for richer assistant payloads and future learning-state endpoints.
Durable learning loop	Conversation memory exists, but challenge/review/adaptation loops are still shallow.	Missing a mechanism that turns failed understanding or weak answers back into future tasks/review signals.	Build an explicit agent learning loop: answer -> inspect evidence -> mark confusion/gap -> schedule guided follow-up.
UI information architecture	Tauri workspace is improving, but large host files still own too much orchestration and the workspace still mixes retrieval, action, and diagnostics surfaces densely.	Interaction surfaces are less modular and less stateful than AhaDiff's page/store split.	Continue extracting workspace surfaces behind clear owners and move high-density diagnostics/learning views toward typed state modules.
Cross-run quality governance	Foundation and runbook gates exist, but answer quality and learning effectiveness are not yet tracked with the same durability.	Missing a stable quality ratchet for agent-grounded learning outcomes.	Add persistent answer-quality / evidence-coverage / follow-up-effectiveness history that can drive policy rather than just reporting snapshots.

2026-06-06 Active-Scope Miss Recovery and Document-Augmented RAG Patch

This patch resolves the live "what is water glass?" failure that reproduced while the WebView was already running on npm run tauri:dev:mini:gpu.

Runtime probes showed that the current sidecar could answer correctly when called with an explicit waterglass scope: it returned one grouped knowledge point, eight citations, and matchedSpans. The WebView, however, had folder-select=financial, localStorage.nc_last_target=financial, and window.__NC_ACTIVE_SOURCE_TARGET.scope.sourcePathPrefixes=["Knowledge_Base/financial"]. The user question was therefore sent as a scoped financial query. The existing planner found the global title-like water glass document, but then intersected that document id with the explicit financial workspace/corpus/prefix scope, reducing the retrieval candidate set to zero indexed atoms.

Code-vs-plan reconciliation for this patch:

Requirement	Current implementation evidence	Progress call
Positive answer when the selected scope misses but the query clearly names another knowledge point	`buildQueryBackendContext()` now distinguishes title hits inside the requested scope from title hits outside it. If an explicit scope has no compatible title hit but a document title/alias hit exists elsewhere, retrieval switches to a document-only `planner_scope_recovery` scope instead of intersecting incompatible corpus constraints.	Implemented
Return results by knowledge point, not duplicated sections	The prior document-level conversation grouping remains intact. The recovery query still returns segment-level evidence internally, then `mergeAgentConversationKnowledgePoints()` groups hits by `documentId` and exposes `matchedSpans` inside the single knowledge-point card.	Implemented
RSE + document augmentation direction	The implementation keeps Relevant Segment Extraction behavior at retrieval time while adding document augmentation at planning time: title-like queries can recover the target document, and section hits inside that document become marked evidence spans rather than duplicated cards.	Operational baseline
User-visible diagnosis of scope behavior	The Knowledge Workspace API status strip now includes the active scope label and, when recovery is used, the recovered source path. This directly exposes cases such as "Scope: financial" plus "Recovered: Knowledge_Base/waterglass/water glass.md".	Implemented
Backward compatibility	Public response fields remain additive. Existing `assistantMessage`, `answer`, `assistantBlocks`, citations, and legacy sync/SSE flows remain supported. `scopeSource` gains a new optional value, `planner_scope_recovery`, without removing existing values.	Preserved

Verification for this patch:

Red/green backend regression: KnowledgeLearningPlatform.test.ts now covers financial active scope plus a water glass title-like query recovering the waterglass document and returning one grouped knowledge point with multiple matched spans.
Red/green frontend regression: agent_workspace.frontend.test.ts now covers status-strip scope and recovered-source visibility.
Live root-cause evidence: CDP showed the running WebView was scoped to financial; direct sidecar probing with waterglass scope returned grouped evidence correctly.

2026-06-06 Knowledge Workspace RAG Answering and API Observability Slice

This update closes a practical Knowledge Workspace gap observed while npm run tauri:dev:mini:gpu was already running: the live sidecar could retrieve scoped waterglass evidence after hydration, but the user-facing answer still used the old "strongest scoped match" template and returned repeated section-level cards from the same knowledge point.

Code-vs-plan reconciliation for this slice:

Requirement	Current implementation evidence	Progress call
User-visible API call status	The Knowledge Workspace now renders a compact API status strip for `/api/knowledge/conversation`, including idle/pending/ok/error state, transport (`SSE` or sync fallback), latency, knowledge-point count, citation count, and memory count (`src/frontend/index.html`, `src/frontend/agent_workspace.js`, `src/frontend/styles.css`).	Implemented
Direct grounded answers	`agentConversation()` now composes the top-level `answer` and `Scoped Answer` block from the best retrieved evidence sentence first, then appends grounding counts and citations. It no longer forces successful explanation turns to start with "The strongest scoped match is..." (`src/learning/KnowledgeLearningPlatform.ts`).	Implemented
Knowledge-point grouping	Conversation responses preserve low-level `queryKnowledge()` atom granularity internally, but group user-facing `knowledgePoints` by `documentId`, exposing optional `atomIds`, `documentId`, `sourcePath`, `matchedSpans`, `citations`, and `matchCount` for document-augmented UI rendering (`src/learning/types.ts`, `src/learning/KnowledgeLearningPlatform.ts`).	Implemented
RSE/document augmentation direction	The response shape now separates retrieval evidence spans from the knowledge-point card identity, so repeated hits inside one document are shown as matched spans inside one card rather than duplicate cards. This is the current lightweight RSE-style boundary: retrieval remains segment-level; answer/UI presentation becomes evidence-grouped and document-augmented.	Operational baseline
Runtime caveat	A running `server.exe` sidecar does not hot-swap TypeScript/backend changes. `npm run build` and `npm run ensure:sidecar:dev` prepare the fixed dev sidecar, but the currently open `tauri:dev:mini:gpu` window must be restarted to load the rebuilt backend binary.	Operational note

Verification for this slice:

Red/green regression coverage: KnowledgeLearningPlatform.test.ts now verifies direct water glass answering plus grouped matched spans; agent_workspace.frontend.test.ts verifies grouped hit rendering and the API status panel.
Reconfirmed locally: npm.cmd exec -- jest src/learning/KnowledgeLearningPlatform.test.ts --runInBand, npm.cmd exec -- jest src/agent_workspace.frontend.test.ts --runInBand, npm.cmd exec -- jest src/server.migration.test.ts -t "conversation route auto-hydrates|knowledge/conversation" --runInBand, npm.cmd exec -- jest src/agent_workspace.contract.parity.test.ts src/agent_workspace.runtime.behavior.test.ts --runInBand, node --check src/frontend/agent_workspace.js, node --check src/frontend/workspace_panes.js, npm.cmd run build, and npm.cmd run ensure:sidecar:dev.

2026-06-06 Mainline Architecture Progress and Code-vs-Plan Reconciliation

This update reconciles the current main codebase against the May architecture plans and supersedes any stale reading that treats Program F delivery as the same thing as release-grade foundation closure.

Current source-of-truth references:

Detailed bilingual plan: Architecture Progress Alignment and Mainline Plan (2026-06-06)
Prior RAG/agent plan: Multiplatform Lightweight RAG and Agent Architecture Plan
Prior substrate plan: Deep Student Comparison Next-Phase Plan

Code-vs-plan reconciliation:

Plan requirement	Current `main` evidence	Progress call
Scoped retrieval and workspace/corpus boundaries	`KnowledgeQueryRequest.scope`, `KnowledgeCorpusScope`, workspace readiness, miss diagnostics, active-target hydration, and workspace/export substrate exist across `src/learning/types.ts`, `src/learning/KnowledgeLearningPlatform.ts`, `src/workspace/`, and `src/export/`.	Implemented baseline
Lightweight RAG and grounded conversation	`AgentConversationResponse` carries `answer`, citations, memory actions, trace, and backward-compatible `assistantBlocks`; frontend Tauri workspace renders typed blocks when present and still supports legacy `assistantMessage`.	Operational baseline
Durable resource/index/workspace/session/memory/export substrate	Program A-F code exists under `src/resources/`, `src/indexing/`, `src/workspace/`, `src/session/`, `src/workflows/`, `src/memory/`, and `src/export/`.	Implemented
Platform shell separation	`PlatformCapabilities`, `RenderMaterializer`, render routes, and workspace export bundles keep desktop/Godot/mobile materialization decisions explicit. Godot remains PNG-first because direct SVG import is unsafe.	Implemented
Runtime graphdb/ANN production closure	graphdb/sqlite, external graphdb HTTP, local-vector rollout controls, external HTTP vector acceleration, runtime capability checks, and rollout profile payloads exist.	Operational, not production-closed
Single route/runtime ownership	modular route registration exists, and runtime runbook modular-route operations now have a dedicated owner in `src/routes/runtimeRunbookRouteOps.ts`, but conversation, turn-cache, rollout, and other stateful fallback/orchestration logic still carry heavy ownership inside `src/server.ts`.	Partially complete
Architecture reduction	Current line-count scan shows `src/server.ts` about 15,920 lines and `src/learning/KnowledgeLearningPlatform.ts` about 10,351 lines; major frontend hosts remain large.	Behind target

Architecture progress map:

Layer	Current maturity	Next move
Graph/path core	Mature operational baseline	Preserve compatibility and avoid unrelated churn.
Runtime storage/retrieval	Operational baseline	Current Windows-host strict release-evidence history audit now passes at 3/3 for sqlite and ANN, readiness exposes it as `foundation_release_evidence_history`, and an opt-in multi-host audit script now exists; next movement is real multi-host evidence plus threshold calibration.
Scoped RAG/conversation	Operational baseline	Extract ownership from `server.ts` without removing legacy response fields.
Memory/session/workflow	Implemented substrate	Harden policy, audit, and workflow artifact quality before adding UI-only state.
Export/platform shell	Implemented baseline	Keep Godot/mobile materialization and export profile rules explicit.
Governance/CI	Strong but host-dependent	Keep FR-009, Linux strict Tauri evidence, graphdb/ANN calibration, and docs truth as separate gates.

Immediate next direction:

Keep docs truth synchronized across this dashboard, task, implementation plan, TODO, README, interface docs, and the new solution note.
Finish release-grade graphdb and ANN closure: use the new opt-in multi-host evidence gate to extend strict release-evidence history beyond the current Windows host, tighten graphdb connector budgets, calibrate ANN release-gate matrix recall/latency thresholds, and collect strict rollout evidence.
Reduce server.ts ownership pressure by moving turn-cache, alert trend, runbook bridge, and rollout helper logic behind explicit modules.
Continue KnowledgeLearningPlatform.ts domain extraction only when the new owner hides state or enforces a real invariant.
Expand assistant block coverage through typed, optional payloads while preserving assistantMessage and stream/sync/replay compatibility.
Keep Godot/mobile constraints at platform adapter boundaries, especially the no-direct-SVG rule.

Verification position for the 2026-06-06 alignment and P1 evidence slices:

The initial alignment update was documentation-only; the current P1 evidence-history/readiness slice changes verifier tooling, package scripts, tests, and readiness mandatory checks, not public runtime APIs.
Required gate for this slice: foundation release-evidence contract tests, foundation readiness regression tests, default/strict release-evidence verification, migration tests, docs map validation, docs site build, Mermaid fence guard, diff review, and clean worktree after commit.
Code slices that follow must continue to use the runtime gates listed in the active task docs, especially verify:foundation:sqlite-runtime:soak, verify:foundation:ann-runtime:matrix, verify:foundation:release-evidence, verify:foundation:release-evidence:strict, test:agent-workspace:contracts, and verify:core-real-machine:clean; release owners should add verify:foundation:release-evidence:multi-host when a release window needs host diversity.
ANN release calibration slices should use verify:foundation:ann-runtime:release; the full matrix release-gate path now has fresh Windows-host evidence, verify:foundation:release-evidence checks whether latest sqlite/ANN release reports are still fresh before they are used as release context, readiness exposes the strict history audit through foundation_release_evidence_history, and verify:foundation:release-evidence:multi-host can now enforce 2 distinct host keys per component. verify:foundation:release-evidence:strict now passes on the current Windows host with sqlite 3/3 and ANN 3/3, so the remaining evidence gap is actual multi-host release evidence and calibration rather than local report count.

2026-05-27 Workflow Gate Realignment and Progress Truth Sync

The current branch had already migrated repo-owned workflows to actions/setup-node@v4 with node-version: "24".
The actual failure on main was narrower than the old docs suggested:
- scripts/verify-fixrisk-issues.js was still enforcing the removed FORCE_JAVASCRIPT_ACTIONS_TO_NODE24 transition override,
- so FR-010 failed even though the workflow YAMLs themselves were already on the intended baseline.
This slice realigns three layers at once:
- the verifier now checks the current Node 24/no-override baseline,
- the fixrisk live-status docs now describe that baseline precisely,
- the active progress/task/implementation docs no longer treat “green once” as a timeless fact.

Code-vs-plan reality for this correction:

Area	Prior expectation	Current HEAD reality	Status
FR-010 workflow closure	keep the old Node24 compatibility override in place	repo-owned workflows already run on `setup-node@v4` + `node-version: "24"` and should stay free of the removed override	Corrected
Fixrisk code-level gate	workflow migration was already “done”	verifier logic lagged behind workflow reality and had to be updated to match the new baseline	Corrected
Residual Node 20 warnings	all deprecation noise should disappear once `setup-node` moved to 24	some annotations still come from marketplace action runtimes (`upload-artifact`, release helpers) and remain external non-blocking debt	Tracked
Progress reporting	prior docs could keep saying remote CI was green again	active docs now need to separate repo-owned closure from remaining external/runtime debt and live-run status	Corrected

Immediate next direction from this point:

keep Fixrisk Operational Readiness and Migration Gates green on live main runs,
continue Tauri-first rich reply expansion without breaking the shared Reader-derived render substrate,
keep Program F substrate work stable while Phase-1/Phase-2 release-grade calibration remains the real engineering frontier,
keep architecture reduction active because server.ts, KnowledgeLearningPlatform.ts, and the frontend host files are still oversized.

2026-05-27 Tauri-First Conversation Rendering Reality Sync

The branch has moved materially beyond the last Program F-only status snapshot.
Current code truth now spans five distinct slices that the older dashboard under-reported:
- scoped knowledge-workspace grounding is now real rather than graph-only optimistic,
- Tauri reader markdown/math/mermaid rendering is materially hardened,
- provider settings and TOML template management are now productized in the frontend,
- conversation preflight/CORS compatibility is fixed for turn-resume headers,
- desktop/runtime debug capture tooling is now part of the repo rather than an ad hoc local workflow.

Implemented now at current HEAD:

Knowledge workspace closure:
- src/frontend/source_manager.js now publishes the active source target,
- src/frontend/agent_workspace.js now forwards activeTarget and scoped prefixes with conversation requests,
- src/server.ts now performs active-target-aware workspace hydration plus selective title-like hydration,
- src/routes/data.ts no longer shadows the real build / restore-cache path with stub behavior,
- src/learning/KnowledgeLearningPlatform.ts now exposes workspaceReadiness, missDiagnostics, title-hit planning, and request inspection helpers.
Reader/runtime rendering hardening:
- src/frontend/reader.js now renders raw markdown, KaTeX, and Mermaid through one runtime path, with frontend render first and backend PNG fallback retained,
- src/frontend/app.js now suppresses leaked Mermaid error artifacts so long-lived error blocks stop occupying the visible Tauri surface,
- src/notemd/MermaidProcessor.ts, src/reader_renderer.ts, and src/routes/render.ts now strengthen parity and render-service behavior around Mermaid/Math handoff.
Provider and settings delivery:
- src/frontend/index.html, src/frontend/settings.js, src/notemd/AppConfigToml.ts, src/notemd/providerTemplates.ts, and the locale bundles now expose a dedicated agent/provider settings page with preset templates and TOML materialization support,
- the API-version field is now documented in-product instead of being left as unexplained raw config.
Runtime transport compatibility:
- src/middleware/cors.ts now explicitly allows x-agent-conversation-turn-id and x-agent-conversation-resume-turn-id, which closes the preflight failure that previously broke browser/Tauri conversation retries.
Debug and evidence tooling:
- the repository now ships runtime/webview/window capture helpers plus Mermaid stage/export tooling under scripts/, so live Tauri failures can be inspected with first-party evidence commands instead of one-off manual probing.

Tauri-first reply rendering baseline delivered:

src/learning/types.ts and src/learning/KnowledgeLearningPlatform.ts now expose backward-compatible assistantBlocks alongside legacy assistantMessage,
src/frontend/markdown_runtime.js now carries a shared markdown/math/mermaid runtime extracted from the Reader-side logic,
src/frontend/workspace_panes.js now mounts assistant replies through typed blocks instead of only plain text when structured payloads are present,
src/frontend/agent_workspace.js keeps legacy fallback behavior intact, so older assistantMessage-only flows still render.

The next real improvement beyond that baseline is now also in code:

src/learning/KnowledgeLearningPlatform.ts no longer treats assistantBlocks as a thin transport wrapper around the same old answer string,
the scoped conversation reply is now organized into explicit overview / explanation / evidence summary / memory notice / action guidance sections before citations and knowledge-action affordances are appended,
those sections now also consume real scoped data instead of only templated filler: explanation is anchored to the strongest scoped point, evidence summary reflects actual citations, and action guidance now carries memory follow-through hints,
the reply policy is now also intent-aware, so comparison-style and how-to-style prompts can shape the explanation and next-action sections differently,
which means the Tauri agent surface can now look materially different even when the underlying knowledge result set is unchanged.

The next architecture-quality improvement beyond that rendering/semantics baseline is also now started:

the agent-conversation reply composition path is no longer treated as permanent inline KnowledgeLearningPlatform.ts ownership,
a dedicated src/learning/conversationComposer.ts module now owns grouped knowledge-point composition plus scoped reply-section synthesis,
which lowers KLP pressure without changing the public AgentConversationResponse contract or the existing Tauri/browser rendering path.
this is intentionally a small, ownership-oriented extraction rather than a new abstraction layer: KLP still owns runtime state and persistence, while the new module owns pure data composition only.

The next gap is narrower now:

broaden block coverage where future endpoints emit richer assistant payloads,
keep the new render substrate honest under real browser/Tauri runtime verification,
preserve a clean downgrade/materialization boundary for later Godot reuse.

2026-05-27 Phase-1 SQLite Soak Gate Baseline

The embedded graphdb/sqlite baseline no longer stops at restart continuity plus workload-envelope proof.
Current HEAD now also carries a dedicated host-level soak/performance verifier:
- npm run verify:foundation:sqlite-runtime:soak
- emits structured JSON reports under output/verification/foundation-sqlite-runtime/,
- keeps release-grade evidence separate from the lighter smoke / medium / heavy matrix path.

What this newly proves:

repeated restart cycles on both dist runtime and packaged sidecar paths,
structured startup / ingest / readiness / diagnostics / query duration summaries,
threshold-gated p95 / max latency checks for the sqlite baseline.

What it still does not prove:

long-horizon multi-host evidence,
calibrated final release thresholds across heterogeneous machines,
a declaration that A8 is fully production-closed.

Code-vs-plan reality for this slice:

Area	Prior expectation	Current HEAD reality	Status
Knowledge workspace grounding	scoped queries should resolve the selected corpus instead of only the loaded graph	active target, scope prefixes, title-like hydration, workspace readiness, and miss diagnostics are now wired across frontend, server, and KLP	Operational
Tauri markdown reader parity	markdown, math, and Mermaid should survive real runtime conditions without persistent error overlays	reader/runtime paths now render Mermaid with frontend-first + backend-PNG fallback and suppress leaked error artifacts	Operational
Provider settings	provider/model/API-key controls should be isolated and write durable TOML configuration	dedicated agent/provider settings surface plus preset/TOML template helpers are now present	Operational
Conversation retry transport	turn/replay headers should survive browser/Tauri preflight	CORS now allows both conversation turn headers	Closed
Tauri agent reply rendering	assistant replies should render rich markdown instead of plain text	backward-compatible `assistantBlocks` plus shared markdown runtime now power rich assistant replies, while legacy `assistantMessage` remains supported	Operational

2026-05-26 Program F Closure

The deep-student-derived next-phase program is now implemented through Program F at current HEAD.
The new durable substrate is no longer only a plan artifact. It now exists in code:
- canonical resources and projections: src/resources/,
- unit/segment indexing lifecycle: src/indexing/,
- durable workspace/corpus entities: src/workspace/,
- session/workflow durability: src/session/, src/workflows/,
- typed memory governance and audits: src/memory/MemoryGovernance.ts,
- deterministic workspace export bundles: src/export/WorkspaceExportBundle.ts.
KnowledgeLearningPlatform.ts now persists and restores those substrate layers alongside graph, mastery, conversation, and telemetry state.
POST /api/knowledge/export/workspace now exposes the Program F bundle path.
Platform export semantics are now explicit rather than inferred:
- src/platform/PlatformCapabilities.ts describes bundle packaging mode (full vs slim) and indexed-readiness requirements,
- src/platform/RenderMaterializer.ts continues to enforce PNG-first materialization where SVG is unsafe.

Fresh verification evidence for this closure:

npm.cmd run build:mini
npm.cmd test -- --runInBand src/resources/ResourceRegistry.test.ts src/workspace/WorkspaceRegistry.test.ts src/indexing/IndexLifecycle.test.ts src/session/SessionStateStore.test.ts src/workflows/WorkflowArtifactStore.test.ts src/memory/MemoryGovernance.test.ts src/export/WorkspaceExportBundle.test.ts src/platform/PlatformCapabilities.test.ts src/platform/RenderMaterializer.test.ts src/routes/registry.contract.test.ts src/learning/store.test.ts src/learning/KnowledgeLearningPlatform.test.ts src/learning/KnowledgeLearningPlatform.persistence.test.ts src/learning/KnowledgeLearningPlatform.program-f.test.ts

Operational implication:

mobile slim export and desktop export now converge on the same workspace/resource/index/session/memory substrate,
lightweight local RAG and multi-platform export now share one durable scope model instead of parallel ad hoc state paths.

2026-05-10 Open-Goal Sync

This page is aligned with the repository-wide audit in Open Goal Audit (2026-05-10).
Current unresolved-goal decisions should be read from:
- docs/en/TODO.md
- docs/en/task.md
- docs/en/tauri_tasks.md
- docs/en/TEST_REPORT.md
Historical checklists in archive/historical docs remain traceability context and are not the canonical release gate.

2026-05-12 HEAD Reality Recalibration

The previous "Phase-1 closure" wording is too optimistic for current HEAD and is superseded by this section.
What is real at HEAD:
- graph/store operations semantics exist in src/learning/store.ts, including file-backed ops, embedded SQLite graphdb persistence/query paths, and HTTP adapter paths with fallback diagnostics,
- the embedded sqlite baseline now also has restart-durability proof: shutdown closes the store cleanly, the adapter can reopen safely, and server integration covers ingest -> shutdown -> fresh module reload -> diagnostics/query/readiness continuity,
- a host-level verifier now exercises that same embedded sqlite baseline through both dist runtime and packaged sidecar flows on the current Windows host: ingest -> store diagnostics/foundation readiness -> restart -> query continuity (scripts/verify-foundation-sqlite-runtime.js),
- a host-level workload-matrix verifier now extends that proof across smoke / medium / heavy profiles on the same two runtime paths, including snapshot metadata counts plus restart and multi-point query continuity (scripts/verify-foundation-sqlite-runtime.js --matrix),
- foundation readiness mandatory checks now include the release-facing sqlite soak alias (verify:foundation:sqlite-runtime:release), the ANN matrix release gate (verify:foundation:ann-runtime:release), the release-evidence freshness verifier (verify:foundation:release-evidence), and the strict release-evidence history verifier (verify:foundation:release-evidence:strict), so operator-facing readiness output matches the package scripts used for release evidence,
- a host-level ANN verifier now proves the external_http connector baseline on the same Windows host through both dist runtime and packaged sidecar flows: ingest -> live query-backend diagnostics -> restart -> query continuity (scripts/verify-foundation-ann-runtime.js),
- a host-level ANN workload-matrix verifier now extends that proof across smoke / medium / heavy profiles on the same two runtime paths, including sync/select telemetry, aligned representation metadata, and restart continuity (scripts/verify-foundation-ann-runtime.js --matrix),
- the ANN verifier now also has a release-gate mode and structured report output: scripts/verify-foundation-ann-runtime.js --release-gates records startup / ingest / diagnostics / query duration summaries plus targeted-query recall under output/verification/foundation-ann-runtime/, and npm run verify:foundation:ann-runtime:release wires the full matrix release path,
- scripts/verify-foundation-release-evidence.js now reads the latest sqlite soak and ANN release-gate reports, verifies bounded freshness, required profiles, both runtime modes, sqlite soak gates, ANN release gates, and expected recall, scans timestamped history reports, reports host-key coverage, and writes a compact release-evidence summary under output/verification/foundation-release-evidence/; the default audit requires 1 valid fresh report per component, verify:foundation:release-evidence:strict requires 3 and now passes on the current Windows host, and verify:foundation:release-evidence:multi-host additionally requires 2 host keys,
- ANN-style prefilter, representation telemetry, circuit health, remote index sync, and live external_http connector proof now exist in src/learning/queryBackend.ts and src/learning/vectorAccelerationAdapter.ts,
- runtime capability/runbook governance now includes explicit ANN remote index-sync health (query_vector_acceleration_index_sync_health) in addition to prefilter, health, traceability, and circuit checks,
- runtime capability governance now also includes explicit gate query_vector_acceleration_calibration_readiness, which formalizes whether the ANN path is even ready for release-grade threshold tuning,
- server.ts now closes the corresponding operator loop: the index-sync gate participates in verification escalation, remediation action-queue generation, and per-check runbook history summaries,
- the agent workspace runtime-runbook surfaces now render operator-facing ANN governance directly in the frontend shell: verify/checks now expose sync-health plus circuit-budget, traceability, and prefilter summaries, and they now also show threshold/signal drilldowns plus calibration-readiness state and the explicit calibration gate needed for budget tuning work, while action-queue keeps the index-sync incident drilldown,
- the modular src/routes/knowledge.ts runtime-runbook surfaces now delegate to live server-side runbook ops with full query-parameter passthrough, so browser/runtime consumers no longer hit the old KLP placeholder payloads for verify/history/checks/action-queue/remediation/schedule flows,
- the browser strict smoke gate now also proves those ANN runbook surfaces from real browser evidence: verify-card ANN sync/circuit/traceability/prefilter content plus threshold/signal and calibration-readiness labels, checks-card first-check ANN sync plus circuit/traceability/prefilter snapshots, and action-queue index-sync drilldown are now asserted end to end instead of remaining component-test-only,
- locale governance for the agent workspace is now tighter on both static and runtime surfaces: bilingual locale bundles now cover the query/quality/runbook cards exercised by strict browser smoke, src/agent_workspace.locale.contract.test.ts blocks source-referenced agentWorkspace.* key drift, and startup-time translate helpers no longer emit false missing-key warnings before locale initialization finishes,
- Phase-2 runtime diagnostics are now materially implemented in src/learning/KnowledgeLearningPlatform.ts for query-backend comparison/history/trend, knowledge staleness diagnostics/rebuild planning, learning-quality history/trend, session-plan quality evaluation/history/trend/runtime-threshold diagnostics, query-backend config, and query-backend diagnostics,
- Phase-3 tutor/memory diagnostics remain real and now include an active default runtime tutor adapter path in src/server.ts, so normal server execution can emit adapter telemetry instead of staying catalog-only.
What is not closed yet:
- Phase-1 A8 has advanced beyond a file-only default: src/server.ts now defaults to graphdb/sqlite with explicit file fallback, restart durability is already proved, host-level dist/runtime + packaged sidecar proof is in place, and a host-level workload matrix is now in place across smoke / medium / heavy, but soak / longer-duration / performance hardening is still open before calling the local graph backend production-closed,
- Phase-1 A9 is now operational rather than scaffold-only: host-level runtime proof, a host-level workload matrix, and matrix release-gate evidence are now in place, but repeated multi-host calibration and threshold convergence are still open before calling the ANN layer production-closed,
- Phase-2 quality/session/query observability is now real, but it is not yet release-closed because these gates still require release-grade calibration on top of the current graph/ANN operational baseline; the new ANN calibration-readiness gate only formalizes prerequisites, not closure,
- default tutor routing is no longer catalog-only, but the runtime is still effectively local-first and retains explicit rule-engine fallback rather than a production-proven multi-provider routing policy.
Active execution focus therefore shifts to truth-first foundation recovery:
- finish the remaining soak / longer-duration / performance closure for the embedded graph backend baseline while keeping the new dist/runtime + packaged sidecar proof and workload matrix green,
- finish the remaining workload/threshold closure for the now-live ANN connector baseline,
- move the newly surfaced ANN runbook visibility from operator-readable summaries to workload-calibrated release gates,
- keep the new diagnostic surfaces honest against the same runtime truth,
- then promote Phase-2 / Phase-3 gates as release-significant only after the graph/ANN baseline is release-grade.

Scope

Focus area: local-first knowledge mastery platform (ingest, retrieval, learning path, tutor, memory, governance).
Time window: v1.7.0 to current branch baseline.
Evidence rule: every progress claim must map to:
- contract surface (src/learning/api.ts, src/learning/types.ts)
- route wiring (src/server.ts)
- behavior tests (src/knowledge.api.contract.test.ts and domain tests)

2026-05-12 Architecture Metrics

File	Current Lines	Implication
`src/server.ts`	15,920	routing is modularized, but the main server monolith is still large and now also carries more runtime orchestration
`src/learning/KnowledgeLearningPlatform.ts`	10,043	KLP remains the dominant implementation gravity well and now also carries workspace-readiness and planner logic
`src/frontend/path_app.js`	4,943	path workbench/controller split is still incomplete and now additionally carries reader-parity responsibilities
`src/frontend/app.js`	5,953	graph runtime still keeps a large host-side control surface and now also carries Mermaid error-guard behavior
`src/frontend/reader.js`	1,334	reader/runtime rendering is now a first-class subsystem rather than a lightweight helper
`src/frontend/agent_workspace.js`	3,214	agent orchestration exists, but reply rendering still has not crossed from text shell to rich message surface
`src/routes/knowledge.ts`	690	knowledge routes need another split before claiming route-layer compaction

Use these numbers as the current HEAD truth. Older size-reduction tables later in this page remain useful as historical traceability, but they no longer describe the current branch state exactly.

Current Delivery Focus

The immediate L4 priority is no longer generic interaction expansion. It is the end-to-end delivery of:

frontend agent chat,
local knowledge-point listing,
docked Tauri graph focus mode pane,
docked learning-path pane that can coexist with graph focus and be promoted fullscreen.
Tauri-first assistant reply rendering that can reuse the mature Reader markdown/math/mermaid pipeline.

Execution reference:

Agent Conversation + Focus Mode Delivery Plan

Current branch status for this slice:

the main frontend now contains an agent workspace shell (src/frontend/index.html, src/frontend/styles.css),
the conversation route now returns actionable local knowledge points with typed capability descriptors, including execution / failure / UI-hint metadata for focus, learning path, tutor-side actions (generate_quiz, recap, generate_transfer, generate_counterexample, follow_up), query-side compare_query_backends / inspect_query_backend_diagnostics / inspect_query_backend_comparison_history / inspect_query_backend_comparison_trend, tutor diagnostics inspect_tutor_adapter_telemetry / inspect_tutor_trace_diagnostics, quality/session diagnostics (inspect_learning_quality_trend / inspect_learning_quality_history / inspect_session_plan_quality_trend / inspect_session_plan_quality_history), session-side inspect_session_history / build_study_session, and conversation-memory recall inspect_conversation_memory (src/server.ts, src/learning/KnowledgeLearningPlatform.ts, src/learning/types.ts),
the capability actions that inspect query comparison, staleness, learning quality, session-plan quality, and query-backend diagnostics are now backed by live KnowledgeLearningPlatform.ts implementations instead of empty placeholders,
default server bootstrap now injects a concrete local tutorAdapter while preserving the multi-adapter catalog (local + cloud), so normal runtime tutor execution can emit adapter telemetry and still degrade explicitly to guarded fallback behavior when needed,
conversation knowledge points are now typed-only (capabilities is the single action source), and legacy availableActions fallback telemetry/synthesis has been removed from both backend response shape and pane rendering (src/learning/types.ts, src/learning/KnowledgeLearningPlatform.ts, src/frontend/workspace_panes.js, src/agent_workspace.frontend.test.ts, src/knowledge.api.contract.test.ts),
agent workspace capability execution dispatch now enforces explicit execution-kind handlers without legacy action fallback execution; knowledge operations are split into independent transport and request-builder registries, while result presentation is split into custom presenters plus card-presentation descriptors and payload-builder registries with fail-fast unsupported_result_presentation* drift semantics, and parity/frontend diagnostics now cover transport/request-builder/custom-presentation/card-presentation/payload-builder/execution-kind completeness (src/frontend/agent_workspace.js, src/agent_workspace.frontend.test.ts, src/agent_workspace.contract.parity.test.ts),
clicking Learning Path now mounts the existing path workspace (path-container + sidebars) into the docked learning-path pane instead of stopping at a text-only preview (src/frontend/workspace_panes.js, src/frontend/agent_workspace.js),
the graph surface now reserves workspace width so conversation + graph-focus + learning-path can coexist in one host-owned layout (src/frontend/styles.css),
graph-focus fullscreen now promotes the real graph workspace instead of only enlarging a metadata card (src/frontend/workspace_panes.js, src/frontend/styles.css),
the scoped-knowledge conversation flow is now materially more honest about corpus readiness: active folder target flows into the request contract, the server selectively hydrates likely title-matching documents into the workspace, and conversation traces now include readiness + miss diagnostics instead of only returning an empty top-k result (src/frontend/source_manager.js, src/frontend/agent_workspace.js, src/server.ts, src/routes/data.ts, src/learning/KnowledgeLearningPlatform.ts),
the new agent workspace shell now has i18n coverage for static shell strings plus runtime button/empty-state messaging, existing knowledge-card actions / localized system messages now re-render on language change instead of staying in the previous locale, conversation card rerender is now centralized through a card-kind renderer registry, and a source-level parity guard now checks append-kind vs registry alignment (src/frontend/index.html, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/frontend/workspace_panes.js, src/frontend/agent_workspace.js, src/agent_workspace.frontend.test.ts),
provider/model/API-key settings are now isolated into a dedicated agent settings page with preset-template and TOML-template flows, and the same agent workspace now also has a typed rich-reply baseline instead of staying plain-text-only (src/frontend/index.html, src/frontend/settings.js, src/notemd/AppConfigToml.ts, src/notemd/providerTemplates.ts, src/frontend/markdown_runtime.js, src/frontend/workspace_panes.js, src/frontend/agent_workspace.js),
the reader/runtime render stack is now substantially more robust for Tauri markdown usage: raw markdown, KaTeX, Mermaid frontend render, Mermaid backend PNG fallback, leaked Mermaid error suppression, and a shared markdown runtime for agent replies are all present (src/frontend/reader.js, src/frontend/app.js, src/frontend/markdown_runtime.js, src/reader_renderer.ts, src/routes/render.ts, src/notemd/MermaidProcessor.ts),
locale governance now includes backend-to-frontend capability label-key parity blocking: emitted conversation capability labelKey values must resolve to non-empty bilingual agentWorkspace.actions.* entries (src/learning/KnowledgeLearningPlatform.ts, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/agent_workspace.locale.contract.test.ts),
modular knowledge-route closure now includes live browser strict proof instead of snapshot-only recovery: conversation response wiring, capability-triggered request routes, card-title localization, and graph-focus compatibility now pass STRICT, UI_STRICT, and UI_DYNAMIC_STRICT against real browser/network traces (src/routes/knowledge.ts, src/learning/KnowledgeLearningPlatform.ts, src/frontend/app.js, src/frontend/locales/en.json, src/frontend/locales/zh.json, scripts/verify-agent-workspace-browser.js),
locale governance now also blocks capability failure-message drift: emitted failure.messageKey values must resolve to bilingual agentWorkspace.messages.* entries with aligned interpolation placeholders and stable fallback placeholder sets (src/learning/KnowledgeLearningPlatform.ts, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/agent_workspace.locale.contract.test.ts),
backend capability descriptor contract now enforces execution completeness for knowledge_operation capabilities (operationId + resultPresentation) and mandatory failure metadata (messageKey + fallbackMessage) in the emitted conversation capability payload (src/learning/KnowledgeLearningPlatform.ts, src/agent_workspace.contract.parity.test.ts),
frontend capability execution now enforces an operation-scoped result-presentation allowlist (default + explicit overrides such as execute_tutor_action -> tutor_action_card) and fails fast on unsupported combinations before backend request dispatch and renderer dispatch (src/frontend/agent_workspace.js, src/agent_workspace.contract.parity.test.ts, src/agent_workspace.frontend.test.ts),
contract governance now blocks allowlist override drift: override operation keys must be a subset of AgentConversationCapabilityOperationId, and override values must be a subset of AgentConversationCapabilityResultPresentation (src/agent_workspace.contract.parity.test.ts),
parity governance now enforces allowlist shape: each operation allowlist must include its transport default presentation, and backend non-default operation presentations must be declared by explicit frontend overrides (src/agent_workspace.contract.parity.test.ts),
override hygiene now enforces non-default-only semantics: operation override entries may not repeat transport defaults (src/agent_workspace.contract.parity.test.ts, src/frontend/agent_workspace.js),
override governance now also blocks stale entries: each frontend override presentation must be observed in backend capability emission for the same operation (src/agent_workspace.contract.parity.test.ts),
frontend registry diagnostics now export per-operation override/default/allowlist presentation maps (operationResultPresentationOverrideMap, operationDefaultResultPresentations, operationAllowedResultPresentations) to support contract drift debugging (src/frontend/agent_workspace.js, src/agent_workspace.frontend.test.ts),
frontend registry diagnostics now also export operationInvalidResultPresentationOverrideMap so runtime can surface default-duplicate/unknown override tokens if configuration drifts (src/frontend/agent_workspace.js, src/agent_workspace.frontend.test.ts),
frontend registry diagnostics now also export operationUnknownResultPresentationOverrideMap so unknown override operation IDs remain visible in runtime drift debugging (src/frontend/agent_workspace.js, src/agent_workspace.frontend.test.ts),
frontend registry diagnostics now also export override-drift summary signals (operationResultPresentationOverrideDriftDetected, invalid/unknown override token counters) for quick runtime health checks (src/frontend/agent_workspace.js, src/agent_workspace.frontend.test.ts),
frontend message-locale governance now blocks unresolved runtime message keys: every agentWorkspace.messages.* key referenced by agent_workspace.js must resolve to bilingual locale entries with aligned placeholders (src/frontend/agent_workspace.js, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/agent_workspace.locale.contract.test.ts),
app-level tauri lifecycle observability now records pathmode-window-toggled payloads into a bounded frontend trace buffer and forwards them as noteconnection:pathmode-window-toggled DOM events for local diagnostics (src/frontend/app.js),
the desktop lifecycle verification stack now includes a first real app/window-handle evidence path (verify:agent-workspace:tauri:window-evidence) that runs dedicated Rust tests against mock-app webview window handles and emits structured artifacts under output/tauri/agent-workspace-window-evidence, with explicit degraded semantics when host system dependencies are missing (scripts/verify-agent-workspace-tauri-window-evidence.js, src-tauri/src/lib.rs, src/agent_workspace.tauri.contract.test.ts),
CI now has an always-on strict desktop evidence job in .github/workflows/migration-gates.yml (agent-workspace-tauri-strict-evidence) that runs verify:agent-workspace:tauri:rust:strict and verify:agent-workspace:tauri:window-evidence:strict on Linux hosts with explicit javascriptcoregtk-4.1 / libsoup-3.0 dependencies, and release workflow .github/workflows/release-desktop-multi-os.yml now enforces the same strict evidence gate on the Linux desktop build path before bundle generation; both workflows also generate a strict evidence index (verify:agent-workspace:tauri:evidence:index:strict), enforce a strict evidence manifest gate (verify:agent-workspace:tauri:evidence:manifest:strict), and upload tauri evidence artifacts (retention policy pinned to 30 days) for audit traceability, while the Linux release path now publishes release-fragment-latest.md into GitHub Release notes using marker-based idempotent upsert,
migration workflow now also includes a dedicated always-on agent-workspace-contract-gates job that runs test:agent-workspace:contracts (parity/frontend/tauri contract suites) plus test:conversation-turn-cache:durability (restart durability check for turn-cache trend index/export consistency), closing the CI drift-detection gap for agent-workspace contract evolution,
license governance now adds test:license:contract to enforce GPL-3.0-only parity across LICENSE, README, package.json, and src-tauri/Cargo.toml, and this gate is wired into migration-gates CI to block license drift,
browser smoke now exercises real conversation/path/query-compare/quality/session/runbook backend slices (including trend + history diagnostics plus runbook verify/checks/action-queue), real graph runtime, and real path runtime, and now asserts ANN sync-health plus verify/checks circuit/traceability/prefilter threshold/signal and calibration-readiness content from browser evidence before emitting screenshot/console/network-summary artifacts (scripts/verify-agent-workspace-browser.js, src/agent_workspace.browser.contract.test.ts),
scoped conversation-memory foundation is now wired end-to-end (typed contracts, backend normalizers/routes, capability operation registry, locale keys, lifecycle tests, browser/runtime verification) through /api/knowledge/conversation-memory/{list,add,search,delete,feedback} (src/learning/api.ts, src/learning/types.ts, src/learning/KnowledgeLearningPlatform.ts, src/server.ts, src/frontend/agent_workspace.js, src/knowledge.api.contract.test.ts, src/learning/KnowledgeLearningPlatform.test.ts, src/agent_workspace.frontend.test.ts),
unified turn streaming baseline is now delivered on /api/knowledge/conversation via Accept: text/event-stream negotiation with a minimal event set (turn_started/capability_planned/capability_progress/capability_result/turn_completed/turn_failed) and frontend stream-first + sync fallback behavior (src/server.ts, src/frontend/agent_workspace.js, src/knowledge.api.contract.test.ts, src/agent_workspace.frontend.test.ts),
M8.2 recovery semantics are now in place on top of the stream baseline: frontend requests propagate client turn IDs across stream-first + sync fallback, server route /api/knowledge/conversation now enforces replay-window idempotency with turn-level dedupe/conflict protection (turn_id_conflict), and resumed stream requests replay cached turn events instead of re-running execution (src/server.ts, src/frontend/agent_workspace.js, src/knowledge.api.contract.test.ts, src/agent_workspace.frontend.test.ts),
M8.3 operator baseline is now delivered: turn-cache lifecycle diagnostics are exposed at GET /api/knowledge/conversation/turn-cache/diagnostics (including TTL/capacity config, live state, hit ratio, conflict count, replay counters, and eviction counters), and TTL/capacity are runtime-tunable via NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_TTL_MS / NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_MAX_ENTRIES (src/server.ts, src/knowledge.api.contract.test.ts),
M8.4 operator productization baseline is now delivered: turn-cache diagnostics are wired into the agent workspace execution contract as inspect_conversation_turn_cache_diagnostics -> fetch_conversation_turn_cache_diagnostics -> conversation_turn_cache_diagnostics_card, including bilingual card rendering and language-switch re-render coverage (src/learning/types.ts, src/learning/KnowledgeLearningPlatform.ts, src/frontend/agent_workspace.js, src/frontend/workspace_panes.js, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/agent_workspace.frontend.test.ts, src/agent_workspace.contract.parity.test.ts, src/knowledge.api.contract.test.ts, src/learning/KnowledgeLearningPlatform.test.ts),
M8.5 thresholded-governance baseline is now delivered: turn-cache diagnostics now include env-driven alert thresholds and policy checks (utilization_pct, execution_failure_ratio_pct, conflict_count, stale_eligible_entries) with summarized severity state (summaryStatus) and fail/warn counters; the workspace diagnostics card now renders alert summary/top-check/threshold-profile metrics with bilingual labels and re-render coverage (src/server.ts, src/frontend/agent_workspace.js, src/frontend/workspace_panes.js, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/agent_workspace.frontend.test.ts, src/knowledge.api.contract.test.ts),
M8.6 alert-trend governance baseline is now delivered: GET /api/knowledge/conversation/turn-cache/diagnostics/trend now returns bounded alert-history snapshots, trend status (insufficient_data / stable / improving / regressing), escalation state (normal / watch / high / critical), and active-streak context; sampling and policy behavior are env-tunable via NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_ALERT_HISTORY_LIMIT, NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_ALERT_SAMPLE_MIN_INTERVAL_MS, NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_ALERT_TREND_WINDOW_SIZE, NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_ALERT_TREND_MIN_SAMPLES, NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_ALERT_ESCALATION_WARN_STREAK, and NOTE_CONNECTION_AGENT_CONVERSATION_TURN_CACHE_ALERT_ESCALATION_FAIL_STREAK (src/server.ts, src/knowledge.api.contract.test.ts),
M8.6 operator productization follow-through is now delivered in the workspace contract path: inspect_conversation_turn_cache_alert_trend -> fetch_conversation_turn_cache_alert_trend -> conversation_turn_cache_alert_trend_card, including bilingual card rendering and language-switch re-render coverage (src/learning/types.ts, src/learning/KnowledgeLearningPlatform.ts, src/frontend/agent_workspace.js, src/frontend/workspace_panes.js, src/frontend/locales/en.json, src/frontend/locales/zh.json, src/agent_workspace.frontend.test.ts, src/agent_workspace.contract.parity.test.ts, src/learning/KnowledgeLearningPlatform.test.ts),
M8.7 durability + runbook-gate linkage baseline is now delivered: turn-cache alert trend history is persisted across restarts at runtime_data/agent_conversation_turn_cache_alert_history.v1.json with bounded compaction and async persist queueing, index/export endpoints are available at GET /api/knowledge/conversation/turn-cache/diagnostics/trend/index and GET /api/knowledge/conversation/turn-cache/diagnostics/trend/export, and escalation is now bridged into runtime runbook through synthetic check conversation_turn_cache_alert_trend with remediation actions inspect_conversation_turn_cache_alert_trend_index, stabilize_conversation_turn_cache_alert_pressure, and verify_conversation_turn_cache_alert_trend_recovery (src/server.ts, src/knowledge.api.contract.test.ts, src/notemd.server.integration.test.ts),
M8.8 operator drilldown + schedule guardrail baseline is now delivered: agent workspace now exposes explicit trend index/export operator actions (inspect_conversation_turn_cache_alert_trend_index / inspect_conversation_turn_cache_alert_trend_export) and corresponding operation wiring (fetch_conversation_turn_cache_alert_trend_index / fetch_conversation_turn_cache_alert_trend_export), trend/action-queue cards now surface storage/index/export/endpoint-hint drilldown context, and replay schedule config now applies cross-field guardrails (maxReplayChecksPerWindow >= replayLimit) with explicit telemetry reasons (config_guardrail_applied, schedule_config_guardrail:*) visible through server snapshot + workbench status text (src/learning/types.ts, src/learning/KnowledgeLearningPlatform.ts, src/frontend/agent_workspace.js, src/frontend/workspace_panes.js, src/frontend/path_app.js, src/server.ts, src/agent_workspace.frontend.test.ts, src/agent_workspace.contract.parity.test.ts, src/knowledge.api.contract.test.ts, src/learning/KnowledgeLearningPlatform.test.ts, src/notemd.server.integration.test.ts),
M8.9 replay-schedule proactive recommendation + policy-template baseline is now delivered: replay-schedule snapshots now include structured recommendation payloads (telemetry.recommendations) and policy-template candidates (telemetry.policyTemplates) for guardrail/budget/trigger/cooldown/skip-streak scenarios, schedule config update accepts policyTemplate as a first-class payload input, and workbench replay-schedule refresh/update/tick status text now surfaces both top recommendation and top policy template for operator action (src/server.ts, src/frontend/path_app.js, src/notemd.server.integration.test.ts, src/path_app.runtime_trace_filter.behavior.test.ts),
M9 replay-schedule safe auto-execution baseline is now delivered: schedule config now supports explicit autoExecution policy (enabled, mode, requireDryRunParity, minConsecutiveSkips), snapshot telemetry now reports structured gate diagnostics (eligible, blockedReasons[], decision, lastAttemptedAt, lastExecutedAt), and schedule tick now enforces gate-first execution semantics (auto_execution_blocked, auto_execution_dry_run_required, auto_execution_executed) on top of existing trigger/cooldown/budget guards (src/server.ts, src/notemd.server.integration.test.ts, src/knowledge.api.contract.test.ts),
M9.1 workbench operator explainability is now delivered: replay-schedule refresh/update/tick status text and remediation history formatting now include autoExecution(...) diagnostics, and config update flow now forwards autoExecution fields from UI preferences to backend payload (src/frontend/path_app.js, src/path_app.runtime_trace_filter.behavior.test.ts),
M10 foundation hardening bootstrap is now delivered as rollout controls: graphdb storage now supports provider-based adapter selection plus fallback policy (NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_ADAPTER_PROVIDER, NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_ADAPTER_ID, NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_FALLBACK_ENABLED) and local-vector acceleration now supports explicit failure semantics plus representation strictness (NOTE_CONNECTION_QUERY_VECTOR_ACCELERATION_FAILURE_MODE=fail_open|fail_closed, NOTE_CONNECTION_QUERY_VECTOR_ACCELERATION_REPRESENTATION_STRICT=true|false) with trace/runtime diagnostics propagation (src/learning/store.ts, src/learning/queryBackend.ts, src/learning/KnowledgeLearningPlatform.ts, src/server.ts, src/learning/store.test.ts, src/learning/queryBackend.test.ts, src/knowledge.api.contract.test.ts),
local-vector acceleration strictness now also treats external_http endpoint misconfiguration as a first-class adapter failure (external_http_endpoint_missing) so fail_closed rollout no longer silently downgrades to full-scan; strict paths now surface vector_acceleration_adapter_failure:* in trace/diagnostics (src/learning/vectorAccelerationAdapter.ts, src/notemd.server.rollout-boundary.integration.test.ts),
query-backend diagnostics/config endpoints now echo vector-acceleration rollout context (configuredVectorAccelerationProvider, configuredVectorAccelerationFailureMode, configuredVectorAccelerationRepresentationStrict, queryVectorAnnPrefilterEnabled, rolloutProfile) so workbench/runtime operators can reason about strictness without cross-calling /api/knowledge/state (src/server.ts, src/notemd.server.integration.test.ts, src/notemd.server.rollout-boundary.integration.test.ts),
M10.2 graphdb adapter baseline now includes an external_http provider path (NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_HTTP_ENDPOINT, NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_HTTP_TIMEOUT_MS, NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_HTTP_MAX_RETRIES, NOTE_CONNECTION_KNOWLEDGE_GRAPHDB_HTTP_RETRY_DELAY_MS) with connector diagnostics and strict fail-closed behavior when endpoint configuration is invalid (graphdb_http_endpoint_missing) (src/learning/store.ts, src/server.ts, src/learning/store.test.ts, src/notemd.server.rollout-boundary.integration.test.ts),
M10.3 graphdb external_http connector telemetry is now promoted to first-class runtime governance: store diagnostics now include structured connector health/circuit/request telemetry (healthStatus, circuitState, requestCount, retryCount, shortCircuitCount, lastRequestId, lastErrorCode, lastStatusCode, lastRetryAfterMs), runtime capability matrix now adds store_graphdb_connector_health with runbook/debug-trace wiring, and strict rollout integration/store tests now validate the healthy path plus circuit-open degradation semantics (src/learning/store.ts, src/learning/runtimeCapability.ts, src/learning/store.test.ts, src/learning/runtimeCapability.test.ts, src/notemd.server.rollout-boundary.integration.test.ts),
M10 rollout-boundary integration coverage is now extended with isolated server bootstrap tests for strict-mode behavior: vector acceleration fail_closed now verifies both adapter-failure surfacing and healthy external_http success-path telemetry (no backend fallback, healthStatus=ready, request-correlation propagation), graphdb provider=none + fallback=false now verifies fail-closed store API semantics, and graphdb provider=external_http + fallback=false now verifies the success path across /api/knowledge/store/reload + /api/knowledge/store-diagnostics (including rollout-context fields configuredGraphDbAdapterProvider / configuredGraphDbAdapterId / graphDbFallbackEnabled) (src/notemd.server.rollout-boundary.integration.test.ts, src/server.ts),
M10 rollout profile operator visibility is now wired end-to-end: runtime payload now exposes rolloutProfile (store/vector strictness + aggregate mode), runtime-capability-matrix plus runbook/verify/history/history-checks/action-queue/remediation-history/replay-schedule endpoints (including remediation POST flows: event/replay/schedule/tick) now echo the same profile, and learning-workbench runtime summary now surfaces rollout=<mode>(...) cue; integration/contract/frontend behavior coverage is in place (src/server.ts, src/notemd.server.integration.test.ts, src/knowledge.api.contract.test.ts, src/frontend/path_app.js, src/path_app.runtime_trace_filter.behavior.test.ts),
the legacy global Path Mode entry now clears the docked pane first so full-path entry remains deterministic (src/frontend/app.js).

Latest Validation Snapshot (2026-05-18)

Reconfirmed on the current Windows host in this turn: node node_modules/jest/bin/jest.js src/learning/runtimeCapability.test.ts src/knowledge.api.contract.test.ts --runInBand --no-cache, node node_modules/jest/bin/jest.js src/agent_workspace.frontend.test.ts --runInBand --no-cache, npm run test:agent-workspace:contracts, npm run build:with-vite, npm run docs:diataxis:check, npm run docs:site:build, NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_STRICT=1 NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_UI_STRICT=1 NOTE_CONNECTION_AGENT_WORKSPACE_BROWSER_UI_DYNAMIC_STRICT=1 node scripts/verify-agent-workspace-browser.js.
Reconfirmed on the current Windows host in this turn: npm run build:sidecar, npm run verify:foundation:sqlite-runtime, npm run verify:foundation:sqlite-runtime:matrix, npm run verify:foundation:ann-runtime, npm run verify:foundation:ann-runtime:matrix.
The strict browser proof now explicitly verifies the bilingual runtime-runbook verify/checks ANN governance labels that were added in this slice: sync-health plus circuit, traceability, and prefilter summaries, along with the threshold/signal drilldowns and calibration-readiness cues that support budget-tuning work.
The embedded sqlite graph baseline now also has repeatable host-level runtime proofs outside Jest integration scope: the lighter verifier keeps dist runtime and packaged sidecar ingest -> diagnostics/readiness -> restart -> query continuity green, and the workload-matrix verifier proves the same two runtime paths across smoke / medium / heavy corpus sizes with snapshot metadata counts and multi-point restart queries.
The external_http ANN connector baseline now also has repeatable host-level runtime proofs outside Jest integration scope: the lighter verifier keeps dist runtime and packaged sidecar ingest -> query-backend diagnostics -> restart -> query continuity green, and the workload-matrix verifier proves the same two runtime paths across smoke / medium / heavy corpus sizes with sync/select telemetry and aligned representation metadata.
Tauri strict evidence is implementation-closed but still host-dependent:
- the current Windows host proves non-strict tauri/runtime behavior and load-flow parity,
- Linux strict evidence commands (verify:agent-workspace:tauri:rust:strict, verify:agent-workspace:tauri:window-evidence:strict, strict evidence index/manifest) still require provisioned webkit2gtk-4.1, javascriptcoregtk-4.1, and libsoup-3.0.
Sidecar/bootstrap readiness on the current Windows host is now offline-ready; remaining bootstrap work is policy hardening before strict no-LFS mode, not an immediate host-readiness blocker.
Practical Phase boundary:
- interaction/runtime verification infrastructure is strong enough to continue selective Phase-3 hardening in parallel,
but neither Phase-1 backend closure nor Phase-2 governance closure is actually done at HEAD,
remaining work is not only ops/release prerequisites; it still includes unresolved real graphdb/ANN delivery, stronger release-grade calibration for the new diagnostics, and continued architecture reduction.

Operational note:

the live server serves frontend assets from dist/src/frontend, so src/frontend/* changes do not reach runtime verification until a fresh npm run build copies them into the dist tree.
there is now a dedicated smoke command, npm run verify:agent-workspace:runtime, which copies the current frontend into a temporary runtime tree, boots a real sidecar/server, and verifies that the served root HTML and locale payload expose the agent workspace shell.
there is also a browser-driven smoke command, npm run verify:agent-workspace:browser, which seeds a minimal knowledge document through the real ingest API, writes a minimal data.js seed so the page boots real graph/path runtimes, opens the served shell in a real Chromium session, drives the agent-workspace conversation/action flow against the real conversation/path/query-compare/quality/session/runbook backend slice, verifies localized action/message re-rendering (including runbook checks/action-queue cards), checks graph-focus promotion enter/exit state, and emits screenshot/console/network-summary evidence paths for failure diagnosis.
there is now a Rust-targeted tauri contract command, npm run verify:agent-workspace:tauri:rust, which executes the pathmode_window_toggle_plan/pathmode_window_toggled_event_payload cargo tests when required system libs exist; local non-strict mode reports SKIP if webkit2gtk-4.1, javascriptcoregtk-4.1, or libsoup-3.0 are unavailable, while CI/strict mode fails hard.
there is now a real app/window evidence command, npm run verify:agent-workspace:tauri:window-evidence, which attempts dedicated Rust window-lifecycle evidence tests and writes structured reports/logs; if host dependencies are missing it reports degraded (non-strict) with explicit reasons, while strict mode hard-fails.
there is now a strict evidence index command, npm run verify:agent-workspace:tauri:evidence:index, which builds a normalized latest-report index across rust/window/smoke artifacts, validates the generated report against schemas/agent-workspace-tauri-evidence-index.schema.json, and in strict mode fails when required evidence is missing or not passed.
there is now a tauri evidence summary renderer, npm run verify:agent-workspace:tauri:evidence:summary, which writes output/tauri/agent-workspace-evidence-index/evidence-summary-latest.md for operator review and can be published into GITHUB_STEP_SUMMARY in CI workflows.
there is now a tauri evidence release-fragment renderer, npm run verify:agent-workspace:tauri:evidence:release-fragment, which emits output/tauri/agent-workspace-evidence-index/release-fragment-latest.md and can be appended to GITHUB_STEP_SUMMARY for release-gate audit context.
there is now a tauri evidence manifest renderer, npm run verify:agent-workspace:tauri:evidence:manifest, which emits output/tauri/agent-workspace-evidence-index/evidence-manifest-latest.json, validates the payload against schemas/agent-workspace-tauri-evidence-manifest.schema.json, and includes strict-validation diagnostics over required evidence artifacts.
there is now a release-note publisher, npm run verify:agent-workspace:tauri:evidence:publish-release-notes -- --tag <release_tag>, which upserts the latest tauri evidence fragment into the target GitHub Release body via stable begin/end markers.

Phase Snapshot (Revalidated 2026-05-12)

Phase	Plan Target	Current Status	Evidence
Phase 1	Knowledge parsing + graph backbone + staleness governance	Operational baseline	`src/learning/store.ts`, `src/learning/queryBackend.ts`, `src/learning/vectorAccelerationAdapter.ts`, `src/server.ts`
Phase 2	Mastery loop + divergence engine	Partial	`src/learning/KnowledgeLearningPlatform.ts`, `src/frontend/path_app.js`
Phase 3	Pluggable tutor + memory operating layer	Early operational	`src/learning/KnowledgeLearningPlatform.ts`, `src/learning/tutorAdapter.ts`, `src/server.ts`, `src/routes/knowledge.ts`

Layer-by-Layer Implementation Matrix

Layer	Goal	Implemented Baseline	Remaining Work
L0 Representation	Parse document content into atom/evidence units	Atom, evidence, source hash and staleness rebuild are implemented (`ingestKnowledge`, staleness APIs)	Add richer formula/code normalization and stronger parser telemetry granularity
L1 Structure	Build relation + temporal graph for learning reasoning	`RelationEdge` with `provenance`, `TemporalEdge` with active validity window are implemented	Improve relation quality scoring and cross-document conflict handling
L2 Retrieval	Evidence-first explainable retrieval	`local_hybrid`, `keyword_only`, and `local_vector` retrieval backends exist with ANN-style prefilter, fallback telemetry, and a live sync-backed `external_http` acceleration path; the default graph store baseline is now embedded `graphdb/sqlite` with restart-durability proof	Keep the remaining backend gap honest: packaged/runtime + heavier-workload hardening are still open for graphdb, and ANN still needs benchmarked rollout thresholds plus larger-workload validation
L3 Learning	Mastery diagnostics + actionable path generation	Mastery diagnostics, misconception summaries, dual-path recommendation, session execution primitives, and live quality/session-plan trend surfaces are implemented	Calibrate the now-live `learning quality` / `session plan quality` history-trend-evaluation surfaces on top of a release-grade graphdb/ANN baseline before claiming Phase-2 gate closure
L4 Interaction	Workbench for operations + tutoring + diagnostics	The agent workspace shell, focus/path panes, typed capability contract, runtime/browser smoke, and turn-cache operator surfaces are real	Keep the shell/runtime evidence healthy, but stop treating observability cards as release-closed while they still depend on an operational rather than release-grade graph/ANN baseline
L5 Governance	Runtime checks, trend gates, remediation loop	Runtime capability matrix, connector/circuit telemetry, ANN index-sync telemetry, remediation plumbing, and operator-facing verify/checks drilldowns for ANN sync/circuit/traceability/prefilter are real	Tie governance upgrades to non-empty live thresholds plus adapter-backed telemetry on top of a release-grade graph/ANN baseline

Architecture Refactoring Status (2026-05-05, FINAL)

A 12-phase refactoring (A→L) was executed against the baseline. The following modules are now delivered:

New Module Inventory

Module	Files	Purpose
`src/routes/`	10	Modular API route handlers (65 routes across knowledge, notemd, markdown, render, data, diagnostics)
`src/middleware/`	5	HTTP middleware (cors, auth, body-parser, request-trace)
`src/learning/domains/`	8	Domain classes extracted from KnowledgeLearningPlatform (7 classes + 7 Platform interfaces)
`src/frontend/*.mjs`	4	ES module versions of i18n, runtime_bridge, main entry, worker bridge
`src/utils/platform.ts`	1	Cross-platform detection (Linux XDG / macOS Library / Windows LOCALAPPDATA)
`src-tauri/tauri.{linux,macos,windows}.conf.json`	3	Platform-specific Tauri configs
`vite.config.ts`	1	Vite 5-entry multi-page build (4 chunks: main/graph-app/agent-workspace/path-mode)
`docs/solutions/`	2	Cross-platform refinement plan + implementation gap analysis
`docs/archive/`	3	Archived TODO.md files (448KB)
`docs/zh/analysis_ref.md`	1	Chinese translation of reference analysis (updated for Tauri v2)
`docs/release_notes_v1.6.6.md`	1	Canonical quality bar release notes (EN+ZH)

Refactoring Metrics

Metric	Before	After
Route modules	0 (inline if/else chain)	10 modules, 65 routes
Middleware modules	0 (inline functions)	5 independent modules
Domain classes	1 (13,370-line monolith)	7 classes with typed Platform interfaces
Frontend module system	`<script>` tag chain	ES modules + Vite 4-chunk
Platform configs	1 generic + 1 Android	5 configs (Linux/macOS/Windows/Android + generic)
Platform path logic	1-line `win32` check	`platform.ts` with proper XDG/Library/LOCALAPPDATA
Godot renderer	GL Compatibility	Forward+ (Vulkan) with Wayland fallback
Mobile build paths	2 (Capacitor + Tauri Android)	1 (Tauri Android, Capacitor deprecated)
TODO files	448KB in docs tree	Archived to `docs/archive/`
Bilingual doc pairs	21	24
CI jobs (migration-gates)	16	18
Route contract tests	0	10/10 passing
Runtime observability	No route migration metrics	registryHitRate + migrationProgress + 7 domain panels
Route migration coverage	0%	91.3% (73 modular + 7 terminal inline)
Inline chain complexity	Monolithic if/else chain	Clearly sectioned + [REGISTRY_COVERED] annotations
Domain class method bodies	0 (all in monolith)	7/7 complete (validate → delegate → augment → diagnostics)
Vite build time	N/A	437ms
Path-mode chunk size	N/A	93KB (from 430KB in legacy bundle)
tsc errors	255 (pre-existing M8-M10)	26 (-90%)
Domain method body migration depth	Delegation only	4-domain-method deep (Ingestor: staleness+guardrails, Querier: validation+cache, Mastery: path validation)

Domain Class Implementation Status

Domain Class	Platform Interface	Own Logic	Production Use
`KnowledgeIngestor`	`IngestPlatform`	4 domain gates, staleness analysis (freshnessScore/freshnessRating/staleBySource), staleness trend (100-snapshot history, getFreshnessTrend), latency tracking, guardrail pass rate, 10 diagnostics	✅ `POST /api/knowledge/ingest`
`KnowledgeQuerier`	`QueryPlatform`	Query validation (empty/length/max), _domain telemetry, cache (TTL+pruning), latency P95, 10 diagnostics	✅ `POST /api/knowledge/query`
`ConversationManager`	`ConversationPlatform`	Query+memory validation, turn count, response latency, memory ops, 6 diagnostics	✅ (instantiated)
`MasteryEngine`	`MasteryPlatform`	Path validation, _domain augmentation (pathLength/duration), session metrics, 6 diagnostics	✅ (instantiated)
`QualityEvaluator`	`QualityPlatform`	User validation, pass rate tracking (200-window), snapshot metrics, 5 diagnostics	✅ (instantiated)
`TutorRouter`	`TutorPlatform`	UserID+actionKind validation, action distribution, execution metadata, 4 diagnostics	✅ (instantiated)
`MemoryPolicyManager`	`MemoryPlatform`	UserID+layer validation, policy layer distribution, _domain augmentation, 5 diagnostics	✅ (instantiated)

Core API and Runtime Baseline

Contract layer

API interfaces: src/learning/api.ts
Core types: src/learning/types.ts
Public export boundary: src/learning/index.ts
Contract coverage: src/knowledge.api.contract.test.ts

Server layer

/api/knowledge/* routes are normalized and alias-compatible in src/server.ts.
Runtime diagnostics endpoint: GET /api/runtime-request-trace.
Runbook endpoints:
- GET /api/knowledge/runtime-capability-runbook
- GET /api/knowledge/runtime-capability-runbook/verify
- GET /api/knowledge/runtime-capability-runbook/history*
- POST /api/knowledge/runtime-capability-runbook/remediation-event
- POST /api/knowledge/runtime-capability-runbook/remediation-event/replay

State and storage layer

Store backends in src/learning/store.ts:
- file
- memory
- graphdb (now supports embedded SQLite, file, and HTTP adapter paths)
Current structural limit: the new default graphdb path is embedded SQLite with explicit fallback, but it still needs packaged/runtime proof and workload hardening before being called production-closed.

Retrieval layer

Query backend implementations in src/learning/queryBackend.ts:
- local_hybrid: keyword + semantic token similarity + relation degree + temporal filtering trace
- keyword_only: keyword-dominant retrieval with temporal filtering
- local_vector: TF-IDF-like local vector similarity + semantic overlap + graph relation bonus, with durable local index snapshot (knowledge_query_vector_index.v1.json), ingest-triggered invalidation, lazy refresh by atom signature, ANN-style token/signature prefilter (ann_prefilter) with automatic full-scan fallback, and a pluggable acceleration adapter boundary (local default + external_stub + external_http)
Known structural limits:
- current external adapter implementations are scaffolds for integration hardening (including external_http timeout/retry/circuit-breaker baseline), not production ANN engine connectors,
- larger-corpus workloads still need dedicated external ANN backends plus benchmarked threshold tuning.
Governance coverage:
- runtime checks now distinguish backend availability, vector index readiness, and persistence mode.
- vector acceleration circuit governance now evaluates warn/fail budgets on short-circuit count/ratio, consecutive failures, and half-open probe success rate.
- ANN prefilter effectiveness governance now tracks lastSelectionMode + lastCandidateCount and fails when ann_prefilter remains on persistent full_scan fallback under stable connector traffic.
- ANN prefilter effectiveness thresholds are now tunable via runtime env controls (minimum sample size + warn/fail candidate-ratio budgets) for corpus-specific rollout calibration.
- runbook action queue now adds a dedicated prefilter-risk tie-break inside the same priority band so query_vector_acceleration_prefilter_effectiveness remediation actions surface earlier when the check is warn|fail.
- remediation-event replay automation is now available via POST /api/knowledge/runtime-capability-runbook/remediation-event/replay to replay risk checks from remediation history into a fresh verify cycle.
- server integration now validates external_http circuit-open propagation across /api/knowledge/query-backend-diagnostics, /api/knowledge/runtime-capability-matrix, and runbook verify endpoints.

Workbench layer

Frontend orchestration and diagnostics entry: src/frontend/path_app.js.
Key observability integration:
- runtime runbook dashboards
- request trace filtering
- query backend diagnostics/config
- remediation replay controls in workbench (risk_only|all, replay limit 1-24) with local preference persistence
- remediation replay schedule orchestration controls in workbench (enable/interval/trigger policy + thresholds) with API-backed snapshot + manual tick
- vector acceleration governance visibility in runtime summary (queryVectorAcceleration(...) now surfaces live counters, circuit warn/fail threshold tuples, and embedded prefilter budget snapshot driven by matrix signals)
- runbook history/checks now exposes structured ANN prefilter effectiveness snapshots (queryVectorAccelerationPrefilter) at both summary and per-check levels for threshold-aware incident triage
- dedicated vector acceleration governance drilldown panel in the runbook section (structured status/threshold/flag/action view for query_vector_acceleration_circuit_state, remote ANN sync-health view for query_vector_acceleration_index_sync_health, prefilter selection/candidate telemetry + threshold budget/action view for query_vector_acceleration_prefilter_effectiveness, plus traceability coverage/action view for query_vector_acceleration_traceability), and the server-side runbook history/action-queue layer now treats the sync-health gate as a first-class incident object
- path strategy telemetry and session history analytics

Practice Runbook (Engineering Workflow)

1) Contract-first verification

npm test -- src/knowledge.api.contract.test.ts --runInBand

2) Docs governance and page stability

npm run docs:diataxis:check
npm run docs:site:build
npm run docs:site:serve

3) Runtime route and diagnostics sanity checks

Validate runbook reads:
- GET /api/knowledge/runtime-capability-runbook
- GET /api/knowledge/runtime-capability-runbook/verify?limit=20
Validate trace correlations:
- GET /api/runtime-request-trace
- GET /api/runtime-request-trace?requestId=<exact_request_id>
Validate turn-cache operator diagnostics and trend governance:
- GET /api/knowledge/conversation/turn-cache/diagnostics
- GET /api/knowledge/conversation/turn-cache/diagnostics/trend?limit=20&windowSize=6&minSamples=3
- GET /api/knowledge/conversation/turn-cache/diagnostics/trend/index?limit=20
- GET /api/knowledge/conversation/turn-cache/diagnostics/trend/export?limit=50
Validate copied-frontend runtime shell:

npm run verify:agent-workspace:runtime

Validate browser-rendered shell and interaction loop:

npm run verify:agent-workspace:browser

Validate desktop lifecycle proxy smoke (runtime + tauri config + source lifecycle contracts + promotion lifecycle test):

npm run verify:agent-workspace:tauri

Validate Rust-side tauri lifecycle contracts (pathmode_window_toggle_plan*, strict in CI):

npm run verify:agent-workspace:tauri:rust
npm run verify:agent-workspace:tauri:rust:strict

Validate app/window lifecycle evidence path (degraded-friendly local mode + strict mode):

npm run verify:agent-workspace:tauri:window-evidence
npm run verify:agent-workspace:tauri:window-evidence:strict

Build strict evidence index (local + CI strict mode):

npm run verify:agent-workspace:tauri:evidence:index
npm run verify:agent-workspace:tauri:evidence:index:strict
npm run verify:agent-workspace:tauri:evidence:summary
npm run verify:agent-workspace:tauri:evidence:release-fragment
npm run verify:agent-workspace:tauri:evidence:manifest
npm run verify:agent-workspace:tauri:evidence:manifest:strict
npm run verify:agent-workspace:tauri:evidence:publish-release-notes -- --tag <release_tag>

4) Query strategy sanity checks

Compare backends with same query and inspect explainability gaps:
- POST /api/knowledge/query/compare-backends
Review recent comparison history:
- GET /api/knowledge/query/compare-backends/history?limit=8
Review trend window:
- GET /api/knowledge/query/compare-backends/trend

5) Session strategy quality checks

Evaluate strategy-outcome consistency:
- GET /api/knowledge/session/history?pathStrategySelectionSource=strategy_trend&sinceMinutes=10080
- GET /api/knowledge/quality/trend
- GET /api/knowledge/session/plan/quality/trend

Next Increments (Priority Order)

Close CI coverage gaps for the agent-workspace contract suites: keep tauri strict evidence jobs, and add always-on parity/frontend contract gates (src/agent_workspace.contract.parity.test.ts, src/agent_workspace.frontend.test.ts, src/agent_workspace.tauri.contract.test.ts) as first-class CI blockers.
Carry the embedded graphdb/sqlite baseline from restart-durability proof to packaged/runtime + heavier-workload closure while preserving explicit fail-open/fail-closed rollout semantics; use verify:foundation:release-evidence after release report generation to confirm host evidence is still fresh.
Finish ANN release-grade closure by keeping the new sync-backed external_http path healthy under real traffic, then tightening workload/threshold calibration.
Move next into Phase-2 gate promotion only after the same checks run on a release-grade graphdb/ANN baseline.
Add CI-integrated durability checks for persisted turn-cache trend index/export consistency across restart flows.
Continue strict evidence artifact governance improvements (retention/indexability/export) for operator audits, and keep i18n expansion as a lower-priority stream unless it directly unblocks contract/foundation risk.

Cross-Platform Architecture Refinement (2026-05-02)

A comprehensive cross-platform compatibility audit and architecture health assessment was completed. Key findings and the unified remediation plan are documented in:

Cross-Platform Architecture Refinement Plan

The plan identifies 6 blocking-level cross-platform issues (Linux asset://localhost 403, missing Windows sidecar binaries, macOS arm64 signing requirement, Wayland + Godot GL crash, WebKitGTK dependency gaps, CI matrix gaps) and 3 monolithic code files (server.ts 16,848 lines, path_app.js 15,140 lines, KnowledgeLearningPlatform.ts 13,370 lines) that together constitute the highest-priority technical debt. The remediation is organized into three phases aligned with the M10 foundation hardening stream.

Key insight: Code monoliths and platform fragility are causally linked — splitting server.ts enables platform-specific route handling without touching 16,848-line files; extracting platform.ts eliminates the single process.platform === 'win32' pattern that currently handles all Unix platforms identically.

FilesExpand file tree

development-progress-dashboard.md

Latest commit

History

development-progress-dashboard.md

File metadata and controls

Explanation: Development Progress Dashboard

2026-06-10 Knowledge Workspace Durable Artifact and DAG Alignment

2026-06-06 Knowledge Workspace Scope Switcher and Evidence-Focused Hit UI

2026-06-07 Graph-Focus Source Rendering and AhaDiff Comparison

AhaDiff comparison and implications

2026-06-06 Active-Scope Miss Recovery and Document-Augmented RAG Patch

2026-06-06 Knowledge Workspace RAG Answering and API Observability Slice

2026-06-06 Mainline Architecture Progress and Code-vs-Plan Reconciliation

2026-05-27 Workflow Gate Realignment and Progress Truth Sync

2026-05-27 Tauri-First Conversation Rendering Reality Sync

2026-05-27 Phase-1 SQLite Soak Gate Baseline

2026-05-26 Program F Closure

2026-05-10 Open-Goal Sync

2026-05-12 HEAD Reality Recalibration

Scope

2026-05-12 Architecture Metrics

Current Delivery Focus

Latest Validation Snapshot (2026-05-18)

Phase Snapshot (Revalidated 2026-05-12)

Layer-by-Layer Implementation Matrix

Architecture Refactoring Status (2026-05-05, FINAL)

New Module Inventory

Refactoring Metrics

Domain Class Implementation Status

Core API and Runtime Baseline

Contract layer

Server layer

State and storage layer

Retrieval layer

Workbench layer

Practice Runbook (Engineering Workflow)

1) Contract-first verification

2) Docs governance and page stability

3) Runtime route and diagnostics sanity checks

4) Query strategy sanity checks

5) Session strategy quality checks

Next Increments (Priority Order)

Cross-Platform Architecture Refinement (2026-05-02)

Related Pages