Integrate LLM Wiki as a persistent knowledge layer for Nous campaigns #110

susiejojo · 2026-05-20T16:03:38Z

susiejojo
May 20, 2026

Proposal

Connect Nous campaign outputs to LLM Wiki so that results from multiple campaigns accumulate into a searchable, interlinked knowledge graph — rather than sitting as isolated JSON files under .nous/.

The Problem

Each Nous campaign produces rich structured artifacts (principles.json, ledger.json, report.md), but:

They live in separate .nous/<run_id>/ directories with no cross-campaign visibility
Principles from campaign A cannot be discovered when designing campaign B (unless you manually read them)
There is no way to query "what do we know about saturation detection across all experiments?"

What LLM Wiki Provides

LLM Wiki is a local-first desktop app that ingests documents and builds an interlinked wiki with:

Knowledge graph with automatic cross-linking via [[wikilinks]]
Semantic search across all ingested content
Community detection (clusters related concepts automatically)
Chat interface — ask questions against the accumulated knowledge

It stores everything as plain Markdown (Obsidian-compatible) and runs entirely locally.

Proposed Integration

Architecture

Nous campaign completes
       |
       v
export_campaign() renders principles/ledger/report to markdown
       |
       v
Writes to: <wiki_project>/raw/sources/nous/<campaign-name>/
       |
       v
LLM Wiki ingests -> generates wiki pages -> builds knowledge graph

Multiple campaigns feed one wiki project. LLM Wiki cross-links them.

What Gets Exported

Nous Artifact	Wiki File	Why
`report.md`	Copied verbatim	Already narrative markdown, highest ingest quality
`principles.json`	Rendered to `principles.md`	Each principle with regime, mechanism, confidence, evidence
`ledger.json`	Rendered to `ledger.md`	Iteration progression table + per-iter detail

Filter: only export campaigns where experiment_valid: true on at least one iteration.

Where to Hook It

In run_campaign.py, right after _generate_report():

wiki_path = defaults.get("wiki", {}).get("project_path")
if wiki_path:
    from nous2wiki import export_campaign
    export_campaign(work_dir, Path(wiki_path))

Opt-in via defaults.yaml:

wiki:
  project_path: ~/.nous/Nous-research

The Export Script (`nous2wiki.py`)

~100 lines of Python. Renders principles.json and ledger.json to readable markdown, copies report.md verbatim, writes to the wiki raw/sources directory. No dependencies beyond stdlib + PyYAML.

Supports single campaign or batch export:

python nous2wiki.py .nous/epp-saturation-detector --wiki-dir ~/.nous/Nous-research
python nous2wiki.py .nous/ --wiki-dir ~/.nous/Nous-research --all

Setup (Local, quick)

Install LLM Wiki — download the pre-built .dmg (macOS) or .AppImage (Linux) from releases. No build step needed.
Configure LLM provider — in LLM Wiki Settings, set it as OpenAI-compatible pointing at your LiteLLM proxy:
- API Base URL: your LiteLLM proxy endpoint
- API Key: your proxy key
- Model: e.g. claude-sonnet-4-6 (whatever your proxy exposes)
LLM Wiki uses the standard /v1/chat/completions format, so any LiteLLM proxy works directly.
Create a project in LLM Wiki (e.g., "Nous Research"), then set that path in defaults.yaml.
Run campaigns as normal — results auto-export to the wiki on completion.

Value

Cross-campaign discovery: "What principles about flow control have we established?" pulls from all campaigns
Knowledge compounding: LLM Wiki graph shows how principles relate across experiments run weeks apart
Searchable history: Instead of grepping JSON files, ask natural language questions
Shared team knowledge: The wiki project is just a folder — sync via git or cloud storage for team access
Zero overhead: Opt-in via one line in defaults.yaml, automatic after that

Future Extensions

Bidirectional: Query the wiki during DESIGN phase to surface relevant prior knowledge (prevents re-testing refuted hypotheses)
Wiki-as-context: Include relevant wiki pages in the planner prompt via LLM Wiki search API (localhost:19828)
Auto-ingest trigger: After export, hit the LLM Wiki API to trigger ingest without manual UI click

Questions

Should this live as a built-in feature (flag in run_campaign.py) or a separate post-processing script?
Any interest in the bidirectional flow (wiki -> design prompt)?
Other wiki/knowledge tools people are using that we should consider as alternatives?

sriumcp · 2026-05-20T16:23:02Z

sriumcp
May 20, 2026
Maintainer

This is a great start. It will already unlock a lot of value. E.g., determine the highest RoI campaign from previous runs.

We should include the commit IDs of nous & the target repo (e.g., BLIS) when available in the knowledge wiki.

In terms of future work, I think the campaigns produce a bunch of other artifacts like code patches, detailed json results from the run (e.g., run of the simulations). We need a "raw" data store for the campaign artifacts as well. Perhaps a pointer to the raw data (in the wiki entry) would also be useful.

During campaigns, we should be able to attach metadata, which should carry over to the wiki entries.

0 replies

susiejojo · 2026-05-20T17:25:59Z

susiejojo
May 20, 2026
Author

Metadata Capture in Nous

All three suggestions are feasible. Here is how Nous can capture and maintain this metadata so it carries through to LLM Wiki automatically.

Where it lives

state.json (already written at campaign init). Extend it with:

{
  "run_id": "epp-saturation-detector",
  "phase": "DONE",
  "owner": "sriumcp",
  "target_repo": "AI-native-Systems-Research/inference-sim",
  "target_commit": "4e88b200...",
  "nous_commit": "abc123def...",
  "metadata": {
    "goal": "30% capacity improvement at saturation onset",
    "tags": ["flow-control", "saturation", "aimd"]
  }
}

When it gets captured

In setup_work_dir() at campaign start:

import subprocess
from pathlib import Path

def _capture_git_meta(repo_path: str | None) -> dict:
    meta = {}
    # Owner from git username
    meta["owner"] = subprocess.check_output(
        ["git", "config", "user.name"], text=True
    ).strip()

    # Target repo commit + remote
    if repo_path:
        meta["target_commit"] = subprocess.check_output(
            ["git", "-C", repo_path, "rev-parse", "HEAD"], text=True
        ).strip()
        remote = subprocess.check_output(
            ["git", "-C", repo_path, "remote", "get-url", "origin"], text=True
        ).strip()
        # extract org/repo from remote URL
        if ":" in remote and "github.com" in remote:
            meta["target_repo"] = remote.split(":")[-1].removesuffix(".git")
        elif "github.com/" in remote:
            meta["target_repo"] = remote.split("github.com/")[-1].removesuffix(".git")
        else:
            meta["target_repo"] = remote

    # Nous commit (the framework itself)
    nous_dir = Path(__file__).parent
    meta["nous_commit"] = subprocess.check_output(
        ["git", "-C", str(nous_dir), "rev-parse", "HEAD"], text=True
    ).strip()

    return meta

This runs once at campaign init — before any experiments execute — so it captures the exact state the experiments ran against.

Custom metadata from campaign.yaml

Allow a metadata field in campaign.yaml that passes through:

research_question: >
  Can AIMD improve saturation onset by >30%?

metadata:
  tags: [flow-control, saturation]
  goal: "30% capacity improvement"
  jira: PERF-1234

This gets merged into state.json alongside the auto-captured fields.

How it flows to the wiki

nous2wiki.py reads state.json and emits YAML frontmatter on exported markdown:

---
campaign: epp-saturation-detector
owner: sriumcp
target_repo: AI-native-Systems-Research/inference-sim
target_commit: 4e88b200
nous_commit: abc123def
tags: [flow-control, saturation, aimd]
---
# EPP Saturation Detector Report
...

LLM Wiki preserves frontmatter as queryable page metadata — searchable by owner, target repo, or tags across all campaigns.

Raw data pointers

For the raw artifacts (patches, simulation result JSONs), the exported markdown includes a pointer section:

## Raw Data

- Campaign dir: `.nous/epp-saturation-detector-archive-20260519-115428/`
- Patches: `runs/iter-2/patches/h-main.patch`, `runs/iter-3/patches/h-main.patch`
- Result count: 87 JSON files across 12 iterations

LLM Wiki keeps the narrative as a wiki page; the user follows the path to actual files. We avoid dumping hundreds of raw JSON simulation outputs into the wiki (they would not ingest well anyway).

0 replies

susiejojo · 2026-05-21T17:56:49Z

susiejojo
May 21, 2026
Author

Design Tradeoffs — Findings from Hands-On Investigation

We spent a session probing LLM Wiki's API, testing its ingestion pipeline, and evaluating whether it can serve as a reasoning engine (not just documentation). Here's what we learned.

What's Programmatically Achievable via the API

LLM Wiki exposes a local HTTP API at 127.0.0.1:19828/api/v1:

Endpoint	Method	What it does
`/projects`	GET	List all wiki projects
`/projects/{id}/graph`	GET	Full knowledge graph (nodes with type/linkCount + edges with weight)
`/projects/{id}/files/content?path=...`	GET	Read any wiki page content
`/projects/{id}/search`	POST	Keyword + vector search across content
`/projects/{id}/sources/rescan`	POST	Trigger re-indexing of raw/sources

Auth: Bearer token configured in app-state.json under apiConfig.token.

What works well:

Querying the graph topology programmatically (nodes, edges, clusters)
Reading generated wiki pages (concepts, entities, experiments)
Triggering rescan after new files are dropped into raw/sources/
The file watcher auto-detects new .md files without explicit rescan

What doesn't work:

.json files are silently ignored — LLM Wiki only ingests markdown/text. Ledger and principles must be converted to .md before ingestion.
No "generate page on demand" endpoint — you can't send a prompt and get a wiki page back via API.
No headless/daemon mode — the desktop app MUST be running for any processing.

When the App Must Be Open vs. Not

Operation	App Required?
Ingesting new source files → generating wiki pages	Yes — the LLM calls happen inside the app
Querying the graph/search/files via API	Yes — API server lives in the app process
Writing files to `raw/sources/`	No — just filesystem writes
Reading generated wiki pages from disk	No — they're plain markdown files
Running the JSON→markdown export script	No — pure Python

Bottom line: Any operation that involves the LLM (page generation, re-indexing) requires the app. Read/write of files on disk does not.

What Value the Knowledge Graph Brings

The graph is computed FROM wiki pages (wikilinks between them). It provides:

For humans (browsing):

Visual cluster detection — see which concepts group together
Spot isolated nodes (under-explored topics)
Navigate relationships at a glance

For automated pipelines:

Node degree counts (which concepts are central vs. peripheral)
Dangling wikilinks (referenced but no page exists = knowledge gap)
Community structure (which clusters don't connect to others)

What the graph does NOT provide:

Any information not already in the wiki pages themselves
Experimental outcomes (CONFIRMED/REFUTED) — these aren't captured in the graph structure
Reasoning about what to try next — that requires an LLM, not a graph query

Assessment: The graph is a nice visualization layer for humans. For automated "suggest next experiment" pipelines, it adds marginal value over just reading the wiki pages directly — the LLM can reason about connections and gaps without needing quantitative topology.

LLM Wiki as a Reasoning Engine — What We Tested

We tried customizing purpose.md and schema.md to make LLM Wiki generate "opportunity" pages (suggested next experiments) and "dead-end" pages (refuted approaches). Results:

Custom page types are ignored. The ingestion pipeline has a hardcoded set of output types (concept, entity, source). Custom types defined in schema.md (dead-end, opportunity, principle) were never generated.
The schema influences structure but not behavior. Frontmatter fields we defined were populated on the pages it DID generate, but it won't create new page types it doesn't know about internally.
Purpose.md is context, not instruction. Writing "identify highest-ROI next experiments" in purpose.md didn't cause the LLM to shift from documentation to recommendation mode.
One experiment page was created from final.md (which described a confirmed result with clear "Next Experiments" suggestions inline). But this was the LLM being smart about the source content, not following our schema directives.

Conclusion: LLM Wiki is a documentation tool. It converts source documents into structured reference pages. It is NOT a reasoning engine that can assess experimental outcomes and generate recommendations.

The Claude Code Skill (`llm_wiki_skill`)

nashsu/llm_wiki_skill is a Claude Code skill that documents the API contract. It lets Claude Code agents query the wiki during a session (search, read pages, traverse graph).

Where it helps: A Claude Code agent could query the wiki mid-conversation to pull in prior knowledge when designing new experiments.

Where it doesn't help: It's read-only (plus rescan trigger). It can't make the wiki generate pages or reason about gaps — it's just API access documentation.

Decision Framework: LLM Wiki vs. Custom Claude Skill

Criterion	LLM Wiki	Custom Claude Skill
Ingestion (docs → structured pages)	Excellent — automatic, handles PDF/DOCX/MD	We'd build this ourselves
Knowledge graph visualization	Built-in, interactive	Would need separate tooling
Semantic search	Built-in (LanceDB vectors)	Would need to implement
Reasoning about next experiments	Cannot do this	Full control over prompts
Custom page types / output format	Limited to hardcoded types	Full flexibility
Headless operation	Not supported — app must be open	Runs anywhere Python runs
Team sharing	Wiki is just a folder (git/Box)	Same — markdown files on disk
Maintenance	Third-party app, updates may break API	We own and maintain it
Dependencies	macOS/Linux desktop app + Tauri	Python + LLM API endpoint

When to Use LLM Wiki

You want a browsable knowledge base for humans to explore visually
You're ingesting diverse document types (PDFs, papers, DOCX) and want automatic structuring
The graph visualization helps your team spot patterns
You're OK keeping the app open during campaign runs

When to Build a Custom Skill

The primary goal is automated reasoning (suggest next experiments, identify dead ends)
You want full control over the prompt, output format, and page types
You need headless/CI operation — no GUI dependency
The output must be machine-readable by the orchestrator (structured YAML, not prose wiki pages)

Recommendation

Use both, with clear separation of concerns:

LLM Wiki — the knowledge base layer. Accumulates campaign outputs as searchable, interlinked documentation. Good for human exploration and as context for future campaigns. Keep the export hook (nous2wiki.py) as proposed.
Custom Claude skill — the reasoning layer. Reads raw campaign data (ledger, principles, findings) directly. Reasons about what worked, what failed, and what's unexplored. Outputs structured campaign.yaml candidates. No app dependency.

The two don't compete — LLM Wiki answers "what do we know?" and the custom skill answers "what should we do next?"

Alternatively, if the team decides the graph/visualization isn't worth the app-open constraint, a custom skill can handle both: accumulate knowledge as markdown files (same format LLM Wiki uses) and reason over them to suggest next experiments. This is simpler but loses the interactive graph UI.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate LLM Wiki as a persistent knowledge layer for Nous campaigns #110

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Integrate LLM Wiki as a persistent knowledge layer for Nous campaigns #110

Uh oh!

susiejojo May 20, 2026

Proposal

The Problem

What LLM Wiki Provides

Proposed Integration

Architecture

What Gets Exported

Where to Hook It

The Export Script (nous2wiki.py)

Setup (Local, quick)

Value

Future Extensions

Questions

Replies: 3 comments

Uh oh!

sriumcp May 20, 2026 Maintainer

Uh oh!

susiejojo May 20, 2026 Author

Metadata Capture in Nous

Where it lives

When it gets captured

Custom metadata from campaign.yaml

How it flows to the wiki

Raw data pointers

Uh oh!

susiejojo May 21, 2026 Author

Design Tradeoffs — Findings from Hands-On Investigation

What's Programmatically Achievable via the API

When the App Must Be Open vs. Not

What Value the Knowledge Graph Brings

LLM Wiki as a Reasoning Engine — What We Tested

The Claude Code Skill (llm_wiki_skill)

Decision Framework: LLM Wiki vs. Custom Claude Skill

When to Use LLM Wiki

When to Build a Custom Skill

Recommendation

susiejojo
May 20, 2026

The Export Script (`nous2wiki.py`)

sriumcp
May 20, 2026
Maintainer

susiejojo
May 20, 2026
Author

susiejojo
May 21, 2026
Author

The Claude Code Skill (`llm_wiki_skill`)