v1.0.6: Self-healing latest symlink, reliable script paths, hardened planner protocol

Jeehut · Jeehut · commit 636fffae8671 · 2026-04-14T16:23:42.000+02:00
All skill scripts now self-heal the cache/tandemkit/latest symlink at
the top of every run, so version updates never leave a stale pointer.
Planner/generator/evaluator SKILL.md switched from the fragile
CLAUDE_PLUGIN_ROOT/CLAUDE_SKILL_DIR fallback to an absolute path under
$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest — skills work
whether loaded via plugin system or via ~/.agents/skills/ symlinks.

Planner SKILL.md also tightened in three places:
- Clickable file:// link to the latest Claude-NN.md draft is now a
  HARD REQUIREMENT at every approval step, never optional.
- Last line before AskUserQuestion must be a brief plain-text prompt,
  so the AskUserQuestion overlay can never hide the spec link.
- Codex launch now states explicitly that run_in_background on the
  Agent is the ONLY backgrounding mechanism — passing --background
  to the Codex CLI produces double-backgrounding with empty output.
- Empty output / zero-byte Codex-NN.md is no longer treated as
  "Codex unavailable"; only EXPLICIT error messages trigger that
  branch, otherwise the planner flags a launch issue and asks.
diff --git a/.claude-plugin/plugin.json b/.claude-plugin/plugin.json
@@ -1,6 +1,6 @@
 {
   "name": "tandemkit",
-  "version": "1.0.5",
+  "version": "1.0.6",
   "description": "Describe your goal, approve the spec, then step away — Claude and Codex loop together until it's right.",
   "author": {
     "name": "Cihat Gündüz",
diff --git a/scripts/create-mission.sh b/scripts/create-mission.sh
@@ -1,6 +1,12 @@
 #!/usr/bin/env bash
 set -euo pipefail
 
+# Self-heal the 'latest' symlink so it always points at the newest installed version.
+_TC="$HOME/.claude/plugins/cache/FlineDev/tandemkit"
+_NV=$(ls "$_TC" 2>/dev/null | grep -v '^latest$' | sort -V | tail -1)
+[[ -n "$_NV" ]] && ln -sf "$_NV" "$_TC/latest" 2>/dev/null
+unset _TC _NV
+
 # TandemKit — create-mission
 # Scaffolds a new mission folder with State.json and updates Config.json.
 # Must be run from the project root (where TandemKit/ lives).
diff --git a/scripts/wait-for-state.sh b/scripts/wait-for-state.sh
@@ -1,6 +1,12 @@
 #!/usr/bin/env bash
 set -euo pipefail
 
+# Self-heal the 'latest' symlink so it always points at the newest installed version.
+_TC="$HOME/.claude/plugins/cache/FlineDev/tandemkit"
+_NV=$(ls "$_TC" 2>/dev/null | grep -v '^latest$' | sort -V | tail -1)
+[[ -n "$_NV" ]] && ln -sf "$_NV" "$_TC/latest" 2>/dev/null
+unset _TC _NV
+
 # TandemKit — wait-for-state
 # Blocks until a State.json field matches an expected value (or any change occurs).
 #
diff --git a/skills/evaluator/SKILL.md b/skills/evaluator/SKILL.md
@@ -55,7 +55,7 @@ Both Claude and Codex write their per-round outputs as files in `Evaluator/Round
 > **If you are Codex:** Run the setup script before anything else. It verifies that your `~/.agents/skills/` symlinks resolve correctly and auto-repairs them if stale — handles plugin upgrades transparently with no user involvement.
 
 ```bash
-bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/setup-codex-skills.sh"
+bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/setup-codex-skills.sh"
 ```
 
 Silent if everything is up to date. Prints what changed if repairs were made. Exits with an error if the TandemKit plugin is not installed.
@@ -89,7 +89,7 @@ The user invokes this skill with `/tandemkit:evaluator NNN-MissionName`. First r
    - `"ready-for-eval"` → Proceed to Step 3 immediately.
    - Any other value → Wait:
      ```bash
-     bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" generatorStatus ready-for-eval
+     bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" generatorStatus ready-for-eval
      ```
      Run with `run_in_background: true`. When it prints "READY", proceed to Step 3.
 
@@ -255,12 +255,12 @@ After writing your verdict, IMMEDIATELY start TWO background watchers:
 
 1. **Next round watcher:**
 ```bash
-bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" generatorStatus ready-for-eval --round N+1
+bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" generatorStatus ready-for-eval --round N+1
 ```
 
 2. **Completion watcher:**
 ```bash
-bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" phase complete
+bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" phase complete
 ```
 
 Run both with `run_in_background: true`. When either returns:
diff --git a/skills/generator/SKILL.md b/skills/generator/SKILL.md
@@ -48,7 +48,7 @@ The user invokes this skill with `/tandemkit:generator NNN-MissionName`. First r
    b. If `evaluatorStatus` is `null` → the Evaluator is not ready yet. Update State.json: `generatorStatus: "researching"`. You may read files, investigate the codebase, and prepare — but do NOT create or modify implementation files.
    c. Wait for the Evaluator to signal readiness:
       ```bash
-      bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" evaluatorStatus watching
+      bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" evaluatorStatus watching
       ```
       Run with `run_in_background: true`. When it prints "READY", update State.json: `generatorStatus: "working"`, `phase: "generation"`. Proceed.
 6. Check for `UserFeedback/` files — if they exist, read the latest (this is a feedback iteration)
@@ -81,7 +81,7 @@ The user invokes this skill with `/tandemkit:generator NNN-MissionName`. First r
 6. **Signal the Evaluator**: Update State.json — `generatorStatus: "ready-for-eval"`, `evaluatorStatus: "pending"`, `phase: "evaluation"`, `round: N`. Read-modify-write only your fields.
 7. **Wait for evaluation**: Use `wait-for-state.sh`:
    ```bash
-   bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" evaluatorStatus done --round N
+   bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" evaluatorStatus done --round N
    ```
    Run with `run_in_background: true`. When `evaluatorStatus` is `"done"` and round matches, read `Evaluator/Round-NN.md`.
 
@@ -213,7 +213,7 @@ If the user says "abort": confirm, set State.json `phase: "abandoned"`, Config.j
 Use `wait-for-state.sh` for ALL State.json watching. Do NOT use raw watchman-wait.
 
 ```bash
-bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" evaluatorStatus done --round N
+bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/wait-for-state.sh" "$(pwd)/TandemKit/NNN-MissionName" evaluatorStatus done --round N
 ```
 
 Run with `run_in_background: true`. The script checks immediately, then enters a watch loop. When it prints "READY", re-read State.json. The `--round N` parameter ensures you don't match stale values from a previous round.
diff --git a/skills/planner/SKILL.md b/skills/planner/SKILL.md
@@ -14,7 +14,7 @@ You are the Planner. Your job is to investigate the codebase, ask the right ques
 
 1. **Ask questions ONE AT A TIME** with 2-3 sentences of context before each AskUserQuestion call.
 2. **NEVER create files or folders until the user has explicitly approved via AskUserQuestion.** Do not infer approval from context.
-3. **Present a 5–10 line summary + a clickable file link to the latest `Claude-NN.md` draft.** Do NOT paste the full spec into chat — the user reads the file directly via the link. The same file becomes `Spec.md` byte-for-byte after approval (via `cp`), so what they read IS what they're approving. The clickable link must follow the `[name](file:///absolute/path)` format from the workspace AGENTS.md so it opens in the user's editor.
+3. **ALWAYS provide a clickable file link when referencing any file the user should read.** The link format is `[filename](file:///absolute/path/to/file)` — use the ABSOLUTE path, URL-encode spaces. This is a HARD REQUIREMENT for every `Claude-NN.md` draft, every `Spec.md`, and any other file you point the user to. If the user cannot click a link to open the file, you have failed this step. Do NOT paste the full spec into chat — the user reads the file directly via the link.
 4. **Use Variant 1 visual framing** for copyable content:
 
 ╔═══ UPPERCASE LABEL ══════════════════════════════════════════════════╗
@@ -72,7 +72,7 @@ Both Claude and Codex write their per-round outputs as files in `Planner-Discuss
 > **If you are Codex:** Run the setup script before anything else. It verifies that your `~/.agents/skills/` symlinks resolve correctly and auto-repairs them if stale — handles plugin upgrades transparently with no user involvement.
 
 ```bash
-bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/setup-codex-skills.sh"
+bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/setup-codex-skills.sh"
 ```
 
 Silent if everything is up to date. Prints what changed if repairs were made. Exits with an error if the TandemKit plugin is not installed.
@@ -104,10 +104,12 @@ Silent if everything is up to date. Prints what changed if repairs were made. Ex
 
    Only AFTER that block is in the response do you run the scaffolding script. This creates the mission folder, `Planner-Discussion/` subfolder, and `State.json` in one shot:
    ```bash
-   bash "${CLAUDE_PLUGIN_ROOT:-${CLAUDE_SKILL_DIR}/../..}/scripts/create-mission.sh" "NNN-MissionName"
+   bash "$HOME/.claude/plugins/cache/FlineDev/tandemkit/latest/scripts/create-mission.sh" "NNN-MissionName"
    ```
    Also create the feature branch if configured.
-9. **Immediately launch Codex in background** using the Agent tool with `run_in_background: true`. Do NOT also use `--background` in the Codex CLI flags — that creates double-backgrounding where the Agent "completes" immediately but Codex is still running, and you never get the result notification.
+9. **Immediately launch Codex in background** using the Agent tool with `run_in_background: true`.
+
+   **CRITICAL — NO DOUBLE-BACKGROUNDING:** The Agent tool's `run_in_background: true` is the ONLY backgrounding mechanism. Do NOT EVER pass `--background` in the Codex CLI flags. If you use both, the Agent completes instantly with zero output (because Codex itself backgrounded), you get no result notification, Codex-01.md stays empty, and the whole round breaks. This has happened before. The correct pattern: `run_in_background: true` on the Agent call, NO `--background` anywhere in the prompt text.
 
    The Codex prompt points at its template file plus the per-mission inputs. Substitute `NNN-MissionName`, the user's verbatim goal text, AND `{EFFORT}` (the `codex.effort` value you captured from `Config.json` in Step 0.5):
 
@@ -124,11 +126,13 @@ Silent if everything is up to date. Prints what changed if repairs were made. Ex
    - User goal (verbatim): [paste the user's goal text here]
    ```
 
-   **If Codex is unavailable**, distinguish two cases:
+   **If Codex is unavailable** — only if you receive an EXPLICIT error message indicating unavailability. **Empty output or zero-byte Codex-01.md is NOT evidence of unavailability** — it almost always means you double-backgrounded (see rule above) or the prompt was malformed. If Codex-01.md is empty and you did NOT receive an explicit error, tell the user: "Codex produced empty output — this is likely a launch issue, not a Codex problem. Want me to retry?" Do NOT write a placeholder or proceed Claude-only.
+
+   Only treat Codex as unavailable when you see one of these EXPLICIT signals:
 
-   - **Permanent** (CLI not installed, auth expired/invalid, `/codex:rescue` itself errors out before Codex even starts): STOP. Tell the user: "Codex is unavailable. Please run `/codex:setup` to fix, then say 'continue'." Do NOT proceed with Claude-only planning — TandemKit's value comes from the dual-model approach and a permanent failure is something the user needs to fix.
+   - **Permanent** (CLI not installed, auth expired/invalid, `/codex:rescue` itself errors out with a clear error message before Codex starts): STOP. Tell the user: "Codex is unavailable. Please run `/codex:setup` to fix, then say 'continue'." Do NOT proceed with Claude-only planning.
 
-   - **Temporary** (Codex returned a rate-limit / token-quota error like `"You've hit your usage limit. To get more access now... try again at <date>"`, or the `codex-companion` reported a quota/throttle error, or a network/timeout error): Do NOT keep retrying — token-limit errors won't clear within this session. Instead:
+   - **Temporary** (Codex returned an EXPLICIT rate-limit / token-quota error like `"You've hit your usage limit. To get more access now... try again at <date>"`, or the `codex-companion` reported a quota/throttle error with a clear message): Do NOT keep retrying — token-limit errors won't clear within this session. Instead:
      1. Write a **placeholder `Codex-01.md`** to `Planner-Discussion/` containing exactly:
         ```markdown
         # Codex-01 — Skipped (Codex Unavailable)
@@ -223,12 +227,13 @@ Codex is already running in background from Step 0.9. The `Planner-Discussion/`
 26. Present to the user, in chat:
     - **A 5–10 line summary** of what Claude and Codex converged on (mission goal in one sentence, key acceptance criteria, key decisions, anything noteworthy)
     - **Any remaining low-level differences** between Claude and Codex (one short bullet each)
-    - **A clickable file link** to the final draft, in this exact format (substitute the absolute path):
+    - **A clickable file link** to the final draft — this is MANDATORY, not optional:
       ```
       📄 [Claude-NN.md](file:///absolute/path/to/TandemKit/NNN-MissionName/Planner-Discussion/Claude-NN.md)
       ```
-      Use the absolute path (not a relative one) so the link opens in the user's editor regardless of their current directory. URL-encode any spaces or special characters per the workspace AGENTS.md conventions.
+      Construct the absolute path from the current working directory + the relative path. URL-encode spaces. If you present the approval step WITHOUT this clickable link, the user cannot review the spec and the step is broken. NEVER skip the link.
     - **One sentence** explaining: "This file becomes `Spec.md` exactly as written once you approve. If you want editorial changes (typos, wording), say so and I'll apply them after the approval. If you want substantive changes (new criteria, changed scope), say so and Codex will review them in another convergence round."
+    - **LAST LINE before AskUserQuestion** must be a brief plain-text prompt like: "Ready to finalize the spec?" — this acts as a buffer because AskUserQuestion can visually overlay the last line of chat output. The clickable file link MUST appear ABOVE this line, never as the last thing before AskUserQuestion.
 27. Ask for approval via AskUserQuestion. The options should be: **Approve as-is** / **Approve with editorial changes** / **Substantive changes needed**.
 28. Handle the response:
 

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "tandemkit",`
`3`		`- "version": "1.0.5",`
	`3`	`+ "version": "1.0.6",`
`4`	`4`	`"description": "Describe your goal, approve the spec, then step away — Claude and Codex loop together until it's right.",`
`5`	`5`	`"author": {`
`6`	`6`	`"name": "Cihat Gündüz",`