egavrin
diff --git a/‎CONTRIBUTING.md‎
Lines changed: 7 additions & 5 deletions b/‎CONTRIBUTING.md‎
Lines changed: 7 additions & 5 deletions
diff --git a/‎README.md‎
Lines changed: 64 additions & 10 deletions b/‎README.md‎
Lines changed: 64 additions & 10 deletions
diff --git a/‎SUPPORT.md‎
Lines changed: 10 additions & 2 deletions b/‎SUPPORT.md‎
Lines changed: 10 additions & 2 deletions
diff --git a/‎WORKFLOW.md‎
Lines changed: 3 additions & 2 deletions b/‎WORKFLOW.md‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎packages/adapters/src/index.test.ts‎
Lines changed: 132 additions & 10 deletions b/‎packages/adapters/src/index.test.ts‎
Lines changed: 132 additions & 10 deletions
@@ -11,12 +11,11 @@ local execution behavior for the DevAgent stack.
 - Node `20+`
 - the four sibling repos checked out side by side
 
-For the supported setup path, start from [`devagent-hub`](../devagent-hub/README.md):
+For local development, bootstrap the sibling repos directly:
 
 ```bash
-cd ../devagent-hub
-bun install
-bun run bootstrap:local
+cd ../devagent-sdk && bun install
+cd ../devagent-runner && bun install
 ```
 
 ## Local checks before opening a PR
@@ -28,11 +27,14 @@ bun run test
 bun run check:oss
 ```
 
-If your change affects the live path, also run the Hub baseline checks from `../devagent-hub`.
+If your change affects a downstream integration path, run that consumer's baseline checks in
+addition to the runner checks above.
 
 ## Contribution rules
 
 - Keep the DevAgent path stable first.
 - Treat other adapters as experimental unless live validation proves parity.
+- Keep non-DevAgent adapter command resolution aligned with adapter constructor overrides and runner
+  env overrides.
 - Keep PRs small and lifecycle-focused.
 - Update docs if you change setup, adapter maturity, or validation claims.
@@ -5,7 +5,7 @@ Local execution substrate for DevAgent workflow tasks.
 ## Maturity
 
 Public alpha component. The repo is public, but the packages remain unpublished and are consumed
-through the sibling-repo bootstrap path documented in `devagent-hub`.
+through local workspace dependencies during development.
 
 ## Responsibilities
 
@@ -54,40 +54,94 @@ devagent-runner run --request /tmp/request-plan.json
 devagent-runner inspect <run-id>
 ```
 
+## Command Resolution
+
+The runner adapters resolve `codex`, `claude`, and `opencode` commands in this order:
+
+1. adapter constructor override or resolver
+2. runner env overrides
+3. PATH defaults
+
+Runner env overrides for the standalone CLI:
+
+```bash
+DEVAGENT_RUNNER_CODEX_BIN=/path/to/codex
+DEVAGENT_RUNNER_CLAUDE_BIN=/path/to/claude
+DEVAGENT_RUNNER_OPENCODE_BIN=/path/to/opencode
+```
+
+Default PATH command names are `codex`, `claude`, and `opencode`.
+
+If the runner is embedded as a library, callers can pass either a fixed command string or a
+request-aware resolver function to `CodexAdapter`, `ClaudeAdapter`, or `OpenCodeAdapter`.
+
 ## Local Development Wiring
 
 For local MVP work this repo consumes `@devagent-sdk/*` through file dependencies from
-`../devagent-sdk`, and `devagent-hub` consumes this runner through file dependencies from
-`../devagent-runner/packages/*`.
+`../devagent-sdk`. Downstream consumers can depend on `../devagent-runner/packages/*` during local
+development.
 
-The supported local setup path is the bootstrap flow documented in
-[`devagent-hub/README.md`](../devagent-hub/README.md) and
-[`devagent-hub/BASELINE_VALIDATION.md`](../devagent-hub/BASELINE_VALIDATION.md).
+Keep the runner repo self-contained: setup, validation, and support claims should be documented
+here rather than delegated to a consumer repo.
 
 ## Validated Flow
 
 The runner has been validated in the canonical path:
 
 ```text
-devagent-hub -> LocalRunnerClient -> LocalRunner -> DevAgentAdapter -> devagent execute
+TaskExecutionRequest -> LocalRunner -> DevAgentAdapter -> devagent execute
 ```
 
 Adapter maturity today:
 
 - `DevAgentAdapter`
   - live-validated and supported for the MVP path
 - `CodexAdapter`
+  - structured CLI integration with machine-readable event parsing
 - `ClaudeAdapter`
+  - structured CLI integration with streamed JSON event parsing
 - `OpenCodeAdapter`
-  - adapter-present and smoke-tested, but still experimental
+  - structured CLI integration with JSON event parsing
+
+All non-DevAgent adapters now normalize machine-readable CLI output into the SDK event/result
+model, write standard markdown artifacts, and rely on runner-side read-only enforcement for review
+and verify stages. Support claims still depend on live validation evidence.
+
+## Validation
+
+Use the shared SDK fixture shape or a generated request JSON and validate each executor through the
+debug CLI.
+
+Examples:
+
+```bash
+devagent-runner run --request /tmp/codex-request.json
+devagent-runner run --request /tmp/claude-request.json
+DEVAGENT_RUNNER_OPENCODE_BIN=/Applications/OpenCode.app/Contents/MacOS/opencode-cli \
+  devagent-runner run --request /tmp/opencode-request.json
+```
+
+The supported bar for promoting an executor path beyond experimental is:
+
+- live CLI validation for `triage`, `plan`, `implement`, `verify`, `review`, and `repair`
+- downstream integration validation through PR handoff
+- cancellation and failure drills still passing
+
+Current CLI smoke-validation snapshot as of 2026-03-11:
+
+- `devagent`: `triage`, `verify`
+- `codex`: `implement`, `review`
+- `claude`: `plan`, `repair`
+- `opencode`: `triage`, `plan`, `review`, `verify`
 
-Treat the experimental adapters as development surfaces, not production-equivalent executor paths.
+Those smoke passes confirm current CLI interoperability and artifact persistence. They do not by
+themselves promote `codex`, `claude`, or `opencode` beyond experimental status.
 
 ## Limitations
 
 - packages are not published to a registry yet
 - the supported contributor path is the four-repo sibling checkout flow
-- only the DevAgent adapter is live-validated today
+- only executor paths with current live validation evidence should be described as supported
 
 ## Development
 
 
@@ -12,8 +12,16 @@ Use the bug report template for workspace, eventing, cancellation, or adapter is
 
 Supported:
 
-- the DevAgent adapter path exercised by `devagent-hub -> devagent-runner -> devagent execute`
+- the DevAgent adapter path exercised through `devagent-runner -> devagent execute`
 
 Experimental:
 
-- `codex`, `claude`, and `opencode` adapters until they have comparable live validation
+- `codex`, `claude`, and `opencode` adapters remain validation-gated until they have comparable
+  live evidence through both `devagent-runner` CLI and at least one downstream integration
+- runner CLI smoke passes alone are not enough to treat those adapters as supported
+
+Binary overrides for standalone runner usage:
+
+- `DEVAGENT_RUNNER_CODEX_BIN`
+- `DEVAGENT_RUNNER_CLAUDE_BIN`
+- `DEVAGENT_RUNNER_OPENCODE_BIN`
@@ -27,12 +27,13 @@ bun run check:oss
 
 ## Done means
 
-- the DevAgent adapter path still works through Hub baseline smoke
+- the DevAgent adapter path still passes runner validation and downstream smoke coverage
 - artifacts and events are written predictably
 - cancellation, timeout, and cleanup behavior remain test-covered
+- non-DevAgent adapters keep structured event parsing and read-only enforcement intact
 - docs do not overstate experimental adapter maturity
 
 ## Supported vs experimental
 
-- Supported: `DevAgentAdapter` in the current Hub -> Runner -> DevAgent path
+- Supported: `DevAgentAdapter` in the current runner -> DevAgent path
 - Experimental: `CodexAdapter`, `ClaudeAdapter`, and `OpenCodeAdapter` until they have matching live validation
@@ -2,7 +2,7 @@ import assert from "node:assert/strict";
 import { join } from "node:path";
 import { chmod, mkdtemp, mkdir, readFile, writeFile } from "node:fs/promises";
 import { tmpdir } from "node:os";
-import { test } from "vitest";
+import { afterEach, test } from "vitest";
 import {
   ClaudeAdapter,
   CodexAdapter,
@@ -22,7 +22,10 @@ async function createWorkspace(): Promise<{ root: string; artifactDir: string; w
   return { root, artifactDir, workspacePath };
 }
 
-function createRequest(executorId: TaskExecutionRequest["executor"]["executorId"]): TaskExecutionRequest {
+function createRequest(
+  executorId: TaskExecutionRequest["executor"]["executorId"],
+  options: { model?: string; provider?: string; readOnly?: boolean } = {},
+): TaskExecutionRequest {
   return {
     protocolVersion: PROTOCOL_VERSION,
     taskId: `task-${executorId}`,
@@ -33,14 +36,23 @@ function createRequest(executorId: TaskExecutionRequest["executor"]["executorId"
       sourceRepoPath: "/tmp/repo",
       workBranch: `devagent/${executorId}/task`,
       isolation: "temp-copy",
+      readOnly: options.readOnly,
+    },
+    executor: {
+      executorId,
+      model: options.model ?? "test-model",
+      provider: options.provider,
     },
-    executor: { executorId, model: "test-model" },
     constraints: {},
     context: { summary: "smoke" },
     expectedArtifacts: ["triage-report"],
   };
 }
 
+afterEach(() => {
+  delete process.env.DEVAGENT_RUNNER_CODEX_BIN;
+});
+
 async function createStub(path: string, contents: string): Promise<void> {
   await writeFile(path, contents);
   await chmod(path, 0o755);
@@ -185,26 +197,31 @@ const fs = require("fs");
 const args = process.argv.slice(2);
 const outIndex = args.indexOf("-o");
 if (outIndex >= 0) fs.writeFileSync(args[outIndex + 1], "stub codex output\\n");
-process.stdout.write("{\\"type\\":\\"result\\",\\"message\\":\\"ok\\"}\\n");
+process.stdout.write("{\\"type\\":\\"thread.started\\"}\\n");
+process.stdout.write("{\\"type\\":\\"turn.started\\"}\\n");
+process.stdout.write("{\\"type\\":\\"item.completed\\",\\"item\\":{\\"type\\":\\"agent_message\\",\\"text\\":\\"stub codex output\\"}}\\n");
+process.stdout.write("{\\"type\\":\\"turn.completed\\"}\\n");
 `);
 
+  process.env.DEVAGENT_RUNNER_CODEX_BIN = `${process.execPath} ${stubPath}`;
   const { events, result } = await collectEvents(
-    new CodexAdapter(`${process.execPath} ${stubPath}`),
+    new CodexAdapter(),
     createRequest("codex"),
     workspacePath,
     artifactDir,
   );
 
   assert.equal(result.status, "success");
-  assert.equal(events.at(-1)?.type, "completed");
+  assert.deepEqual(events.map((event) => event.type), ["started", "progress", "progress", "progress", "progress"]);
   assert.match(await readFile(join(artifactDir, "triage-report.md"), "utf8"), /stub codex output/);
 });
 
 test("ClaudeAdapter smoke test with stub executable", async () => {
   const { root, artifactDir, workspacePath } = await createWorkspace();
   const stubPath = join(root, "claude-stub.js");
   await createStub(stubPath, `#!/usr/bin/env node
-process.stdout.write("claude stub output\\n");
+process.stdout.write(JSON.stringify({ type: "assistant", message: { content: [{ type: "text", text: "claude stub output" }] } }) + "\\n");
+process.stdout.write(JSON.stringify({ type: "result", subtype: "success", result: "claude stub output" }) + "\\n");
 `);
 
   const { events, result } = await collectEvents(
@@ -215,23 +232,128 @@ process.stdout.write("claude stub output\\n");
   );
 
   assert.equal(result.status, "success");
-  assert.equal(events.at(-1)?.type, "completed");
+  assert.deepEqual(events.map((event) => event.type), ["started", "progress", "progress"]);
+  assert.match(await readFile(join(artifactDir, "triage-report.md"), "utf8"), /claude stub output/);
 });
 
 test("OpenCodeAdapter smoke test with stub executable", async () => {
   const { root, artifactDir, workspacePath } = await createWorkspace();
   const stubPath = join(root, "opencode-stub.js");
   await createStub(stubPath, `#!/usr/bin/env node
-process.stdout.write("opencode stub output\\n");
+const args = process.argv.slice(2);
+const agentIndex = args.indexOf("--agent");
+if (agentIndex === -1 || args[agentIndex + 1] !== "build") {
+  throw new Error("expected build agent");
+}
+const permissions = process.env.OPENCODE_PERMISSION || "";
+if (!permissions.includes('"*":"deny"') || !permissions.includes('"read":"allow"') || !permissions.includes('"edit":"deny"') || !permissions.includes('"write":"deny"')) {
+  throw new Error("expected read-only permissions");
+}
+if (process.argv.includes("--model")) {
+  throw new Error("unexpected --model flag");
+}
+process.stdout.write(JSON.stringify({ type: "step_start", part: { type: "step-start" } }) + "\\n");
+process.stdout.write(JSON.stringify({ type: "text", part: { type: "text", text: "opencode stub output" } }) + "\\n");
+process.stdout.write(JSON.stringify({ type: "step_finish", part: { type: "step-finish" } }) + "\\n");
 `);
 
   const { events, result } = await collectEvents(
+    new OpenCodeAdapter(`${process.execPath} ${stubPath}`),
+    createRequest("opencode", { readOnly: true }),
+    workspacePath,
+    artifactDir,
+  );
+
+  assert.equal(result.status, "success");
+  assert.deepEqual(events.map((event) => event.type), ["started", "progress", "progress", "progress"]);
+  assert.match(await readFile(join(artifactDir, "triage-report.md"), "utf8"), /opencode stub output/);
+});
+
+test("OpenCodeAdapter passes provider-qualified model names through", async () => {
+  const { root, artifactDir, workspacePath } = await createWorkspace();
+  const stubPath = join(root, "opencode-model-stub.js");
+  await createStub(stubPath, `#!/usr/bin/env node
+const args = process.argv.slice(2);
+const modelIndex = args.indexOf("--model");
+if (modelIndex === -1 || args[modelIndex + 1] !== "opencode/big-pickle") {
+  throw new Error("expected provider-qualified --model");
+}
+process.stdout.write(JSON.stringify({ type: "text", part: { type: "text", text: "opencode model output" } }) + "\\n");
+`);
+
+  const { result } = await collectEvents(
+    new OpenCodeAdapter(`${process.execPath} ${stubPath}`),
+    createRequest("opencode", { provider: "opencode", model: "big-pickle" }),
+    workspacePath,
+    artifactDir,
+  );
+
+  assert.equal(result.status, "success");
+  assert.match(await readFile(join(artifactDir, "triage-report.md"), "utf8"), /opencode model output/);
+});
+
+test("OpenCodeAdapter surfaces structured errors without mislabeling them as permissions", async () => {
+  const { root, artifactDir, workspacePath } = await createWorkspace();
+  const stubPath = join(root, "opencode-error-stub.js");
+  await createStub(stubPath, `#!/usr/bin/env node
+process.stdout.write(JSON.stringify({
+  type: "error",
+  error: { data: { message: "Model not found: opencode/missing-model" } }
+}) + "\\n");
+process.exit(1);
+`);
+
+  const { result } = await collectEvents(
     new OpenCodeAdapter(`${process.execPath} ${stubPath}`),
     createRequest("opencode"),
     workspacePath,
     artifactDir,
   );
 
+  assert.equal(result.status, "failed");
+  assert.equal(result.error?.message, "Model not found: opencode/missing-model");
+});
+
+test("ClaudeAdapter fails when no final assistant text is produced", async () => {
+  const { root, artifactDir, workspacePath } = await createWorkspace();
+  const stubPath = join(root, "claude-empty-stub.js");
+  await createStub(stubPath, `#!/usr/bin/env node
+process.stdout.write(JSON.stringify({ type: "result", subtype: "success", result: "" }) + "\\n");
+`);
+
+  const { result } = await collectEvents(
+    new ClaudeAdapter(`${process.execPath} ${stubPath}`),
+    createRequest("claude"),
+    workspacePath,
+    artifactDir,
+  );
+
+  assert.equal(result.status, "failed");
+  assert.equal(result.artifacts.length, 0);
+});
+
+test("ClaudeAdapter captures plan-mode file output when no final assistant text is emitted", async () => {
+  const { root, artifactDir, workspacePath } = await createWorkspace();
+  const stubPath = join(root, "claude-plan-stub.js");
+  await createStub(stubPath, `#!/usr/bin/env node
+process.stdout.write(JSON.stringify({
+  type: "user",
+  tool_use_result: {
+    type: "create",
+    filePath: "/Users/test/.claude/plans/example-plan.md",
+    content: "# Example Plan\\n\\nExecutor claude handled task type plan."
+  }
+}) + "\\n");
+process.stdout.write(JSON.stringify({ type: "result", subtype: "success", result: "" }) + "\\n");
+`);
+
+  const { result } = await collectEvents(
+    new ClaudeAdapter(`${process.execPath} ${stubPath}`),
+    createRequest("claude", { readOnly: true }),
+    workspacePath,
+    artifactDir,
+  );
+
   assert.equal(result.status, "success");
-  assert.equal(events.at(-1)?.type, "completed");
+  assert.match(await readFile(join(artifactDir, "triage-report.md"), "utf8"), /Example Plan/);
 });