Skip to content

feat(agents,skills): opus/fable model tiers + skill efficiency pass#278

Merged
Lykhoyda merged 2 commits into
mainfrom
feat/agent-models-skill-efficiency
Jun 11, 2026
Merged

feat(agents,skills): opus/fable model tiers + skill efficiency pass#278
Lykhoyda merged 2 commits into
mainfrom
feat/agent-models-skill-efficiency

Conversation

@Lykhoyda

Copy link
Copy Markdown
Owner

Summary

Two commits: agent model upgrades, then a skill efficiency pass driven by a duplication/staleness audit.

Note: the Conductor-prefilled PR description referenced the creating-actions skill — that work already merged as #272 this morning; this body describes the actual branch content.

1. Agent models (5211903)

Agent Was Now
rn-tester sonnet opus
rn-code-explorer sonnet opus
rn-code-reviewer sonnet opus
rn-debugger opus opus
rn-code-architect opus fable

The architect gets the top tier (fable → claude-fable-5): it is the pipeline's single 1–2× deep-reasoning blueprint step, vs the others' 2–3× parallel fan-outs. Model-tier prose synced in using-rn-dev-agent (agent-map tables + decision tree) and the six docs-site agent pages.

2. Skill efficiency pass (1eb5e65)

The audit found the real inefficiencies were drifting duplication and stale claims, not just word counts:

  • rn-feature-development 5,076 → 3,962 words (−22%): Phase 8 was a ~1,800-word copy of the /proof-capture protocol that had already diverged from it (the command gained video-labeling the skill lacked; both carried a claim falsified by feat(skills): creating-actions — guided authoring of reusable Maestro actions #272). Phase 8 now defers to commands/proof-capture.md as the single protocol source and lists only the pipeline deltas (architect's flow table as source, persist-as-action via creating-actions Steps 3–6, cdp_run_action smoke-test, Deviations section). Eight repeated per-phase evaluator lines collapsed into one core principle. Description rewritten trigger-only per the CSO rule — a workflow-summarizing description makes the body get skipped.
  • using-rn-dev-agent 2,065 → 1,825 (−12%) — loads at the start of every RN conversation, so highest leverage: HELPERS_NOT_INJECTED recovery protocol moved to rn-debugging (its owner) with routing pointers; stale surface counts fixed (64→76 MCP tools, 16→14 commands).
  • rn-testing: M7 section slimmed to a 5-key table + creating-actions pointer (the full glossary lives there); heading preserved so existing citations resolve.
  • rn-best-practices / rn-setup: trigger-only descriptions (dropped the rot-prone 118-rule inventory; added concrete failure-phrase triggers).
  • Correctness: two files still claimed maestro_run has "no -e pass-through" (false since feat(skills): creating-actions — guided authoring of reusable Maestro actions #272 — now show params={KEY: "VALUE"}); broken citation in run-action.md ("Maestro Flow Standards" → real heading); smoke-test standardized on cdp_run_action (RunRecord + auto-promotion) with plain maestro_run reserved for the on-camera replay (auto-repair must not mutate a flow mid-recording); <test-app> placeholder defined.

Verification

  • Deterministic sweeps: all 8 skill frontmatters parse; zero stale-claim greps; all cross-references/citations resolve; M7 table escapes render.
  • Confined-subagent retrieval test on the restructured Phase 8 chain (skill → proof-capture → creating-actions): answered every execution question concretely from the new text, and surfaced 4 real defects — stale "Step 1.4" cross-references, the smoke-test tool contradiction, a Steps-range that skipped the diagram, the undefined <test-app> placeholder — all fixed in this PR before commit.

Wiring

  • Changeset: rn-dev-agent-plugin minor (covers both commits)
  • No bridge/source changes; docs/skills/agents only

🤖 Generated with Claude Code

Lykhoyda and others added 2 commits June 11, 2026 15:39
rn-tester, rn-code-explorer, rn-code-reviewer move sonnet → opus;
rn-debugger already opus; rn-code-architect moves opus → fable
(claude-fable-5 — the top tier, fitting the architect's role as the
single deep-reasoning blueprint step in the /rn-feature-dev pipeline,
where it runs 1-2× vs the explorers'/reviewers' 2-3× parallel fans).

Model-tier prose synced: using-rn-dev-agent agent-map tables +
decision tree, docs-site agent pages + index tables.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
… stale-claim fixes

- rn-feature-development (5,076→3,962 words, −22%): Phase 8 defers to
  commands/proof-capture.md as the single protocol source (was a ~1,800-word
  drifting duplicate) with pipeline deltas on top; 8 per-phase evaluator
  lines collapsed to one core principle; description rewritten trigger-only
  per the CSO rule (workflow-summarizing descriptions make the body get
  skipped).
- using-rn-dev-agent (2,065→1,825): HELPERS_NOT_INJECTED recovery moved to
  rn-debugging (its owner) with routing pointers; surface counts fixed
  (76 tools / 14 commands, were 64/16).
- rn-testing: M7 section slimmed to a 5-key table + creating-actions
  pointer; heading preserved for citations.
- rn-best-practices / rn-setup: trigger-only descriptions (dropped the
  rot-prone 118-rule inventory; added failure-phrase triggers).
- Correctness: maestro_run/cdp_run_action params claims updated for #272
  (two files said "no -e pass-through"); run-action.md citation fixed
  ("Maestro Flow Standards" → real heading); smoke-test standardized on
  cdp_run_action (RunRecord + promotion), plain maestro_run reserved for
  the on-camera replay; <test-app> placeholder defined.

Verified: frontmatter parses on all 8 skills; zero stale-claim greps; all
cross-references resolve; confined-subagent retrieval test answered the
full Phase 8 chain (skill → proof-capture → creating-actions) concretely —
its 4 findings (stale Step-1.4 refs, smoke-test contradiction, Steps-range
skipping the diagram, undefined placeholder) are fixed in this commit.

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1eb5e65c81

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

</example>
tools: Glob, Grep, LS, Read
model: opus
model: fable

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Use a valid Claude Code model id for architect

When /rn-feature-dev reaches Phase 4 and spawns rn-code-architect, this frontmatter asks Claude Code for model: fable. I checked the current Claude Code model/subagent docs: supported aliases are default, best, sonnet, opus, haiku, sonnet[1m], opus[1m], and opusplan; non-alias selections need to be full model names. As written, users with this plugin will hit an invalid/unknown model selection instead of getting the architect blueprint, so this should use a supported alias or the full Fable model ID if Claude Code exposes one.

Useful? React with 👍 / 👎.

@Lykhoyda Lykhoyda merged commit 85a6b60 into main Jun 11, 2026
10 checks passed
@Lykhoyda Lykhoyda deleted the feat/agent-models-skill-efficiency branch June 11, 2026 14:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant