Skip to content

Commit ac66926

Browse files
authored
chore(ce-work-beta): remove beta Codex-delegation skill (#1013)
1 parent b5633aa commit ac66926

21 files changed

Lines changed: 534 additions & 1729 deletions

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -105,7 +105,7 @@ The first pass tightens recent branch changes before review. The targeted pass i
105105

106106
After installing, run `/ce-setup` in any project. It checks repo-local config, reports optional tool capabilities, and helps keep machine-local CE settings safely gitignored.
107107

108-
The `compound-engineering` plugin currently ships 27 skills and 0 standalone agents. Specialist review, research, and workflow behavior lives inside the owning skills as skill-local prompt assets.
108+
The `compound-engineering` plugin currently ships 26 skills and 0 standalone agents. Specialist review, research, and workflow behavior lives inside the owning skills as skill-local prompt assets.
109109

110110
### Full Skill Inventory
111111

@@ -136,7 +136,6 @@ The `compound-engineering` plugin currently ships 27 skills and 0 standalone age
136136
| `/ce-polish` | Start a dev server and iterate on UX polish |
137137
| `/ce-proof` | Create, edit, and share Proof documents |
138138
| `/ce-dogfood-beta` | Diff-scoped browser QA of the active branch |
139-
| `/ce-work-beta` | Experimental execution workflow with Codex delegation mode |
140139
| `/lfg` | Full autonomous engineering workflow |
141140

142141
---

docs/plans/2026-06-26-001-chore-remove-ce-work-beta-plan.html

Lines changed: 449 additions & 0 deletions
Large diffs are not rendered by default.

docs/solutions/best-practices/codex-delegation-best-practices.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,16 +18,17 @@ tags:
1818
- batching
1919
- orchestration-cost
2020
- prompt-engineering
21-
- ce-work-beta
2221
---
2322

2423
# Codex Delegation Best Practices
2524

2625
## Context
2726

28-
Over six iterations of evaluation building Codex delegation into `ce-work-beta`, we collected quantitative data on the token economics of orchestrating work between Claude Code (the orchestrator) and Codex (the delegated executor). The core question: when does delegating plan units to Codex actually save Claude tokens, and what architectural patterns control the cost?
27+
> **Note:** This is a retrospective. The experimental delegation skill it studied (`ce-work-beta`) has since been removed from the plugin. The findings below are preserved as general guidance for designing external-model delegation in any orchestrator skill, not as documentation for a live feature.
2928
30-
The delegation model: `ce-work-beta` receives a plan with N implementation units, then decides whether to execute them directly (standard mode) or delegate them to Codex via `codex exec`. Delegation has a fixed orchestration overhead per batch (prompt file write, codex exec invocation, result classification, commit) of approximately 4-5k Claude tokens. Each unit of code Claude does not write saves roughly 3-5k tokens. The crossover depends on how many units are batched per delegation call.
29+
Over six iterations of evaluation building Codex delegation into an experimental `ce-work` delegation mode, we collected quantitative data on the token economics of orchestrating work between Claude Code (the orchestrator) and Codex (the delegated executor). The core question: when does delegating plan units to Codex actually save Claude tokens, and what architectural patterns control the cost?
30+
31+
The delegation model: the delegating skill receives a plan with N implementation units, then decides whether to execute them directly (standard mode) or delegate them to Codex via `codex exec`. Delegation has a fixed orchestration overhead per batch (prompt file write, codex exec invocation, result classification, commit) of approximately 4-5k Claude tokens. Each unit of code Claude does not write saves roughly 3-5k tokens. The crossover depends on how many units are batched per delegation call.
3132

3233
The evaluation spanned iterations 1-6, testing small (1-2 units), medium (4 units), large (7 units), and extra-large (10 units) plans in both delegation and standard modes, with real code implementation and test verification in isolated worktrees.
3334

docs/solutions/skill-design/ce-work-beta-promotion-checklist.md

Lines changed: 0 additions & 106 deletions
This file was deleted.

0 commit comments

Comments
 (0)