Skip to content

Commit 56327bb

Browse files
franklywatsonclaude
andcommitted
docs: update my-claw case study with current project state
Re-scouted my-claw: 32 test files, 492 tests (was 31/473), added Discord stack tests, renumbered test suites. Updated project description from "war room command center" to "autonomous, self-managing, multi-agent AI system" to match current README. Updated test table with current suite names and counts. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
1 parent ed68954 commit 56327bb

2 files changed

Lines changed: 9 additions & 8 deletions

File tree

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ The lineage: patterns were extracted from the reference project, organized into
3434
| [rig](https://github.com/franklywatson/claude-rig) | Baseline guardrails framework: L2 enforcement pipeline, L3 tool routing + scout agent, skill chain with phase transitions, CI guardrails. Configurable via `.harness.yaml`. | TypeScript |
3535
| [gstack](https://github.com/garrytan/gstack) | L2 skill framework with resolver pipeline, preamble system | TypeScript |
3636
| [superpowers](https://github.com/obra/superpowers) | L2 base skills (brainstorming, TDD, verification, review), automated worktree management | Markdown/JS |
37-
| [my-claw](docs/references/reference-my-claw-case-study.md) | L1 design rinsing reference: multi-agent war room whose architecture evolved through three phases of cross-domain design rinsing — YouTube demo to architecture, academic talk + codebases to agent design, agentic-patterns + compound engineering to development approach | Python |
37+
| [my-claw](docs/references/reference-my-claw-case-study.md) | L1 design rinsing reference: autonomous multi-agent system whose architecture evolved through three phases of cross-domain design rinsing — YouTube demo to architecture, academic talk + codebases to agent design, agentic-patterns + compound engineering to development approach | Python |
3838

3939
## The Pattern Pyramid
4040

@@ -100,7 +100,7 @@ This is a living pattern library. Contributions welcome:
100100
## Background and Further Reading
101101

102102
- [Reference Telegram Trading Bot Case Study](docs/references/reference-telegram-trading-bot-case-study.md) — the production system these patterns were extracted from
103-
- [Reference my-claw Case Study](docs/references/reference-my-claw-case-study.md) — design rinsing in practice: multi-agent war room architecture evolved through cross-domain extraction across three phases
103+
- [Reference my-claw Case Study](docs/references/reference-my-claw-case-study.md) — design rinsing in practice: autonomous multi-agent system architecture evolved through cross-domain extraction across three phases
104104
- [FAQ](docs/cross-cutting/faq.md) — deployment, operations, and other SDLC concerns beyond the patterns
105105
- [Further Reading](docs/references/further-reading.md) — articles, videos, and tools that informed this work
106106
- [Glossary](docs/cross-cutting/glossary.md) — terminology reference

docs/references/reference-my-claw-case-study.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
# Reference my-claw Project — Design Rinsing in Practice
22

3-
**Project:** my-claw — Multi-agent war room command center with voice, Telegram, Discord, and WebSocket interfaces
3+
**Project:** my-claw — Autonomous, self-managing, multi-agent AI system with voice, Telegram, Discord, and WebSocket interfaces
44

55
**Tech Stack:** Python 3.11+, Pipecat (real-time frame-processing pipeline), litellm (provider-agnostic LLM gateway), FastAPI, SQLite
66

7-
**Scale:** 5-agent architecture with 3-tier routing, voice integration via Deepgram STT + Cartesia TTS, 31 test files with 473 test functions (unit + stack + Docker stack + browser stack), room-based isolation with 3 templates, 10 worker roles, behavioral constitution, trust tiers, memory system, heartbeat, and scheduling
7+
**Scale:** 5-agent architecture with 3-tier routing, voice integration via Deepgram STT + Cartesia TTS, 32 test files with 492 test functions (unit + stack + Docker stack + browser stack + Discord stack), room-based isolation with 3 templates, 10 worker roles, behavioral constitution, trust tiers, memory system, heartbeat, and scheduling
88

99
This case study demonstrates design rinsing — the structured practice of extracting distilled architectural understanding from external sources and translating it into a project's design. The my-claw project evolved through three distinct rinsing phases, each building on the last. That compounding — where each rinsing phase leveraged and extended the previous — is itself an example of [compound engineering](https://github.com/EveryInc/compound-engineering-plugin): each unit of work making subsequent units easier.
1010

@@ -178,14 +178,15 @@ The testing infrastructure demonstrates rinsing at the practice level — the tr
178178
| Trading Bot Pattern | my-claw Translation |
179179
|---|---|
180180
| StackTestUtils class | Per-test session management with real services |
181-
| Sequential test ordering | ST1-ST12 ordered by dependency (startup → auth → routing → voice → rooms) |
181+
| Sequential test ordering | ST1-ST11 ordered by dependency (startup → routing → voice → rooms → tools → trust → heartbeat) |
182182
| Real dependencies | Zero mocks in integration tests; real Deepgram, Cartesia, litellm APIs |
183183
| Full-loop assertions | Tests verify entire user journeys, not individual functions |
184-
| Docker stack tests | ST-D1-ST-D11 against Docker container |
184+
| Docker stack tests | ST-D1-ST-D10 against Docker container |
185185
| Browser stack tests | ST-B1-ST-B8 via Playwright against running container |
186+
| Discord stack tests | ST-DS1-ST-DS6 against real Discord bot |
186187
| Health endpoint test mode | Container readiness checks before domain tests |
187-
| Room isolation tests | 4 tests verifying isolated pipelines don't interfere |
188-
| Tool stack tests | 5 tests verifying delegation and tool execution |
188+
| Room isolation tests | ST-R1-ST-R4 verifying isolated pipelines don't interfere |
189+
| Tool stack tests | ST-T1-ST-T6 verifying delegation and tool execution with real LLM |
189190

190191
Test markers: `pytest -m "not integration"` for unit (no network), `pytest -m integration` for real API tests (auto-skip if no .env). Unit tests use no mocks — they test against the module interfaces directly.
191192

0 commit comments

Comments
 (0)