Commit 586940b
running-in-ci: verify external-tool behavior by run or clone, not just hand-test (#327)
## Problem
The bundled `running-in-ci` skill's Grounded Analysis section covers
behavioral claims by telling the bot to "run the command yourself" or
hedge — but that fallback doesn't obviously apply to external CLIs and
APIs that aren't installed in CI and aren't exercised by automated
tests. In
[worktrunk#1907](max-sixty/worktrunk#1907 (comment))
the bot read upstream mintlify docs describing a `cmux list-workspaces
--json` flag, believed it, and committed a recipe that broke for every
reader — the installed cmux had no such flag.
## Solution
Adds a new `### Verifying external-tool behavior` subsection under
Grounded Analysis. It points at two concrete verification paths in order
of preference: install and run the tool, or clone its public repo and
grep the source. Deferring to a human with the tool installed is the
fallback only when both paths fail. The "don't make overconfident
claims" framing already lives in the preceding `### User-facing comments
require source evidence` subsection.
Bad/good example is drawn from the cmux incident: the good path now
shows cloning the upstream repo and checking the CLI parser, not asking
a human to confirm.
## Testing
Skill text only. `pre-commit run` on the modified file passes (typos,
trim-whitespace, bang-backtick, end-of-files).
---
Closes #326 — automated triage
---------
Co-authored-by: tend-agent <270458913+tend-agent@users.noreply.github.com>
Co-authored-by: Claude <noreply@anthropic.com>
Co-authored-by: Maximilian Roos <5635139+max-sixty@users.noreply.github.com>1 parent 9bee033 commit 586940b
1 file changed
Lines changed: 34 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
501 | 501 | | |
502 | 502 | | |
503 | 503 | | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
504 | 538 | | |
505 | 539 | | |
506 | 540 | | |
| |||
0 commit comments