docs: add "When to use" guide so users can evaluate whether ralph loops fit their task

Kasper Junge · claude · Kasper Junge · commit a0b8a79f7e32 · 2026-03-17T09:53:31.000+01:00
Users landing on the docs could see how ralphify works but had no guidance on
whether it was the right tool for their specific use case. The new page covers
what works well, what doesn't, loop vs. single conversation trade-offs, and how
to adapt borderline tasks.

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/docs/when-to-use.md b/docs/when-to-use.md
@@ -0,0 +1,84 @@
+---
+description: When ralph loops are the right tool — what kinds of tasks work well, what doesn't, and how to tell if your task fits the pattern.
+---
+
+# When to use ralph loops
+
+Ralph loops are powerful but they're not the right tool for everything. This page helps you decide whether a loop fits your task before you invest time setting one up.
+
+## The sweet spot
+
+Ralph loops work best when a task has these properties:
+
+**Decomposable into small, independent steps.** The loop does one thing per iteration. Tasks that naturally break into "do this, then this, then this" are ideal — implementing features from a TODO list, writing tests module by module, fixing lint errors one at a time.
+
+**Has a clear definition of "done" for each step.** Checks need something to validate. If you can express "this iteration succeeded" as a command that exits 0 or non-zero (tests pass, build succeeds, lint is clean), the self-healing loop works. If success requires human judgment ("does this look good?"), a loop can't self-correct.
+
+**Benefits from fresh context.** Long conversations degrade — the agent loses track of earlier instructions, fills up the context window, and starts making mistakes. If your task will take more than 15-20 minutes of agent work, a loop outperforms a single conversation because each iteration starts clean with the current state of the codebase.
+
+**Progress is visible in the codebase.** The agent's work must be observable on disk — files changed, tests added, docs written, commits made. The next iteration reads the codebase to understand what's been done. Tasks that produce output elsewhere (a Slack message, a deployment, an email) need a wrapper that records progress locally.
+
+## What works well
+
+| Task | Why it fits |
+|---|---|
+| **Implementing features from a spec** | Each feature is one iteration; tests validate correctness |
+| **Writing tests** | Each module is one iteration; coverage reports guide prioritization |
+| **Fixing lint / type errors** | Each fix is small and independently verifiable |
+| **Documentation improvements** | Each page is one iteration; `mkdocs build --strict` validates |
+| **Codebase migrations** (JS→TS, Python 2→3) | Each file is one iteration; the compiler validates |
+| **Bug triage** | Each bug is one iteration; regression tests verify the fix |
+| **Refactoring** | Each extraction/rename is one iteration; tests catch regressions |
+
+## What doesn't work well
+
+| Task | Why it doesn't fit |
+|---|---|
+| **Design decisions** | Requires human judgment about trade-offs — no check can validate "is this the right architecture?" |
+| **Tasks requiring multi-step reasoning across iterations** | Each iteration starts fresh — the agent can't "continue where it left off" from memory, only from what's on disk |
+| **One-shot tasks** | If the task takes 5 minutes and you won't repeat it, just chat with the agent — the loop setup overhead isn't worth it |
+| **Tasks with no automated validation** | Without checks, there's no self-healing — the agent may compound errors across iterations |
+| **Creative writing** | Prose quality is subjective; no check can validate "is this well-written?" |
+| **Interacting with external services** | API calls, deployments, and messages are hard to undo if the agent makes a mistake |
+
+## Loop vs. single conversation
+
+Use a **single conversation** when:
+
+- The task will take less than 10-15 minutes
+- You want to iterate interactively with the agent
+- The task requires back-and-forth discussion
+- You need to make subjective decisions along the way
+
+Use a **ralph loop** when:
+
+- The task involves many similar, independent steps
+- You want the agent to work autonomously without your attention
+- You have tests or checks that can validate correctness
+- The task would fill up a conversation's context window
+- You want to walk away and come back to completed work
+
+## Making a borderline task work
+
+Some tasks seem like they don't fit but can be adapted:
+
+**"There's no automated check for this."** Write one. Even a simple script that checks for obvious problems (file exists, no syntax errors, word count above threshold) catches the worst failures. You can always add a more thorough check later.
+
+**"The task requires multi-step reasoning."** Use a `PLAN.md` or `TODO.md` file as the coordination mechanism. The agent reads the plan each iteration, marks steps done, and the next iteration continues from there. The plan file IS the agent's memory.
+
+**"Each iteration depends on the previous one."** That's fine — the agent reads the codebase, which includes all previous iterations' commits. As long as progress is visible on disk, the fresh context model works. The agent doesn't need conversation memory when the code tells the story.
+
+**"I need to review the agent's work before it continues."** Use `-n 1` to run single iterations, review, then run again. Or use `--stop-on-error` with a check that requires your sign-off (a file you manually create or delete between iterations).
+
+## How many iterations?
+
+- **Start with `-n 3`** to verify your setup works and the agent produces useful output
+- **Use `-n 10-20`** for bounded tasks (a TODO list with known items)
+- **Run unlimited** (`ralph run` without `-n`) for open-ended improvement tasks with good checks — the checks prevent the agent from going off the rails
+- **Use `--stop-on-error`** when each iteration must succeed before the next one makes sense
+
+## Next steps
+
+- [Getting Started](getting-started.md) — set up your first loop
+- [Writing Prompts](writing-prompts.md) — patterns for effective autonomous loop prompts
+- [Cookbook](cookbook.md) — copy-pasteable setups for common workflows
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -88,6 +88,7 @@ extra:
 nav:
   - Home: index.md
   - Guide:
+    - When to Use: when-to-use.md
     - Getting Started: getting-started.md
     - How it Works: how-it-works.md
     - Writing Prompts: writing-prompts.md