Add Mutagent: eval-driven prompt optimization by bene-mutagent · Pull Request #25 · ai-boost/awesome-harness-engineering

bene-mutagent · 2026-05-12T14:29:33Z

Adds Mutagent to the Evals & Verification section.

What it is: Open-source (MIT) Claude Code plugin + bare Agent Skills bundle that gives AI engineers an eval-driven loop for prompt and agent optimization: capture traces → curate datasets → score against rubrics → optimize prompts against measurable targets.

Why it fits: Existing Evals entries (DeepEval, Inspect AI, tau-bench, SWE-bench) cover scoring infrastructure and benchmarks; Mutagent is orthogonal — it closes the loop from existing-framework traces (Mastra/LangChain/LangGraph/Vercel AI SDK) to dataset curation, rubric scoring, and prompt optimization against measurable targets. Vendor-agnostic by principle: the pattern (trace → dataset → rubric → optimize) generalizes regardless of which adapter is wired.

Credibility: 9.6k npm downloads/30d on @mutagent/cli. Framework adapters for Mastra, LangChain, LangGraph, Vercel AI SDK. Runtime-agnostic (works in Claude Code, Cursor, Aider, Continue).

Source: https://github.com/mutagent-io/skills
Docs: https://docs.mutagent.io
Install: npm install -g @mutagent/cli && mutagent skills install

Happy to revise placement/wording if it's a better fit elsewhere.

Add Mutagent: eval-driven prompt optimization

bbf3d08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Mutagent: eval-driven prompt optimization#25

Add Mutagent: eval-driven prompt optimization#25
bene-mutagent wants to merge 1 commit into
ai-boost:mainfrom
bene-mutagent:add-mutagent

bene-mutagent commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bene-mutagent commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant