You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+97-82Lines changed: 97 additions & 82 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,20 +1,34 @@
1
1
<palign="center">
2
-
<imgsrc="docs/images/hero.png"alt="Bug Hunter — Adversarial Bug Finding for Coding Agents"width="720">
2
+
<imgsrc="docs/images/hero.png"alt="Bug Hunter — AI-powered adversarial code review and security vulnerability scanner for Claude Code, Cursor, Codex, Copilot, Windsurf, and Kiro"width="720">
3
3
</p>
4
4
5
5
<h1align="center">Bug Hunter</h1>
6
-
<palign="center"><strong>Three AI agents argue about your code. Only bugs that survive all three make the report.</strong></p>
6
+
<palign="center"><strong>AI code review that argues with itself — adversarial multi-agent bug finding, security scanning, and auto-fix for any coding agent.</strong></p>
Bug Hunter is an open-source AI code review and security vulnerability scanner that works as a skill/plugin for AI coding agents. Three AI agents — a Hunter, a Skeptic, and a Referee — independently analyze your code in an adversarial pipeline. The Hunter finds bugs. The Skeptic tries to disprove them. The Referee delivers the final verdict. Only bugs that survive all three stages make the report, eliminating the false positive overload that plagues other AI code review tools.
27
+
28
+
It then auto-fixes confirmed bugs with a safe canary rollout pipeline — git branching, test baselines, per-fix commits, automatic rollback on failure, and post-fix re-scanning.
29
+
30
+
---
31
+
18
32
## Install
19
33
20
34
```bash
@@ -29,158 +43,161 @@ bug-hunter install # auto-detects your IDE/agent
> **Requirements:** Node.js 18+ recommended. Core pipeline works without it.
39
53
>
40
-
> **Works with:** Claude Code, Cursor, Codex CLI, Windsurf, Kiro, Copilot, Opencode, [Pi](https://github.com/mariozechner/pi-coding-agent) — or any AI agent that can read files and run shell commands.
54
+
> **Compatible with:** Claude Code, Cursor, Codex CLI, Windsurf, Kiro, Copilot, Opencode, [Pi](https://github.com/mariozechner/pi-coding-agent) — or any AI agent that can read files and run shell commands.
3.**Hunter**— deep behavioral scan for logic errors, security vulnerabilities, race conditions
85
+
4.**Skeptic**— adversarial review that tries to *disprove* every finding with counter-evidence
86
+
5.**Referee**— independent final judge, re-reads code, delivers CVSS-scored verdicts
87
+
6.**Fixer**— applies canary-first patches with per-fix rollback safety
75
88
76
-
The adversarial debate between Hunter, Skeptic, and Referee eliminates false positives. The canary rollout with auto-revert prevents regressions from fixes.
89
+
Claims are verified against official library documentation via [Context Hub](https://github.com/andrewyng/context-hub) + [Context7](https://context7.com) before any agent asserts framework behavior.
77
90
78
91
<palign="center">
79
-
<imgsrc="docs/images/adversarial-debate.png"alt="Hunter vs Skeptic vs Referee"width="100%">
92
+
<imgsrc="docs/images/adversarial-debate.png"alt="Adversarial AI code review debate — Hunter finds bugs, Skeptic challenges with counter-evidence, Referee delivers independent verdict"width="100%">
80
93
</p>
81
94
82
-
| Agent |Earns Points For |Loses Points For |
83
-
|-------|-----------------|-----------------|
84
-
|**Hunter**|Reporting real bugs | False positives |
85
-
|**Skeptic**| Disproving false positives |Dismissing real bugs (2x penalty) |
86
-
|**Referee**| Accurate verdicts | Blind trust in either side |
95
+
| Agent |Rewarded For |Penalized For |
96
+
|-------|-------------|--------------|
97
+
|**Hunter**|Confirmed real bugs | False positives |
98
+
|**Skeptic**| Disproving false positives |Missing real bugs (2x penalty) |
99
+
|**Referee**| Accurate independent verdicts | Blind trust in either side |
87
100
88
101
---
89
102
90
-
## What It Finds
103
+
## Bugs and Vulnerabilities Detected
91
104
92
105
**Runtime behavioral bugs only** — not style, naming, or TODOs:
0 commit comments