Skip to content

Add AWB to 8.2 Benchmarks > Integrated Benchmarks#260

Open
xmpuspus wants to merge 1 commit into
codefuse-ai:mainfrom
xmpuspus:add-awb-benchmark
Open

Add AWB to 8.2 Benchmarks > Integrated Benchmarks#260
xmpuspus wants to merge 1 commit into
codefuse-ai:mainfrom
xmpuspus:add-awb-benchmark

Conversation

@xmpuspus
Copy link
Copy Markdown

Adds AWB (AI Workflow Benchmark) to section 8.2 Benchmarks > Integrated Benchmarks.

AWB is an open-source benchmark suite that evaluates AI coding workflows on 100 tasks across 8 categories (bug-fix, feature-addition, refactoring, code-review, debugging, multi-file, legacy-code, workflow) using real OSS repositories pinned at commit SHAs — not synthetic snippets.

Inserted chronologically after OmniCode [2026-02] at [2026-04].

AWB (AI Workflow Benchmark) evaluates AI coding workflows on 100 tasks
across 8 categories using real OSS repositories pinned at commit SHAs.
Scored across 7 capability dimensions; ships 9 adapters (Claude Code,
Cursor, Aider, Gemini CLI, Codex CLI, Windsurf, Copilot, Pi).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant