Skip to content

Show scoring breakdown in recommendation output#96

Merged
that-github-user merged 1 commit into
mainfrom
issue-62-scoring-breakdown
Mar 28, 2026
Merged

Show scoring breakdown in recommendation output#96
that-github-user merged 1 commit into
mainfrom
issue-62-scoring-breakdown

Conversation

@that-github-user

Copy link
Copy Markdown
Owner

Summary

  • recommend() returns per-agent score breakdowns: testPoints, convergencePoints, diffSizePoints, total
  • Display shows scoring table after recommendation so users can understand and trust the pick
  • Updated EnsembleResult type with scores field
  • 2 new tests for score breakdown

Generated by thinktank Opus — 5 agents, 81% convergence (strong consensus), all pass, all changed same 5 files. Agent #1 recommended (+120/-21).

Change type

  • New feature

Related issue

Closes #62

How to test

npm test  # 82 tests pass
npx tsx src/cli.ts run "trivial task" -n 2
# Output should show Scoring section with per-agent breakdown

Breaking changes

  • This PR introduces breaking changes

recommend() return type changed from number | null to { recommended: number | null; scores: ScoreBreakdown[] }. Internal only — no external consumers.

🤖 Generated with thinktank (Opus)

recommend() now returns per-agent score breakdowns (testPoints,
convergencePoints, diffSizePoints, total). Display shows scoring table
after recommendation so users understand why an agent was picked.

Generated by thinktank Opus (5 agents, 81% convergence, all pass).

Closes #62

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@that-github-user

Copy link
Copy Markdown
Owner Author

Self-review: 5/5 agents at 81% strong consensus. All changed same 5 files. recommend() return type updated, display shows scoring table, 2 new tests. 82 tests pass, CI green.

@that-github-user that-github-user merged commit 12d13a0 into main Mar 28, 2026
4 checks passed
@that-github-user that-github-user deleted the issue-62-scoring-breakdown branch March 28, 2026 19:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Show scoring breakdown in recommendation output so users can trust the result

1 participant