What's shipped and what's coming next.
Onboarding and customization
Blueprint construct - Per-repo CDK configuration (model, turns, budget, prompt overrides, egress, GitHub token)
Repo-level project config - Agent loads CLAUDE.md, .claude/rules/, .claude/settings.json, .mcp.json
Per-repo overrides - Model ID, max turns, max budget, system prompt overrides, poll interval, dedicated token
Planned capabilities, grouped by theme. Items are independent and may ship in any order.
Credentials and authorization
Capability
Description
Per-repo GitHub credentials
GitHub App per org/repo via AgentCore Token Vault. Auto-refresh for long sessions. Sets the pattern for GitLab, Jira, Slack integrations.
Principal-to-repo authorization
Map Cognito identities to allowed repository sets. Users can only trigger work on authorized repos.
Capability
Description
Tiered validation pipeline
Three post-agent tiers: tool validation (build/test/lint), code quality (DRY/SOLID/complexity), risk and blast radius analysis.
PR risk classification
Rule-based risk classifier at submission. Drives model selection, budget defaults, approval requirements.
Review feedback memory loop
Capture PR review comments via webhook, extract rules via LLM, persist as searchable memory.
PR outcome tracking
Track merge/reject via GitHub webhooks. Positive/negative signals feed evaluation and memory.
Evaluation pipeline
Failure categorization, memory effectiveness metrics (merge rate, revision cycles, CI pass rate).
Capability
Description
Trust-aware retrieval
Weight memories by freshness, source type, pattern consistency.
Temporal decay
Configurable per-entry TTL with faster decay for unverified content.
Anomaly detection
CloudWatch metrics on write patterns; alarms for burst writes or suspicious content.
Quarantine and rollback
Operator API for isolating suspicious entries and restoring pre-task snapshots.
Write-ahead validation
Route proposed memory writes through a guardian model.
Channels and integrations
Capability
Description
Multi-modal input
Accept images in task payload (screenshots, UI mockups, design specs).
Additional git providers
GitLab (and optionally Bitbucket). Same workflow, provider-specific API adapters.
Slack integration
Submit tasks, check status, receive notifications from Slack. Block Kit rendering.
Control panel
Web UI: task list, task detail with logs/traces, cancel, metrics dashboards, cost attribution.
Real-time event streaming
WebSocket API for live task updates. Replaces polling for CLI, control panel, Slack.
Capability
Description
Adaptive model router
Per-turn model selection by complexity. Cheaper models for reads, Opus for complex reasoning. ~30-40% cost reduction.
Alternative compute
ECS/Fargate or EKS via ComputeStrategy interface. For workloads exceeding AgentCore's 2 GB image limit or requiring GPU.
Environment pre-warming
Pre-build container layers per repo. Snapshot-on-schedule (rebuild on push). Cold start from minutes to seconds.
Capability
Description
Multi-user and teams
Team visibility, shared approval queues, team concurrency/cost budgets, memory isolation.
Agent swarm
Planner-worker architecture for complex multi-file tasks. DAG of subtasks, merge orchestrator, one consolidated PR.
Iterative feedback
Follow-up instructions to running tasks. Multiple users inject context. Per-prompt commit attribution.
Scheduled triggers
Cron-based task creation via EventBridge (dependency updates, nightly flaky test checks).
Capability
Description
CDK constructs library
Publish reusable constructs to Construct Hub with semver versioning.
Centralized policy framework
Unified Cedar-based framework with PolicyDecisionEvent audit schema. Three enforcement modes with observe-before-enforce rollout.
Formal verification
TLA+ specification of task state machine, concurrency, cancellation races, reconciler interleavings.
Design docs to keep in sync: ARCHITECTURE.md , ORCHESTRATOR.md , API_CONTRACT.md , INPUT_GATEWAY.md , REPO_ONBOARDING.md , MEMORY.md , OBSERVABILITY.md , COMPUTE.md , SECURITY.md , EVALUATION.md .