Skip to content

Commit f0c077f

Browse files
docs: add least-privilege deployment roles and deployment guide
Add DEPLOYMENT_ROLES.md with least-privilege IAM policies for the CloudFormation execution role (IaCRole-AgentCore and IaCRole-Fargate), derived from analysis of all CDK constructs and handler code. Add DEPLOYMENT_GUIDE.md covering deployment choices, step-by-step setup, scale-to-zero analysis, and complete AWS services inventory. Update COST_MODEL.md with accurate VPC endpoint count (11, not 7), additional always-on costs (WAF, dashboard, alarms, secrets), and scale-to-zero characteristics section. Baseline updated from ~$85-90 to ~$118-122/month. Update ARCHITECTURE.md cross-reference table with new documents. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
1 parent 574b205 commit f0c077f

4 files changed

Lines changed: 1034 additions & 2 deletions

File tree

docs/design/ARCHITECTURE.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -202,6 +202,9 @@ Each concept has a **source-of-truth document** and one or more documents that r
202202
| Adaptive model router | ROADMAP.md (Iter 5) | COST_MODEL.md |
203203
| Capability-based security | ROADMAP.md (Iter 5) | SECURITY.md |
204204
| Live session replay | ROADMAP.md (Iter 4) | API_CONTRACT.md |
205+
| Deployment roles and least-privilege IaC | DEPLOYMENT_ROLES.md | SECURITY.md, COST_MODEL.md, Deployment guide |
206+
| Scale-to-zero and idle cost analysis | Deployment guide (Scale-to-zero analysis) | COST_MODEL.md, NETWORK_ARCHITECTURE.md |
207+
| AWS services inventory | Deployment guide (AWS services inventory) | COMPUTE.md, NETWORK_ARCHITECTURE.md |
205208

206209
### Per-repo model selection
207210

docs/design/COST_MODEL.md

Lines changed: 14 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,12 +11,22 @@ These costs are incurred regardless of task volume:
1111
| Component | Estimated cost | Notes |
1212
|---|---|---|
1313
| NAT Gateway (1×) | ~$32/month | Fixed hourly cost + data processing. Single AZ (see [NETWORK_ARCHITECTURE.md](./NETWORK_ARCHITECTURE.md)). |
14-
| VPC Interface Endpoints () | ~$50/month | $0.01/hr per endpoint per AZ. |
14+
| VPC Interface Endpoints (11×, 2 AZs) | ~$77/month | $0.01/hr per endpoint per AZ. 7 base + 4 Fargate-specific (ECS, ECS Agent, ECS Telemetry, Step Functions). |
1515
| VPC Flow Logs | ~$3/month | CloudWatch ingestion. |
16+
| WAF v2 Web ACL | ~$5/month | Base monthly charge for web ACL + managed rule groups. |
17+
| CloudWatch Dashboard | ~$3/month | Per-dashboard charge. |
1618
| DynamoDB (on-demand, idle) | ~$0/month | Pay-per-request; no cost when idle. |
19+
| Secrets Manager | ~$0.40/secret/month | Per-secret monthly charge. |
20+
| CloudWatch Alarms | ~$0.10/alarm/month | Per standard alarm. |
1721
| CloudWatch Logs retention | ~$1–5/month | Depends on log volume. 90-day retention. |
1822
| API Gateway (idle) | ~$0/month | Pay-per-request. |
19-
| **Total baseline** | **~$85–90/month** | |
23+
| **Total baseline** | **~$118–122/month** | |
24+
25+
If deploying AgentCore only (no Fargate stack), the 4 Fargate-specific VPC endpoints can be removed, reducing the baseline to ~$90–95/month.
26+
27+
### Scale-to-zero characteristics
28+
29+
Most platform components are fully serverless and incur zero cost when idle: DynamoDB (PAY_PER_REQUEST), Lambda, API Gateway, Step Functions, ECS Fargate (cluster is free), AgentCore Runtime (per-session), Bedrock (per-token), and Cognito (free tier). The always-on cost floor (~$118/month) is dominated by VPC networking infrastructure (NAT Gateway + interface endpoints) which is required for private subnet connectivity to AWS services and GitHub. See the [Deployment guide](../guides/DEPLOYMENT_GUIDE.md) for the full scale-to-zero breakdown.
2030

2131
## Per-task variable costs
2232

@@ -85,6 +95,8 @@ For multi-user deployments, cost should be attributable to individual users and
8595

8696
## Reference
8797

98+
- [Deployment guide](../guides/DEPLOYMENT_GUIDE.md) — Deployment choices, scale-to-zero analysis, AWS services inventory.
99+
- [DEPLOYMENT_ROLES.md](./DEPLOYMENT_ROLES.md) — Least-privilege IAM policies for deployment.
88100
- [NETWORK_ARCHITECTURE.md](./NETWORK_ARCHITECTURE.md) — VPC infrastructure cost breakdown.
89101
- [ORCHESTRATOR.md](./ORCHESTRATOR.md) — Polling cost analysis.
90102
- [COMPUTE.md](./COMPUTE.md) — Compute option billing models.

0 commit comments

Comments
 (0)