Hermes Agent on Amazon Bedrock AgentCore

English | 中文

Deploy Hermes Agent on Amazon Bedrock AgentCore — per-user Firecracker microVMs with automatic scaling, native Bedrock Claude models, and multi-channel messaging.

Architecture

Telegram / Slack / Discord / Feishu          WeChat / Feishu (long-lived)
         │                                          │
    API Gateway                              ECS Fargate Gateway
         │                                          │
    Router Lambda ──→ AgentCore Runtime ←── AgentCore Proxy
                           │
                    ┌──────────────┐
                    │  main.py     │  AgentCore entrypoint
                    │  Hermes Agent│  40+ tools, skills, memory
                    │  Bedrock API │  Claude via SigV4 auth
                    └──────────────┘

Key Features

Per-user isolation — Firecracker microVMs, one per session
Serverless — No servers to manage, auto-scaling, pay-per-use
Multi-channel — Telegram, Slack, Discord, Feishu (Lark) via webhook; WeChat via ECS gateway
Native Bedrock — Claude models via SigV4 auth (no API keys needed)
Infrastructure as Code — 9 CDK stacks, four-phase deployment
Persistent state — S3-backed workspace for memory and sessions

How It Works

Hermes Agent runs unmodified inside AgentCore containers. The key integration is a monkey-patch in app/hermes/main.py that transparently replaces anthropic.Anthropic with anthropic.AnthropicBedrock, routing all API calls through Bedrock with SigV4 authentication. This means:

No Anthropic API key needed
All requests use AWS IAM credentials
Model access governed by Bedrock policies
Hermes Agent source code remains unchanged

Project Structure

├── app/hermes/              # Runtime application
│   ├── main.py              # AgentCore entrypoint (Anthropic → Bedrock monkey-patch)
│   ├── Dockerfile           # Multi-stage container build
│   ├── entrypoint.sh        # Workspace initialization
│   └── pyproject.toml       # Python dependencies
├── bridge/                  # AgentCore contract bridge
│   ├── contract.py          # HTTP server (/ping, /invocations)
│   ├── workspace_sync.py    # S3 ↔ SQLite sync
│   └── bedrock_provider.py  # Bedrock model configuration
├── gateway/                 # ECS Fargate gateway (Phase 4)
│   ├── main.py              # Gateway entry point
│   ├── agentcore_proxy.py   # AIAgent → AgentCore proxy (monkey-patch)
│   ├── weixin_file_patch.py # Auto-convert long text to .md files for WeChat
│   ├── healthcheck.py       # ECS health-check HTTP server
│   └── Dockerfile           # Gateway container image
├── lambda/                  # AWS Lambda functions
│   ├── router/              # Channel webhook → AgentCore dispatcher
│   ├── cron/                # Scheduled task execution
│   └── token_metrics/       # Usage tracking
├── stacks/                  # CDK stack definitions
│   ├── vpc_stack.py         # VPC, subnets, NAT Gateway
│   ├── security_stack.py    # KMS, Secrets Manager, Cognito
│   ├── guardrails_stack.py  # Bedrock Guardrails
│   ├── agentcore_stack.py   # IAM roles, S3 workspace bucket
│   ├── observability_stack.py   # CloudWatch dashboards & alarms
│   ├── router_stack.py      # API Gateway + Router Lambda
│   ├── gateway_stack.py     # ECS Fargate gateway (WeChat + Feishu)
│   ├── cron_stack.py        # Scheduled invocations
│   └── token_monitoring_stack.py # Token usage analytics
├── scripts/
│   └── deploy.sh            # Three-phase deployment orchestrator
├── docs/                    # Documentation
├── hermes-agent/            # Git submodule (hermes-agent source)
├── app.py                   # CDK entry point
├── cdk.json                 # CDK configuration
└── requirements.txt         # Python CDK dependencies

Prerequisites

AWS Account with Bedrock model access enabled (Claude Sonnet/Opus)
AWS CLI configured with credentials
Node.js >= 18 (for AWS CDK)
Python >= 3.10
Docker (for container builds)
AgentCore CLI: npm install -g @aws/agentcore

Deployment

Quick Start

# Clone
git clone https://github.com/aws-samples/sample-host-hermesagent-on-amazon-bedrock-agentcore.git
cd sample-host-hermesagent-on-amazon-bedrock-agentcore

# Setup Python environment
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

# Install CDK
npm install

# Deploy all three phases
./scripts/deploy.sh all

Phase-by-Phase

# Phase 1: Foundation (VPC, security, guardrails, IAM)
./scripts/deploy.sh phase1

# Phase 2: Build & deploy Hermes Agent container to AgentCore
./scripts/deploy.sh phase2

# Phase 3: Router Lambda, cron, monitoring
./scripts/deploy.sh phase3

# Phase 4 (optional): ECS Gateway for WeChat + Feishu long-lived connections
./scripts/deploy.sh phase4

Phase 4: ECS Gateway (Optional)

Phase 4 deploys an ECS Fargate gateway that runs the hermes-agent platform adapters (WeChat long-poll, Feishu WebSocket) in a persistent container. All AI inference is forwarded to AgentCore via AgentCoreProxyAgent — the gateway handles only platform protocols.

When you need Phase 4:

WeChat — requires persistent long-poll connection (via iLink Bot API)
Feishu WebSocket — lower latency than webhook mode

What it deploys:

ECR repository for the gateway container image
ECS Fargate cluster + service (single task, auto-restart)
VPC networking, security groups, IAM roles
Secrets Manager integration for platform credentials

Configure platform credentials after deployment:

# WeChat
aws secretsmanager put-secret-value --secret-id hermes/weixin/token --secret-string 'YOUR_TOKEN'

# Feishu
aws secretsmanager put-secret-value --secret-id hermes/feishu/app-id --secret-string 'YOUR_APP_ID'
aws secretsmanager put-secret-value --secret-id hermes/feishu/app-secret --secret-string 'YOUR_SECRET'

Features:

Long text auto-converted to .md files for WeChat delivery (configurable threshold via WEIXIN_FILE_THRESHOLD)
Conversation history forwarded to AgentCore for multi-turn context
Cold start retry with exponential backoff (5s → 10s → 20s)
Health check endpoint at http://localhost:8080/health

Invocation

AgentCore CLI

# Single message
agentcore invoke "Hello, who are you?" --stream --runtime hermes

# Multi-turn conversation
agentcore invoke "My name is Steven" --stream --runtime hermes --session-id s001
agentcore invoke "What's my name?" --stream --runtime hermes --session-id s001

Python SDK (boto3)

import boto3, json

RUNTIME_ARN = "arn:aws:bedrock-agentcore:us-west-2:ACCOUNT_ID:runtime/YOUR_RUNTIME_ID"
client = boto3.client("bedrock-agentcore", region_name="us-west-2")

response = client.invoke_agent_runtime(
    agentRuntimeArn=RUNTIME_ARN,
    payload=json.dumps({"prompt": "Hello!"}).encode("utf-8"),
)

result = response.get("response", "")
if hasattr(result, "read"):
    result = result.read().decode("utf-8")
print(result)

See docs/INVOKE_GUIDE.md for AWS CLI, JavaScript SDK, and HTTP API examples.

Channel Integration

Discord

Create Discord Application at Developer Portal
Set Interactions Endpoint URL to your API Gateway webhook endpoint
Register /ask slash command
Add users to DynamoDB allowlist

See docs/DISCORD_SETUP.md for step-by-step instructions.

Telegram / Slack

Configure webhooks after Phase 3 deployment:

./scripts/setup_telegram.sh
./scripts/setup_slack.sh

WeChat (Phase 4 — ECS Gateway)

WeChat uses the iLink Bot API (ilinkai.weixin.qq.com) for personal WeChat accounts. It requires a persistent long-poll connection and is only available via the Phase 4 ECS gateway.

Step 1: Obtain WeChat Token

The token is obtained through an interactive QR code login — not a static API key.

Run the hermes-agent gateway setup locally:

cd ~/hermes-agent
pip install -e ".[messaging]"
hermes gateway setup

Select Weixin when prompted. The setup will:

Request a QR code from iLink Bot API
Display the QR code in your terminal
Scan the QR code with your WeChat app and confirm on your phone
Return credentials: WEIXIN_ACCOUNT_ID and WEIXIN_TOKEN

The credentials are saved to ~/.hermes/.env. You can also find them in ~/.hermes/weixin/accounts/{account_id}.json.

Note: The token is a session token, not a permanent key. If the session expires, you will need to re-scan the QR code.

Step 2: Deploy and Configure

# Deploy Phase 4
./scripts/deploy.sh phase4

# Store credentials in Secrets Manager
aws secretsmanager put-secret-value \
  --secret-id hermes/weixin/token \
  --secret-string 'TOKEN_FROM_STEP_1'

aws secretsmanager put-secret-value \
  --secret-id hermes/weixin/account-id \
  --secret-string 'ACCOUNT_ID_FROM_STEP_1'

Optional Environment Variables

Variable	Default	Description
`WEIXIN_DM_POLICY`	`open`	DM authorization: `open`, `allowlist`, `disabled`, `pairing`
`WEIXIN_ALLOWED_USERS`	(empty)	Comma-separated allowed user IDs (for `allowlist` mode)
`WEIXIN_GROUP_POLICY`	`disabled`	Group chat policy: `open`, `allowlist`, `disabled`
`WEIXIN_FILE_THRESHOLD`	`2000`	Auto-convert text to `.md` file above this character count

Feishu (Lark)

Feishu supports two connection modes:

	Webhook (Phase 3)	WebSocket (Phase 4)
Connection	Feishu POST → API Gateway → Lambda	ECS container → Feishu WebSocket
Public URL required	Yes	No
Latency	Higher (Lambda cold start)	Lower (persistent connection)
Setup complexity	Event subscription URL config	Just App ID + Secret

Run the interactive setup script:

./scripts/setup_feishu.sh             # Interactive mode selection
./scripts/setup_feishu.sh webhook     # Webhook mode (Phase 3)
./scripts/setup_feishu.sh websocket   # WebSocket mode (Phase 4, recommended)

The script will guide you through: creating the app, storing credentials, verifying connectivity, configuring the chosen mode, and adding users to the allowlist.

See docs/FEISHU_SETUP.md for detailed manual instructions.

Configuration

Key settings in cdk.json:

Setting	Default	Description
`default_model_id`	`global.anthropic.claude-opus-4-6-v1`	Primary Bedrock model
`enable_guardrails`	`false`	Bedrock Guardrails toggle
`session_idle_timeout`	`1800`	Session idle timeout (seconds)
`daily_token_budget`	`2000000`	Daily token limit
`daily_cost_budget_usd`	`20`	Daily cost cap (USD)

Cost Estimate (10 active users)

Component	Monthly
AgentCore Runtime	$50–150
Bedrock Claude	$100–500
VPC + NAT	$30–45
Lambda + API GW + DynamoDB	$15–25
ECS Fargate Gateway (Phase 4)	$15–30
S3 + Secrets + CloudWatch	$10–20
Total	~$220–770

Documentation

Document	Description
ARCHITECTURE.md	System design and component interactions
DEPLOYMENT_GUIDE.md	Step-by-step setup and troubleshooting
INVOKE_GUIDE.md	All invocation methods (CLI, SDK, HTTP)
DISCORD_SETUP.md	Discord bot configuration
FEISHU_SETUP.md	Feishu (Lark) bot configuration
AGENTCORE_CONTRACT.md	HTTP contract protocol details

Reference

Based on the patterns from sample-host-openclaw-on-amazon-bedrock-agentcore.

License

This project is provided as a sample deployment guide. Hermes Agent is developed by Nous Research.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hermes Agent on Amazon Bedrock AgentCore

Architecture

Key Features

How It Works

Project Structure

Prerequisites

Deployment

Quick Start

Phase-by-Phase

Phase 4: ECS Gateway (Optional)

Invocation

AgentCore CLI

Python SDK (boto3)

Channel Integration

Discord

Telegram / Slack

WeChat (Phase 4 — ECS Gateway)

Step 1: Obtain WeChat Token

Step 2: Deploy and Configure

Optional Environment Variables

Feishu (Lark)

Configuration

Cost Estimate (10 active users)

Documentation

Reference

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.claude		.claude
agentcore		agentcore
app/hermes		app/hermes
bridge		bridge
docs		docs
gateway		gateway
lambda		lambda
scripts		scripts
stacks		stacks
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
app.py		app.py
cdk.json		cdk.json
package.json		package.json
requirements.txt		requirements.txt

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Hermes Agent on Amazon Bedrock AgentCore

Architecture

Key Features

How It Works

Project Structure

Prerequisites

Deployment

Quick Start

Phase-by-Phase

Phase 4: ECS Gateway (Optional)

Invocation

AgentCore CLI

Python SDK (boto3)

Channel Integration

Discord

Telegram / Slack

WeChat (Phase 4 — ECS Gateway)

Step 1: Obtain WeChat Token

Step 2: Deploy and Configure

Optional Environment Variables

Feishu (Lark)

Configuration

Cost Estimate (10 active users)

Documentation

Reference

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages