Skip to content

Commit cc58d82

Browse files
committed
add roadmap
1 parent 559c8b3 commit cc58d82

File tree

2 files changed

+32
-1
lines changed

2 files changed

+32
-1
lines changed

CLAUDE.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# AgentLens
22

3-
Harness for running multi-session Claude Code experiments and capturing trajectories in ATIF format. Built for agent interpretability research.
3+
Harness for running multi-session Claude Code experiments and capturing trajectories in ATIF format. Built for AI alignment and interpretability research.
44

55
## Project structure
66

docs/guide/roadmap.md

Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,31 @@
1+
# Roadmap
2+
3+
AgentLens currently supports Claude Code via the Claude Agent SDK. Here's where we're headed.
4+
5+
## Planned
6+
7+
### Multi-agent support
8+
9+
Extend beyond Claude Code to support other agent frameworks and LLM providers — Codex, Devin, custom agents, and any tool-using LLM system. The core trajectory format (ATIF) and analysis tooling are already agent-agnostic; the main work is adding new adapters and runner integrations.
10+
11+
### Comparative analysis
12+
13+
Side-by-side trajectory comparison across agents, models, and prompt variants. Run the same experiment config against different agents and compare behavior, tool usage patterns, and outcomes.
14+
15+
### Richer intervention toolkit
16+
17+
Programmatic intervention pipelines for systematic counterfactual testing. Build on the existing edit & resample workflow with scriptable intervention sequences, batch processing, and automated hypothesis testing.
18+
19+
### Scoring & evaluation
20+
21+
Built-in trajectory scoring and automated evaluation metrics. Define custom scorers that run against captured trajectories to measure agent behavior quantitatively.
22+
23+
## Contributing
24+
25+
We welcome PRs and contributions! Whether it's bug fixes, new features, documentation improvements, or support for additional agent frameworks — all contributions are appreciated.
26+
27+
If you're interested in adding support for a new agent framework, the key integration points are:
28+
29+
- **Runner** (`src/harness/runner.py`) — session execution loop
30+
- **Adapter** (`src/harness/atif_adapter.py`) — message-to-ATIF step mapping
31+
- **Proxy** (`src/harness/proxy.py`) — API request capture (optional)

0 commit comments

Comments
 (0)