Skip to content

Commit a325348

Browse files
tbitcsoz-agent
andcommitted
docs: README AG2 section + CHANGELOG v0.3.11 + v0.3.10 entries
- README: Added AG2 Agent Shell section with install/usage examples - README: Updated Quick Start with agent commands - README: Updated VS Code Extension features (AG2, catalog, agent tab) - README: Updated 50+ CLI commands list with agent commands - CHANGELOG: v0.3.11 entry (AG2 shell, tools, tests, timeout fix) - CHANGELOG: v0.3.10 entry (no-placeholder-requirements) Co-Authored-By: Oz <oz-agent@warp.dev>
1 parent c60a924 commit a325348

2 files changed

Lines changed: 69 additions & 1 deletion

File tree

CHANGELOG.md

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,38 @@ All notable changes to this project will be documented in this file.
55
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
66
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
77

8+
## [0.3.11] — 2026-04-22
9+
10+
### Added
11+
- **AG2 agent shell** (`src/specsmith/agents/`) — Planner/Builder/Verifier agents over Ollama.
12+
New CLI commands: `specsmith agent run/plan/status/verify/improve/reports`.
13+
Uses AG2 v0.12.0 with native Ollama tool calling. Configurable per-project via `scaffold.yml`.
14+
- **Self-improvement workflow** (`agents/workflows/improve.py`) — `specsmith agent improve <task>`
15+
runs Plan→Build→Verify, produces structured ChangeReport at `.specsmith/agent-reports/`.
16+
- **AG2 tool surface** — 12 typed tools: filesystem (pathlib, no subprocess), shell, git, tests.
17+
Replaces the old `operations.py` concept.
18+
- **Phase 0–3 documentation**`docs/baseline-audit.md`, `docs/system-proof.md`.
19+
- **23 new agent tests** (`tests/test_agent.py`) — tool registry, tool handlers, system prompt,
20+
AgentRunner init, SessionState, meta-commands, Ollama integration (live).
21+
- **tests/conftest.py** — WinError 448 pytest cleanup fix for Windows.
22+
23+
### Changed
24+
- **Ollama timeout** — 120s → 600s for completion, 300s for streaming. Fixes frequent
25+
`[Provider error] timed out` in VS Code sessions.
26+
- **AgentConfig**`effective_utility_model` (defaults to primary), `effective_max_iterations`
27+
(0 = unlimited, maps to 999).
28+
- **AGENTS.md** — AG2 four-layer architecture, 12 project rules, updated file registry.
29+
- **pyproject.toml**`ag2[ollama]` optional dependency added.
30+
31+
---
32+
33+
## [0.3.10] — 2026-04-10
34+
35+
### Fixed
36+
- No-placeholder-requirements rule added to system prompt (#69).
37+
38+
---
39+
840
## [0.3.6] — 2026-04-09
941

1042
### Added

README.md

Lines changed: 37 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -121,12 +121,45 @@ specsmith epistemic-audit --project-dir ./my-project
121121
# Start the agentic REPL
122122
specsmith run --project-dir ./my-project
123123

124+
# AG2 agent shell — Planner/Builder/Verifier over Ollama
125+
specsmith agent status # check agent config + Ollama
126+
specsmith agent plan "add logging" # plan only (no execution)
127+
specsmith agent run "fix lint errors" # full Plan → Build → Verify
128+
specsmith agent improve "add tests" # self-improvement with reports
129+
specsmith agent verify # run Verifier on current state
130+
specsmith agent reports # list improvement reports
131+
124132
# Check current AEE workflow phase
125133
specsmith phase --project-dir ./my-project
126134
```
127135

128136
---
129137

138+
## AG2 Agent Shell — Local AI Agents over Ollama
139+
140+
specsmith includes an AG2-based agent shell with three specialized agents:
141+
142+
- **Planner** — inspects repo, generates execution plans with acceptance criteria
143+
- **Builder** — makes code/doc changes following the plan
144+
- **Verifier** — runs tests, accepts or rejects changes
145+
146+
All agents run locally on Ollama (default: `qwen2.5:14b`). Zero cloud cost.
147+
148+
```bash
149+
pip install "specsmith[ag2]" # install AG2 + Ollama support
150+
specsmith agent status # verify config + Ollama running
151+
specsmith agent run "fix lint errors" # Plan → Build → Verify pipeline
152+
specsmith agent improve "add tests for config.py" # self-improvement with reports
153+
```
154+
155+
The agent shell stores structured reports at `.specsmith/agent-reports/` with task ID,
156+
files changed, test results, verdict (ACCEPT/REJECT), and follow-up tasks.
157+
158+
Configurable per-project in `scaffold.yml` under `agents:` or via the VS Code
159+
Project Settings → Agent tab.
160+
161+
---
162+
130163
## VS Code Extension
131164

132165
The **specsmith AEE Workbench** VS Code extension is the flagship client:
@@ -146,7 +179,10 @@ The **specsmith AEE Workbench** VS Code extension is the flagship client:
146179
- **Execution profiles** — safe / standard / open / admin; custom allow/block command lists
147180
- **AEE phase indicator** — shows current phase with readiness %, Next Phase button, phase selector
148181
- **AI agent sessions** — independent process per project, JSONL bridge, chat with file injection
182+
- **AG2 agent shell** — Planner/Builder/Verifier agents over Ollama in Actions tab
183+
- **Agent tab** — per-project provider/model/context/iteration config (overrides global defaults)
149184
- **Live model listing** — Anthropic, OpenAI, Gemini, Mistral, local Ollama (GPU-aware)
185+
- **Ollama model catalog** — 16 models, 4 tiers, GPU-aware recommendations, filter by installed/available
150186
- **Ollama integration** — model manager (update/remove/update-all), version check, upgrade
151187
- **FPGA/HDL tool support** — vivado, gtkwave, vsg, ghdl, verilator, yosys, nextpnr, and 15 more
152188
- **Tool installer** — scan installed tools; one-click install via winget/brew/apt for missing tools
@@ -219,7 +255,7 @@ Supported tools: **Synthesis:** vivado, quartus, radiant, diamond, gowin.
219255

220256
**Workflow:** `phase show/set/next/list` `ledger add/list` `req list/add/gaps/trace`
221257

222-
**Agent:** `run` `agent providers/tools/skills`
258+
**Agent:** `run` `agent run/plan/status/verify/improve/reports` `agent providers/tools/skills`
223259

224260
**Ollama:** `ollama list/available/gpu/pull/suggest`
225261

0 commit comments

Comments
 (0)