Skip to content

Commit e052943

Browse files
docs(v2): align exploratory outcome validation language
1 parent fe3c6de commit e052943

1 file changed

Lines changed: 12 additions & 11 deletions

File tree

docs/simulation-v2-architecture.md

Lines changed: 12 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -957,22 +957,23 @@ Conversation themes (clustered):
957957
| **high** reasoning | ~4.5k | ~700 |
958958
| **high** conversation turn | ~2k | ~200 |
959959

960-
**Total cost (5-mini: $0.15/$0.60 per 1M tokens, 15 timesteps):**
960+
**Total cost (gpt-5-mini: $0.30/$1.50 per 1M tokens, 15 timesteps):**
961961

962962
| | 2k agents | 10k agents |
963963
|---|---|---|
964-
| **Current system** | ~$8 | ~$40 |
965-
| **low** | ~$18 | ~$90 |
966-
| **medium** | ~$30 | ~$150 |
967-
| **high** | ~$44 | ~$220 |
964+
| **Current system** | ~$40 | ~$200 |
965+
| **low** | ~$59 | ~$295 |
966+
| **medium** | ~$72 | ~$360 |
967+
| **high** | ~$99 | ~$495 |
968968

969969
**Total cost (Sonnet-class: ~$3/$15 per 1M tokens, 15 timesteps):**
970970

971971
| | 2k agents | 10k agents |
972972
|---|---|---|
973-
| **low** | ~$360 | ~$1,800 |
974-
| **medium** | ~$600 | ~$3,000 |
975-
| **high** | ~$900 | ~$4,500 |
973+
| **Current system** | ~$480 | ~$2,400 |
974+
| **low** | ~$700 | ~$3,500 |
975+
| **medium** | ~$860 | ~$4,300 |
976+
| **high** | ~$1,180 | ~$5,900 |
976977

977978
**Wall time (1k RPM, conservative):**
978979

@@ -983,7 +984,7 @@ Conversation themes (clustered):
983984
| **medium** | ~1.2 hrs | ~6 hrs |
984985
| **high** | ~1.5 hrs | ~8 hrs |
985986

986-
**Default:** `medium`. Best cost/quality tradeoff. Names, narrative, temporal awareness, full memory, the one conversation that matters most (partner), aggregate mood. ~$150 for 10k agents on 5-mini. ~3.5x current cost for a fundamentally better simulation.
987+
**Default:** `medium`. Best cost/quality tradeoff. Names, narrative, temporal awareness, full memory, the one conversation that matters most (partner), aggregate mood. ~$360 for 10k agents on gpt-5-mini. ~1.8x current cost for a fundamentally better simulation.
987988

988989
---
989990

@@ -1120,9 +1121,9 @@ These are the minimum changes needed to move every tenet to **Strong**. Listed i
11201121

11211122
7. **Demographic consistency:** An agent with `digital_literacy: basic` should not describe a plan involving "fine-tuning open-source models." Elaborations should reflect the agent's actual capabilities and constraints.
11221123

1123-
### Clustering Validation
1124+
### Exploratory Outcome Validation
11241125

1125-
8. **Cluster coherence:** For exploratory outcomes, verify clusters are semantically meaningful. Silhouette scores on embeddings. Human review of cluster labels vs representative samples. Bad clusters = too heterogeneous or too small.
1126+
8. **Export completeness:** For exploratory outcomes, verify all agent elaborations are exported with correct agent_id, demographics, and timestep. Downstream analysis (clustering, thematic coding) is done by the agentic harness or manual DS workflows — not validated by the engine.
11261127

11271128
---
11281129

0 commit comments

Comments
 (0)