Extropy v2 — Full Pipeline Architecture

This document specifies every change across the entire Extropy pipeline, from extropy spec through extropy results. It supersedes simulation-experience-gaps.md (which identified the problems). This document defines the solutions.

Design Principles

Households, not individuals. The atomic unit of sampling is a household. Partners are co-sampled. Kids are generated at sampling time. Economic fate is shared.
Social roles, not similarity. Network edges represent real relationships (partner, coworker, neighbor, friend) derived from agent attributes — not generic "acquaintance" edges from embedding distance.
First person, always. Agents think as "I" — not "You are Travis." The prompt reads like an internal monologue, not a briefing document.
Names carry culture. Every agent has a name. Names are demographically appropriate (SSA baby names + Census surname data, bundled CSVs, US-only for now). Partners, kids, and peers are named. Non-US populations use country-specific name data added later behind the same generate_name(gender, ethnicity, birth_decade, country="US") interface.
Time is felt. Agents know what day/week it is, how long ago they heard the news, whether things are getting better or worse.
Conversations are real. Agents talk to each other. Both sides are real agents (or NPC dependents). The conversation changes both participants.
Outcomes emerge, not imposed. For exploratory scenarios, outcomes are open-ended. Categories are discovered post-hoc by downstream DS tooling (agentic harness), not pre-defined dropdowns or built-in clustering.
Scenarios evolve. Events develop over time. New information arrives at specified timesteps. The world isn't frozen at t=0.
Deterministic where possible. Names, family, temporal framing, mood rendering, channel experience — all deterministic. LLM calls only for reasoning and conversations.
Fidelity is tunable. --fidelity low/medium/high controls prompt richness and conversation depth without changing the underlying data model.

Locked Decisions

Decisions confirmed before implementation. These override any conflicting detail elsewhere in this document.

Pre-Phase Decisions (all phases)

#	Decision	Resolution
1	Pipeline ordering	Network stays scenario-agnostic. Optional "scenario-conditioned network" mode is future work, not v2 core.
2	`talk_to` target identity	`agent_id` in schema, render names only in prompt.
3	Merged-pass default	2-pass is default. Merged pass behind `--merged-pass` flag. A/B test later.
4	Token budget for memory	Uncapped for now. Deterministic token cap with oldest-to-newest compression deferred.
5	Timeline merge semantics	Timeline entry overrides base event for that timestep.
6	DB schema for new artifacts	Define conversations/posts/action_history tables before Phase D.
7	Name data	Local SSA baby names + Census surnames, bundled CSVs (~500KB), US-only. Non-US via country-specific CSVs later behind same interface: `generate_name(gender, ethnicity, birth_decade, country="US")`.
8	Conformity/threshold mechanics	Soft prompt signal only (conformity self-awareness + peer opinions + mood rendering). No explicit ratios or hard numeric gates.
9	Backtesting ground-truth	Define one validation dataset schema before Phase G.

Phase-Specific Decisions

#	Decision	Resolution	Phase
10	Conversation turns	2 turns / 4 messages (medium), 3 turns / 6 messages (high). LLM told the limit so it wraps up.	D
11	Conversation state	Both agents keep their own independent final state. No conflict.	D
12	Household joint distributions	Pre-processed cross-tab CSVs from Census PUMS, not raw PUMS.	B
13	Neighbor edge inference	TBD at spec/extend phase — needs generalized attribute. Not blocking B core.	B
14	Fidelity flag location	`SimulationRunConfig` (runtime choice, not scenario-intrinsic).	F
15	Post-hoc clustering	Not building. Agentic harness does DS on exported data.	F
16	Repetition detection metric	Jaccard similarity on word-level trigrams, >70% threshold.	E
17	Channel experience	`experience_template` field on `ExposureChannel`. Scenario compiler generates it. Fallback: humanize channel name.	A

Pipeline Overview

extropy spec  →  extropy extend  →  extropy sample  →  extropy network  →  extropy persona  →  extropy scenario  →  extropy simulate  →  extropy results
                                         ↑                                                          ↑                      ↑
                                   HOUSEHOLD-BASED                                            SCENARIO TIMELINE      CONVERSATIONS
                                   SAMPLING (NEW)                                             + OUTCOME TRACKS       + COGNITIVE STATE
                                                                                              (NEW)                  (NEW)

PHASE 1: POPULATION CREATION

1.1 `extropy spec` — No Changes

Spec building (sufficiency check → attribute selection → hydration → binding) stays the same. The LLM discovers attributes relevant to the scenario and builds distributions. No structural changes.

One addition: the spec builder should recognize household-level attributes and tag them appropriately. When the LLM discovers household_income, marital_status, household_size — these should be tagged scope: household in the attribute metadata, signaling to the sampler that they're shared within a household unit.

# New attribute metadata field
- name: household_income
  scope: household  # shared across household members
  type: int
  category: universal
  ...

- name: neuroticism
  scope: individual  # default, independent per person
  type: float
  category: personality
  ...

1.2 `extropy extend` — Minor Changes

Same as spec, but with context from a base spec. The extension LLM should be aware that household-scoped attributes exist and avoid duplicating them across individual-level additions.

1.3 `extropy sample` — Major Rewrite

Current: Samples N individual agents sequentially. Each agent is independent. A married agent with household_size: 5 exists alone — no partner, no kids.

New: Samples N/H households, where H is the average household size. Each household produces 1-2 adult agents plus NPC dependents.

Household Sampling Flow

1. Determine household composition
   - Sample household_type: single | couple | single_parent | couple_with_kids | multi_generational
   - Source: Census household composition rates by state, age bracket, race_ethnicity
   - This determines how many adults and dependents

2. Sample primary adult (Adult 1)
   - All individual attributes sampled as today (age, gender, personality, occupation, etc.)
   - Household-scoped attributes sampled once and shared

3. If couple household → Sample partner (Adult 2)
   - Correlated demographics:
     - Same state, urban_rural (shared household)
     - Same household_income, household_size (shared)
     - Age: sampled from joint distribution conditioned on Adult 1's age
       (mean = Adult 1 age ± 2 years, std = 3 years, constrained by gender norms for the cohort)
     - race_ethnicity: correlated via intermarriage rates from census
       (same-race probability ~85%, varies by group)
     - education_level: assortative mating correlation (~0.6 from research)
     - religious_affiliation: high correlation (~0.7)
     - political_orientation: moderate correlation (~0.5)
   - Independent attributes:
     - Personality traits (neuroticism, extraversion, etc.) — independent
     - occupation_sector — constrained by local job market (same state) but otherwise independent
     - All attitudes and context-specific attributes — independent
   - Same-sex couples:
     - Census same-sex household rates by state (~1-3% of couple households)
     - When sampled as same-sex, both adults share gender
     - Name generation respects this

4. Generate NPC dependents (children, elderly parents)
   - Number of kids = household_size - number_of_adults
   - Kid ages: sampled from plausible range given parent age + life_stage
     - If parent is 43 (middle_aged_adult), kids likely 5-20
     - If parent is 28 (young_adult), kids likely 0-8
     - Spacing: 2-4 years between siblings (sampled)
   - Kid attributes (minimal, for context only):
     - first_name (from same cultural pool as parents)
     - age
     - gender (50/50)
     - school_status: derived from age (0-4: home, 5-17: in school, 18+: college/working)
     - grade: derived from age
   - Elderly dependents: if household_labor_dependency implies it
     - first_name, age (65+), relationship ("mother", "father-in-law")
   - NPCs stored as structured metadata on the parent agent, NOT as agents

5. Generate names for all household members
   - Data source: SSA baby name frequency data (by birth decade + gender) + Census surname data (by ethnicity)
   - First name: filter by birth_decade (derived from age) + gender + race_ethnicity → weighted random pick
   - Last name: filter by race_ethnicity → weighted random pick → shared across household
   - Partner may have different last name (probability based on era/demographics)
   - ~50KB of bundled CSV data. Zero API calls.

Agent Record (New Schema)

{
  "_id": "agent_0020",
  "first_name": "Travis",
  "last_name": "McAllister",
  "household_id": "household_0010",
  "household_role": "adult_primary",
  "partner_id": "agent_0021",
  "dependents": [
    {"name": "Tyler", "age": 17, "gender": "male", "relationship": "son", "school_status": "high_school_senior"},
    {"name": "Kayla", "age": 14, "gender": "female", "relationship": "daughter", "school_status": "high_school_freshman"},
    {"name": "Mason", "age": 9, "gender": "male", "relationship": "son", "school_status": "4th_grade"}
  ],
  "age": 43,
  "gender": "male",
  "race_ethnicity": "white",
  "state": "south_carolina",
  "marital_status": "cohabiting",
  "household_size": 5,
  "household_income": 52000,
  ...all other attributes...
}

What This Changes About Study DB Storage

The agents table stays the same (id → JSON attributes). But agents now have:

household_id for grouping
partner_id for cross-referencing
dependents as structured metadata
first_name, last_name as named attributes

The populations table in study DB gets a new households table:

CREATE TABLE households (
    id TEXT PRIMARY KEY,
    population_id TEXT,
    adult_ids JSON,        -- ["agent_0020", "agent_0021"]
    dependent_data JSON,   -- NPC details
    shared_attributes JSON -- household_income, state, etc.
);

1.4 `extropy network` — Major Rewrite

Current: Compute pairwise similarity from all attributes → threshold → create edges → assign edge types from similarity buckets → calibrate to target metrics.

New: Create edges from social role rules first, then fill with similarity-based edges.

Social Role Edge Generation

Phase 1: Structural edges (deterministic from attributes)
  - partner: from partner_id field (always created, weight = 1.0)
  - household: all adults in same household_id (always created, weight = 0.9)
  - coworker: same occupation_sector + same state (probabilistic, ~5-10 per agent)
  - neighbor: same state + same urban_rural + age within 15 years (probabilistic, ~3-5 per agent)
  - congregation: same religious_affiliation + same state (if religious, ~3-5 per agent)
  - school_parent: both have kids in school age + same state + same urban_rural (~2-3 per agent)

Phase 2: Similarity-based edges (fills remaining degree budget)
  - Compute similarity as today, but only for agents not yet connected
  - These become "acquaintance" or "online_contact" edges
  - Lower weight than structural edges

Phase 3: Calibration
  - Same calibration loop as today (hit target avg_degree, clustering, modularity)
  - But structural edges are PROTECTED — calibration can only add/remove similarity edges
  - Structural edges are never pruned

Edge Schema (Enhanced)

{
  "source": "agent_0020",
  "target": "agent_0021",
  "edge_type": "partner",
  "weight": 1.0,
  "structural": true,
  "context": "household"
}

Edge types and their default weights:

Edge Type	Weight	Source	Bidirectional
partner	1.0	household_id + partner_id	yes
household	0.9	household_id	yes
coworker	0.6	occupation_sector + state	yes
close_friend	0.7	similarity top-k + age proximity	yes
neighbor	0.4	state + urban_rural + age proximity	yes
congregation	0.4	religious_affiliation + state	yes
school_parent	0.35	kids in school + state + urban_rural	yes
acquaintance	0.2	similarity-based	yes
online_contact	0.15	similarity-based, different state OK	yes

What The Scenario Can Override

The scenario YAML can declare relationship priority for this topic:

relationship_weights:
  partner: 1.0       # always highest
  coworker: 0.9      # ASI: coworkers are critical
  close_friend: 0.7
  neighbor: 0.3      # ASI: neighbors less relevant

For ULEZ, the weights would be different:

relationship_weights:
  partner: 1.0
  neighbor: 0.8      # ULEZ: neighbors directly affected
  coworker: 0.5      # ULEZ: less relevant unless both tradies

These weights are used in two places:

Conversation conflict resolution — when two people want to talk to the same agent, higher weight goes first
Peer opinion ordering in the prompt — higher weight peers appear first with full detail (name, demographics, statement), lower weight peers get summarized ("a few acquaintances also mentioned it")

The agent's OWN choice of who to talk to is LLM-driven — they see the list of available people and pick based on what they need. The weights handle tiebreaking and prompt ordering, not the agent's decision.

Auto-generation: extropy scenario generates these weights automatically — the compiler LLM knows that ASI is a workplace threat (coworker weight high), Netflix is a household product (family weight high), ULEZ is local policy (neighbor weight high). The user can override in the YAML after generation.

PHASE 2: SCENARIO COMPILATION

2.1 `extropy scenario` — Significant Extensions

The scenario YAML gains three new top-level sections: timeline, day_phases, and enhanced outcomes.

Scenario Timeline (New)

Current: Single event at t=0. Nothing changes.

New: Sequence of events at specified timesteps.

timeline:
  - timestep: 0
    event:
      type: news
      content: "AI systems demonstrate superhuman performance across all cognitive domains. Fortune 50 firms announce restructuring."
      source: "Major news outlets"
      credibility: 0.8
      ambiguity: 0.6
      emotional_valence: -0.7
  - timestep: 3
    event:
      type: news
      content: "Federal government announces emergency AI task force. No concrete policy yet. Unemployment claims spike 40%."
      source: "White House press briefing"
      credibility: 0.9
      ambiguity: 0.4
      emotional_valence: -0.5
  - timestep: 8
    event:
      type: news
      content: "First wave of layoffs hits. 200,000 jobs cut across tech and finance in the past month."
      source: "Bureau of Labor Statistics"
      credibility: 0.95
      ambiguity: 0.2
      emotional_valence: -0.8
  - timestep: 16
    event:
      type: policy_change
      content: "Government announces Emergency Workforce Transition Act: $500/person retraining voucher, extended unemployment to 52 weeks."
      source: "Congressional legislation"
      credibility: 0.9
      ambiguity: 0.3
      emotional_valence: -0.2

Each timeline event:

Has its own exposure rules (which channels, which agents, what probability)
Gets injected into agent prompts at the specified timestep as "new information this [week/day]"
Accumulates — by timestep 16, agents have the full history of developments
Can override or update the original event's parameters

For scenarios with no evolution (Netflix password sharing), the timeline is just the single t=0 event. No extra configuration needed.

Day Phases (OMITTED)

Decision: Day phase templates are not implemented. The improved flat prompt structure (first-person voice, temporal awareness, named peers, local mood, social feed) provides sufficient narrative context without explicit morning/work/evening phase segmentation. The complexity of condition-based phase selection doesn't justify the marginal improvement over the current prompt design.

Outcome Tracks (New)

Current: All outcomes are pre-defined categorical/boolean/float with fixed options.

New: Two tracks, inferred from schema.

Track 1: Known Outcomes — Options are pre-defined. Used when you know the decision space.

outcomes:
  suggested_outcomes:
    - name: action
      type: categorical
      options: [pay_own_account, stay_shared_with_fee, cancel, switch_service, workaround]
      description: "What the user does about Netflix account"
      required: true
    - name: sentiment
      type: float
      range: [-1.0, 1.0]
      description: "How they feel about the change"
      required: true

Engine behavior: Pass 2 extracts exact category from options. Results show distributions directly.

Track 2: Exploratory Outcomes — No pre-defined options. Used when the decision space is unknown.

outcomes:
  suggested_outcomes:
    - name: primary_response
      type: open_ended
      description: "What are you actually going to do? Be specific about concrete steps, resources, timeline, and obstacles."
      required: true
    - name: sentiment
      type: float
      range: [-1.0, 1.0]
      description: "Overall sentiment"
      required: true

Engine behavior: The agent writes a free-form response. No classification pass needed for this outcome. extropy results clusters the responses post-hoc to discover categories.

Track 3: Hybrid — Categorical bucket + open elaboration.

outcomes:
  suggested_outcomes:
    - name: primary_adaptation_strategy
      type: categorical
      options: [aggressive_upskilling, occupational_pivot, double_down, disengage, collective_resistance]
      description: "Broad strategy category"
      required: true
    - name: elaboration
      type: open_ended
      description: "Describe specifically what you're planning. Concrete steps, resources, timeline, obstacles."
      required: true
    - name: sentiment
      type: float
      range: [-1.0, 1.0]
      required: true

Engine behavior: Categorical extracted via classification. Elaboration captured as free text. Results show both: "54% aggressive upskilling" as the headline, then clustered elaborations as the story.

Inference rule: If type: open_ended and no options → exploratory track. If type: categorical with options → known track. If both exist → hybrid. No explicit outcome_mode field needed.

Channel Experience Templates (New, Optional)

experience_template is a field on ExposureChannel itself — not a separate top-level config. The scenario compiler generates it. The engine renders it into the prompt. Fallback: humanize the channel name if no template exists.

seed_exposure:
  channels:
    - name: mainstream_news_media
      description: "TV and online news coverage"
      reach: broadcast
      credibility_modifier: 1.1
      experience_template: "I saw a news segment about this on TV."
    - name: social_media_feeds
      description: "Posts appearing in social feeds"
      reach: broadcast
      credibility_modifier: 0.8
      experience_template: "I noticed some posts about this online."
    - name: email_notification
      description: "Direct email from the company"
      reach: targeted
      credibility_modifier: 1.2
      experience_template: "I got an email about this."

If experience_template is omitted → engine humanizes the channel name (e.g., "social_media_feeds" → "social media"). If provided → used directly in the prompt.

2.2 ScenarioSpec Model Changes

class TimelineEvent(BaseModel):
    """A single event in the scenario timeline."""
    timestep: int
    event: Event
    exposure_rules: list[ExposureRule] | None = None  # If None, reuse seed_exposure rules
    description: str | None = None  # Human-readable context for this development

# DayPhase and DayPhaseConfig — OMITTED (see "Day Phases" section above)

class ChannelVariant(BaseModel):
    when: str
    template: str

class ChannelExperience(BaseModel):
    default: str
    variants: list[ChannelVariant] = []

class ScenarioSpec(BaseModel):
    meta: ScenarioMeta
    event: Event                              # Initial event (t=0)
    timeline: list[TimelineEvent] | None = None  # NEW: subsequent developments
    seed_exposure: SeedExposure
    interaction: InteractionConfig
    spread: SpreadConfig
    outcomes: OutcomeConfig
    simulation: SimulationConfig
    # day_phases — OMITTED
    channel_experience: dict[str, ChannelExperience] | None = None  # NEW
    relationship_weights: dict[str, float] | None = None  # NEW

PHASE 3: SIMULATION

3.1 Persona Generation — Rewrite

Current: "You are a 43-year-old male..." + grouped attribute bullet points.

New: First-person identity with named family, household context, and economic reality.

I'm Travis McAllister. I'm 43, white, living in Greenville, South Carolina with my
partner Lisa (41, retail) and our three kids — Tyler (17, about to graduate high school),
Kayla (14), and Mason (9). I work full-time in the service industry. Our household income
is about $52,000. We've got maybe one month of savings.

My Mindset & Values
- Neuroticism: High
- Extraversion: Moderate
- Openness: Low
- Conscientiousness: Moderate
- Agreeableness: Moderate

My Attitudes & Concerns
- Institutional Trust: Low
- AI Threat Perception: High
- Economic Anxiety: Severe
- Technology Adoption: Low
...

Key changes:

First person ("I'm Travis" not "You are a 43-year-old")
Partner named and described (from partner agent's attributes)
Kids named with ages and school status (from dependent metadata)
Household economic context (shared income, savings)
All remaining attributes still listed (structured format, nothing filtered)

3.2 Timestep Loop — Redesigned

Current: Expose → Reason (Pass 1 + Pass 2) → Propagate → Aggregate → Stop check.

New: Four-phase timestep with conversations.

Phase 1: EXPOSURE + CONTEXT BUILD
  - Apply seed exposures for this timestep
  - Apply timeline events if any scheduled for this timestep
  - Propagate network exposures from previous timestep's sharers
  - Build reasoning context for each agent (persona, exposures, memory, peers, mood, temporal)

Phase 2: REASONING (parallel, all agents)
  - All aware agents reason in parallel
  - Single merged LLM call (no separate Pass 1 + Pass 2)
  - Output: internal monologue, sentiment, conviction, public statement,
    position, outcomes, elaboration, actions (talk_to, post)
  - State from this phase is PROVISIONAL — conversations can override

Phase 3: CONVERSATIONS (parallel across pairs, sequential within conflicts)
  - Engine collects all talk_to actions from Phase 2
  - Builds conversation queue:
    - Priority by edge weight (partner > close_friend > coworker > acquaintance)
    - Non-conflicting pairs run in parallel
    - Conflicting pairs (share an agent) run sequentially by priority
  - Each conversation: 2 turns at medium fidelity (4 messages back-and-forth), 3 turns at high fidelity (6 messages back-and-forth). The LLM knows the turn limit so it can wrap up.
    - Agent-agent: both sides are real agents, both states update independently (each agent keeps their own final state)
    - Agent-NPC: NPC side generated by LLM from NPC profile, only agent state updates
  - Conversation output: updated sentiment, conviction, position for EACH participant independently
  - This OVERRIDES Phase 2 provisional state for agents who conversed

Phase 4: STATE UPDATE + AGGREGATION
  - Social posts collected and stored (feeds into next timestep's public discourse)
  - Final state written for all agents (Phase 3 output if conversed, Phase 2 output if not)
  - Sharing decisions computed (mechanical, based on will_share + spread config)
  - Timestep summary computed (exposure rate, position distribution, sentiment stats)
  - Stopping conditions evaluated
  - Checkpoint if needed

3.3 Prompt Structure — Complete Rewrite

Current prompt:

[System: You ARE this person]
[Persona: attribute bullet points]
[Event: content block]
[Exposures: "Someone told you" x17]
[Memory: 3 summaries]
[Peer opinions: "A acquaintance says..."]
[Instructions: respond as JSON]

New prompt (medium fidelity, weekly timestep):

You are going to think as Travis McAllister. Everything below is from Travis's
perspective. Respond as Travis — first person, honest, unfiltered.

---

I'm Travis McAllister. I'm 43, white, living in Greenville, SC with my partner
Lisa (41, retail) and our three kids — Tyler (17, about to graduate), Kayla (14),
and Mason (9). I work full-time in services. Household income ~$52K. About one
month of savings.

[Full characteristics list]

---

It's Week 3 since AI systems were announced to have superhuman performance across
all cognitive domains.

What's happened so far:
- Week 1: AI systems publicly demonstrated superhuman performance. Fortune 50 firms
  announced restructuring with major layoffs. Federal guidance unclear.
- Week 3: Government announced emergency task force but no concrete policy.
  Unemployment claims spiked 40%.

This week:
- I saw this on CNN and it was all over my X feed
- 23 people in my circle brought it up this week, including my coworker Darnell
  and my neighbor Marcus
- This is the 3rd week in a row everyone around me is talking about this

What people around me are saying:
- My partner Lisa (41, retail): "We need to be realistic about our savings. I'm
  scared." — she's very worried and firm about cutting costs
- My coworker Darnell (36, services): "I signed up for some free AI tutorials.
  Can't hurt." — he's anxious but doing something about it
- My neighbor Marcus (42, construction): "My job's physical, maybe I'm okay?"
  — he's uncertain, trying to convince himself
- Most people I know are anxious. The mood hasn't improved since last week.
  If anything, the layoff numbers made it worse.

What I've been thinking:
- Week 1: "I'm terrified. One month of savings. Five mouths to feed. I need to
  cut spending NOW and find backup work."  I was panicked and certain we were in trouble.
- Week 2: "Same as last week but heavier. Lisa's right — can't afford courses.
  Looking at gig apps. The anger is fading into something worse." Still anxious,
  getting more resigned.

I've been feeling panicked since this started. Last week the panic settled into
a heavy dread that hasn't lifted. I've been firm about survival-first since Week 1
and that hasn't wavered.

---

People in my life right now:
- Lisa (my partner, she's home with me)
- Tyler, Kayla, Mason (my kids, they're home)
- Darnell (my coworker, I'll see him at work)
- Marcus (my neighbor, I could text him)

Think honestly about how this week's developments land for someone in your exact
situation. What are you actually thinking? What do you feel? What are you going to do?

You can choose to:
- Talk to someone (pick 1-2 people from the list above)
- Post something on social media (or choose not to)

Respond as JSON:
{
  "internal_monologue": "Raw, honest stream of thought...",
  "sentiment": -0.8,
  "conviction": 75,
  "public_statement": "What I'd actually say out loud to people",
  "position": "your primary response to this situation",
  "elaboration": "Specifically what you're planning to do — concrete steps, resources, timeline, obstacles",
  "actions": [
    {"type": "talk_to", "who": "Lisa", "topic": "..."},
    {"type": "post", "platform": "x", "content": "..."}
  ]
}

3.4 Merged Pass 1 + Pass 2 (With Caveat)

Why we originally split into 2 passes: Single-pass role-play + classification caused 83% central tendency — agents hedged toward safe middle options because "be this person" and "pick from these categories" competed in the same generation.

Why merging might work now: The v2 merged pass is structured differently. The agent reasons freely FIRST (unconstrained monologue), then fills in structured fields AFTER — within the same generation. The classification is downstream of the reasoning, not competing with it. The reasoning comes first and informs the structured extraction.

This is a hypothesis that needs testing. A/B test: run 200 agents with merged pass vs 2-pass, compare outcome distributions. If merged produces the same spread as 2-pass, keep it (saves 1 round trip per agent, ~50% fewer LLM calls). If it collapses to center, revert to 2-pass. The architecture supports both — the engine just needs a flag.

Note on Pass 2 context: The current Pass 2 already receives the Pass 1 reasoning text (not completely disconnected). What it lacks is the full persona/exposures/peers — but the reasoning already reflects all of that. The real win from merging is latency reduction and cost savings, not fixing a context gap.

The single call produces:

Free-form reasoning (internal_monologue, elaboration)
Structured outputs (sentiment, conviction, position, outcomes)
Actions (talk_to, post)

For known outcome scenarios, position must match one of the defined options. The prompt includes the options explicitly:

"position": one of ["pay_own_account", "cancel", "switch_service", "workaround"]

For exploratory outcome scenarios, position is free-form:

"position": "describe your primary response in a few words"

Classification into categories happens in extropy results, not during simulation.

3.5 Conversation System

When an agent's actions include talk_to, the engine resolves it:

Who the agent talks to is LLM-driven — the agent sees their available contacts and picks who's relevant based on what THEY need right now. Travis might choose Darnell over Lisa tonight because he needs field-specific intel about automation in services, not a budget argument. The LLM makes this choice in context, so it's naturally topic-aware.

Conflict resolution (when two people want to talk to the same person) uses scenario-defined relationship_weights × structural edge weight. These weights are auto-generated by extropy scenario (the compiler knows ASI is a workplace threat → coworker weight high, Netflix is a household product → family weight high) and can be overridden in the YAML.

1. Collect all talk_to requests across all agents
2. Build pairs: (initiator, target, topic, priority_score)
   priority_score = structural_edge_weight × scenario_relationship_weight
3. Sort by priority_score descending
4. Identify conflicts (same agent appears in multiple pairs)
5. Non-conflicting pairs → parallel execution
6. Conflicting pairs → sequential execution, highest priority first

For each conversation pair (medium fidelity — 2 turns / 4 messages):
  Message 1: Initiator speaks (LLM call with initiator's context + topic)
  Message 2: Target responds (LLM call with target's context + initiator's statement)
  Message 3: Initiator replies (LLM call with updated context)
  Message 4: Target wraps up (LLM call — knows this is the last message)

For high fidelity — 3 turns / 6 messages:
  Messages 1-4: Same as above
  Message 5: Initiator follow-up
  Message 6: Target final response (knows this is the last message)

The LLM is told the turn limit upfront so it can wrap up naturally.

  Output for EACH participant (independent — both keep their own final state):
  {
    "response": "What they said",
    "updated_sentiment": float,
    "updated_conviction": int,
    "updated_position": "if changed",
    "internal_reaction": "What they actually thought during this conversation"
  }

Agent-NPC conversations: When Travis talks to Tyler (NPC kid), the engine generates Tyler's side using a system prompt built from Tyler's NPC profile:

You are Tyler McAllister, 17 years old, a high school senior living with your dad
Travis (43, services) and mom Lisa (41, retail) in Greenville, SC. You're about
to graduate. Respond naturally as a teenager would to your dad.

Tyler's response is LLM-generated but Tyler has no persistent state. Only Travis's state updates.

Cost: ~4 LLM calls per conversation at medium (2 turns), ~6 at high (3 turns). Fast model. At medium fidelity, only the top-1 edge (partner/closest) gets a conversation per timestep. At high fidelity, top 2-3 edges.

Conversation vs mechanical rules: For agents who HAD a conversation this timestep, the conversation output IS their final state. Bounded confidence, flip resistance, conviction decay do NOT apply on top. These mechanical rules exist to approximate social influence when no conversation happens. For agents who did NOT converse, the mechanical rules still apply as the "passive influence" layer — the effect of scrolling past posts, overhearing things, seeing opinions without engaging.

3.6 Aggregate Mood Rendering

Current: TimestepSummary computed but never shown to agents.

New: Aggregate mood from the agent's LOCAL NETWORK rendered as fuzzy vibes.

def render_local_mood(agent_id: str, adjacency: dict, agent_states: dict) -> str:
    """Render the mood of an agent's local network as natural language."""
    neighbors = adjacency[agent_id]
    sentiments = [agent_states[n].sentiment for n, _ in neighbors if agent_states[n].sentiment is not None]

    if not sentiments:
        return ""

    avg = sum(sentiments) / len(sentiments)
    variance = sum((s - avg) ** 2 for s in sentiments) / len(sentiments)

    # Mood label
    if avg > 0.6: mood = "optimistic"
    elif avg > 0.2: mood = "cautiously hopeful"
    elif avg > -0.2: mood = "uncertain and mixed"
    elif avg > -0.6: mood = "anxious and worried"
    else: mood = "deeply fearful"

    # Consensus
    if variance < 0.05: consensus = "Everyone seems to feel the same way."
    elif variance < 0.15: consensus = "Most people feel similarly."
    else: consensus = "Opinions are all over the place."

    return f"Most people I know seem {mood}. {consensus}"

No numbers. No pie charts. Just vibes — the way real humans sense the mood around them.

3.7 Temporal Awareness

Every prompt includes:

Current position in time: "It's Week 3 since the announcement."
Timeline recap: Bullet list of what's happened so far (from scenario timeline).
Memory timestamps: "Week 1: I thought... Week 2: I thought..."
Exposure duration: "This is the 3rd week in a row people are talking about this."
Emotional trajectory: "I've been anxious since Week 1. Last week the panic settled into dread."

All deterministic string formatting from existing data. Zero LLM calls.

3.8 Memory System

Current: 3-entry sliding window of 1-sentence summaries.

New: Full reasoning history, timestamped, with emotional context.

ALL reasoning entries kept (not capped at 3)
Each entry shows: timestep label, truncated reasoning (first 3-4 sentences of raw_reasoning), emotional state at the time
Conviction trajectory rendered: "I've been firm about this since Week 1" or "I started certain but I've been wavering"
If token budget is a concern: at --fidelity low, show last 3 full traces. At medium, show all. At high, show all + consolidated beliefs.

3.9 Social Posts + Public Discourse

When an agent's actions include post:

The post content is stored with the agent's ID, timestep, and platform
Next timestep, the engine aggregates all posts into a public discourse summary:
- "My X feed is a mix of panic and dark humor. Most posts are about job security."
- "People on Reddit are sharing workaround guides."
The summary is rendered into prompts as part of the social_media_exposures slot
Individual posts are NOT shown to other agents (too many) — only the aggregate mood of the platform

For network contacts specifically: if Darnell posted on X AND Darnell is in Travis's network, Travis might see "Darnell posted on X: '...'" as a named peer exposure. This is higher influence than the anonymous aggregate.

3.10 Cognitive Architecture (Tiered by Fidelity)

Split into independent subsystems. Each assessed for actual value vs implementation cost.

Tier 1: Build (trivial, high impact — string formatting from existing data)

3.10a: Emotional trajectory rendering. Map sentiment history to narrative: "I started panicked. By mid-week it settled into dread. It hasn't lifted." Deterministic lookup from sentiment values + trend. Gives the LLM emotional continuity between timesteps instead of starting fresh every time. Zero cost.
3.10b: Conviction self-awareness. "I've been firm about this since Week 1" or "I started certain but I've been getting less sure." Deterministic from conviction history. Enables commitment bias (consistent agents resist change) and openness (wavering agents are more receptive). Zero cost.

Tier 2: Build at high fidelity (medium effort, good value)

3.10c: Internal monologue vs external action (THINK vs SAY). Schema change — output includes both internal_monologue (raw, honest) and public_statement (socially filtered). Replaces the mechanical public/private split with agent-generated divergence. An agent with high agreeableness might have a large gap between what they think and what they say — that's interesting data. Schema change only, no new system.
3.10d: Repetition detection + forced deepening. If Jaccard similarity on word-level trigrams between consecutive reasonings > 70%, inject a prompt nudge: "You've been thinking the same thing for several days. Has anything actually changed? Are you starting to doubt your plan? Have you actually done anything about it?" Simple trigram comparison, no embeddings needed. Prevents the stale convergence we saw in the ASI run ("No change — save, learn AI, backup income" × 5 timesteps). Without this, agents converge to identical outputs and the sim produces meaningless duplicate reasoning.

OMITTED: Marginal value

~~3.10e: Episodic vs semantic memory.~~ ~~After N timesteps of consistent reasoning on a theme, engine extracts a belief statement and adds to persistent "beliefs" field.~~ Omitted. The LLM already consolidates beliefs implicitly when reading its own full history. Making it explicit adds complexity for marginal gain. The full memory trace (uncapped, timestamped) provides sufficient context.

CUT: Not building

~~3.10f: Attention/focus weighting.~~ ~~What the agent is currently focused on determines which inputs are foregrounded in the prompt.~~ Cut. The LLM already does this natively — if the memory trace says "budget first," the model naturally attends to budget-related inputs. Artificially weighting prompt sections is trying to replicate what attention heads already do. Unnecessary complexity.

DEFERRED: Post-launch

3.10g: Spontaneous memory recall. Memories surface by RELEVANCE to current events, not recency. Requires embedding stored memories and comparing to current context. Small embedding model on short text, cheap per-call, but needs embedding infrastructure. Deferred because: for most scenarios with 10-15 timesteps, recency and relevance overlap heavily — the memory that's relevant IS usually recent. This matters more for long-running sims (50+ timesteps) where early memories might be contextually triggered. Build it when we actually have that use case.

Fidelity tier mapping:

Feature	low	medium	high
3.10a emotional trajectory	Yes	Yes	Yes
3.10b conviction self-awareness	Yes	Yes	Yes
3.10c THINK vs SAY	No	No	Yes
3.10d repetition detection	No	No	Yes
~~3.10e episodic/semantic memory~~	—	—	Omitted
~~3.10f attention weighting~~	—	—	Cut
3.10g spontaneous recall	No	No	Deferred

PHASE 4: RESULTS + ANALYSIS

4.1 `extropy results` — Enhanced

Current: Outcome distributions, segment breakdowns, timeline visualization.

New: Same, plus elaboration clustering for exploratory outcomes.

Known Outcome Results (Same as Current)

Position Distribution:
  pay_own_account: 43.2%
  cancel: 12.1%
  switch_service: 18.7%
  workaround: 15.3%
  stay_shared_with_fee: 10.7%

Segment: age_bracket
  18-25: cancel 22%, workaround 31%
  26-35: pay_own 48%, switch 23%
  36-50: pay_own 52%, stay_shared 18%
  50+: pay_own 61%, cancel 8%

Exploratory Outcome Results (New)

For type: open_ended outcomes, the raw elaboration texts are exported as structured data. No built-in clustering. Post-hoc analysis (clustering, thematic coding, embedding, etc.) is done by the agentic harness or manual data science workflows — not baked into extropy results.

extropy results exports:

1. All elaboration texts with agent_id, demographics, timestep
2. Summary statistics (word count distribution, unique response count)
3. Structured JSON/CSV ready for downstream DS tooling

The real value of exploratory outcomes — discovering categories nobody would have pre-defined — comes from the analysis layer, not the simulation engine.

Conversation Analysis (New)

Conversation Summary:
  Total conversations: 3,847 (across 12 timesteps)
  Average turns: 2.4
  State changes from conversations: 891 (23% of conversations changed someone's mind)

Most impactful conversations (by state change magnitude):
  1. Travis ↔ Lisa (Week 1): Travis shifted from "upskilling" to "survival-first" after budget discussion
  2. Graham ↔ Sandra (Week 1): Both solidified "raise prices 10%" after doing the math together
  ...

Conversation themes (clustered):
  - Financial planning (34%): Budget discussions, cost-cutting
  - Emotional support (28%): Reassurance, shared anxiety
  - Information sharing (22%): "Did you see...", "Have you heard..."
  - Disagreement (16%): Different coping strategies, arguments

FIDELITY TIERS

--fidelity low|medium|high controls what goes into prompts and whether conversations happen.

Feature	low	medium	high
Names & household data	Yes	Yes	Yes
Temporal awareness	Yes	Yes	Yes
Aggregate mood (local)	Yes	Yes	Yes
Named peer opinions	Top 5	Top 10 + consensus signal	All connected + demographics
Day phase templates	Yes (if defined)	Yes	Yes
Channel experience	Default templates	Scenario-defined variants	Full demographic adaptation
Memory	Last 3 full reasoning traces	All traces, timestamped	All + consolidated beliefs
Emotional trajectory	Label only	Label + trend	Full trajectory narrative
Conversations	None	Top 1 edge (partner/closest)	Top 2-3 edges
Internal monologue	Not separated	Not separated	Explicit THINK vs SAY
Repetition detection	No	No	Yes
Social posts	Stored, not rendered	Stored + aggregate	Stored + aggregate + named peer posts
Scenario timeline	Yes	Yes	Yes
Pass structure	Merged (test vs 2-pass)	Merged (test vs 2-pass)	Merged (test vs 2-pass)

Cost Estimates (Conservative)

Per-call token budgets:

	Input tokens	Output tokens
low reasoning	~2.5k	~500
medium reasoning	~3.5k	~600
medium conversation turn	~2k	~200
high reasoning	~4.5k	~700
high conversation turn	~2k	~200

Total cost (gpt-5-mini: $0.30/$1.50 per 1M tokens, 15 timesteps):

	2k agents	10k agents
Current system	~$40	~$200
low	~$59	~$295
medium	~$72	~$360
high	~$99	~$495

Total cost (Sonnet-class: ~$3/$15 per 1M tokens, 15 timesteps):

	2k agents	10k agents
Current system	~$480	~$2,400
low	~$700	~$3,500
medium	~$860	~$4,300
high	~$1,180	~$5,900

Wall time (1k RPM, conservative):

	2k agents	10k agents
Current system	~30 min	~2.5 hrs
low	~45 min	~4 hrs
medium	~1.2 hrs	~6 hrs
high	~1.5 hrs	~8 hrs

Default: medium. Best cost/quality tradeoff. Names, narrative, temporal awareness, full memory, the one conversation that matters most (partner), aggregate mood. ~$360 for 10k agents on gpt-5-mini. ~1.8x current cost for a fundamentally better simulation.

TENET COMPLIANCE: 12 TENETS OF HIGH-FIDELITY POPULATION SIMULATION

The 12 tenets below define what a high-fidelity simulation must satisfy. Tenets 1-4 are about who the agents are, 5-8 are about how they decide, 9-12 are about what shapes and connects them.

Assessment Against v2 Architecture

#	Tenet	Status	What v2 Has	Remaining Gap
1	Realistic distributions	Strong	Household-based correlated sampling, fat-tail distributions (lognormal/beta), conditional modifiers, grounded in census/research data	Need hard calibration targets for joint distributions (e.g., income × education × race correlation matrices)
2	Information asymmetry	Strong	Conditional exposure rules, channel-specific credibility, network propagation with decay, scenario timeline with staggered events	—
3	Social hierarchy & influence topology	Partial	Structural role edges, degree multipliers in network config, edge weight hierarchy	No explicit power-law degree enforcement; no hub/opinion-leader generation. Need scenario-dependent centrality targets
4	Behavioral heterogeneity	Partial	Big Five personality, risk tolerance, institutional trust, cognitive attributes vary per agent	Decision-policy heterogeneity relies entirely on LLM interpretation of persona. Need explicit behavioral parameters (conformity threshold, action inertia) as agent attributes
5	Temporal dynamics & decay	Strong	Conviction decay, temporal prompt awareness, emotional trajectory, memory history, scenario timeline with evolving events	Need intent→action accountability loop (surface prior action_intent, ask about follow-through)
6	Social contagion & network effects	Partial	Network propagation, share modifiers, conversation system, aggregate mood, peer opinions	No explicit threshold/complex contagion. Need per-agent conformity parameter + conformity-aware prompt phrasing
7	Friction & transaction costs	Weak	option_friction on outcomes, bounded confidence mechanics	Biggest gap. Need explicit intent→behavior pipeline: surface what agent planned vs what they actually did. Friction emerges from agent constraints but isn't tracked or measured
8	Bounded rationality & heuristics	Strong	LLM is inherently a bounded rationality engine. Persona attributes (education, digital literacy, neuroticism) shape heuristic use. Agents satisfice, anchor, exhibit status quo bias naturally	Could strengthen with explicit bias nudges in prompts for specific attributes
9	Environmental & contextual sensitivity	Partial	Scenario timeline handles exogenous shocks. Channel templates adapt to agent demographics	Need ambient macro context in every prompt (economic conditions, cultural moment). Need previous-timestep macro summary injection
10	Identity & group membership	Partial	race_ethnicity, political_orientation, religious_affiliation in persona. Social role edges create in-group connections	Need identity-threat framing: when the scenario threatens a group identity, persona rendering should explicitly flag it as identity-relevant
11	Preference interdependence	Partial	Aggregate mood rendering ("most people I know are doing X"), peer opinions, social posts. Bandwagon/FOMO effects emerge from context	Named peer opinions + local mood + macro summary provide social pressure without omniscient ratio framing
12	Macro-micro feedback loops	Partial	Micro→macro works (agent decisions → aggregate stats). Timeline handles exogenous macro shifts	No endogenous macro: agent behavior doesn't produce emergent macro variables that feed back. Need at minimum: inject previous timestep aggregates as ambient context

Concrete Fixes to Close Gaps

These are the minimum changes needed to move every tenet to Strong. Listed in implementation priority order.

Fix 1: Named + Temporal + Accountable Prompt Context

Files: reasoning.py, engine.py, state.py Closes: Tenets 5 (temporal), 7 (friction), 9 (context)

Replace all anonymous peer phrasing ("Someone in your network told you") with named + relationship context ("My coworker Darnell told me")
Include day/week label in every prompt ("It's Week 3 since the announcement")
Include channel experience text with credibility framing ("I saw a CNN segment" not "mainstream_news_media")
Include local mood summary as vibes, not numbers ("Everyone around me seems scared")
Remove 3-entry memory cap. Surface full reasoning history, timestamped.
Surface prior action_intent each re-reasoning step. "Last week I said I'd sign up for a free AI course. Did I actually do it?" Forces the agent to reconcile intent with behavior. The friction between "I said I would" and "I actually did" emerges from the agent's own constraints (time, money, energy, competing priorities). This is the intent→behavior pipeline — not a separate outcome field, just prompt accountability.

Fix 2: Household-First Sampling

Files: sampler/core.py, models/population.py Closes: Tenet 1 (realistic distributions — correlated household demographics)

Couple households sampled as pairs with correlated demographics (age, education, race via census joint distributions)
Shared economic fate attributes (household_income, savings, housing)
NPC dependent generation (kids with names, ages, school status)
household_id, partner_id, first_name, last_name, dependents on every agent
Hard calibration targets for joint distributions: use census PUMS microdata for income × education × race × age cross-tabulations. Validate sampled population against these targets.

Fix 3: Role-First Network + Influence Topology

Files: network/generator.py, network/config.py Closes: Tenets 3 (hierarchy/topology), 10 (identity/group membership)

Structural edges first (partner, household, coworker, neighbor, congregation, school_parent) — these are deterministic from agent attributes
Similarity edges fill remaining degree budget
Calibration loop CANNOT delete structural edges
Power-law degree enforcement (scenario-dependent): Network config gets a degree_distribution_target field. For scenarios with opinion leaders (ASI, political), target a power-law with configurable exponent. For scenarios without (Netflix, product change), target a more uniform degree distribution. The scenario compiler generates this based on the event type.
Identity clustering: Agents sharing identity-relevant attributes (political_orientation, religious_affiliation, race_ethnicity) get higher intra-group edge density. When the scenario involves identity threat (Bud Light, political events), the network config increases in-group clustering for the relevant identity dimension. Implemented via targeted blocking attributes in the existing calibration system.

Fix 4: Contagion Thresholds + Macro Feedback

Files: engine.py, reasoning.py, models/simulation.py Closes: Tenets 6 (contagion), 9 (environmental context), 11 (preference interdependence), 12 (macro-micro feedback)

Soft conformity/threshold behavior:

Add conformity as a standard personality attribute (0-1 scale, correlated with agreeableness). Sampled at population creation time.
Inject conformity self-awareness into prompt: "I tend to go along with what most people around me are doing" (high) or "I tend to form my own opinion regardless of what others think" (low). Mid-range agents get no explicit phrasing.
Social pressure is conveyed through existing mechanisms, not explicit ratios:
- Named peer opinions: "My coworker Darnell thinks X"
- Local mood rendering: "Most people around me seem worried"
- Macro summary: "The general mood is shifting toward X"
Rationale: People don't actually know "7 out of 10 contacts did X" — that's omniscient narrator framing. Real social pressure comes from specific conversations and vague impressions, which the peer opinion and mood systems already capture.

Macro state feedback:

After each timestep, compute macro summary from TimestepSummary data:
- Position distribution rendered as "Most people are choosing X. A growing minority is doing Y."
- Sentiment trend: "The general mood is getting worse / stabilizing / improving."
- Exposure saturation: "Almost everyone has heard about this now."
- Action momentum: "More and more people are taking action" / "Most people are still waiting"
Inject this into every agent's next-timestep prompt as ambient context, rendered as what the agent would sense from media/social feeds, not raw numbers.
This closes the macro→micro loop: agent decisions → aggregate stats → rendered as ambient context → influences next round of agent decisions.

Ambient scenario context:

ScenarioSpec gets an optional background_context field: "The US economy is in a mild recession. Unemployment was at 4.5% before the AI announcement. It's early spring."
Injected into every prompt as ambient framing. Helps agents reason about environmental context beyond just the focal event.

Fix 5: Backtesting Harness

Files: tests/, new extropy/validation/ module Closes: Validation across all tenets + falsifiability + uncertainty quantification

Historical replay suites:

Curate ground-truth datasets for completed scenarios where actual outcomes are known:
- Netflix password crackdown (actual churn rate, account creation rate by demographic)
- Bud Light boycott (actual sales drop by region, recovery timeline)
- London ULEZ expansion (compliance rate, protest participation, price pass-through)
Run sim on these scenarios with matched population demographics
Compare sim output distributions to actual observed outcomes

Acceptance gates:

Per-tenet metrics:
- Tenet 1 (distributions): KL divergence between sampled population and census targets < threshold
- Tenet 3 (topology): Degree distribution fits target (power-law exponent within ±0.2)
- Tenet 6 (contagion): Adoption curve shape matches empirical S-curves (timing, steepness)
- Tenet 7 (friction): Intent-to-behavior conversion rate within 2x of observed real-world rates
Aggregate error: mean absolute error on outcome distribution vs ground truth < 15%
Uncertainty quantification: run same scenario with 5 different seeds, report mean ± std on all metrics. If std > 20% of mean on key outcomes, the sim is too noisy.

Reproducibility:

All runs seeded and deterministic given same seed + same model
Results include full provenance: model version, prompt templates, scenario YAML hash, population seed

VALIDATION & EVALUATION PLAN

Output Quality

A/B comparison: Run same population + scenario through old system and new system (medium fidelity). Blind human eval on 50 agent outputs: "Which reads more like a real person's reasoning?" Measure win rate.
Outcome distribution stability: Run same scenario at low/medium/high fidelity. Outcome distributions should be SIMILAR — fidelity controls richness of reasoning, not WHAT people decide. If distributions diverge significantly across tiers, the richer prompts are causing systematic bias.
Hallucination audit: Sample 100 agent outputs. Check every factual claim against the prompt context. Agents should not invent information not in their exposures/memory/peers. Richer prompts = more grounding = less hallucination expected, but verify.

Conversation Quality

In-character consistency: Sample 50 conversations. Both agents should stay in character (demographics, personality, relationship). Neither should suddenly become eloquent if their persona is low-education.
State change plausibility: For conversations that changed an agent's position, verify the change makes sense given what was said. "Lisa convinced Travis to focus on cash" should show Lisa making a compelling financial argument, not Travis randomly flipping.

Elaboration Quality

Scenario awareness: Elaborations should be contextually appropriate. ASI scenario should NOT produce "I'll take an online course" if the premise is superhuman AI. Netflix scenario should NOT produce existential crisis responses.
Demographic consistency: An agent with digital_literacy: basic should not describe a plan involving "fine-tuning open-source models." Elaborations should reflect the agent's actual capabilities and constraints.

Exploratory Outcome Validation

Export completeness: For exploratory outcomes, verify all agent elaborations are exported with correct agent_id, demographics, and timestep. Downstream analysis (clustering, thematic coding) is done by the agentic harness or manual DS workflows — not validated by the engine.

IMPLEMENTATION ORDER

Aligned with both the original issue priorities AND the tenet-closing fixes above.

Phase A: Prompt Overhaul — Named, Temporal, Accountable (~1.5 weeks)

Files: reasoning.py, persona.py, engine.py, state.py Tenet fixes: 5 (temporal), 7 (friction/accountability), 8 (bounded rationality via better prompts)

Agent names from SSA + census data (bundled CSV, zero API calls)
First-person prompt voice ("I'm Travis" not "You are a 43-year-old")
Temporal awareness: timestep + unit in prompt, timestamped memory
Full memory: remove 3-entry cap, surface raw_reasoning with timestamps
Intent accountability: surface prior action_intent in re-reasoning prompt ("Last week you said you'd X. Did you?")
Named peer opinions (use agent names from population, relationship type from network edge)
Channel experience rendering (experiential text, not channel labels)
Local mood summary (vibes from local network, not global numbers)
Macro summary injection (previous timestep aggregates as ambient context)

Ship this alone. Every simulation immediately feels more human, and the accountability loop closes the intent→behavior gap.

Phase B: Household Sampling + Role Network (~2.5 weeks)

Files: sampler/core.py, models/population.py, network/generator.py, network/config.py Tenet fixes: 1 (realistic joint distributions), 3 (influence topology), 10 (identity/group membership)

Household-based sampling (couples co-sampled, NPC dependents generated)
household_id, partner_id, first_name, last_name, dependents on agents
Joint distribution calibration targets from census PUMS data
Structural edge generation: partner, household, coworker, neighbor, congregation, school_parent
Similarity edges fill remaining degree budget
Structural edges protected from calibration pruning
Scenario-dependent degree distribution targets (power-law for opinion-leader scenarios, more uniform for product scenarios)
Identity clustering: in-group edge density boost for identity-relevant attributes
conformity as standard personality attribute (for threshold behavior in Phase C)

Phase C: Contagion + Scenario Timeline + Macro Feedback (~1.5 weeks)

Files: models/scenario.py, engine.py, reasoning.py Tenet fixes: 6 (contagion thresholds), 9 (environmental context), 11 (preference interdependence), 12 (macro-micro feedback)

Scenario timeline: sequence of events at specified timesteps
Timeline injection into agent prompts as "what's happened since last time"
Named peer opinions + local mood convey social pressure without explicit ratios
Conformity-aware prompt rendering ("You tend to wait for others / act independently")
Ambient scenario context field (background_context in ScenarioSpec)
Macro state feedback: timestep aggregates rendered as ambient vibes in next prompt
Merged Pass 1 + Pass 2 (with A/B test against 2-pass to validate no central tendency regression)
Outcome track inference (known vs exploratory from schema)

Phase D: Conversations + Narrative Prompts (~2 weeks)

Files: engine.py, new conversation.py, reasoning.py Prerequisite: DB schema for conversations/posts/action_history defined before starting.

Conversation resolution: priority queue using scenario relationship_weights × structural edge weight
Agent-agent conversations: 2 turns at medium fidelity (4 messages), 3 turns at high (6 messages). LLM told the turn limit upfront. Both agents keep their own independent final state.
Agent-NPC conversations (LLM-generated NPC responses from profile)
Phase 2 reasoning is provisional; Phase 3 conversation overrides
Mechanical rules (bounded confidence, flip resistance) only for agents who didn't converse
Day phase templates (optional, adapts to timestep unit)
Social posts + public discourse aggregation

Phase E: Cognitive Architecture — COMPLETE ✓

Files: reasoning.py, engine.py, text_utils.py

✅ Emotional trajectory rendering (all tiers — deterministic string formatting)
✅ Conviction self-awareness (all tiers — deterministic)
✅ THINK vs SAY separation (high fidelity — prompt-only, uses reasoning field)
✅ Repetition detection + deepening nudge (trigram Jaccard >70% threshold)
~~Episodic/semantic memory consolidation~~ — Omitted

Phase F: Fidelity Tiers + Results — COMPLETE ✓

Files: engine.py, reasoning.py, aggregation.py, CLI

✅ --fidelity low|medium|high flag on SimulationRunConfig
✅ Fidelity-gated feature inclusion (conversations, memory depth, peer limits 5/5/10)
✅ Exploratory outcome export (elaborations.csv with agent demographics + all outcomes)
✅ Conversation analysis (compute_most_impactful_conversations — ranks by sentiment+conviction delta)
✅ Social posts export (social_posts.json)

Phase G: Backtesting Harness — DEFERRED

Files: tests/, new extropy/validation/ module

Historical replay suites: Netflix, Bud Light, ULEZ (curate ground truth data)
Per-tenet acceptance metrics (distribution KL divergence, adoption curve shape, intent-to-behavior conversion)
Aggregate error gates (MAE < 15% on outcome distributions)
Uncertainty quantification (5-seed runs, report mean ± std)
Reproducibility: full provenance in results (model version, prompt hash, scenario hash, seed)

Total estimated: ~13 weeks for full v2.

Status: Phases A-F COMPLETE. Phase G (backtesting) is deferred — requires curating ground-truth datasets for historical scenarios.

FilesExpand file tree

simulation-v2-architecture.md

Latest commit

History