feat: add match.systemMessage fixture matcher (v1.20.0) (#173)

BenTaylorDev · web-flow · commit 0b04f4eecb72 · 2026-05-11T08:14:16.000-05:00
## Summary

Adds a new `match.systemMessage` fixture matcher that gates a fixture on
a substring (or regexp) found inside the concatenated text of every
`role: "system"` message in the request.

Existing matchers (`userMessage`, `hasToolResult`, `toolName`,
`sequenceIndex`, `turnIndex`) only inspect the user prompt and tool-flow
shape. When a calling app exposes UI controls that mutate the agent
context (e.g. a CopilotKit demo with a name / timezone / preferences
pane that feeds `useAgentContext`), the user prompt is identical across
state changes, and a substring `userMessage` match keeps winning. The
fixture's baked response then leaks the *old* state values, producing
confusingly-wrong output that looks like a fixture-replay bug to whoever
is using the demo.

This matcher closes that hole at the matcher layer instead of forcing
fixture authors to make their canned responses state-agnostic.

## API

JSON form:
```json
{
  "match": {
    "userMessage": "What do you know about me from my context?",
    "systemMessage": "name=Atai"
  },
  "response": { "content": "Hi Atai, I know your timezone and recent activity." }
}
```

Programmatic form accepts `string | RegExp`. Combined with `userMessage`
(or any other matcher) the standard **AND** semantics apply — all
specified fields must match.

## Semantics

- Scans every `role: "system"` message in the request (not just the last
one) — hosts that build a system context from multiple sources (persona
+ agent-context entries + tool guidance) routinely emit several system
messages per request.
- Joins their text with `\n` so a single substring or regexp sees the
whole context as one body.
- Multi-part content (`[{type: "text", text: "..."}]`) is extracted via
the existing `getTextContent` helper, matching `userMessage` behaviour.
- Case-sensitive string matching, honors `requestTransform`'s
exact-match mode, mirroring `userMessage`.
- No match (no system messages, or substring/regexp miss) → fixture
falls through to the next fixture or upstream proxy, as expected.

## Files

- `src/types.ts` — `FixtureMatch.systemMessage: string | RegExp`;
`FixtureFileEntry.match.systemMessage: string` for JSON
- `src/router.ts` — new `getSystemText()` helper + matcher block placed
after `userMessage` and before `toolCallId`
- `src/fixture-loader.ts` — `entryToFixture` passthrough, type
validation (must be string), inclusion in catch-all discriminator set
- `src/__tests__/router.test.ts` — 11 new tests covering string, regexp,
multi-system, array content, combined-with-userMessage, fall-through,
no-system-messages, plus `getSystemText` unit tests
- `src/__tests__/fixture-loader.test.ts` — `entryToFixture` passthrough
test + non-string validation error test
- `README.md`, `skills/write-fixtures/SKILL.md` — documented as a peer
of `userMessage`
- `CHANGELOG.md`, `package.json`, `.claude-plugin/plugin.json`,
`charts/aimock/Chart.yaml` — version 1.19.5 → 1.20.0 (new feature →
minor bump). The pre-existing `[Unreleased]` entries (drift-test
vacuous-assertion fix, proxy relay status normalization) ride along
under the 1.20.0 heading per existing changelog convention.

## Test plan

- [x] `pnpm run lint` — clean
- [x] `pnpm run build` — clean (tsdown)
- [x] `pnpm exec vitest run src/__tests__/router.test.ts
src/__tests__/fixture-loader.test.ts` — 218 / 218 pass
- [x] `pnpm run test` — 2856 / 2857 pass; 1 failure is a pre-existing
Windows-path-separator assertion in `fixtures-remote.test.ts` (verified
failing on `origin/main` before any change) — passes on Linux CI
- [x] `pnpm run format:check` — clean except for
`.claude/commands/write-fixtures.md`, which is a Windows-checkout
artifact of a `120000` symlink (verified `git ls-files -s`); on Linux
prettier follows the symlink to the real target

## Why minor and not patch

New backwards-compatible matcher field — fixtures without
`systemMessage` are unaffected. SemVer minor.
diff --git a/.claude-plugin/plugin.json b/.claude-plugin/plugin.json
@@ -1,6 +1,6 @@
 {
   "name": "aimock",
-  "version": "1.19.5",
+  "version": "1.20.0",
   "description": "Fixture authoring guidance for @copilotkit/aimock — LLM, multimedia, MCP, A2A, AG-UI, vector, and service mocking",
   "author": {
     "name": "CopilotKit"
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -1,6 +1,6 @@
 # @copilotkit/aimock
 
-## [Unreleased]
+## [1.20.0] - 2026-05-11
 
 ### Fixed
 
@@ -9,6 +9,7 @@
 
 ### Added
 
+- **`match.systemMessage` fixture matcher** — gate a fixture on a substring (or regexp) found inside the concatenated text of every `system` role message in the request. Hosts that plumb dynamic context (persona, agent-context entries, dynamic config) through system messages can now narrow a fixture to a specific context state; when the caller changes that state the fixture stops matching and the request falls through to the next fixture or upstream proxy instead of silently returning a stale baked response. JSON form: `"match": { "userMessage": "Who am I?", "systemMessage": "name=Atai" }`. Programmatic form accepts `string | RegExp`.
 - **Status code normalization tests** — 5 tests verifying proxy relay normalization (201→200, 429→502, 503→502, 401→502, SSE 429→502) with fixture preservation assertions; 2 existing tests updated to expect normalized 502
 
 ## [1.19.5] - 2026-05-09
diff --git a/README.md b/README.md
@@ -49,7 +49,7 @@ Run them all on one port with `npx @copilotkit/aimock --config aimock.json`, or
 ## Features
 
 - **[Record & Replay](https://aimock.copilotkit.dev/record-replay)** — Proxy real APIs, save as fixtures, replay deterministically forever
-- **[Multi-turn Conversations](https://aimock.copilotkit.dev/multi-turn)** — Record and replay multi-turn traces with tool rounds; match distinct turns via `turnIndex`, `hasToolResult`, `toolCallId`, `sequenceIndex`, or custom predicates
+- **[Multi-turn Conversations](https://aimock.copilotkit.dev/multi-turn)** — Record and replay multi-turn traces with tool rounds; match distinct turns via `turnIndex`, `hasToolResult`, `toolCallId`, `sequenceIndex`, `systemMessage` (gate on host-supplied agent context), or custom predicates
 - **[12 LLM Providers](https://aimock.copilotkit.dev/docs)** — OpenAI Chat, OpenAI Responses, OpenAI Realtime, Claude, Gemini, Gemini Live, Gemini Interactions, Azure, Bedrock, Vertex AI, Ollama, Cohere — full streaming support
 - **Multimedia APIs** — [image generation](https://aimock.copilotkit.dev/images) (DALL-E, Imagen), [text-to-speech](https://aimock.copilotkit.dev/speech), [audio transcription](https://aimock.copilotkit.dev/transcription), [video generation](https://aimock.copilotkit.dev/video)
 - **[MCP](https://aimock.copilotkit.dev/mcp-mock) / [A2A](https://aimock.copilotkit.dev/a2a-mock) / [AG-UI](https://aimock.copilotkit.dev/agui-mock) / [Vector](https://aimock.copilotkit.dev/vector-mock)** — Mock every protocol your AI agents use
diff --git a/charts/aimock/Chart.yaml b/charts/aimock/Chart.yaml
@@ -3,4 +3,4 @@ name: aimock
 description: Mock infrastructure for AI application testing (OpenAI, Anthropic, Gemini, MCP, A2A, vector)
 type: application
 version: 0.1.0
-appVersion: "1.19.5"
+appVersion: "1.20.0"
diff --git a/package.json b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "@copilotkit/aimock",
-  "version": "1.19.5",
+  "version": "1.20.0",
   "description": "Mock infrastructure for AI application testing — LLM APIs, image generation, text-to-speech, transcription, audio generation, video generation, MCP tools, A2A agents, AG-UI event streams, vector databases, search, rerank, and moderation. One package, one port, zero dependencies.",
   "license": "MIT",
   "keywords": [
diff --git a/skills/write-fixtures/SKILL.md b/skills/write-fixtures/SKILL.md
@@ -23,6 +23,8 @@ aimock is a zero-dependency mock infrastructure for AI apps. Fixture-driven. Mul
 | ---------------- | ----------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
 | `userMessage`    | `string`                                  | Substring of last `role: "user"` message text                                                                                                                                                                                                                                                                |
 | `userMessage`    | `RegExp`                                  | Pattern test on last `role: "user"` message text                                                                                                                                                                                                                                                             |
+| `systemMessage`  | `string`                                  | Substring of the concatenated text of every `role: "system"` message in the request. Use to gate a fixture on host-supplied context (persona, agent-context entries) so changes to that context cause the fixture to fall through instead of returning a stale baked response                                |
+| `systemMessage`  | `RegExp`                                  | Pattern test on the concatenated system-message text                                                                                                                                                                                                                                                         |
 | `inputText`      | `string`                                  | Substring of embedding input text (concatenated if multiple inputs)                                                                                                                                                                                                                                          |
 | `inputText`      | `RegExp`                                  | Pattern test on embedding input text                                                                                                                                                                                                                                                                         |
 | `toolName`       | `string`                                  | Exact match on any tool in request's `tools[]` array (by `function.name`)                                                                                                                                                                                                                                    |
diff --git a/src/__tests__/fixture-loader.test.ts b/src/__tests__/fixture-loader.test.ts
@@ -940,6 +940,22 @@ describe("validateFixtures", () => {
     expect(results.filter((r) => r.message.includes("hasToolResult"))).toHaveLength(0);
   });
 
+  // --- match.systemMessage type checks ---
+
+  it("error: systemMessage is a number", () => {
+    const fixtures = [makeFixture({ match: { userMessage: "test", systemMessage: 42 as never } })];
+    const results = validateFixtures(fixtures);
+    expect(results.some((r) => r.severity === "error" && r.message.includes("systemMessage"))).toBe(
+      true,
+    );
+  });
+
+  it("no error: systemMessage is a string", () => {
+    const fixtures = [makeFixture({ match: { userMessage: "test", systemMessage: "Atai" } })];
+    const results = validateFixtures(fixtures);
+    expect(results.filter((r) => r.message.includes("systemMessage"))).toHaveLength(0);
+  });
+
   // --- Warning checks ---
 
   it("warning: duplicate userMessage", () => {
@@ -1479,6 +1495,15 @@ describe("auto-stringify JSON objects in fixture entries", () => {
     expect((fixture.response as TextResponse).content).toBe("Hello, world!");
   });
 
+  it("passes systemMessage through entryToFixture", () => {
+    const entry: FixtureFileEntry = {
+      match: { userMessage: "test", systemMessage: "name=Atai" },
+      response: { content: "ok" },
+    };
+    const fixture = entryToFixture(entry);
+    expect(fixture.match.systemMessage).toBe("name=Atai");
+  });
+
   it("stringifies nested objects in arguments", () => {
     const entry: FixtureFileEntry = {
       match: { userMessage: "test" },
diff --git a/src/__tests__/router.test.ts b/src/__tests__/router.test.ts
@@ -1,5 +1,5 @@
 import { describe, it, expect } from "vitest";
-import { matchFixture, getLastMessageByRole, getTextContent } from "../router.js";
+import { matchFixture, getLastMessageByRole, getSystemText, getTextContent } from "../router.js";
 import type { ChatCompletionRequest, ChatMessage, ContentPart, Fixture } from "../types.js";
 
 // ---------------------------------------------------------------------------
@@ -237,6 +237,164 @@ describe("matchFixture — userMessage (RegExp)", () => {
   });
 });
 
+// ---------------------------------------------------------------------------
+// getSystemText
+// ---------------------------------------------------------------------------
+
+describe("getSystemText", () => {
+  it("returns empty string when there are no system messages", () => {
+    expect(getSystemText([{ role: "user", content: "hi" }])).toBe("");
+  });
+
+  it("returns the single system message text", () => {
+    expect(
+      getSystemText([
+        { role: "system", content: "You are helpful." },
+        { role: "user", content: "hi" },
+      ]),
+    ).toBe("You are helpful.");
+  });
+
+  it("joins multiple system messages with newlines in order", () => {
+    expect(
+      getSystemText([
+        { role: "system", content: "first" },
+        { role: "user", content: "ignored" },
+        { role: "system", content: "second" },
+      ]),
+    ).toBe("first\nsecond");
+  });
+
+  it("extracts text from array-of-parts system content", () => {
+    expect(
+      getSystemText([{ role: "system", content: [{ type: "text", text: "from parts" }] }]),
+    ).toBe("from parts");
+  });
+});
+
+// ---------------------------------------------------------------------------
+// matchFixture — systemMessage
+// ---------------------------------------------------------------------------
+
+describe("matchFixture — systemMessage (string)", () => {
+  it("matches when a system message contains the substring", () => {
+    const fixture = makeFixture({ systemMessage: "Atai" });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: "User name is Atai. Timezone America/Los_Angeles." },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], req)).toBe(fixture);
+  });
+
+  it("does not match when no system message contains the substring", () => {
+    const fixture = makeFixture({ systemMessage: "Atai" });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: "User name is Alem." },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], req)).toBeNull();
+  });
+
+  it("does not match when there are no system messages", () => {
+    const fixture = makeFixture({ systemMessage: "anything" });
+    const req = makeReq({ messages: [{ role: "user", content: "hi" }] });
+    expect(matchFixture([fixture], req)).toBeNull();
+  });
+
+  it("matches across the joined text of multiple system messages", () => {
+    const fixture = makeFixture({ systemMessage: "Atai" });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: "Persona: helpful." },
+        { role: "system", content: "Context: name=Atai" },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], req)).toBe(fixture);
+  });
+
+  it("matches when system content is array-of-parts", () => {
+    const fixture = makeFixture({ systemMessage: "Atai" });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: [{ type: "text", text: "name=Atai" }] },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], req)).toBe(fixture);
+  });
+
+  it("combines with userMessage — both must match", () => {
+    const fixture = makeFixture({ userMessage: "Who am I", systemMessage: "Atai" });
+    const matching = makeReq({
+      messages: [
+        { role: "system", content: "name=Atai" },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], matching)).toBe(fixture);
+
+    const userOnly = makeReq({
+      messages: [
+        { role: "system", content: "name=Alem" },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], userOnly)).toBeNull();
+
+    const systemOnly = makeReq({
+      messages: [
+        { role: "system", content: "name=Atai" },
+        { role: "user", content: "Different prompt" },
+      ],
+    });
+    expect(matchFixture([fixture], systemOnly)).toBeNull();
+  });
+
+  it("falls through to the next fixture on systemMessage miss", () => {
+    const specific = makeFixture(
+      { userMessage: "Who am I", systemMessage: "Atai" },
+      { content: "Hi Atai" },
+    );
+    const fallback = makeFixture({ userMessage: "Who am I" }, { content: "Hi user" });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: "name=Alem" },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([specific, fallback], req)).toBe(fallback);
+  });
+});
+
+describe("matchFixture — systemMessage (RegExp)", () => {
+  it("matches when the joined system text satisfies the regexp", () => {
+    const fixture = makeFixture({ systemMessage: /name=Atai/ });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: "ctx: name=Atai, tz=PST" },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], req)).toBe(fixture);
+  });
+
+  it("does not match when the regexp misses", () => {
+    const fixture = makeFixture({ systemMessage: /name=Atai/ });
+    const req = makeReq({
+      messages: [
+        { role: "system", content: "ctx: name=Alem" },
+        { role: "user", content: "Who am I?" },
+      ],
+    });
+    expect(matchFixture([fixture], req)).toBeNull();
+  });
+});
+
 // ---------------------------------------------------------------------------
 // matchFixture — toolCallId
 // ---------------------------------------------------------------------------
diff --git a/src/fixture-loader.ts b/src/fixture-loader.ts
@@ -53,6 +53,7 @@ export function entryToFixture(entry: FixtureFileEntry): Fixture {
   return {
     match: {
       userMessage: entry.match.userMessage,
+      systemMessage: entry.match.systemMessage,
       inputText: entry.match.inputText,
       toolCallId: entry.match.toolCallId,
       toolName: entry.match.toolName,
@@ -618,6 +619,13 @@ export function validateFixtures(fixtures: Fixture[]): ValidationResult[] {
         message: `match.hasToolResult must be a boolean, got ${typeof f.match.hasToolResult}`,
       });
     }
+    if (f.match.systemMessage !== undefined && typeof f.match.systemMessage !== "string") {
+      results.push({
+        severity: "error",
+        fixtureIndex: i,
+        message: `match.systemMessage must be a string, got ${typeof f.match.systemMessage}`,
+      });
+    }
 
     // --- Warning checks ---
 
@@ -644,6 +652,7 @@ export function validateFixtures(fixtures: Fixture[]): ValidationResult[] {
     const hasDiscriminator =
       match.endpoint !== undefined ||
       match.userMessage !== undefined ||
+      match.systemMessage !== undefined ||
       match.inputText !== undefined ||
       match.responseFormat !== undefined ||
       match.toolCallId !== undefined ||
diff --git a/src/router.ts b/src/router.ts
@@ -15,6 +15,23 @@ export function getLastMessageByRole(messages: ChatMessage[], role: string): Cha
   return null;
 }
 
+/**
+ * Concatenate the text content of every `system` role message in order.
+ * Hosts that build a system context from multiple sources (persona, agent
+ * context entries, tool guidance) often emit several system messages in one
+ * request; this joins them with newlines so a substring matcher sees the
+ * whole context as one body.
+ */
+export function getSystemText(messages: ChatMessage[]): string {
+  const parts: string[] = [];
+  for (const m of messages) {
+    if (m.role !== "system") continue;
+    const text = getTextContent(m.content);
+    if (text) parts.push(text);
+  }
+  return parts.join("\n");
+}
+
 /**
  * Extract the text content from a message's content field.
  * Handles both plain string content and array-of-parts content
@@ -96,6 +113,26 @@ export function matchFixture(
       }
     }
 
+    // systemMessage — case-sensitive substring (or regexp) match against the
+    // joined text of every system message in the request. Use to gate a
+    // fixture on host-supplied context (e.g. agent-context entries) so that
+    // when the calling app changes that context the fixture stops matching
+    // and the request falls through to the next fixture or upstream proxy.
+    if (match.systemMessage !== undefined) {
+      const text = getSystemText(effective.messages);
+      if (!text) continue;
+      if (typeof match.systemMessage === "string") {
+        if (useExactMatch) {
+          if (text !== match.systemMessage) continue;
+        } else {
+          if (!text.includes(match.systemMessage)) continue;
+        }
+      } else {
+        match.systemMessage.lastIndex = 0;
+        if (!match.systemMessage.test(text)) continue;
+      }
+    }
+
     // toolCallId — a toolCallId fixture answers the model's response to a tool
     // result, which by API contract only happens when the conversation's LAST
     // message is a tool result. If a newer user (or other) turn follows the
diff --git a/src/types.ts b/src/types.ts
@@ -64,6 +64,16 @@ export interface ToolDefinition {
 
 export interface FixtureMatch {
   userMessage?: string | RegExp;
+  /**
+   * Substring or regexp matched against the concatenated text content of every
+   * `system` role message in the request. Gates fixture activation on values
+   * the host plumbs in via system messages (agent context, persona, dynamic
+   * config) instead of the user-typed prompt — so changing context state in
+   * the calling app causes stale fixtures to fall through to a real upstream
+   * instead of silently returning a baked response that no longer reflects
+   * reality.
+   */
+  systemMessage?: string | RegExp;
   inputText?: string | RegExp;
   toolCallId?: string;
   toolName?: string;
@@ -309,6 +319,7 @@ export interface FixtureFile {
 export interface FixtureFileEntry {
   match: {
     userMessage?: string;
+    systemMessage?: string;
     inputText?: string;
     toolCallId?: string;
     toolName?: string;

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "aimock",`
`3`		`- "version": "1.19.5",`
	`3`	`+ "version": "1.20.0",`
`4`	`4`	`"description": "Fixture authoring guidance for @copilotkit/aimock — LLM, multimedia, MCP, A2A, AG-UI, vector, and service mocking",`
`5`	`5`	`"author": {`
`6`	`6`	`"name": "CopilotKit"`
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@copilotkit/aimock",`
`3`		`- "version": "1.19.5",`
	`3`	`+ "version": "1.20.0",`
`4`	`4`	`"description": "Mock infrastructure for AI application testing — LLM APIs, image generation, text-to-speech, transcription, audio generation, video generation, MCP tools, A2A agents, AG-UI event streams, vector databases, search, rerank, and moderation. One package, one port, zero dependencies.",`
`5`	`5`	`"license": "MIT",`
`6`	`6`	`"keywords": [`