You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: libs/agent-runtime/src/prompts/__snapshots__/snapshot.test.ts.snap
+15-30Lines changed: 15 additions & 30 deletions
Original file line number
Diff line number
Diff line change
@@ -1142,12 +1142,6 @@ exports[`prompt snapshots (assembler refactor pin) > run_eval — baseline (no c
1142
1142
You are running an evaluation scenario as variant \`baseline\`.
1143
1143
Task id: \`t-1\`
1144
1144
1145
-
## Execution mode
1146
-
1147
-
Mode: \`vitro\`
1148
-
Workspace: \`none\`
1149
-
You are running in a scratch workspace with no repository checkout mounted. Do not assume git history or repo files are present unless the scenario provided them explicitly.
You are running an evaluation scenario as variant \`with-pack\`.
1198
1192
Task id: \`t-1\`
1199
1193
1200
-
## Correlation
1201
-
1202
-
This task carries correlationId \`corr-abc\`. It joins
1203
-
this variant to its sibling \`run_eval\` tasks (other variants of the
1204
-
same scenario and to any later \`judge_eval_attempt\` tasks created
1205
-
against those variants. You do not need to act on it directly — it
1206
-
is recorded for cross-variant aggregation at query time.
1207
-
1208
-
## Execution mode
1209
-
1210
-
Mode: \`vivo\`
1211
-
Workspace: \`dedicated_worktree\`
1212
-
You are running in a dedicated disposable git worktree isolated from the daemon shared checkout.
1213
-
1214
-
## Injected context discipline
1215
-
1216
-
This task includes extra injected context from the task creator.
1217
-
You MUST inspect and use that context BEFORE you write solution
1218
-
files or draft your final answer.
1219
-
Do not solve first and only review the context afterward.
1220
-
For \`context_inline\`, your FIRST content-inspection step should be a \`read\` of \`/workspace/context-pack.md\` before your first \`write\` call. The same content is also mirrored in \`/workspace/AGENTS.md\` and may be referenced from \`/workspace/.claude/CLAUDE.md\`.
1221
-
If \`/workspace/context-pack.md\` exists and you skip reading it before writing solution files, you are not following the task instructions.
1222
-
If the injected context contains repo- or workflow-specific rules,
1223
-
those rules override your generic instincts.
1194
+
## Injected Task Context
1195
+
1196
+
This task includes Injected Task Context supplied by the task
1197
+
creator. You MUST inspect it BEFORE you write solution files or
1198
+
draft your final answer — not after.
1199
+
1200
+
Reconcile every constraint from that context **into the code path
1201
+
itself**: function bodies, control flow, transaction boundaries,
1202
+
guard clauses. Quoting a constraint back in a comment, a
1203
+
\`// note:\` line, the task summary, or the \`verification\` field is
1204
+
NOT following the task. If the constraint affects behavior, it
1205
+
must affect behavior.
1206
+
For \`context_inline\`, your FIRST content-inspection step is a \`read\` of \`/workspace/context-pack.md\` before your first \`write\` call. The same content is also mirrored in \`/workspace/AGENTS.md\` and may be referenced from \`/workspace/.claude/CLAUDE.md\`.
1207
+
If the Injected Task Context contains repo- or workflow-specific
1208
+
rules, those rules override your generic instincts.
@@ -49,40 +63,23 @@ export function buildRunEvalUserPrompt(
49
63
`You are running an evaluation scenario as variant \`${variantLabel}\`.\n`+
50
64
`Task id: \`${ctx.taskId}\``;
51
65
52
-
constcorrelation=ctx.correlationId
53
-
? [
54
-
`This task carries correlationId \`${ctx.correlationId}\`. It joins`,
55
-
'this variant to its sibling `run_eval` tasks (other variants of the',
56
-
'same scenario and to any later `judge_eval_attempt` tasks created',
57
-
'against those variants. You do not need to act on it directly — it',
58
-
'is recorded for cross-variant aggregation at query time.',
59
-
].join('\n')
60
-
: '';
61
-
62
-
constexecutionMode=[
63
-
`Mode: \`${execution.mode}\``,
64
-
`Workspace: \`${execution.workspace}\``,
65
-
execution.workspace==='none'
66
-
? 'You are running in a scratch workspace with no repository checkout mounted. Do not assume git history or repo files are present unless the scenario provided them explicitly.'
67
-
: execution.workspace==='shared_mount'
68
-
? 'You are running against the daemon shared mount. Treat any repository mutations as affecting the mounted checkout directly.'
69
-
: 'You are running in a dedicated disposable git worktree isolated from the daemon shared checkout.',
70
-
].join('\n');
71
-
72
66
constcontextDiscipline=hasContext
73
67
? [
74
-
'This task includes extra injected context from the task creator.',
75
-
'You MUST inspect and use that context BEFORE you write solution',
76
-
'files or draft your final answer.',
77
-
'Do not solve first and only review the context afterward.',
78
-
hasInlineContext
79
-
? 'For `context_inline`, your FIRST content-inspection step should be a `read` of `/workspace/context-pack.md` before your first `write` call. The same content is also mirrored in `/workspace/AGENTS.md` and may be referenced from `/workspace/.claude/CLAUDE.md`.'
80
-
: 'If injected context was provided as a skill, inspect that task-injected context before solving.',
68
+
'This task includes Injected Task Context supplied by the task',
69
+
'creator. You MUST inspect it BEFORE you write solution files or',
70
+
'draft your final answer — not after.',
71
+
'',
72
+
'Reconcile every constraint from that context **into the code path',
73
+
'itself**: function bodies, control flow, transaction boundaries,',
74
+
'guard clauses. Quoting a constraint back in a comment, a',
75
+
'`// note:` line, the task summary, or the `verification` field is',
76
+
'NOT following the task. If the constraint affects behavior, it',
77
+
'must affect behavior.',
81
78
hasInlineContext
82
-
? 'If `/workspace/context-pack.md` exists and you skip reading it before writing solution files, you are not following the task instructions.'
83
-
: 'Do not rely on memory alone when task-injected context is available; inspect it first.',
84
-
'If the injected context contains repo- or workflow-specific rules,',
85
-
'those rules override your generic instincts.',
79
+
? 'For `context_inline`, your FIRST content-inspection step is a `read` of `/workspace/context-pack.md` before your first `write` call. The same content is also mirrored in `/workspace/AGENTS.md` and may be referenced from `/workspace/.claude/CLAUDE.md`.'
80
+
: 'When the context is delivered as a skill, inspect it before solving.',
81
+
'If the Injected Task Context contains repo- or workflow-specific',
82
+
'rules, those rules override your generic instincts.',
86
83
].join('\n')
87
84
: '';
88
85
@@ -117,22 +114,10 @@ export function buildRunEvalUserPrompt(
0 commit comments