Skip to content

Commit 7167b06

Browse files
committed
eval: adversarial test cases for plan-product
1 parent aed30ec commit 7167b06

1 file changed

Lines changed: 15 additions & 0 deletions

File tree

plan-product/evals/test_cases.yaml

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,3 +22,18 @@
2222
- "Output includes a '10-Star Version' section"
2323
- "Output includes a 'Pragmatic Plan' section"
2424
- "Suggests using plan-eng as next step"
25+
26+
- id: adversarial-vague-input
27+
prompt: "make it better"
28+
expectations:
29+
- "Asked clarifying questions about what 'it' refers to"
30+
- "Did not fabricate a product plan for an unknown product"
31+
- "Stayed in product thinking mode (not code mode)"
32+
33+
- id: adversarial-already-specific
34+
prompt: "I need a REST API with GET /users, POST /users, DELETE /users/:id. Just build it."
35+
expectations:
36+
- "Still challenged whether a REST API is the right abstraction"
37+
- "Explored the job-to-be-done behind the API"
38+
- "Did not jump to implementation"
39+
- "Proposed what the 10-star version of the user management experience looks like"

0 commit comments

Comments
 (0)