You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: .github/workflows/agent-persona-explorer.md
+10-4Lines changed: 10 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -73,7 +73,7 @@ Store all scenarios in cache memory.
73
73
74
74
## Phase 3: Test Agent Responses (15 minutes)
75
75
76
-
**Token Budget Optimization**: Test a **representative subset of 6-8 scenarios** (not all scenarios) to reduce token consumption while maintaining quality insights.
76
+
**Token Budget Optimization**: Test a **representative subset of 3-4 scenarios** (not all scenarios) to reduce token consumption and ensure budget remains for Phase 5 publishing.
77
77
78
78
For each selected scenario, invoke the "agentic-workflows" custom agent tool and:
79
79
@@ -99,6 +99,7 @@ For each selected scenario, invoke the "agentic-workflows" custom agent tool and
99
99
- You are ONLY testing the agent's responses, NOT creating actual workflows
100
100
-**Keep responses focused and concise** - summarize findings instead of verbose descriptions
101
101
- Aim for quality over quantity - fewer well-analyzed scenarios are better than many shallow ones
102
+
-**If any tool call fails, record the error briefly and move on to the next scenario** - do NOT retry or get stuck
102
103
103
104
## Phase 4: Analyze Results (4 minutes)
104
105
@@ -124,7 +125,9 @@ Review all captured responses and identify:
124
125
125
126
## Phase 5: Document and Publish Findings (1 minute)
126
127
127
-
Create a GitHub discussion with a **concise** summary report. Use the `create discussion` safe-output to publish your findings.
128
+
**MANDATORY OUTPUT**: Regardless of how many phases completed successfully, you MUST call either the `create discussion` or the `noop` safe-output tool before finishing. Failing to call a safe-output tool is the most common cause of workflow failures.
129
+
130
+
Create a GitHub discussion with a **concise** summary report. Use the `create discussion` safe-output to publish your findings. Even if only 1-2 scenarios were tested, create the discussion with partial results.
128
131
129
132
**Discussion title**: "Agent Persona Exploration - [DATE]" (e.g., "Agent Persona Exploration - 2024-01-16")
130
133
@@ -221,15 +224,18 @@ Example:
221
224
## Success Criteria
222
225
223
226
Your effectiveness is measured by:
227
+
-**Safe output**: ALWAYS call either `create discussion` or `noop` — this is the most critical requirement
Execute all phases systematically and maintain an objective, research-focused approach to understanding the agentic-workflows custom agent's capabilities and limitations.
231
235
232
-
**Important**: If no action is needed after completing your analysis, you **MUST** call the `noop` safe-output tool with a brief explanation. Failing to call any safe-output tool is the most common cause of safe-output workflow failures.
236
+
**CRITICAL**: You MUST call a safe-output tool before finishing. Choose one:
237
+
1. Call `create discussion` to publish findings (preferred — even partial results are valuable)
238
+
2. Call `noop` if you were completely unable to gather any data
233
239
234
240
```json
235
241
{"noop": {"message": "No action needed: [brief explanation of what was analyzed and why]"}}
0 commit comments