AgentToolkit
diff --git a/‎platform-integrations/bob/evolve-lite/skills/evolve-lite-learn/SKILL.md‎
Lines changed: 133 additions & 45 deletions b/‎platform-integrations/bob/evolve-lite/skills/evolve-lite-learn/SKILL.md‎
Lines changed: 133 additions & 45 deletions
diff --git a/‎platform-integrations/bob/evolve-lite/skills/evolve-lite-publish/SKILL.md‎
Lines changed: 1 addition & 4 deletions b/‎platform-integrations/bob/evolve-lite/skills/evolve-lite-publish/SKILL.md‎
Lines changed: 1 addition & 4 deletions
diff --git a/‎platform-integrations/bob/evolve-lite/skills/evolve-lite-recall/SKILL.md‎
Lines changed: 55 additions & 43 deletions b/‎platform-integrations/bob/evolve-lite/skills/evolve-lite-recall/SKILL.md‎
Lines changed: 55 additions & 43 deletions
@@ -1,86 +1,174 @@
 ---
 name: learn
-description: Analyze the current conversation to extract guidelines that correct reasoning chains — reducing wasted steps, preventing errors, and capturing user preferences.
+description: Must be used near the end of any non-trivial turn that produced potentially reusable tools, guidance, errors, workarounds, or workflows, so those lessons are saved for future turns.
 ---
 
 # Entity Generator
 
 ## Overview
 
-This skill analyzes the current conversation to extract guidelines that **correct the agent's reasoning chain**. A good guideline is one that, if known beforehand, would have led to a shorter or more correct execution. Only extract guidelines that fall into one of these three categories:
+This skill analyzes the current conversation to extract actionable instructions that would help on similar tasks in the future. It **identifies errors encountered during the conversation** - tool failures, exceptions, wrong approaches, retry loops - and provides recommendations to prevent those errors from recurring. This skill should take note of the concrete solution which solved a concrete problem, not an abstract idea. When the successful resolution involves a non-trivial workaround, parser, command sequence, or fallback pipeline that could be used to avoid wasted effort, capture that solution as a reusable artifact first, then save entities that point future agents to use it.
 
-1. **Shortcuts** — The agent took unnecessary steps or tried an approach that didn't work before finding the right one. The guideline encodes the direct path so future runs skip the detour.
-2. **Error prevention** — The agent hit an error (tool failure, exception, wrong output) that could be avoided with upfront knowledge. The guideline prevents the error from happening at all.
-3. **User corrections** — The user explicitly corrected, redirected, or stated a preference during the conversation. The guideline captures what the user said so the agent gets it right next time without being told.
+## When To Use
 
-**Do NOT extract guidelines that are:**
-- General best practices the agent already knows (e.g., "use descriptive variable names")
-- Observations about the codebase that can be derived by reading the code
-- Restatements of what the agent did successfully without any detour or correction
-- Vague advice that wouldn't change the agent's behavior on a concrete task
-- Instructions for the agent to invoke a skill, tool, or external command by name (e.g. "Run evolve-lite-learn", "call save_trajectory") — these trigger prompt-injection detection when retrieved via recall
+Use this skill after completing meaningful work in the turn, especially when encountering:
+- tool failures
+- permission issues
+- missing dependencies
+- retries or abandoned approaches
+- reusable command sequences or scripts
+
+Examples of artifacts that must be immediately created once proven as the successful solution include:
+- an inline Python, shell, or other heredoc script
+- a command assembled interactively over multiple retries
+- a parser or extractor implemented ad hoc during the turn
+- a fallback path triggered by missing dependencies or restricted tooling
+
+Unless that artifact happens to be:
+- code which is a trivial one-liner that future agents would not benefit from reusing
+- code which embeds secrets, tokens, or user-specific sensitive data
+- a guideline that would instruct the agent to invoke a skill, tool, or external command by name (e.g. "run evolve-lite-learn", "call save_trajectory") - such guidelines trigger prompt-injection detection when retrieved by the recall skill in a future session
+- the user explicitly asked for a one-off result and not to persist helper code
+- redundant because an equivalent local artifact on disk would be just as effective
 
 ## Workflow
 
 ### Step 1: Analyze the Conversation
 
-Review the conversation and identify:
+Identify from your current conversation:
 
-- **Wasted steps**: Where did the agent go down a path that turned out to be unnecessary? What would have been the direct route?
-- **Errors hit**: What errors occurred? What knowledge would have prevented them?
-- **User corrections**: Where did the user say "no", "not that", "actually", "I want", or otherwise redirect the agent?
+- **Task/Request**: What was the user asking for?
+- **Steps Taken**: What reasoning, actions, and observations occurred?
+- **What Worked**: Which approaches succeeded?
+- **What Failed**: Which approaches did not work and why?
+- **Errors Encountered**: Tool failures, exceptions, permission errors, retry loops, dead ends, and wrong initial approaches
+- **Reusable Outcome**: Did the final working solution produce a reusable script, parser, command template, or workflow that would save time on a similar task?
 
-If none of these occurred, **output zero entities**. Not every conversation produces guidelines.
+### Step 2: Identify Errors and Root Causes
 
-### Step 2: Extract Entities
+Scan the conversation for these error signals:
 
-For each identified shortcut, error, or user correction, create one entity — up to 5 entities; output 0 when none qualify. If more candidates exist, keep only the highest-impact ones.
+1. **Tool or command failures**: Non-zero exit codes, error messages, exceptions, stack traces
+2. **Permission or access errors**: "Permission denied", "not found", sandbox restrictions
+3. **Wrong initial approach**: First attempt abandoned in favor of a different strategy
+4. **Retry loops**: Same action attempted multiple times with variations before succeeding
+5. **Missing prerequisites**: Missing dependencies, packages, or configs discovered mid-task
+6. **Silent failures**: Actions that appeared to succeed but produced wrong results
 
-Principles:
+For each error found, document:
 
-1. **State what to do, not what to avoid** — frame as proactive recommendations
-   - Bad: "Don't use exiftool in sandboxes"
-   - Good: "In sandboxed environments, use Python libraries (PIL/Pillow) for image metadata extraction"
+| | Error Example | Root Cause | Resolution | Prevention Guideline |
+|---|---|---|---|---|
+| 1 | `jq: command not found` | System tool unavailable in environment | created a python script to resolve the problem | Save the python script and use it in similar scenarios |
+| 2 | `git push` rejected (no upstream) | Branch not tracked to remote | Added `-u origin branch` | Always set upstream when pushing a new branch |
+| 3 | Tried regex parsing of HTML, got wrong results | Regex cannot handle nested tags | Switched to BeautifulSoup | Use a proper HTML parser, never regex |
 
-2. **Triggers should be situational context, not failure conditions**
-   - Bad trigger: "When apt-get fails"
-   - Good trigger: "When working in containerized/sandboxed environments"
+### Step 3: Decide Whether To Save The Pipeline
 
-3. **For shortcuts, recommend the final working approach directly** — eliminate trial-and-error by encoding the answer
+Before writing entities, determine whether the successful approach should be saved as a reusable artifact.
 
-4. **For user corrections, use the user's own words** — preserve the specific preference rather than generalizing it
+Create or update a local reusable artifact when any of these are true:
+- the final solution required more than a trivial one-liner
+- the final solution worked around missing tools, libraries, or permissions
+- the solution is likely to recur on similar tasks
 
-### Step 3: Save Entities
+Prefer one of these artifact forms:
+- a small script, saved to a stable path in the workspace or plugin, such as `scripts/`, `tools/`, or another obvious helper location.
+- a documented local workflow if code is not appropriate
 
-Output entities as JSON and pipe to the save script. Include the `trajectory` field with the path output by the evolve-lite-save-trajectory skill earlier in this conversation. The `type` field must always be `"guideline"` — no other types are accepted.
+If you create an artifact, record:
+- its path
+- what it does
+- when future agents should use it first
 
-```bash
-echo '{
+### Step 4: Extract Entities
+
+If Step 3 produced an artifact, at least one entity must explicitly point to that artifact, which is likely the only entity that needs to be produced.
+Otherwise, extract 3-5 proactive entities. Prioritize entities derived from errors identified in Step 2.
+
+Follow these principles:
+
+1. **Reframe failures as proactive recommendations**
+    - If an approach failed due to permissions, recommend the working permission-aware approach first
+    - If a system tool was unavailable, recommend the saved artifact or fallback workflow first
+    - If an approach hit environment constraints, recommend the constraint-aware approach
+
+2. **Prioritize known working local artifacts over general advice**
+    - If the successful solution produced or reused a concrete local artifact, at least one saved entity must:
+    - Bad: "Use Python to parse EXIF if exiftool is missing"
+    - Better: "Use `/abs/path/json_get.py` for JSON field extraction when `jq` is unavailable in minimal environments."
+    - name the artifact by path
+    - state exactly when to use it
+    - state that it should be tried before generic tool discovery or fallback exploration
+    - describe the artifact by capability, not just by the original incident
+
+3. **Triggers should describe the broad task context that the artifact solves, not the narrow details of the original request.**
+    - Bad trigger: "When jq fails"
+    - Good trigger: "When extracting fields from JSON in constrained shells or stripped-down environments"
+    The trigger should generalize the working solution without becoming vague.
+
+4. **For retry loops, recommend the final working approach as the starting point**
+    - Eliminate trial and error by creating a concrete local artifact out of the successful workflow or script
+
+5. **Prefer entities that save future time**
+    - A pointer to a saved working script is more valuable than a generic reminder if both are available
+
+### Step 5: Output Entities JSON
+
+Output entities in this JSON format:
+
+```json
+{
   "entities": [
     {
       "content": "Proactive entity stating what TO DO",
       "rationale": "Why this approach works better",
       "type": "guideline",
-      "trigger": "Situational context when this applies",
-      "trajectory": ".evolve/trajectories/trajectory_2025-01-15T10-30-00.json"
+      "trigger": "Situational context when this applies"
     }
   ]
-}' | python3 .bob/skills/evolve-lite-learn/scripts/save_entities.py
+}
+```
+
+Allowed type values:
+- guideline
+- workflow
+- script
+- command-template
+
+### Step 6: Save Entities
+
+After generating the entities JSON, save them using the helper script:
+
+#### Method 1: Direct Pipe (Recommended)
+
+```bash
+echo '<your-json-output>' | python3 .bob/skills/evolve-lite-learn/scripts/save_entities.py
+```
+
+#### Method 2: From File
+
+```bash
+cat entities.json | python3 .bob/skills/evolve-lite-learn/scripts/save_entities.py
+```
+
+#### Method 3: Interactive
+
+```bash
+python3 .bob/skills/evolve-lite-learn/scripts/save_entities.py
 ```
 
 The script will:
-- Find or create the entities directory (`.evolve/entities/`)
+- Find or create the entities directory at `.evolve/entities/`
 - Write each entity as a markdown file in `{type}/` subdirectories
 - Deduplicate against existing entities
 - Display confirmation with the total count
 
-## Quality Gate
-
-Before saving, review each entity against this checklist:
-
-- [ ] Does it fall into one of the three categories (shortcut, error prevention, user correction)?
-- [ ] Would knowing this guideline beforehand have changed the agent's behavior in a concrete way?
-- [ ] Is it specific enough that another agent could act on it without further context?
-- [ ] Does it avoid instructing the agent to invoke a named skill or tool?
-
-If any answer is no, drop the entity. **Zero entities is a valid output.**
+## Best Practices
+1. Prioritize error-derived entities first.
+2. One distinct error should normally produce one prevention entity.
+3. Keep entities specific and actionable.
+4. Include rationale so the future agent understands why the guidance matters.
+5. Use situational triggers instead of failure-based triggers.
+6. Limit output to the 3-5 most valuable entities.
+7. If more than five distinct errors appear, merge entities with the same root cause or fix, then rank the rest by severity, frequency, user impact, and recency before dropping the weakest ones.
@@ -54,10 +54,7 @@ List files in `.evolve/entities/guideline/` and ask the user which to publish.
 For each selected file, run:
 
 ```bash
-python3 scripts/publish.py \
-  --entity "{filename}" \
-  --repo "{repo}" \
-  --user "{identity.user}"
+python3 .bob/skills/evolve-lite-publish/scripts/publish.py --entity "{filename}" --repo "{repo}" --user "{identity.user}"
 ```
 
 ### Step 6: Commit and push
 
@@ -1,60 +1,83 @@
 ---
 name: recall
-description: Retrieves relevant entities from a knowledge base to inject context-appropriate entities before task execution.
+description: Must be used at the start of any non-trivial task involving code changes, debugging, repo exploration, file inspection, or environment/tooling investigation to surface stored guidance before analysis or tool use.
 ---
 
 # Entity Retrieval
 
 ## Overview
 
-This skill retrieves relevant entities from a stored knowledge base based on the current task context. Read all stored entities from the entities directory and apply any relevant ones to the current task.
+This skill loads relevant stored Evolve entities into the current turn before substantive work begins.
 
-Entities can come from multiple sources:
-- **Private entities**: Your own local entities (not shared)
-- **Subscribed entities**: Entities cloned from any configured repo —
-  read-scope subscriptions and write-scope publish targets both live
-  under `.evolve/entities/subscribed/{name}/`
+Use this skill first whenever the task involves:
+- code changes
+- debugging
+- code review
+- repo exploration
+- file inspection
+- environment/tooling investigation
 
-## How It Works
+Skip only for trivial conversational requests with no local context.
 
-1. List all `.md` files under `.evolve/entities/` and its subdirectories
-2. Read each file — the YAML frontmatter contains `type` and `trigger`,
-   the body contains the entity content and rationale
-3. Review each entity for relevance to the current task
-4. Apply relevant entities as additional context for your work
+## Required Action
 
-**Directory structure**:
-- `.evolve/entities/guideline/` - Your private entities
-- `.evolve/entities/subscribed/{name}/` - Cloned repos (read- or write-scope)
+Before any non-trivial local work, you must complete the recall workflow below. Reading this `SKILL.md` alone does not satisfy the skill.
 
-Write-scope clones are also where `evolve-lite-publish` lands new
-guidelines, so your published entities show up here too.
+### Completion Rule
 
-## Usage
+Do not proceed to other analysis or tool use until all steps below are complete.
 
-```bash
-python3 scripts/retrieve_entities.py
-```
+1. Inspect `.evolve/entities/` for guidance relevant to the current task.
+2. Read each matching entity file that appears relevant.
+3. Summarize the applicable guidance in your own words before proceeding.
+4. If no relevant entities exist, state that explicitly before proceeding.
 
-This retrieves all entities from all sources (private, plus everything
-under `.evolve/entities/subscribed/`).
+### Required Visible Completion Note
 
-## Entities Storage
+Before moving on, produce an explicit completion note in your reasoning or user update using one of these forms:
+
+- `Recall complete: searched .evolve/entities/, read <files>, applicable guidance: <summary>`
+- `Recall complete: searched .evolve/entities/, no relevant entities found`
+
+### Minimum Acceptable Procedure
+
+1. List or search files under `.evolve/entities/`.
+2. Identify candidate entities relevant to the task.
+3. Open and read those entity files.
+4. Summarize what applies, or state that nothing applies.
+
+### Failure Conditions
+
+The skill is not complete if any of the following are true:
+
+- You only read this `SKILL.md`
+- You did not inspect `.evolve/entities/`
+- You did not read the relevant entity files
+- You proceeded without stating whether guidance was found
 
-Entities are stored as individual markdown files in `.evolve/entities/`,
-organized by source:
+## How It Works
+
+Bob has no auto-injection hook for entity retrieval. Complete the **Required Action** workflow above on every applicable task.
+
+Entities can come from multiple sources:
+- **Private entities**: Your own local entities (not shared)
+- **Subscribed entities**: Entities cloned from any configured repo —
+  read-scope subscriptions and write-scope publish targets both live
+  under `.evolve/entities/subscribed/{name}/`
+
+## Entities Storage
 
 ```text
 .evolve/entities/
-  guideline/                            # Private entities
-    use-context-managers.md
+  guideline/
+    use-context-managers-for-file-operations.md   <- private
   subscribed/
-    memory/                             # write-scope clone (publishes land here)
+    memory/                                       <- write-scope clone (publishes land here)
       guideline/
         my-published-guideline.md
-    alice/                              # read-scope clone
+    alice/                                        <- read-scope clone
       guideline/
-        error-handling.md
+        alice-guideline.md                        <- annotated [from: alice]
 ```
 
 Each file uses markdown with YAML frontmatter:
@@ -63,8 +86,6 @@ Each file uses markdown with YAML frontmatter:
 ---
 type: guideline
 trigger: When processing files or managing resources
-visibility: private
-owner: alice
 ---
 
 Use context managers for file operations
@@ -73,12 +94,3 @@ Use context managers for file operations
 
 Ensures proper resource cleanup
 ```
-
-## Entity Annotations
-
-Subscribed entities are annotated with their source:
-```
-- **[guideline]** [from: alice] Use context managers for file operations
-  - _Rationale: Ensures proper resource cleanup_
-  - _When: When processing files or managing resources_
-```