You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: clients/python/README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -140,7 +140,7 @@ All errors carry `.code` and `.details` (the structured payload from the server)
140
140
141
141
## Versioning
142
142
143
-
This package is **version-locked** to the npm package. `tangle-agent-eval==0.18.0` ↔ `@tangle-network/agent-eval@0.18.0`. The two ship from the same git tag in the same CI workflow; if either fails to publish, neither does. Mismatched versions are a build-time error.
143
+
This package is **version-locked** to the npm package. `tangle-agent-eval==0.19.0` ↔ `@tangle-network/agent-eval@0.19.0`. The two ship from the same git tag in the same CI workflow; if either fails to publish, neither does. Mismatched versions are a build-time error.
144
144
145
145
`wire_version` is separate. It bumps only on breaking schema changes. Package versions can differ across releases as long as `wire_version` is the same.
Copy file name to clipboardExpand all lines: docs/feature-guide.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -33,7 +33,7 @@ trying, and whether a change made them better or worse.
33
33
| “Human feedback should become reusable eval data.” |`FeedbackTrajectory`| Captures approvals, rejections, edits, choices, metrics, and policy blocks. |
34
34
| “Can this action run, or does it need approval?” |`evaluateActionPolicy`| Generic preflight for side effects, budgets, and required evidence. |
35
35
| “I need train/dev/test/holdout examples.” |`Dataset` plus feedback trajectory conversion | Stable splits and contamination control. |
36
-
| “Which prompt or signature wins?” |`PromptOptimizer`, `OptimizationLoop`, steering optimizers | Runs variants on scenarios and compares scores. |
36
+
| “Which prompt or signature wins?” |`runMultiShotOptimization`, steering optimizers | Runs variants on scenarios and compares scores. |
37
37
| “Improve a multi-turn agent over real task traces.” |`runMultiShotOptimization`| GEPA-style trajectory optimization with ASI and held-out promotion. |
38
38
| “Improve prompts, then code if prompts plateau.” |`runPromptEvolution`, composite mutator, code mutator | Bounded evolution with telemetry and lineage. |
39
39
| “Find why a regression happened.” | bisector, traces, run records | Narrows changes and preserves evidence. |
@@ -156,7 +156,7 @@ Store as `FeedbackTrajectory`, then derive:
156
156
| Feedback data |`FeedbackTrajectory`, stores, converters | Human/environment labels | Domain adapters live in downstream repos. |
157
157
| Action policy |`evaluateActionPolicy`| Approval/budget preflight | Blocks or labels actions before `act()`. |
0 commit comments