Skip to content

Nightly 2026-04-20 — 2 cycles, +15 goal weight, 1 dream finding closed#136

Merged
boshu2 merged 2 commits intomainfrom
nightly/2026-04-20
Apr 26, 2026
Merged

Nightly 2026-04-20 — 2 cycles, +15 goal weight, 1 dream finding closed#136
boshu2 merged 2 commits intomainfrom
nightly/2026-04-20

Conversation

@boshu2
Copy link
Copy Markdown
Owner

@boshu2 boshu2 commented Apr 24, 2026

Nightly summary

Bounded autonomous nightly pass on nightly/2026-04-20 branched from origin/main @ 6bbc0307.

Baseline Final Delta
Pass / fail 14 / 5 17 / 2 +3 pass
Weighted fitness 80 / 109 95 / 109 +15 weight
Percent 73.4% 87.2% +13.8 pp

Anchor: previous nightly branch origin/nightly/2026-04-17-v2 @ 4269c665 (no nightly git tags existed yet — this run creates the first).

Per-cycle summary

Cycle 1 — 6103c2e2 fix(provenance): accept structured source frontmatter in learning audit

Dream's INGEST stage flagged .agents/learnings/2026-04-19-orchestrator-compression-anti-pattern.md every iteration with yaml: cannot unmarshal !!map into string, because learningFrontmatter.Source was typed as string while the file uses a map (source: {session, evidence}). Swapped Source for a flexibleString wrapper whose UnmarshalYAML accepts scalar, map, or sequence nodes; structured values decode to a non-empty sentinel so the "missing source" predicate keeps working. Added a regression test (TestProvenanceAudit_StructuredSourceAccepted) covering all three shapes.

Cycle 2 — 8778fa97 fix(codex): convert remaining Skill() invocations to $skill notation

Closed the low-severity council finding under 2026-04-19-rpi-dag-hardening in .agents/rpi/next-work.jsonl. Rewrote all 11 remaining Skill(skill="…", args="…") calls in skills-codex/{design,implement,red-team,validation}/SKILL.md to the canonical $<name> form ($vibe recent, $test coverage --quick, $post-mortem <epic-id>, etc.). Prose examples inside skills-codex/shared/references/strict-delegation-contract.md left as documentation of the Claude-native shape. Regenerated codex hashes (manifest + per-skill markers); bash scripts/audit-codex-parity.sh and bash scripts/lint-codex-native.sh pass.

Findings

Finding Status
Dream provenance-audit YAML parse error on structured source: closed (cycle 1)
2026-04-19-rpi-dag-hardening: remaining Skill() notation in codex closed (cycle 2)
2026-04-19-rpi-dag-hardening: "Decompose skills/crank/SKILL.md under 248 lines" superseded — tests/skills/lint-skills.sh no longer enforces line count when references/ is present (crank has one)

No new findings opened, no auto-reverts, no quarantined goals.

Goal results

Still failing (both out of nightly scope):

  • [8] flywheel-compounding — σρ > δ not met in the measurement window; this is a knowledge-lifecycle metric that needs corpus activity, not a code fix.
  • [6] go-complexity-ceilinggocyclo binary not installed in this runtime environment; no code regression.

Moved from fail → pass:

  • [7] flywheel-proof — baseline failure was transient: sum.golang.org 503 DNS cache overflow during proof-run toolchain download.
  • [4] compile-freshness — baseline lacked .agents/defrag/latest.json; generated by a dry-run ao defrag during cycle selection (runtime state, not a committed artifact; nightly CI runs ao defrag before measuring, so this also reflects correct CI flow).
  • [4] compile-no-oscillation — same root cause as compile-freshness.

Final validation bundle

Check Result
ao autodev validate --file PROGRAM.md --json valid: true
go build ./... / go vet ./... clean
go test ./cmd/ao ./internal/autodev ./internal/provenance ./internal/overnight all pass
skills/heal-skill/scripts/heal.sh --strict All clean. No findings.
tests/docs/validate-links.sh 1528 links, 0 broken
scripts/audit-codex-parity.sh --skill {design,validation,implement,red-team} pass
scripts/lint-codex-native.sh 0 errors, 0 warnings
scripts/pre-push-gate.sh --fast 1 FAIL, 32 skipped — the sole failure is worktree-disposition, which the nightly pipeline documents as a known false positive on a nightly/* branch (the script expects the canonical root on main). All other checks pass.

Commit links

Anchor for tomorrow's audit

This branch will be tagged nightly/2026-04-20 after the PR is opened so the next nightly has a concrete fitness anchor to diff against.


Generated by Claude Code

claude added 2 commits April 24, 2026 17:49
The provenance audit's learningFrontmatter decoded source as a bare
string and failed with "cannot unmarshal !!map into string" whenever a
learning used a structured value — for example, the 2026-04-19
orchestrator-compression learning set source to a session+evidence map.
Dream's INGEST stage then listed the file under degraded every run and
never counted its source toward MissingSources.

Switch Source to a flexibleString wrapper whose UnmarshalYAML accepts
scalar, map, or sequence nodes. Structured values decode to a
non-empty sentinel so the existing "missing source" predicate still
works; scalar values are preserved verbatim.

Surfaced during the 2026-04-20 nightly dream run.
Codex mirror SKILL.md files still carried Claude-native
Skill(skill="...", args="...") calls in their executable DAG blocks,
which codex-skill-api does not recognize as delegated sub-skill
invocations. Rewrite all 11 occurrences in design, implement, red-team,
and validation to the canonical $<name> form (e.g. $vibe recent, $test
coverage --quick, $post-mortem <epic-id>). Prose occurrences inside
shared/references/strict-delegation-contract.md are left intact: they
document the Claude-native shape.

Regenerate skills-codex hashes so the manifest and per-skill generated
markers match the new content; parity audit and codex-native lint pass.

Addresses the low-severity council finding tracked under the
2026-04-19-rpi-dag-hardening epic in .agents/rpi/next-work.jsonl.
@boshu2 boshu2 added the nightly label Apr 24, 2026 — with Claude
@boshu2 boshu2 merged commit 66524af into main Apr 26, 2026
@boshu2 boshu2 deleted the nightly/2026-04-20 branch April 27, 2026 01:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants