[audit-workflows] Daily Audit — 2026-05-29: cache-memory git infra bug (2 failures) + 9-day codex routing issue RESOLVED #35800
Replies: 2 comments
-
|
Warning The 💥 KA-POW! 🦸 The Smoke Test Agent BLAZED through here in a streak of light! ⚡ WHOOSH! 🌪️ Every gadget tested, every system GREEN! ✅ "Claude engine nominal, citizen!" 🚀 ...and with a final FWOOSH, the agent vanished into the CI clouds! 💨✨ Warning Firewall blocked 6 domainsThe following domains were blocked by the firewall during workflow execution:
network:
allowed:
- defaults
- "accounts.google.com"
- "android.clients.google.com"
- "clients2.google.com"
- "contentautofill.googleapis.com"
- "safebrowsingohttpgateway.googleapis.com"
- "www.google.com"See Network Configuration for more information.
|
Beta Was this translation helpful? Give feedback.
-
|
This discussion has been marked as outdated by Agentic Workflow Audit Agent. A newer discussion is available at Discussion #36002. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Overview
Audit of agentic workflow runs in the last ~24h. This was a partial mid-cycle window of 21 runs (17 success, 2 failure, 2 in-progress) — a smaller sample than a full day, so absence of a prior issue is not proof it is fixed.
The headline is mixed-but-positive: a single new infrastructure bug caused both of today's failures, while several long-standing issues cleared — most notably the 9-day-stuck codex alpha-model routing failure, now resolved.
Summary
🔴 Critical — NEW: cache-memory git setup bug (both failures)
Both failures today share one root cause in
setup_cache_memory_git.sh:The setup step logs a cache hit, then fails:
The restored cache contains the data files but no valid
.git, yet cache-hit detection asserts a repo was found and runs git commands anyway. The same fault then re-fails the downstream Commit cache-memory changes step. For Chaos this aborted before the agent even executed.Fix (single script, restores 2 workflows): after restore, verify with
git -C "$GH_AW_CACHE_DIR" rev-parse --git-dir(or test.git); when files are present but it is not a valid git repo,git init+ re-attach remote/branch instead of trusting the cache-hit flag. Add a regression guard for the "non-git files present after restore" branch.✅ Resolved / Recovered this cycle
model: gpt-5.4and succeeded. The one-line frontmatter edit recommended daily for 9 days was applied.target=*failures did NOT recur — Contribution Check, Smoke Claude, and Smoke Codex all passed today (they failed this way on 05-28). Partial window; not yet confirmed fixed.🟡 Capability gaps (runs passed, coverage reduced)
Missing / broken tools
serena find_symbol/get_symbols_overviewreturnMCP error 0 EOFon first query — the Go LSP backend crashes after a successfulactivate_project. Symbol-search coverage silently lost.mcpscripts-ghpassthrough absent (onlygithub_pr_query/issue/discussion/go/make); the test self-corrected withgithub_pr_query.web-fetchMCP tool not available in environment.Action: provision the tool or update the smoke-test prompt to stop expecting it, per tool. The serena Go-LSP crash is the most worth a dedicated reliability fix.
⏸️ Carried / could not evaluate in this window
--max-turnsto ~50): Avenger did not run in this window — unverified, recommendation stays open.📊 Trends (last ~12 days)
Success rate sits at 89.5% in today's partial window, comfortably inside the normal band and well above the 05-23 dip (41.6%). Failure counts are low (2) and both trace to one infrastructure bug rather than model/agent behavior, so the underlying agent reliability is healthy. The dashed 05-29 bar is a small mid-cycle sample — treat its absolute counts as partial.
Daily cost tracks token volume closely and the 7-day moving average is flat-to-declining, with no runaway escalation — the previously-rising Safe Output Optimizer cost has fallen back into line. Note today's cost (~$7.18) is claude-engine only (copilot/codex/gemini estimated cost is not emitted in
run_summary.json), so it understates true spend and is not directly comparable to full-day historical figures.Recommendations (priority order)
setup_cache_memory_git.shto re-init when a restored cache has files but no valid.git— restores Chaos PR Bundle Fuzzer + Copilot PR Prompt Pattern Analysis.max-turnsnext cycle it runs; keep themax-turns: ~50edit queued.EOFcrash in Smoke Claude (capture gopls stderr; check toolchain compatibility).web-fetch(Smoke Codex) andmcpscripts-gh(Smoke Claude).References:
Beta Was this translation helpful? Give feedback.
All reactions