Skip to content

fix(ci): prevent empty safeoutputs schema probes in smoke-claude workflow#5307

Merged
lpcox merged 2 commits into
mainfrom
copilot/debug-smoke-claude-failure-again
Jun 20, 2026
Merged

fix(ci): prevent empty safeoutputs schema probes in smoke-claude workflow#5307
lpcox merged 2 commits into
mainfrom
copilot/debug-smoke-claude-failure-again

Conversation

Copilot AI commented Jun 19, 2026

Copy link
Copy Markdown
Contributor
  • Reproduce the failing Smoke Claude run and inspect logs
  • Add a prompt guardrail preventing empty add_comment / add_labels calls
  • Regenerate smoke-claude.lock.yml for the source change
  • Add a focused smoke workflow test assertion for the guardrail
  • Run targeted tests and workflow compile/postprocess steps
  • Run CodeQL check

Copilot AI requested a review from lpcox June 19, 2026 22:51
@lpcox lpcox marked this pull request as ready for review June 19, 2026 23:20
Copilot AI review requested due to automatic review settings June 19, 2026 23:20
@github-actions

Copy link
Copy Markdown
Contributor

✅ Coverage Check Passed

Overall Coverage

Metric Base PR Delta
Lines 97.62% 97.66% 📈 +0.04%
Statements 97.56% 97.60% 📈 +0.04%
Functions 98.85% 98.85% ➡️ +0.00%
Branches 93.24% 93.27% 📈 +0.03%
📁 Per-file Coverage Changes (1 files)
File Lines (Before → After) Statements (Before → After)
src/workdir-setup.ts 92.7% → 94.5% (+1.82%) 92.7% → 94.5% (+1.82%)

Coverage comparison generated by scripts/ci/compare-coverage.ts

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the Smoke Claude workflow prompt guidance (and its lock metadata) to prevent “schema probe” / empty-argument invocations of safeoutputs tools, and adds a focused Jest assertion to ensure the guardrail stays present in the source workflow.

Changes:

  • Add an explicit prompt instruction to never call add_comment / add_labels with empty arguments (avoid schema-probing).
  • Update the smoke workflow Jest test to assert the new instruction exists in smoke-claude.md.
  • Regenerate smoke-claude.lock.yml metadata to reflect the updated workflow source.
Show a summary per file
File Description
scripts/ci/smoke-claude-workflow.test.ts Adds a test assertion to enforce the new “no empty safeoutputs calls” guardrail in the workflow source.
.github/workflows/smoke-claude.md Adds the explicit instruction to avoid empty-argument add_comment / add_labels calls.
.github/workflows/smoke-claude.lock.yml Updates compiled workflow metadata (body hash) corresponding to the source prompt change.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

  • Files reviewed: 3/3 changed files
  • Comments generated: 0

@lpcox

lpcox commented Jun 19, 2026

Copy link
Copy Markdown
Collaborator

@copilot address the review feedback and fix the pr title and summary

@lpcox

lpcox commented Jun 20, 2026

Copy link
Copy Markdown
Collaborator

@copilot address the review feedback and fix the pr title and summary

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

✨ The prophecy is fulfilled... Smoke Codex has completed its mystical journey. The stars align. 🌟

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Chroot tests passed! Smoke Chroot - All security and functionality tests succeeded.

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

🔑 Smoke Copilot PAT PAT auth validated. All systems operational. ✅

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Smoke Copilot BYOK completed. Copilot BYOK mode operational. 🔓

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Build Test Suite completed successfully!

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Smoke Copilot BYOK AOAI (Entra) reports failed. AOAI BYOK (Entra) mode investigation needed...

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

📡 Smoke OTel Tracing completed. All tracing scenarios validated. ✅

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Smoke Gemini completed. All facets verified. 💎

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Smoke Claude failed

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

📰 VERDICT: Smoke Copilot has concluded. All systems operational. This is a developing story. 🎤

@github-actions

Copy link
Copy Markdown
Contributor

🚀 Security Guard has started processing this pull request

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Smoke Copilot BYOK AOAI (api-key) reports failed. AOAI BYOK (api-key) mode investigation needed...

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

🔌 Smoke Services — All services reachable! ✅

@github-actions

github-actions Bot commented Jun 20, 2026

Copy link
Copy Markdown
Contributor

Contribution Check completed successfully!

@github-actions

Copy link
Copy Markdown
Contributor

🔬 Smoke Test Results

PR: [WIP] Debug smoke Claude workflow failure
Author: @Copilot | Assignees: @lpcox, @Copilot

Test Result
✅ GitHub MCP connectivity PASS — PR list fetched
✅ GitHub.com connectivity PASS — HTTP 200
❌ File write/read test FAIL — pre-step template vars not substituted

Overall: FAIL (pre-computed data missing due to unresolved ${{ }} expressions)

📰 BREAKING: Report filed by Smoke Copilot

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test: Copilot PAT — PASS ✅

Test Result
GitHub MCP connectivity
GitHub.com HTTP (200)
File write/read

Auth mode: PAT (COPILOT_GITHUB_TOKEN)
PR: "[WIP] Debug smoke Claude workflow failure" — @Copilot (author), @lpcox (assignee)

🔑 PAT report filed by Smoke Copilot PAT

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test: Copilot BYOK (Direct) Mode ✅ PASS

Results:

  • ✅ MCP connectivity: Retrieved 2 merged PRs
  • ✅ GitHub.com: HTTP 200
  • ✅ File I/O: Created and verified
  • ✅ BYOK inference: Direct mode working (agent → api-proxy → api.githubcopilot.com)

Running in direct BYOK mode via COPILOT_PROVIDER_API_KEY.

/cc @lpcox @Copilot

🔑 BYOK report filed by Smoke Copilot BYOK

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test: API Proxy OpenTelemetry Tracing

Scenario Result Detail
1. Module Loading otel.js loads cleanly; exports: startRequestSpan, setTokenAttributes, setBudgetAttributes, endSpan, endSpanError, shutdown, isEnabled
2. Test Suite 59 tests passed, 0 failed across 2 suites (otel.test.js, otel-fanout.test.js)
3. Env Var Forwarding api-proxy-service-config.ts forwards GH_AW_OTLP_ENDPOINTS, OTEL_EXPORTER_OTLP_ENDPOINT, OTEL_EXPORTER_OTLP_HEADERS, GITHUB_AW_OTEL_TRACE_ID, GITHUB_AW_OTEL_PARENT_SPAN_ID, OTEL_SERVICE_NAME
4. Token Tracker Integration onUsage callback present in token-tracker-http.js (line 324) as OTEL hook point
5. OTEL Diagnostics isEnabled() returns true; graceful fallback to otel.jsonl file export when no OTLP endpoint configured

All 5 scenarios pass. OTEL tracing integration is functioning correctly.

📡 OTel tracing validated by Smoke OTel Tracing

@github-actions

Copy link
Copy Markdown
Contributor

Chroot Runtime Version Comparison

Runtime Host Version Chroot Version Match?
Python Python 3.12.13 Python 3.12.3
Node.js v24.16.0 v22.22.3
Go go1.22.12 go1.22.12

Result: 1/3 tests passed — Go matches, but Python and Node.js versions differ between host and chroot environments.

Tested by Smoke Chroot

@github-actions

Copy link
Copy Markdown
Contributor

Gemini Smoke Test Results

Overall status: PASS

Warning

Firewall blocked 1 domain

The following domain was blocked by the firewall during workflow execution:

  • localhost

To allow these domains, add them to the network.allowed list in your workflow frontmatter:

network:
  allowed:
    - defaults
    - "localhost"

See Network Configuration for more information.

💎 Faceted by Smoke Gemini

@github-actions

Copy link
Copy Markdown
Contributor

[WIP] Debug smoke Claude workflow failure by Copilot
Merged PRs reviewed:

  • chore: upgrade gh-aw to v0.80.6 and recompile workflows
  • [WIP] Refactor entrypoint.sh by extracting initialization phases into functions
    Checks:
  • GitHub title: ✅
  • Merged PR review: ✅
  • Discussion query: ✅
  • Temp file: ✅
  • Build: ✅
    Overall: PASS

🔮 The oracle has spoken through Smoke Codex

@github-actions

Copy link
Copy Markdown
Contributor

🏗️ Build Test Suite Results

Ecosystem Project Build/Install Tests Status
Bun elysia 1/1 passed ✅ PASS
Bun hono 1/1 passed ✅ PASS
C++ fmt N/A ✅ PASS
C++ json N/A ✅ PASS
Deno oak N/A 1/1 passed ✅ PASS
Deno std N/A 1/1 passed ✅ PASS
.NET hello-world N/A ✅ PASS
.NET json-parse N/A ✅ PASS
Go color 1/1 passed ✅ PASS
Go env 1/1 passed ✅ PASS
Go uuid 1/1 passed ✅ PASS
Java gson 1/1 passed ✅ PASS
Java caffeine 1/1 passed ✅ PASS
Node.js clsx passed ✅ PASS
Node.js execa passed ✅ PASS
Node.js p-limit passed ✅ PASS
Rust fd 1/1 passed ✅ PASS
Rust zoxide 1/1 passed ✅ PASS

Overall: 8/8 ecosystems passed — ✅ PASS

Generated by Build Test Suite for issue #5307 · 42.2 AIC · ⊞ 7.7K ·

@github-actions

Copy link
Copy Markdown
Contributor

Smoke Test Results

  • Redis PING: ❌ (timeout, no PONG)
  • PostgreSQL pg_isready: ❌ (no response on port 5432)
  • PostgreSQL SELECT 1: ❌ (timeout, no output)

host.docker.internal resolves to 172.17.0.1 but service containers are unreachable.

Overall: FAIL

🔌 Service connectivity validated by Smoke Services

@lpcox lpcox changed the title [WIP] Debug smoke Claude workflow failure fix(ci): prevent empty safeoutputs schema probes in smoke-claude workflow Jun 20, 2026

Copilot AI commented Jun 20, 2026

Copy link
Copy Markdown
Contributor Author

@copilot address the review feedback and fix the pr title and summary

Updated the PR metadata: title is now fix(ci): prevent empty safeoutputs schema probes in smoke-claude workflow and the summary checklist has been rewritten to reflect the actual changes. There were no actionable reviewer findings to address in code, so no new commit was needed (latest commit remains 5bf60bd).

Copilot finished work on behalf of lpcox June 20, 2026 03:17
@lpcox lpcox merged commit 39bcb8b into main Jun 20, 2026
90 of 95 checks passed
@lpcox lpcox deleted the copilot/debug-smoke-claude-failure-again branch June 20, 2026 15:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants