Optimize export-audit workflow token usage by precomputing audits and bounding verification#3258
Conversation
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
There was a problem hiding this comment.
Pull request overview
This PR optimizes the API Surface & Export Audit agentic workflow by moving more deterministic audit work into pre-agent steps, restructuring prompt data for cacheability, and bounding verification work.
Changes:
- Adds precomputed test-import and api-proxy export audit steps.
- Moves dynamic step-output data into a final “Pre-computed Data” prompt section.
- Regenerates the compiled workflow lock file.
Show a summary per file
| File | Description |
|---|---|
.github/workflows/export-audit.md |
Adds new audit steps and updates prompt phases, verification budget, and precomputed data layout. |
.github/workflows/export-audit.lock.yml |
Regenerated compiled workflow reflecting new steps, env wiring, and prompt template changes. |
Copilot's findings
Tip
Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
- Files reviewed: 2/2 changed files
- Comments generated: 3
This comment has been minimized.
This comment has been minimized.
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
|
@copilot fix this failed ci check https://github.com/github/gh-aw-firewall/actions/runs/25967614686/job/76333782113?pr=3258 |
Fixed in commit |
🔬 Smoke Test Results
Overall: FAIL — GitHub MCP returned 401; template variable substitution did not occur for pre-step outputs. PR author/assignees could not be retrieved due to MCP auth failure.
|
Claude Engine Smoke Test Results❌ Test 1 (GitHub API): FAIL — gh CLI authentication error (HTTP 401) Overall: 2/3 PASS
|
Smoke Test: Copilot BYOK (Offline) Mode
Running in BYOK offline mode ( Overall: FAIL — pre-step template variables (
|
|
[awf] Support Azure Copilot BYOK env routing in api-proxy and resolve gpt-5.4 via gpt-5 family aliases Warning Firewall blocked 1 domainThe following domain was blocked by the firewall during workflow execution:
network:
allowed:
- defaults
- "registry.npmjs.org"See Network Configuration for more information.
|
Smoke Test Results\n\n- GitHub MCP: ❌\n- Connectivity: ❌\n- File Writing: ✅\n- Bash Tool: ✅\n\nOverall: FAILWarning Firewall blocked 1 domainThe following domain was blocked by the firewall during workflow execution:
network:
allowed:
- defaults
- "localhost"See Network Configuration for more information.
|
🏗️ Build Test Suite Results
Overall: 8/8 ecosystems passed — ✅ PASS
|
Smoke Test Results — FAIL
Overall: FAIL —
|
🧪 Chroot Version Comparison Results
Overall: ❌ Not all tests passed — Python and Node.js versions differ between host and chroot environments.
|
API Surface & Export Auditwas consistently expensive (high effective token usage, low prefix-cache reuse) and occasionally spiked into outlier runs due to repeated in-agent bash verification loops. This update shifts deterministic audits into pre-agent steps, refactors prompt layout for cacheability, and adds explicit verification limits.Precompute deterministic Phase 4/5 audits
test_importsstep to collect test import-path findings.apip_exportsstep to collectapi-proxyprovider export consistency findings.Refactor prompt structure for prefix caching
${{ steps.*.outputs.* }}interpolations into a single## Pre-computed Datablock at the end of the prompt.Bound verification loops
## Verification Budgetsection:Regenerate compiled workflow
export-audit.mdintoexport-audit.lock.ymlso runtime prompt/env wiring reflects new step outputs and prompt layout.