CI: fail build when audit-harness citation markers leak into compiled output by marcin-kordas-hoc · Pull Request #1678 · handsontable/hyperformula

marcin-kordas-hoc · 2026-05-25T08:23:21Z

Summary

Adds a defensive post-build scan to .github/workflows/build.yml that fails the workflow if internal audit-harness markers leak into compiled JS output.

The markers ([V<n>]/[vrf_n] citation tags and the §AuditSources footer) are an internal convention used in spec drafts and agent prompts. They must never appear in shipped JS — if they do, something slipped from a comment/string literal in source into bundled output.

What changed

.github/workflows/build.yml: two new steps after Build (npm run bundle-all):
1. Verify no audit-harness markers leaked into build output — runs scripts/marker-scan.sh over dist, commonjs, es, typings, languages; exits 1 with line-numbered output on any hit.
2. Self-test marker-scan logic against synthetic fixtures — runs scripts/test-marker-scan.sh on one matrix slice (Node 22 + npm ci) to keep the live scan and fixture coverage aligned.
scripts/marker-scan.sh (73 lines): centralizes the grep scan; skips missing dirs, distinguishes "no match" (exit 0) from I/O errors (exit 2).
scripts/test-marker-scan.sh (160 lines): asserts scan behaviour against clean vs. dirty synthetic fixtures covering webpack dist, source maps, CommonJS, and ESM outputs.

Patterns

Pattern	Catches
`\[(V[0-9]+\|(vrf\|dec\|con\|que\|wrg\|crf)_[0-9]+)\]`	Citation markers: legacy `[V1]`/`[V12]` and the current prefixed form `[vrf_3]`, `[dec_1]`, `[con_…]`, `[que_…]`, `[wrg_…]`, `[crf_…]`
`§[[:space:]]*AuditSources`	Footer heading `§AuditSources` (or `§ AuditSources`)

Why

Tiny, self-contained guardrail. No new dependency on external repos or npm packages — bash only (scripts/marker-scan.sh + scripts/test-marker-scan.sh). Fires on the same matrix as the existing build (Node 20/22/24 across Linux/Windows/macOS), but since the step uses shell: bash it runs identically everywhere.

Test plan

Manually verified locally: synthetic dist/foo.js containing [V3]/[vrf_3] and §AuditSources triggers exit 1 with line-numbered output
Manually verified locally: clean JS in all output dirs exits 0
Manually verified locally: no output dirs present exits 0 (skip path)
YAML parse check passed
CI run on this PR is green against the real build output (no markers present today)

Note

Low Risk
Adds bash-only CI checks after the existing build; no runtime, auth, or application logic changes.

Overview
Adds a post-build CI gate so internal audit-harness tokens ([V<n>], prefixed tags like [vrf_3], and §AuditSources) cannot ship in compiled artifacts.

After npm run bundle-all, build.yml runs scripts/marker-scan.sh over dist, commonjs, es, typings, and languages, failing the job with line-numbered hits when grep finds a match. A second step on one matrix slice (Node 22 + npm ci) runs scripts/test-marker-scan.sh, which drives the same scan script against synthetic clean/dirty fixtures (bundles, source maps, CommonJS, ESM, typings) so workflow logic and tests stay aligned.

marker-scan.sh centralizes the regex scan, skips missing output dirs, treats “no matches” as success, and does not treat grep I/O errors as a clean pass.

^{Reviewed by Cursor Bugbot for commit 25a566a. Bugbot is set up for automated code reviews on this repo. Configure here.}

… output Adds a post-build scan step to `.github/workflows/build.yml` that greps `dist/`, `commonjs/`, and `es/` for two internal-only marker patterns: - `\[V[0-9]+\]` — audit-harness citation markers used in spec drafts - `§[[:space:]]*Sources` — section heading used in audit-harness footers Both are conventions from the audit-harness tooling and belong in internal docs/prompts only. If they ever appear in compiled JS it means a comment or string literal slipped through from a spec draft into shipped output — the scan fails the workflow with the offending file path and line number. The step runs after `npm run bundle-all` (which produces the three output directories) and skips gracefully if a directory is missing, so unrelated build failures aren't masked by this guardrail. Manual verification: - Synthesized `dist/foo.js` containing both markers — grep matched both lines and exited 1 with a clear message. - Repeated with clean JS — grep exited 0. - Repeated with no output dirs — step exited 0 (skip path).

netlify · 2026-05-25T08:23:26Z

✅ Deploy Preview for hyperformula-docs ready!

Name	Link
🔨 Latest commit	`803f11a`
🔍 Latest deploy log	https://app.netlify.com/projects/hyperformula-docs/deploys/6a1406fc85be2a000849e0c8
😎 Deploy Preview	https://deploy-preview-1678--hyperformula-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

netlify · 2026-05-25T08:23:26Z

✅ Deploy Preview for hyperformula-dev-docs ready!

Name	Link
🔨 Latest commit	`542062c`
🔍 Latest deploy log	https://app.netlify.com/projects/hyperformula-dev-docs/deploys/6a1ebdbcd971760007f41a7f
😎 Deploy Preview	https://deploy-preview-1678--hyperformula-dev-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

github-actions · 2026-05-25T08:38:53Z

Performance comparison of head (`542062c`) vs base (`508d78f`)

                                     testName |   base |   head | change
------------------------------------------------------------------------
                                      Sheet A | 504.92 | 516.94 | +2.38%
                                      Sheet B | 164.37 | 165.63 | +0.77%
                                      Sheet T | 144.27 | 141.17 | -2.15%
                                Column ranges | 529.91 | 527.35 | -0.48%
Sheet A:  change value, add/remove row/column |  16.98 |  17.96 | +5.77%
 Sheet B: change value, add/remove row/column | 136.06 | 149.01 | +9.52%
                   Column ranges - add column | 157.15 | 169.73 | +8.01%
                Column ranges - without batch | 489.64 | 484.82 | -0.98%
                        Column ranges - batch | 127.66 | 122.68 | -3.90%

Bugbot review #3296952334 flagged that the `if grep ...` form treats grep's exit code 2 (scan/IO error) identically to exit code 1 (no matches) — so a permission or read error on dist/, commonjs/, or es/ would silently green- light the step. Split the rc into 0/1/other and fail the step explicitly on any non-zero, non-1 result.

Validates the build.yml marker-scan step against synthetic fixtures: clean build, marker in dist/*.js, marker in dist/*.js.map (sourcesContent), marker in commonjs/*.js, marker in es/*.mjs. Wired as a single self-test step in build.yml that runs once per OS (node 22, ci install). Empirically confirmed (probed by planting a marker in src/index.ts and running `npm run bundle-all`) that source comments survive into: - commonjs/index.js and es/index.mjs (babel preserves comments) - dist/hyperformula{,.full}.js (webpack development build preserves comments) - dist/hyperformula.js.map (`sourcesContent` embeds full original source) All three surfaces are inside the existing `grep -rn dist commonjs es` scope, so the scan already covers source-maps. The new self-test pins this behavior so a future bundler/comment-stripping change cannot silently erode coverage.

marcin-kordas-hoc · 2026-05-25T15:16:56Z

Tier-2 hardening: integration test for marker scan + source-map coverage

Added scripts/test-marker-scan.sh (5 synthetic fixture cases) and wired it into build.yml as a single self-test step (node 22, ci install, runs once per OS, <1s).

Empirical answer to the SFDIPOT P0 question — do source-maps carry source comments?

Yes. Probed by planting // [V99] test marker — empirical sourcemap probe in src/index.ts and running npm run bundle-all. The marker survived into THREE distinct surfaces:

File	Hit count
`dist/hyperformula.js`	3
`dist/hyperformula.full.js`	3
`dist/hyperformula.js.map`	1 (inside `sourcesContent`)
`commonjs/index.js`	3
`es/index.mjs`	3

The existing grep -rn dist commonjs es step already catches all three because .map files are plain JSON and grep treats them as text. So the scope was already correct — but it was untested. This PR pins that coverage with a self-test that asserts each surface (clean / dist-js / dist-map-sourcesContent / commonjs / es) reaches the expected branch of the scan logic.

Also added an inline comment block in build.yml documenting the three surfaces, so future bundler/comment-stripping changes can't silently erode coverage without someone reading the rationale.

Verification (local):

scripts/test-marker-scan.sh — 5 passed, 0 failed
npm run bundle-all end-to-end — completes, publish-check OK
Real-build scan on clean source — rc=1 (no markers)

New head SHA: 108396574

…test cannot drift The verify step in build.yml previously inlined the audit-marker grep logic while scripts/test-marker-scan.sh kept its own duplicate copy. A workflow-only edit could silently desynchronize the live scan from the self-test fixtures that are supposed to guard it. Move the scan into scripts/marker-scan.sh as a single parameterized entry point (accepts paths as $@, exit 0=clean, 1=dirty, 2+=error). The workflow step now invokes `bash scripts/marker-scan.sh dist commonjs es`, and the self-test drives the SAME script against synthetic fixture roots.

…es output marker-scan grepped only the legacy [V<n>] form, so current markers ([vrf_1], [dec_3], ...) passed the gate; it also scanned only dist/commonjs/es, missing the typings/ and languages/ bundle outputs (both preserve source comments). Extend the grep to the lowercase prefix+_digits grammar, add typings+languages to the CI invocation and the self-test dir list, and add fixtures for both gaps.

… in build-output scan

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 542062c. Configure here.}

cursor · 2026-06-02T11:33:40Z

+  local root="$3"
+
+  local rc=0
+  run_marker_scan "$root" >/tmp/scan-out 2>&1 || rc=$?


Scan output file bypasses temp directory cleanup trap

Low Severity

assert_scan writes scan output to a hardcoded /tmp/scan-out path instead of using the already-allocated $TMP_ROOT directory. This file is not cleaned up by the trap 'rm -rf "$TMP_ROOT"' EXIT handler, which is inconsistent with the script's own temp-file management design. Using $TMP_ROOT/scan-out would keep all artifacts under the managed directory and ensure proper cleanup.

Additional Locations (1)

scripts/test-marker-scan.sh#L165-L166

^{Reviewed by Cursor Bugbot for commit 542062c. Configure here.}

codecov · 2026-06-03T02:47:14Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.16%. Comparing base (508d78f) to head (542062c).
⚠️ Report is 3 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #1678   +/-   ##
========================================
  Coverage    97.16%   97.16%           
========================================
  Files          175      176    +1     
  Lines        15319    15322    +3     
  Branches      3356     3356           
========================================
+ Hits         14884    14887    +3     
  Misses         427      427           
  Partials         8        8

see 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

cursor Bot reviewed May 25, 2026

View reviewed changes

Comment thread .github/workflows/build.yml Outdated

marcin-kordas-hoc added 2 commits May 25, 2026 13:12

cursor Bot reviewed May 25, 2026

View reviewed changes

Comment thread .github/workflows/build.yml

marcin-kordas-hoc added 4 commits May 26, 2026 02:54

fix(ci): rename audit-harness footer marker §Sources to §AuditSources…

25a566a

… in build-output scan

docs(ci): fix stale marker references in marker-scan self-test header

542062c

cursor Bot reviewed Jun 2, 2026

View reviewed changes

marcin-kordas-hoc requested a review from sequba June 3, 2026 02:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: fail build when audit-harness citation markers leak into compiled output#1678

CI: fail build when audit-harness citation markers leak into compiled output#1678
marcin-kordas-hoc wants to merge 7 commits into
developfrom
feat/ci-audit-strip-check

marcin-kordas-hoc commented May 25, 2026 •

edited

Loading

Uh oh!

netlify Bot commented May 25, 2026 •

edited

Loading

Uh oh!

netlify Bot commented May 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

github-actions Bot commented May 25, 2026 •

edited

Loading

Uh oh!

marcin-kordas-hoc commented May 25, 2026

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Jun 2, 2026

Uh oh!

codecov Bot commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

marcin-kordas-hoc commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Patterns

Why

Test plan

Uh oh!

netlify Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyperformula-docs ready!

Uh oh!

netlify Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for hyperformula-dev-docs ready!

Uh oh!

Uh oh!

github-actions Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance comparison of head (542062c) vs base (508d78f)

Uh oh!

marcin-kordas-hoc commented May 25, 2026

Tier-2 hardening: integration test for marker scan + source-map coverage

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Jun 2, 2026

Choose a reason for hiding this comment

Scan output file bypasses temp directory cleanup trap

Uh oh!

codecov Bot commented Jun 3, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

marcin-kordas-hoc commented May 25, 2026 •

edited

Loading

netlify Bot commented May 25, 2026 •

edited

Loading

netlify Bot commented May 25, 2026 •

edited

Loading

github-actions Bot commented May 25, 2026 •

edited

Loading

Performance comparison of head (`542062c`) vs base (`508d78f`)