Skip to content

Commit b5e6196

Browse files
authored
Merge pull request #344 from raifdmueller/feat/evaluation-report-page
feat: publish evaluation report on the website
2 parents 8a54746 + 1d9d6a3 commit b5e6196

3 files changed

Lines changed: 397 additions & 0 deletions

File tree

docs/anchor-evaluations.adoc

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ We do not know whether this holds equally across models.
1212
A semantic anchor that works perfectly in Claude may activate a different or shallow framework in GPT, Gemini, or an open-source model.
1313
Without systematic evaluation, our catalog is a collection of untested assumptions.
1414

15+
link:../evaluation-report.html[**View the latest evaluation results →**]
16+
1517
This document describes how to build evaluations that answer three questions:
1618

1719
. Does a given LLM *recognize* a semantic anchor?

scripts/render-docs.js

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -93,6 +93,13 @@ renderFile(
9393
path.join(WEB_DOCS, 'spec-driven-workflow.de.html')
9494
)
9595

96+
// Copy evaluation report (self-contained HTML)
97+
const evalReport = path.join(ROOT, 'evaluations/report.html')
98+
if (fs.existsSync(evalReport)) {
99+
fs.copyFileSync(evalReport, path.join(WEB_PUBLIC, 'evaluation-report.html'))
100+
console.log(`Copied: ${path.relative(ROOT, path.join(WEB_PUBLIC, 'evaluation-report.html'))}`)
101+
}
102+
96103
// Copy assets referenced by workflow docs
97104
const workflowDiagram = path.join(ROOT, 'docs/workflow-diagram.png')
98105
if (fs.existsSync(workflowDiagram)) {

0 commit comments

Comments
 (0)