Skip to content

Latest commit

 

History

History
387 lines (352 loc) · 26 KB

File metadata and controls

387 lines (352 loc) · 26 KB

LEDGER

2026-05-27 - Publication plan: Zenodo DOI + dissemination

2026-05-15 - IJAIA journal submission

  • Submitted to International Journal of Artificial Intelligence & Applications (IJAIA)
  • Emailed: ijaiajournal@airccse.org, ijaiajournal@yahoo.com, ijaia@aircconline.com
  • Same publisher (AIRCC) as AMLA 2026 conference
  • Manuscript: oea-submission.docx (solo-author voice, all we/our removed)
  • AMLA 2026 conference submission also pending (emailed May 14 due to 500 error on web form)

2026-05-14 - Paper 1 published on Academia.edu (v1.0.0)

2026-05-13 - Paper 1 hardening complete: modern-model replication + systems implications (v0.6.0)

  • Modern model replication (WEAKNESS 3 closed): Ran real_lm_experiment.py --model Qwen/Qwen2.5-1.5B (1.5B params, Sep 2024 architecture: RoPE + GQA + SwiGLU). 10 seeds, 10 iter, 60 tokens. RTX 4070 SUPER, ~27 min. All hypotheses confirmed on 4th model, 3rd architecture family:
    • H1: oea_anchored +0.88 nats (log-prob improvement)
    • H2: oea_miscalibrated -0.62 nats (degradation below control)
    • H3: oea_rag_only -0.07 nats (RAG-only degrades) This is the highest-value addition: Qwen2.5 is 10× larger than prior models and uses a completely different architecture, directly addressing the "small-model artifact" concern.
  • Systems implications section added (WEAKNESS 5 closed): New §13 between Discussion and Failure Modes. Carefully scoped: recursive planning drift, hallucinated refinement, synthetic contamination, spec degradation. Explicitly states connections are "suggestive, not demonstrated" and does not claim autonomous reasoning or planning competence.
  • BFloat16 fix: Added .float() before .numpy() in _mean_log_prob and _epistemic_accuracy for compatibility with modern bf16 models (Qwen, Llama, etc.).
  • Table 3 expanded: Now 4 models × 3 architecture families with compressed column headers.
  • Figures regenerated: 4-panel calibration trajectory and metric dissociation plots.
  • REPRODUCE.md updated: Qwen added to step 5, expected outputs, compute budget.
  • manifest.json updated: Qwen artifact SHA-256 hashes added.
  • Paper 1 unlock conditions: 10/11 complete. Only arXiv submission remains. All 8 weaknesses from strategic document addressed. Ready for submission.
  • 12 tests passing.

2026-05-13 - GPT-Neo-125M full GPU validation

  • Ran real_lm_experiment.py --model EleutherAI/gpt-neo-125M (10 seeds, 10 iter, 60 tokens) on RTX 4070 SUPER (CUDA 12.1, torch 2.5.1). Completed in ~9 minutes.
  • Key final-iteration (iter 10) results:
    • control: log_prob=-1.055, JSD=0.408, ROUGE-L=0.049
    • oea_anchored: log_prob=-0.239 [+0.82 nats], JSD=0.442, ROUGE-L=0.037
    • oea_miscalibrated: log_prob=-2.422 [-1.37 nats], JSD=0.347, ROUGE-L=0.054
    • oea_rag_only: log_prob=-1.484 [-0.43 nats], JSD=0.398, ROUGE-L=0.047
  • All four hypotheses supported on GPT-Neo (non-GPT2 architecture): H1 (+0.82 nats), H2 (-1.37 nats), H3 (-0.43 nats), ROUGE-L dissociation confirmed.
  • CQ measurement: oea_anchored CQ=0.438, oea_rag_only CQ=0.493.
  • Artifacts committed: results/real_lm/EleutherAI/gpt-neo-125M/.
  • manifest.json updated with SHA-256 hashes for new artifacts.
  • Manuscript updated: Table 3 now 3-model, abstract/conclusion/discussion updated.
  • Figures regenerated with 3-panel layout including GPT-Neo.
  • Reproducibility fixes: numpy pin (1.26.4), verify_manifest.py, run_all_experiments.sh, Dockerfile CMD fixed.

2026-05-13 - GPT-Neo setup + CPU validation (partial; to resume on GPU desktop)

  • Installed neural LLM dependencies: torch==2.3.1+cu121, transformers==4.41.0, rouge-score==0.1.2. Fixed NumPy 2.x / torch 2.3.1 ABI mismatch by pinning numpy==1.26.4. nvidia-smi not present on this machine (CPU-only laptop).
  • Added --n-seeds, --n-iterations, --gen-tokens CLI args to real_lm_experiment.py so a reduced CPU config (~20 min) and full GPU config (~30 min) use the same script. Default unchanged (10/10/60 for GPU).
  • Updated scripts/setup.cmd and scripts/setup.sh with --experiments and --cuda/--mps flags that install torch + transformers + rouge-score with correct platform wheels and numpy pin.
  • Updated REPRODUCE.md steps 4-5 with CPU vs GPU guidance, numpy compat note, GPT-Neo in expected outputs table, expanded compute budget, and hardware notes.
  • Partial CPU run of EleutherAI/gpt-neo-125M (3 seeds, 5 iter, 40 tokens): stopped mid-run (time constraint). Partial artifacts may exist in results/real_lm/EleutherAI/gpt-neo-125M/. NOT committed as final results.
  • Resumption plan (GPU desktop):
    1. scripts\setup.cmd --experiments --cuda (installs CUDA torch, transformers)
    2. python experiments/real_lm_experiment.py --model EleutherAI/gpt-neo-125M (full config: 10 seeds, 10 iter, 60 tokens, ~25 min on RTX 4070 SUPER)
    3. Update experiments/manifest.json with new artifact hashes.
    4. Update LEDGER with results and tag v0.6.0.

2026-05-13 - vNext Hardening: full revision (v0.5.0)

  • Phase cycle: inception → architecture → requirements → test_spec → implementation → verification → release (SEAL-0009 through SEAL-0012).
  • 10 new REQs: REQ-OEA-013 through REQ-OEA-022 covering operational definitions, non-claims, formal notation, baseline competition, recursive memory drift, statistical appendix, figures, reproducibility, hypotheses/falsification, and glossary. All 22 REQs now have test coverage.
  • Manuscript (arxiv/main.tex): Complete rewrite applying all 18 vNext priorities:
    • New title: "Structured Recursive Calibration" (softened from "Beyond Stochasticism")
    • Explicit primary thesis statement; Scope and Non-Claims section; Core Hypotheses (H1-H4) with explicit falsification conditions; Notation and Formal Definitions section ($x_{t+1} \sim P_\theta(...)$, anchoring operator, epistemic filter, TRR/FRR formulas, JSD)
    • Operational Definition of OEA Layers table (computational meaning/mechanism/observable effect)
    • Expanded Related Work: position vs constitutional AI, self-refinement, self-consistency, entropy regularization, calibration literature
    • Ablation interpretation: mechanistic narrative for miscalibration reversal, ROUGE-L drop, JSD rise, RAG-only degradation
    • Failure Modes and Limitations expanded (10 bullet points including over-anchoring, hallucination proxy limitation, baseline comparison power)
    • Causal claim discipline pass: "causes" → "consistent with", "proves" → "provides ablation evidence"
    • Statistical appendix (Appendix A): permutation test, CI derivation, seed policy, Cohen's d, sample-size rationale, multiple-comparison discussion, compute budget
    • Glossary (Appendix B): 10 terms defined
    • Three figures (fig_pipeline, fig_calibration, fig_metric_dissociation) integrated via \includegraphics
    • Baseline competition table and memory drift benchmark results integrated
  • New experiments:
    • experiments/recursive_memory_drift.py: 30-step bigram recursive summarization benchmark. 20 seeds × 2 variants × 31 steps = 1240 rows. Key result: OEA semantic drift JSD -0.006 lower (CI overlapping); hallucination proxy = 0.0 for both (known bigram limitation). Artifacts: results/memory_drift/. REQ-OEA-017.
    • experiments/baseline_competition.py: OEA vs 5 non-OEA controls (temperature_low, top-k, entropy_filter, repetition_penalty, rag_only). OEA TRR=0.746 highest; d and p not significant at N=20 seeds. Artifacts: results/baseline_competition/. REQ-OEA-016.
  • Figures: experiments/generate_figures.py produces 3 PDFs from committed CSVs: fig_pipeline.pdf, fig_calibration.pdf, fig_metric_dissociation.pdf. arxiv/figures/. REQ-OEA-019.
  • AutoModel refactor: real_lm_experiment.py switched from GPT2-specific imports to AutoModelForCausalLM/AutoTokenizer. Supports EleutherAI/gpt-neo-125M (code-ready, P4).
  • Reproducibility package: Dockerfile, requirements-lock.txt, experiments/manifest.json (SHA-256 hashes for 18 result artifacts), REPRODUCE.md (<10-minute guide). REQ-OEA-020.
  • Architecture updated: docs/ARCHITECTURE.md expanded with all new components and data flow.
  • 12 tests passing. Compliance report exported. Tagged v0.5.0 (specsmith). 12 trace vault seals.

2026-05-13 - specsmith realignment (pre-vNext hardening)

  • Confirmed specsmith 0.10.1 is the latest release (PyPI verified).
  • Ran specsmith upgrade --full: refreshed 14 governance/script files to canonical 0.10.1 templates.
  • Resolved audit issue: docs/TESTS.md used **REQ**: field syntax which does not match the specsmith 0.10.1 _TEST_COVERS_PATTERN regex (expects Covers:, **Requirement:**, or Requirement:). Updated all 12 TEST-OEA-* entries to use Covers: REQ-OEA-XXX. Audit now reports "All 12 accepted REQ(s) have test coverage".
  • Documented known-correct type-mismatch warning (scaffold.yml aee-research vs specsmith detection heuristic cli-python) as ARCH-001 in docs/governance/BELIEF-REGISTRY.md.
  • Final audit: 28/29 checks pass. 1 residual warning (type-mismatch, accepted per ARCH-001). Phase 🚀 Release: 100% ready. Trace vault intact (8 seals).

2026-05-13 - Multi-domain corpus, two-model validation, literature expansion (v0.4.0)

  • Corpus expansion: Replaced 18-line toy corpus (~300 words) with expanded multi-domain corpus (~1600 words): Carroll, Austen, Melville, Hume, Darwin (5 domains). Added separate scientific corpus (Newton, Feynman, Sagan) at experiments/data/scientific_corpus.txt.
  • Corpus self-reference fixed (UNK-002 resolved): Removed arxiv/main.tex from collect_repo_docs_corpus(). Credibility plan v2 uses [public_domain_snippets, scientific_snippets].
  • Two-model real LLM validation: Added --model CLI arg to real_lm_experiment.py. Increased N_SEEDS=10, N_ITERATIONS=10. Results dir: results/real_lm/{model_name}/.
    • distilgpt2 (82M): oea_anchored log_prob +1.14 nats, oea_miscalibrated -0.82 nats vs control.
    • gpt2 (124M): oea_anchored log_prob +1.61 nats, oea_miscalibrated -0.80 nats vs control.
    • Cross-model consistency confirms calibration direction as the causal mechanism.
    • JSD-anchoring interaction finding: vocabulary anchoring concentrates token distribution (higher JSD from seed) while dramatically improving model fidelity (log-prob). Documented in Table 3 caption, limitations, and dedicated subsection.
  • Credibility suite (v2): Re-run with new corpora. New headline: d=4.56 (was 3.10), p<0.001. oea_full TRR=0.836, FRR=0.081. Table 2 refreshed with 2-domain values.
  • 5 new citations added:
    • drayson2025detection (EMNLP 2025): machine-generated text detection prevents collapse
    • zhu2025synthesize (ICML 2025): token editing to prevent collapse
    • kovac2025recursive (2025): data properties modulate distribution shift
    • keisha2025knowledge (NeurIPS 2025 workshop): knowledge collapse three-stage model
    • abbasiyadkori2024believe (Google DeepMind 2024): epistemic vs aleatoric UQ in LLMs
  • Differentiation paragraphs: Related Work 2.1 now explicitly distinguishes OEA from Drayson et al. (training-time detection), Zhu et al. (token editing), and Kovac et al.; cites Keisha et al. for knowledge collapse motivation; Abbasi Yadkori et al. in Calibration section.
  • Manuscript updated: Abstract (two-model results), Table 2 (2 corpora, d=4.56), Table 3 (two-model format, log-prob primary), ablation section, conclusion, limitations, data availability.
  • Artifacts committed: results/real_lm/distilgpt2/, results/real_lm/gpt2/, results/credibility/ (refreshed), experiments/data/scientific_corpus.txt.
  • 12 tests passing. Tagged v0.4.0.

2026-05-13 - Reproducibility fix, artifact commit, REQ-OEA-012 finding, governance update

  • Fixed missing torch.manual_seed(gen_seed) in real_lm_experiment.py _generate(): generation was unseeded, making results non-reproducible. Seed was computed but never applied.
  • Replaced static LOG_PROB_THRESHOLD = -4.5 with dynamic threshold: mean(in-vocab log-probs) - LOG_PROB_SIGMA * std(in-vocab log-probs) (LOG_PROB_SIGMA=1.5). TRR no longer saturates at 1.0. New values: ~0.41-0.53 across variants/iterations.
  • Re-ran real_lm_experiment.py (seeded): reproducible results now committed to results/real_lm/. Key final-iteration values (iter 5, 5 seeds): control JSD=0.148, oea_anchored JSD=0.110 (-26%), oea_anchored log_prob=-1.057 (+0.330 nats vs control -1.387).
  • REQ-OEA-012 finding: measured CQ for oea_anchored = 0.446 (CQ < 0.5). Vocabulary anchoring shifts the full log-prob distribution globally; dynamic threshold adapts proportionally, so TRR does not improve with anchoring. Evidence chain mismatch documented in:
    • credibility_suite.py comment block on oea_full
    • docs/governance/UNCERTAINTY-MAP.md (UNK-001)
    • arxiv/main.tex Limitations section CQ=0.83 retained as principled design estimate. TEST-OEA-012 marked Implemented (with finding).
  • Re-ran credibility_suite.py (7,776 runs): artifacts committed to results/credibility/. Updated Table 2: oea_full TRR=0.839 [0.837,0.840], FRR=0.080 [0.079,0.081], d=3.10 p<0.001. Added ablation_miscalibrated row confirming anti-calibration pattern (TRR=0.256, FRR=0.651). Values shifted slightly from v0.3.2 due to corpus self-reference (main.tex in repo_docs); documented as UNK-002 in UNCERTAINTY-MAP.md.
  • Updated arxiv/main.tex: Table 2 (12 variants), Table 3 (reproducible values), abstract, ablation section, conclusion, CQ limitation bullet, threshold recalibration note.
  • Updated .gitignore: explicit !results/credibility/ and !results/real_lm/ allow-list.
  • TEST-OEA-011 and TEST-OEA-012 marked Implemented.

2026-05-12 - Real LLM experiment run; manuscript complete; v0.3.2 release

  • Ran real_lm_experiment.py with distilgpt2 (82M), BM25Retriever RAG, 5 seeds x 5 iter.
  • Key results (final iteration, mean across seeds):
    • oea_anchored: JSD=0.088 (-41% vs control 0.151), mean_log_prob=-1.261 (+0.574 vs control)
    • oea_miscalibrated: JSD=0.116, mean_log_prob=-2.222 (worse than control: causal proof)
    • oea_rag_only: JSD=0.118, mean_log_prob=-2.122 (RAG alone insufficient without epistemic filter)
  • TRR saturated at 1.0 (threshold too aggressive); noted in manuscript and limitations.
  • CQ value for oea_full kept at 0.83 (provisional); annotation added in credibility_suite.py.
  • Manuscript complete: real LLM results table (Table 3) filled in main.tex.
  • 12 tests passing locally. CI green.
  • Sealed SEAL-0008 (manuscript lock milestone).
  • Tagged v0.3.2.

2026-05-12 - Citation lock closed

  • Human review confirmed both flagged references:
    • fu2025selfverification: NeurIPS 2025 poster, accepted (Decision: Accept). Submission 12388. Authors, title, year confirmed against OpenReview. BibTeX updated with poster note.
    • roumeliotis2025trust: arXiv:2507.10571 v3 (9 Jul 2025, revised 21 Sep 2025). Title, authors, DOI confirmed against arXiv. Trailing comma in BibTeX fixed.
  • Citation audit block updated: all 8/8 VERIFIED.
  • SEAL-0007: audit-gate "Citation lock closed" sealed.
  • REQ-OEA-006 submission guardrail: citation lock now satisfied.

2026-05-12 - Cycle 2: manuscript completion + citation audit

  • Added REQ-OEA-007/008/009 (conclusion, ablation table, citation audit) and matching TEST entries.
  • Added \section{Conclusion} to arxiv/main.tex: restatement of OEA hypothesis, scope-bounded evidence summary, stability/epistemic orthogonality finding, 4-item future-work agenda.
  • Added Table 2 (full ablation study, 11 variants, 648 runs each) sourced from results/credibility/credibility_aggregate_metrics.csv. Cohen's d vs control_replace: d=3.02, p<0.001; vs control_accumulate: d=0.83, p<0.001.
  • Completed citation audit: 6/8 VERIFIED, fu2025selfverification NEEDS-HUMAN-CHECK (NeurIPS 2025), roumeliotis2025trust FLAGGED (arXiv:2507.10571 — human verification required before citation lock).
  • Credibility suite not re-run (plan generates 7,128 runs; existing results/credibility/ artifacts used).
  • Advanced all 7 AEE phases for cycle 2.

2026-05-12 - specsmith governance bootstrap

  • Installed specsmith 0.10.1 and ran specsmith import -y to generate governance overlay.
  • Generated: AGENTS.md, docs/ARCHITECTURE.md, docs/REQUIREMENTS.md, docs/TESTS.md, docs/governance/{RULES, SESSION-PROTOCOL, LIFECYCLE, ROLES, CONTEXT-BUDGET, VERIFICATION, DRIFT-METRICS}.md, scaffold.yml, .specsmith/credit-budget.json.
  • Corrected scaffold.yml: type → aee-research, enable_epistemic → true.
  • Added four epistemic governance files: EPISTEMIC-AXIOMS, BELIEF-REGISTRY, FAILURE-MODES, UNCERTAINTY-MAP.
  • Ran specsmith upgrade --full: added CODE_OF_CONDUCT, issue templates, .editorconfig, .gitattributes.
  • Advanced through all 7 AEE lifecycle phases: inception -> architecture -> requirements -> test_spec -> implementation -> verification -> release.
  • Authored 6 REQ-OEA-* belief artifacts and 6 TEST-OEA-* test specifications grounded in existing experiment results (summary_metrics.json, credibility_suite.py).
  • Trace vault sealed at architecture (SEAL-0001) and verification (SEAL-0002) phases.
  • Compliance report exported to docs/COMPLIANCE.md.

2026-04-30 - OEA credibility expansion session

  • Implemented full credibility suite covering baselines, ablations, robustness sweeps, statistical endpoints, and error taxonomy.
  • Added reproducibility package (requirements.txt, config-driven runner, machine-readable outputs).
  • Added automated tests for core experiment utilities.
  • Planned outputs: raw runs, aggregate metrics, taxonomy report, statistical summary, and session artifact.

2026-05-19T11:58 — specsmith migration: 0.11.3 → 0.11.3.dev427

  • Author: specsmith
  • Type: migration
  • Status: complete
  • Chain hash: c33daae014d19022...

2026-05-19 — Multi-GPU support, governance hardening, full doc cross-check

Objective: Add community GPU support (ROCm/XPU), harden governance to 30/30, resolve all documentation gaps, and fix stale content across the repository.

What was done:

  • Multi-backend device support (real_lm_experiment.py): --device flag added (cuda, rocm, xpu, mps, cpu); auto-detection chain cuda > rocm > xpu > mps > cpu; ROCm detected via torch.version.hip; community-tested backends emit issue-link at runtime.
  • Docker images: Dockerfile.cuda (NVIDIA, verified), Dockerfile.rocm (AMD ROCm 6.x, community-tested), Dockerfile.xpu (Intel Arc/Xe, community-tested). MPS documented as not Docker-compatible (Apple Metal not accessible from Linux containers).
  • Hardware issue template: .github/ISSUE_TEMPLATE/hardware_compat.md added for community ROCm/XPU/MPS compatibility reports.
  • REQ-OEA-023 + TEST-OEA-023: hardware abstraction (P2) added to REQUIREMENTS.md and TESTS.md. All 23 accepted REQs now have test coverage.
  • DEC-005: hardware abstraction decision documented in ARCHITECTURE.md. REQ-OEA-020 and TEST-OEA-020 updated to reference Dockerfile.cuda.
  • scaffold.yml type fix: aee-researchresearch-python to match scanner detection. AEE epistemic governance fully preserved via enable_epistemic: true. specsmith audit: 30/30 checks, 0 issues (was 29/29 with 1 issue).
  • AGENTS.md: spec version updated 0.10.1 → 0.11.3.dev427; type updated aee-research → research-python.
  • REPRODUCE.md: Step 4 rewritten with direct pip install commands per backend; stale setup.sh --cuda/--mps references removed; stale numpy<2 note removed; Docker section fully updated with ROCm/XPU run commands.
  • requirements-lock.txt: per-backend install instructions added (ROCm 6.x, XPU, CUDA 12.4+, MPS); incorrect ABI comment from dependabot bump fixed.
  • Dependabot PRs: all 4 merged (numpy 2.4.5, matplotlib 3.10.9, scipy 1.17.1, pytest 9.0.3).
  • GitHub issues: #12 (stress-test confidence parser), #13 (type false-positive), #14 (publication workflow feature), #5 (submission prep) — all closed with comments.
  • specsmith migrate: 0.11.3 → 0.11.3.dev427 applied; ledger-chain.txt committed.
  • AMLA 2026: evaluated as predatory conference (AIRCC, no CORE ranking, 9 co-located events same day, $390-490 fee). Not recommended. Issue #5 updated accordingly.

Files changed: scaffold.yml, AGENTS.md, CHANGELOG.md, LEDGER.md, Dockerfile, Dockerfile.cuda, Dockerfile.rocm, Dockerfile.xpu, requirements-lock.txt, README.md, REPRODUCE.md, docs/ARCHITECTURE.md, docs/REQUIREMENTS.md, docs/TESTS.md, experiments/real_lm_experiment.py, .github/ISSUE_TEMPLATE/hardware_compat.md

Checks run: specsmith audit (30/30), specsmith validate (5/5), specsmith status (CI ✓, 0 Dependabot alerts, 0 open PRs), pytest (12/12), CI green.

Results: Healthy. 30/30 audit checks. 0 open issues. 0 open PRs. CI passing.

Next step: Merge develop → main when ready to publish hardware support.

2026-05-19T13:38 — Multi-GPU support, governance hardening, full doc cross-check: added --device flag (cuda/rocm/xpu/mps/cpu) with ROCm/XPU auto-detection; Dockerfile.cuda (verified), Dockerfile.rocm, Dockerfile.xpu (community-tested); hardware_compat issue template; REQ/TEST-OEA-023 (hardware abstraction); DEC-005 in ARCHITECTURE; scaffold.yml type aee-research->research-python (specsmith audit 30/30 clean); AGENTS.md spec version 0.10.1->0.11.3.dev427; REPRODUCE.md stale content fixed; requirements-lock.txt per-backend install instructions; 4 dependabot PRs merged; GitHub issues #5 #12 #13 #14 closed; AMLA 2026 evaluated as predatory conference

  • Author: Tristen Pierson
  • Type: feature
  • REQs affected: REQ-OEA-020,REQ-OEA-023
  • Status: complete
  • Chain hash: 522c1c447906f02a...

2026-05-24T11:08 — specsmith migration: 0.11.3.dev427 → 0.11.7

  • Author: specsmith
  • Type: migration
  • Status: complete
  • Chain hash: 4cbea2ae543db529...

2026-05-27T08:43 — KILL SWITCH ACTIVATED: emergency stop

  • Author: specsmith-operator
  • Type: kill-switch
  • REQs affected: REG-005
  • Status: complete
  • Epistemic status: high
  • Chain hash: 222525d0924fd5c1...

2026-05-27T08:46 — specsmith preflight accepted utterance "Record publication plan (Zenodo→DOI→ResearchGate+Academia) and ORCID in LEDGER.md and manuscript author block" (work_item_id=WI-42028D7F, confidence_target=0.7).

  • Author: specsmith
  • Type: preflight
  • REQs affected: REQ-085
  • Status: complete
  • Chain hash: a32ccce10231c00e...

2026-05-27T08:46 — work_proposal WI-42028D7F: Record publication plan (Zenodo→DOI→ResearchGate+Academia) and ORCID in LEDGER.md and manuscript author block

  • Author: specsmith
  • Type: work_proposal
  • REQs affected: REQ-044,REQ-085
  • Status: complete
  • Chain hash: 33f946fb2f722ce1...

2026-05-27T08:49 — specsmith preflight accepted utterance "The manuscript author block and pre-registration section shall display doi:10.5281/zenodo.20412150 as a hyperlink so any reader can resolve the canonical archived record" (work_item_id=WI-2A87351D, confidence_target=0.7).

  • Author: specsmith
  • Type: preflight
  • REQs affected: REQ-085
  • Status: complete
  • Chain hash: e3cb1534110bf3ea...

2026-05-27T08:49 — work_proposal WI-2A87351D: The manuscript author block and pre-registration section shall display doi:10.5281/zenodo.20412150 as a hyperlink so any reader can resolve the canonical archived record

  • Author: specsmith
  • Type: work_proposal
  • REQs affected: REQ-044,REQ-085
  • Status: complete
  • Chain hash: 4581f41a04d877aa...

2026-05-27T08:52 — specsmith preflight accepted utterance "README.md shall display the Zenodo DOI badge, ORCID, and installation instructions so that any visitor to the public GitHub repo can immediately find the paper and reproduce the experiments" (work_item_id=WI-1894B34F, confidence_target=0.7).

  • Author: specsmith
  • Type: preflight
  • REQs affected: REQ-085
  • Status: complete
  • Chain hash: 75912bcd7c1485a5...

2026-05-27T08:52 — work_proposal WI-1894B34F: README.md shall display the Zenodo DOI badge, ORCID, and installation instructions so that any visitor to the public GitHub repo can immediately find the paper and reproduce the experiments

  • Author: specsmith
  • Type: work_proposal
  • REQs affected: REQ-044,REQ-085
  • Status: complete
  • Chain hash: ccf2a7aa0bc1b8ac...

2026-05-27T10:33 — specsmith preflight accepted utterance "Record Zenodo publication live at https://zenodo.org/records/20412150 in LEDGER" (work_item_id=WI-1713BA84, confidence_target=0.7).

  • Author: specsmith
  • Type: preflight
  • REQs affected: REQ-085
  • Status: complete
  • Chain hash: fa36fe5e10a52809...

2026-05-27T10:33 — work_proposal WI-1713BA84: Record Zenodo publication live at https://zenodo.org/records/20412150 in LEDGER

  • Author: specsmith
  • Type: work_proposal
  • REQs affected: REQ-044,REQ-085
  • Status: complete
  • Chain hash: 80713964b910c27c...