lucasbellesi
diff --git a/‎CONTRIBUTING.md‎
Lines changed: 23 additions & 63 deletions b/‎CONTRIBUTING.md‎
Lines changed: 23 additions & 63 deletions
diff --git a/‎FULL_REPO_IMPROVEMENT_REVIEW.md‎
Lines changed: 224 additions & 0 deletions b/‎FULL_REPO_IMPROVEMENT_REVIEW.md‎
Lines changed: 224 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 13 additions & 3 deletions b/‎README.md‎
Lines changed: 13 additions & 3 deletions
@@ -14,56 +14,36 @@ Thank you for contributing to this repository.
 
 1. Fork the repository and create a branch for your change.
 2. Make focused updates (one topic per pull request when possible).
-3. Verify C++ code compiles with:
+3. Run the full repository check first:
 
-```bash
-g++ -std=c++17 -Wall -Wextra -pedantic <file>.cpp -o <output>
+```powershell
+./scripts/verify-repo.ps1
 ```
 
-You can also validate all C++ files with:
-
-- `./scripts/build-all.ps1` (PowerShell)
-- `bash ./scripts/build-all.sh` (Bash)
-
-Validate markdown links with:
-
-- `./scripts/check-links.ps1` (PowerShell)
-- `bash ./scripts/check-links.sh` (Bash)
-
-Validate README structure with:
-
-- `./scripts/check-readme-structure.ps1` (PowerShell)
-- `bash ./scripts/check-readme-structure.sh` (Bash)
-
-Validate module completeness with:
-
-- `./scripts/check-module-completeness.ps1` (PowerShell)
-- `bash ./scripts/check-module-completeness.sh` (Bash)
-
-Validate checkpoint completeness with:
-
-- `./scripts/check-checkpoint-completeness.ps1` (PowerShell)
-- `bash ./scripts/check-checkpoint-completeness.sh` (Bash)
-
-Validate cross-language parity with:
-
-- `./scripts/check-cross-language-parity.ps1` (PowerShell)
-- `bash ./scripts/check-cross-language-parity.sh` (Bash)
-
-Validate exercise parity with:
-
-- `./scripts/check-exercise-parity.ps1` (PowerShell)
-- `bash ./scripts/check-exercise-parity.sh` (Bash)
+```bash
+bash ./scripts/verify-repo.sh
+```
 
-Validate example output contracts with:
+For a faster loop while you are iterating, use a narrower command on the area you changed:
 
-- `./scripts/check-example-output-contracts.ps1` (PowerShell)
-- `bash ./scripts/check-example-output-contracts.sh` (Bash)
+- Links: `./scripts/check-links.ps1` or `bash ./scripts/check-links.sh`
+- Module README structure: `./scripts/check-readme-structure.ps1` or `bash ./scripts/check-readme-structure.sh`
+- Module completeness: `./scripts/check-module-completeness.ps1` or `bash ./scripts/check-module-completeness.sh`
+- Checkpoint completeness: `./scripts/check-checkpoint-completeness.ps1` or `bash ./scripts/check-checkpoint-completeness.sh`
+- Cross-language parity: `./scripts/check-cross-language-parity.ps1` or `bash ./scripts/check-cross-language-parity.sh`
+- Exercise parity: `./scripts/check-exercise-parity.ps1` or `bash ./scripts/check-exercise-parity.sh`
+- Example output contracts: `./scripts/check-example-output-contracts.ps1` or `bash ./scripts/check-example-output-contracts.sh`
+- Exercise output contracts: `./scripts/check-exercise-output-contracts.ps1` or `bash ./scripts/check-exercise-output-contracts.sh`
+- Education audit: `./scripts/audit-education-quality.ps1` or `bash ./scripts/audit-education-quality.sh`
+- Lint: `./scripts/lint.ps1` or `bash ./scripts/lint.sh`
+- Smoke checks: `./scripts/smoke-languages.ps1` or `bash ./scripts/smoke-languages.sh`
+- Compiled-language builds: `./scripts/build-all.ps1` or `bash ./scripts/build-all.sh`
 
-Validate exercise output contracts with:
+If you are changing one C++ file and want a local spot check before the full pass, you can still compile it directly with:
 
-- `./scripts/check-exercise-output-contracts.ps1` (PowerShell)
-- `bash ./scripts/check-exercise-output-contracts.sh` (Bash)
+```bash
+g++ -std=c++17 -Wall -Wextra -pedantic <file>.cpp -o <output>
+```
 
 For large changes, you can run one language at a time before the full check:
 
@@ -73,26 +53,6 @@ For large changes, you can run one language at a time before the full check:
 - `python scripts/automation.py check-exercise-output-contracts --language cpp`
 - `python scripts/automation.py check-exercise-output-contracts --language csharp`
 
-Run full repository checks with:
-
-- `./scripts/verify-repo.ps1` (PowerShell)
-- `bash ./scripts/verify-repo.sh` (Bash)
-
-Run multi-language smoke checks with:
-
-- `./scripts/smoke-languages.ps1` (PowerShell)
-- `bash ./scripts/smoke-languages.sh` (Bash)
-
-Run language lint checks with:
-
-- `./scripts/lint.ps1` (PowerShell)
-- `bash ./scripts/lint.sh` (Bash)
-
-Run the non-blocking education quality audit with:
-
-- `./scripts/audit-education-quality.ps1` (PowerShell)
-- `bash ./scripts/audit-education-quality.sh` (Bash)
-
 These smoke checks also compile standalone C# exercises by generating temporary validation projects during the check.
 TypeScript checks restore Node dependencies from `package-lock.json`, compile with `tsc`, and execute the emitted JavaScript with `node`.
 
 
@@ -0,0 +1,224 @@
+# Full Repo Improvement Review
+
+Generated: 2026-04-10
+
+## Summary
+
+This repository is already strong on structural consistency, parity coverage, and executable validation. A full local `python scripts/automation.py verify-repo` run passed during this review, including markdown links, README structure, module and checkpoint completeness, documentation sync, example comments, cross-language parity, output contracts, and compiled-language builds.
+
+The main opportunity is no longer "make the repo consistent enough to work." That baseline is in place. The next improvement tier is to reduce drift between docs and reality, raise the quality floor for learner-facing materials that currently pass lightweight checks, and reduce the maintenance burden created by duplicated repo rules across docs, workflow, and manifest-driven automation.
+
+## Baseline Evidence
+
+- `python scripts/automation.py verify-repo` passed locally on 2026-04-10.
+- `python scripts/automation.py audit-education-quality` reported:
+  - 120 module example entry files reviewed
+  - 30 files below the comment-ratio threshold
+  - 12 files oversized for their level norms
+  - 0 files missing observable output markers
+  - 0 files with boilerplate comment hits
+- The current automation surface is centered on `scripts/automation.py`, `scripts/automation_core/ops.py`, and `scripts/automation_manifest.json`.
+
+## Main Strengths
+
+- Cross-language parity is unusually complete for an educational repo: all active tracks reach module and checkpoint parity through `04-expert`.
+- The automation core is centralized enough to support repo-wide enforcement instead of ad hoc shell scripts.
+- Output contracts for examples and exercises materially reduce silent behavioral drift.
+- Module README structure is consistent enough that learners can move between tracks with low navigation friction.
+- The anti-pattern backlog is already pointing at high-value content expansion areas instead of random feature work.
+
+## Highest-Value Improvements
+
+1. Standardize learner guidance above the module level, especially for non-C++ tracks, so every level explains outcomes and completion criteria as clearly as C++.
+2. Remove documentation drift by tightening track README validation and reducing duplicated status text across the repo.
+3. Split dense expert examples into one primary teaching path plus secondary companion examples when a single file currently teaches multiple ideas at once.
+4. Rework file I/O entry examples so they better transfer into checkpoint tasks instead of hiding the workflow inside temp-file setup and cleanup.
+5. Make the template system actually parity-ready by covering TypeScript and the current example-commenting standard.
+6. Treat `scripts/automation_manifest.json` as the canonical metadata source and generate or shrink duplicated doc fragments around it.
+7. Raise automation coverage for learner-quality signals that currently rely on manual review.
+
+## Findings Register
+
+### F1. Track README scope text has already drifted from repo reality
+
+- Category: parity and consistency
+- Severity: medium
+- Learner/maintainer impact: learners get an outdated picture of checkpoint coverage, and maintainers already have doc drift in top-level discovery pages.
+- Evidence:
+  - `languages/csharp/README.md`
+  - `languages/go/README.md`
+  - `languages/python/README.md`
+  - Each still says the track includes "the first checkpoint layer" while also listing `4/4` projects and `4/4` assessments.
+  - `scripts/automation_core/ops.py` only validates selected marker lines and root status rows, so this mismatch passes current checks.
+- Affected scope: three track landing pages and the current doc-sync contract.
+- Recommended fix: define one canonical track-status block shape, validate the scope sentence itself, and eliminate freeform status prose where possible.
+- Estimated effort: small
+- Dependency/risk: low risk; mostly doc and validation work.
+
+### F2. Level-level learner guidance is much stronger in C++ than in the parity tracks
+
+- Category: learner-facing content
+- Severity: high
+- Learner/maintainer impact: parity exists structurally, but the learner experience is uneven because non-C++ levels give less guidance about outcomes, completion, and pacing.
+- Evidence:
+  - `languages/cpp/01-foundations/README.md`
+  - `languages/cpp/02-core/README.md`
+  - C++ level READMEs include `## Level Outcomes` and `## Done When`.
+  - `languages/go/02-core/README.md`
+  - `languages/csharp/03-advanced/README.md`
+  - `languages/python/04-expert/README.md`
+  - `languages/typescript/01-foundations/README.md`
+  - These parity tracks mostly stop at module order plus `## Study Tip`.
+- Affected scope: all non-C++ level READMEs.
+- Recommended fix: adopt a shared level README contract with outcomes, completion checklist, and next-step guidance across every track.
+- Estimated effort: medium
+- Dependency/risk: medium; this is content-heavy and should avoid making every track sound mechanically identical.
+
+### F3. Repo rules are duplicated across manifest, docs, and workflow, which increases maintenance cost
+
+- Category: structural/tooling
+- Severity: high
+- Learner/maintainer impact: updating validation policy or track status requires touching multiple files, which invites drift and slows maintenance.
+- Evidence:
+  - `README.md`
+  - `CONTRIBUTING.md`
+  - `.github/workflows/cpp-build.yml`
+  - `scripts/automation_manifest.json`
+  - `scripts/automation_core/ops.py`
+  - The repo duplicates validation commands, status summaries, and track metadata across docs, workflow, and automation config.
+- Affected scope: root docs, CI workflow, and automation metadata.
+- Recommended fix: move status and validation metadata behind one canonical source, then generate or drastically shorten duplicated documentation blocks.
+- Estimated effort: medium
+- Dependency/risk: medium; generated docs add complexity, but the current duplication is already creating drift.
+
+### F4. Current automation is intentionally permissive in the areas where quality drift is now happening
+
+- Category: structural/tooling
+- Severity: high
+- Learner/maintainer impact: important content regressions can slip through while the repo still reports a clean verification run.
+- Evidence:
+  - `scripts/automation_core/ops.py`
+  - `check_readme_structure` validates only module README heading order and metadata.
+  - `check_doc_sync` validates selected markers, not full semantic accuracy of track docs.
+  - `check_example_comments` only checks whether an example file has at least one comment.
+  - `audit_education_quality` is advisory and does not fail CI even when it flags low-comment or oversized examples.
+- Affected scope: track READMEs, level READMEs, checkpoint docs, and pedagogical quality enforcement.
+- Recommended fix: add stronger non-blocking and blocking checks for track README accuracy, level README completeness, and example complexity thresholds with an allowlist or waiver model.
+- Estimated effort: medium
+- Dependency/risk: medium; quality checks need calibration to avoid noisy CI.
+
+### F5. Several entry examples teach too many things at once for a "main example" file
+
+- Category: learner-facing content
+- Severity: high
+- Learner/maintainer impact: learners get a runnable file, but the concept boundary is blurrier than the module title suggests, especially in advanced and expert tracks.
+- Evidence:
+  - `build/reports/education-quality-report.md`
+  - The audit flags 30 low-comment examples and 12 oversized examples.
+  - `languages/csharp/04-expert/concurrency-basics/example/main.cs`
+  - One entry file mixes a locked shared-counter demonstration with a producer-consumer queue walkthrough.
+  - `languages/csharp/04-expert/performance-and-profiling-basics/example/main.cs`
+  - One entry file compares both string construction and list-capacity measurement.
+  - `languages/cpp/03-advanced/structs-and-classes/example/main.cpp`
+  - One entry file introduces both a plain data struct and a stateful bank-account class.
+- Affected scope: the examples already highlighted by the education audit, especially C# expert and selected C++ and TypeScript modules.
+- Recommended fix: keep one primary `main.*` scenario per module, move extra demonstrations into companion example files, and increase narration before each conceptual jump.
+- Estimated effort: medium to large
+- Dependency/risk: low risk technically, but parity updates must stay coordinated across tracks.
+
+### F6. File I/O entry examples are technically correct but weakly aligned with the checkpoint workflow they are supposed to prepare learners for
+
+- Category: learner-facing content
+- Severity: medium
+- Learner/maintainer impact: learners may understand the API calls but get less practice with the user-facing file workflow they need in projects and assessments.
+- Evidence:
+  - `languages/go/02-core/file-io-basics/example/main.go`
+  - `languages/typescript/02-core/file-io-basics/example/main.ts`
+  - Both primary examples create temporary directories/files, do the work, then clean them up.
+  - `languages/python/projects/02-core/README.md`
+  - `languages/csharp/projects/02-core/README.md`
+  - The capstone tasks instead center on user-supplied file paths and persistent report outputs.
+- Affected scope: at least the Go and TypeScript file I/O entry modules, and likely sibling implementations in the same concept family.
+- Recommended fix: make the main example follow the same mental model as the checkpoint task, then keep temp-file setup as a secondary deterministic example if needed.
+- Estimated effort: medium
+- Dependency/risk: low risk; output contracts may need updates.
+
+### F7. The repo's onboarding path is thorough but heavier than it needs to be for contributors
+
+- Category: structural/tooling
+- Severity: medium
+- Learner/maintainer impact: new contributors have to parse a long list of scripts in both `README.md` and `CONTRIBUTING.md`, even though `verify-repo` is the real default path.
+- Evidence:
+  - `README.md`
+  - `CONTRIBUTING.md`
+  - Both enumerate many individual validation scripts in full.
+  - `.github/workflows/cpp-build.yml`
+  - CI itself effectively runs the consolidated flows rather than presenting the learner with the entire matrix of commands.
+- Affected scope: contributor docs and first-time maintainer workflow.
+- Recommended fix: keep `verify-repo` as the default contributor command, then document the individual scripts as scoped troubleshooting tools instead of the primary path.
+- Estimated effort: small
+- Dependency/risk: low risk.
+
+### F8. The concept template is not yet aligned with the repo's current parity-first standards
+
+- Category: parity and consistency
+- Severity: medium
+- Learner/maintainer impact: new content can start from an outdated scaffold, which increases cleanup work and invites inconsistency.
+- Evidence:
+  - `templates/concept-template/README.md`
+  - The quick-run section includes C++, Python, Go, and C# examples but omits TypeScript.
+  - `templates/concept-template/example/main.cpp`
+  - The stub does not model the required header-comment pattern used by current parity and education-quality checks.
+- Affected scope: all future module creation work.
+- Recommended fix: upgrade the template to reflect the current five-track repo, current comment conventions, and the intended companion-example pattern where relevant.
+- Estimated effort: small
+- Dependency/risk: low risk.
+
+## Prioritized Roadmap
+
+### Near-Term
+
+- Fix the stale track-scope statements in `languages/csharp/README.md`, `languages/go/README.md`, and `languages/python/README.md`.
+- Simplify contributor onboarding in `README.md` and `CONTRIBUTING.md` so `verify-repo` is the default path and the long script lists become secondary references.
+- Upgrade `templates/concept-template/README.md` and `templates/concept-template/example/main.cpp` to match the current five-track, comment-first repo contract.
+- Add automation coverage for track README scope/status accuracy so this class of drift cannot recur silently.
+
+### Medium-Term
+
+- Define and roll out a shared level README contract across non-C++ tracks, including level outcomes and completion criteria.
+- Expand doc-sync checks to cover level-guide completeness and stronger checkpoint/track consistency rules.
+- Use `build/reports/education-quality-report.md` as the queue for targeted example cleanup, starting with the oversized and lowest-comment expert examples.
+- Rework `02-core/file-io-basics` entry examples so the primary walkthrough matches checkpoint-style file workflows.
+
+### Later
+
+- Decide whether repo status and validation metadata should be generated from `scripts/automation_manifest.json` or whether docs should be deliberately shortened to avoid duplication.
+- Add a second-layer pedagogy review surface that checks concept density, companion-example balance, and track adaptation quality rather than just structure and output.
+- Execute the anti-pattern backlog in batches once the level-guide and template improvements reduce future drift.
+
+## What Current Automation Does Not Catch Well
+
+- Semantic accuracy of freeform track overview text.
+- Whether level READMEs are equally helpful across languages.
+- Whether a module's main example teaches one idea cleanly or several ideas at once.
+- Whether checkpoint-prep examples match the mental model of the checkpoint tasks.
+- Whether companion examples are being used consistently to keep `main.*` focused.
+- Whether the contributor docs are optimized for first use rather than completeness.
+
+## Recommended Next PR Batches
+
+1. Documentation truthfulness and onboarding
+   - Fix stale track README scope text.
+   - Simplify top-level contributor instructions.
+   - Add doc-sync coverage for track overview accuracy.
+
+2. Level-guide parity
+   - Standardize non-C++ level READMEs around outcomes, done-when criteria, and next-step guidance.
+
+3. Template and automation contract refresh
+   - Update the concept template for TypeScript and header-comment expectations.
+   - Strengthen quality checks for level guides and example complexity.
+
+4. High-friction content cleanup
+   - Start with the examples already flagged by `build/reports/education-quality-report.md`.
+   - Prioritize C# expert examples and the `02-core/file-io-basics` family across tracks.
@@ -43,8 +43,8 @@ If you only want to learn one track, follow that track README first because lang
 
 1. Read [CONTRIBUTING.md](CONTRIBUTING.md).
 2. Install the repo prerequisites listed above.
-3. Run `./scripts/verify-repo.ps1` or `bash ./scripts/verify-repo.sh`.
-4. Run `./scripts/lint.ps1` or `bash ./scripts/lint.sh` before opening a pull request.
+3. Run `./scripts/verify-repo.ps1` or `bash ./scripts/verify-repo.sh` as the default pre-PR check.
+4. Use a narrower script only when you want a faster loop on one part of the repo.
 
 ## Language Status
 
@@ -148,7 +148,17 @@ Hand-written example code files under `languages/*/*/*/example/` should use inte
 
 ## Validation and CI
 
-Run checks locally:
+Use the full repository check by default:
+
+~~~powershell
+./scripts/verify-repo.ps1
+~~~
+
+~~~bash
+bash ./scripts/verify-repo.sh
+~~~
+
+Use narrower commands only when you want a faster loop on one area:
 
 ~~~powershell
 ./scripts/check-links.ps1