- Add ch03-evaluation elaboration.

NinjaRocks · NinjaRocks · commit e84017fbb374 · 2026-05-01T20:01:42.000+01:00
diff --git a/docs/verification-log.md b/docs/verification-log.md
@@ -0,0 +1,93 @@
+# Verification Log
+
+Pre-publication verification record for *Generative AI in .NET*. One entry per pass. The most-recent fully-green entry is the print sign-off gate.
+
+**Cadence:**
+
+- **Pre-print:** weekly for the four weeks before print drop.
+- **Post-print:** monthly for six months, then quarterly (drives errata releases).
+
+**What each entry covers:** package versions (Critical-5 list), code samples (build + smoke runs), URLs (chapter links + appendices), Anthropic API surface (model IDs in Appendix B + Chapter 4.2.4).
+
+**Sign-off gate before print:**
+
+- [ ] Most recent week is fully green.
+- [ ] No package on the watch list has a known breaking change pending.
+- [ ] Companion repo CI green on the latest commit.
+- [ ] Companion repo tagged `v1.0-print-ready` matching the manuscript version.
+
+---
+
+## Template
+
+Copy this block to start a new entry. Date format `YYYY-MM-DD`.
+
+```markdown
+## YYYY-MM-DD verification pass
+
+### Packages (Critical-5 list)
+- [ ] Microsoft.Extensions.AI -- vX.Y.Z, no change / changelog reviewed
+- [ ] Microsoft.Extensions.AI.Abstractions -- ...
+- [ ] Microsoft.Extensions.AI.OpenAI -- ...
+- [ ] Microsoft.Extensions.AI.Ollama -- ...
+- [ ] Microsoft.Extensions.AI.AzureAIInference -- ...
+- [ ] Microsoft.Agents.AI -- ...
+- [ ] Microsoft.Agents.AI.OpenAI -- ...
+- [ ] Microsoft.Agents.AI.Workflows -- ...
+- [ ] Microsoft.Agents.AI.AzureAI -- ...
+- [ ] ModelContextProtocol -- ...
+- [ ] ModelContextProtocol.Core -- ...
+- [ ] ModelContextProtocol.AspNetCore -- ...
+- [ ] Microsoft.McpServer.ProjectTemplates -- ...
+- [ ] Microsoft.Azure.Functions.Worker.Extensions.Mcp -- ...
+- [ ] Azure.AI.OpenAI -- ...
+- [ ] OpenAI -- ...
+- [ ] OllamaSharp -- ...
+- [ ] Anthropic.SDK -- ...
+
+### Code samples
+- [ ] CI matrix green on commit `<sha>`
+- [ ] Live-API smoke tests green (or skipped, with reason)
+
+### URLs
+- [ ] Anthropic / Claude documentation links resolve
+- [ ] Microsoft Learn links resolve
+- [ ] Azure documentation links resolve
+- [ ] NuGet package pages resolve
+
+### Anthropic API surface
+- [ ] Every model ID in `Appendix-B-Model-Quick-Reference.md` is callable
+- [ ] Every model ID in `Chapter-04.md` section 4.2.4 is callable
+- [ ] `Anthropic.SDK` API surface used in the chapter examples matches the latest stable
+
+### Issues found / actions taken
+- (none) | <description + commit ref + manuscript section touched>
+```
+
+---
+
+## 2026-04-30 -- Initial sweep (kickoff)
+
+**Status:** Partial -- snapshot only; subsequent weeks will be full passes.
+
+### Packages
+- [x] Critical-5 list re-verified against live NuGet feed; companion repo pinned to `Microsoft.Agents.AI 1.3` and `ModelContextProtocol 1.2`.
+- [x] All 37 sample projects build clean on these versions (companion-repo commits `0047e61` + `35e6fd3`).
+- [x] CI matrix green (run 25136332826).
+
+### Code samples
+- [x] All 37 samples build clean.
+- [ ] Live-API smoke tests -- not yet wired up (P2-1 in next-steps-plan).
+
+### URLs
+- [ ] Not run yet -- queue for the first scheduled weekly pass.
+
+### Anthropic API surface
+- [ ] Not run yet -- queue for the first scheduled weekly pass (P0-4 in next-steps-plan).
+
+### Issues found / actions taken
+- 15 placeholder samples ported to 1.x stable APIs (book-repo commit `09bb7d9` cleared the API-update-pending punch list).
+
+---
+
+*New entries go below this line, most recent first.*
diff --git a/samples/ch03-rag/03.2.7-evaluation/Bootstrap.cs b/samples/ch03-rag/03.2.7-evaluation/Bootstrap.cs
@@ -0,0 +1,36 @@
+using System.ComponentModel;
+using Microsoft.Extensions.AI;
+
+namespace RagEvaluation;
+
+internal sealed record QaPair(
+    [property: Description("A question whose answer is contained in the source passage.")] string Question,
+    [property: Description("The minimal correct answer drawn directly from the passage.")] string GroundTruth,
+    [property: Description("Echo back the source chunk id so the pair stays traceable.")] string SourceChunkId);
+
+internal static class Bootstrap
+{
+    public static async Task<IReadOnlyList<QaPair>> GenerateAsync(
+        IChatClient generator,
+        IEnumerable<(string Id, string Text)> chunks,
+        CancellationToken ct = default)
+    {
+        var pairs = new List<QaPair>();
+        foreach (var (id, text) in chunks)
+        {
+            var resp = await generator.GetResponseAsync<QaPair>(
+                $$"""
+                Read the passage and propose ONE question whose answer is contained
+                in the passage. Keep the question specific and the answer concise.
+                Set sourceChunkId to "{{id}}".
+
+                Passage:
+                {{text}}
+                """, cancellationToken: ct);
+
+            if (resp.TryGetResult(out var pair))
+                pairs.Add(pair);
+        }
+        return pairs;
+    }
+}
diff --git a/samples/ch03-rag/03.2.7-evaluation/README.md b/samples/ch03-rag/03.2.7-evaluation/README.md
@@ -6,6 +6,8 @@ A minimal LLM-as-judge harness that scores candidate answers on **faithfulness**
 
 The bundled `golden-dataset.json` is a 3-row demo. Replace it with your own questions/ground-truths and plug in your real RAG pipeline's answer in place of the `candidateAnswer` placeholder.
 
+`Bootstrap.cs` ships a small Q&A-pair generator that reads chunks of your indexed corpus and proposes one `QaPair` per chunk via the same `IChatClient` plumbing. Use it to seed a golden dataset from a corpus with no existing user logs, then human-review the result before checking it in.
+
 ## Run it
 
 ```bash