SemiAnalysisAI
diff --git a/‎.github/workflows/claude.yml‎
Lines changed: 2 additions & 2 deletions b/‎.github/workflows/claude.yml‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎.oxlintrc.json‎
Lines changed: 1 addition & 0 deletions b/‎.oxlintrc.json‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎AGENTS.md‎
Lines changed: 2 additions & 1 deletion b/‎AGENTS.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎docs/adding-entities.md‎
Lines changed: 99 additions & 0 deletions b/‎docs/adding-entities.md‎
Lines changed: 99 additions & 0 deletions
diff --git a/‎docs/index.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/index.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/state-ownership.md‎
Lines changed: 33 additions & 32 deletions b/‎docs/state-ownership.md‎
Lines changed: 33 additions & 32 deletions
diff --git a/‎package.json‎
Lines changed: 3 additions & 3 deletions b/‎package.json‎
Lines changed: 3 additions & 3 deletions
@@ -278,7 +278,7 @@ jobs:
       - name: Run Claude Code
         id: claude
         if: ${{ always() }}
-        uses: anthropics/claude-code-action@fbda2eb1bdc90d319b8d853f5deb53bca199a7c1 # v1.0.140
+        uses: anthropics/claude-code-action@d5726de019ec4498aa667642bc3a80fca83aa102 # v1.0.148
         env:
           GH_TOKEN: ${{ secrets.PAT }}
           GITHUB_TOKEN: ${{ secrets.PAT }}
@@ -331,7 +331,7 @@ jobs:
           fetch-depth: 0
 
       - name: PR Review with Claude
-        uses: anthropics/claude-code-action@fbda2eb1bdc90d319b8d853f5deb53bca199a7c1 # v1.0.140
+        uses: anthropics/claude-code-action@d5726de019ec4498aa667642bc3a80fca83aa102 # v1.0.148
         with:
           anthropic_api_key: ${{ secrets.ANTHROPIC_API_KEY }}
           trigger_phrase: '@claude review'
 
@@ -40,6 +40,7 @@
     "unicorn/no-null": "off",
     "unicorn/no-useless-undefined": "off",
     "unicorn/numeric-separators-style": "off",
+    "unicorn/prefer-export-from": "off",
     "unicorn/prefer-global-this": "off",
     "unicorn/prefer-top-level-await": "off"
   }
 
@@ -148,6 +148,7 @@ Authoritative total / active parameter counts for every model in the dashboard.
 | DeepSeek-V4-Pro        | 1.6T  | 49B         | `deepseek-ai/DeepSeek-V4-Pro`       | HF model card                      |
 | Kimi-K2.5              | 1T    | 32B         | `moonshotai/Kimi-K2.5`              | HF model card                      |
 | Kimi-K2.6              | 1T    | 32B         | `moonshotai/Kimi-K2.6`              | HF model card                      |
+| Kimi-K2.7-Code         | 1T    | 32B         | `moonshotai/Kimi-K2.7-Code`         | HF model card                      |
 | Qwen3.5-397B-A17B      | 397B  | 17B         | `Qwen/Qwen3.5-397B-A17B`            | HF model card                      |
 | GLM-5                  | 744B  | 40B         | `zai-org/GLM-5`                     | HF model card                      |
 | GLM-5.1                | 744B  | 40B         | `zai-org/GLM-5.1-FP8`               | HF model card (same base as GLM-5) |
@@ -161,7 +162,7 @@ Authoritative total / active parameter counts for every model in the dashboard.
 - **GLM-5 ≠ 355B.** 355B is GLM-4.5. GLM-5 jumped to 744B / 40B active (256-expert MoE with DSA).
 - **MiniMax-M2.5/M2.7 ≠ 456B.** 456B is the older MiniMax-Text-01 / M1 (32 large experts). The M2 series is a different architecture: 230B / 10B active, 256 small experts.
 - **DeepSeek-R1 is 671B, not 685B.** HF metadata shows 685B because the bundled MTP head adds ~14B; the core MoE is 671B / 37B active.
-- **Kimi K2.5 and K2.6 are post-training refinements**, not new pre-trained sizes. Same 1T / 32B / 384-expert backbone as the original K2.
+- **Kimi K2.5, K2.6, and K2.7-Code are post-training refinements**, not new pre-trained sizes. Same 1T / 32B / 384-expert backbone as the original K2. K2.7-Code is a coding-focused refinement of the same backbone.
 
 ## Common Development Tasks
 
 
@@ -71,6 +71,105 @@ Present what you inferred and get confirmation + category in a single step. Incl
 
 Everything else (`MODEL_OPTIONS`, `DEFAULT_MODELS`, `EXPERIMENTAL_MODELS`, `DEPRECATED_MODELS`, `MODEL_PREFIX_MAPPING`, `getModelLabel()`) is derived automatically.
 
+**`packages/app/src/lib/compare-slug.ts`** (easy to miss — the /compare and /compare-per-dollar pages do NOT derive from `MODEL_CONFIG`):
+
+- `COMPARE_MODEL_SLUGS` — add an entry with `{ slug, displayName, dbKeys, label }`. `displayName` must match the `Model` enum value; `dbKeys` lists the DB buckets to query. Place it per the ordering comment (Chinese-lab flagships first, newer family member leads). Without this entry the model is absent from /compare, /compare-per-dollar, the sitemap, and their OG images.
+- `COMPARE_MODEL_ALIASES` — only if a family-level or older-version slug should 308 to the new entry.
+
+**`packages/app/src/lib/compare-ssr.ts`**:
+
+- `KNOWN_MODELS` — add the display name so `?g_model=` URL overrides validate on compare pages.
+
+**`packages/app/src/app/compare/page.tsx`** and **`packages/app/src/app/compare-per-dollar/page.tsx`**:
+
+- `DESCRIPTION` — these SEO meta strings hardcode a sample model list ("…, Qwen 3.5 397B-A17B, and more"). Add the new model if it should appear in the catalog blurb.
+
+**`packages/app/src/lib/model-architectures.ts`** (optional — powers the per-model architecture diagram on the inference tab):
+
+- `MODEL_ARCHITECTURES` — add a `[Model.X]` entry with verified config.json values. Omitted models simply render no diagram (`getModelArchitecture` returns `undefined`), so this is non-blocking but expected for parity with other models.
+
+`/about` needs no change — its model list derives from `DB_MODEL_TO_DISPLAY` and includes the new key automatically once `models.ts` is updated.
+
+---
+
+## Featuring a Day-0 Model
+
+When a new model launches and we want to give it the headline treatment, swap the **promotion surfaces** to it. This is separate from [Adding a New Model](#adding-a-new-model) above — the model must **already exist** (`Model.*` enum, `MODEL_CONFIG`, DB mapping) before it can be featured. The promotion surfaces are:
+
+- **Launch banner** — the dismissible bar at the top of the landing page
+- **Launch modal** — the "X is live" popup on the landing page
+- **Quick Comparisons preset** — the "X — First Look" card (first entry in `FAVORITE_PRESETS`)
+- **Default model** (optional) — the model the dashboard opens on (`g_model`)
+
+### The "retire old, new IDs" pattern
+
+Each launch **replaces** the previous day-0 model's surfaces rather than editing them in place. This is deliberate:
+
+- **New storage keys** (`inferencex-<slug>-{banner,modal}-dismissed`) so users who dismissed the _previous_ launch banner/modal still see the new one.
+- **Keep the old preset, hide it** (`hidden: true`) instead of deleting it — existing `?preset=<old-slug>-launch` links (old banners, modals, external shares, blog `DashboardCTA`s) must keep resolving.
+- **Generic testIds** (`launch-banner`, `launch-modal`) — launch-agnostic so Cypress selectors don't change every launch.
+
+> The current day-0 model is **whatever the single visible (`hidden` unset) `*-launch` preset points to** — detect it, don't assume. As of MiniMax M3 it was DeepSeek V4 Pro.
+
+### Derive the identifiers
+
+From the model name, derive (MiniMax M3 shown as the worked example):
+
+| Token     | Example            | Used in                                        |
+| --------- | ------------------ | ---------------------------------------------- |
+| `SLUG`    | `minimax-m3`       | preset id, nudge ids, storage keys, `?preset=` |
+| `SLUG_`   | `minimax_m3`       | analytics event names                          |
+| `ENUM`    | `Model.MiniMax_M3` | preset `config.model`                          |
+| `DISPLAY` | `MiniMax M3`       | all user-facing copy                           |
+| `G_MODEL` | `MiniMax-M3`       | `g_model` default (the `Model.*` string value) |
+
+### Then apply
+
+**`packages/app/src/components/favorites/favorite-presets.ts`**:
+
+1. On the outgoing visible `*-launch` preset, add `hidden: true` and update its comment (retired, kept for link compat — same pattern as the existing `dsv4-launch-nvidia` entry).
+2. Prepend a new visible preset as the **first** element of `FAVORITE_PRESETS`:
+   ```ts
+   {
+     id: 'SLUG-launch',
+     title: 'DISPLAY — First Look',
+     description:
+       'First benchmarks of DISPLAY across every available GPU. New configurations appear here as they come online.',
+     tags: ['<Vendor>', '<Version>', 'New'], // e.g. ['MiniMax', 'M3', 'New']
+     category: 'comparison',
+     wide: true,
+     config: {
+       model: ENUM,
+       sequence: Sequence.EightK_OneK,
+       precisions: ['fp4', 'fp4fp8', 'fp8'],
+       yAxisMetric: 'y_tpPerGpu',
+       hwFilter: ['h100', 'h200', 'b200', 'b300', 'gb200', 'gb300', 'mi300x', 'mi325x', 'mi355x'],
+     },
+   }
+   ```
+   Narrow `hwFilter` only for a restricted launch (e.g. NVIDIA-only). The broad filter + "as they come online" copy is the intended self-filling behavior even when data is still partial at launch.
+
+**`packages/app/src/lib/nudges/registry.tsx`** — rewrite the two launch nudges (only one banner + one modal exist at a time):
+
+- **Modal** (under "Landing modals"): `id: 'SLUG-launch-modal'`, `storageKey: 'inferencex-SLUG-modal-dismissed'`, `title: 'DISPLAY is live'`, day-zero `description`, `testId: 'launch-modal'`, `primaryAction.onClick` → `/inference?preset=SLUG-launch`, analytics `SLUG_modal_shown`/`_dismissed`/`_explored`.
+- **Banner** (under "Landing banner"): `id: 'SLUG-launch-banner'`, `storageKey: 'inferencex-SLUG-banner-dismissed'`, `title: 'DISPLAY benchmarks are live'`, `testId: 'launch-banner'`, `href`/`onLinkClick` → `/inference?preset=SLUG-launch`, keep the generic `launch_banner_*` analytics events but set `properties: { banner_id: 'SLUG-launch', preset_id: 'SLUG-launch' }`.
+
+**`packages/app/src/lib/url-state.ts`** _(only if making it the site default)_:
+
+- Set `PARAM_DEFAULTS.g_model` to `'G_MODEL'`. Most launches **leave this unchanged** — only change it for a true flagship (DeepSeek V4 Pro got it; MiniMax M3 did not).
+
+### Sync tests
+
+- **`packages/app/src/lib/nudges/registry.test.ts`** — update the **sorted** expected-ids array ("contains the expected set of migrated nudges") to the new `SLUG-launch-banner`/`SLUG-launch-modal` ids.
+- **`packages/app/cypress/e2e/nudge-system.cy.ts`** and **`navigation.cy.ts`** — replace the old `inferencex-<old-slug>-{modal,banner}-dismissed` storage keys with the new ones. TestId selectors stay generic (`launch-modal`, `launch-banner`); update any `it(...)` titles that name the old model.
+- **`packages/app/src/lib/url-state.test.ts`** _(only if the default changed)_ — two specs hardcode the default `g_model`; update both.
+
+> **Don't touch:** blog MDX `?g_model=…` / `?preset=<old-slug>-launch` links (historical, correct), `packages/constants/src/models.ts` DB-key maps, or the outgoing model's data-mapping / architecture entries — it still exists, it's just no longer the headline.
+
+### Verify
+
+`pnpm typecheck && pnpm lint && pnpm fmt && pnpm test:unit`, then `rg` for the old slug to confirm only the intentional hidden preset + blog links remain. Final gate: `pnpm test:e2e` and a manual `pnpm dev` check that the banner/modal/preset read `DISPLAY` and `/inference?preset=SLUG-launch` renders data.
+
 ---
 
 ## Adding a New GPU
 
@@ -10,7 +10,7 @@ Design rationale and non-obvious conventions. See [CLAUDE.md](../CLAUDE.md) for
 - [Pitfalls](./pitfalls.md) — Failure modes: token type consistency, schema evolution, empty objects, zoom loss, stale closures, disaggregated metrics, negative splines, date stamping, ref stability, cost inheritance
 - [GPU Specs](./gpu-specs.md) — Unit conventions, topology invariants, SVG layout rationale, hardware gotchas
 - [TCO Calculator](./tco-calculator.md) — Why interpolation, composite keys, cost matrix, token type bugs, badge logic, state design
-- [Adding Entities](./adding-entities.md) — Step-by-step checklists for adding new models, GPUs, precisions, sequences, frameworks (ingest + constants + frontend)
+- [Adding Entities](./adding-entities.md) — Step-by-step checklists for adding new models, GPUs, precisions, sequences, frameworks (ingest + constants + frontend), plus featuring a day-0 model (launch banner, modal, Quick Comparisons preset)
 - [Testing](./testing.md) — Requirements, quality standards, pre-commit checklist
 - [Data Transforms](./data-transforms.md) — Full pipeline from BenchmarkRow to RenderableGraph: type hierarchy, hardware key construction, derived metrics, memoization strategy
 - [State Ownership](./state-ownership.md) — Which context owns which state, availability filtering cascade, comparison date mechanics, URL param sync
 
@@ -73,7 +73,7 @@ Depends on: `GlobalFilterProvider` (reads all filter state and availability, inc
 
 - `selectedYAxisMetric` (`i_metric`), `selectedXAxisMetric` (`i_xmetric`), `selectedE2eXAxisMetric` (`i_e2e_xmetric`)
 - `scaleType` — `auto | linear | log` (`i_scale`)
-- `hideNonOptimal` (`i_optimal`), `hidePointLabels` (`i_nolabel`), `logScale` (`i_log`)
+- `hideNonOptimal` (`i_optimal`), `showPointLabels` (`i_label`), `logScale` (`i_log`)
 - `highContrast` (`i_hc`), `isLegendExpanded` (`i_legend`)
 - `useAdvancedLabels` (`i_advlabel`), `showGradientLabels` (`i_gradlabel`)
 - `colorShuffleSeed` — no URL param; ephemeral
@@ -260,34 +260,35 @@ Historical Trends and TCO Calculator share the inference tab's URL path (`/infer
 
 ### Full parameter list
 
-| Param           | Owner               | Default                          |
-| --------------- | ------------------- | -------------------------------- |
-| `g_model`       | GlobalFilterContext | `DeepSeek-R1-0528`               |
-| `g_rundate`     | GlobalFilterContext | `''`                             |
-| `g_runid`       | GlobalFilterContext | `''`                             |
-| `i_seq`         | GlobalFilterContext | `8k/1k`                          |
-| `i_prec`        | GlobalFilterContext | `fp4`                            |
-| `i_metric`      | InferenceProvider   | `y_tpPerGpu`                     |
-| `i_xmetric`     | InferenceProvider   | `p99_ttft`                       |
-| `i_e2e_xmetric` | InferenceProvider   | `''`                             |
-| `i_scale`       | InferenceProvider   | `auto`                           |
-| `i_gpus`        | InferenceProvider   | `''`                             |
-| `i_dates`       | InferenceProvider   | `''`                             |
-| `i_dstart`      | InferenceProvider   | `''`                             |
-| `i_dend`        | InferenceProvider   | `''`                             |
-| `i_optimal`     | InferenceProvider   | `''` (truthy = hide non-optimal) |
-| `i_nolabel`     | InferenceProvider   | `''`                             |
-| `i_hc`          | InferenceProvider   | `''`                             |
-| `i_log`         | InferenceProvider   | `''`                             |
-| `i_legend`      | InferenceProvider   | `''`                             |
-| `i_advlabel`    | InferenceProvider   | `''`                             |
-| `i_gradlabel`   | InferenceProvider   | `''`                             |
-| `e_rundate`     | EvaluationProvider  | `''`                             |
-| `e_bench`       | EvaluationProvider  | `''`                             |
-| `e_hc`          | EvaluationProvider  | `''`                             |
-| `e_labels`      | EvaluationProvider  | `''`                             |
-| `e_legend`      | EvaluationProvider  | `''`                             |
-| `r_range`       | ReliabilityProvider | `last-3-months`                  |
-| `r_pct`         | ReliabilityProvider | `''`                             |
-| `r_hc`          | ReliabilityProvider | `''`                             |
-| `r_legend`      | ReliabilityProvider | `''`                             |
+| Param           | Owner               | Default                           |
+| --------------- | ------------------- | --------------------------------- |
+| `g_model`       | GlobalFilterContext | `DeepSeek-R1-0528`                |
+| `g_rundate`     | GlobalFilterContext | `''`                              |
+| `g_runid`       | GlobalFilterContext | `''`                              |
+| `i_seq`         | GlobalFilterContext | `8k/1k`                           |
+| `i_prec`        | GlobalFilterContext | `fp4`                             |
+| `i_metric`      | InferenceProvider   | `y_tpPerGpu`                      |
+| `i_xmetric`     | InferenceProvider   | `p99_ttft`                        |
+| `i_e2e_xmetric` | InferenceProvider   | `''`                              |
+| `i_scale`       | InferenceProvider   | `auto`                            |
+| `i_gpus`        | InferenceProvider   | `''`                              |
+| `i_dates`       | InferenceProvider   | `''`                              |
+| `i_dstart`      | InferenceProvider   | `''`                              |
+| `i_dend`        | InferenceProvider   | `''`                              |
+| `i_optimal`     | InferenceProvider   | `''` (truthy = hide non-optimal)  |
+| `i_label`       | InferenceProvider   | `''` (truthy = show point labels) |
+| `i_nolabel`     | InferenceProvider   | `''` (legacy, read-only)          |
+| `i_hc`          | InferenceProvider   | `''`                              |
+| `i_log`         | InferenceProvider   | `''`                              |
+| `i_legend`      | InferenceProvider   | `''`                              |
+| `i_advlabel`    | InferenceProvider   | `''`                              |
+| `i_gradlabel`   | InferenceProvider   | `''`                              |
+| `e_rundate`     | EvaluationProvider  | `''`                              |
+| `e_bench`       | EvaluationProvider  | `''`                              |
+| `e_hc`          | EvaluationProvider  | `''`                              |
+| `e_labels`      | EvaluationProvider  | `''`                              |
+| `e_legend`      | EvaluationProvider  | `''`                              |
+| `r_range`       | ReliabilityProvider | `last-3-months`                   |
+| `r_pct`         | ReliabilityProvider | `''`                              |
+| `r_hc`          | ReliabilityProvider | `''`                              |
+| `r_legend`      | ReliabilityProvider | `''`                              |
@@ -44,14 +44,14 @@
     "audit-ci": "^7.1.0",
     "is-ci": "^4.1.0",
     "lefthook": "^2.1.9",
-    "oxfmt": "^0.54.0",
-    "oxlint": "^1.69.0",
+    "oxfmt": "^0.55.0",
+    "oxlint": "^1.70.0",
     "rimraf": "^6.1.3",
     "typescript": "^6.0.3"
   },
   "engines": {
     "node": ">=18.0.0",
     "pnpm": ">=10.0.0"
   },
-  "packageManager": "pnpm@11.5.2"
+  "packageManager": "pnpm@11.7.0"
 }
Original file line number	Diff line number	Diff line change
`@@ -40,6 +40,7 @@`
`40`	`40`	`"unicorn/no-null": "off",`
`41`	`41`	`"unicorn/no-useless-undefined": "off",`
`42`	`42`	`"unicorn/numeric-separators-style": "off",`
	`43`	`+ "unicorn/prefer-export-from": "off",`
`43`	`44`	`"unicorn/prefer-global-this": "off",`
`44`	`45`	`"unicorn/prefer-top-level-await": "off"`
`45`	`46`	`}`