cloudflare
diff --git a/‎.agents/skills/workers-ai-docs/SKILL.md‎
Lines changed: 195 additions & 0 deletions b/‎.agents/skills/workers-ai-docs/SKILL.md‎
Lines changed: 195 additions & 0 deletions
diff --git a/‎package.json‎
Lines changed: 1 addition & 1 deletion b/‎package.json‎
Lines changed: 1 addition & 1 deletion
@@ -0,0 +1,195 @@
+# Workers AI Documentation
+
+Guidelines for maintaining Workers AI documentation, including model schemas, release notes, and changelog entries.
+
+## Key Workflows
+
+### 1. Model Schema Updates
+
+Run the API sync script:
+
+```bash
+node bin/fetch-ai-models.js
+```
+
+**Review changes carefully** - watch for RED FLAGS:
+
+- Pricing removed (major issue)
+- Schema structure changes or disappearing completely
+- Created_at dates going backwards
+- Internal/test naming (e.g., "Dumb Pipe")
+
+Stage only safe changes (feature additions, metadata updates). Exclude files with pricing or breaking schema changes.
+
+### 2. New Model Launch (End-to-End Checklist)
+
+When launching a new model on Workers AI, you need to touch **4 files**. Use this checklist:
+
+1. **Fetch model JSON** — Run `node bin/fetch-ai-models.js` to pull the model from the API into `src/content/workers-ai-models/`. The script fetches ALL models, so verify the specific new file was created.
+2. **Verify schema is populated** — The API can be flaky and return an empty `"schema": {}`. If the schema is empty, re-run the fetch script until it populates. Always inspect the JSON file before committing.
+3. **Create changelog entry** — `src/content/changelog/workers-ai/{YYYY-MM-DD}-{slug}.mdx`. The changelog URL is derived from the filename: `https://developers.cloudflare.com/changelog/post/{filename-without-extension}/`. If a URL has already been shared externally, the filename must match exactly.
+4. **Add release notes entry** — Add a new entry at the TOP of `src/content/release-notes/workers-ai.yaml`. Include a link to the model page and a cross-link to the changelog.
+5. **Add pricing row** — Add a row to the LLM pricing table in `src/content/docs/workers-ai/platform/pricing.mdx` (or the appropriate category table for non-LLM models).
+6. **Stage selectively** — The fetch script overwrites all model files. Only `git add` the new model file and your other changes. Exclude:
+   - Unrelated new models the API added
+   - Existing model files with minor upstream changes (do those in a separate PR)
+
+**New model release notes example:**
+
+```yaml
+- publish_date: "YYYY-MM-DD"
+  title: Model Name now available on Workers AI
+  description: |-
+    - [`@cf/vendor/model-name`](/workers-ai/models/model-name/) now available on Workers AI! Description of the model. Read [changelog](/changelog/post/YYYY-MM-DD-slug/) to get started.
+```
+
+### 3. Pricing Page
+
+**Location:** `src/content/docs/workers-ai/platform/pricing.mdx`
+
+The pricing page has separate tables for LLMs, embeddings, image, audio, and other models. Each row has three columns: Model, Price in Tokens, Price in Neurons.
+
+**Neuron conversion formula:** The billing rate is `$0.011 per 1,000 Neurons`, so `1 Neuron = $0.000011`. To convert:
+
+```
+neurons per M tokens = dollar_price_per_M_tokens / 0.000011
+```
+
+Round to the nearest whole number. For example: `$0.50 per M input tokens` → `45455 neurons per M input tokens`.
+
+**Row format:**
+
+```markdown
+| @cf/vendor/model-name | $X.XXX per M input tokens <br/> $X.XXX per M output tokens | XXXXX neurons per M input tokens <br/> XXXXX neurons per M output tokens |
+```
+
+### 4. Best Practice: Separate Concerns
+
+- **New model additions:** Stage only the relevant new model files
+- **API syncs:** Do in a separate PR from model additions
+- This makes reviews easier and reduces risk of mixing safe/unsafe changes
+
+### 5. Release Notes
+
+**Location:** `src/content/release-notes/workers-ai.yaml`
+
+**Format:**
+
+- Use `[Bug fix]` prefix for bug fixes
+- Use specific endpoint paths like `/v1/chat/completions` instead of generic terms
+- Use actual release date (not commit date)
+- Avoid internal implementation terms unless users understand them
+
+**Example:**
+
+```yaml
+- publish_date: "2026-02-17"
+  title: Chat Completions API support for gpt-oss models and tool calling improvements
+  description: |-
+    - [Bug fix] `/v1/chat/completions` now preserves original tool call IDs...
+```
+
+### 6. Changelog (Big Releases Only)
+
+**Location:** `src/content/changelog/workers-ai/YYYY-MM-DD-description.mdx`
+
+Use for major model launches, significant features, or breaking changes. Not for routine bug fixes or minor updates (use release notes instead).
+
+### 7. Model Pages
+
+**Code components:** `src/components/models/code/`
+
+- Use descriptive endpoint names in documentation
+- Document all supported API formats for multi-format models (e.g., GPT-OSS supports Responses API, Workers AI Run, and Chat Completions)
+
+## Key Files
+
+| File/Directory                              | Purpose                               |
+| ------------------------------------------- | ------------------------------------- |
+| `src/content/workers-ai-models/*.json`      | Model schemas                         |
+| `src/content/release-notes/workers-ai.yaml` | Release notes                         |
+| `src/content/changelog/workers-ai/*.mdx`    | Changelog entries (big releases only) |
+| `src/components/models/code/*.astro`        | Model page code examples              |
+| `src/pages/workers-ai/models/[name].astro`  | Model page template                   |
+| `src/content/docs/workers-ai/platform/pricing.mdx` | Pricing tables (per-model)      |
+| `bin/fetch-ai-models.js`                    | API sync script                       |
+
+## Common Patterns
+
+**API Endpoints:**
+
+- Responses API: `/ai/v1/responses`
+- Workers AI Run: `/ai/run` (dynamic format detection)
+- Chat Completions: `/v1/chat/completions`
+
+**Bug Fix Descriptions:**
+Explain what was broken and why it works now. Do not link to internal MRs.
+
+**Example:**
+
+> [Bug fix] `/v1/chat/completions` now preserves original tool call IDs from models instead of regenerating them. Previously, the endpoint was generating new IDs which broke multi-turn tool calling because AI SDK clients could not match tool results to their original calls.
+
+## Git Conventions
+
+### Branch Naming
+
+Use descriptive branch names that start with the product abbreviation:
+
+- **Format:** `wai-{description}` (wai = Workers AI)
+- **Model syncs:** `wai-models-sync`, `wai-update-models-2024-02`
+- **New models:** `wai-add-gpt-oss`, `wai-add-llama-4-scout`
+- **Bug fixes:** `wai-fix-tool-call-ids`, `wai-fix-streaming-finish-reason`
+- **Features:** `wai-add-chat-completions`, `wai-update-model-schemas`
+
+Keep names lowercase with dashes. Avoid generic names like `wai-fix-bugs` or `wai-updates`.
+
+### Commit Messages
+
+Use repository conventions:
+
+- **Format:** `[Product] description` or `type: description`
+- **Examples:**
+  - `[Workers AI] Update model schemas and add GPT-OSS Chat Completions support`
+  - `[Workers AI] Fix tool call ID validation in streaming responses`
+  - `fix: correct content field schema to accept array types`
+
+Use imperative mood (add, fix, update, remove). Keep under 72 characters.
+
+### Pull Requests
+
+**Title format:** `{Product}: {short description}`
+
+- Keep description under 50 characters
+- Use imperative mood
+- **Examples:**
+  - `Workers AI: add Chat Completions support for GPT-OSS models`
+  - `Workers AI: fix content field schema for multi-modal inputs`
+
+**PR Body structure:**
+
+Use headers to organize the PR body. Include:
+
+1. **Summary** - 1-2 sentences explaining WHY the PR exists
+2. **Changes** - Bullet points of what changed (group related changes)
+3. **Key Details** - Any important notes (e.g., excluded files, breaking changes)
+
+Keep bullet points concise (< 10 words). Use formatting for readability.
+
+**Example:**
+
+```markdown
+## Summary
+
+Syncs Workers AI models from API and adds Chat Completions support for GPT-OSS models.
+
+## Changes
+
+- Update 45 model files with content field schema fixes
+- Add Chat Completions API support for GPT-OSS-120B and GPT-OSS-20B
+- Fix tool call ID validation and streaming finish_reason issues
+- Update release notes with 2026-02-17 changelog entry
+
+## Notes
+
+4 files excluded due to pricing/schema issues (flux-2-dev, flux-2-klein-4b, flux-2-klein-9b, smart-turn-v2)
+```
@@ -170,4 +170,4 @@
 			"onFail": "error"
 		}
 	}
-}
+}
Original file line number	Diff line number	Diff line change
`@@ -170,4 +170,4 @@`
`170`	`170`	`"onFail": "error"`
`171`	`171`	`}`
`172`	`172`	`}`
`173`		`-}`
	`173`	`+}`