TanStack
diff --git a/‎.changeset/gemini-veo-video-adapter.md‎
Lines changed: 0 additions & 42 deletions b/‎.changeset/gemini-veo-video-adapter.md‎
Lines changed: 0 additions & 42 deletions
diff --git a/‎.changeset/image-and-video-inputs.md‎
Lines changed: 0 additions & 30 deletions b/‎.changeset/image-and-video-inputs.md‎
Lines changed: 0 additions & 30 deletions
diff --git a/‎packages/ai-angular/CHANGELOG.md‎
Lines changed: 8 additions & 0 deletions b/‎packages/ai-angular/CHANGELOG.md‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎packages/ai-angular/package.json‎
Lines changed: 1 addition & 1 deletion b/‎packages/ai-angular/package.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎packages/ai-anthropic/CHANGELOG.md‎
Lines changed: 7 additions & 0 deletions b/‎packages/ai-anthropic/CHANGELOG.md‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎packages/ai-anthropic/package.json‎
Lines changed: 1 addition & 1 deletion b/‎packages/ai-anthropic/package.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎packages/ai-client/CHANGELOG.md‎
Lines changed: 29 additions & 0 deletions b/‎packages/ai-client/CHANGELOG.md‎
Lines changed: 29 additions & 0 deletions
diff --git a/‎packages/ai-client/package.json‎
Lines changed: 1 addition & 1 deletion b/‎packages/ai-client/package.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎packages/ai-code-mode-skills/CHANGELOG.md‎
Lines changed: 8 additions & 0 deletions b/‎packages/ai-code-mode-skills/CHANGELOG.md‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎packages/ai-code-mode-skills/package.json‎
Lines changed: 1 addition & 1 deletion b/‎packages/ai-code-mode-skills/package.json‎
Lines changed: 1 addition & 1 deletion
@@ -1,5 +1,13 @@
 # @tanstack/ai-angular
 
+## 0.1.4
+
+### Patch Changes
+
+- Updated dependencies [[`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f), [`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f)]:
+  - @tanstack/ai@0.32.0
+  - @tanstack/ai-client@0.18.0
+
 ## 0.1.3
 
 ### Patch Changes
 
@@ -1,6 +1,6 @@
 {
   "name": "@tanstack/ai-angular",
-  "version": "0.1.3",
+  "version": "0.1.4",
   "description": "Angular signals integration for TanStack AI streaming chat, structured outputs, and media generation.",
   "author": "",
   "license": "MIT",
 
@@ -1,5 +1,12 @@
 # @tanstack/ai-anthropic
 
+## 0.15.5
+
+### Patch Changes
+
+- Updated dependencies [[`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f), [`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f)]:
+  - @tanstack/ai@0.32.0
+
 ## 0.15.4
 
 ### Patch Changes
 
@@ -1,6 +1,6 @@
 {
   "name": "@tanstack/ai-anthropic",
-  "version": "0.15.4",
+  "version": "0.15.5",
   "description": "Anthropic Claude adapter for TanStack AI chat, tool calling, thinking, and structured outputs.",
   "author": "Tanner Linsley",
   "license": "MIT",
 
@@ -1,5 +1,34 @@
 # @tanstack/ai-client
 
+## 0.18.0
+
+### Minor Changes
+
+- [#624](https://github.com/TanStack/ai/pull/624) [`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f) - `generateImage()` and `generateVideo()` now accept a multimodal `prompt`: a plain string, or an ordered array of content parts (`TextPart` / `ImagePart` / `VideoPart` / `AudioPart`) for image-conditioned generation, image-to-image, multi-reference, image-to-video, and edit / inpaint flows. Part order is meaningful — "not like this _(image)_, more like this _(image)_" — and each media part may carry a `metadata.role` hint (`'reference' | 'mask' | 'control' | 'start_frame' | 'end_frame' | 'character'`) that adapters use to route to the provider-specific field, plus an informational `metadata.tag` label for your own bookkeeping. The accepted part types are narrowed per model at compile time via each adapter's input-modality map, so passing an image part to a text-only model is a type error (with a clear runtime throw as backstop).
+
+  Prompt text is always sent **verbatim** — the SDK never injects or rewrites in-prompt referencing markers. To reference inputs from your prompt, write the provider's own convention (fal Kling / Seedance `@Image1`, OpenAI / FLUX.2 `"image 1"` prose, Gemini content descriptions); see the image-generation docs for the per-provider table.
+
+  Provider behavior in this release:
+  - **OpenAI image** — Prompts with image parts route `gpt-image-2` / `gpt-image-1` / `gpt-image-1-mini` to `images.edit()` (up to 16 source images plus optional mask); `dall-e-2` routes to `images.edit()` with one source image; `dall-e-3` rejects image parts at compile time and at runtime.
+  - **OpenAI video** — Sora-2 / Sora-2-Pro accept a single image part as `input_reference`; passing more than one throws.
+  - **Gemini image** — Native models (`gemini-*-flash-image`, "nano-banana") map prompt parts 1:1 onto multimodal `contents`, preserving interleaved order. Imagen is text-only (compile-time + runtime rejection).
+  - **fal.ai** — Field names resolve per endpoint from a map generated from the fal SDK's endpoint types (362 endpoints with nonstandard fields, e.g. nano-banana edit → `image_urls`, Kling i2v start frame → `image_url`, Veo first-last-frame → `first_frame_url` / `last_frame_url`). Defaults for endpoints not in the map: single → `image_url`, multiple → `image_urls`; `role: 'mask'` → `mask_url`; `role: 'control'` → `control_image_url`; `role: 'reference'` / `'character'` → `reference_image_urls`; video `role: 'start_frame'` / `'end_frame'` → `start_image_url` / `end_image_url`. Per-model prompt modalities are derived at the type level from the SDK's endpoint input types. Regenerate the map after a fal SDK bump with `pnpm generate:fal-image-fields` (a unit test fails when it goes stale). In `FalImageProviderOptions` / `FalVideoProviderOptions`, media-conditioning fields the mappers can populate (`image_url`, `start_image_url`, `video_url`, `audio_url`, …) are demoted from required to optional — supply them as prompt parts, or keep passing them explicitly via `modelOptions`.
+  - **Grok** — New `grok-imagine-image` / `grok-imagine-image-quality` models. Prompts with image parts route to xAI's JSON `/v1/images/edits` endpoint (up to 3 source images, addressed by xAI in request order; the prompt is sent verbatim). `role: 'mask'` / `'control'` throw. Their `size` uses an `aspectRatio_resolution` template (`'16:9_2k'`, suffix optional) mirroring Gemini's native image models. `grok-2-image-1212` remains text-to-image only.
+  - **OpenRouter** — Prompt parts map 1:1 onto multimodal `text` / `image_url` chat content parts, preserving interleaved order, and are forwarded to the underlying image model. URL sources pass through verbatim (no fetching or re-encoding in your process); `data` sources become data URIs.
+  - **Anthropic** — Unchanged (no image generation API).
+
+  A new `resolveMediaPrompt()` utility (exported from `@tanstack/ai`) is the single downrev point from the canonical interleaved prompt shape to flattened text + per-modality part buckets, for adapter authors.
+
+  On the client side, `ImageGenerateInput.prompt` and `VideoGenerateInput.prompt` (`@tanstack/ai-client`, and the `useGenerateImage` / `useGenerateVideo` hooks built on them) are widened from `string` to the same `MediaPrompt` shape, so prompt parts can be sent from the browser through your server route to `generateImage()` / `generateVideo()`.
+
+  Closes [#618](https://github.com/TanStack/ai/issues/618).
+
+### Patch Changes
+
+- Updated dependencies [[`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f), [`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f)]:
+  - @tanstack/ai@0.32.0
+  - @tanstack/ai-event-client@0.6.3
+
 ## 0.17.3
 
 ### Patch Changes
 
@@ -1,6 +1,6 @@
 {
   "name": "@tanstack/ai-client",
-  "version": "0.17.3",
+  "version": "0.18.0",
   "description": "Framework-agnostic headless client for TanStack AI chat, realtime sessions, streaming transports, and media generations.",
   "author": "Tanner Linsley",
   "license": "MIT",
 
@@ -1,5 +1,13 @@
 # @tanstack/ai-code-mode-skills
 
+## 0.2.9
+
+### Patch Changes
+
+- Updated dependencies [[`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f), [`8fa6cc5`](https://github.com/TanStack/ai/commit/8fa6cc56c5f36e22885c98a511dcceb2bfc0da1f)]:
+  - @tanstack/ai@0.32.0
+  - @tanstack/ai-code-mode@0.2.9
+
 ## 0.2.8
 
 ### Patch Changes
 
@@ -1,6 +1,6 @@
 {
   "name": "@tanstack/ai-code-mode-skills",
-  "version": "0.2.8",
+  "version": "0.2.9",
   "description": "Persistent runtime skill library for TanStack AI Code Mode agents and sandboxed tool orchestration.",
   "author": "Tanner Linsley",
   "license": "MIT",
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@tanstack/ai-angular",`
`3`		`- "version": "0.1.3",`
	`3`	`+ "version": "0.1.4",`
`4`	`4`	`"description": "Angular signals integration for TanStack AI streaming chat, structured outputs, and media generation.",`
`5`	`5`	`"author": "",`
`6`	`6`	`"license": "MIT",`
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@tanstack/ai-anthropic",`
`3`		`- "version": "0.15.4",`
	`3`	`+ "version": "0.15.5",`
`4`	`4`	`"description": "Anthropic Claude adapter for TanStack AI chat, tool calling, thinking, and structured outputs.",`
`5`	`5`	`"author": "Tanner Linsley",`
`6`	`6`	`"license": "MIT",`
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@tanstack/ai-client",`
`3`		`- "version": "0.17.3",`
	`3`	`+ "version": "0.18.0",`
`4`	`4`	`"description": "Framework-agnostic headless client for TanStack AI chat, realtime sessions, streaming transports, and media generations.",`
`5`	`5`	`"author": "Tanner Linsley",`
`6`	`6`	`"license": "MIT",`
Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "@tanstack/ai-code-mode-skills",`
`3`		`- "version": "0.2.8",`
	`3`	`+ "version": "0.2.9",`
`4`	`4`	`"description": "Persistent runtime skill library for TanStack AI Code Mode agents and sandboxed tool orchestration.",`
`5`	`5`	`"author": "Tanner Linsley",`
`6`	`6`	`"license": "MIT",`