update changelog, release note and overview

aninibread · aninibread · commit cf956f4816fd · 2026-04-11T22:31:18.000-04:00
diff --git a/src/content/changelog/ai-search/2026-04-16-ai-search-namespace-binding.mdx b/src/content/changelog/ai-search/2026-04-16-ai-search-namespace-binding.mdx
@@ -1,53 +1,69 @@
 ---
-title: AI Search namespace binding
-description: Access AI Search instances with the new ai_search_namespaces binding for namespace isolation, dynamic instance management, and file uploads.
+title: New AI Search bindings, built-in storage, and cross-instance search
+description: Access AI Search from Workers with two new bindings, upload files directly to built-in storage, and search across multiple instances in a single call.
 products:
   - ai-search
 date: 2026-04-16
 ---
 
-The new `ai_search_namespaces` [Workers binding](/ai-search/api/search/workers-binding/) provides first-class access to [AI Search](/ai-search/) from your Worker. The binding follows the same pattern as `kv_namespaces` and `r2_buckets`, and supports dynamic instance management at runtime.
+[AI Search](/ai-search/) now ships with two new Workers bindings, built-in storage for every new instance, and cross-instance search.
 
-## What is new
+## New Workers bindings
 
-- **Namespace isolation**: Group instances into [namespaces](/ai-search/concepts/namespaces/) for environment or tenant separation
-- **Dynamic instance management**: Create, update, and delete instances at runtime without redeploying
-- **Items API**: Upload and manage individual documents through the [Items API](/ai-search/api/items/workers-binding/)
-- **Streaming**: Stream chat completion responses as Server-Sent Events
-- **Messages-based API**: Uses a `messages` array format consistent with chat completion APIs
+AI Search introduces two new Workers bindings for accessing AI Search from a [Cloudflare Worker](/workers/).
 
-## Example
-
-Add the binding to your Wrangler configuration and start querying:
+The `ai_search_namespaces` binding gives your Worker access to all instances within a [namespace](/ai-search/concepts/namespaces/). You can create, update, and delete instances at runtime without redeploying:
 
 ```ts
 // wrangler.jsonc
 // "ai_search_namespaces": [{ "binding": "AI_SEARCH", "namespace": "default" }]
 
-export type Env = {
-	AI_SEARCH: AiSearchNamespace;
-};
+// create a new instance at runtime
+const instance = await env.AI_SEARCH.create({ id: "my-instance" });
 
-export default {
-	async fetch(request, env): Promise<Response> {
-		const results = await env.AI_SEARCH.get("my-instance").search({
-			messages: [{ role: "user", content: "How do I configure caching?" }],
-		});
+// upload a file to built-in storage
+await instance.items.upload("guide.md", content);
 
-		return Response.json(results);
-	},
-} satisfies ExportedHandler<Env>;
+// search the instance
+const results = await env.AI_SEARCH.get("my-instance").search({
+	messages: [{ role: "user", content: "What is Cloudflare?" }],
+});
+```
+
+The `ai_search` binding binds directly to a single instance in the default namespace. Use this when you know which instance you need at deploy time:
+
+```ts
+// wrangler.jsonc
+// "ai_search": [{ "binding": "MY_SEARCH", "instance_name": "my-instance" }]
+
+const results = await env.MY_SEARCH.search({
+	messages: [{ role: "user", content: "What is Cloudflare?" }],
+});
 ```
 
-## Requirements
+Refer to [Namespaces](/ai-search/concepts/namespaces/) for details on the difference between the two bindings.
 
-| Package                     | Minimum version |
-| --------------------------- | --------------- |
-| `@cloudflare/workers-types` | `4.20260304.0`  |
-| `wrangler`                  | `4.68.1`        |
+## Built-in storage
 
-## Backwards compatibility
+New AI Search instances come with built-in storage and a built-in vector index, powered by [R2](/r2/) and [Vectorize](/vectorize/). You can upload files directly to an instance and they are indexed automatically. No need to set up an R2 bucket or Vectorize index yourself.
+
+Upload files using the [Items API](/ai-search/api/items/workers-binding/) or the **Items** tab in the dashboard. You can also connect an external data source like a website or R2 bucket alongside built-in storage.
+
+Refer to [Built-in storage](/ai-search/configuration/data-source/built-in-storage/) for details.
+
+## Cross-instance search
+
+You can now search across multiple instances in a single call using the namespace binding. Pass an array of instance IDs and get one ranked list back. Each chunk in the response includes an `instance_id` field identifying which instance it came from.
+
+```ts
+const results = await env.AI_SEARCH.search({
+	messages: [{ role: "user", content: "What is Cloudflare?" }],
+	ai_search_options: {
+		instance_ids: ["product-docs", "customer-abc123"],
+	},
+});
+```
 
-The previous `env.AI.autorag()` binding is deprecated but will continue to work indefinitely. Existing code does not need to be migrated.
+This is useful when you need to search across different data sources. For example, a support agent can search shared product docs and per-customer history in one call.
 
-For full documentation, refer to the [Workers binding guide](/ai-search/api/search/workers-binding/).
+Refer to [Namespace-level search](/ai-search/api/search/workers-binding/#namespace-level) for details.
diff --git a/src/content/changelog/ai-search/2026-04-16-hybrid-search-and-relevance-boosting.mdx b/src/content/changelog/ai-search/2026-04-16-hybrid-search-and-relevance-boosting.mdx
@@ -0,0 +1,49 @@
+---
+title: Hybrid search and relevance boosting
+description: Combine vector and keyword search in a single query, and boost results by metadata fields like timestamp or priority.
+products:
+  - ai-search
+date: 2026-04-16
+---
+
+[AI Search](/ai-search/) now supports hybrid search and relevance boosting, giving you more control over how results are found and ranked.
+
+## Hybrid search
+
+Hybrid search combines vector (semantic) search with BM25 keyword search in a single query. Vector search understands intent, but it can miss queries that depend on a specific term appearing exactly. For example, a query for "Go errors help" might return Java or Python docs because the embedding model prioritizes the broad context of "error handling" over the word "Go." Keyword search fills that gap by matching the exact terms in your query.
+
+When you enable hybrid search, both run in parallel and the results are fused into a single ranked list. You can configure the tokenizer (`porter` for natural language, `trigram` for code), keyword match mode (`and` for precision, `or` for recall), and fusion method (`rrf` or `max`) per instance:
+
+```ts
+const instance = await env.AI_SEARCH.create({
+	id: "my-instance",
+	index_method: { vector: true, keyword: true },
+	fusion_method: "rrf",
+	indexing_options: { keyword_tokenizer: "porter" },
+	retrieval_options: { keyword_match_mode: "and" },
+});
+```
+
+Refer to [Search modes](/ai-search/concepts/search-modes/) for an overview and [Hybrid search](/ai-search/configuration/indexing/hybrid-search/) for configuration details.
+
+## Relevance boosting
+
+Relevance boosting lets you nudge search rankings based on document metadata. For example, you can prioritize recent documents by boosting on `timestamp`, or surface high-priority content by boosting on a custom metadata field like `priority`.
+
+Configure up to 3 boost fields per instance or override them per request:
+
+```ts
+const results = await env.AI_SEARCH.get("my-instance").search({
+	messages: [{ role: "user", content: "deployment guide" }],
+	ai_search_options: {
+		retrieval: {
+			boost_by: [
+				{ field: "timestamp", direction: "desc" },
+				{ field: "priority", direction: "desc" },
+			],
+		},
+	},
+});
+```
+
+Refer to [Relevance boosting](/ai-search/configuration/retrieval/boosting/) for configuration details.
diff --git a/src/content/docs/ai-search/how-to/nlweb.mdx b/src/content/docs/ai-search/how-to/nlweb.mdx
@@ -5,8 +5,12 @@ sidebar:
   order: 6
 ---
 
+import { YouTube } from "~/components";
+
 Enable conversational search on your website with NLWeb and Cloudflare AI Search. This template crawls your site, indexes the content, and deploys NLWeb-standard endpoints to serve both people and AI agents.
 
+<YouTube id="Az6NKLjSZMM" />
+
 :::note
 This is a public preview ideal for experimentation. If you're interested in running this in production workflows, please contact us at nlweb@cloudflare.com.
 :::
@@ -26,9 +30,9 @@ You can deploy NLWeb on your website directly through the AI Search dashboard:
 2. Go to **Compute & AI** > **AI Search**.
 3. Select **Create**.
 4. Select **Website** as a data source.
-4. Follow the instructions to create an AI Search instance.
-5. Go to the **Settings** for the instance 
-6. Find **NLWeb Worker** and select "Enable AI Search for your website".
+5. Follow the instructions to create an AI Search instance.
+6. Go to the **Settings** for the instance
+7. Find **NLWeb Worker** and select "Enable AI Search for your website".
 
 Once complete, AI Search will deploy an NLWeb Worker for you that enables you to use the NLWeb API Endpoints.
 
@@ -37,16 +41,16 @@ Once complete, AI Search will deploy an NLWeb Worker for you that enables you to
 Choosing the NLWeb Website option extends a normal AI Search by tailoring it for content‑heavy websites and giving you everything that is required to adopt NLWeb as the standard for conversational search on your site. Specifically, the template provides:
 
 - **Website as a data source:** Uses [Website](/ai-search/configuration/data-source/website/) as data source option to crawl and ingest pages with the Rendered Sites option.
-- **Defaults for content-heavy websites:**  Applies tuned embedding and retrieval configurations ideal for publishing and content‑rich websites.
+- **Defaults for content-heavy websites:** Applies tuned embedding and retrieval configurations ideal for publishing and content‑rich websites.
 - **NLWeb Worker deployment:** Automatically spins up a Cloudflare Worker from the [NLWeb Worker template](https://github.com/cloudflare/templates).
 
 ## What the Worker includes
 
 Your deployed Worker provides two endpoints:
 
 - `/ask` — NLWeb’s standard conversational endpoint
-    - Powers the conversational UI at the root (`/`)
-    - Powers the embeddable preview widget (`/snippet.html`)
+  - Powers the conversational UI at the root (`/`)
+  - Powers the embeddable preview widget (`/snippet.html`)
 - `/mcp` — NLWeb’s MCP server endpoint for trusted AI agents
 
 These endpoints give both people and agents structured access to your content.
@@ -56,9 +60,11 @@ These endpoints give both people and agents structured access to your content.
 To integrate NLWeb search directly into your site you can:
 
 1. Find your deployed Worker in the [Cloudflare dashboard](https://dash.cloudflare.com/):
-  - Go to **Compute & AI** >  **AI Search**.
-  - Select **Connect**, then go to the **NLWeb** tab.
-  - Select **Go to Worker**.
+
+- Go to **Compute & AI** > **AI Search**.
+- Select **Connect**, then go to the **NLWeb** tab.
+- Select **Go to Worker**.
+
 2. Add a [custom domain](/workers/configuration/routing/custom-domains/) to your Worker (for example, ask.example.com)
 3. Use the `/ask` endpoint on your custom domain to power the search (for example, ask.example.com/ask)
 
@@ -87,7 +93,6 @@ You can also use the embeddable snippet to add a search UI directly into your we
 
 This lets you serve conversational AI search directly from your own domain, with control over how people and agents access your content.
 
-
 ## Modifying or updating the Worker
 
 You may want to customize your Worker, for example, to adjust the UI for the embeddable snippet. In those cases, we recommend calling the `/ask` endpoint for queries and building your own UI on top of it, however, you may also choose to modify the Worker's code for the embeddable UI.
@@ -104,7 +109,7 @@ To do so:
 2. Enter the name of your AI Search in the `RAG_ID` environment variable field.
 3. Click **Deploy**.
 4. Select the **GitHub/GitLab** icon on the Workers Dashboard.
-4. Clone the repository that is created for your Worker.
-5. Make your modifications, then commit and push changes to the repository to update your Worker.
+5. Clone the repository that is created for your Worker.
+6. Make your modifications, then commit and push changes to the repository to update your Worker.
 
-Now you can use this Worker as the new NLWeb endpoint for your website.
+Now you can use this Worker as the new NLWeb endpoint for your website.
diff --git a/src/content/docs/ai-search/index.mdx b/src/content/docs/ai-search/index.mdx
@@ -48,6 +48,10 @@ You can use AI Search for:
 	</LinkButton>
 </div>
 
+:::note[Latest update]
+New AI Search instances created after April 16, 2026 include [managed storage](/ai-search/configuration/data-source/built-in-storage/), vector index, and web crawling at no additional cost during the open beta. [View limits and pricing](/ai-search/platform/limits-pricing/).
+:::
+
 ---
 
 ## Features
diff --git a/src/content/release-notes/ai-search.yaml b/src/content/release-notes/ai-search.yaml
@@ -4,9 +4,25 @@ productName: AI Search
 productLink: "/ai-search/"
 entries:
   - publish_date: "2026-04-16"
-    title: AI Search namespace binding
+    title: Hybrid search
     description: |-
-      The new `ai_search_namespaces` [Workers binding](/ai-search/api/search/workers-binding/) provides first-class access to AI Search from your Worker. Group instances into [namespaces](/ai-search/concepts/namespaces/) for isolation, create and manage instances at runtime, and upload documents through the [Items API](/ai-search/api/items/workers-binding/). The previous `env.AI.autorag()` binding continues to work.
+      AI Search now supports [hybrid search](/ai-search/configuration/indexing/hybrid-search/), combining vector and BM25 keyword search in a single query. Configure the tokenizer, keyword match mode, and fusion method per instance. Refer to [Search modes](/ai-search/concepts/search-modes/) for an overview.
+  - publish_date: "2026-04-16"
+    title: Built-in storage
+    description: |-
+      New AI Search instances come with [built-in storage](/ai-search/configuration/data-source/built-in-storage/) and a vector index. Upload files directly to an instance using the Items API or the dashboard without setting up external infrastructure.
+  - publish_date: "2026-04-16"
+    title: Relevance boosting
+    description: |-
+      Boost search results by metadata fields like timestamp or priority using [relevance boosting](/ai-search/configuration/retrieval/boosting/). Configure up to 3 boost fields per instance or per request.
+  - publish_date: "2026-04-16"
+    title: Cross-instance search
+    description: |-
+      Search across multiple AI Search instances in a single call using [namespace-level search](/ai-search/api/search/workers-binding/#namespace-level). Results are merged and ranked, with each chunk identifying which instance it came from.
+  - publish_date: "2026-04-16"
+    title: New AI Search Workers bindings
+    description: |-
+      Two new [Workers bindings](/ai-search/api/search/workers-binding/) for AI Search. The `ai_search_namespaces` binding gives access to all instances within a [namespace](/ai-search/concepts/namespaces/) and supports dynamic instance management at runtime. The `ai_search` binding binds directly to a single instance for simpler use cases.
   - publish_date: "2026-04-01"
     title: Wrangler CLI support for AI Search
     description: |-