feast-dev
diff --git a/‎docs/SUMMARY.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/SUMMARY.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/getting-started/genai.md‎
Lines changed: 64 additions & 1 deletion b/‎docs/getting-started/genai.md‎
Lines changed: 64 additions & 1 deletion
diff --git a/‎docs/how-to-guides/feast-on-kubernetes.md‎
Lines changed: 4 additions & 0 deletions b/‎docs/how-to-guides/feast-on-kubernetes.md‎
Lines changed: 4 additions & 0 deletions
@@ -55,6 +55,7 @@
 * [Retrieval Augmented Generation (RAG) with Feast](tutorials/rag-with-docling.md)
 * [RAG Fine Tuning with Feast and Milvus](../examples/rag-retriever/README.md)
 * [MCP - AI Agent Example](../examples/mcp_feature_store/README.md)
+* [Feast-Powered AI Agent](../examples/agent_feature_store/README.md)
 
 ## How-to Guides
 
@@ -70,6 +71,7 @@
   * [Multi-Team Feature Store Setup](how-to-guides/federated-feature-store.md)
 * [Running Feast in production (e.g. on Kubernetes)](how-to-guides/running-feast-in-production.md)
 * [Feast on Kubernetes](how-to-guides/feast-on-kubernetes.md)
+* [Feast Production Deployment Topologies](how-to-guides/production-deployment-topologies.md)
 * [Online Server Performance Tuning](how-to-guides/online-server-performance-tuning.md)
 * [Customizing Feast](how-to-guides/customizing-feast/README.md)
   * [Adding a custom batch materialization engine](how-to-guides/customizing-feast/creating-a-custom-materialization-engine.md)
 
@@ -56,6 +56,53 @@ The transformation workflow typically involves:
 3. **Chunking**: Split documents into smaller, semantically meaningful chunks
 4. **Embedding Generation**: Convert text chunks into vector embeddings
 5. **Storage**: Store embeddings and metadata in Feast's feature store
+
+### DocEmbedder: End-to-End Document Ingestion Pipeline
+
+The `DocEmbedder` class provides an end-to-end pipeline for ingesting documents into Feast's online vector store. It handles chunking, embedding generation, and writing results -- all in a single step.
+
+#### Key Components
+
+* **`DocEmbedder`**: High-level orchestrator that runs the full pipeline: chunk → embed → schema transform → write to online store
+* **`BaseChunker` / `TextChunker`**: Pluggable chunking layer. `TextChunker` splits text by word count with configurable `chunk_size`, `chunk_overlap`, `min_chunk_size`, and `max_chunk_chars`
+* **`BaseEmbedder` / `MultiModalEmbedder`**: Pluggable embedding layer with modality routing. `MultiModalEmbedder` supports text (via sentence-transformers) and image (via CLIP) with lazy model loading
+* **`SchemaTransformFn`**: A user-defined function that transforms the chunked + embedded DataFrame into the format expected by the FeatureView schema
+
+#### Quick Example
+
+```python
+from feast import DocEmbedder
+import pandas as pd
+
+# Prepare your documents
+df = pd.DataFrame({
+    "id": ["doc1", "doc2"],
+    "text": ["First document content...", "Second document content..."],
+})
+
+# Create DocEmbedder -- automatically generates a FeatureView and applies the repo
+embedder = DocEmbedder(
+    repo_path="feature_repo/",
+    feature_view_name="text_feature_view",
+)
+
+# Embed and ingest documents in one step
+result = embedder.embed_documents(
+    documents=df,
+    id_column="id",
+    source_column="text",
+    column_mapping=("text", "text_embedding"),
+)
+```
+
+#### Features
+
+* **Auto-generates FeatureView**: Creates a Python file with Entity and FeatureView definitions compatible with `feast apply`
+* **Auto-applies repo**: Registers the generated FeatureView in the registry automatically
+* **Custom schema transform**: Provide your own `SchemaTransformFn` to control how chunked + embedded data maps to your FeatureView schema
+* **Extensible**: Subclass `BaseChunker` or `BaseEmbedder` to plug in your own chunking or embedding strategies
+
+For a complete walkthrough, see the [DocEmbedder tutorial notebook](../../examples/rag-retriever/rag_feast_docembedder.ipynb).
 ### Feature Transformation for LLMs
 
 Feast supports transformations that can be used to:
@@ -89,6 +136,17 @@ Implement semantic search by:
 2. Converting search queries to embeddings
 3. Finding semantically similar documents using vector search
 
+### AI Agents with Context and Memory
+
+Feast can serve as both the **context provider** and **persistent memory layer** for AI agents. Unlike stateless RAG pipelines, agents make autonomous decisions about which tools to call and can write state back to the feature store:
+
+1. **Structured context**: Retrieve customer profiles, account data, and other entity-keyed features
+2. **Knowledge retrieval**: Search vector embeddings for relevant documents
+3. **Persistent memory**: Store and recall per-entity interaction history (last topic, resolution, preferences) using `write_to_online_store`
+4. **Governed access**: All reads and writes are subject to the same RBAC, TTL, and audit policies as any other feature
+
+With MCP enabled, agents built with any framework (LangChain, LlamaIndex, CrewAI, AutoGen, or custom) can discover and call Feast tools dynamically. See the [Feast-Powered AI Agent example](../../examples/agent_feature_store/) and the blog post [Building AI Agents with Feast](https://feast.dev/blog/feast-agents-mcp/) for a complete walkthrough.
+
 ### Scaling with Spark Integration
 
 Feast integrates with Apache Spark to enable large-scale processing of unstructured data for GenAI applications:
@@ -167,20 +225,25 @@ The MCP integration uses the `fastapi_mcp` library to automatically transform yo
 The fastapi_mcp integration automatically exposes your Feast feature server's FastAPI endpoints as MCP tools. This means AI assistants can:
 
 * **Call `/get-online-features`** to retrieve features from the feature store
+* **Call `/retrieve-online-documents`** to perform vector similarity search
+* **Call `/write-to-online-store`** to persist agent state (memory, notes, interaction history)
 * **Use `/health`** to check server status  
 
-For a complete example, see the [MCP Feature Store Example](../../examples/mcp_feature_store/).
+For a basic MCP example, see the [MCP Feature Store Example](../../examples/mcp_feature_store/). For a full agent with persistent memory, see the [Feast-Powered AI Agent Example](../../examples/agent_feature_store/).
 
 ## Learn More
 
 For more detailed information and examples:
 
 * [Vector Database Reference](../reference/alpha-vector-database.md)
 * [RAG Tutorial with Docling](../tutorials/rag-with-docling.md)
+* [DocEmbedder Tutorial Notebook](../../examples/rag-retriever/rag_feast_docembedder.ipynb)
 * [RAG Fine Tuning with Feast and Milvus](../../examples/rag-retriever/README.md)
 * [Milvus Quickstart Example](https://github.com/feast-dev/feast/tree/master/examples/rag/milvus-quickstart.ipynb)
 * [Feast + Ray: Distributed Processing for RAG Applications](https://feast.dev/blog/feast-ray-distributed-processing/)
 * [MCP Feature Store Example](../../examples/mcp_feature_store/)
+* [Feast-Powered AI Agent Example (with Memory)](../../examples/agent_feature_store/)
+* [Blog: Building AI Agents with Feast](https://feast.dev/blog/feast-agents-mcp/)
 * [MCP Feature Server Reference](../reference/feature-servers/mcp-feature-server.md)
 * [Spark Data Source](../reference/data-sources/spark.md)
 * [Spark Offline Store](../reference/offline-stores/spark.md)
 
@@ -10,6 +10,10 @@ Kubernetes is a common target environment for running Feast in production. You c
 2. Run scheduled and ad-hoc jobs (e.g. materialization jobs) as Kubernetes Jobs.
 3. Operate Feast components using Kubernetes-native primitives.
 
+{% hint style="info" %}
+**Planning a production deployment?** See the [Feast Production Deployment Topologies](./production-deployment-topologies.md) guide for architecture diagrams, sample FeatureStore CRs, RBAC policies, infrastructure recommendations, and scaling best practices across Minimal, Standard, and Enterprise topologies.
+{% endhint %}
+
 ## Feast Operator
 
 To deploy Feast components on Kubernetes, use the included [feast-operator](../../infra/feast-operator).