databricks
diff --git a/‎experimental/README.md‎
Lines changed: 29 additions & 12 deletions b/‎experimental/README.md‎
Lines changed: 29 additions & 12 deletions
diff --git a/‎experimental/databricks-agent-bricks/1-knowledge-assistants.md‎
Lines changed: 49 additions & 158 deletions b/‎experimental/databricks-agent-bricks/1-knowledge-assistants.md‎
Lines changed: 49 additions & 158 deletions
@@ -68,12 +68,11 @@ See the root [README](../README.md) for details on the stable install path.
 
 ### 🚀 Development & Deployment
 - **databricks-bundles** - DABs for multi-environment deployments
-- **databricks-apps-python** - Python web apps (Dash, Streamlit, Flask) with foundation model integration
+- **databricks-apps-python** - Databricks apps. Prefers AppKit (TypeScript + React SDK) for new apps; falls back to Python frameworks (Dash, Streamlit, Gradio, Flask, FastAPI, Reflex) when Python is required
 - **databricks-python-sdk** - Python SDK, Connect, CLI, REST API
 - **databricks-config** - Profile authentication setup
 - **databricks-execution-compute** - Execute on Databricks compute
 - **databricks-lakebase-autoscale** - Autoscaling for Lakebase
-- **databricks-lakebase-provisioned** - Managed PostgreSQL for OLTP workloads
 
 ### 📚 Reference
 - **databricks-docs** - Documentation index via llms.txt
@@ -83,16 +82,34 @@ See the root [README](../README.md) for details on the stable install path.
 These skills are imported as a snapshot from
 [`databricks-solutions/ai-dev-kit/databricks-skills/`](https://github.com/databricks-solutions/ai-dev-kit/tree/main/databricks-skills).
 
-**Source SHA**: [`2228c3e`](https://github.com/databricks-solutions/ai-dev-kit/commit/2228c3e880fbadd871882a5f99628300dcb9f2f1)
-on the `add_appkit` branch (5 commits ahead of `origin/main` at the time
-of import). Divergence from public main is small but meaningful: the
-`databricks-app-python` → `databricks-apps-python` rename had not yet been
-merged upstream, and importing from the renamed version is what prevents a
-3rd skill name collision with d-a-s's own `databricks-apps`. A few other
-local commits touch `databricks-bundles/SKILL.md` (2 lines),
-`databricks-lakebase-provisioned/SKILL.md` (2 lines), and
-`databricks-apps-python/SKILL.md` (64 lines). The full set of local
-deltas is tracked by the import commit on this branch.
+**Source SHA**: [`9c7a5b3`](https://github.com/databricks-solutions/ai-dev-kit/commit/9c7a5b3a3bf187c2b19d0b777768ecb52dd2de22)
+on the `appkit-on-experimental` branch of `jamesbroadhead/ai-dev-kit` —
+the head of [a-d-k PR #533](https://github.com/databricks-solutions/ai-dev-kit/pull/533),
+which targets a-d-k's `experimental` branch. One commit ahead of
+`origin/experimental` at import time. Divergence from `experimental`
+is the PR #533 change set:
+
+- `databricks-app-python` → `databricks-apps-python` rename (folder,
+  baselines, manifests, install scripts, cross-skill mentions). The
+  rename prevents a 3rd skill-name collision with d-a-s's own
+  `databricks-apps` — alongside the two we already handle for
+  `databricks-jobs` and `databricks-model-serving`.
+- `databricks-apps-python/SKILL.md` leads with AppKit (TypeScript +
+  React SDK) as the recommended approach for new apps; Python
+  frameworks (Dash, Streamlit, Gradio, Flask, FastAPI, Reflex) are
+  demoted to an explicit alternative.
+- `install.sh` / `install.ps1` upstream changes wiring a-d-k to
+  install d-a-s skills via a single GitHub tree call (out of scope
+  for this snapshot, not imported here).
+
+**Note**: the `experimental` branch of a-d-k previously removed
+`databricks-lakebase-provisioned`, which is why it is not present in
+this import. `databricks-model-serving` and
+`databricks-spark-declarative-pipelines` are intentionally excluded
+from this snapshot — see TODOs #1b and #5 on the import PR.
+
+The full set of paths brought in is tracked by the import commit on
+this branch.
 
 **Transition phase (until `ai-dev-kit` skills are locked):**
 - Source of truth is **upstream `ai-dev-kit`**. New work and bug fixes go there.
 
@@ -1,183 +1,74 @@
-# Knowledge Assistants (KA)
+# Knowledge Assistants - Details
 
-Knowledge Assistants are document-based Q&A systems that use RAG (Retrieval-Augmented Generation) to answer questions from indexed documents.
+For commands, see [SKILL.md](SKILL.md).
 
-## What is a Knowledge Assistant?
+## Source Types
 
-A KA connects to documents stored in a Unity Catalog Volume and allows users to ask natural language questions. The system:
+Both shapes go inside the `--json` body alongside `display_name` and `description` — see SKILL.md for the full invocation.
 
-1. **Indexes** all documents in the volume (PDFs, text files, etc.)
-2. **Retrieves** relevant chunks when a question is asked
-3. **Generates** an answer using the retrieved context
+### Files (Volume)
 
-## When to Use
-
-Use a Knowledge Assistant when:
-- You have a collection of documents (policies, manuals, guides, reports)
-- Users need to find specific information without reading entire documents
-- You want to provide a conversational interface to documentation
-
-## Prerequisites
-
-Before creating a KA, you need documents in a Unity Catalog Volume:
-
-**Option 1: Use existing documents**
-- Upload PDFs/text files to a Volume manually or via SDK
-
-**Option 2: Generate synthetic documents**
-- Use the `databricks-unstructured-pdf-generation` skill to create realistic PDF documents
-- Each PDF gets a companion JSON file with question/guideline pairs for evaluation
-
-## Creating a Knowledge Assistant
-
-Use the `manage_ka` tool with `action="create_or_update"`:
-
-- `name`: "HR Policy Assistant"
-- `volume_path`: "/Volumes/my_catalog/my_schema/raw_data/hr_docs"
-- `description`: "Answers questions about HR policies and procedures"
-- `instructions`: "Be helpful and always cite the specific policy document when answering. If you're unsure, say so."
-
-The tool will:
-1. Create the KA with the specified volume as a knowledge source
-2. Scan the volume for JSON files with example questions (from PDF generation)
-3. Queue examples to be added once the endpoint is ready
-
-## Provisioning Timeline
-
-After creation, the KA endpoint needs to provision:
-
-| Status | Meaning | Duration |
-|--------|---------|----------|
-| `PROVISIONING` | Creating the endpoint | 2-5 minutes |
-| `ONLINE` | Ready to use | - |
-| `OFFLINE` | Not currently running | - |
-
-Use `manage_ka` with `action="get"` to check the status:
-
-- `tile_id`: "<the tile_id from create>"
-
-## Adding Example Questions
-
-Example questions help with:
-- **Evaluation**: Test if the KA answers correctly
-- **User onboarding**: Show users what to ask
-
-### Automatic (from PDF generation)
-
-If you used `generate_pdf_documents`, each PDF has a companion JSON with:
 ```json
 {
-  "question": "What is the company's remote work policy?",
-  "guideline": "Should mention the 3-day minimum in-office requirement"
+  "display_name": "...",
+  "description": "...",
+  "source_type": "files",
+  "files": {"path": "/Volumes/catalog/schema/volume/folder/"}
 }
 ```
 
-These are automatically added when `add_examples_from_volume=true` (default).
-
-### Manual
+Supported formats: PDF, TXT, MD, DOCX.
 
-Examples can also be specified in the `manage_ka` create_or_update call if needed.
+### Vector Search Index
 
-## Best Practices
+Use an existing index instead of auto-indexing:
 
-### Document Organization
-
-- **One volume per topic**: e.g., `/Volumes/catalog/schema/raw_data/hr_docs`, `/Volumes/catalog/schema/raw_data/tech_docs`
-- **Clear naming**: Name files descriptively so chunks are identifiable
-
-### Instructions
-
-Good instructions improve answer quality:
-
-```
-Be helpful and professional. When answering:
-1. Always cite the specific document and section
-2. If multiple documents are relevant, mention all of them
-3. If the information isn't in the documents, clearly say so
-4. Use bullet points for multi-part answers
+```json
+{
+  "display_name": "...",
+  "description": "...",
+  "source_type": "index",
+  "index": {
+    "index_name": "catalog.schema.my_index",
+    "text_col": "content",
+    "doc_uri_col": "source_url"
+  }
+}
 ```
 
-### Updating Content
-
-To update the indexed documents:
-1. Add/remove/modify files in the volume
-2. Call `manage_ka` with `action="create_or_update"`, the same name and `tile_id`
-3. The KA will re-index the updated content
-
-## Example Workflow
-
-1. **Generate PDF documents** using `databricks-unstructured-pdf-generation` skill:
-   - Creates PDFs in `/Volumes/catalog/schema/raw_data/pdf_documents`
-   - Creates JSON files with question/guideline pairs
-
-2. **Create the Knowledge Assistant**:
-   - `name`: "My Document Assistant"
-   - `volume_path`: "/Volumes/catalog/schema/raw_data/pdf_documents"
+## Updating Content
 
-3. **Wait for ONLINE status** (2-5 minutes)
+1. Add/modify/remove files in the Volume
+2. Re-sync: `databricks knowledge-assistants sync-knowledge-sources "knowledge-assistants/{ka_id}"`
 
-4. **Examples are automatically added** from the JSON files
-
-5. **Test the KA** in the Databricks UI
-
-## Using KA in Supervisor Agents
-
-Knowledge Assistants can be used as agents in a Supervisor Agent (formerly Multi-Agent Supervisor, MAS). Each KA has an associated model serving endpoint.
-
-### Finding the Endpoint Name
+## Troubleshooting
 
-Use `manage_ka` with `action="get"` to retrieve the KA details. The response includes:
-- `tile_id`: The unique identifier for the KA
-- `name`: The KA name (sanitized)
-- `endpoint_status`: Current status (ONLINE, PROVISIONING, etc.)
+**KA stays in CREATING:**
+- Wait up to 10 minutes
+- Check workspace quotas
+- Verify volume path exists
 
-The endpoint name follows this pattern: `ka-{tile_id}-endpoint`
+**Documents not indexed:**
+- Check file format (PDF, TXT, MD, DOCX)
+- Verify volume path (trailing slash matters)
+- Check file permissions
 
-### Finding a KA by Name
+**Poor answer quality:**
+- Ensure documents are well-structured
+- Break large documents into smaller files
+- Add clear headings and sections
 
-If you know the KA name but not the tile_id, use `manage_ka` with `action="find_by_name"`:
+## Evaluation Questions
 
-```python
-manage_ka(action="find_by_name", name="HR_Policy_Assistant")
-# Returns: {"found": True, "tile_id": "01abc...", "name": "HR_Policy_Assistant", "endpoint_name": "ka-01abc...-endpoint"}
-```
+When testing a KA, check if the volume or project contains a `pdf_eval_questions.json` file with test questions:
 
-### Example: Adding KA to Supervisor Agent
-
-```python
-# First, find the KA
-manage_ka(action="find_by_name", name="HR_Policy_Assistant")
-
-# Then use the tile_id in a Supervisor Agent
-manage_mas(
-    action="create_or_update",
-    name="Support_MAS",
-    agents=[
-        {
-            "name": "hr_agent",
-            "ka_tile_id": "<tile_id from find_by_name>",
-            "description": "Answers HR policy questions from the employee handbook"
-        }
-    ]
-)
+```json
+{
+  "api_errors_guide.pdf": {
+    "question": "What is the solution for error ERR-4521?",
+    "expected_fact": "Call /api/v2/auth/refresh with refresh_token before the 3600s TTL expires"
+  }
+}
 ```
 
-## Troubleshooting
-
-### Endpoint stays in PROVISIONING
-
-- Check workspace capacity and quotas
-- Verify the volume path is accessible
-- Wait up to 10 minutes before investigating further
-
-### Documents not indexed
-
-- Ensure files are in a supported format (PDF, TXT, MD)
-- Check file permissions in the volume
-- Verify the volume path is correct
-
-### Poor answer quality
-
-- Add more specific instructions
-- Ensure documents are well-structured
-- Consider breaking large documents into smaller files
+Use these questions to validate retrieval accuracy. See [databricks-unstructured-pdf-generation](../databricks-unstructured-pdf-generation/SKILL.md) for generating test PDFs with eval questions.