databricks
diff --git a/‎README.md‎
Lines changed: 17 additions & 0 deletions b/‎README.md‎
Lines changed: 17 additions & 0 deletions
diff --git a/‎experimental/README.md‎
Lines changed: 82 additions & 0 deletions b/‎experimental/README.md‎
Lines changed: 82 additions & 0 deletions
diff --git a/‎experimental/databricks-agent-bricks/1-knowledge-assistants.md‎
Lines changed: 183 additions & 0 deletions b/‎experimental/databricks-agent-bricks/1-knowledge-assistants.md‎
Lines changed: 183 additions & 0 deletions
@@ -24,6 +24,23 @@ Run this command in chat:
 
 - **databricks-apps** - Build full-stack TypeScript apps on Databricks using AppKit
 
+See [`skills/`](./skills/) for the full list of supported skills.
+
+## Experimental Skills
+
+The [`experimental/`](./experimental/) directory contains additional skills
+imported from [databricks-solutions/ai-dev-kit](https://github.com/databricks-solutions/ai-dev-kit)
+on a **best-effort basis**.
+
+- Experimental skills are **not officially supported** — they may be used, but
+  do not follow the same review / quality bar as the stable skills under
+  [`skills/`](./skills/).
+- They are **not installed by default** by `databricks experimental aitools
+  skills install`. Pass `--experimental` to install all of them, or install a
+  specific one by name.
+- See [`experimental/README.md`](./experimental/README.md) for the full list
+  and caveats.
+
 ## Structure
 
 Each skill follows the [Agent Skills Specification](https://agentskills.io/specification):
 
@@ -0,0 +1,82 @@
+> ⚠️ **Experimental — best-effort, not officially supported**
+>
+> The skills in this directory are imported from
+> [databricks-solutions/ai-dev-kit](https://github.com/databricks-solutions/ai-dev-kit)
+> on a best-effort basis. They may be useful, but they are **not officially
+> supported** as part of `databricks-agent-skills`:
+>
+> - They do not follow the same review / quality bar as the skills in
+>   [`../skills/`](../skills/).
+> - They may be out of date relative to upstream `ai-dev-kit`.
+> - They may overlap or conflict with the stable skills (e.g.
+>   `databricks-jobs`, `databricks-model-serving` exist in both directories).
+> - They are not installed by `databricks experimental aitools skills install`
+>   by default — you have to opt in (see the root README).
+>
+> File issues against this directory in this repo; do not file issues against
+> `ai-dev-kit` for skills installed via `databricks-agent-skills`.
+
+---
+
+# Databricks Skills for Claude Code
+
+Skills that teach Claude Code how to work effectively with Databricks - providing patterns, best practices, and code examples that work with Databricks MCP tools.
+
+## Installation
+
+These experimental skills are **not** installed by default. To install them via the Databricks CLI:
+
+```bash
+# Install all experimental skills at once
+databricks experimental aitools skills install --experimental
+
+# Install a single experimental skill by name
+databricks experimental aitools skills install databricks-iceberg
+```
+
+See the root [README](../README.md) for details on the stable install path.
+
+## Available Skills
+
+### 🤖 AI & Agents
+- **databricks-ai-functions** - Built-in AI Functions (ai_classify, ai_extract, ai_summarize, ai_query, ai_forecast, ai_parse_document, and more) with SQL and PySpark patterns, function selection guidance, document processing pipelines, and custom RAG (parse → chunk → index → query)
+- **databricks-agent-bricks** - Knowledge Assistants, Genie Spaces, Supervisor Agents
+- **databricks-genie** - Genie Spaces: create, curate, and query via Conversation API
+- **databricks-model-serving** - Deploy MLflow models and AI agents to endpoints *(also available as stable skill)*
+- **databricks-mlflow-evaluation** - End-to-end agent evaluation workflow
+- **databricks-unstructured-pdf-generation** - Generate synthetic PDFs for RAG
+- **databricks-vector-search** - Vector similarity search for RAG and semantic search
+
+### 📊 Analytics & Dashboards
+- **databricks-aibi-dashboards** - Databricks AI/BI dashboards (with SQL validation workflow)
+- **databricks-metric-views** - Metric Views for governed metrics
+- **databricks-unity-catalog** - System tables for lineage, audit, billing
+
+### 🔧 Data Engineering
+- **databricks-dbsql** - Databricks SQL warehouse patterns
+- **databricks-iceberg** - Apache Iceberg tables (Managed/Foreign), UniForm, Iceberg REST Catalog, Iceberg Clients Interoperability
+- **databricks-spark-declarative-pipelines** - SDP (formerly DLT) in SQL/Python
+- **databricks-spark-structured-streaming** - Spark Structured Streaming patterns
+- **databricks-jobs** - Multi-task workflows, triggers, schedules *(also available as stable skill)*
+- **databricks-synthetic-data-gen** - Realistic test data with Faker
+- **databricks-zerobus-ingest** - Zerobus ingest patterns
+- **spark-python-data-source** - Python data sources for Spark
+
+### 🚀 Development & Deployment
+- **databricks-bundles** - DABs for multi-environment deployments
+- **databricks-apps-python** - Python web apps (Dash, Streamlit, Flask) with foundation model integration
+- **databricks-python-sdk** - Python SDK, Connect, CLI, REST API
+- **databricks-config** - Profile authentication setup
+- **databricks-execution-compute** - Execute on Databricks compute
+- **databricks-lakebase-autoscale** - Autoscaling for Lakebase
+- **databricks-lakebase-provisioned** - Managed PostgreSQL for OLTP workloads
+
+### 📚 Reference
+- **databricks-docs** - Documentation index via llms.txt
+
+## Provenance
+
+These skills are imported as a snapshot from
+[`databricks-solutions/ai-dev-kit/databricks-skills/`](https://github.com/databricks-solutions/ai-dev-kit/tree/main/databricks-skills).
+Upstream changes are not automatically synced — see the
+[contributing notes](../CONTRIBUTING.md) for the current sync process.
@@ -0,0 +1,183 @@
+# Knowledge Assistants (KA)
+
+Knowledge Assistants are document-based Q&A systems that use RAG (Retrieval-Augmented Generation) to answer questions from indexed documents.
+
+## What is a Knowledge Assistant?
+
+A KA connects to documents stored in a Unity Catalog Volume and allows users to ask natural language questions. The system:
+
+1. **Indexes** all documents in the volume (PDFs, text files, etc.)
+2. **Retrieves** relevant chunks when a question is asked
+3. **Generates** an answer using the retrieved context
+
+## When to Use
+
+Use a Knowledge Assistant when:
+- You have a collection of documents (policies, manuals, guides, reports)
+- Users need to find specific information without reading entire documents
+- You want to provide a conversational interface to documentation
+
+## Prerequisites
+
+Before creating a KA, you need documents in a Unity Catalog Volume:
+
+**Option 1: Use existing documents**
+- Upload PDFs/text files to a Volume manually or via SDK
+
+**Option 2: Generate synthetic documents**
+- Use the `databricks-unstructured-pdf-generation` skill to create realistic PDF documents
+- Each PDF gets a companion JSON file with question/guideline pairs for evaluation
+
+## Creating a Knowledge Assistant
+
+Use the `manage_ka` tool with `action="create_or_update"`:
+
+- `name`: "HR Policy Assistant"
+- `volume_path`: "/Volumes/my_catalog/my_schema/raw_data/hr_docs"
+- `description`: "Answers questions about HR policies and procedures"
+- `instructions`: "Be helpful and always cite the specific policy document when answering. If you're unsure, say so."
+
+The tool will:
+1. Create the KA with the specified volume as a knowledge source
+2. Scan the volume for JSON files with example questions (from PDF generation)
+3. Queue examples to be added once the endpoint is ready
+
+## Provisioning Timeline
+
+After creation, the KA endpoint needs to provision:
+
+| Status | Meaning | Duration |
+|--------|---------|----------|
+| `PROVISIONING` | Creating the endpoint | 2-5 minutes |
+| `ONLINE` | Ready to use | - |
+| `OFFLINE` | Not currently running | - |
+
+Use `manage_ka` with `action="get"` to check the status:
+
+- `tile_id`: "<the tile_id from create>"
+
+## Adding Example Questions
+
+Example questions help with:
+- **Evaluation**: Test if the KA answers correctly
+- **User onboarding**: Show users what to ask
+
+### Automatic (from PDF generation)
+
+If you used `generate_pdf_documents`, each PDF has a companion JSON with:
+```json
+{
+  "question": "What is the company's remote work policy?",
+  "guideline": "Should mention the 3-day minimum in-office requirement"
+}
+```
+
+These are automatically added when `add_examples_from_volume=true` (default).
+
+### Manual
+
+Examples can also be specified in the `manage_ka` create_or_update call if needed.
+
+## Best Practices
+
+### Document Organization
+
+- **One volume per topic**: e.g., `/Volumes/catalog/schema/raw_data/hr_docs`, `/Volumes/catalog/schema/raw_data/tech_docs`
+- **Clear naming**: Name files descriptively so chunks are identifiable
+
+### Instructions
+
+Good instructions improve answer quality:
+
+```
+Be helpful and professional. When answering:
+1. Always cite the specific document and section
+2. If multiple documents are relevant, mention all of them
+3. If the information isn't in the documents, clearly say so
+4. Use bullet points for multi-part answers
+```
+
+### Updating Content
+
+To update the indexed documents:
+1. Add/remove/modify files in the volume
+2. Call `manage_ka` with `action="create_or_update"`, the same name and `tile_id`
+3. The KA will re-index the updated content
+
+## Example Workflow
+
+1. **Generate PDF documents** using `databricks-unstructured-pdf-generation` skill:
+   - Creates PDFs in `/Volumes/catalog/schema/raw_data/pdf_documents`
+   - Creates JSON files with question/guideline pairs
+
+2. **Create the Knowledge Assistant**:
+   - `name`: "My Document Assistant"
+   - `volume_path`: "/Volumes/catalog/schema/raw_data/pdf_documents"
+
+3. **Wait for ONLINE status** (2-5 minutes)
+
+4. **Examples are automatically added** from the JSON files
+
+5. **Test the KA** in the Databricks UI
+
+## Using KA in Supervisor Agents
+
+Knowledge Assistants can be used as agents in a Supervisor Agent (formerly Multi-Agent Supervisor, MAS). Each KA has an associated model serving endpoint.
+
+### Finding the Endpoint Name
+
+Use `manage_ka` with `action="get"` to retrieve the KA details. The response includes:
+- `tile_id`: The unique identifier for the KA
+- `name`: The KA name (sanitized)
+- `endpoint_status`: Current status (ONLINE, PROVISIONING, etc.)
+
+The endpoint name follows this pattern: `ka-{tile_id}-endpoint`
+
+### Finding a KA by Name
+
+If you know the KA name but not the tile_id, use `manage_ka` with `action="find_by_name"`:
+
+```python
+manage_ka(action="find_by_name", name="HR_Policy_Assistant")
+# Returns: {"found": True, "tile_id": "01abc...", "name": "HR_Policy_Assistant", "endpoint_name": "ka-01abc...-endpoint"}
+```
+
+### Example: Adding KA to Supervisor Agent
+
+```python
+# First, find the KA
+manage_ka(action="find_by_name", name="HR_Policy_Assistant")
+
+# Then use the tile_id in a Supervisor Agent
+manage_mas(
+    action="create_or_update",
+    name="Support_MAS",
+    agents=[
+        {
+            "name": "hr_agent",
+            "ka_tile_id": "<tile_id from find_by_name>",
+            "description": "Answers HR policy questions from the employee handbook"
+        }
+    ]
+)
+```
+
+## Troubleshooting
+
+### Endpoint stays in PROVISIONING
+
+- Check workspace capacity and quotas
+- Verify the volume path is accessible
+- Wait up to 10 minutes before investigating further
+
+### Documents not indexed
+
+- Ensure files are in a supported format (PDF, TXT, MD)
+- Check file permissions in the volume
+- Verify the volume path is correct
+
+### Poor answer quality
+
+- Add more specific instructions
+- Ensure documents are well-structured
+- Consider breaking large documents into smaller files