Database Setup

Setup instructions for each supported database.

PostgreSQL

Create the database:
```
createdb contoso_inventory
```
Run the seed script:
```
uv run python scripts/seed_postgres.py
```

Azure SQL Database

Create an Azure SQL Database in Azure Portal
Configure firewall rules to allow your IP

Set environment variables in .env:

AZURE_SQL_SERVER=your-server.database.windows.net
AZURE_SQL_DATABASE=ContosoHR
AZURE_SQL_USERNAME=sqladmin
AZURE_SQL_PASSWORD=your-password

Run the seed script:
```
uv run python scripts/seed_azure_sql.py
```

Azure Synapse Analytics

Create an Azure Synapse workspace in Azure Portal
Configure the serverless SQL endpoint

Set environment variables in .env:

SYNAPSE_SERVER=your-workspace.sql.azuresynapse.net
SYNAPSE_DATABASE=HotelAnalytics
SYNAPSE_USERNAME=sqladmin
SYNAPSE_PASSWORD=your-password

Run the seed script:
```
uv run python scripts/seed_synapse.py
```

Azure Cosmos DB

Create a Cosmos DB account and database in Azure Portal

Set the connection string in .env:

COSMOS_ENDPOINT=https://your-account.documents.azure.com:443/
COSMOS_KEY=your-cosmos-key

Run the seed script:
```
uv run python scripts/seed_cosmos.py
```

Databricks

Create a SQL Warehouse in your Databricks workspace
Generate a Personal Access Token (Settings > Developer > Access tokens)

Configure environment variables:

DATABRICKS_HOST=https://your-workspace.azuredatabricks.net
DATABRICKS_TOKEN=dapi...
DATABRICKS_PATH=/sql/1.0/warehouses/your-warehouse-id
DATABRICKS_CATALOG=your_catalog
DATABRICKS_SCHEMA=your_schema

Run the seed script:

uv run python scripts/seed_databricks.py

Google BigQuery

Create a GCP project and enable BigQuery API
Create a service account with BigQuery permissions
Download the service account JSON key

Configure environment variables:

GOOGLE_CLOUD_PROJECT=your-project-id
BIGQUERY_DATASET=financial_data
GOOGLE_APPLICATION_CREDENTIALS=./credentials.json

Run the seed script:
```
uv run python scripts/seed_bigquery.py
```

Azure AD Authentication

Several datasources support Azure AD authentication using DefaultAzureCredential.

PostgreSQL

datasource:
  type: postgres
  host: "myserver.postgres.database.azure.com"
  database: "mydb"
  username: "myuser@myserver"

Azure SQL / Synapse

datasource:
  type: azure_sql
  server: "myserver.database.windows.net"
  database: "mydb"
  use_aad: true

Cosmos DB

datasource:
  type: cosmos
  endpoint: "https://myaccount.documents.azure.com:443/"
  database: "mydb"
  container: "mycontainer"
  use_aad: true

Note: For Cosmos DB with Azure AD, assign the appropriate RBAC role:

Cosmos DB Built-in Data Reader - read-only
Cosmos DB Built-in Data Contributor - read/write

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Database Setup

PostgreSQL

Azure SQL Database

Azure Synapse Analytics

Azure Cosmos DB

Databricks

Google BigQuery

Azure AD Authentication

PostgreSQL

Azure SQL / Synapse

Cosmos DB

FilesExpand file tree

DATABASE_SETUP.md

Latest commit

History

DATABASE_SETUP.md

File metadata and controls

Database Setup

PostgreSQL

Azure SQL Database

Azure Synapse Analytics

Azure Cosmos DB

Databricks

Google BigQuery

Azure AD Authentication

PostgreSQL

Azure SQL / Synapse

Cosmos DB