Managed models

Unlike normal tables where the user is responsible for managing the data within the table, some database engines have a concept of a table where the engine itself ensures that the data within the table is up to date. These tables are typically based on a query that reads from other tables within the database. Each time these other tables are updated, the database will ensure that the managed table reflects the changes without the user having to do anything special (such as issue a REFRESH command).

Under the hood, each supported database engine achieves this in a slightly different way but most of them have background processes that run and automatically keep the tables up to date, within the parameters you define when you create the table.

For supported engines, we expose this functionality through Managed models. This indicates to SQLMesh that the underlying database engine will ensure that the data remains up to date and all SQLMesh needs to do is maintain the schema.

Due to this, managed models would typically be built off an External Model rather than another SQLMesh model. Since SQLMesh already ensures that models it's tracking are kept up to date, the main benefit of managed models comes when they read from external tables that arent tracked by SQLMesh.

!!! warning "Not supported in Python models"

Python models do not support the `MANAGED` [model kind](./model_kinds.md) - use a SQL model isntead.

Difference from materialized views

The difference between an Managed model and a materialized view is down to semantics and in some engines there is no difference.

SQLMesh has support for materialized views already. However, depending on the engine, these are subject to some limitations, such as:

A Materialized View query can only be derived from a single base table
The Materialized View is not automatically maintained by the engine. To refresh the data, a REFRESH MATERIALIZED VIEW or equivalent command must be issued

Managed models are different in that:

The engine updates the table data automatically when a base table changes
When performing updates, the engine has a semantic understanding of the query and can decide if an incremental or full refresh should be applied
There is no need to issue manual REFRESH commands. The engine maintains the table transparently in a background process

Lifecycle in SQLMesh

Managed models follow the same lifecycle as other models:

Creating a Virtual Environment creates a pointer to the current model snapshot
Modifying the model causes a new snapshot to be created
Any upstream changes cause a new snapshot to be created
The model can be deployed and rolled back via the usual pointer swap mechanism
Once the TTL expires, model snapshots are cleaned up

However, there is usually extra vendor-imposed costs associated with Managed models. For example, Snowflake has additional costs for Dynamic Tables.

Therefore, we try to not create managed tables unnecessarily. For example, in forward-only plans we just create a normal table to preview the changes and only re-create the managed table on deployment to prod.

!!! warning Due to the use of normal tables for dev previews, it is possible to write a query that uses features that are available to normal tables in the target engine but not managed tables. This could result in a scenario where a plan works in a dev environment but fails when deployed to production.

We believe the cost savings are worth it, however please [reach out](https://tobikodata.com/slack) if this causes problems for you.

Supported Engines

SQLMesh supports managed models in the following database engines:

Engine	Implementatation
Snowflake	Dynamic Table

To define a managed model, you can use the MANAGED model Kind.

Snowflake

Managed Models are in Snowflake are implemented as Dynamic Tables.

Here is an example of a SQLMesh model that will result in a dynamic table being created:

MODEL (
  name db.events,
  kind MANAGED,
  physical_properties (
    warehouse = datalake,
    target_lag = '2 minutes',
    data_retention_time_in_days = 2
  )
);

SELECT
  event_date::DATE as event_date,
  event_payload::TEXT as payload
FROM raw_events

results in:

CREATE OR REPLACE DYNAMIC TABLE db.events
  WAREHOUSE = "datalake",
  TARGET_LAG = '2 minutes'
  DATA_RETENTION_TIME_IN_DAYS = 2
AS SELECT
  event_date::DATE as event_date,
  event_payload::TEXT as payload
FROM raw_events

!!! note

SQLMesh will not create intervals and run this model for each interval, so there is no need to add a WHERE clause with date filters like you would for a normal incremental model. How the data in this model is refreshed is completely up to Snowflake.

Table properties

Dynamic Tables have some properties that affect things like how often the data is refreshed by Snowflake, when the initial data is populated, how long data is retained for etc. The list of available properties is located in the Snowflake documentation.

In SQLMesh, these properties are set on the model definition.

The following Dynamic Table properties are set on the model physical_properties:

Snowflake Property	Required	Notes
target_lag	Y
warehouse	N	In Snowflake, this is a required property. However, if not specified, then SQLMesh will use the result of `select current_warehouse()`.
refresh_mode	N
initialize	N
data_retention_time_in_days	N
max_data_extension_time_in_days	N

The following Dynamic Table properties can be set directly on the model:

Snowflake Property	Required	Notes
cluster by	N	`clustered_by` is a standard model property, so set `clustered_by` on the model to add a `CLUSTER BY` clause to the Dynamic Table

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Managed models

Difference from materialized views

Lifecycle in SQLMesh

Supported Engines

Snowflake

Table properties

FilesExpand file tree

managed_models.md

Latest commit

History

managed_models.md

File metadata and controls

Managed models

Difference from materialized views

Lifecycle in SQLMesh

Supported Engines

Snowflake

Table properties