Namespace deletion

Overview

When a namespace loses access to the Knowledge Graph, all of its indexed data needs to be removed. This includes every node and edge in the graph tables, plus the SDLC and code indexing checkpoints that track indexing progress for that namespace.

Deletion is not immediate. A 30-day grace period gives operators time to reverse the decision before any data is removed. The scheduler detects deleted namespaces and queues them for future removal. The handler does the actual deletion once the grace period passes.

How detection works

The knowledge_graph_enabled_namespaces table in the datalake tracks which namespaces have access to the Knowledge Graph. When a namespace is disabled, Siphon's CDC pipeline sets _siphon_deleted = true on the corresponding row. The deletion scheduler picks this up on its next run.

The scheduler runs every 24 hours. Each run has two phases:

Phase 1: Record newly deleted namespaces. The scheduler loads its last checkpoint watermark and queries the datalake for any knowledge_graph_enabled_namespaces rows where _siphon_deleted = true within the watermark window. For each match, it inserts a row into the namespace_deletion_schedule table with scheduled_deletion_date set to 30 days from now. It then advances the checkpoint.

Phase 2: Dispatch due deletions. The scheduler queries namespace_deletion_schedule for entries where scheduled_deletion_date <= now(). For each one, it publishes a NamespaceDeletionRequest to NATS. If NATS rejects the publish as a duplicate (the previous request is still in-flight), the scheduler skips it and moves on.

-- Finding newly deleted namespaces
SELECT
    enabled.root_namespace_id AS namespace_id,
    CONCAT(toString(namespaces.organization_id), '/',
           toString(enabled.root_namespace_id), '/') AS traversal_path
FROM siphon_knowledge_graph_enabled_namespaces AS enabled
INNER JOIN siphon_namespaces AS namespaces
    ON enabled.root_namespace_id = namespaces.id
WHERE enabled._siphon_deleted = true
  AND enabled._siphon_replicated_at > {last_watermark}
  AND enabled._siphon_replicated_at <= {watermark}

How deletion works

The NamespaceDeletionHandler consumes NamespaceDeletionRequest messages from NATS. Each message contains a namespace_id and a traversal_path (formatted as <org_id>/<namespace_id>/).

The handler follows these steps:

Validate the traversal path. The path must match <org_id>/<namespace_id>/ where both segments are numeric. An empty or malformed path would cause startsWith(traversal_path, '') to match every row in every table, so the handler rejects anything that does not fit the expected format.
Check if the namespace is still deleted. Between scheduling and execution, an operator may have re-enabled the namespace. The handler queries the datalake to check the current state. If the namespace was re-enabled, the handler clears the schedule entry without touching any data and returns early.
Soft-delete graph data. For every namespaced node table and all configured edge tables, the handler runs an INSERT INTO ... SELECT that copies matching rows with _deleted = true and a fresh _version timestamp. The list of tables comes from the ontology at startup, so adding a new entity type or edge table to the ontology automatically includes it in namespace deletion. If any table fails, the handler stops and returns an error without proceeding to the next steps.
Soft-delete checkpoints. Once all graph data has been marked deleted, the handler removes the SDLC checkpoints (keyed by namespace position, e.g. ns.42.Project) and the code indexing checkpoints (keyed by traversal path prefix). This prevents stale checkpoints from interfering if the namespace is later re-enabled and re-indexed from scratch.
Mark deletion complete. The handler soft-deletes the namespace_deletion_schedule entry so the scheduler does not dispatch it again.

-- Soft-delete pattern for graph tables (generated per table from the ontology)
INSERT INTO {table} ({sort_key_columns}, _deleted, _version)
SELECT {sort_key_columns}, true, now64(6)
FROM {table}
WHERE startsWith(traversal_path, {traversal_path:String})
  AND _deleted = false

Relationship to row-level deletion

Namespace deletion is separate from the row-level soft-delete that flows through Siphon's CDC pipeline. When an individual row is deleted in the source PostgreSQL database, Siphon sets _siphon_deleted = true, and the SDLC indexer carries the _deleted flag through to the graph table during normal ETL. Namespace deletion removes an entire namespace and all of its data at once.

Data model

The namespace_deletion_schedule table tracks pending and completed deletions:

CREATE TABLE namespace_deletion_schedule (
    namespace_id Int64,
    traversal_path String,
    scheduled_deletion_date DateTime64(6, 'UTC'),
    _deleted Boolean DEFAULT false,
    _version DateTime64(6, 'UTC') DEFAULT now64(6)
) ENGINE = ReplacingMergeTree(_version)
ORDER BY (namespace_id, traversal_path);

The scheduler writes to this table when it detects a deleted namespace. The handler reads from it (indirectly, via the scheduler's dispatch) and soft-deletes the entry when deletion is complete or when the namespace was re-enabled.

Error handling

Table deletion is all-or-nothing. If any graph table fails to soft-delete, the handler returns an error and NATS redelivers the message on the next attempt. Checkpoints and the schedule entry are only cleaned up after every table has been processed, so a transient ClickHouse failure will not leave the namespace in a state where checkpoints are gone but graph data remains.

If the mark_deletion_complete step fails after data and checkpoints have been deleted, the message will be redelivered. The next run will re-execute the soft-deletes, which is safe because the queries only affect rows where _deleted = false.

Observability

The namespace deletion module emits two metrics under the indexer_namespace_deletion meter:

Metric	Type	Description
`indexer.namespace_deletion.table.duration`	Histogram	Duration of each table's soft-delete query
`indexer.namespace_deletion.table.errors`	Counter	Per-table deletion failures

The scheduler uses the shared scheduler meter documented in observability: scheduler.task.runs, scheduler.task.requests.published, and scheduler.task.requests.skipped.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Namespace deletion

Overview

How detection works

How deletion works

Relationship to row-level deletion

Data model

Error handling

Observability

FilesExpand file tree

namespace_deletion.md

Latest commit

History

namespace_deletion.md

File metadata and controls

Namespace deletion

Overview

How detection works

How deletion works

Relationship to row-level deletion

Data model

Error handling

Observability