Delta Live Tables (DLT)

Clone-Xs provides comprehensive management for Databricks Delta Live Tables pipelines — discover, clone, monitor, trigger, and integrate DLT pipeline lineage with Unity Catalog.

Field tooltips

Every field in the DLT clone modal (Clone Mode, New Pipeline Name, Workspace URL, PAT, Dry run) has an info icon — hover for a 1-line explanation, especially useful for the cross-workspace path where the target PAT permissions matter.

Overview

Feature	Description
Pipeline Discovery	Browse all DLT pipelines — name, state, health, creator
Pipeline Clone	Clone pipeline definitions to new pipelines (same workspace)
Trigger & Stop	Start pipeline runs (incremental or full refresh) and stop running pipelines
Event Monitoring	View pipeline event logs — errors, warnings, flow progress
Run History	Track pipeline update history with status and timing
Expectation Monitoring	Query DLT expectation results from system tables
Lineage Integration	Map DLT datasets to Unity Catalog tables
Health Dashboard	Aggregate pipeline state, health, and recent events

Quick start

Web UI

Navigate to Operations > Delta Live Tables in the main sidebar.

Three tabs:

Dashboard — stat cards (total, running, failed, idle, healthy, unhealthy), recent events
Pipelines — searchable list of all DLT pipelines, click to view details
Detail — pipeline config, actions (run, full refresh, stop, clone), dataset lineage, event log

API

# List all DLT pipelines
curl /api/dlt/pipelines

# Get pipeline details
curl /api/dlt/pipelines/{pipeline_id}

# Trigger a run
curl -X POST /api/dlt/pipelines/{pipeline_id}/trigger \
  -d '{"full_refresh": false}'

# Stop a pipeline
curl -X POST /api/dlt/pipelines/{pipeline_id}/stop

# Clone a pipeline
curl -X POST /api/dlt/pipelines/{pipeline_id}/clone \
  -d '{"new_name": "My Clone", "dry_run": false}'

# Get event log
curl /api/dlt/pipelines/{pipeline_id}/events?max_events=100

# Get run history
curl /api/dlt/pipelines/{pipeline_id}/updates

# Get DLT-to-UC lineage
curl /api/dlt/pipelines/{pipeline_id}/lineage

# Query expectations from system tables
curl /api/dlt/pipelines/{pipeline_id}/expectations?days=7

# Full dashboard
curl /api/dlt/dashboard

Pipeline clone

Clone a DLT pipeline definition (not data) to a new pipeline:

# Dry run — preview what will be cloned
curl -X POST /api/dlt/pipelines/{id}/clone \
  -d '{"new_name": "prod-pipeline-copy", "dry_run": true}'

# Execute clone
curl -X POST /api/dlt/pipelines/{id}/clone \
  -d '{"new_name": "prod-pipeline-copy"}'

The clone copies: catalog, target schema, libraries (notebooks), cluster config, continuous/serverless flags, configuration, and notifications. The new pipeline is created in development mode by default.

note

If the source pipeline has no notebook libraries (common with serverless/SQL-based DLT pipelines), Clone-Xs automatically creates a placeholder notebook at /Shared/clone-xs/dlt_placeholder_{name} in the destination workspace. Replace this placeholder with your actual pipeline code after cloning.

How it works

Source: src/dlt_management.py

When you'll reach for this: duplicating a pipeline for A/B testing a logic change, pre-staging a DR pipeline in another workspace, or producing a dev copy of a production pipeline. See Use Cases → Delta Live Tables for concrete scenarios.

Pipeline clone is an SDK-only operation — no data is copied, no SQL runs. DLT pipelines are metadata (a spec that tells Databricks how to build tables); the destination pipeline must be triggered separately to actually populate data.

Same-workspace clone:

client.pipelines.get(pipeline_id) → returns the full PipelineSpec (catalog, target schema, libraries, clusters, configuration, continuous / serverless flags, notifications).
Strip runtime-only fields: pipeline_id, creator_user_name.
Set development=True and the new name.
client.pipelines.create(name=<new_name>, spec=…) → returns the new pipeline_id.

Cross-workspace clone:

source_client.pipelines.get() on the source workspace.
Build a destination WorkspaceClient with get_client(host=dest_host, token=dest_token).
Library handling — walk the source spec's libraries list. Each entry is either notebook.path or file.path.
Placeholder path — if the source pipeline has zero libraries (common with serverless / SQL DLT), dest_client.workspace.import_() writes a minimal placeholder notebook to /Shared/clone-xs/dlt_placeholder_<name> so the pipelines.create() call below doesn't 400.
dest_client.pipelines.create(...) in the destination workspace. Cross-workspace clones always land in development mode regardless of source setting — you promote to production manually after review.

What doesn't transfer:

Notebook contents — only the path is in the spec. Copy the notebooks separately with the Databricks workspace CLI or client.workspace.export() → client.workspace.import_() before triggering the cloned pipeline.
Table data — DLT pipelines are definitions, not data. Use a separate catalog clone if you need to migrate the tables the pipeline produces.
Cluster IDs — any existing_cluster_id reference in the source spec will likely 404 on the destination. Clone-Xs warns but doesn't rewrite; update the destination pipeline to use a policy or new cluster config.

Cross-workspace clone

Clone a DLT pipeline to a different Databricks workspace:

# Dry run preview
curl -X POST /api/dlt/pipelines/{id}/clone-to-workspace \
  -d '{
    "new_name": "prod-pipeline-DR",
    "dest_host": "https://adb-xxx.azuredatabricks.net",
    "dest_token": "dapi_dest_token",
    "dry_run": true
  }'

# Execute cross-workspace clone
curl -X POST /api/dlt/pipelines/{id}/clone-to-workspace \
  -d '{
    "new_name": "prod-pipeline-DR",
    "dest_host": "https://adb-xxx.azuredatabricks.net",
    "dest_token": "dapi_dest_token"
  }'

The UI provides a clone modal with a Same Workspace / Different Workspace toggle. For cross-workspace, enter the destination workspace URL and PAT token.

note

Cross-workspace clones always create the new pipeline in development mode for safety. You can switch it to production mode in the destination workspace after review.

caution

Notebook references in the pipeline may not exist in the destination workspace. Copy the notebooks separately before running the cloned pipeline.
For serverless/SQL DLT pipelines with no notebooks, Clone-Xs creates a placeholder notebook automatically. Replace it with your actual code.

Expectation monitoring

DLT expectations (data quality rules defined in pipeline code) are tracked in system tables. Clone-Xs queries system.lakeflow.pipeline_events to surface:

Quality violations — expectations that failed
Flow progress — dataset transformation completion
Errors and warnings — pipeline-level issues

# Get expectation results for the last 7 days
curl /api/dlt/pipelines/{pipeline_id}/expectations?days=7

note

System tables (system.lakeflow.*) must be enabled in your Databricks workspace. If they are not available, expectation queries will return empty results.

Lineage integration

Clone-Xs maps DLT pipeline datasets to Unity Catalog tables by querying information_schema.tables in the pipeline's target schema:

curl /api/dlt/pipelines/{pipeline_id}/lineage

Response:

{
  "pipeline_id": "abc-123",
  "pipeline_name": "ETL Pipeline",
  "catalog": "prod",
  "target_schema": "bronze",
  "datasets": [
    {"fqn": "prod.bronze.raw_events", "type": "TABLE", "format": "DELTA"},
    {"fqn": "prod.bronze.raw_users", "type": "TABLE", "format": "DELTA"}
  ],
  "total_datasets": 2
}

This connects DLT pipeline lineage with Clone-Xs clone lineage — you can trace data from DLT source to cloned destination.

API reference

Method	Endpoint	Description
`GET`	`/api/dlt/pipelines`	List all DLT pipelines
`GET`	`/api/dlt/pipelines/{id}`	Get pipeline details + config
`POST`	`/api/dlt/pipelines/{id}/trigger`	Trigger pipeline run
`POST`	`/api/dlt/pipelines/{id}/stop`	Stop running pipeline
`POST`	`/api/dlt/pipelines/{id}/clone`	Clone pipeline (same workspace)
`POST`	`/api/dlt/pipelines/{id}/clone-to-workspace`	Clone pipeline (cross-workspace)
`GET`	`/api/dlt/pipelines/{id}/events`	Get event log
`GET`	`/api/dlt/pipelines/{id}/updates`	Get run history
`GET`	`/api/dlt/pipelines/{id}/lineage`	Map datasets to UC tables
`GET`	`/api/dlt/pipelines/{id}/expectations`	Query expectation results
`GET`	`/api/dlt/dashboard`	Full health dashboard

Next steps

Clone — clone Unity Catalog objects (tables, views, functions)
Observability — unified health dashboard including DLT status
Pipelines — Clone-Xs custom clone workflows (different from DLT)
Lakehouse Monitor — quality monitoring for Delta tables

Overview​

Quick start​

Web UI​

API​

Pipeline clone​

How it works​

Cross-workspace clone​

Expectation monitoring​

Lineage integration​

API reference​

Next steps​

Overview

Quick start

Web UI

API

Pipeline clone

How it works

Cross-workspace clone

Expectation monitoring

Lineage integration

API reference

Next steps