Skip to main content

DQ Automation

DQ Automation covers three pages — Active Jobs, Recon Schedules, and Auto-Remediation — that turn point-in-time DQ tools into recurring, hands-off operations.

Active Jobs (/data-quality/jobs)

Live status of all DQ-related Databricks jobs:

  • Rule-execution jobs (per-engine: DQX, GX, Recon)
  • Profiling jobs
  • Volume / freshness snapshot jobs
  • Schema-drift detector jobs

Each row shows job name, last-run status, duration, schedule, next-run timestamp, and quick actions:

  • Run now — trigger immediately
  • Pause / resume — disable without deleting
  • View runs — open the Databricks Jobs UI
  • Edit — modify schedule, scope, or notification
GET /data-quality/jobs
POST /data-quality/jobs/{id}/run
POST /data-quality/jobs/{id}/pause

Recon Schedules (/data-quality/recon-schedules)

Recurring Reconciliation runs. Each schedule specifies:

FieldNotes
NameHuman label
Source / target FQNTables to compare
Moderow-level / column-level / deep
CadenceCron expression
On-fail actionalert, incident, auto-remediate
Diff retentionDays to keep diff CSVs

Common pattern:

  • Hourly row-level on critical clones
  • Daily column-level on the same set
  • Weekly deep-diff for monthly compliance
GET  /data-quality/recon-schedules
POST /data-quality/recon-schedules
DELETE /data-quality/recon-schedules/{id}

Auto-Remediation (/data-quality/remediation)

Playbooks that automatically fix common DQ problems. Examples shipped:

  • Backfill from upstream — when a recent partition is empty, re-run the source job
  • Deduplicate — when an anomaly flags duplicates, run a MERGE to drop them
  • Re-clone — when reconciliation diverges, re-run the clone job
  • Rollback — when a critical rule fails post-clone, rollback the destination

Each playbook has:

  • Trigger — anomaly type, incident severity, rule failure
  • Action — Python or SQL block (run in a sandboxed Databricks job)
  • Approvalauto (run immediately) or manual (require human approve)
GET  /data-quality/remediation/playbooks
POST /data-quality/remediation/playbooks
POST /data-quality/remediation/playbooks/{id}/trigger # manual run
GET /data-quality/remediation/runs # execution history

Safety

  • Auto-run playbooks have a per-day rate limit (default 5) to prevent runaway loops
  • Every remediation writes to Audit Trail
  • Failed remediations promote to Incidents with severity critical