Data Quality - Quarantine

Applies To:

Pipeline Bundle

Configuration Scope:

Data Flow Spec

Databricks Docs:

NA

The Lakeflow Framework provides a quarantine feature that allows you to quarantine records that violate defined pipeline expectations.

There are multiple ways to handle quarantined records and these can be configured using the quarantineMode property in the Data Flow Spec. Available options are:

  • off: The quarantine feature is disabled

  • flag: The quarantined records are flagged in the target table

  • table: The quarantined records are stored in a separate quarantine table

If the quarantineMode property is set to table, the quarantineTargetDetails property can be set in the Data Flow Spec to define the details of the quarantine table, otherwise the quarantine table will be derived based of the main target table.

Configuration

Set as an attribute when creating your Data Flow Spec, refer to the Data Quality and Quarantine Configuration documentation for more information.