v.25.12New Feature

Support reading DeltaLake CDF via deltaLake table

Support reading DeltaLake CDF via deltaLake table function with settings delta_lake_snapshot_start_version, delta_lake_snapshot_end_version. The CDF (Change Data Feed, a feature that lets you automatically capture and query row-level data changes—such as inserts, updates, and deletes—between versions of a Delta table) is enabled in DeltaLake via delta.enableChangeDataFeed. The columns provided along with the data are _change_type, _commit_version, _commit_timestamp. #90431 (Kseniia Sumarokova).
Added support for reading DeltaLake Change Data Feed (CDF) using the deltaLake table function with configurable snapshot version settings.

Why it matters

This feature enables users to automatically capture and query row-level changes—such as inserts, updates, and deletes—between versions of a DeltaLake table. It provides a convenient way to analyze change data directly in ClickHouse, leveraging DeltaLake's CDF capabilities.

How to use it

Use the deltaLake table function with the settings delta_lake_snapshot_start_version and delta_lake_snapshot_end_version to specify the range of versions to read. Ensure that Change Data Feed is enabled in DeltaLake by setting delta.enableChangeDataFeed. The result includes special columns: _change_type, _commit_version, and _commit_timestamp.