v.25.12New Feature
Support reading DeltaLake CDF via deltaLake table
Support reading DeltaLake CDF viadeltaLaketable function with settingsdelta_lake_snapshot_start_version,delta_lake_snapshot_end_version. The CDF (Change Data Feed, a feature that lets you automatically capture and query row-level data changes—such as inserts, updates, and deletes—between versions of a Delta table) is enabled in DeltaLake viadelta.enableChangeDataFeed. The columns provided along with the data are_change_type,_commit_version,_commit_timestamp. #90431 (Kseniia Sumarokova).
Why it matters
This feature enables users to automatically capture and query row-level changes—such as inserts, updates, and deletes—between versions of a DeltaLake table. It provides a convenient way to analyze change data directly in ClickHouse, leveraging DeltaLake's CDF capabilities.How to use it
Use thedeltaLake table function with the settings delta_lake_snapshot_start_version and delta_lake_snapshot_end_version to specify the range of versions to read. Ensure that Change Data Feed is enabled in DeltaLake by setting delta.enableChangeDataFeed. The result includes special columns: _change_type, _commit_version, and _commit_timestamp.