v.24.11New Feature

Add Table Functions for Iceberg, Delta Lake, and Hudi Clusters

Add iceberg[S3;HDFS;Azure]Cluster, deltaLakeCluster, hudiCluster table functions. #72045 (Mikhail Artemenko).
Introduces new table functions iceberg[S3;HDFS;Azure]Cluster, deltaLakeCluster, and hudiCluster to enable querying of Iceberg, Delta Lake, and Hudi datasets directly within ClickHouse.

Why it matters

These functions allow users to natively access and query data stored in popular open table formats like Iceberg, Delta Lake, and Hudi across various storage systems (S3, HDFS, Azure). This simplifies data integration and analytics workflows by enabling seamless interaction with external lakehouse data without manual data movement or external tools.

How to use it

Use the table functions by calling them in the FROM clause of your SQL queries. For example, to query an Iceberg dataset on S3, use SELECT * FROM icebergS3Cluster('your_parameters'). Similarly, use deltaLakeCluster and hudiCluster with appropriate connection parameters to access Delta Lake and Hudi datasets. Refer to the function signatures to provide necessary connection and table details.