v.22.8New Feature

Added Parallel Distributed Insert Support for S3Cluster in ClickHouse

Added support for parallel distributed insert select with s3Cluster table function into tables with Distributed and Replicated engine #34670. #39107 (Nikita Mikhaylov).
Added support for parallel distributed INSERT SELECT operations using the s3Cluster table function into tables with Distributed and Replicated engines.

Why it matters

This feature enables efficient data ingestion and parallel processing for distributed and replicated tables by leveraging the s3Cluster table function. It improves scalability and performance when inserting data from S3 clusters, addressing limitations in parallel distributed inserts.

How to use it

Use the s3Cluster table function within an INSERT SELECT query targeting tables that use the Distributed or Replicated engines. This allows parallel execution of data insertion from S3 sources across the cluster.