v.21.9New Feature
Zero-copy Replication for ReplicatedMergeTree over HDFS Storage
Zero-copy replication forReplicatedMergeTreeoverHDFSstorage. #25918 (Zhichang Yu).
Why it matters
This feature addresses performance and storage overhead issues during replication by eliminating redundant data copying processes. It allows replication to be faster and more storage-efficient, especially beneficial for large datasets stored inHDFS.How to use it
To use zero-copy replication, configure yourReplicatedMergeTree tables to store data on HDFS and enable the zero-copy replication option. The replication mechanism will then reuse existing data files on HDFS during replication processes automatically.