v.24.8Improvement

Add Replication Lag and Recovery Time Metrics

Add replication lag and recovery time metrics. #66703 (Miсhael Stetsyuk).
Added new metrics to monitor replication lag and recovery time in ClickHouse clusters.

Why it matters

This feature provides insight into the replication performance by exposing metrics for replication lag and the time taken for recovery. It helps users detect delays and potential issues in replication, enabling better monitoring and troubleshooting of distributed data consistency.

How to use it

Users can access the new replication lag and recovery time metrics via the existing system metrics interfaces. No additional configuration is needed; these metrics are automatically collected and available for monitoring tools.