v.23.5Improvement

Don't Overestimate Processed Data Size for LowCardinality Columns with Shared Dictionaries

Don't overestimate the size of processed data for LowCardinality columns when they share dictionaries between blocks. This closes #49322. See also #48745. #49323 (Alexey Milovidov).
Improved accuracy in estimating the size of processed data for LowCardinality columns by correctly accounting for shared dictionaries between data blocks.

Why it matters

This feature addresses the issue where the size of processed data was overestimated for LowCardinality columns that share dictionaries across multiple blocks. By fixing this, the system provides more accurate data size metrics, which helps in better resource planning, query optimization, and monitoring.

How to use it

This improvement is applied automatically in ClickHouse. Users do not need to enable any settings; the size estimation for LowCardinality columns now correctly considers shared dictionaries by default.