v.24.8New Feature
Add rows_before_aggregation_at_least Statistic to Query Response
Add therows_before_aggregation_at_leaststatistic to the query response when a new setting,rows_before_aggregationis enabled. This statistic represents the number of rows read before aggregation. In the context of a distributed query, when using thegroup byormaxaggregation function without alimit,rows_before_aggregation_at_leastcan reflect the number of rows hit by the query. #66084 (morning-color).
Why it matters
This feature helps users understand the volume of data processed before aggregation, especially in distributed queries usinggroup by or max without a limit. It provides insight into query performance and resource usage by reporting the minimum number of rows scanned prior to aggregation.How to use it
Enable the feature by settingrows_before_aggregation = 1 in your query or session settings. Once enabled, the query response will include the rows_before_aggregation_at_least statistic indicating the number of rows read before aggregation.