v.22.4Improvement
Use minmax index for ORC/Parquet files in Hive Engine
Use minmax index for orc/parquet file in Hive Engine. Related PR: https://github.com/ClickHouse/arrow/pull/10. #34631 (李扬).Why it matters
This feature addresses the problem of slow query performance when reading ORC and Parquet files through the Hive engine by leveragingminmax indexes. These indexes allow ClickHouse to skip irrelevant data blocks during query execution, reducing IO and speeding up query times.How to use it
Theminmax index on ORC and Parquet files is enabled automatically in the Hive engine when reading these file formats. Users do not need to change their queries; simply use the Hive engine to query ORC or Parquet files, and the index will optimize the data scanning process.