v.22.4Improvement

Use minmax index for ORC/Parquet files in Hive Engine

Use minmax index for orc/parquet file in Hive Engine. Related PR: https://github.com/ClickHouse/arrow/pull/10. #34631 (李扬).
Added support for minmax index on ORC and Parquet files in the Hive engine to improve query performance by enabling efficient data skipping.

Why it matters

This feature addresses the problem of slow query performance when reading ORC and Parquet files through the Hive engine by leveraging minmax indexes. These indexes allow ClickHouse to skip irrelevant data blocks during query execution, reducing IO and speeding up query times.

How to use it

The minmax index on ORC and Parquet files is enabled automatically in the Hive engine when reading these file formats. Users do not need to change their queries; simply use the Hive engine to query ORC or Parquet files, and the index will optimize the data scanning process.