v.23.3Improvement

Use Parquet format v2 by default with new version control setting

Use Parquet format v2 instead of v1 in output format by default. Add setting output_format_parquet_version to control parquet version, possible values 1.0, 2.4, 2.6, 2.latest (default). #46617 (Kruglov Pavel).
ClickHouse now uses Parquet format version 2 by default for the output format, replacing the previous default version 1.

Why it matters

This change improves compatibility and performance by adopting the newer Parquet format version 2, which supports more features and optimizations compared to version 1. It allows users to take advantage of enhanced Parquet capabilities for data export.

How to use it

Users can control the Parquet format version used in output by setting the output_format_parquet_version setting. Possible values are 1.0, 2.4, 2.6, and 2.latest (which is the new default). For example, to specify version 2.6 in a query, set the parameter as shown:

SET output_format_parquet_version = '2.6'
SELECT * FROM table_name FORMAT Parquet