v.23.3Improvement
Use Parquet format v2 by default with new version control setting
Use Parquet format v2 instead of v1 in output format by default. Add settingoutput_format_parquet_versionto control parquet version, possible values1.0,2.4,2.6,2.latest(default). #46617 (Kruglov Pavel).
Why it matters
This change improves compatibility and performance by adopting the newer Parquet format version 2, which supports more features and optimizations compared to version 1. It allows users to take advantage of enhanced Parquet capabilities for data export.How to use it
Users can control the Parquet format version used in output by setting theoutput_format_parquet_version setting. Possible values are 1.0, 2.4, 2.6, and 2.latest (which is the new default). For example, to specify version 2.6 in a query, set the parameter as shown:SET output_format_parquet_version = '2.6'
SELECT * FROM table_name FORMAT Parquet