v.23.5Improvement
Parquet Writer Optimizes Row Group Size with OUTFILE Usage
Parquet writer now uses reasonable row group size when invoked through OUTFILE. #49325 (Michael Kolupaev).Why it matters
This feature addresses the issue of inefficient row group sizes in Parquet files created viaOUTFILE. By using a more appropriate row group size, it improves file compression and query performance when reading these files, providing better storage efficiency and faster analytics.How to use it
When exporting data to Parquet format usingOUTFILE, the writer will automatically apply the optimized row group size without any additional configuration from the user.