v.22.12Improvement

Support for Subcolumns of Nested Types from S3 Storage with Parquet, Arrow, and ORC Formats

Support reading of subcolumns of nested types from storage S3 and table function s3 with formats Parquet, Arrow and ORC. #43329 (chen).
Support for reading subcolumns of nested data types from S3 storage and the s3 table function when using Parquet, Arrow, and ORC formats.

Why it matters

This feature enables efficient extraction of nested subcolumns directly from external data sources stored in S3 or accessed via the s3 table function. It reduces the need to read entire nested columns when only specific subcolumns are required, improving query performance and resource usage.

How to use it

When reading data from S3 storage or via the s3 table function in Parquet, Arrow, or ORC formats, users can select nested subcolumns directly in their SELECT query. No additional configuration is required beyond specifying the subcolumns in the query projection.