v.22.12Improvement
Support for Subcolumns of Nested Types from S3 Storage with Parquet, Arrow, and ORC Formats
Support reading of subcolumns of nested types from storageS3and table functions3with formatsParquet,ArrowandORC. #43329 (chen).
Why it matters
This feature enables efficient extraction of nested subcolumns directly from external data sources stored in S3 or accessed via thes3 table function. It reduces the need to read entire nested columns when only specific subcolumns are required, improving query performance and resource usage.How to use it
When reading data fromS3 storage or via the s3 table function in Parquet, Arrow, or ORC formats, users can select nested subcolumns directly in their SELECT query. No additional configuration is required beyond specifying the subcolumns in the query projection.