v.22.6Improvement

Allow String Type in Arrow/Parquet/ORC Formats with New Settings

Allow to use String type instead of Binary in Arrow/Parquet/ORC formats. This PR introduces 3 new settings for it: output_format_arrow_string_as_string, output_format_parquet_string_as_string, output_format_orc_string_as_string. Default value for all settings is false. #37327 (Kruglov Pavel).
Introduces the ability to output String data types as native String types instead of Binary in Arrow, Parquet, and ORC formats.

Why it matters

This feature addresses the limitation where String data was exported as Binary type in popular columnar formats, which could cause compatibility or usability issues. By allowing exporting strings as actual String types, it improves interoperability and makes data consumption easier for downstream systems.

How to use it

Users can enable the feature by setting one or more of the following boolean settings to true:

- output_format_arrow_string_as_string
- output_format_parquet_string_as_string
- output_format_orc_string_as_string

These settings default to false. For example, to enable for Parquet output, you can run:

SET output_format_parquet_string_as_string = 1;