v.22.6Improvement
Allow String Type in Arrow/Parquet/ORC Formats with New Settings
Allow to use String type instead of Binary in Arrow/Parquet/ORC formats. This PR introduces 3 new settings for it:output_format_arrow_string_as_string,output_format_parquet_string_as_string,output_format_orc_string_as_string. Default value for all settings isfalse. #37327 (Kruglov Pavel).
Why it matters
This feature addresses the limitation whereString data was exported as Binary type in popular columnar formats, which could cause compatibility or usability issues. By allowing exporting strings as actual String types, it improves interoperability and makes data consumption easier for downstream systems.How to use it
Users can enable the feature by setting one or more of the following boolean settings totrue:-
output_format_arrow_string_as_string-
output_format_parquet_string_as_string-
output_format_orc_string_as_stringThese settings default to
false. For example, to enable for Parquet output, you can run:SET output_format_parquet_string_as_string = 1;