v.23.9Improvement
Added setting to control inclusion of remaining string in splitBy functions
Added a setting splitby_max_substrings_includes_remaining_string which controls if functions "splitBy*()" with argument "max_substring" > 0 include the remaining string (if any) in the result array (Python/Spark semantics) or not. The default behavior does not change. #54518 (Robert Schulze).Why it matters
This feature addresses the inconsistency in the behavior ofsplitBy* functions regarding the inclusion of the remaining substring when max_substring is greater than zero. It allows users to choose a split behavior that matches Python and Spark semantics, improving compatibility and flexibility in string splitting operations.How to use it
Set thesplitby_max_substrings_includes_remaining_string setting to true to enable inclusion of the remaining substring in the result array when using splitBy* functions with max_substring > 0. The default value is false, preserving the existing behavior.