v.23.9Improvement

Added setting to control inclusion of remaining string in splitBy functions

Added a setting splitby_max_substrings_includes_remaining_string which controls if functions "splitBy*()" with argument "max_substring" > 0 include the remaining string (if any) in the result array (Python/Spark semantics) or not. The default behavior does not change. #54518 (Robert Schulze).
Introduced a new setting splitby_max_substrings_includes_remaining_string to control whether splitBy* functions with a positive max_substring argument include the remaining string as the last element in the result array.

Why it matters

This feature addresses the inconsistency in the behavior of splitBy* functions regarding the inclusion of the remaining substring when max_substring is greater than zero. It allows users to choose a split behavior that matches Python and Spark semantics, improving compatibility and flexibility in string splitting operations.

How to use it

Set the splitby_max_substrings_includes_remaining_string setting to true to enable inclusion of the remaining substring in the result array when using splitBy* functions with max_substring > 0. The default value is false, preserving the existing behavior.