v.23.7New Feature

Add arrayJaccardIndex function for Jaccard similarity computation

Add function arrayJaccardIndex which computes the Jaccard similarity between two arrays. #50076 (FFFFFFFHHHHHHH).
Introduces the arrayJaccardIndex function to compute the Jaccard similarity between two arrays in ClickHouse.

Why it matters

This feature provides an easy and efficient way to measure the similarity between two sets represented as arrays by calculating their Jaccard index. It helps users analyze overlap or similarity in datasets directly within ClickHouse without external processing.

How to use it

Use the function by passing two arrays as arguments in a query, for example:

SELECT arrayJaccardIndex(array1, array2) FROM table;

This returns a float representing the Jaccard similarity between the two input arrays.