v.24.1New Feature

Added String Similarity Functions: Damerau-Levenshtein, Jaro, and Jaro-Winkler

Added string similarity functions dramerauLevenshteinDistance, jaroSimilarity and jaroWinklerSimilarity. #58531 (Robert Schulze).
Added new string similarity functions damerauLevenshteinDistance, jaroSimilarity, and jaroWinklerSimilarity to ClickHouse for advanced text comparison.

Why it matters

These functions provide methods to measure how similar two strings are, enabling fuzzy matching and approximate text matching directly within ClickHouse queries. This helps users perform more flexible and intelligent text analysis, such as spell checking, record linkage, and data deduplication.

How to use it

Use the new functions in your SQL queries to compare strings. For example, call damerauLevenshteinDistance(string1, string2) to get the edit distance, or jaroSimilarity(string1, string2) / jaroWinklerSimilarity(string1, string2) for similarity scores between 0 and 1.