v.25.4Experimental Feature

Functions sparseGrams and sparseGramsHashes

Functions sparseGrams and sparseGramsHashes with ASCII and UTF8 versions added. Author: scanhex12. #78176 (Pervakov Grigorii). Do not use it: the implementation will change in the next versions.
Added new functions sparseGrams and sparseGramsHashes with support for both ASCII and UTF8 encodings.

Why it matters

These functions provide new ways to generate sparse n-gram representations and their hash values from text data, enabling more efficient text processing and analysis in ClickHouse.

How to use it

Users can call the sparseGrams and sparseGramsHashes functions on their text columns, specifying ASCII or UTF8 input as needed. Note that these functions are experimental and their implementation may change in future ClickHouse versions.