v.22.2Experimental Feature

Functions for Text Classification: Language and Charset Detection

Functions for text classification: language and charset detection. See #23271. #33314 (Nikolay Degterinsky).
Introduces new functions for text classification in ClickHouse that enable language and charset detection.

Why it matters

These functions address the need to automatically identify the language and character set of text data, enhancing data processing and analysis by allowing users to better handle multilingual and multi-charset datasets.

How to use it

Users can apply the new functions directly in their SQL queries to detect the language and charset of text fields. For example, use the language detection function within a SELECT statement to classify text columns by language or charset.