mirror of
https://github.com/honeymoose/OpenSearch.git
synced 2025-02-05 20:48:22 +00:00
ffd226efa0
We have 1074 snippets that look like they should be converted to `// CONSOLE`. At least that is what `gradle docs:listConsoleCandidates` says. This adds `// NOTCONSOLE` to explicitly mark snippets that *shouldn't* be converted to `// CONSOLE`. After marking the blindingly obvious ones this cuts the remaining snippet count to 1032.
50 lines
1.5 KiB
Plaintext
50 lines
1.5 KiB
Plaintext
[[analysis-smartcn]]
|
|
=== Smart Chinese Analysis Plugin
|
|
|
|
The Smart Chinese Analysis plugin integrates Lucene's Smart Chinese analysis
|
|
module into elasticsearch.
|
|
|
|
It provides an analyzer for Chinese or mixed Chinese-English text. This
|
|
analyzer uses probabilistic knowledge to find the optimal word segmentation
|
|
for Simplified Chinese text. The text is first broken into sentences, then
|
|
each sentence is segmented into words.
|
|
|
|
|
|
[[analysis-smartcn-install]]
|
|
[float]
|
|
==== Installation
|
|
|
|
This plugin can be installed using the plugin manager:
|
|
|
|
[source,sh]
|
|
----------------------------------------------------------------
|
|
sudo bin/elasticsearch-plugin install analysis-smartcn
|
|
----------------------------------------------------------------
|
|
// NOTCONSOLE
|
|
|
|
The plugin must be installed on every node in the cluster, and each node must
|
|
be restarted after installation.
|
|
|
|
[[analysis-smartcn-remove]]
|
|
[float]
|
|
==== Removal
|
|
|
|
The plugin can be removed with the following command:
|
|
|
|
[source,sh]
|
|
----------------------------------------------------------------
|
|
sudo bin/elasticsearch-plugin remove analysis-smartcn
|
|
----------------------------------------------------------------
|
|
// NOTCONSOLE
|
|
|
|
The node must be stopped before removing the plugin.
|
|
|
|
[[analysis-smartcn-tokenizer]]
|
|
[float]
|
|
==== `smartcn` tokenizer and token filter
|
|
|
|
The plugin provides the `smartcn` analyzer and `smartcn_tokenizer` tokenizer,
|
|
which are not configurable.
|
|
|
|
NOTE: The `smartcn_word` token filter and `smartcn_sentence` have been deprecated.
|