mirror of
https://github.com/honeymoose/OpenSearch.git
synced 2025-02-17 10:25:15 +00:00
Docs: Fixed bad ref on cjk_width/bigram pages
This commit is contained in:
parent
5e40868f44
commit
e323e577e8
@ -3,7 +3,7 @@
|
|||||||
|
|
||||||
The `cjk_bigram` token filter forms bigrams out of the CJK
|
The `cjk_bigram` token filter forms bigrams out of the CJK
|
||||||
terms that are generated by the <<analysis-standard-tokenizer,`standard` tokenizer>>
|
terms that are generated by the <<analysis-standard-tokenizer,`standard` tokenizer>>
|
||||||
or the `icu_tokenizer` (see <<icu-analysis-plugin>>).
|
or the `icu_tokenizer` (see <<analysis-icu-plugin>>).
|
||||||
|
|
||||||
By default, when a CJK character has no adjacent characters to form a bigram,
|
By default, when a CJK character has no adjacent characters to form a bigram,
|
||||||
it is output in unigram form. If you always want to output both unigrams and
|
it is output in unigram form. If you always want to output both unigrams and
|
||||||
|
@ -7,6 +7,6 @@ The `cjk_width` token filter normalizes CJK width differences:
|
|||||||
* Folds halfwidth Katakana variants into the equivalent Kana
|
* Folds halfwidth Katakana variants into the equivalent Kana
|
||||||
|
|
||||||
NOTE: This token filter can be viewed as a subset of NFKC/NFKD
|
NOTE: This token filter can be viewed as a subset of NFKC/NFKD
|
||||||
Unicode normalization. See the <<icu-analysis-plugin>>
|
Unicode normalization. See the <<analysis-icu-plugin>>
|
||||||
for full normalization support.
|
for full normalization support.
|
||||||
|
|
||||||
|
Loading…
x
Reference in New Issue
Block a user