diff --git a/docs/plugins/analysis-kuromoji.asciidoc b/docs/plugins/analysis-kuromoji.asciidoc index 71bbdc231f4..4b1f3408882 100644 --- a/docs/plugins/analysis-kuromoji.asciidoc +++ b/docs/plugins/analysis-kuromoji.asciidoc @@ -33,9 +33,8 @@ characters, such as `o` and `f`. If a text contains full-width characters, the tokenizer can produce unexpected tokens. For example, the `kuromoji_tokenizer` tokenizer converts the text -`Culture of Japan` to the tokens `[ culture, o, f, japan ]` by -default. However, a user may expect the tokenizer to instead produce -`[ culture, of, japan ]`. +`Culture of Japan` to the tokens `[ culture, o, f, japan ]` +instead of `[ culture, of, japan ]`. To avoid this, add the <> to a custom analyzer based on the `kuromoji` analyzer. The