mirror of
https://github.com/honeymoose/OpenSearch.git
synced 2025-03-27 10:28:28 +00:00
[DOCS] Fixed some bad UTF8
This commit is contained in:
parent
1f497c6678
commit
4c34615686
@ -22,14 +22,14 @@ of these classes. |`[]` (Keep all characters)
|
||||
|=======================================================================
|
||||
|
||||
|
||||
`token_chars` accepts the following character classes:
|
||||
`token_chars` accepts the following character classes:
|
||||
|
||||
[horizontal]
|
||||
`letter`:: for example `a`, `b`, `ï` or `京`
|
||||
`digit`:: for example `3` or `7`
|
||||
`whitespace`:: for example `" "` or `"\n"`
|
||||
`whitespace`:: for example `" "` or `"\n"`
|
||||
`punctuation`:: for example `!` or `"`
|
||||
`symbol`:: for example `$` or `â`
|
||||
`symbol`:: for example `$` or `√`
|
||||
|
||||
[float]
|
||||
==== Example
|
||||
@ -65,16 +65,16 @@ of these classes. |`[]` (Keep all characters)
|
||||
==== `side` deprecated
|
||||
|
||||
There used to be a `side` parameter up to `0.90.1` but it is now deprecated. In
|
||||
order to emulate the behavior of `"side" : "BACK"` a
|
||||
<<analysis-reverse-tokenfilter,`reverse` token filter>> should be used together
|
||||
with the <<analysis-edgengram-tokenfilter,`edgeNGram` token filter>>. The
|
||||
`edgeNGram` filter must be enclosed in `reverse` filters like this:
|
||||
order to emulate the behavior of `"side" : "BACK"` a
|
||||
<<analysis-reverse-tokenfilter,`reverse` token filter>> should be used together
|
||||
with the <<analysis-edgengram-tokenfilter,`edgeNGram` token filter>>. The
|
||||
`edgeNGram` filter must be enclosed in `reverse` filters like this:
|
||||
|
||||
[source,js]
|
||||
--------------------------------------------------
|
||||
"filter" : ["reverse", "edgeNGram", "reverse"]
|
||||
--------------------------------------------------
|
||||
|
||||
which essentially reverses the token, builds front `EdgeNGrams` and reverses
|
||||
which essentially reverses the token, builds front `EdgeNGrams` and reverses
|
||||
the ngram again. This has the same effect as the previous `"side" : "BACK"` setting.
|
||||
|
||||
|
||||
|
@ -17,14 +17,14 @@ tokens, Elasticsearch will split on characters that don't belong to any
|
||||
of these classes. |`[]` (Keep all characters)
|
||||
|=======================================================================
|
||||
|
||||
`token_chars` accepts the following character classes:
|
||||
`token_chars` accepts the following character classes:
|
||||
|
||||
[horizontal]
|
||||
`letter`:: for example `a`, `b`, `ï` or `京`
|
||||
`digit`:: for example `3` or `7`
|
||||
`whitespace`:: for example `" "` or `"\n"`
|
||||
`whitespace`:: for example `" "` or `"\n"`
|
||||
`punctuation`:: for example `!` or `"`
|
||||
`symbol`:: for example `$` or `â`
|
||||
`symbol`:: for example `$` or `√`
|
||||
|
||||
[float]
|
||||
==== Example
|
||||
|
Loading…
x
Reference in New Issue
Block a user