mirror of
https://github.com/honeymoose/OpenSearch.git
synced 2025-02-09 06:25:07 +00:00
Referring to stop analyzer for stopword docs #329
This commit is contained in:
parent
22a96e6a18
commit
ba8e012be9
@ -49,14 +49,6 @@ index :
|
|||||||
stopwords : [test1, test2, test3]
|
stopwords : [test1, test2, test3]
|
||||||
--------------------------------------------------
|
--------------------------------------------------
|
||||||
|
|
||||||
[[analyzers-stopwords]]
|
|
||||||
=== Stopwords
|
|
||||||
|
|
||||||
The `stopwords` parameter can be used to provide a custom set of stopwords. As
|
|
||||||
certain analyzers use a default list of stopwords while others don't, please
|
|
||||||
check out the individual analyzer sections. In case you want the analyzer to
|
|
||||||
use no stopwords at all, simply provide `stopwords: _none_`
|
|
||||||
|
|
||||||
Below is a list of the built in analyzers.
|
Below is a list of the built in analyzers.
|
||||||
|
|
||||||
include::analyzers/standard-analyzer.asciidoc[]
|
include::analyzers/standard-analyzer.asciidoc[]
|
||||||
|
@ -11,7 +11,8 @@ following types are supported: `arabic`, `armenian`, `basque`,
|
|||||||
|
|
||||||
All analyzers support setting custom `stopwords` either internally in
|
All analyzers support setting custom `stopwords` either internally in
|
||||||
the config, or by using an external stopwords file by setting
|
the config, or by using an external stopwords file by setting
|
||||||
`stopwords_path`.
|
`stopwords_path`. Check <<analysis-stop-analyzer,Stop Analyzer>> for
|
||||||
|
more details.
|
||||||
|
|
||||||
The following analyzers support setting custom `stem_exclusion` list:
|
The following analyzers support setting custom `stem_exclusion` list:
|
||||||
`arabic`, `armenian`, `basque`, `brazilian`, `bulgarian`, `catalan`,
|
`arabic`, `armenian`, `basque`, `brazilian`, `bulgarian`, `catalan`,
|
||||||
|
@ -15,7 +15,8 @@ type:
|
|||||||
|`flags` |The regular expression flags.
|
|`flags` |The regular expression flags.
|
||||||
|`stopwords` |A list of stopwords to initialize the stop filter with.
|
|`stopwords` |A list of stopwords to initialize the stop filter with.
|
||||||
Defaults to an 'empty' stopword list coming[1.0.0.RC1, Previously
|
Defaults to an 'empty' stopword list coming[1.0.0.RC1, Previously
|
||||||
defaulted to the English stopwords list]
|
defaulted to the English stopwords list]. Check
|
||||||
|
<<analysis-stop-analyzer,Stop Analyzer>> for more details.
|
||||||
|===================================================================
|
|===================================================================
|
||||||
|
|
||||||
*IMPORTANT*: The regular expression should match the *token separators*,
|
*IMPORTANT*: The regular expression should match the *token separators*,
|
||||||
|
@ -41,8 +41,9 @@ filter>> and defaults to `English`. Note that not all the language
|
|||||||
analyzers have a default set of stopwords provided.
|
analyzers have a default set of stopwords provided.
|
||||||
|
|
||||||
The `stopwords` parameter can be used to provide stopwords for the
|
The `stopwords` parameter can be used to provide stopwords for the
|
||||||
languages that has no defaults, or to simply replace the default set
|
languages that have no defaults, or to simply replace the default set
|
||||||
with your custom list. A default set of stopwords for many of these
|
with your custom list. Check <<analysis-stop-analyzer,Stop Analyzer>>
|
||||||
|
for more details. A default set of stopwords for many of these
|
||||||
languages is available from for instance
|
languages is available from for instance
|
||||||
https://github.com/apache/lucene-solr/tree/trunk/lucene/analysis/common/src/resources/org/apache/lucene/analysis/[here]
|
https://github.com/apache/lucene-solr/tree/trunk/lucene/analysis/common/src/resources/org/apache/lucene/analysis/[here]
|
||||||
and
|
and
|
||||||
|
@ -19,7 +19,8 @@ type:
|
|||||||
|Setting |Description
|
|Setting |Description
|
||||||
|`stopwords` |A list of stopwords to initialize the stop filter with.
|
|`stopwords` |A list of stopwords to initialize the stop filter with.
|
||||||
Defaults to an 'empty' stopword list added[1.0.0.Beta1, Previously
|
Defaults to an 'empty' stopword list added[1.0.0.Beta1, Previously
|
||||||
defaulted to the English stopwords list]
|
defaulted to the English stopwords list]. Check
|
||||||
|
<<analysis-stop-analyzer,Stop Analyzer>> for more details.
|
||||||
|`max_token_length` |The maximum token length. If a token is seen that
|
|`max_token_length` |The maximum token length. If a token is seen that
|
||||||
exceeds this length then it is discarded. Defaults to `255`.
|
exceeds this length then it is discarded. Defaults to `255`.
|
||||||
|=======================================================================
|
|=======================================================================
|
||||||
|
@ -14,8 +14,9 @@ The following are settings that can be set for a `stop` analyzer type:
|
|||||||
|Setting |Description
|
|Setting |Description
|
||||||
|`stopwords` |A list of stopwords to initialize the stop filter with.
|
|`stopwords` |A list of stopwords to initialize the stop filter with.
|
||||||
Defaults to the english stop words.
|
Defaults to the english stop words.
|
||||||
|
|
||||||
|`stopwords_path` |A path (either relative to `config` location, or
|
|`stopwords_path` |A path (either relative to `config` location, or
|
||||||
absolute) to a stopwords file configuration.
|
absolute) to a stopwords file configuration.
|
||||||
|=======================================================================
|
|=======================================================================
|
||||||
|
|
||||||
|
Use `stopwords: _none_` to explicitly specify an 'empty' stopword list.
|
||||||
|
|
||||||
|
Loading…
x
Reference in New Issue
Block a user