mirror of
https://github.com/honeymoose/OpenSearch.git
synced 2025-02-06 04:58:50 +00:00
4b77d0434a
In #30209 we deprecated the camel case `nGram` filter name in favour of `ngram` and did the same for `edgeNGram` and `edge_ngram` and we are removing those names in 8.0. This change disallows using the deprecated names for new indices created in 7.0 by throwing an error if these filters are used. Relates to #38911
51 lines
2.3 KiB
Plaintext
51 lines
2.3 KiB
Plaintext
[float]
|
|
[[breaking_70_analysis_changes]]
|
|
=== Analysis changes
|
|
|
|
[float]
|
|
==== Limiting the number of tokens produced by _analyze
|
|
|
|
To safeguard against out of memory errors, the number of tokens that can be produced
|
|
using the `_analyze` endpoint has been limited to 10000. This default limit can be changed
|
|
for a particular index with the index setting `index.analyze.max_token_count`.
|
|
|
|
[float]
|
|
==== Limiting the length of an analyzed text during highlighting
|
|
|
|
Highlighting a text that was indexed without offsets or term vectors,
|
|
requires analysis of this text in memory real time during the search request.
|
|
For large texts this analysis may take substantial amount of time and memory.
|
|
To protect against this, the maximum number of characters that will be analyzed has been
|
|
limited to 1000000. This default limit can be changed
|
|
for a particular index with the index setting `index.highlight.max_analyzed_offset`.
|
|
|
|
[float]
|
|
==== `delimited_payload_filter` renaming
|
|
|
|
The `delimited_payload_filter` was deprecated and renamed to `delimited_payload` in 6.2.
|
|
Using it in indices created before 7.0 will issue deprecation warnings. Using the old
|
|
name in new indices created in 7.0 will throw an error. Use the new name `delimited_payload`
|
|
instead.
|
|
|
|
[float]
|
|
==== `standard` filter has been removed
|
|
|
|
The `standard` token filter has been removed because it doesn't change anything in the stream.
|
|
|
|
[float]
|
|
==== Deprecated standard_html_strip analyzer
|
|
|
|
The `standard_html_strip` analyzer has been deprecated, and should be replaced
|
|
with a combination of the `standard` tokenizer and `html_strip` char_filter.
|
|
Indexes created using this analyzer will still be readable in elasticsearch 7.0,
|
|
but it will not be possible to create new indexes using it.
|
|
|
|
[float]
|
|
==== The deprecated `nGram` and `edgeNGram` token filter cannot be used on new indices
|
|
|
|
The `nGram` and `edgeNGram` token filter names have been deprecated in an earlier 6.x version.
|
|
Indexes created using these token filters will still be readable in elasticsearch 7.0 but indexing
|
|
documents using those filter names will issue a deprecation warning. Using the deprecated names on
|
|
new indices starting with version 7.0.0 will be prohibited and throw an error when indexing
|
|
or analyzing documents. Both names should be replaces by `ngram` or `edge_ngram` respectively.
|