OpenSearch/docs/reference/analysis
Mayya Sharipova 148376c2c5
Add limits for ngram and shingle settings (#27211)
* Add limits for ngram and shingle settings (#27211)

Create index-level settings:
max_ngram_diff - maximum allowed difference between max_gram and min_gram in
NGramTokenFilter/NGramTokenizer. Default is 1.
max_shingle_diff - maximum allowed difference between max_shingle_size and
 min_shingle_size in ShingleTokenFilter.  Default is 3.

Throw an IllegalArgumentException when
trying to create NGramTokenFilter, NGramTokenizer, ShingleTokenFilter
where difference between max_size and min_size exceeds the settings value.

Closes #25887
2017-11-07 08:14:55 -05:00
..
analyzers Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238) 2017-10-05 13:25:05 +02:00
charfilters Add a shard filter search phase to pre-filter shards based on query rewriting (#25658) 2017-07-12 22:19:20 +02:00
tokenfilters Add limits for ngram and shingle settings (#27211) 2017-11-07 08:14:55 -05:00
tokenizers Add limits for ngram and shingle settings (#27211) 2017-11-07 08:14:55 -05:00
analyzers.asciidoc First pass at improving analyzer docs (#18269) 2016-05-11 14:17:56 +02:00
anatomy.asciidoc Correction of the names of numirals (#21531) 2016-11-25 14:30:49 +01:00
charfilters.asciidoc Hindu-Arabico-Latino Numerals (#22476) 2017-01-10 15:24:56 +01:00
normalizers.asciidoc Update experimental labels in the docs (#25727) 2017-07-18 14:06:22 +02:00
testing.asciidoc Remove wait_for_status=yellow from the docs 2016-07-15 16:02:07 -04:00
tokenfilters.asciidoc Add missing link for the WordDelimiterGraphFilter 2017-04-28 17:12:38 +02:00
tokenizers.asciidoc Rename simple pattern tokenizers (#25300) 2017-06-19 13:48:43 -07:00