OpenSearch/docs/reference/analysis
Nik Everett 4b9664beeb Mapping: Default position_offset_gap to 100
This is much more fiddly than you'd expect it to be because of the way
position_offset_gap is applied in StringFieldMapper. Instead of setting
the default to 100 its simpler to make sure that all the analyzers default
to 100 and that StringFieldMapper doesn't override the default unless the
user specifies something different. Unless the index was created before
2.1, in which case the old default of 0 has to take.

Also postition_offset_gaps less than 0 aren't allowed at all.

New tests test that:
1. the new default doesn't match phrases across values with reasonably low
slop (5)
2. the new default doest match phrases across values with reasonably high
slop (50)
3. you can override the value and phrases work as you'd expect
4. if you leave the value undefined in the mapping and define it on a
custom analyzer the the value from the custom analyzer shines through

Closes #7268
2015-08-25 14:21:50 -04:00
..
analyzers Mapping: Default position_offset_gap to 100 2015-08-25 14:21:50 -04:00
charfilters Docs: Update pattern-replace-charfilter.asciidoc 2015-01-29 20:24:08 +01:00
tokenfilters Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting 2015-07-14 18:14:09 +02:00
tokenizers Docs: Removed all the added/deprecated tags from 1.x 2014-09-26 21:04:42 +02:00
analyzers.asciidoc Referring to stop analyzer for stopword docs #329 2014-01-14 11:53:30 +01:00
charfilters.asciidoc Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
icu-plugin.asciidoc Docs: fixed ICU plugin documentation 2014-07-03 15:21:51 +02:00
tokenfilters.asciidoc Analysis: Add keep_types for filtering by token type 2014-08-15 09:28:12 -04:00
tokenizers.asciidoc Analysis: Add additional Analyzers, Tokenizers, and TokenFilters from Lucene 2014-07-03 05:47:49 -04:00