OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	bf72858ce8	[Docs] Restore section about multi-level parent/child relation in parent-join (#27392 ) This section was removed to hide this ability to new users. This change restores the section and adds a warning regarding the expected performance. Closes #27336	2017-11-16 11:29:16 +01:00
Martijn van Groningen	b4048b4e7f	Use CoveringQuery to select percolate candidate matches and extract all clauses from a conjunction query. When clauses from a conjunction are extracted the number of clauses is also stored in an internal doc values field (minimum_should_match field). This field is used by the CoveringQuery and allows the percolator to reduce the number of false positives when selecting candidate matches and in certain cases be absolutely sure that a conjunction candidate match will match and then skip MemoryIndex validation. This can greatly improve performance. Before this change only a single clause was extracted from a conjunction query. The percolator tried to extract the clauses that was rarest in order (based on term length) to attempt less candidate queries to be selected in the first place. However this still method there is still a very high chance that candidate query matches are false positives. This change also removes the influencing query extraction added via #26081 as this is no longer needed because now all conjunction clauses are extracted. https://www.elastic.co/guide/en/elasticsearch/reference/6.x/percolator.html#_influencing_query_extraction Closes #26307	2017-11-10 07:44:42 +01:00
Nicholas Knize	06ff92d237	Add ignore_malformed to geo_shape fields This commit adds ignore_malformed support to geo_shape field types to skip malformed geoJson fields. closes #23747	2017-11-09 17:59:05 -06:00
Holger Bartnick	aa03fb72b7	[Docs] Correct link target for datatype murmur3 (#27143 )	2017-10-30 09:31:55 +01:00
Martijn van Groningen	f1e944a675	docs: describe parent/child performances	2017-10-26 11:49:13 +02:00
markwalkom	2b864156ca	[Docs] Clarify mapping `index` option default (#27104 )	2017-10-25 12:42:29 +02:00
David Turner	559fc5a4de	Update numbers to reflect 4-byte UTF-8-encoded characters (#27083 ) You need 4 bytes for characters outside the BMP, which includes many emoji and a bunch of less-common writing characters too.	2017-10-24 09:50:47 +01:00
Adrien Grand	4e1ff8d086	Add documentation about disabling `_field_names`. (#26813 ) This field has significant index-time overhead. Closes #26779	2017-10-06 16:49:15 +02:00
Clinton Gormley	eb3ead6561	Update type-field.asciidoc Fixed asciidoc syntax on deprecated annotation	2017-10-06 11:57:27 +02:00
Christoph Büscher	6189c54c84	Reject the `index_options` parameter for numeric fields (#26668 ) Numeric fields no longer support the index_options parameter. This changes the parameter to be rejected in numeric field types after it was deprecated in 6.0. Closes #21475	2017-09-25 23:43:14 +02:00
Michael Basnight	f385e0cf26	Add bad_request to the rest-api-spec catch params (#26539 ) This adds another request to the catch params. It also makes sure that the generic request param does not allow 400 either.	2017-09-14 14:24:03 -05:00
Bernd	59600dfe2d	[Docs] Correct typo in removal_of_types.asciidoc (#26646 )	2017-09-14 15:34:07 +02:00
Daniel A. Ochoa	914416e9f4	[Docs] Update link in removal_of_types.asciidoc (#26614 ) Fix link to [parent-child relationship].	2017-09-14 10:11:03 +02:00
Jim Ferenczi	c709b8d6ac	Fix incomplete sentences in parent-join docs (#26623 ) * Fix incomplete sentences in parent-join docs Closes #26590	2017-09-13 16:09:00 +02:00
Martijn van Groningen	b391425da1	Added support to the percolate query to percolate multiple documents The percolator will add a `_percolator_document_slot` field to all percolator hits to indicate with what document it has matched. This number matches with the order in which the documents have been specified in the percolate query. Also improved the support for multiple percolate queries in a search request.	2017-09-08 17:28:39 +02:00
Martijn van Groningen	a4d5c6418e	percolator: Rename map_unmapped_fields_as_string setting to map_unmapped_fields_as_text The `index.percolator.map_unmapped_fields_as_text` is a more better name, because unmapped fields are mapped to a text field with default settings and string is no longer a field type (it is either keyword or text).	2017-09-04 14:12:44 +02:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Martijn van Groningen	636e85e5b7	percolator: Hint what clauses are important in a conjunction query based on fields The percolator field mapper doesn't need to extract all terms and ranges from a bool query with must or filter clauses. In order to help to default extraction behavior, boost fields can be configured, so that fields that are known for not being selective enough can be ignored in favor for other fields or clauses with specific fields can forcefully take precedence over other clauses. This can help selecting clauses for fields that don't match with a lot of percolator queries over other clauses and thus improving performance of the percolate query. For example a status like field is something that should configured as an ignore field. Queries on this field tend to match with more documents and so if clauses for this fields get selected as best clause then that isn't very helpful for the candidate query that the percolate query generates to filter out percolator queries that are likely not going to match.	2017-08-11 15:32:01 +02:00
Martijn van Groningen	b88cfe2008	docs: Use stackexchange based example to make documentation easier to understand	2017-08-04 16:04:26 +02:00
Martijn van Groningen	ec7ac32772	docs: document work around for the percolator if query time text analysis is expensive.	2017-07-28 15:04:15 +02:00
Martijn van Groningen	7c3735bdc4	percolator: Store the QueryBuilder's Writable representation instead of its XContent representation. The Writeble representation is less heavy to parse and that will benefit percolate performance and throughput. The query builder's binary format has now the same bwc guarentees as the xcontent format. Added a qa test that verifies that percolator queries written in older versions are still readable by the current version.	2017-07-28 12:24:10 +02:00
Martijn van Groningen	5cf56a846a	docs: Remove incorrect warning Closes #25935	2017-07-28 10:53:47 +02:00
Colin Goodheart-Smithe	f1f1725fcf	[DOCS] improve explanation of dynamic mapping setting (#25829 ) Closes #25825	2017-07-21 12:24:38 +01:00
Clinton Gormley	febb4bf7bc	Update removal_of_types.asciidoc Fixed `include_in_type` -> `include_type_name`	2017-07-20 19:18:51 +02:00
Clinton Gormley	f69decf509	NOCONSOLE -> NOTCONSOLE in removal-of-types	2017-07-19 14:06:04 +02:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
James Baiera	847378a43b	Add another parent value option to join documentation (#25609 ) Indexing a join field on a document requires a value of type "object" and two sub fields "name" and "parent". The "parent" field is only required on child documents, but the "name" field which denotes the name of the relation is always needed. Previously, only the short-hand version of the join field was documented. This adds documentation for the long-hand join field data, and explicitly points out that just specifying the name of the relation for the field value is a convenience shortcut.	2017-07-11 15:36:59 -04:00
Martijn van Groningen	d0f9f425bd	parent/child: Removed ParentJoinFieldSubFetchPhase	2017-07-06 13:15:02 +02:00
Adrien Grand	26de905f1e	Fix the documentation to state that the `_id` field is indexed. (#25540 )	2017-07-05 16:09:31 +02:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
Martijn van Groningen	9ce9c21b83	docs: added percolator script query limitation	2017-06-28 17:10:30 +02:00
Nathan Taylor	645bb9d0fb	Docs: Removed duplicated line in mapping docs	2017-06-21 10:47:19 +02:00
Jim Ferenczi	afada69ea9	[Docs] more fix for the parent-join docs	2017-06-16 12:49:16 +02:00
Jim Ferenczi	664193185e	[Docs] Fix cross reference for parent-join field	2017-06-16 11:53:16 +02:00
Jim Ferenczi	ccb3c9aae7	Add documentation for the new parent-join field (#25227 ) * Add documentation for the new parent-join field This commit adds the docs for the new parent-join field. It explains how to define, index and query this new field. Relates #20257	2017-06-16 11:13:23 +02:00
Russ Cam	f6821c41d8	Add half_float and scaled float (#22988 ) to numeric datatypes (cherry picked from commit 67ea06145a80d5ec52ba55d1f2e1e8287e1882b1)	2017-06-13 09:54:44 +10:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Jim Ferenczi	8250aa4267	Remove the postings highlighter and make unified the default highlighter choice (#25028 ) This change removes the `postings` highlighter. This highlighter has been removed from Lucene master (7.x) because it behaves exactly like the `unified` highlighter when index_options is set to `offsets`: https://issues.apache.org/jira/browse/LUCENE-7815 It also makes the `unified` highlighter the default choice for highlighting a field (if `type` is not provided). The strategy used internally by this highlighter remain the same as before, it checks `term_vectors` first, then `postings` and ultimately it re-analyzes the text. Ultimately it rewrites the docs so that the options that the `unified` highlighter cannot handle are clearly marked as such. There are few features that the `unified` highlighter is not able to handle which is why the other highlighters (`plain` and `fvh`) are still available. I'll open separate issues for these features and we'll deprecate the `fvh` and `plain` highlighters when full support for these features have been added to the `unified`.	2017-06-09 14:09:57 +02:00
Andrey Groshev	e4fd8485ce	Made the same length of opening and closing lines (#23583 )	2017-06-09 00:50:43 -07:00
Jim Ferenczi	ad905924ae	update docs that claim that classic is the default similarity	2017-06-09 09:22:48 +02:00
Adrien Grand	ebf806d38f	Reorganize docs of global ordinals. (#24982 ) Currently global ordinals are documented under `fielddata`. It moves them to their own file since they also work with doc values and fielddata is on the way out. Closes #23101	2017-06-01 16:47:44 +02:00
markharwood	b7197f5e21	SignificantText aggregation - like significant_terms, but for text (#24432 ) * SignificantText aggregation - like significant_terms but doesn’t require fielddata=true, recommended used with `sampler` agg to limit expense of tokenizing docs and takes optional `filter_duplicate_text`:true setting to avoid stats skew from repeated sections of text in search results. Closes #23674	2017-05-24 13:46:43 +01:00
Adrien Grand	a72eaa8e0f	Identify documents by their `_id`. (#24460 ) Now that indices have a single type by default, we can move to the next step and identify documents using their `_id` rather than the `_uid`. One notable change in this commit is that I made deletions implicitly create types. This helps with the live version map in the case that documents are deleted before the first type is introduced. Otherwise there would be no way to differenciate `DELETE index/foo/1` followed by `PUT index/foo/1` from `DELETE index/bar/1` followed by `PUT index/foo/1`, even though those are different if versioning is involved.	2017-05-09 16:33:52 +02:00
Nicholas Knize	0c4eb0a029	Add new ip_range field type This commit adds support for indexing and searching a new ip_range field type. Both IPv4 and IPv6 formats are supported. Tests are updated and docs are added.	2017-05-05 09:43:42 -05:00
Nik Everett	a01f846226	CONSOLEify a few more docs Adds CONSOLE to cross-cluster-search docs but skips them for testing because we don't have a second cluster set up. This gets us the `VIEW IN CONSOLE` and `COPY AS CURL` links and makes sure that they are valid yaml (not json, technically) but doesn't get testing. Which is better than we had before. Adds CONSOLE to the dynamic templates docs and ingest-node docs. The ingest-node docs contain a ton of non-console snippets. We might want to convert them to full examples later, but that can be a separate thing. Relates to #18160	2017-05-04 21:01:14 -04:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Danilo Akamine	0adaf9fb4c	Drop `search_analyzer` parameter from keyword.asciidoc (#24221 ) `search_analyzer` isn't supported by `keyword` fields so this removes it from the documentation for them.	2017-04-25 12:49:50 -04:00
Nik Everett	e429d66956	CONSOLEify some more docs Relates to #18160	2017-04-24 16:08:19 -04:00
Fabien Baligand	4a45579506	token_count type : add an option to count tokens (fix #23227 ) (#24175 ) Add option "enable_position_increments" with default value true. If option is set to false, indexed value is the number of tokens (not position increments count)	2017-04-21 00:53:28 +02:00
Loek van Gool	e11d892562	Update field-names-field.asciidoc (#24178 ) fix typo in field name	2017-04-19 11:57:37 +02:00
Martijn van Groningen	3d9671a668	[PERCOLATOR] Allowing range queries with now ranges inside percolator queries. Before now ranges where forbidden, because the percolator query itself could get cached and then the percolator queries with now ranges that should no longer match, incorrectly will continue to match. By disabling caching when the `percolator` is being used, the percolator can now correctly support range queries with now based ranges. I think this is the right tradeoff. The percolator query is likely to not be the same between search requests and disabling range queries with now ranges really disabled people using the percolator for their use cases. Also fixed an issue that existed in the percolator fieldmapper, it was unable to find forbidden queries inside `dismax` queries. Closes #23859	2017-04-07 08:44:43 +02:00
Lee Hinman	b6b9ef8e26	[DOCS] Remove line about eager loading global ordinals Fielddata can no longer be configured to be loaded eagerly (it only accepts `true` and `false`), so this line is a little misleading because it talks about a procedure we can no longer do.	2017-04-03 12:56:21 -06:00
Nik Everett	653f50973a	CONSOLEify geo-shape docs `CONSOLE`ify geo-shape type and geo-shape query docs. Relates to #18160	2017-03-31 09:11:54 -04:00
Nik Everett	5f91241f57	CONSOLEify geo aggregation docs Turns the top example in each of the geo aggregation docs into a working example that can be opened in CONSOLE. Subsequent examples can all also be opened in console and will work after you've run the first example. All examples are tested as part of the build.	2017-03-30 21:28:52 -04:00
Ali Beyad	8359dd05c9	Adds boolean similarity to Elasticsearch (#23637 ) This commit adds the boolean similarity scoring from Lucene to Elasticsearch. The boolean similarity provides a means to specify that a field should not be scored with typical full-text ranking algorithms, but rather just whether the query terms match the document or not. Boolean similarity scores a query term equal to its query boost only. Boolean similarity is available as a default similarity option and thus a field can be specified to have boolean similarity by declaring in its mapping: "similarity": "boolean" Closes #6731	2017-03-28 10:17:23 -04:00
Martijn van Groningen	b116b8f0cb	[DOCS] Update the docs about the fact that global ordinals for _parent field are loaded eagerly instead of lazily by default. Relates to #8053	2017-03-22 10:39:39 +01:00
Lee Hinman	b3c27a7fdd	Disallow include_in_all for 6.0+ indices Since `_all` is now deprecated and cannot be set for new indices, we should also disallow any field that has the `include_in_all` parameter set. Resolves #22923	2017-02-07 19:31:51 -07:00
AlexNodex	fb8bdbc57a	Update typo in date (#22955 ) your example has yyy and it should be yyyy	2017-02-03 13:16:17 +01:00
Clinton Gormley	19ce039d2d	Update type-field.asciidoc Wildcard type names are not supported	2017-01-27 17:50:28 +01:00
Yannick Welsch	881993de3a	[Docs] Remove outdated info about enabling/disabling doc_values (#22694 )	2017-01-19 17:33:40 +01:00
Daniel Mitterdorfer	aece89d6a1	Make boolean conversion strict (#22200 ) This PR removes all leniency in the conversion of Strings to booleans: "true" is converted to the boolean value `true`, "false" is converted to the boolean value `false`. Everything else raises an error.	2017-01-19 07:59:18 +01:00
Scott Somerville	372812da98	Allow an index to be partitioned with custom routing (#22274 ) This change makes it possible for custom routing values to go to a subset of shards rather than just a single shard. This enables the ability to utilize the spatial locality that custom routing can provide while mitigating the likelihood of ending up with an imbalanced cluster or suffering from a hot shard. This is ideal for large multi-tenant indices with custom routing that suffer from one or both of the following: - The big tenants cannot fit into a single shard or there is so many of them that they will likely end up on the same shard - Tenants often have a surge in write traffic and a single shard cannot process it fast enough Beyond that, this should also be useful for use cases where most queries are done under the context of a specific field (e.g. a category) since it gives a hint at how the data can be stored to minimize the number of shards to check per query. While a similar solution can be achieved with multiple concrete indices or aliases per value today, those approaches breakdown for high cardinality fields. A partitioned index enforces that mappings have routing required, that the partition size does not change when shrinking an index (the partitions will shrink proportionally), and rejects mappings that have parent/child relationships. Closes #21585	2017-01-18 08:51:23 +01:00
Alex	a0c83c4511	Minor doc changes to clarify mapping index param for string type (#22652 ) * Grammatical correction * Add note for legacy string mapping type * Update truncate token filter to not mention the keyword tokenizer The advice predates the existence of the keyword field Closes #22650	2017-01-17 16:43:11 +01:00
Lee Hinman	7a18bb50fc	Disable _all by default This change disables the _all meta field by default. Now that we have the "all-fields" method of query execution, we can save both indexing time and disk space by disabling it. _all can no longer be configured for indices created after 6.0. Relates to #20925 and #21341 Resolves #19784	2017-01-11 16:47:13 -07:00
Nik Everett	75d5b3d9eb	Fix parent_id example in docs And fix some indentation I noticed while looking up the query.	2017-01-10 10:01:31 -05:00
Clinton Gormley	cb7952e71d	Docs: Parent field is no longer indexed and should use parent_id instead of term query Closes #22517	2017-01-10 13:48:07 +01:00
Jason Veatch	20f90178fe	Docs: Detail on false/strict dynamic mapping setting (#22451 ) Reference: https://www.elastic.co/guide/en/elasticsearch/guide/master/dynamic-mapping.html	2017-01-05 14:36:18 -05:00
Adrien Grand	3f805d68cb	Add the ability to set an analyzer on keyword fields. (#21919 ) This adds a new `normalizer` property to `keyword` fields that pre-processes the field value prior to indexing, but without altering the `_source`. Note that only the normalization components that work on a per-character basis are applied, so for instance stemming filters will be ignored while lowercasing or ascii folding will be applied. Closes #18064	2016-12-30 09:36:10 +01:00
Adrien Grand	84edf36f11	Make `-0` compare less than `+0` consistently. (#22173 ) Our `float`/`double` fields generally assume that `-0` compares less than `+0`, except when bounds are exclusive: an exclusive lower bound on `-0` excludes `+0` and an exclusive upper bound on `+0` excludes `-0`. Closes #22167	2016-12-21 16:51:45 +01:00
Adrien Grand	9524c81af9	Document the `locale` option of the `date` field. (#22050 ) This also adds another level of protection against using the default locale. Relates to https://discuss.elastic.co/t/mapping-for-12h-date-format/68433/3.	2016-12-09 09:45:53 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Clinton Gormley	5555e85619	Document that the PUT mapping API with the _default_ type overwrites instead of merging Closes #8215	2016-11-26 12:43:56 +01:00
Clinton Gormley	a4e88bb64a	Fixed bad asciidoc in boolean mapping docs	2016-11-15 17:50:23 +00:00
Lee Hinman	96122aa518	Be strict when parsing values searching for booleans (#21555 ) This changes only the query parsing behavior to be strict when searching on boolean values. We continue to accept the variety of values during index time, but searches will only be parsed using `"true"` or `"false"`. Resolves #21545	2016-11-15 10:36:57 -07:00
Alexander Lin	0219a211d3	Allows multiple patterns to be specified for index templates (#21009 ) * Allows for an array of index template patterns to be provided to an index template, and rename the field from 'template' to 'index_pattern'. Closes #20690	2016-11-10 18:00:30 -05:00
LakumiNarayanan	5af6deb5b5	Fix typo in keyword.asciidoc (#21237 )	2016-11-01 10:15:12 -04:00
Lee Hinman	6a8bad8a06	[DOCS] Document all date formats (#21164 ) Resolves #21046	2016-10-31 09:15:36 -06:00
Jun Ohtani	a66c76eb44	Merge pull request #20704 from johtani/remove_request_params_in_analyze_api Removing request parameters in _analyze API	2016-10-27 17:43:18 +09:00
Colin Goodheart-Smithe	c1a9833445	Correct similarity default for 5.0 (#21144 )	2016-10-27 09:33:21 +01:00
Pascal Borreli	fcb01deb34	Fixed typos (#20843 )	2016-10-10 14:51:47 -06:00
Jun Ohtani	370f0b885e	Removing request parameters in _analyze API Remove request params in _analyze API without index param Change rest-api-test using JSON Change docs using JSON Closes #20246	2016-10-07 16:23:24 +09:00
Anatolii Stepaniuk	f895abcf40	Fix grammar issues in some docs This commit fixes some grammar issues in various docs. Closes #20751 Closes #20752 Closes #20754 Closes #20755	2016-10-05 11:20:45 -04:00
Lee Hinman	3f77eacab1	Revert "Default `include_in_all` for numeric-like types to false" This reverts commit `6666892038`.	2016-09-28 07:07:46 -06:00
Clinton Gormley	e3b7b4f032	Reorganised docs for mapping safeguard settings	2016-09-22 14:58:17 +02:00
Martijn van Groningen	ad7c22198c	docs: describe more explicitly what happens when indexing queries that fetch terms	2016-09-22 10:00:11 +00:00
David Pilato	dfd1eebdd0	Remove mapper attachments plugin We now have in 5.0.0 `ingest-attachment` plugin. We can remove `mapper-attachments` plugin for 6.0. Closes #18837.	2016-09-19 09:01:16 +02:00
Nicholas Knize	598bab93ae	[DOC] Cleanup dangling references to deprecated geo parameters With the cut over to LatLonPoint the geohash, geohash_precision, lat_lon, and geohash_prefix parameters have been removed. This commit fixes the doc build by removing the remaining dangling references to these removed parameters.	2016-09-13 16:38:38 -05:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Lee Hinman	40b088d728	Rework documentation example for _all to be less ambigious with numerics	2016-09-08 09:09:48 -06:00
Lee Hinman	6666892038	Default `include_in_all` for numeric-like types to false This includes: - All regular numeric types such as int, long, scaled-float, double, etc - IP addresses - Dates - Geopoints and Geoshapes Relates to #19784	2016-09-08 09:09:48 -06:00
Nik Everett	e03fb602cd	Add CONSOLE places where it is obviously missing These places already have other annotations like `// TEST` and `// TESTSETUP` so they are already in console format.	2016-09-06 10:48:19 -04:00
Nik Everett	9c3f6d58ac	Support downgrading keyword/text into string This changes Elasticsearch to automatically downgrade `text` and `keyword` fields into appropriate `string` fields when changing the mapping of indexes imported from 2.x. This allows users to use the modern, documented syntax against 2.x indexes. It also makes it clear that reindexing in order to recreate the index in 5.0 is required for any long lived indexes. This change is useful for the times when you can't (cluster is just starting, not stable enough for reindex) or shouldn't (index will only live 90 days or something).	2016-08-29 11:27:37 -04:00
Munish Goyal	81b815ff76	Correct grammar in parent field doc	2016-08-29 07:51:39 -04:00
Nik Everett	5b34bec92a	Add deprecation warnings to docs for geohash Relates to #20126	2016-08-23 13:43:35 -04:00
Lee Hinman	3298a4ed38	Revert "Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'" This reverts commit `514585290c`, reversing changes made to `8563c8d897`.	2016-08-23 09:24:33 -06:00
Nicholas Knize	8234fad9ca	Deprecate geohash parameters for geo_point parser This commit deprecates all geohash parameters in the geo_point field parser.	2016-08-23 09:19:21 -05:00
Simon Willnauer	d685847b73	Use `refresh=true` in mapping/fields examples (#20120 ) Fix field examples to make documents actually visible This commit adds refresh calls to field examples an removes not working `_routing` and `_field_names` script access. Closes #20118	2016-08-23 13:32:14 +02:00
Lee Hinman	514585290c	Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'	2016-08-22 12:36:25 -06:00
Munish Goyal	f9c17dd976	Correct sentence (#20088 )	2016-08-22 16:20:14 +02:00
Jim Ferenczi	4bee565535	Fix docs stating that index.mapper.dynamic can be set for all nodes in the elasticsearch.yml file. This is not supported in 5.x (index settings cannot be set at the cluster level) and should be replace with a template for all indices.	2016-08-22 10:20:43 +02:00
Lee Hinman	b6ec1ae6eb	Rework documentation example for _all to be less ambigious with numerics	2016-08-19 16:44:38 -06:00
Lee Hinman	d7e516c0b4	Default `include_in_all` for numeric-like types to false This includes: - All regular numeric types such as int, long, scaled-float, double, etc - IP addresses - Dates - Geopoints and Geoshapes Relates to #19784	2016-08-19 15:50:38 -06:00
David Pilato	97dfa2ba40	Fix typo Reported at https://discuss.elastic.co/t/little-error-in-documentation-page-mapping-parameters-format/57424	2016-08-08 10:52:09 +02:00
Nik Everett	1e587406d8	Fail yaml tests and docs snippets that get unexpected warnings Adds `warnings` syntax to the yaml test that allows you to expect a `Warning` header that looks like: ``` - do: warnings: - '[index] is deprecated' - quotes are not required because yaml - but this argument is always a list, never a single string - no matter how many warnings you expect get: index: test type: test id: 1 ``` These are accessible from the docs with: ``` // TEST[warning:some warning] ``` This should help to force you to update the docs if you deprecate something. You must add the warnings marker to the docs or the build will fail. While you are there you should update the docs to add deprecation warnings visible in the rendered results.	2016-08-04 15:23:05 -04:00
Adrien Grand	398d70b567	Add `scaled_float`. #19264 This is a tentative to revive #15939 motivated by elastic/beats#1941. Half-floats are a pretty bad option for storing percentages. They would likely require 2 bytes all the time while they don't need more than one byte. So this PR exposes a new `scaled_float` type that requires a `scaling_factor` and internally indexes `valuescaling_factor` in a long field. Compared to the original PR it exposes a lower-level API so that the trade-offs are clearer and avoids any reference to fixed precision that might imply that this type is more accurate (actually it is less* accurate). In addition to being more space-efficient for some use-cases that beats is interested in, this is also faster that `half_float` unless we can improve the efficiency of decoding half-float bits (which is currently done using software) or until Java gets first-class support for half-floats.	2016-07-18 12:36:23 +02:00
Nik Everett	7aeea764ba	Remove wait_for_status=yellow from the docs It is no longer required after `687e2e12b3`.	2016-07-15 16:02:07 -04:00
Clinton Gormley	05271d58ca	Updated fielddata docs to make it easier for users with old mappings	2016-07-14 19:58:12 +02:00
Martijn van Groningen	ff5527f037	percolator: Forbid the usage or `range` queries with a range based on the current time If there are percolator queries containing `range` queries with ranges based on the current time then this can lead to incorrect results if the `percolate` query gets cached. These ranges are changing each time the `percolate` query gets executed and if this query gets cached then the results will be based on how the range was at the time when the `percolate` query got cached. The ExtractQueryTermsService has been renamed `QueryAnalyzer` and now only deals with analyzing the query (extracting terms and deciding if the entire query is a verified match) . The `PercolatorFieldMapper` is responsible for adding the right fields based on the analysis the `QueryAnalyzer` has performed, because this is highly dependent on the field mappings. Also the `PercolatorFieldMapper` is responsible for creating the percolate query.	2016-07-08 14:20:56 +02:00
Britta Weber	f36c1b4e60	Update fielddata.asciidoc	2016-07-05 16:21:52 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Jim Ferenczi	6d2df0dc18	Fix docs example for the _id field, the field is not accessible in scripts	2016-06-29 15:25:51 +02:00
Robert Muir	6d52cec2a0	Merge pull request #19092 from rmuir/more_painless_docs cutover some docs to painless	2016-06-28 13:40:25 -04:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Robert Muir	6fc1a22977	cutover some docs to painless	2016-06-27 09:55:16 -04:00
Martijn van Groningen	0cae9ad30e	docs: removed obsolete information, percolator queries are not longer loaded into jvm heap memory.	2016-06-23 15:32:26 +02:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Adrien Grand	7d63f4b8db	Fix doc build.	2016-06-22 09:34:49 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
Clinton Gormley	0160d91c2c	Removed docs for precision_step - no longer used	2016-06-21 15:19:12 +02:00
Adrien Grand	9ffb2ff6ba	Expose half-floats. #18887 They have been implemented in https://issues.apache.org/jira/browse/LUCENE-7289. Ranges are implemented so that the accuracy loss only occurs at index time, which means that if you are searching for values between A and B, the query will match exactly all documents whose value rounded to the closest half-float point is between A and B.	2016-06-16 09:46:39 +02:00
Jim Ferenczi	6d62f33702	Make doc_values accessible for _type `doc_values` for _type field are created but any attempt to load them throws an IAE. This PR re-enables `doc_values` loading for _type, it also enables `fielddata` loading for indices created between 2.0 and 2.1 since doc_values were disabled during that period. It also restores the old docs that gives example on how to sort or aggregate on _type field.	2016-05-25 18:56:13 +02:00
G. Richard Bellamy	cf54903580	Support full range of Java Long for epoch DateTime Remove the arbitrary limit on epoch_millis and epoch_seconds of 13 and 10 characters, respectively. Instead allow any character combination that can be converted to a Java Long. Update the docs to reflect this change.	2016-05-22 13:08:20 -07:00
Clinton Gormley	97a41ee973	First pass at improving analyzer docs (#18269 ) * Docs: First pass at improving analyzer docs I've rewritten the intro to analyzers plus the docs for all analyzers to provide working examples. I've also removed: * analyzer aliases (see #18244) * analyzer versions (see #18267) * snowball analyzer (see #8690) Next steps will be tokenizers, token filters, char filters * Fixed two typos	2016-05-11 14:17:56 +02:00
Clinton Gormley	3f594089c2	Renamed all AUTOSENSE snippets to CONSOLE (#18210 )	2016-05-09 15:42:23 +02:00
Clinton Gormley	b352a90454	Correct docs for dynamic mapping of fields Floating point numbers are added as `float`, and Strings are added as `text` with `keyword sub-field	2016-05-07 17:16:31 +02:00
Nik Everett	cb40b986d1	Allow leading `/` in AUTOSENSE path Relates to #18160	2016-05-06 09:26:19 -04:00
Clinton Gormley	c55df195c5	Fixed bad asciidoc	2016-05-06 09:25:58 +02:00
Nik Everett	f3b2ab822d	Another wait_for_yellow to the docs All in service of the snippets passing consistently.	2016-05-05 19:03:23 -04:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Adrien Grand	80dbe31d59	Add note about using ipv6 addresses in `query_string`.	2016-05-04 08:53:11 +02:00
Clinton Gormley	7c8397d99b	Update keyword.asciidoc `ignore_above` doesn't apply to analyzed `text` fields	2016-05-02 13:47:14 +02:00
Robin Joseph	e322903f2c	Fix typo in include-in-all.asciidoc (#18055 )	2016-04-29 18:03:22 +02:00
Shane Connelly	713c0df3a3	Merge pull request #17994 from eskibars/master Add new IPv6 types to docs where it's supported	2016-04-29 06:00:32 -07:00
Clinton Gormley	84a2b4e17e	Update id-field.asciidoc Clarified which queries support the `_id` field	2016-04-28 13:36:14 +02:00
Christoph Büscher	a2c3b5cae1	Update keyword.asciidoc	2016-04-27 12:10:19 +02:00
Shane Connelly	aff148f532	Add new IPv6 types to docs where it's supported	2016-04-26 11:38:49 -07:00
Martijn van Groningen	81449fc912	percolator: renamed `percolator` query to `percolate` query	2016-04-20 15:23:54 +02:00
Martijn van Groningen	40c22fc654	percolator: removed .percolator type instead a field of type `percolator` should be configured before indexing percolator queries * Added an extra `field` parameter to the `percolator` query to indicate what percolator field should be used. This must be an existing field in the mapping of type `percolator`. * The `.percolator` type is now forbidden. (just like any type that starts with a `.`) This only applies for new indices created on 5.0 and later. Indices created on previous versions the .percolator type is still allowed to exist. The new `percolator` field type isn't active in such indices and the `PercolatorQueryCache` knows how to load queries from these legacy indices. The `PercolatorQueryBuilder` will not enforce that the `field` parameter is of type `percolator`.	2016-04-19 11:20:31 +02:00
LeonardGC	0b8be7f894	Update field-mapping.asciidoc (#17670 )	2016-04-15 09:22:38 +02:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Nik Everett	0f9804b0e2	reindex: gracefully handle when _source is disabled Closes #17666	2016-04-13 08:19:58 -04:00
Ibrahim Awwal	5121060e75	Fix typo in templates.asciidoc The doc mentions match_path in one place but the correct syntax is path_match which is mentioned everywhere else. Using the wrong string leads to errors because the mapping becomes too greedy, and matches things it shouldn't.	2016-04-06 16:40:20 -06:00
Sergii Golubev	8430b379d8	string.asciidoc: fix for `position_increment_gap` Remove outdated and duplicate description for the `position_increment_gap` parameter.	2016-04-05 16:23:42 -04:00
Adrien Grand	26a0fb37a4	Add examples of useful dynamic templates to the docs. #17413	2016-03-31 09:45:11 +02:00
Adrien Grand	fc47007e17	Add a soft limit on the mapping depth. #17400 This commit adds the new `index.mapping.depth.limit` setting which controls the maximum mapping depth that is allowed. It has a default value of 20.	2016-03-30 14:37:00 +02:00
Yanjun Huang	361adcf387	Add limit to total number of fields in mapping. #17357 This is to prevent mapping explosion when dynamic keys such as UUID are used as field names. index.mapping.total_fields.limit specifies the total number of fields an index can have. An exception will be thrown when the limit is reached. The default limit is 1000. Value 0 means no limit. This setting is runtime adjustable Closes #11443	2016-03-29 19:39:46 +02:00
Adrien Grand	b42f66c8ac	Document 5.0 mapping changes.	2016-03-22 16:22:58 +01:00
Clinton Gormley	2fa573bc58	Missing word in docs	2016-03-10 14:34:05 +01:00
Nicholas Knize	55635d5de1	update coerce and breaking changes documentation	2016-03-09 16:09:44 -06:00
Nicholas Knize	61f39e6c92	GeoPointV2 update docs and query builders This commit updates the documentation for GeoPointField by removing all references to the coerce and doc_values parameters. DocValues are enabled in lucene GeoPointField by default (required for boundary filtering). The QueryBuilders are updated to automatically normalize points (ignoring the coerce parameter) for any index created onOrAfter version 2.2.	2016-03-09 16:09:44 -06:00
Jim Ferenczi	927303e7a9	Change the field mapping index time boost into a query time boost. Index time boost will still be applied for indices created before 5.0.0.	2016-03-04 11:47:35 +01:00
Clinton Gormley	05e3cd6b97	Merge pull request #16878 from peschlowp/patch-8 Update index-options.asciidoc	2016-03-02 10:52:44 +01:00
Clinton Gormley	812f03a33f	Merge pull request #16842 from anhlqn/patch-1 Fix minor spelling	2016-02-29 01:32:42 +01:00
Clinton Gormley	00b9640208	Merge pull request #16672 from teuneboon/patch-1 Clarify text about date format range	2016-02-15 16:16:19 +01:00
Dongjoon Hyun	21ea552070	Fix typos in docs.	2016-02-09 02:07:32 -08:00
Adrien Grand	209860854d	Make the `index` property a boolean. With the split of `string` into `text` and `keyword`, the `index` property can only have two values and should be a boolean.	2016-01-27 09:06:00 +01:00
Clinton Gormley	6aa1a4930e	Added back deprecation notices for _ttl and _timestamp	2016-01-26 11:56:36 +01:00
Robert Muir	6e7e3a2274	Update lucene to r1725675 Adds DFI (divergence from independence) provider. Fixes test bugs passing invalid values for BM25 parameters.	2016-01-20 03:32:51 -05:00
Rachit Gupta	5b2ded5c96	Fix typo in doc values docs Closes #16067	2016-01-19 05:58:39 -05:00
Yannick Welsch	a1b8dd2de9	Add per-index setting to limit number of nested fields Closes #14983	2016-01-19 10:03:48 +01:00
Felipe Forbeck	9965c83ae4	Documented how to define custom mappings for all indexes and all types Closes #15557	2016-01-12 13:35:29 +01:00
Clinton Gormley	9773cca58e	Merge pull request #15870 from rjruizes/patch-1 fix nested multi-value query	2016-01-10 10:06:40 +01:00
Adrien Grand	67d233cecd	Remove warmers and the warmer API. Warmers are now barely useful and will be removed in 3.0. Note that this only removes the warmer API and query-based warmers. We still have warmers internally for eg. global ordinals. Close #15607	2016-01-07 09:57:07 +01:00
Imran Azad	8081c782ef	Documented search_quote_analyzer in mapping types and detailed how to disable stop words as a potential use case.	2016-01-06 10:40:51 +01:00
Jim Ferenczi	81fd2169cf	Renames "default" similarity into "classic". Replaces deprecated DefaultSimilarity by ClassicSimilarity. Fixes #15102	2015-12-21 16:22:53 +01:00
umeku	0ce88b5887	Fix inaccurate docs for nested datatype Closes #15436	2015-12-15 15:15:00 +01:00
Clinton Gormley	061446b25a	Merge pull request #15304 from cjohansen/patch-1 Fix typo	2015-12-15 10:57:38 +01:00
Clinton Gormley	83ee1fc903	Merge pull request #15400 from TheDude05/fix-match_pattern-docs Fix docs with `match_pattern` in dynamic templates	2015-12-14 14:18:59 +01:00
Nicholas Knize	5f3d807f61	Update geo_shape/query docs, fix TermStrategy defaults This commit adds the following: * SpatialStrategy documentation to the geo-shape reference docs. * Updates relation documentation to geo-shape-query reference docs. * Updates GeoShapeFiledMapper to set points_only to true if TERM strategy is used (to be consistent with documentation)	2015-12-11 17:14:22 -06:00
Andrew Williams	e7127c9f6f	Fix docs with `match_pattern` in dynamic templates	2015-12-11 14:03:54 -06:00
Jim Ferenczi	9ab168dbf6	Removes all the reference of the query in the docs	2015-12-11 20:07:57 +01:00
Ben Tse	3cede749f9	fixed minor typo	2015-12-03 23:53:48 -05:00
Clinton Gormley	72be42d742	Document that _index is a virtual field and only supports term queries Closes #15070 Closes #15081	2015-11-30 08:43:23 +01:00
Jason Tedor	b6da075505	Fix typo in TTL field docs Closes #14994	2015-11-24 22:57:35 -05:00
David Pilato	5b0e2823b1	Merge branch 'docs/mapper-attachments'	2015-11-23 12:14:31 +01:00
Clinton Gormley	2293c0d8c8	Update token-count.asciidoc Fix typo	2015-11-20 19:00:52 +01:00
Clinton Gormley	728cc5137a	Merge pull request #14738 from petmit/patch-1 Update error in documentation for multi-fields	2015-11-17 17:33:53 +01:00
Adrien Grand	35c0b50879	Reword some documentation to make it more obvious that doc values are a columnar representation of the data. Some users may already be familiar with column stores, so saying more explicitly that doc values are a columnar representation of the data may help them better and/or more quickly understand what doc values are about.	2015-11-09 23:32:47 +01:00
David Pilato	e993c6a862	Migrate mapper attachements plugin to asciidoc Followup for #14605	2015-11-09 15:35:06 +01:00
Clinton Gormley	c49aaa1284	Merge pull request #14608 from jimmyjones2/patch-1 Update all-field.asciidoc	2015-11-09 13:43:25 +01:00
Clinton Gormley	dc018cf622	Updated docs for 3.0.0-beta	2015-10-07 13:27:46 +02:00
xuzha	a77c68ba0e	Fix position-increment-gap doc example	2015-09-23 08:04:43 -07:00
Nik Everett	b205875c43	Merge pull request #13515 from elastic/docsfix Fix for mappings->_source example in docs	2015-09-11 11:02:55 -04:00
Shane Connelly	d86c1e8769	Fixes #13417	2015-09-11 07:34:14 -07:00
Nicholas Knize	e4e71d8a9a	add points_only option to GeoShapeFieldMapper for optimizing indexing performance on geo_shape indexes designed to store only points. Includes updated documentation and exception handling for ensuring index integrity on points only data.	2015-09-08 16:17:50 -05:00
Clinton Gormley	2c20658204	Docs: Added deprecation notice for _timestamp and _ttl	2015-09-07 21:16:19 +02:00
Nik Everett	da16dcf527	[docs] Fix docs for position_increment_gap Closes #13207	2015-08-31 14:05:55 -04:00
Nik Everett	9eb684da51	Default detect_noop to true detect_noop is pretty cheap and noop updates compartively expensive so this feels like a sensible default. Also had to do some testing and documentation around how _ttl works with detect_noop. Closes #11282	2015-08-27 10:34:18 -04:00
xuzha	9bd4a7b72e	Fix doc build	2015-08-26 16:02:36 -07:00
xuzha	fb2be6d6a1	The name "position_offset_gap" is confusing because Lucene has three similar sounding things: * Analyzer#getPositionIncrementGap * Analyzer#getOffsetGap * IndexOptions.DOCS_AND_FREQS_AND_POSITIONS_AND_OFFSETS and * FieldType#storeTermVectorOffsets Rename position_offset_gap to position_increment_gap closes #13056	2015-08-26 14:56:35 -07:00
Nik Everett	4b9664beeb	Mapping: Default position_offset_gap to 100 This is much more fiddly than you'd expect it to be because of the way position_offset_gap is applied in StringFieldMapper. Instead of setting the default to 100 its simpler to make sure that all the analyzers default to 100 and that StringFieldMapper doesn't override the default unless the user specifies something different. Unless the index was created before 2.1, in which case the old default of 0 has to take. Also postition_offset_gaps less than 0 aren't allowed at all. New tests test that: 1. the new default doesn't match phrases across values with reasonably low slop (5) 2. the new default doest match phrases across values with reasonably high slop (50) 3. you can override the value and phrases work as you'd expect 4. if you leave the value undefined in the mapping and define it on a custom analyzer the the value from the custom analyzer shines through Closes #7268	2015-08-25 14:21:50 -04:00
Adrien Grand	a91b3fcbb9	Move the `murmur3` field to a plugin and fix defaults. This move the `murmur3` field to the `mapper-murmur3` plugin and fixes its defaults so that values will not be indexed by default, as the only purpose of this field is to speed up `cardinality` aggregations on high-cardinality string fields, which only requires doc values. I also removed the `rehash` option from the `cardinality` aggregation as it doesn't bring much value (rehashing is cheap) and allowed to remove the coupling between the `cardinality` aggregation and the `murmur3` field. Close #12874	2015-08-18 11:41:52 +02:00
Clinton Gormley	5df5ab0451	Docs: Another bad asciidoc link	2015-08-15 18:25:34 +02:00
Clinton Gormley	b67741f5f3	Docs: Another bad asciidoc link	2015-08-15 18:22:28 +02:00
Clinton Gormley	43936c5fcd	Docs: Removed the _size field include	2015-08-15 18:12:31 +02:00
Clinton Gormley	e143c6e460	Docs: Prepare plugin and integration docs for 2.0 * Centralised plugin docs in docs/plugins/ * Moved integrations into same docs * Moved community clients into the clients section of the docs * Removed docs/community Closes #11734 Closes #11724 Closes #11636 Closes #11635 Closes #11632 Closes #11630 Closes #12046 Closes #12438 Closes #12579	2015-08-15 18:02:43 +02:00
Clinton Gormley	c6c3a40cb6	Docs: Updated annotations for 2.0.0-beta1	2015-08-14 10:51:09 +02:00
Clinton Gormley	f8b9ede81f	Documented the update_all_types setting on PUT mapping Added docs to each mapping param to specify which ones can be updated when	2015-08-12 21:21:37 +02:00
Clinton Gormley	9da8822aed	Docs: Made multi-fields more prominent	2015-08-06 20:09:42 +02:00
Clinton Gormley	0eb2ab915d	Docs: Fixed date format default option	2015-08-06 19:05:09 +02:00
Clinton Gormley	08687dfa3d	Docs: Fixed typo on string datatype page	2015-08-06 18:59:37 +02:00
Clinton Gormley	52663071c0	Docs: Removed redundant docs from field datatypes page.	2015-08-06 18:52:54 +02:00
Clinton Gormley	7977979146	Docs: Reorganised the mapping home page	2015-08-06 18:44:07 +02:00
Clinton Gormley	ac2b8951c6	Docs: Mapping docs completely rewritten for 2.0	2015-08-06 17:24:51 +02:00
loopmachine	5de2044c5b	Update nested-type.asciidoc mapping example	2015-08-04 14:02:03 -04:00
Ryan Ernst	8cd03cce5e	Merge branch 'master' into fix/12329	2015-07-21 00:29:34 -07:00
Ryan Ernst	1c99626b84	Mappings: Remove ability to configure _index The `_index` field is now a completely virtual field thanks to #12027. It is no longer necessary to index the actual value of the index name. closes #12329	2015-07-20 23:54:35 -07:00
Clinton Gormley	c56ce0e242	Docs: Refactored the mapping meta-fields docs	2015-07-20 01:26:27 +02:00
Clinton Gormley	2b512f1f29	Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting	2015-07-14 18:14:09 +02:00
John Roesler	f86e8c33c1	Docfix: ignore_above uses string length, not utf-8 ignore_above is used to guard against the lucene limitation that a term cannot exceed 32766 bytes. However, the implementation just used the character count, which doesn't take into account the fact that some characters have multi-byte utf-8 encodings. This commit updates the docs to make this relationship clear. Closes #11563	2015-07-10 18:47:21 +02:00
Clinton Gormley	6c0badd0b3	Docs: Updated the source field docs to remove deprecation of includes/excludes Also provide warnings about why disabling source is probably something you don't want to do Closes #12141	2015-07-10 15:52:30 +02:00
Alexander Reelsen	b612cab96a	Dates: More strict parsing of ISO dates If you are using the default date or the named identifiers of dates, the current implementation was allowed to read a year with only one digit. In order to make this more strict, this fixes a year to be at least 4 digits. Same applies for month, day, hour, minute, seconds. Also the new default is `strictDateOptionalTime` for indices created with Elasticsearch 2.0 or newer. In addition a couple of not exposed date formats have been exposed, as they have been mentioned in the documentation. Closes #6158	2015-07-07 09:34:37 +02:00
Martijn van Groningen	53874bf5a6	aliases: Parse aliases at search time and never cache parsed alias filters The work around for resolving `now` doesn't need to be used for aliases, becuase alias filters are parsed at search time. However it can't be removed, because the percolator relies on it. Parent/child can be specified again in alias filters, this now works again because alias filters are parsed at search time. Parent/child will also use the late query parse work around, to make sure to do the final preparations when the search context is around. This allows the aliases api to validate the parent/child queries without failing because there is no search context. Closes #10485	2015-07-01 21:20:54 +02:00
Christoph Büscher	f5f73259e4	Docs: Update Joda URLs in documentation.	2015-06-26 10:23:02 +02:00
Christoph Büscher	ba9bbf7e66	Docs: Update date-format.asciidoc Joda documentation moved from http://joda-time.sourceforge.net/ to http://www.joda.org/joda-time/. Updated the links in the documentation accordingly.	2015-06-26 09:49:29 +02:00
Alexander Reelsen	23cf9af495	Dates: Be backwards compatible with pre 2.x indices In order to be backwards compatible, indices created before 2.x must support indexing of a unix timestamp and its configured date format. Indices created with 2.x must configure the `epoch_millis` date formatter in order to support this. Relates #10971	2015-06-25 17:21:29 +02:00
Clinton Gormley	3105b4edbe	Update core-types.asciidoc Added an anchor for multi-fields in mappinggs	2015-06-24 21:36:37 +02:00
Clinton Gormley	f123a53d72	Docs: Refactored modules and index modules sections	2015-06-22 23:49:45 +02:00
Ryan Ernst	12e7cbe92b	Mappings: Lockdown _timestamp This is a follow up to #8143 and #6730 for _timestamp. It removes support for `path`, as well as any field type settings, and enables docvalues for _timestamp, for 2.0. Users who need to adjust these settings can use a date field.	2015-06-22 10:21:03 -07:00
Alexander Reelsen	38ddc8159c	Dates: Allow for negative unix timestamps This fixes an issue to allow for negative unix timestamps. An own printer for epochs instead of just having a parser has been added. Added docs that only 10/13 length unix timestamps are supported Added docs in upgrade documentation Fixes #11478	2015-06-22 11:56:31 +02:00
Robin Clarke	f13c216aa2	More information about 'Copy field to'	2015-06-09 16:35:49 +02:00
Alexander Reelsen	01e8eaf181	Date Parsing: Add parsing for epoch and epoch in milliseconds This commit changes the date handling. First and foremost Elasticsearch does not try to convert every date to a unix timestamp first and then uses the configured date. This now allows for dates like `2015121212` to be parsed correctly. Instead it is now explicit by adding a `epoch_second` and `epoch_millis` date format. This also means, that the default date format now is `epoch_millis\|\|dateOptionalTime` to remain backwards compatible. Closes #5328 Relates #10971	2015-06-03 18:07:47 +02:00
Martijn van Groningen	359d9ac0d0	docs: added missing ids	2015-05-29 22:45:01 +02:00
Martijn van Groningen	1cfb6a79f1	Parent/child: refactored _parent field mapper and parent/child queries * Cut the `has_child` and `has_parent` queries over to use Lucene's query time global ordinal join. The main benefit of this change is that parent/child queries can now efficiently execute if parent/child queries are wrapped in a bigger boolean query. If the rest of the query only hit a few documents both has_child and has_parent queries don't need to evaluate all parent or child documents any more. * Cut the `_parent` field over to use doc values. This significantly reduces the on heap memory footprint of parent/child, because the parent id values are never loaded into memory. Breaking changes: * The `type` option on the `_parent` field can only point to a parent type that doesn't exist yet, so this means that an existing type/mapping can't become a parent type any longer. * The `has_child` and `has_parent` queries can no longer be use in alias filters. All these changes, improvements and breaks in compatibility only apply for indices created with ES version 2.0 or higher. For indices creates with ES <= 2.0 the older implementation is used. It is highly recommended to re-index all your indices with parent and child documents to benefit from all the improvements that come with this refactoring. The easiest way to achieve this is by using the scan and bulk apis using a simple script. Closes #6107 Closes #8134	2015-05-29 21:44:17 +02:00
Colin Goodheart-Smithe	35a58d874e	Scripting: Unify script and template requests across codebase This change unifies the way scripts and templates are specified for all instances in the codebase. It builds on the Script class added previously and adds request building and parsing support as well as the ability to transfer script objects between nodes. It also adds a Template class which aims to provide the same functionality for template APIs Closes #11091	2015-05-29 16:52:04 +01:00
Adrien Grand	461683ac58	Mappings: Remove the `compress`/`compress_threshold` options of the BinaryFieldMapper. This option is broken currently since it potentially interprets an incoming binary value as compressed while it just happens that the first bytes are the same as the LZF header.	2015-05-22 14:20:42 +02:00
Ryan Ernst	e29492ce94	Docs: Cleanup meta field docs Meta fields were locked down to not allow exotic options to the underlying field types in #8143. This change fixes the docs to no longer refer to the old settings. closes #10879	2015-05-07 11:26:49 -07:00
Adrien Grand	a0af88e996	Query DSL: Remove filter parsers. This commit makes queries and filters parsed the same way using the QueryParser abstraction. This allowed to remove duplicate code that we had for similar queries/filters such as `range`, `prefix` or `term`.	2015-05-07 20:14:34 +02:00
Ryan Ernst	7a7bd6086a	Mappings: Remove ability to disable _source field Current features (eg. update API) and future features (eg. reindex API) depend on _source. This change locks down the field so that it can no longer be disabled. It also removes legacy settings compress/compress_threshold. closes #8142 closes #10915	2015-05-05 22:04:18 -07:00
Ryan Ernst	d2b12e4fc2	Mappings: Remove docs for type level analyzer defaults These settings were removed in #9430.	2015-04-30 13:57:55 -07:00
Ryan Ernst	4ef9f3ca63	Mappings: Remove file based default mappings Using files that must be specified on each node is an anti-pattern from the API based goal of ES. This change removes the ability to specify the default mapping with a file on each node. closes #10620	2015-04-30 13:50:35 -07:00
Adrien Grand	6e076efdb9	Docs: Add documentation for the `doc_values` setting on the `boolean` field type. Close #10431	2015-04-29 15:59:24 +02:00
Clinton Gormley	7aa4c7e256	Docs: Removed a reference to index_name from the array mapping page	2015-04-29 15:12:31 +02:00
Ryan Ernst	bf09e58cb3	Mappings: Remove includes and excludes from _source Regardless of the outcome of #8142, we should at least enforce that when _source is enabled, it is sufficient to reindex. This change removes the excludes and includes settings, since these modify the source, causing us to lose the ability to reindex some fields. closes #10814	2015-04-28 15:03:51 -07:00
Clinton Gormley	2579cc31b1	Docs: Note that include_in_parent/root does not apply to geo-shape fields Closes #10653	2015-04-25 16:49:49 +02:00
Nicholas Knize	453217fd7a	[GEO] Prioritize tree_level and precision parameters over default distance_error_pct If a user explicitly defined the tree_level or precision parameter in a geo_shape mapping their specification was always overridden by the default_error_pct parameter (even though our docs say this parameter is a 'hint'). This lead to unexpected accuracy problems in the results of a geo_shape filter. (example provided in issue #9691) This simple patch fixes the unexpected behavior by setting the default distance_error_pct parameter to zero when the tree_level or precision parameters are provided by the user. Under the covers the quadtree will now use the tree level defined by the user. The docs will be updated to alert the user to exercise caution with these parameters. Specifying a precision of "1m" for an index using large complex shapes can quickly lead to OOM issues. closes #9691	2015-04-21 14:42:10 -05:00
Clinton Gormley	abc7de96ae	Docs: Updated version annotations in master	2015-04-09 14:50:11 +02:00
Adrien Grand	fae124103a	Merge pull request #10420 from jpountz/feature/numeric_resolution Mappings: Bring back numeric_resolution. Close #10420	2015-04-09 12:28:33 +02:00
Clinton Gormley	a95b11ca61	Document `doc_values` for field type `ip` Closes #9809	2015-04-04 17:51:28 +02:00
Adrien Grand	c7115f8364	Mappings: Bring back numeric_resolution. We had an undocumented parameter called `numeric_resolution` which allows to configure how to deal with dates when provided as a number. The default is to handle them as milliseconds, but you can also opt-on for eg. seconds. Close #10072	2015-04-03 19:54:14 +02:00
Guillaume Dievart	adcb782423	Update core-types.asciidoc	2015-04-03 14:12:29 +02:00
Nicholas Knize	c2ec463cdb	[GEO] fix docs for geo_point "validate" option Documentation states false as the default for "validate", "validate_lon", and "validate_lat" leading to confusion as described in issue #9539. This simple fix corrects the documentation and communicates that these fields will be deprecated and removed in upcoming versions. closes #9539	2015-03-23 15:34:37 -05:00
David Pilato	0c8da6bb84	[doc] Link mapper-attachment type documentation to its repo As explained in elasticsearch/elasticsearch-mapper-attachments#101, we should have consistent documentation. The best option is to link the documentation in elasticsearch guide to the most recent README in the plugin repo. Closes #9756	2015-02-27 22:18:59 +01:00
Martijn van Groningen	daefb4c673	Docs: Document that the fielddata loading defaults to eager on the _parent field. Closes #9804	2015-02-22 23:15:59 +01:00
Clinton Gormley	20ece4acb5	Update core-types.asciidoc Provide an example of how to disable norms Closes #9641	2015-02-12 12:10:11 +01:00
Ryan Ernst	b3474f6b25	Mappings: Remove ability to set path for _id and _routing on 2.0+ indexes _id and _routing now no longer support the 'path' setting on indexes created with 2.0. Indexes created before 2.0 still support this setting for backcompat. closes #6730	2015-02-10 10:53:44 -08:00
Ryan Ernst	c6968883a7	Mappings: Remove support for new indexes using path setting in object/nested fields or index_name in any field Backcompat is still here for indexes created before 2.0. closes #6677	2015-02-05 12:44:43 -08:00
David Pilato	878e46d7f9	[Docs] fix missing space	2015-01-29 19:17:41 +01:00
Ryan Ernst	afcedb94ed	Mappings: Remove `index_analyzer` setting to simplify analyzer logic The `analyzer` setting is now the base setting, and `search_analyzer` is simply an override of the search time analyzer. When setting `search_analyzer`, `analyzer` must be set. closes #9371	2015-01-28 13:43:15 -08:00

... 3 4 5 6 7 ...

541 Commits