OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	636e85e5b7	percolator: Hint what clauses are important in a conjunction query based on fields The percolator field mapper doesn't need to extract all terms and ranges from a bool query with must or filter clauses. In order to help to default extraction behavior, boost fields can be configured, so that fields that are known for not being selective enough can be ignored in favor for other fields or clauses with specific fields can forcefully take precedence over other clauses. This can help selecting clauses for fields that don't match with a lot of percolator queries over other clauses and thus improving performance of the percolate query. For example a status like field is something that should configured as an ignore field. Queries on this field tend to match with more documents and so if clauses for this fields get selected as best clause then that isn't very helpful for the candidate query that the percolate query generates to filter out percolator queries that are likely not going to match.	2017-08-11 15:32:01 +02:00
Martijn van Groningen	b88cfe2008	docs: Use stackexchange based example to make documentation easier to understand	2017-08-04 16:04:26 +02:00
Martijn van Groningen	ec7ac32772	docs: document work around for the percolator if query time text analysis is expensive.	2017-07-28 15:04:15 +02:00
Martijn van Groningen	7c3735bdc4	percolator: Store the QueryBuilder's Writable representation instead of its XContent representation. The Writeble representation is less heavy to parse and that will benefit percolate performance and throughput. The query builder's binary format has now the same bwc guarentees as the xcontent format. Added a qa test that verifies that percolator queries written in older versions are still readable by the current version.	2017-07-28 12:24:10 +02:00
Martijn van Groningen	5cf56a846a	docs: Remove incorrect warning Closes #25935	2017-07-28 10:53:47 +02:00
Colin Goodheart-Smithe	f1f1725fcf	[DOCS] improve explanation of dynamic mapping setting (#25829 ) Closes #25825	2017-07-21 12:24:38 +01:00
Clinton Gormley	febb4bf7bc	Update removal_of_types.asciidoc Fixed `include_in_type` -> `include_type_name`	2017-07-20 19:18:51 +02:00
Clinton Gormley	f69decf509	NOCONSOLE -> NOTCONSOLE in removal-of-types	2017-07-19 14:06:04 +02:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
James Baiera	847378a43b	Add another parent value option to join documentation (#25609 ) Indexing a join field on a document requires a value of type "object" and two sub fields "name" and "parent". The "parent" field is only required on child documents, but the "name" field which denotes the name of the relation is always needed. Previously, only the short-hand version of the join field was documented. This adds documentation for the long-hand join field data, and explicitly points out that just specifying the name of the relation for the field value is a convenience shortcut.	2017-07-11 15:36:59 -04:00
Martijn van Groningen	d0f9f425bd	parent/child: Removed ParentJoinFieldSubFetchPhase	2017-07-06 13:15:02 +02:00
Adrien Grand	26de905f1e	Fix the documentation to state that the `_id` field is indexed. (#25540 )	2017-07-05 16:09:31 +02:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
Martijn van Groningen	9ce9c21b83	docs: added percolator script query limitation	2017-06-28 17:10:30 +02:00
Nathan Taylor	645bb9d0fb	Docs: Removed duplicated line in mapping docs	2017-06-21 10:47:19 +02:00
Jim Ferenczi	afada69ea9	[Docs] more fix for the parent-join docs	2017-06-16 12:49:16 +02:00
Jim Ferenczi	664193185e	[Docs] Fix cross reference for parent-join field	2017-06-16 11:53:16 +02:00
Jim Ferenczi	ccb3c9aae7	Add documentation for the new parent-join field (#25227 ) * Add documentation for the new parent-join field This commit adds the docs for the new parent-join field. It explains how to define, index and query this new field. Relates #20257	2017-06-16 11:13:23 +02:00
Russ Cam	f6821c41d8	Add half_float and scaled float (#22988 ) to numeric datatypes (cherry picked from commit 67ea06145a80d5ec52ba55d1f2e1e8287e1882b1)	2017-06-13 09:54:44 +10:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Jim Ferenczi	8250aa4267	Remove the postings highlighter and make unified the default highlighter choice (#25028 ) This change removes the `postings` highlighter. This highlighter has been removed from Lucene master (7.x) because it behaves exactly like the `unified` highlighter when index_options is set to `offsets`: https://issues.apache.org/jira/browse/LUCENE-7815 It also makes the `unified` highlighter the default choice for highlighting a field (if `type` is not provided). The strategy used internally by this highlighter remain the same as before, it checks `term_vectors` first, then `postings` and ultimately it re-analyzes the text. Ultimately it rewrites the docs so that the options that the `unified` highlighter cannot handle are clearly marked as such. There are few features that the `unified` highlighter is not able to handle which is why the other highlighters (`plain` and `fvh`) are still available. I'll open separate issues for these features and we'll deprecate the `fvh` and `plain` highlighters when full support for these features have been added to the `unified`.	2017-06-09 14:09:57 +02:00
Andrey Groshev	e4fd8485ce	Made the same length of opening and closing lines (#23583 )	2017-06-09 00:50:43 -07:00
Jim Ferenczi	ad905924ae	update docs that claim that classic is the default similarity	2017-06-09 09:22:48 +02:00
Adrien Grand	ebf806d38f	Reorganize docs of global ordinals. (#24982 ) Currently global ordinals are documented under `fielddata`. It moves them to their own file since they also work with doc values and fielddata is on the way out. Closes #23101	2017-06-01 16:47:44 +02:00
markharwood	b7197f5e21	SignificantText aggregation - like significant_terms, but for text (#24432 ) * SignificantText aggregation - like significant_terms but doesn’t require fielddata=true, recommended used with `sampler` agg to limit expense of tokenizing docs and takes optional `filter_duplicate_text`:true setting to avoid stats skew from repeated sections of text in search results. Closes #23674	2017-05-24 13:46:43 +01:00
Adrien Grand	a72eaa8e0f	Identify documents by their `_id`. (#24460 ) Now that indices have a single type by default, we can move to the next step and identify documents using their `_id` rather than the `_uid`. One notable change in this commit is that I made deletions implicitly create types. This helps with the live version map in the case that documents are deleted before the first type is introduced. Otherwise there would be no way to differenciate `DELETE index/foo/1` followed by `PUT index/foo/1` from `DELETE index/bar/1` followed by `PUT index/foo/1`, even though those are different if versioning is involved.	2017-05-09 16:33:52 +02:00
Nicholas Knize	0c4eb0a029	Add new ip_range field type This commit adds support for indexing and searching a new ip_range field type. Both IPv4 and IPv6 formats are supported. Tests are updated and docs are added.	2017-05-05 09:43:42 -05:00
Nik Everett	a01f846226	CONSOLEify a few more docs Adds CONSOLE to cross-cluster-search docs but skips them for testing because we don't have a second cluster set up. This gets us the `VIEW IN CONSOLE` and `COPY AS CURL` links and makes sure that they are valid yaml (not json, technically) but doesn't get testing. Which is better than we had before. Adds CONSOLE to the dynamic templates docs and ingest-node docs. The ingest-node docs contain a ton of non-console snippets. We might want to convert them to full examples later, but that can be a separate thing. Relates to #18160	2017-05-04 21:01:14 -04:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Danilo Akamine	0adaf9fb4c	Drop `search_analyzer` parameter from keyword.asciidoc (#24221 ) `search_analyzer` isn't supported by `keyword` fields so this removes it from the documentation for them.	2017-04-25 12:49:50 -04:00
Nik Everett	e429d66956	CONSOLEify some more docs Relates to #18160	2017-04-24 16:08:19 -04:00
Fabien Baligand	4a45579506	token_count type : add an option to count tokens (fix #23227 ) (#24175 ) Add option "enable_position_increments" with default value true. If option is set to false, indexed value is the number of tokens (not position increments count)	2017-04-21 00:53:28 +02:00
Loek van Gool	e11d892562	Update field-names-field.asciidoc (#24178 ) fix typo in field name	2017-04-19 11:57:37 +02:00
Martijn van Groningen	3d9671a668	[PERCOLATOR] Allowing range queries with now ranges inside percolator queries. Before now ranges where forbidden, because the percolator query itself could get cached and then the percolator queries with now ranges that should no longer match, incorrectly will continue to match. By disabling caching when the `percolator` is being used, the percolator can now correctly support range queries with now based ranges. I think this is the right tradeoff. The percolator query is likely to not be the same between search requests and disabling range queries with now ranges really disabled people using the percolator for their use cases. Also fixed an issue that existed in the percolator fieldmapper, it was unable to find forbidden queries inside `dismax` queries. Closes #23859	2017-04-07 08:44:43 +02:00
Lee Hinman	b6b9ef8e26	[DOCS] Remove line about eager loading global ordinals Fielddata can no longer be configured to be loaded eagerly (it only accepts `true` and `false`), so this line is a little misleading because it talks about a procedure we can no longer do.	2017-04-03 12:56:21 -06:00
Nik Everett	653f50973a	CONSOLEify geo-shape docs `CONSOLE`ify geo-shape type and geo-shape query docs. Relates to #18160	2017-03-31 09:11:54 -04:00
Nik Everett	5f91241f57	CONSOLEify geo aggregation docs Turns the top example in each of the geo aggregation docs into a working example that can be opened in CONSOLE. Subsequent examples can all also be opened in console and will work after you've run the first example. All examples are tested as part of the build.	2017-03-30 21:28:52 -04:00
Ali Beyad	8359dd05c9	Adds boolean similarity to Elasticsearch (#23637 ) This commit adds the boolean similarity scoring from Lucene to Elasticsearch. The boolean similarity provides a means to specify that a field should not be scored with typical full-text ranking algorithms, but rather just whether the query terms match the document or not. Boolean similarity scores a query term equal to its query boost only. Boolean similarity is available as a default similarity option and thus a field can be specified to have boolean similarity by declaring in its mapping: "similarity": "boolean" Closes #6731	2017-03-28 10:17:23 -04:00
Martijn van Groningen	b116b8f0cb	[DOCS] Update the docs about the fact that global ordinals for _parent field are loaded eagerly instead of lazily by default. Relates to #8053	2017-03-22 10:39:39 +01:00
Lee Hinman	b3c27a7fdd	Disallow include_in_all for 6.0+ indices Since `_all` is now deprecated and cannot be set for new indices, we should also disallow any field that has the `include_in_all` parameter set. Resolves #22923	2017-02-07 19:31:51 -07:00
AlexNodex	fb8bdbc57a	Update typo in date (#22955 ) your example has yyy and it should be yyyy	2017-02-03 13:16:17 +01:00
Clinton Gormley	19ce039d2d	Update type-field.asciidoc Wildcard type names are not supported	2017-01-27 17:50:28 +01:00
Yannick Welsch	881993de3a	[Docs] Remove outdated info about enabling/disabling doc_values (#22694 )	2017-01-19 17:33:40 +01:00
Daniel Mitterdorfer	aece89d6a1	Make boolean conversion strict (#22200 ) This PR removes all leniency in the conversion of Strings to booleans: "true" is converted to the boolean value `true`, "false" is converted to the boolean value `false`. Everything else raises an error.	2017-01-19 07:59:18 +01:00
Scott Somerville	372812da98	Allow an index to be partitioned with custom routing (#22274 ) This change makes it possible for custom routing values to go to a subset of shards rather than just a single shard. This enables the ability to utilize the spatial locality that custom routing can provide while mitigating the likelihood of ending up with an imbalanced cluster or suffering from a hot shard. This is ideal for large multi-tenant indices with custom routing that suffer from one or both of the following: - The big tenants cannot fit into a single shard or there is so many of them that they will likely end up on the same shard - Tenants often have a surge in write traffic and a single shard cannot process it fast enough Beyond that, this should also be useful for use cases where most queries are done under the context of a specific field (e.g. a category) since it gives a hint at how the data can be stored to minimize the number of shards to check per query. While a similar solution can be achieved with multiple concrete indices or aliases per value today, those approaches breakdown for high cardinality fields. A partitioned index enforces that mappings have routing required, that the partition size does not change when shrinking an index (the partitions will shrink proportionally), and rejects mappings that have parent/child relationships. Closes #21585	2017-01-18 08:51:23 +01:00
Alex	a0c83c4511	Minor doc changes to clarify mapping index param for string type (#22652 ) * Grammatical correction * Add note for legacy string mapping type * Update truncate token filter to not mention the keyword tokenizer The advice predates the existence of the keyword field Closes #22650	2017-01-17 16:43:11 +01:00
Lee Hinman	7a18bb50fc	Disable _all by default This change disables the _all meta field by default. Now that we have the "all-fields" method of query execution, we can save both indexing time and disk space by disabling it. _all can no longer be configured for indices created after 6.0. Relates to #20925 and #21341 Resolves #19784	2017-01-11 16:47:13 -07:00
Nik Everett	75d5b3d9eb	Fix parent_id example in docs And fix some indentation I noticed while looking up the query.	2017-01-10 10:01:31 -05:00
Clinton Gormley	cb7952e71d	Docs: Parent field is no longer indexed and should use parent_id instead of term query Closes #22517	2017-01-10 13:48:07 +01:00
Jason Veatch	20f90178fe	Docs: Detail on false/strict dynamic mapping setting (#22451 ) Reference: https://www.elastic.co/guide/en/elasticsearch/guide/master/dynamic-mapping.html	2017-01-05 14:36:18 -05:00
Adrien Grand	3f805d68cb	Add the ability to set an analyzer on keyword fields. (#21919 ) This adds a new `normalizer` property to `keyword` fields that pre-processes the field value prior to indexing, but without altering the `_source`. Note that only the normalization components that work on a per-character basis are applied, so for instance stemming filters will be ignored while lowercasing or ascii folding will be applied. Closes #18064	2016-12-30 09:36:10 +01:00
Adrien Grand	84edf36f11	Make `-0` compare less than `+0` consistently. (#22173 ) Our `float`/`double` fields generally assume that `-0` compares less than `+0`, except when bounds are exclusive: an exclusive lower bound on `-0` excludes `+0` and an exclusive upper bound on `+0` excludes `-0`. Closes #22167	2016-12-21 16:51:45 +01:00
Adrien Grand	9524c81af9	Document the `locale` option of the `date` field. (#22050 ) This also adds another level of protection against using the default locale. Relates to https://discuss.elastic.co/t/mapping-for-12h-date-format/68433/3.	2016-12-09 09:45:53 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Clinton Gormley	5555e85619	Document that the PUT mapping API with the _default_ type overwrites instead of merging Closes #8215	2016-11-26 12:43:56 +01:00
Clinton Gormley	a4e88bb64a	Fixed bad asciidoc in boolean mapping docs	2016-11-15 17:50:23 +00:00
Lee Hinman	96122aa518	Be strict when parsing values searching for booleans (#21555 ) This changes only the query parsing behavior to be strict when searching on boolean values. We continue to accept the variety of values during index time, but searches will only be parsed using `"true"` or `"false"`. Resolves #21545	2016-11-15 10:36:57 -07:00
Alexander Lin	0219a211d3	Allows multiple patterns to be specified for index templates (#21009 ) * Allows for an array of index template patterns to be provided to an index template, and rename the field from 'template' to 'index_pattern'. Closes #20690	2016-11-10 18:00:30 -05:00
LakumiNarayanan	5af6deb5b5	Fix typo in keyword.asciidoc (#21237 )	2016-11-01 10:15:12 -04:00
Lee Hinman	6a8bad8a06	[DOCS] Document all date formats (#21164 ) Resolves #21046	2016-10-31 09:15:36 -06:00
Jun Ohtani	a66c76eb44	Merge pull request #20704 from johtani/remove_request_params_in_analyze_api Removing request parameters in _analyze API	2016-10-27 17:43:18 +09:00
Colin Goodheart-Smithe	c1a9833445	Correct similarity default for 5.0 (#21144 )	2016-10-27 09:33:21 +01:00
Pascal Borreli	fcb01deb34	Fixed typos (#20843 )	2016-10-10 14:51:47 -06:00
Jun Ohtani	370f0b885e	Removing request parameters in _analyze API Remove request params in _analyze API without index param Change rest-api-test using JSON Change docs using JSON Closes #20246	2016-10-07 16:23:24 +09:00
Anatolii Stepaniuk	f895abcf40	Fix grammar issues in some docs This commit fixes some grammar issues in various docs. Closes #20751 Closes #20752 Closes #20754 Closes #20755	2016-10-05 11:20:45 -04:00
Lee Hinman	3f77eacab1	Revert "Default `include_in_all` for numeric-like types to false" This reverts commit `6666892038`.	2016-09-28 07:07:46 -06:00
Clinton Gormley	e3b7b4f032	Reorganised docs for mapping safeguard settings	2016-09-22 14:58:17 +02:00
Martijn van Groningen	ad7c22198c	docs: describe more explicitly what happens when indexing queries that fetch terms	2016-09-22 10:00:11 +00:00
David Pilato	dfd1eebdd0	Remove mapper attachments plugin We now have in 5.0.0 `ingest-attachment` plugin. We can remove `mapper-attachments` plugin for 6.0. Closes #18837.	2016-09-19 09:01:16 +02:00
Nicholas Knize	598bab93ae	[DOC] Cleanup dangling references to deprecated geo parameters With the cut over to LatLonPoint the geohash, geohash_precision, lat_lon, and geohash_prefix parameters have been removed. This commit fixes the doc build by removing the remaining dangling references to these removed parameters.	2016-09-13 16:38:38 -05:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Lee Hinman	40b088d728	Rework documentation example for _all to be less ambigious with numerics	2016-09-08 09:09:48 -06:00
Lee Hinman	6666892038	Default `include_in_all` for numeric-like types to false This includes: - All regular numeric types such as int, long, scaled-float, double, etc - IP addresses - Dates - Geopoints and Geoshapes Relates to #19784	2016-09-08 09:09:48 -06:00
Nik Everett	e03fb602cd	Add CONSOLE places where it is obviously missing These places already have other annotations like `// TEST` and `// TESTSETUP` so they are already in console format.	2016-09-06 10:48:19 -04:00
Nik Everett	9c3f6d58ac	Support downgrading keyword/text into string This changes Elasticsearch to automatically downgrade `text` and `keyword` fields into appropriate `string` fields when changing the mapping of indexes imported from 2.x. This allows users to use the modern, documented syntax against 2.x indexes. It also makes it clear that reindexing in order to recreate the index in 5.0 is required for any long lived indexes. This change is useful for the times when you can't (cluster is just starting, not stable enough for reindex) or shouldn't (index will only live 90 days or something).	2016-08-29 11:27:37 -04:00
Munish Goyal	81b815ff76	Correct grammar in parent field doc	2016-08-29 07:51:39 -04:00
Nik Everett	5b34bec92a	Add deprecation warnings to docs for geohash Relates to #20126	2016-08-23 13:43:35 -04:00
Lee Hinman	3298a4ed38	Revert "Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'" This reverts commit `514585290c`, reversing changes made to `8563c8d897`.	2016-08-23 09:24:33 -06:00
Nicholas Knize	8234fad9ca	Deprecate geohash parameters for geo_point parser This commit deprecates all geohash parameters in the geo_point field parser.	2016-08-23 09:19:21 -05:00
Simon Willnauer	d685847b73	Use `refresh=true` in mapping/fields examples (#20120 ) Fix field examples to make documents actually visible This commit adds refresh calls to field examples an removes not working `_routing` and `_field_names` script access. Closes #20118	2016-08-23 13:32:14 +02:00
Lee Hinman	514585290c	Merge remote-tracking branch 'dakrone/exclude-numerics-from-all'	2016-08-22 12:36:25 -06:00
Munish Goyal	f9c17dd976	Correct sentence (#20088 )	2016-08-22 16:20:14 +02:00
Jim Ferenczi	4bee565535	Fix docs stating that index.mapper.dynamic can be set for all nodes in the elasticsearch.yml file. This is not supported in 5.x (index settings cannot be set at the cluster level) and should be replace with a template for all indices.	2016-08-22 10:20:43 +02:00
Lee Hinman	b6ec1ae6eb	Rework documentation example for _all to be less ambigious with numerics	2016-08-19 16:44:38 -06:00
Lee Hinman	d7e516c0b4	Default `include_in_all` for numeric-like types to false This includes: - All regular numeric types such as int, long, scaled-float, double, etc - IP addresses - Dates - Geopoints and Geoshapes Relates to #19784	2016-08-19 15:50:38 -06:00
David Pilato	97dfa2ba40	Fix typo Reported at https://discuss.elastic.co/t/little-error-in-documentation-page-mapping-parameters-format/57424	2016-08-08 10:52:09 +02:00
Nik Everett	1e587406d8	Fail yaml tests and docs snippets that get unexpected warnings Adds `warnings` syntax to the yaml test that allows you to expect a `Warning` header that looks like: ``` - do: warnings: - '[index] is deprecated' - quotes are not required because yaml - but this argument is always a list, never a single string - no matter how many warnings you expect get: index: test type: test id: 1 ``` These are accessible from the docs with: ``` // TEST[warning:some warning] ``` This should help to force you to update the docs if you deprecate something. You must add the warnings marker to the docs or the build will fail. While you are there you should update the docs to add deprecation warnings visible in the rendered results.	2016-08-04 15:23:05 -04:00
Adrien Grand	398d70b567	Add `scaled_float`. #19264 This is a tentative to revive #15939 motivated by elastic/beats#1941. Half-floats are a pretty bad option for storing percentages. They would likely require 2 bytes all the time while they don't need more than one byte. So this PR exposes a new `scaled_float` type that requires a `scaling_factor` and internally indexes `valuescaling_factor` in a long field. Compared to the original PR it exposes a lower-level API so that the trade-offs are clearer and avoids any reference to fixed precision that might imply that this type is more accurate (actually it is less* accurate). In addition to being more space-efficient for some use-cases that beats is interested in, this is also faster that `half_float` unless we can improve the efficiency of decoding half-float bits (which is currently done using software) or until Java gets first-class support for half-floats.	2016-07-18 12:36:23 +02:00
Nik Everett	7aeea764ba	Remove wait_for_status=yellow from the docs It is no longer required after `687e2e12b3`.	2016-07-15 16:02:07 -04:00
Clinton Gormley	05271d58ca	Updated fielddata docs to make it easier for users with old mappings	2016-07-14 19:58:12 +02:00
Martijn van Groningen	ff5527f037	percolator: Forbid the usage or `range` queries with a range based on the current time If there are percolator queries containing `range` queries with ranges based on the current time then this can lead to incorrect results if the `percolate` query gets cached. These ranges are changing each time the `percolate` query gets executed and if this query gets cached then the results will be based on how the range was at the time when the `percolate` query got cached. The ExtractQueryTermsService has been renamed `QueryAnalyzer` and now only deals with analyzing the query (extracting terms and deciding if the entire query is a verified match) . The `PercolatorFieldMapper` is responsible for adding the right fields based on the analysis the `QueryAnalyzer` has performed, because this is highly dependent on the field mappings. Also the `PercolatorFieldMapper` is responsible for creating the percolate query.	2016-07-08 14:20:56 +02:00
Britta Weber	f36c1b4e60	Update fielddata.asciidoc	2016-07-05 16:21:52 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Jim Ferenczi	6d2df0dc18	Fix docs example for the _id field, the field is not accessible in scripts	2016-06-29 15:25:51 +02:00
Robert Muir	6d52cec2a0	Merge pull request #19092 from rmuir/more_painless_docs cutover some docs to painless	2016-06-28 13:40:25 -04:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Robert Muir	6fc1a22977	cutover some docs to painless	2016-06-27 09:55:16 -04:00
Martijn van Groningen	0cae9ad30e	docs: removed obsolete information, percolator queries are not longer loaded into jvm heap memory.	2016-06-23 15:32:26 +02:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Adrien Grand	7d63f4b8db	Fix doc build.	2016-06-22 09:34:49 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
Clinton Gormley	0160d91c2c	Removed docs for precision_step - no longer used	2016-06-21 15:19:12 +02:00
Adrien Grand	9ffb2ff6ba	Expose half-floats. #18887 They have been implemented in https://issues.apache.org/jira/browse/LUCENE-7289. Ranges are implemented so that the accuracy loss only occurs at index time, which means that if you are searching for values between A and B, the query will match exactly all documents whose value rounded to the closest half-float point is between A and B.	2016-06-16 09:46:39 +02:00
Jim Ferenczi	6d62f33702	Make doc_values accessible for _type `doc_values` for _type field are created but any attempt to load them throws an IAE. This PR re-enables `doc_values` loading for _type, it also enables `fielddata` loading for indices created between 2.0 and 2.1 since doc_values were disabled during that period. It also restores the old docs that gives example on how to sort or aggregate on _type field.	2016-05-25 18:56:13 +02:00
G. Richard Bellamy	cf54903580	Support full range of Java Long for epoch DateTime Remove the arbitrary limit on epoch_millis and epoch_seconds of 13 and 10 characters, respectively. Instead allow any character combination that can be converted to a Java Long. Update the docs to reflect this change.	2016-05-22 13:08:20 -07:00
Clinton Gormley	97a41ee973	First pass at improving analyzer docs (#18269 ) * Docs: First pass at improving analyzer docs I've rewritten the intro to analyzers plus the docs for all analyzers to provide working examples. I've also removed: * analyzer aliases (see #18244) * analyzer versions (see #18267) * snowball analyzer (see #8690) Next steps will be tokenizers, token filters, char filters * Fixed two typos	2016-05-11 14:17:56 +02:00
Clinton Gormley	3f594089c2	Renamed all AUTOSENSE snippets to CONSOLE (#18210 )	2016-05-09 15:42:23 +02:00
Clinton Gormley	b352a90454	Correct docs for dynamic mapping of fields Floating point numbers are added as `float`, and Strings are added as `text` with `keyword sub-field	2016-05-07 17:16:31 +02:00
Nik Everett	cb40b986d1	Allow leading `/` in AUTOSENSE path Relates to #18160	2016-05-06 09:26:19 -04:00
Clinton Gormley	c55df195c5	Fixed bad asciidoc	2016-05-06 09:25:58 +02:00
Nik Everett	f3b2ab822d	Another wait_for_yellow to the docs All in service of the snippets passing consistently.	2016-05-05 19:03:23 -04:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Adrien Grand	80dbe31d59	Add note about using ipv6 addresses in `query_string`.	2016-05-04 08:53:11 +02:00
Clinton Gormley	7c8397d99b	Update keyword.asciidoc `ignore_above` doesn't apply to analyzed `text` fields	2016-05-02 13:47:14 +02:00
Robin Joseph	e322903f2c	Fix typo in include-in-all.asciidoc (#18055 )	2016-04-29 18:03:22 +02:00
Shane Connelly	713c0df3a3	Merge pull request #17994 from eskibars/master Add new IPv6 types to docs where it's supported	2016-04-29 06:00:32 -07:00
Clinton Gormley	84a2b4e17e	Update id-field.asciidoc Clarified which queries support the `_id` field	2016-04-28 13:36:14 +02:00
Christoph Büscher	a2c3b5cae1	Update keyword.asciidoc	2016-04-27 12:10:19 +02:00
Shane Connelly	aff148f532	Add new IPv6 types to docs where it's supported	2016-04-26 11:38:49 -07:00
Martijn van Groningen	81449fc912	percolator: renamed `percolator` query to `percolate` query	2016-04-20 15:23:54 +02:00
Martijn van Groningen	40c22fc654	percolator: removed .percolator type instead a field of type `percolator` should be configured before indexing percolator queries * Added an extra `field` parameter to the `percolator` query to indicate what percolator field should be used. This must be an existing field in the mapping of type `percolator`. * The `.percolator` type is now forbidden. (just like any type that starts with a `.`) This only applies for new indices created on 5.0 and later. Indices created on previous versions the .percolator type is still allowed to exist. The new `percolator` field type isn't active in such indices and the `PercolatorQueryCache` knows how to load queries from these legacy indices. The `PercolatorQueryBuilder` will not enforce that the `field` parameter is of type `percolator`.	2016-04-19 11:20:31 +02:00
LeonardGC	0b8be7f894	Update field-mapping.asciidoc (#17670 )	2016-04-15 09:22:38 +02:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Nik Everett	0f9804b0e2	reindex: gracefully handle when _source is disabled Closes #17666	2016-04-13 08:19:58 -04:00
Ibrahim Awwal	5121060e75	Fix typo in templates.asciidoc The doc mentions match_path in one place but the correct syntax is path_match which is mentioned everywhere else. Using the wrong string leads to errors because the mapping becomes too greedy, and matches things it shouldn't.	2016-04-06 16:40:20 -06:00
Sergii Golubev	8430b379d8	string.asciidoc: fix for `position_increment_gap` Remove outdated and duplicate description for the `position_increment_gap` parameter.	2016-04-05 16:23:42 -04:00
Adrien Grand	26a0fb37a4	Add examples of useful dynamic templates to the docs. #17413	2016-03-31 09:45:11 +02:00
Adrien Grand	fc47007e17	Add a soft limit on the mapping depth. #17400 This commit adds the new `index.mapping.depth.limit` setting which controls the maximum mapping depth that is allowed. It has a default value of 20.	2016-03-30 14:37:00 +02:00
Yanjun Huang	361adcf387	Add limit to total number of fields in mapping. #17357 This is to prevent mapping explosion when dynamic keys such as UUID are used as field names. index.mapping.total_fields.limit specifies the total number of fields an index can have. An exception will be thrown when the limit is reached. The default limit is 1000. Value 0 means no limit. This setting is runtime adjustable Closes #11443	2016-03-29 19:39:46 +02:00
Adrien Grand	b42f66c8ac	Document 5.0 mapping changes.	2016-03-22 16:22:58 +01:00
Clinton Gormley	2fa573bc58	Missing word in docs	2016-03-10 14:34:05 +01:00
Nicholas Knize	55635d5de1	update coerce and breaking changes documentation	2016-03-09 16:09:44 -06:00
Nicholas Knize	61f39e6c92	GeoPointV2 update docs and query builders This commit updates the documentation for GeoPointField by removing all references to the coerce and doc_values parameters. DocValues are enabled in lucene GeoPointField by default (required for boundary filtering). The QueryBuilders are updated to automatically normalize points (ignoring the coerce parameter) for any index created onOrAfter version 2.2.	2016-03-09 16:09:44 -06:00
Jim Ferenczi	927303e7a9	Change the field mapping index time boost into a query time boost. Index time boost will still be applied for indices created before 5.0.0.	2016-03-04 11:47:35 +01:00
Clinton Gormley	05e3cd6b97	Merge pull request #16878 from peschlowp/patch-8 Update index-options.asciidoc	2016-03-02 10:52:44 +01:00
Clinton Gormley	812f03a33f	Merge pull request #16842 from anhlqn/patch-1 Fix minor spelling	2016-02-29 01:32:42 +01:00
Clinton Gormley	00b9640208	Merge pull request #16672 from teuneboon/patch-1 Clarify text about date format range	2016-02-15 16:16:19 +01:00
Dongjoon Hyun	21ea552070	Fix typos in docs.	2016-02-09 02:07:32 -08:00
Adrien Grand	209860854d	Make the `index` property a boolean. With the split of `string` into `text` and `keyword`, the `index` property can only have two values and should be a boolean.	2016-01-27 09:06:00 +01:00
Clinton Gormley	6aa1a4930e	Added back deprecation notices for _ttl and _timestamp	2016-01-26 11:56:36 +01:00
Robert Muir	6e7e3a2274	Update lucene to r1725675 Adds DFI (divergence from independence) provider. Fixes test bugs passing invalid values for BM25 parameters.	2016-01-20 03:32:51 -05:00
Rachit Gupta	5b2ded5c96	Fix typo in doc values docs Closes #16067	2016-01-19 05:58:39 -05:00
Yannick Welsch	a1b8dd2de9	Add per-index setting to limit number of nested fields Closes #14983	2016-01-19 10:03:48 +01:00
Felipe Forbeck	9965c83ae4	Documented how to define custom mappings for all indexes and all types Closes #15557	2016-01-12 13:35:29 +01:00
Clinton Gormley	9773cca58e	Merge pull request #15870 from rjruizes/patch-1 fix nested multi-value query	2016-01-10 10:06:40 +01:00
Adrien Grand	67d233cecd	Remove warmers and the warmer API. Warmers are now barely useful and will be removed in 3.0. Note that this only removes the warmer API and query-based warmers. We still have warmers internally for eg. global ordinals. Close #15607	2016-01-07 09:57:07 +01:00
Imran Azad	8081c782ef	Documented search_quote_analyzer in mapping types and detailed how to disable stop words as a potential use case.	2016-01-06 10:40:51 +01:00
Jim Ferenczi	81fd2169cf	Renames "default" similarity into "classic". Replaces deprecated DefaultSimilarity by ClassicSimilarity. Fixes #15102	2015-12-21 16:22:53 +01:00
umeku	0ce88b5887	Fix inaccurate docs for nested datatype Closes #15436	2015-12-15 15:15:00 +01:00

1 2 3 4 5 ...

424 Commits