OpenSearch

Commit Graph

Author	SHA1	Message	Date
James Rodewig	922a80c3f4	[DOCS] Add collapsible sections to search API response (#55887 )	2020-05-04 16:57:10 -04:00
Dan Hermann	9892813842	[7.x] Delay warning about missing x-pack (#56142 ) * Delay warning about missing x-pack (#54265) Currently, when monitoring is enabled in a freshly-installed cluster, the non-master nodes log a warning message indicating that master may not have x-pack installed. The message is often printed even when the master does have x-pack installed but takes some time to setup the local exporter for monitoring. This commit adds the local exporter setting `wait_master.timeout` which defaults to 30 seconds. The setting configures the time that the non-master nodes should wait for master to setup monitoring. After the time elapses, they log a message to the user about possible missing x-pack installation on master. The logging of this warning was moved from `resolveBulk()` to `openBulk()` since `resolveBulk()` is called only on cluster updates and the message might not be logged until a new cluster update occurs. Closes #40898	2020-05-04 14:16:18 -05:00
Lisa Cawley	b816ab0c18	[DOCS] Synchs and links hyperparameter descriptions (#56131 )	2020-05-04 10:37:26 -07:00
Julie Tibshirani	6b5cf1b031	For constant_keyword, make sure exists query handles missing values. (#55757 ) It's possible for a constant_keyword to have a 'null' value before any documents are seen that contain a value for the field. In this case, no documents have a value for the field, and 'exists' queries should return no documents.	2020-05-04 09:41:52 -07:00
James Rodewig	4faf5a7916	[DOCS] Reformat `porter_stem` token filter (#56053 ) Makes the following changes to the `porter_stem` token filter docs: * Rewrites description and adds a Lucene link * Adds detailed analyze example * Adds an analyzer example	2020-05-04 10:39:17 -04:00
bellengao	722de7dd98	[Docs] Fix typo in match-bool-prefix-query doc (#56077 )	2020-05-04 14:19:23 +02:00
bellengao	40f99119ae	[Docs] Fix typo in getting-started-slm doc (#56075 )	2020-05-04 14:18:00 +02:00
markharwood	e197b6c45b	Analysis enhancement - add preserve_original setting in ngram-token-filter (#55432 ) (#56100 ) Authored-by: Amit Khandelwal <amitmbm87@gmail.com>	2020-05-04 11:31:28 +01:00
Christos Soulios	c65f828cb7	[7.x] Histogram field type support for ValueCount and Avg aggregations (#56099 ) Backports #55933 to 7.x Implements value_count and avg aggregations over Histogram fields as discussed in #53285 - value_count returns the sum of all counts array of the histograms - avg computes a weighted average of the values array of the histogram by multiplying each value with its associated element in the counts array	2020-05-04 13:23:02 +03:00
Armin Braun	3a64ecb6bf	Allow Deleting Multiple Snapshots at Once (#55474 ) (#56083 ) * Allow Deleting Multiple Snapshots at Once (#55474) Adds deleting multiple snapshots in one go without significantly changing the mechanics of snapshot deletes otherwise. This change does not yet allow mixing snapshot delete and abort. Abort is still only allowed for a single snapshot delete by exact name.	2020-05-03 20:30:58 +02:00
William Brafford	d53c941c41	Make xpack.monitoring.enabled setting a no-op (#55617 ) (#56061 ) * Make xpack.monitoring.enabled setting a no-op This commit turns xpack.monitoring.enabled into a no-op. Mostly, this involved removing the setting from the setup for integration tests. Monitoring may introduce some complexity for test setup and teardown, so we should keep an eye out for turbulence and failures * Docs for making deprecated setting a no-op	2020-05-01 16:42:11 -04:00
David Turner	69f50fe79f	Improve same-shard allocation explanations (#56010 ) I see occasional confusion about the explanations emitted by the same-shard allocation decider, particularly amongst new users setting up a single-node cluster and trying to determine why their cluster has `yellow` health. For example: the shard cannot be allocated to the same node on which a copy of the shard already exists This is technically correct but it's quite a complicated sentence. Also, by starting with "the shard cannot be allocated" it makes it sound like this is the problem, whereas in fact this message is a good thing and users should typically focus their attention elsewhere. This commit simplifies the wording of these messages and makes them sound more positive, for example: a copy of this shard is already allocated to this node	2020-05-01 10:07:14 +01:00
Vanessa Bell	9b005f70ef	[Docs] Grammar improvements in disk-usage.asciidoc (#55620 )	2020-04-30 19:31:56 +02:00
Karel Minarik	81dcc9aef7	[DOCS] Add link to the Go client bulk helper (#55872 )	2020-04-30 13:24:00 -04:00
James Rodewig	61cf646f17	[DOCS] EQL: Add advantages to overview (#53452 ) (#56052 ) Adds a concise list of EQL advantages, based on the "EQL Advantages" section in the [EQL for the masses][0] blog post. The intent is to inform users how EQL could benefit at a high level. [0]: https://www.elastic.co/blog/eql-for-the-masses Co-Authored-By: Ross Wolf <31489089+rw-access@users.noreply.github.com>	2020-04-30 13:19:31 -04:00
Igor Motov	d8f9df771d	Expose agg usage in Feature Usage API (#55732 ) (#56048 ) Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746	2020-04-30 12:53:36 -04:00
James Rodewig	e4e02e133e	[DOCS] Remove approximate document counts example from term agg docs (#55442 ) Removes an example from the "Document counts are approximate" section of the terms agg documentation. As #52377 details, the example was no longer accurate in 7.x or 6.8. Document counts were more precise than the example presented. We've opened issue #56025 to discuss re-adding an example later. Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: AB Prashanth <panuradh@buffalo.edu>	2020-04-30 10:11:50 -04:00
William Brafford	273ff6a105	Make xpack.ilm.enabled setting a no-op (#55592 ) (#55980 ) * Make xpack.ilm.enabled setting a no-op * Add watcher setting to not use ILM * Update documentation for no-op setting * Remove NO_ILM ml index templates * Remove unneeded setting from test setup * Inline variable definitions for ML templates * Use identical parameter names in templates * New ILM/watcher setting falls back to old setting * Add fallback unit test for watcher/ilm setting	2020-04-30 09:50:18 -04:00
Lisa Cawley	006e00ed0a	[DOCS] Adds documentation for secondary authorization headers (#55365 ) (#55986 )	2020-04-29 16:29:38 -07:00
James Rodewig	65b47d20a6	[DOCS] Update attribute for multi arg footnotes (#55860 )	2020-04-29 10:25:36 -04:00
James Rodewig	1808a1f36b	[DOCS] EQL: Correct `cidrMatch` function heading (#55935 )	2020-04-29 10:02:06 -04:00
István Zoltán Szabó	337dc45f5b	[DOCS] Adds missing space and a relevant link to the slm execute API page (#55917 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-04-29 15:50:06 +02:00
James Rodewig	bbf68de446	[DOCS] Correct Lucene link in `kstem` token filter docs	2020-04-29 09:30:37 -04:00
Luca Cavanna	8b05027bf0	[DOCS] Clarify async search response flags (#55574 ) Relates to #55572	2020-04-29 15:22:05 +02:00
James Rodewig	767836c367	[DOCS] Reformat `kstem` token filter (#55823 ) Makes the following changes to the `kstem` token filter docs: * Rewrite description and adds a Lucene work * Adds detailed analyze example * Adds an analyzer example	2020-04-29 08:52:55 -04:00
Christos Soulios	02bf0c586a	[7.x] Histogram field type support for Sum aggregation (#55916 ) Implements Sum aggregation over Histogram fields by summing the value of each bucket multiplied by their count as requested in #53285 Backports #55681 to 7.x	2020-04-29 15:06:12 +03:00
Henning Andersen	f679880b80	[DOCS] Create index name required (#55886 ) The name of the new index to create is required. Relates #45749	2020-04-29 13:35:49 +02:00
István Zoltán Szabó	e982cf4381	[DOCS] Makes the footnotes less verbose in configuring aggs page. (#55857 )	2020-04-29 09:52:29 +02:00
debadair	8a662c7e62	ILM update backports (#55902 ) * [DOCS] Rework conceptual info for ILM. (#52181) * [DOCS] Rework conceptual info for ILM. * Split the actions out of concepts. * Added xpack role to actions. Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Apply suggestions from code review * Edit actions for consistency and add action template. (#55632) * Edit actions for consistency and add action template. * Update docs/reference/ilm/actions/ilm-readonly.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Apply suggestions from code review	2020-04-28 16:38:01 -07:00
Lee Hinman	4315a55a1c	[7.x] Initial documentation for index templates V2 (#55755 ) (#55898 ) Backports the following commits to 7.x: - Initial documentation for index templates V2 (#55755)	2020-04-28 16:10:50 -06:00
Henning Andersen	cab7bcc156	Disk decider respect watermarks for single data node (#55805 ) (#55847 ) The disk decider had special handling for the single data node case, allowing any allocation (skipping watermark checks) for such clusters. This special handling can now be avoided via a setting.	2020-04-28 18:46:22 +02:00
Lee Hinman	777caf0725	[7.x] Add support for V2 index templates to /_cat/templates (#55829 ) (#55866 ) Backports the following commits to 7.x: - Add support for V2 index templates to /_cat/templates (#55829)	2020-04-28 10:14:19 -06:00
Larry Gregory	47d252424b	Backport: Deprecate the kibana reserved user (#54967 ) (#55822 )	2020-04-28 10:30:25 -04:00
James Rodewig	ddc7305ac9	[DOCS] Correct search API's timeout parm default (#55855 )	2020-04-28 09:44:50 -04:00
James Rodewig	386fb16409	[DOCS] SQL: Update link for supported regex in `RLIKE` docs (#55830 ) The`RLIKE` function docs points users to [Java’s Pattern class doc][0] for regular expression syntax. However, these docs include shorthand character classes, such as `[\d]`, `[\s]`, and `[\w]`. These character classes are not supported in Elasticsearch, which may confuse users. This updates the SQL `RLIKE` docs to refer to the ES [regular expression syntax docs][1], which only documents supported syntax. [0]: https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/util/regex/Pattern.html [1]: https://www.elastic.co/guide/en/elasticsearch/reference/master/regexp-syntax.html Relates to #55231	2020-04-28 09:25:51 -04:00
James Rodewig	452be22a4d	[DOCS] Warn about searching across all fields wt. `query_string` (#55853 ) Warn about potential performance impact when a large number of fields is used with query string query and no default field. Re-adds content from #35570. That content was erroneously removed in #45296. Co-authored-by: Peter Dyson <peter.dyson@geekpete.com>	2020-04-28 09:20:21 -04:00
Adrien Grand	58c3bb5ae1	Repurpose `ignore_throttled` to be only about frozen indices. (#55047 ) (#55852 ) This has no practical impact on users since frozen indices are the only throttled indices today. However this has an impact on upcoming features that would use search throttling. Filtering out throttled indices made sense a couple years ago, but as we're now improving support for slow requests with `_async_search` and exploring ways to reduce storage costs, this feature has most likely become a trap, that we'd like to not have with upcoming features that would use search throttling. Relates #54058	2020-04-28 14:31:54 +02:00
Amit Khandelwal	126e4acca8	Expose `preserve_original` in `edge_ngram` token filter (#55766 ) The Lucene `preserve_original` setting is currently not supported in the `edge_ngram` token filter. This change adds it with a default value of `false`. Closes #55767	2020-04-28 10:24:27 +02:00
István Zoltán Szabó	a5cf4712e5	[DOCS] Changes feature importance links to point to the new page (#55531 ) * [DOCS] Changes feature importance links to point to the new page. * [DOCS] Fixes line breaks.	2020-04-28 09:03:43 +02:00
James Rodewig	c16b1edae0	[DOCS] EQL: Fix whitespace in `stringContains` docs	2020-04-27 15:53:59 -04:00
James Rodewig	8df5cff9c1	[DOCS] Correct stemmer token filters anchor	2020-04-27 14:57:59 -04:00
James Rodewig	5b8a18c756	[DOCS] Correct stemmer token filter anchor	2020-04-27 14:51:51 -04:00
David Roberts	3ba44a5af8	[ML] Adding failed_category_count to model_size_stats (#55761 ) The failed_category_count statistic records the number of times categorization wanted to create a new category but couldn't because the job had reached its model_memory_limit. Backport of #55716	2020-04-25 10:36:49 +01:00
James Rodewig	c1b0548db0	[DOCS] Document EQL search REST API (#52384 )	2020-04-24 15:36:01 -04:00
James Rodewig	5981412bf7	[DOCS] EQL: Document `stringContains` function (#54968 )	2020-04-24 15:09:05 -04:00
James Rodewig	e4ebe55d04	[DOCS] EQL: Document `cidrMatch` function (#54216 ) (#55739 )	2020-04-24 14:01:11 -04:00
James Rodewig	e0a8adb5b2	[DOCS] Reformat `stemmer` token filter (#55693 ) Makes the following changes to the `stemmer` token filter docs: * Adds detailed analyze example * Rewrites parameter definitions * Adds custom analyzer example * Adds a `language` value for the `estonian` stemmer * Reorders the `language` values to show recommended algorithms first, followed by other values alphabetically	2020-04-24 11:25:01 -04:00
James Rodewig	96285b90c1	[DOCS] Add stemming concept docs (#55156 ) Adds conceptual documentation for stemming, including: * An overview of why stemming is helpful in search * Algorithmic vs. dictionary stemming * Token filters used to control stemming, such as `stemmer_override`, `keyword_marker`, and `conditional`	2020-04-24 11:01:28 -04:00
Christoph Büscher	f95a741ad3	[Docs] Fix fuzziness example in match-query.asciidoc (#55715 ) The example looks the same as in the previous section although it should use the "fuzziness" parameter. This seems to be okay on 6.8 and master and was probably only forgotten to port to 7.x branches.	2020-04-24 16:21:40 +02:00
Zachary Tong	715c90bf7d	Aggs must specify a `field` or `script` (or both) (#52226 ) This adds a validation to VSParserHelper to ensure that a field or script or both are specified by the user. This is technically required today already, but throws an exception much deeper in the agg framework and has a very unintuitive error for the user (as well as eating more resources instead of failing early)	2020-04-23 19:23:41 -04:00
James Rodewig	e74fdacabd	[DOCS] Add admonition for EQL exact matches on text fields (#53402 ) (#55670 ) Adds a important admonition to the EQL syntax page noting that the equal (`==`) operator should not be used to match `text` field values. Relates to #52709 and #53020	2020-04-23 10:59:50 -04:00
István Zoltán Szabó	5813dfdcc7	[7.x][DOCS] Adds ML related items to release highlights (#55652 )	2020-04-23 11:58:32 +02:00
Lisa Cawley	314ca78e31	[7.x][DOCS] Update example and nesting in get data frame analytics job stats API (#55612 )	2020-04-22 10:58:26 -07:00
James Rodewig	8d05d7dace	[DOCS] Add collapsible sections to 7.x breaking changes (#55334 ) Adds collapsible sections and new format to the 7.x breaking changes. Relates to #53229.	2020-04-22 10:56:38 -04:00
James Rodewig	6f9513915d	[DOCS] Add 'how to' doc about avoiding oversharding (#55480 ) Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>	2020-04-22 10:44:16 -04:00
James Rodewig	414f9c98f3	[DOCS] Document missing bulk API response parameters (#55414 ) Documents several parameters missing from the bulk API's response body docs. Also moves several response-related chunks of text to the response body section. Relates to #55237	2020-04-22 09:48:03 -04:00
David Roberts	2dc5586afe	[ML] Add effective max model memory limit to ML info (#55581 ) The ML info endpoint returns the max_model_memory_limit setting if one is configured. However, it is still possible to create a job that cannot run anywhere in the current cluster because no node in the cluster has enough memory to accommodate it. This change adds an extra piece of information, limits.effective_max_model_memory_limit, to the ML info response that returns the biggest model memory limit that could be run in the current cluster assuming no other jobs were running. The idea is that the ML UI will be able to warn users who try to create jobs with higher model memory limits that their jobs will not be able to start unless they add a bigger ML node to their cluster. Backport of #55529	2020-04-22 12:28:50 +01:00
David Roberts	da5aeb8be7	[ML] Return assigned node in start/open job/datafeed response (#55570 ) Adds a "node" field to the response from the following endpoints: 1. Open anomaly detection job 2. Start datafeed 3. Start data frame analytics job If the job or datafeed is assigned to a node immediately then this field will return the ID of that node. In the case where a job or datafeed is opened or started lazily the node field will contain an empty string. Clients that want to test whether a job or datafeed was opened or started lazily can therefore check for this. Backport of #55473	2020-04-22 12:06:53 +01:00
István Zoltán Szabó	0ce3406033	[DOCS] Provides further details on aggregations in datafeeds (#55462 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-04-22 08:54:52 +02:00
James Rodewig	777ffd5801	[DOCS] Add bulk API example with failures (#55412 ) Adds an example for bulk API requests that include failures. Also documents guidance on use the `filter_path` parameter to narrow the bulk API response for errors. Closes #55237	2020-04-21 16:22:23 -04:00
James Baiera	2a5f1f49a9	Add enrich metricset from 7.5 (#54791 ) (#55356 ) Co-authored-by: Julien Guay <guay_j@yahoo.fr>	2020-04-21 12:39:08 -04:00
James Rodewig	b9dfd12e7e	[DOCS] Remove 'Testing' chapter (#55270 ) (#55532 ) Removes the 'Testing' chapter from the Elasticsearch Reference guide. This chapter was originally written for so that users using the Java HLRC client could use the same test classes when testing Elasticsearch in their own applications. However, this is no longer the case or recommended. Closes #55257.	2020-04-21 10:29:58 -04:00
Paul Sanwald	0f7917b94b	add release notes for 7.5.2 (#51259 ) Adds release notes for 7.5.2	2020-04-21 08:19:46 -04:00
Benjamin Trent	24d41eb695	[ML] partitions model definitions into chunks (#55260 ) (#55484 ) This paves the data layer way so that exceptionally large models are partitioned across multiple documents. This change means that nodes before 7.8.0 will not be able to use trained inference models created on nodes on or after 7.8.0. I chose the definition document limit to be 100. This SHOULD be plenty for any large model. One of the largest models that I have created so far had the following stats: ~314MB of inflated JSON, ~66MB when compressed, ~177MB of heap. With the chunking sizes of `16 * 1024 * 1024` its compressed string could be partitioned to 5 documents. Supporting models 20 times this size (compressed) seems adequate for now.	2020-04-20 16:08:54 -04:00
David Turner	8e618fdf10	Adjust docs for voting config exclusions API (#55006 ) In #50836 we deprecated the existing voting config exclusions API and added a new one. This commit adjust the docs to match.	2020-04-20 19:47:33 +01:00
Lee Hinman	9eddd2bcc9	[7.x] Add prefer_v2_templates flag and index setting (#55411 ) (#55476 ) This commit adds a new querystring parameter on the following APIs: - Index - Update - Bulk - Create Index - Rollover These APIs now support a `?prefer_v2_templates=true\|false` flag. This flag changes the preference creation to use either V2 index templates or V1 templates. This flag defaults to `false` and will be changed to `true` for 8.0+ in subsequent work. Additionally, setting this flag internally sets the `index.prefer_v2_templates` index-level setting. This setting is used so that actions that automatically create a new index (things like rollover initiated by ILM) will inherit the preference from the original index. This setting is dynamic so that a transition from v1 to v2 templates can occur for long-running indices grouped by an alias performing periodic rollover. This also adds support for sending this parameter to the High Level Rest Client. Relates to #53101	2020-04-20 12:05:42 -06:00
jmceniery	99409e8c95	[DOCS] Remove Wikipedia link from `SUM_OF_SQUARES` SQL function docs (#52398 ) Removed the link to Wikipedia as the function is not calculating the sum of squares in this way. More can be found here at this issue: https://github.com/elastic/elasticsearch/issues/50416	2020-04-20 09:59:59 -04:00
Ben Skelker	74f55ec6fa	[DOCS] Add `ip_range` datatype to core datatypes range list (#55446 )	2020-04-20 08:55:09 -04:00
William Brafford	49e30b15a2	Deprecate disabling basic-license features (#54816 ) (#55405 ) We believe there's no longer a need to be able to disable basic-license features completely using the "xpack..enabled" settings. If users don't want to use those features, they simply don't need to use them. Having such features always available lets us build more complex features that assume basic-license features are present. This commit deprecates settings of the form "xpack..enabled" for basic-license features, excluding "security", which is a special case. It also removes deprecated settings from integration tests and unit tests where they're not directly relevant; e.g. monitoring and ILM are no longer disabled in many integration tests.	2020-04-17 15:04:17 -04:00
Andrei Dan	ef338ee3d4	ILM DOCS: mention forcemerge is best effort (#54794 ) (#55401 ) (cherry picked from commit 3fd05435c52dd265dbe1a40104e7dc7a335d50ae) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-17 15:42:23 +01:00
James Rodewig	f87a3f0c48	[DOCS] Document analysis/mapping response for cluster stats API (#55054 ) PR #51260 moved usage counts about mapping field types and analysis to the `_cluster/stats` API. This documents those stats in the response section of the cluster stats API docs.	2020-04-17 08:44:10 -04:00
Adrien Grand	0cb6a1f089	Document the index corruption bug that gets fixed via Lucene 8.5.1. (#55232 ) Using soft deletes on shrunk indices may cause corruption.	2020-04-17 13:37:37 +02:00
markharwood	7761b01a33	Remove normalizer support from wildcard field while we decide on approach for handling case insensitvity (#55294 ) (#55375 ) Closes #55288	2020-04-17 11:43:26 +01:00
Marios Trivyzas	f958e9abdc	SQL: Implement scripting inside aggs (#55241 ) (#55371 ) Implement the use of scalar functions inside aggregate functions. This allows for complex expressions inside aggregations, with or without GROUBY as well as with or without a HAVING clause. e.g.: ``` SELECT MAX(CASE WHEN a IS NULL then -1 ELSE abs(a * 10) + 1 END) AS max, b FROM test GROUP BY b HAVING MAX(CASE WHEN a IS NULL then -1 ELSE abs(a * 10) + 1 END) > 5 ``` Scalar functions are still not allowed for `KURTOSIS` and `SKEWNESS` as this is currently not implemented on the ElasticSearch side. Fixes: #29980 Fixes: #36865 Fixes: #37271 (cherry picked from commit 506d1beea7abb2b45de793bba2e349090a78f2f9)	2020-04-17 12:41:22 +02:00
Lisa Cawley	c7cf6e621d	[DOCS] Remove text fields from classification dependent variables (#54849 )	2020-04-16 13:40:28 -07:00
Lisa Cawley	cf5278f771	[DOCS] Add ml-cpp PRs to 7.7 release notes (#55264 ) Co-Authored-By: David Roberts <dave.roberts@elastic.co>	2020-04-16 11:28:34 -07:00
Julie Tibshirani	d7cded8d7a	Fix updating include_in_parent/include_in_root of nested field. (#55326 ) The main changes are: 1. Throw an error when updating `include_in_parent` or `include_in_root` attribute of nested field dynamically by the PUT mapping API. 2. Add a test for the change. Closes #53792 Co-authored-by: bellengao <gbl_long@163.com>	2020-04-16 11:17:12 -07:00
James Rodewig	f0b9be8b1b	[DOCS] Reformat `flatten_graph` token filter (#54268 ) * [DOCS] Reformat `flatten_graph` token filter Makes the following changes to the `flatten_graph` token filter docs: * Rewrites description and adds Lucene link * Adds detailed analyze example * Adds analyzer example	2020-04-16 08:35:08 -04:00
Bogdan Pintea	b88dd47de3	Docs: add the change log for 7.7 (#55019 ) * Add the change log for 7.7 Add the change log for 7.7 * Update rel. notes to latest state (BC5) Update the release notes to current state (i.e. BC5). * Update docs/reference/release-notes/7.7.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-15 15:25:08 -04:00
Lisa Cawley	f0b9578684	[DOCS] Removes transform performance note (#55177 )	2020-04-15 10:42:52 -07:00
James Rodewig	4f2ab96f38	[DOCS] EQL: Document `indexOf` function (#55071 )	2020-04-15 11:29:50 -04:00
James Rodewig	8d6f0f6a76	[DOCS] Document `max_concurrent_searches` default (#55116 )	2020-04-15 10:04:23 -04:00
Benjamin Trent	8ff2cbf1a3	[7.x] [ML] adding prediction_field_type to inference config (#55128 ) (#55230 ) * [ML] adding prediction_field_type to inference config (#55128) Data frame analytics dynamically determines the classification field type. This field type then dictates the encoded JSON that is written to Elasticsearch. Inference needs to know about this field type so that it may provide the EXACT SAME predicted values as analytics. Here is added a new field `prediction_field_type` which indicates the desired type. Options are: `string` (DEFAULT), `number`, `boolean` (where close_to(1.0) == true, false otherwise). Analytics provides the default `prediction_field_type` when the model is created from the process.	2020-04-15 09:45:22 -04:00
Jake Landis	85139fad7e	[7.x] Advise a simpler curator migration (#54457 ) (#55188 ) Advice for migrating from Curator should simply be to phase out curator managed indices, since curator will ignore ILM indices https://www.elastic.co/guide/en/elasticsearch/client/curator/5.7/ilm-and-curator.html#ilm-and-curator. Co-authored-by: Jay Greenberg <PhaedrusTheGreek@users.noreply.github.com>	2020-04-15 07:55:31 -05:00
Lisa Cawley	2910d01179	[DOCS] Removes unshared sections from ml-shared.asciidoc (#55192 )	2020-04-14 18:47:09 -07:00
Yang Wang	f49354b7d7	Add migration notes for deprecating local parameter of get field mapping API (#55194 ) This is a follow-up for #55099 to add migration notes about the deprecation of local parameter for get field mappings API.	2020-04-15 11:38:05 +10:00
Igor Motov	1754e50cbd	[7.x] Add analytics plugin usage stats to _xpack/usage (#54911 ) (#55162 ) Adds analytics plugin usage stats to _xpack/usage. Closes #54847	2020-04-14 17:03:14 -04:00
James Rodewig	12130843ca	[DOCS] Add maintenance releases to upgrade table (#55012 ) Updates the supported upgrade path table in [Upgrade Elasticsearch][0] to include a new row for maintenance releases. For example, this row covers upgrading from 7.6.0 to 7.6.2. The new table row only displays for releases greater than n.x.0. For example, the new row will display for the 7.7.1 release but not the 7.7.0 release. [0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/setup-upgrade.html	2020-04-14 11:28:55 -04:00
James Rodewig	3fbd8b371f	[DOCS] Use consistent line breaks in EQL function docs	2020-04-14 10:17:45 -04:00
Yannick Welsch	a610513ec7	Provide repository-level stats for searchable snapshots (#55051 ) Provides basic repository-level stats that will allow us to get some insight into how many requests are actually being made by the underlying SDK. Currently only tracks GET and LIST calls for S3 repositories. Most of the code is unfortunately boiler plate to add a new endpoint that will help us better understand some of the low-level dynamics of searchable snapshots.	2020-04-14 14:34:08 +02:00
lcawl	fcd96db006	[DOCS] Edits create data frame analytics job API (#54751 )	2020-04-13 10:43:52 -07:00
Nhat Nguyen	96bb1164f0	Support hierarchical task cancellation (#54757 ) With this change, when a task is canceled, the task manager will cancel not only its direct child tasks but all also its descendant tasks. Closes #50990	2020-04-13 12:35:21 -04:00
Igor Motov	51c6f69e02	[7.x] Add support for filters to T-Test aggregation (#54980 ) (#55066 ) Adds support for filters to T-Test aggregation. The filters can be used to select populations based on some criteria and use values from the same or different fields. Closes #53692	2020-04-13 12:28:58 -04:00
James Rodewig	57d6493e29	[DOCS] EQL: Document `string` function (#55086 )	2020-04-13 11:23:45 -04:00
Peter Dyson	f0b6cf4c11	[DOCS] Note where ILM policies are stored and backup caveats (#54859 )	2020-04-13 09:11:16 -06:00
Vishal Patel	16921ebbd8	[DOCS] Collapse nested objects in Explore API docs (#55067 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-04-13 09:27:03 -04:00
Ioannis Kakavas	7a8a66d9ae	[7.x] Fix ReloadSecureSettings API to consume password (#54771 ) (#55059 ) The secure_settings_password was never taken into consideration in the ReloadSecureSettings API. This commit fixes that and adds necessary REST layer testing. Doing so, it also: - Allows TestClusters to have a password protected keystore so that it can be set for tests. - Adds a parameter to the run task so that elastisearch can be run with a password protected keystore from source.	2020-04-13 09:50:55 +03:00
Yang Wang	862799956c	Deprecate local parameter for get field mapping request (#55014 ) (#55099 ) The usage of local parameter for GetFieldMappingRequest has been removed from the underlying transport action since v2.0. This PR deprecates the parameter from rest layer. It will be removed in next major version.	2020-04-12 13:48:47 +10:00
James Rodewig	2655dfa2fe	[DOCS] EQL: Reword field support for EQL functions (#55074 ) Changes boilerplate sentence of "If using a field as the argument, this parameter only supports..." to "...this parameter supports only...". The latter is a bit more clear and readable.	2020-04-10 15:33:29 -04:00
Jason Tedor	d1137ebdaa	Passthrough special characters in thread pool docs (#55080 ) Some of these characters are special to Asciidoctor and they ruin the rendering on this page. Instead, we use a macro to passthrough these characters without Asciidoctor applying any subtitutions to them. This commit then addresses some rendering issues in the thread pool docs. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-04-10 15:11:19 -04:00
Nik Everett	b99a50bcb9	value_count Aggregation optimization (backport of #54854 ) (#55076 ) We found some problems during the test. Data: 200Million docs, 1 shard, 0 replica hits \| avg \| sum \| value_count \| ----------- \| ------- \| ------- \| ----------- \| 20,000 \| .038s \| .033s \| .063s \| 200,000 \| .127s \| .125s \| .334s \| 2,000,000 \| .789s \| .729s \| 3.176s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| The performance of `avg`, `sum` and other is very close when performing statistics, but the performance of `value_count` has always been poor, even not on an order of magnitude. Based on some common-sense knowledge, we think that `value_count` and sum are similar operations, and the time consumed should be the same. Therefore, we have discussed the agg of `value_count`. The principle of counting in es is to traverse the field of each document. If the field is an ordinary value, the count value is increased by 1. If it is an array type, the count value is increased by n. However, the problem lies in traversing each document and taking out the field, which changes from disk to an object in the Java language. We summarize its current problems with Elasticsearch as: - Number cast to string overhead, and GC problems caused by a large number of strings - After the number type is converted to string, sorting and other unnecessary operations are performed Here is the proof of type conversion overhead. ``` // Java long to string source code, getChars is very time-consuming. public static String toString(long i) { int size = stringSize(i); if (COMPACT_STRINGS) { byte[] buf = new byte[size]; getChars(i, size, buf); return new String(buf, LATIN1); } else { byte[] buf = new byte[size * 2]; StringUTF16.getChars(i, size, buf); return new String(buf, UTF16); } } ``` test type \| average \| min \| max \| sum ------------ \| ------- \| ---- \| ----------- \| ------- double->long \| 32.2ns \| 28ns \| 0.024ms \| 3.22s long->double \| 31.9ns \| 28ns \| 0.036ms \| 3.19s long->String \| 163.8ns \| 93ns \| 1921 ms \| 16.3s particularly serious. Our optimization code is actually very simple. It is to manage different types separately, instead of uniformly converting to string unified processing. We added type identification in ValueCountAggregator, and made special treatment for number and geopoint types to cancel their type conversion. Because the string type is reduced and the string constant is reduced, the improvement effect is very obvious. hits \| avg \| sum \| value_count \| value_count \| value_count \| value_count \| value_count \| value_count \| \| \| \| double \| double \| keyword \| keyword \| geo_point \| geo_point \| \| \| \| before \| after \| before \| after \| before \| after \| ----------- \| ------- \| ------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| 20,000 \| 38s \| .033s \| .063s \| .026s \| .030s \| .030s \| .038s \| .015s \| 200,000 \| 127s \| .125s \| .334s \| .078s \| .116s \| .099s \| .278s \| .031s \| 2,000,000 \| 789s \| .729s \| 3.176s \| .439s \| .348s \| .386s \| 3.365s \| .178s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 2.700s \| 2.500s \| 2.600s \| 25.192s \| 1.278s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| 18.990s \| 19.000s \| 20.000s \| 168.971s \| 9.093s \| - The results are more in line with common sense. `value_count` is about the same as `avg`, `sum`, etc., or even lower than these. Previously, `value_count` was much larger than avg and sum, and it was not even an order of magnitude when the amount of data was large. - When calculating numeric types such as `double` and `long`, the performance is improved by about 8 to 9 times; when calculating the `geo_point` type, the performance is improved by 18 to 20 times.	2020-04-10 13:16:39 -04:00
James Rodewig	c440754784	[DOCS] EQL: Document `wildcard` function (#54086 )	2020-04-10 09:18:29 -04:00
oneoneonepig	356cc94889	[DOCS] Fix double quote typo in 7.0 breaking changes (#55040 )	2020-04-10 09:11:51 -04:00
Jason Tedor	9eeae59a83	Clarify available processors (#54907 ) The use of available processors, the terminology, and the settings around it have evolved over time. This commit cleans up some places in the codes and in the docs to adjust to the current terminology.	2020-04-10 08:48:27 -04:00
James Rodewig	51326432be	[DOCS] Add query reference docs template (#52292 )	2020-04-10 08:47:54 -04:00
James Rodewig	d5a609a2e5	[DOCS] Add token filter reference docs template (#52290 ) Creates a reusable template for token filter reference documentation. Contributors can make a copy of this template and customize it when documenting new token filters.	2020-04-10 08:45:10 -04:00
Marios Trivyzas	bf0cadb602	SQL: Implement DATETIME_PARSE function for parsing strings (#54960 ) (#55035 ) Implement DATETIME_PARSE(<datetime_str>, <pattern_str>) function which allows to parse a datetime string according to the specified pattern into a datetime object. The patterns allowed are those of java.time.format.DateTimeFormatter. Relates to #53714 (cherry picked from commit 3febcd8f3cdf9fdda4faf01f23a5f139f38b57e0)	2020-04-10 01:16:29 +02:00
Vishal Patel	51cb0c5c7b	[DOCS] Collapse nested objects in cluster reroute docs (#54851 )	2020-04-09 15:29:22 -04:00
István Zoltán Szabó	374f633b6e	[DOCS] Adds link points to the data frame analytics supported fields (#55004 ) Co-authored-by: lcawl <lcawley@elastic.co>	2020-04-09 11:27:57 -07:00
James Rodewig	c6cd8ca7c0	[DOCS] Update upgrade docs for 7.7 (#54978 )	2020-04-08 16:23:08 -04:00
James Rodewig	964cf565c9	[DOCS] EQL: Document `between` function (#54950 )	2020-04-08 13:49:15 -04:00
Théophile Helleboid - chtitux	a8aa36d427	[DOCS] Fix typo in SLM retention docs (#54797 )	2020-04-08 08:56:45 -04:00
Marios Trivyzas	6afd60b082	SQL: Implement DATETIME_FORMAT function for date/time formatting (#54832 ) (#54942 ) Implement DATETIME_FORMAT(<date/datetime/time>, ) function which allows for formatting a timestamp to the specified format. The patterns allowed as those of java.time.format.DateTimeFormatter. Related to #53714 (cherry picked from commit 72be0b54a9299e87e785469cdc9aafac2a48c046)	2020-04-08 13:45:47 +02:00
István Zoltán Szabó	3a3effedc2	[DOCS] Reworks some parts of EMM API docs (#54872 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-04-08 10:20:34 +02:00
Julie Tibshirani	475b210eec	Improve guidance on removing default mappings. (#54915 ) In 7.x, an index template will fail to apply if it contains a `_default_` mapping. Several users have expressed confusion over the fact that loading the template doesn't show any default mappings. This docs change clarifies that in order to see all mappings in the template, you must pass `include_type_name`.	2020-04-07 15:18:13 -07:00
James Rodewig	9569a8eb13	[DOCS] Add example to "avoid scripts" advice (#54719 ) Adds a detailed example to the "Avoid scripts" section of the "Tune for search speed" docs. The detail outlines how a script used to transform indexed data can be moved to ingest. The update also removes an outdated reference to supported script languages.	2020-04-07 15:25:10 -04:00
Jason Tedor	d1d478debf	Update docs to reflect node.processors (#54855 ) We namespaced the previous setting "processors" into "node.processors". This commit updates some of the documentation to reflect this.	2020-04-07 13:06:14 -04:00
Lisa Cawley	a7599031ae	[DOCS] Adds tranform node to list of default types (#54850 )	2020-04-07 08:49:05 -07:00
Ignacio Vera	076c199484	Add new point field. (#53804 ) (#54879 ) This commit adds a new point field that is able to index arbitrary pair of values (x/y) in the cartesian space. It only supports filtering using shape queries at the moment.	2020-04-07 15:28:50 +02:00
Tanguy Leroux	4d36917e52	Merge feature/searchable-snapshots branch into 7.x (#54803 ) (#54825 ) This is a backport of #54803 for 7.x. This pull request cherry picks the squashed commit from #54803 with the additional commits: 6f50c92 which adjusts master code to 7.x a114549 to mute a failing ILM test (#54818) 48cbca1 and 50186b2 that cleans up and fixes the previous test aae12bb that adds a missing feature flag (#54861) 6f330e3 that adds missing serialization bits (#54864) bf72c02 that adjust the version in YAML tests a51955f that adds some plumbing for the transport client used in integration tests Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Yannick Welsch <yannick@welsch.lu> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com> Co-authored-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-07 13:28:53 +02:00
Ioannis Kakavas	3560c0cbf2	Remove `_xpack` from license API example (#54698 ) (#54763 ) Resolves #54662	2020-04-07 09:51:37 +03:00
Lisa Cawley	b3d5300968	[DOCS] Collapses sections in put snapshot lifecycle policy API (#54834 ) (#54840 )	2020-04-06 13:46:56 -07:00
James Rodewig	e9c3bfc8e5	[DOCS] Collapse nested objects in node stats API response (#54755 ) Replaces dot notation with collapsed nested object formatting per the [Elastic API reference template][0]. [0]:https://github.com/elastic/docs/blob/master/shared/api-ref-ex.asciidoc	2020-04-06 15:19:54 -04:00
James Rodewig	548ad03941	[DOCS] Collapse nested objects in cluster stats API response (#54739 ) Replaces dot notation with collapsed nested object formatting per the [Elastic API reference template][0]. [0]:https://github.com/elastic/docs/blob/master/shared/api-ref-ex.asciidoc	2020-04-06 13:11:46 -04:00
Igor Motov	2794572a35	[7.x] Add Student's t-test aggregation support (#54469 ) (#54737 ) Adds t_test metric aggregation that can perform paired and unpaired two-sample t-tests. In this PR support for filters in unpaired is still missing. It will be added in a follow-up PR. Relates to #53692	2020-04-06 11:36:47 -04:00
Nhat Nguyen	2fdbed7797	Broadcast cancellation to only nodes have outstanding child tasks (#54312 ) Today when canceling a task we broadcast ban/unban requests to all nodes in the cluster. This strategy does not scale well for hierarchical cancellation. With this change, we will track outstanding child requests and broadcast the cancellation to only nodes that have outstanding child tasks. This change also prevents a parent task from sending child requests once it got canceled. Relates #50990 Supersedes #51157 Co-authored-by: Igor Motov <igor@motovs.org> Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-04-06 11:11:29 -04:00
István Zoltán Szabó	7dc1ba4273	[DOCS] Updates transform prerequisites (#54804 )	2020-04-06 17:07:59 +02:00
Christoph Büscher	def519ea70	[Docs] Correct date rounding example for `range` query (#51524 ) Looking into #50237 I realized that two of the examples given in the documentation around date math rounding for range queries on date fields using `gt` and `lt` is slightly off by a nanosecond. This PR changes this to the bounds that are currently parsed using these parameters.	2020-04-06 17:05:45 +02:00
István Zoltán Szabó	4cba1e6368	[DOCS] Changes kibana_user to kibana_admin in DFA API prerequisites. (#54806 )	2020-04-06 15:46:18 +02:00
Jason Tedor	184c038f59	Add get autoscaling policy API (#54762 ) This commit adds the get autoscaling policy API.	2020-04-04 18:04:25 -04:00
James Baiera	705e46d5c1	[DOCS] Remove unused cat tasks request parameters (#54539 ) (#54741 )	2020-04-03 15:33:28 -04:00
Lisa Cawley	de91d2aeea	[DOCS] Collapse nested objects in CCR APIs (#54697 )	2020-04-03 12:04:33 -07:00
Bogdan Pintea	7cef89e084	ODBC: Document the new VarcharLimit and EarlyExecution params (#54632 ) * Document VarcharLimit and EarlyExecution params Add the documentation for the newly added VarcharLimit and EarlyExecution DSN attributes. * Remove obsolete VersionChecking param This param had been removed already along the #53082 work. * Update docs/reference/sql/endpoints/odbc/configuration.asciidoc fix typo Co-Authored-By: Stuart Cam <stuart@codebrain.co.uk> * Update docs/reference/sql/endpoints/odbc/configuration.asciidoc fix typo Co-Authored-By: Stuart Cam <stuart@codebrain.co.uk> (cherry picked from commit f38761631a12b38f7f075635f7ac61dc96656cd7)	2020-04-03 14:46:39 +02:00
markharwood	2da2305587	Backport of lowercase normalizer PR #53882 A pre-configured normalizer for lower-casing. Closes #53872	2020-04-03 11:43:40 +01:00
István Zoltán Szabó	d025b90cd1	[DOCS] Makes PUT inference API docs collapsible (#54653 ) Co-authored-by: lcawl <lcawley@elastic.co>	2020-04-03 09:48:53 +02:00
Lisa Cawley	11afead21e	[DOCS] Adds collapsible sections to rollup APIs (#54690 )	2020-04-02 17:51:16 -07:00
Lisa Cawley	b138dc4565	[DOCS] Add processing details to get transforms stats API (#54368 ) (#54595 )	2020-04-02 16:12:57 -07:00
lcawl	d3fa0ec730	[DOCS] Fixes typo in node settings	2020-04-02 16:01:54 -07:00
Lisa Cawley	98965116fe	[DOCS] Clarify ML and transform settings on coordinating nodes (#54676 )	2020-04-02 15:38:15 -07:00
Julie Tibshirani	5fb7602227	Disallow changing 'enabled' on the root mapper. (#54681 ) In #33933 we disallowed changing the `enabled` parameter in object mappings. However, the fix didn't cover the root object mapper. This PR adjusts the change to also include the root mapper and clarifies the error message.	2020-04-02 15:28:48 -07:00
David Turner	4e083cd97d	indices.recovery.max_bytes_per_sec may be per-node (#54633 ) The `indices.recovery.max_bytes_per_sec` recovery bandwidth limit can differ between nodes if it is not set dynamically, but today this is not obvious. This commit adds a paragraph to its documentation clarifying how to set different bandwidth limits on each node. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-02 18:15:41 +01:00
Benjamin Trent	4a1610265f	[7.x] [ML] add new inference_config field to trained model config (#54421 ) (#54647 ) * [ML] add new inference_config field to trained model config (#54421) A new field called `inference_config` is now added to the trained model config object. This new field allows for default inference settings from analytics or some external model builder. The inference processor can still override whatever is set as the default in the trained model config. * fixing for backport	2020-04-02 12:25:10 -04:00
Benjamin Trent	65233383f6	[7.x] [ML] prefer secondary authorization header for data[feed\|frame] authz (#54121 ) (#54645 ) * [ML] prefer secondary authorization header for data[feed\|frame] authz (#54121) Secondary authorization headers are to be used to facilitate Kibana spaces support + ML jobs/datafeeds. Now on PUT/Update/Preview datafeed, and PUT data frame analytics the secondary authorization is preferred over the primary (if provided). closes https://github.com/elastic/elasticsearch/issues/53801 * fixing for backport	2020-04-02 11:20:25 -04:00
qiye	de8e0200fe	[DOCS] Correct `shape` field release in 7.5 release highlights (#54631 ) The `shape` field was added in 7.4, not 7.3. This corrects a small error in the 7.5 release highlights.	2020-04-02 09:19:40 -04:00
Benjamin Trent	eb31be0e71	[7.x] [ML] add num_matches and preferred_to_categories to category defintion objects (#54214 ) (#54639 ) * [ML] add num_matches and preferred_to_categories to category defintion objects (#54214) This adds two new fields to category definitions. - `num_matches` indicating how many documents have been seen by this category - `preferred_to_categories` indicating which other categories this particular category supersedes when messages are categorized. These fields are only guaranteed to be up to date after a `_flush` or `_close` native change: https://github.com/elastic/ml-cpp/pull/1062 * adjusting for backport	2020-04-02 09:09:19 -04:00
Jason Tedor	54ecb009bb	Add delete autoscaling policy API (#54601 ) This commit adds an API for deleting autoscaling policies.	2020-04-02 09:05:12 -04:00
Aleh Zasypkin	161eac1942	[7.x] Switch to the most recent Kibana configuration format and SAML/OIDC endpoints. (#54624 )	2020-04-02 11:59:11 +02:00
István Zoltán Szabó	3a8e880fe6	[DOCS] Adds snippet comparing two indices to the painless examples (#54563 )	2020-04-02 08:46:25 +02:00
Russ Cam	110d9a7845	Correct description for mget API request URI index (#52305 ) This commit corrects the description for the request URI index for the Multi Get (mget) API. The index can only be a single index name (multiple or wildcard expressions not supported), and acts as the index to use when "ids" are specified, or a document in the "docs" array does not specify an index. (cherry picked from commit aa4926ed7f91dfbf7973a01b1e4682e91dda11a9)	2020-04-02 09:08:34 +10:00
lcawl	949636944c	[DOCS] Fixes shared attribute for feature importance	2020-04-01 14:52:08 -07:00
Jason Tedor	ccecb78c98	Rename the policy in put autoscaling policy docs The put autoscaling policy docs use a "hot" policy as an example. Instead, this commit changes the name of this policy to "my_autoscaling_policy".	2020-04-01 16:32:03 -04:00
Jason Tedor	bd6b383926	Include autoscaling APIs in API docs (#54603 ) This commit adds a top-level link to the autoscaling API reference page to the API docs. Additionally, we add a conditional guard on the API pages to only include them in development builds of the docs.	2020-04-01 15:44:13 -04:00
lcawl	2cd35bf696	[DOCS] Adds release highlights placeholder	2020-04-01 09:22:20 -07:00
Lisa Cawley	f5ccf939d9	[DOCS] Clarifies API key breaking change (#54522 )	2020-04-01 08:58:15 -07:00
James Rodewig	21abc311fd	[DOCS] Reformat `keyword_repeat` token filter (#54428 )	2020-04-01 11:56:05 -04:00
James Rodewig	4982b720ef	[DOCS] EQL: Document `length` function (#54225 )	2020-04-01 11:35:36 -04:00
James Rodewig	b43eb5ac32	[DOCS] EQL: Document `endsWith` function (#54521 )	2020-04-01 10:43:37 -04:00
Dan Hermann	11bfbd8bbf	Rename examples in ILM guide to avoid association with data streams (#54579 )	2020-04-01 09:05:20 -05:00
István Zoltán Szabó	27f88fcdac	[DOCS] Updates estimate model memory docs (#54574 )	2020-04-01 15:55:25 +02:00
James Rodewig	95622d8782	[DOCS] EQL: Document `startsWith` function (#54518 ) (#54578 )	2020-04-01 09:30:27 -04:00
James Rodewig	92d570d6f3	[DOCS] EQL: Add search/index speed tip for functions (#54346 ) (#54575 ) EQL functions are an easy way for users to transform indexed data at search time. However, using multiple functions can make queries difficult to write and slows search speeds. Users can circumvent this by indexing fields containing the transformed data, but that usually slows index speeds. This adds a related tip and example covering these tradeoffs.	2020-04-01 08:39:04 -04:00
bellengao	e9c201b446	[DOCS] Correct field name in date_nanos doc (#54556 )	2020-04-01 14:22:32 +02:00
Jason Tedor	f670ae0bc8	Introduce autoscaling policies (#54473 ) This commit is the first in a series of commits that introduces autoscaling policies, and APIs for working with them. For now, we introduce the basic infrastructure, and a single API for putting an autoscaling policy. We will follow in rapid succession with APIs for getting, and deleting autoscaling policies.	2020-04-01 08:12:26 -04:00
István Zoltán Szabó	325b8ec0ce	[DOCS] Adds data_counts object to the GET DFA stats API (#54498 )	2020-04-01 10:07:28 +02:00
Jason Tedor	5fcda57b37	Rename MetaData to Metadata in all of the places (#54519 ) This is a simple naming change PR, to fix the fact that "metadata" is a single English word, and for too long we have not followed general naming conventions for it. We are also not consistent about it, for example, METADATA instead of META_DATA if we were trying to be consistent with MetaData (although METADATA is correct when considered in the context of "metadata"). This was a simple find and replace across the code base, only taking a few minutes to fix this naming issue forever.	2020-03-31 17:24:38 -04:00
James Rodewig	114894dd76	[DOCS] Add redirect for changed anchor ID (#54533 ) The anchor ID for the snapshot repository plugins section in the docs was recently changes from `_repository_plugins` to `snapshots-repository-plugins`. This adds a corresponding redirect so no links are broken.	2020-03-31 16:42:16 -04:00
James Rodewig	7401191019	[DOCS] Include 7.7.0 release notes (#54529 ) Includes the 7.7.0 release notes so they render in the HTML docs. Also removes a few legacy `coming[7.6.0]` tags.	2020-03-31 16:23:49 -04:00
Lisa Cawley	922ec8e961	[DOCS] Collapses nested objects in data frame analytics APIs (#54472 ) (#54526 )	2020-03-31 12:51:04 -07:00
Glen Smith	0d4a001ef2	[DOCS] Clarify cluster health status during rolling upgrade (#40757 ) Remove mention of the `yellow` and `red` starting health status from the rolling upgrade docs. Instead, we should emphasize that users wait for the node to recover with a health status of `green` rather than the starting status. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-03-31 14:13:05 -04:00
Dimitris Athanasiou	0b25e3b66c	[7.x][ML] Fix DF analytics explain API request in docs (#54510 ) (#54514 ) The explain API expects a data frame analytics config as its request. Backport of #54410	2020-03-31 18:56:52 +03:00
István Zoltán Szabó	a5497cd9e0	[DOCS] Adds HTTP response count example to Painless examples (#54412 )	2020-03-31 15:12:58 +02:00
István Zoltán Szabó	eeb23e9e73	[DOCS] Adds description of analysis_stats object and its properties to GET DFA stats API docs (#53881 ) Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com> Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-03-31 13:30:06 +02:00
James Rodewig	ce7539ce6c	[DOCS] Fix broken deprecated macros (#54466 ) Updates formatting for `deprecated` macros in the translog and synced flush docs. Previously, these macros were rendering literally.	2020-03-30 17:16:27 -04:00
Lisa Cawley	0fa1060ca4	[DOCS] Collapses content in machine learning APIs (#54234 ) (#54453 )	2020-03-30 11:06:33 -07:00
James Rodewig	ed1edb4964	[DOCS] Add missing word to keyword marker token filter docs	2020-03-30 10:52:14 -04:00
James Rodewig	21f362a2a8	[DOCS] Add a lowercase email example to keyword tokenizer docs (#53257 )	2020-03-30 09:06:04 -04:00
Benjamin Trent	374e76d7cd	[Transform] fixing naming in HLRC and _cat to match API content (#54300 ) (#54408 ) Fixing the naming of the HLRC values to match the ToXContent field names (i.e. the field names returned from an API call). Also fixes the names in the _cat API as well. closes #53946	2020-03-30 08:57:02 -04:00
István Zoltán Szabó	00eaa0ebe5	[DOCS] Changes scripted metric to filter aggs in transforms example (#54167 )	2020-03-30 09:51:07 +02:00
Jason Tedor	f0033783db	Deprecate node local storage setting (#54374 ) This setting is not documented and has dubious value since it means there can be nodes in the cluster (non-data and non-master nodes) that do not have persistent node IDs. This does not have any use cases so this commit removes the setting.	2020-03-28 14:36:41 -04:00
Gil Raphaelli	2984a54b7f	[DOCS] Fix typos in top metrics agg docs (#54299 )	2020-03-27 10:49:21 -04:00
AndyHunt66	2dd8946539	[DOCS] Remove redundant sentence in ingest processor docs (#54329 )	2020-03-27 08:25:09 -04:00
Christoph Büscher	f7ea794312	[Test] Don't expect specific scores in docs tests (#54297 ) The failing suggester documentation test was expecting specific scores in the test response, which is fragile implementation details that e.g. can change with different lucene versions and generally shouldn't be done in documentation test. Instead we usually replace the float values in the output response by the ones in the actual response. Closes #54257	2020-03-27 10:27:47 +01:00
Lisa Cawley	27cd5b343c	[DOCS] Augments cat transforms API (#53776 ) (#54232 )	2020-03-26 07:56:46 -07:00
Jason Tedor	d8f745736b	Clarify the remove keystore command can handle many (#54244 ) The remove keystore command can handle multiple settings. In a few places, we were not consistent about mentioning this. This commit addreses this, in the CLI help, and the docs.	2020-03-26 08:49:43 -04:00
Luca Cavanna	ff269160af	Async search: rename REST parameters (#54198 ) This commit renames wait_for_completion to wait_for_completion_timeout in submit async search and get async search. Also it renames clean_on_completion to keep_on_completion and turns around its behaviour. Closes #54069	2020-03-26 09:40:50 +01:00
István Zoltán Szabó	487b273286	[DOCS] Adds feature importance mapping subsection to inference processor docs (#54190 )	2020-03-26 09:26:50 +01:00
Jason Tedor	6af89e62d1	Allow keystore add-file to handle multiple settings (#54240 ) Today the keystore add-file command can only handle adding a single setting/file pair in a single invocation. This incurs the startup costs of the JVM many times, which in some environments can be expensive. This commit teaches the add-file keystore command to accept adding multiple settings in a single invocation.	2020-03-26 00:07:05 -04:00
Jason Tedor	4fbc0e9ab8	Complete keystore CLI options documentation (#54242 ) The documentation was missing the long option for the force option, and the short option for the stdin option. This commit addresses this by adding these to the documentation.	2020-03-25 23:53:34 -04:00
Jason Tedor	fe8257d981	Allow keystore add to handle multiple settings (#54229 ) Today the keystore add command can only handle adding a single setting/value pair in a single invocation. This incurs the startup costs of the JVM many times, which in some environments can be expensive. This commit teaches the add keystore command to accept adding multiple settings in a single invocation.	2020-03-25 22:58:20 -04:00
James Rodewig	30a32040d3	[DOCS] EQL: Document `substring` function (#53867 ) Adds documentation for the EQL `substring` function. Supporting changes: * Creates a new "EQL function reference" page * Updates the title of the "EQL syntax reference" page for consistency * Adds a brief "Functions" section to the EQL syntax docs * Updates EQL limitations docs to state that only array functions are unsupported	2020-03-25 12:23:59 -04:00
Nik Everett	16e4bd50e2	Add breaking change note for #53669	2020-03-25 09:31:14 -04:00
James Rodewig	74051d68a8	[DOCS] Reformat `keyword_marker` token filter (#54076 ) Makes the following changes to the `keyword_marker` token filter docs: * Rewrites description and adds Lucene link * Adds detailed analyze example * Rewrites parameter definitions * Adds custom analyzer and filter example	2020-03-25 09:26:06 -04:00
James Rodewig	2fdf6b2f96	[DOCS] Document missing data types for node stats API's response parameters (#53475 ) Documents missing data types for several response parameters returned by the node stats API. Also adds several missing human-readable parameters returned by the API.	2020-03-25 08:42:58 -04:00
Jason Tedor	381d7586e4	Introduce formal role for remote cluster client (#54138 ) This commit introduce a formal role for identifying nodes that are capable of making connections to remote clusters. Relates #53924	2020-03-24 21:59:43 -04:00
David Roberts	7667004b20	[ML] Add a model memory estimation endpoint for anomaly detection (#54129 ) A new endpoint for estimating anomaly detection job model memory requirements: POST _ml/anomaly_detectors/estimate_model_memory Backport of #53507	2020-03-24 22:55:11 +00:00
Jim Ferenczi	55f2e8bff0	[DOCS] Add 7.6.2 release notes (#53720 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: lcawl <lcawley@elastic.co>	2020-03-24 22:42:25 +01:00
markharwood	6a60f85bba	Wildcard field - add normalizer support (#53851 ) (#54109 ) Backport support for normalisation to wildcard field Closes #53603	2020-03-24 17:37:47 +00:00
Lisa Cawley	88b1b2f36f	[DOCS] Adds transform security privileges (#53908 )	2020-03-24 09:35:45 -07:00
Tim Brooks	caefa78513	Align remote info api with new settings (#54102 ) Currently the remote info api has added a number of possible fields (proxy, num_socket_connections, etc) that are available in proxy mode. These fields are not aligned with what the settings are named. This commit modifies this API to align with the settings.	2020-03-24 10:27:24 -06:00
Luca Cavanna	6b457abbd3	Async search: prevent users from overriding pre_filter_shard_size (#54088 ) Submit async search forces pre_filter_shard_size for the underlying search that it creates. With this commit we also prevent users from overriding such default as part of request validation.	2020-03-24 17:06:04 +01:00
Ahmet Arslan	52062565a9	[DOCS] Correct DFI docs regarding stop word removal (#53836 ) The documentation of DFI should recommend not to [remove stop words][1], since DFI is good at scoring queries that contain common terms: `the wall`, `the sun`, `the who`, etc. [1]:https://lucene.apache.org/core/8_1_1/core/org/apache/lucene/search/similarities/DFISimilarity.html	2020-03-24 10:48:42 -04:00
Karen Metts	9da589c5fd	[DOCS] Replace outdated Logstash monitoring link (#54032 ) Replaces a link to Logstash OSS-only content with a link to the general Logstash monitoring topic.	2020-03-24 10:03:31 -04:00
David Roberts	1421471556	[ML] Introduce a "starting" datafeed state for lazy jobs (#54065 ) It is possible for ML jobs to open lazily if the "allow_lazy_open" option in the job config is set to true. Such jobs wait in the "opening" state until a node has sufficient capacity to run them. This commit fixes the bug that prevented datafeeds for jobs lazily waiting assignment from being started. The state of such datafeeds is "starting", and they can be stopped by the stop datafeed API while in this state with or without force. Backport of #53918	2020-03-24 13:00:04 +00:00
Hendrik Muhs	7dcacf531f	[7.x][Transform][Rollup] add processing stats to record the ti… (#54027 ) add 2 additional stats: processing time and processing total which capture the time spent for processing results and how often it ran. The 2 new stats correspond to the existing indexing and search stats. Together with indexing and search this now allows the user to see the full picture, all 3 stages.	2020-03-24 09:22:02 +01:00
Jason Tedor	e3ca124537	Introduce autoscaling decisions (#53934 ) This is the first in a series of commits that will introduce the autoscaling deciders framework. This commit introduces the basic framework for representing autoscaling decisions.	2020-03-23 23:08:06 -04:00
Jim Ferenczi	9e3f7f4575	Add heuristics to compute pre_filter_shard_size when unspecified (#53873 ) (#54007 ) This commit changes the pre_filter_shard_size default from 128 to unspecified. This allows to apply heuristics based on the request and the target indices when deciding whether the can match phase should run or not. When unspecified, this pr runs the can match phase automatically if one of these conditions is met: * The request targets more than 128 shards. * The request contains read-only indices. * The primary sort of the query targets an indexed field. Users can opt-out from this behavior by setting the `pre_filter_shard_size` to a static value. Closes #39835	2020-03-24 02:05:15 +01:00
Julie Tibshirani	df7cfb3a5b	Remove the top-level 'mapping type' section. (#54035 ) It seemed confusing for users that our top-level mapping page still had a prominent section named 'Mapping Type'. This PR reworks the docs to remove this reference and adds a note about types removal (similar to the note we added to other APIs like put mapping).	2020-03-23 15:34:23 -07:00
James Rodewig	43199a8c82	[DOCS] Remove double space in WDG docs	2020-03-23 17:18:04 -04:00
James Rodewig	553d8a9ca9	[DOCS] Fix "letter case" typo Changes "lettercase" to "letter case" in the `uppercase` token filter docs.	2020-03-23 17:11:59 -04:00
Paweł Krześniak	c0534f4157	[DOCS] link fix (#53973 ) Fix bad link in top_metrics.	2020-03-23 14:20:54 -04:00
Luca Cavanna	932a7e3112	Backport of async search changes (#53976 ) * Get Async Search: omit _clusters section when empty (#53907) The _clusters section is omitted by the search API whenever no remote clusters are searched. Async search should do the same, but Get Async Search returns a deserialized response, hence a weird `_clusters` section with all values set to `0` gets returned instead. In fact the recreated Clusters object is not the same object as the EMPTY constant, yet it has the same content. This commit addresses this by changing the comparison in the `toXContent` method to not print out the section if the number of total clusters is `0`. * Async search: remove version from response (#53960) The goal of the version field was to quickly show when you can expect to find something new in the search response, compared to when nothing has changed. This can also be done by looking at the `_shards` section and `num_reduce_phases` returned with the search response. In fact when there has been one or more additional reduction of the results, you can expect new results in the search response. Otherwise, the `_shards` section could notify of additional failures of shards that have completed the query, but that is not a guarantee that their results will be exposed (only when the following partial reduction is performed their results will be available). That said this commit clarifies this in the docs and removes the version field from the async search response * Async Search: replicas to auto expand from 0 to 1 (#53964) This way single node clusters that are green don't go yellow once async search is used, while all the others still have one replica. * [DOCS] address timing issue in async search docs tests (#53910) The docs snippets for submit async search have proven difficult to test as it is not possible to guarantee that you get a response that is not final, even when providing `wait_for_completion=0`. In the docs we want to show though a proper long-running query, and its first response should be partial rather than final. With this commit we adapt the docs snippets to show a partial response, and replace under the hood all that's needed to make the snippets tests succeed when we get a final response. Also, increased the timeout so we always get a final response. Closes #53887 Closes #53891	2020-03-23 19:13:31 +01:00
Lisa Cawley	c4260ba3c7	[DOCS] Fixes formatting in transform overview (#53900 )	2020-03-23 10:21:54 -07:00
C.J. Jameson	5ac8b795ba	[DOCS] Clarify routing enforcement in docs (#53945 ) Removes a mention of the `_doc` mapping type that's no longer applicable now that mapping types are removed/deprecated.	2020-03-23 12:57:07 -04:00
Lisa Cawley	5f45780577	[DOCS] Adds data nanos transform limitation (#53826 )	2020-03-23 09:49:21 -07:00
Lisa Cawley	8b3ef4ac21	[DOCS] Add generated_dest_index to preview transform API (#53905 )	2020-03-23 09:43:44 -07:00
Marios Trivyzas	af03200ad6	SQL: Extend DATE_TRUNC to also operate on intervals(elastic - #46632 ) (#47720 ) (#53972 ) The function is extended to operate on intervals according to the PostgreSQL: https://www.postgresql.org/docs/9.1/functions-datetime.html#FUNCTIONS-DATETIME-TRUNC Closes : #46632 (cherry picked from commit 2dc79505825fa75e0711dcfa8e9c69e8028fc979) Co-authored-by: musteaf <gs_mustea@hotmail.com>	2020-03-23 15:05:16 +01:00
Dan Hermann	0528e2b06b	Document the deprecation of the 'auth.password' setting (#53822 )	2020-03-23 08:58:25 -05:00
István Zoltán Szabó	534e734ad1	[DOCS] Adds painless transform examples (#53274 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-03-23 10:39:47 +01:00
Przemyslaw Gomulka	015ad019d5	[docs] Known issue about joda patterns on 7.6 (#53957 )	2020-03-23 10:28:55 +01:00
Przemyslaw Gomulka	412e163cf6	[Doc] migration guide joda (#51986 ) The joda to java.time migration requires users to upgrade their mappings. We allow them to still use 6.x created indices with joda patterns in 7 but ask them to upgrade their patterns in 7.x. This migration guide is to help them understand how they could be affected and what needs to be changed in their mappings. closes #51614 closes #51236	2020-03-23 08:29:01 +01:00
Mark Vieira	0cfe6d90cc	Mute async-search test	2020-03-20 11:35:24 -07:00
Luca Belluccini	3937854444	[DOCS] Clarify upgrade paths (#53417 ) Unsupported upgrade paths: - `6.8 to 7.0`. Supported, but requires a full cluster restart: - `6.0–6.7 directly to 7.x`	2020-03-20 18:05:07 +00:00
Lisa Cawley	c2af015ee7	[DOCS] Updates list of transform aggs (#53820 )	2020-03-20 10:28:23 -07:00
Alan Woodward	a70016a6d6	TermsLookup fields should be marked as Required in docs (#53784 ) The terms-lookup section of our terms query docs currently state that the index, id and path fields are optional. They should be marked instead as required.	2020-03-20 15:39:41 +00:00
Luca Cavanna	d486bdefdd	[DOCS] correct async search note The sort optimization kicks in whenever results are sorted by field.	2020-03-20 15:58:19 +01:00
Luca Cavanna	03fca61fcb	[DOCS] add docs for async search (#53675 ) Relates to #49091 Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-03-20 14:46:38 +01:00
James Rodewig	6c57681446	[DOCS] Add redirects for missing data stream API docs (#53866 ) The following API spec files contain a link to a not-yet-created docs page for the data stream APIs: * [indices.create_data_stream.json][0] * [indices.delete_data_stream.jsonn][1] * [indices.get_data_streams.json][2] The Elaticsearch-JS client uses these spec files to create their docs. This created a broken link in the Elaticsearch-JS, which has broken the docs build. This PR adds a temporary redirect for the docs page. This redirect should be removed when the actual API docs are added. [0]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/indices.create_data_stream.json [1]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/indices.delete_data_stream.json [2]: https://github.com/elastic/elasticsearch/blob/master/rest-api-spec/src/main/resources/rest-api-spec/api/indices.get_data_streams.json Relates to #53558. CC @martijnvg	2020-03-20 09:41:47 -04:00
lgypro	be3090138e	[Docs] Fix typo in _analyze api docs (#53837 )	2020-03-20 12:02:40 +01:00
István Zoltán Szabó	53f7e31462	[DOCS] Fixes typo in start datafeed API docs. (#53811 )	2020-03-19 17:56:40 +01:00
István Zoltán Szabó	29583288b3	[DOCS] Adds performance considerations section to transforms overview (#53791 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-03-19 17:51:00 +01:00
Lisa Cawley	68f7036a4c	[DOCS] Adds example links to transform tutorial (#53640 )	2020-03-19 09:41:57 -07:00
István Zoltán Szabó	00203c35fe	[DOCS] Changes seconds to milliseconds since the Epoch in AD docs. (#53797 )	2020-03-19 15:42:58 +01:00
Dominic Page	b0884baf46	Geo shape query vs geo point backport (#53774 ) Backport to 7x Enable geo_shape query to work on geo_point fields for shapes: circle, polygon, multipolygon, rectangle see: #48928 Co-Authored-By: @iverase	2020-03-19 13:00:36 +01:00
James Rodewig	8f4a3eb07f	[DOCS] Add token graph concept docs (#53339 ) Adds conceptual docs for token graphs. These docs cover: * How a token graph is constructed from a token stream * How synonyms and multi-position tokens impact token graphs * How token graphs are used during search * Why some token filters produce invalid token graphs Also makes the following supporting changes: * Adds anchors to the 'Anatomy of an Analyzer' docs for cross-linking * Adds several SVGs for token graph diagrams	2020-03-19 07:43:18 -04:00
Leaf-Lin	b649b0d273	Update jvm options doc There's a typo with extra `	2020-03-19 12:00:01 +11:00
Benjamin Trent	415d73c27d	[Transform] renamed _cat/transform to _cat/transforms (#53743 ) (#53771 ) renaming _cat/transform to _cat/transforms for uniformity with the other _cat apis.	2020-03-18 19:54:03 -04:00
Lisa Cawley	268e512f0b	[DOCS] Add transform nodes (#53698 )	2020-03-18 15:26:37 -07:00
James Rodewig	0e2e06bd7e	[DOCS] Remove incorrect parms from put index template API docs (#53750 ) Removes the `flat_settings` and `timeout` query parameters from the JSON spec and asciidoc docs for the put index template API. These parameters are not supported by the API.	2020-03-18 14:36:27 -04:00
markharwood	598d4c1bf9	Formatting fix Bullet points were not rendered correctly	2020-03-18 15:36:52 +00:00
Lisa Cawley	c6e37c7662	[DOCS] Adds stub for cat transform API (#53737 )	2020-03-18 08:31:10 -07:00
James Rodewig	0732475cb9	[DOCS] Streamline `analyzer` mapping parm def (#51874 ) Simplifies the `analyzer` mapping parameter definition to remove duplicated analysis content and examples.	2020-03-18 09:43:59 -04:00
James Rodewig	e4b7af11ab	[DOCS] Remove `light_bengali` stemmer (#53697 ) Only the `bengali` stemmer is available in Lucene and surfaced through Elasticsearch. This removes the incorrect `light_bengali` link in our docs.	2020-03-18 08:34:27 -04:00
Dan Hermann	94ac979c66	Support array for all string ingest processors (#53694 )	2020-03-18 07:07:49 -05:00
Ralph Ursprung	474dfbf9c7	[Docs] Fix highlighting in match-query example (#52426 )	2020-03-18 11:53:14 +01:00
Tianlun Li	e7ae9ae596	Deprecate delaying state recovery for master nodes (#53646 ) It is useful to be able to delay state recovery until enough data nodes have joined the cluster, since this gives the shard allocator a decent opportunity to re-use as much existing data as possible. However we also have the option to delay state recovery until a certain number of master-eligible nodes have joined, and this is unnecessary: we require a majority of master-eligible nodes for state recovery, and there is no advantage in waiting for more. This commit deprecates the unnecessary settings in preparation for their removal. Relates #51806	2020-03-18 10:04:22 +00:00
Yash Joshi	85cc7a06c4	[Docs] Fix typo in range query (#53656 )	2020-03-18 10:18:08 +01:00
Karen Metts	3c5437894e	Remove link to old settings 7.x (#53639 )	2020-03-17 14:38:50 -04:00
David Turner	347b7594b7	Fix deprecation in history retention docs (#53655 ) This commit adjusts a `deprecation[...]` message in the docs since such messages must be on a single line. It also moves this message to the start of the description of the deprecated setting as is the case with other such messages.	2020-03-17 14:06:37 +00:00
Luca Cavanna	c3d2417448	Cumulative backport of async search changes (#53635 ) * Submit async search to work only with POST (#53368) Currently the submit async search API can be called using both GET and POST at REST, but given that it submits a call and creates internal state, POST should be the only allowed method. * Refine SearchProgressListener internal API (#53373) The following cumulative improvements have been made: - rename `onReduce` and `notifyReduce` to `onFinalReduce` and `notifyFinalReduce` - add unit test for `SearchShard` - on* methods in `SearchProgressListener` shouldn't need to be public as they should never be called directly, they only need to be overridden hence they can be made protected. They are actually called directly from a test which required some adapting, like making `AsyncSearchTask.Listener` class package private instead of private - Instead of overriding `getProgressListener` in `AsyncSearchTask`, as it feels weird to override a getter method, added a specific method that allows to retrieve the Listener directly without needing to cast it. Made the getter and setter for the listener final in the base class. - rename `SearchProgressListener#searchShards` methods to `buildSearchShards` and make it static given that it accesses no instance members - make `SearchShard` and `SearchShardTask` classes final * Move async search yaml tests to x-pack yaml test folder (#53537) The yaml tests for async search currently sit in its qa folder. There is no reason though for them to live in a separate folder as they don't require particular setup. This commit moves them to the main folder together with the other x-pack yaml tests so that they will be run by the client test runners too. * [DOCS] Add temporary redirect for async-search (#53454) The following API spec files contain a link to a not-yet-created async search docs page: * [async_search.delete.json][0] * [async_search.get.json][1] * [async_search.submit.json][2] The Elaticsearch-js client uses these spec files to create their docs. This created a broken link in the Elaticsearch-js docs, which has broken the docs build. This PR adds a temporary redirect for the docs page. This redirect should be removed when the actual API docs are added. [0]: https://github.com/elastic/elasticsearch/blob/master/x-pack/plugin/src/test/resources/rest-api-spec/api/async_search.delete.json [1]: https://github.com/elastic/elasticsearch/blob/master/x-pack/plugin/src/test/resources/rest-api-spec/api/async_search.get.json [2]: https://github.com/elastic/elasticsearch/blob/master/x-pack/plugin/src/test/resources/rest-api-spec/api/async_search.submit.json Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-03-17 00:08:17 +01:00
Ryan Earle	b4e203a735	[DOCS] Remove `force` as valid value for `version_type` (#53428 ) The `force` value for the `version_type` parameter was deprecated in 6.8. This removes the value from the parameter definition.	2020-03-16 16:50:58 -04:00
Tim Brooks	531cd5aaa4	Add documentation for remote cluster proxy mode (#52779 ) This is related to #49067.	2020-03-16 14:18:29 -06:00
Lisa Cawley	278e3fce50	[DOCS] Add anchors for scripted metric aggregations (#53618 )	2020-03-16 12:20:41 -07:00
Abhilash Bolla	508e5239b4	[DOCS] Remove unneeded anchor from ILM snippet (#53432 )	2020-03-16 15:19:50 -04:00
Nik Everett	f7482f794a	Improve top_metrics docs (#53521 ) (#53619 ) * Removes experimental. * Replaces `"v"` (for value) with `"m"` (for metric). * Move the note about tiebreaking into the list of limitations of the sort. * Explain how you ask for `metrics`. * Clean up some wording. * Link to the docs from `top_metrics`. Closes #51813	2020-03-16 13:47:43 -04:00
Andrei Stefan	91ca9c5c33	QL: constant_keyword support (#53241 ) (#53602 ) (cherry picked from commit d6cd4ce7849ba215407c8c5fa815c9b373fb8480)	2020-03-16 18:06:31 +02:00
James Rodewig	e1eebea846	[DOCS] Reformat `remove_duplicates` token filter (#53608 ) Makes the following changes to the `remove_duplicates` token filter docs: * Rewrites description and adds Lucene link * Adds detailed analyze example * Adds custom analyzer example	2020-03-16 11:37:06 -04:00
markharwood	2c74f3e22c	Backport of new wildcard field type (#53590 ) * New wildcard field optimised for wildcard queries (#49993) Indexes values using size 3 ngrams and also stores the full original as a binary doc value. Wildcard queries operate by using a cheap approximation query on the ngram field followed up by a more expensive verification query using an automaton on the binary doc values. Also supports aggregations and sorting.	2020-03-16 15:07:13 +00:00
Mayya Sharipova	a906f8a0e4	Highlighters skip ignored keyword values (#53408 ) (#53604 ) Keyword field values with length more than ignore_above are not indexed. But highlighters still were retrieving these values from _source and were trying to highlight them. This sometimes lead to errors if a field length exceeded max_analyzed_offset. But also this is an overall wrong behaviour to attempt to highlight something that was ignored during indexing. This PR checks if a keyword value was ignored because of its length, and if yes, skips highlighting it. Backport: #53408 Closes #43800	2020-03-16 11:06:25 -04:00
Benjamin Trent	4e43ede735	[ML] renaming inference processor field field_mappings to new name field_map (#53433 ) (#53502 ) This renames the `inference` processor configuration field `field_mappings` to `field_map`. `field_mappings` is now deprecated.	2020-03-13 15:40:57 -04:00
Tom Veasey	690099553c	[7.x][ML] Adds the class_assignment_objective parameter to classification (#53552 ) Adds a new parameter for classification that enables choosing whether to assign labels to maximise accuracy or to maximise the minimum class recall. Fixes #52427.	2020-03-13 17:35:51 +00:00
Lisa Cawley	b1d589f276	[DOCS] Adds operations_behind to transform stats (#53518 )	2020-03-13 09:34:49 -07:00
James Rodewig	1e31031b73	[DOCS] Clarify `max_shingle_size` parm def (#53480 ) Rewrites the `search_as_you_type` field datatype's `max_shingle_size` mapping parameter to improve clarity and better communicate trade-offs regarding index size. Relates to [elastic/kibana#55161][0]. Closes #51774. [0]: https://github.com/elastic/kibana/pull/55161#discussion_r368107177	2020-03-13 04:08:44 -04:00
Nik Everett	9dcd64c110	Preserve metric types in top_metrics (backport of #53288 ) (#53440 ) This changes the `top_metrics` aggregation to return metrics in their original type. Since it only supports numerics, that means that dates, longs, and doubles will come back as stored, with their appropriate formatter applied.	2020-03-12 17:17:09 -04:00
Jim Ferenczi	97621e7f65	Removes old Lucene's experimental flag from analyzer documentations (#53217 ) This change removes the Lucene's experimental flag from the documentations of the following tokenizer/filters: * Simple Pattern Split Tokenizer * Simple Pattern tokenizer * Flatten Graph Token Filter * Word Delimiter Graph Token Filter The flag is still present in Lucene codebase but we're fully supporting these tokenizers/filters in ES for a long time now so the docs flag is misleading. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-03-12 21:18:19 +01:00
Jason Tedor	5b08ea84c9	Add deprecation check for listener thread pool (#53438 ) This commit adds a deprecation check for the listener thread pool settings as these will be removed in 8.0.0.	2020-03-12 14:32:41 -04:00
István Zoltán Szabó	969164cc47	[DOCS] Adds a warning about reindexing docs with the same ID to the PUT DFA docs. (#53490 )	2020-03-12 18:01:43 +01:00
James Rodewig	af987fb2d4	[DOCS] Reduce content reuse in enrich docs (#53460 ) Restructures the 'Update an enrich policy' section to: * Migrate the content to the section. It was previously stored in the Put Enrich Policy API docs. * Remove the warning tag admonition from the section content. * Replace a reused section earlier in the "Set up an enrich processor" page with a link. No substantive changes were made to the content.	2020-03-12 05:57:23 -04:00
Benjamin Trent	89668c5ea0	[ML][Inference] adds new default_field_map field to trained models (#53294 ) (#53419 ) Adds a new `default_field_map` field to trained model config objects. This allows the model creator to supply field map if it knows that there should be some map for inference to work directly against the training data. The use case internally is having analytics jobs supply a field mapping for multi-field fields. This allows us to use the model "out of the box" on data where we trained on `foo.keyword` but the `_source` only references `foo`.	2020-03-11 13:49:39 -04:00
Ignacio Vera	562a9eff33	remove sneaked placeholder from histogram docs (#53391 ) (#53405 )	2020-03-11 14:04:22 +01:00
James Rodewig	933a9c6fca	[DOCS] Reformat `word_delimiter` token filter (#53387 ) Makes the following changes to the `word_delimiter` token filter docs: * Adds a warning admonition recommending the `word_delimiter_graph` filter instead. This warning includes a link to the deprecated Lucene `WordDelimiterFilter`. * Updates the description * Adds detailed analyze snippet * Adds custom analyzer and custom filter snippets * Reorganizes and updates parameter documentation	2020-03-11 09:03:57 -04:00
Dimitris Athanasiou	0fd0516d0d	[7.x][ML] Rename data frame analytics maximum_number_trees to max_trees (#53300 ) (#53390 ) Deprecates `maximum_number_trees` parameter of classification and regression and replaces it with `max_trees`. Backport of #53300	2020-03-11 12:45:27 +02:00
James Rodewig	a9dd7773d2	[DOCS] Use keyword tokenizer in word delimiter graph examples (#53384 ) In a tip admonition, we recommend using the `keyword` tokenizer with the `word_delimiter_graph` token filter. However, we only use the `whitespace` tokenizer in the example snippets. This updates those snippets to use the `keyword` tokenizer instead. Also corrects several spacing issues for arrays in these docs.	2020-03-11 04:46:33 -04:00
James Rodewig	166b5a92f6	[DOCS] Correct anchor in word delimiter graph token filter docs	2020-03-10 10:32:42 -04:00
James Rodewig	d503bf9a45	[DOCS] Make token graph diagrams consistent (#53331 ) Updates the SVG for a token graph to make the layout consistent with other graphs. This means moving the text directly above the edge lines. Previously, the text was above the edge line.	2020-03-10 06:53:28 -04:00
James Rodewig	5e3df18d56	[DOCS] Adds Beats tip to EQL search docs (#53292 ) Adds a tip admonition to the basic example in the EQL search docs. This tip lets users know they can set up a Beat to automatically index data in ES, rather than manually indexing using the bulk or index APIs.	2020-03-10 05:16:18 -04:00
James Rodewig	641eb69520	[DOCS] Document `nodes` cluster stats (#52813 ) Documents the `nodes` response parameters returned by the `_cluster/stats` API. Also adds collapsible attributes for the `indices` and `nodes` sections.	2020-03-10 05:05:05 -04:00
Jason Tedor	1860c57147	Deprecate the listener thread pool (#53266 ) The listener thread pool is being removed from use in the server codebase. This commit deprecates configuring the listener thread pool.	2020-03-09 16:56:01 -04:00
Julie Tibshirani	c33afea9fb	Small corrections to stored_fields docs. (#53247 ) * Fix a reference to the 'field' option. * Remove claim about detecting script fields. * Specify that object fields will just be ignored.	2020-03-09 10:59:17 -07:00
Romain Gonord	00657901ec	[DOCS] Fix "Wait for Snapshot" link in ILM actions reference (#53269 ) Corrects an anchor for the `Wait for Snapshot` action, which previously linked to the `Delete` action.	2020-03-09 08:40:36 -04:00
Anton Dollmaier	35c8226419	[DOCS] Fix parameter formatting for GeoHash grid agg docs (#53032 ) Adds missing colon (`:`) to the parameter definition list.	2020-03-09 08:16:40 -04:00
James Rodewig	28cb4a167d	[DOCS] Reformat `word_delimiter_graph` token filter (#53170 ) (#53272 ) Makes the following changes to the `word_delimiter_graph` token filter docs: * Updates the Lucene experimental admonition. * Updates description * Adds analyze snippet * Adds custom analyzer and custom filter snippets * Reorganizes and updates parameter list * Expands and updates section re: differences between `word_delimiter` and `word_delimiter_graph`	2020-03-09 06:45:44 -04:00
István Zoltán Szabó	aafc0409a9	[DOCS] Makes the description clearer on how to use aggregations in an anomaly detection job (#53103 ) Co-authored-by: lcawl <lcawley@elastic.co>	2020-03-09 09:49:59 +01:00
Lisa Cawley	341417613e	[7.x][DOCS] Adds common definitions for security settings (#51017 ) (#53242 ) Co-Authored-By: Tim Vernum <tim@adjective.org>	2020-03-06 16:28:54 -08:00
Gordon Brown	ff9b8bda63	Implement hidden aliases (#52547 ) This commit introduces hidden aliases. These are similar to hidden indices, in that they are not visible by default, unless explicitly specified by name or by indicating that hidden indices/aliases are desired. The new alias property, `is_hidden` is implemented similarly to `is_write_index`, except that it must be consistent across all indices with a given alias - that is, all indices with a given alias must specify the alias as either hidden, or all specify it as non-hidden, either explicitly or by omitting the `is_hidden` property.	2020-03-06 16:02:38 -07:00
Adam Canady	a88d0c7ca3	[Docs] Correct examples for * and + in regexp-syntax.asciidoc (#53210 )	2020-03-06 17:17:32 +01:00
István Zoltán Szabó	bf3dcd4229	[DOCS] Adds deleting flag to the GET job stats API docs (#53223 )	2020-03-06 16:04:20 +01:00
James Rodewig	9bb9f63364	[DOCS] Note that `trim` filter doesn't change offsets (#53220 ) The [word delimiter graph token filter docs][0] note that the `trim` filter changes the length of tokens without changing their offsets. This explicitly mentions that in the `trim` filter docs. [0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/analysis-word-delimiter-graph-tokenfilter.html	2020-03-06 07:32:35 -05:00
James Rodewig	4bc6d2dbec	[DOCS] Correct link for Lucene StopFilter	2020-03-05 14:52:25 -05:00
Mayya Sharipova	7e2a9f58ee	script_score query errors on negative scores (#53133 ) 7.5 and 7.6 had a regression that allowed for script_score queries to have negative scores. We have corrected this regression in #52478. This is an addition to #52478 that adds a test and release notes.	2020-03-05 14:23:39 -05:00
David Turner	c2627aa22f	Clarify futher the order for a rolling upgrade (#52964 ) Expands the "master-ineligible then master-eligible" sentence into a list and specifies that within these subsets the order doesn't matter.	2020-03-05 15:28:59 +00:00
István Zoltán Szabó	58ce56f6c8	[DOCS] Makes the naming convention of the DFA response objects coherent (#53172 )	2020-03-05 16:26:57 +01:00
István Zoltán Szabó	48707ec55a	[DOCS] Expands GET DFA stat API docs with response objects. (#53107 )	2020-03-05 15:31:55 +01:00
Nik Everett	28df7ae5ed	Support multiple metrics in `top_metrics` agg (backport of #52965 ) (#53163 ) This adds support for returning multiple metrics to the `top_metrics` agg. It looks like: ``` POST /test/_search?filter_path=aggregations { "aggs": { "tm": { "top_metrics": { "metrics": [ {"field": "v"}, {"field": "m"} ], "sort": {"s": "desc"} } } } } ```	2020-03-05 08:12:01 -05:00
James Rodewig	e46bb54c7b	[DOCS] Document `any` keyword in EQL syntax (#52821 ) (#53157 ) Adds documentation for the `any` keyword to the EQL syntax docs. Includes: * Definition of an event category and its relationship to the event category field. * Example matching all event categories using `any` keyword * Example using `any` with `where true`	2020-03-05 05:02:47 -05:00
James Rodewig	801e50203e	[DOCS] Add missing doc type to EQL search results	2020-03-04 10:26:11 -05:00
Yannick Welsch	d1e7951e00	[DOCS] Add 7.6.1. release notes (#52874 ) Adds the release notes for 7.6.1.	2020-03-04 15:47:54 +01:00
James Rodewig	e3d3c3400c	[DOCS] Update EQL default event category and timestamp values (#53102 ) Updates the documented default `event_category_field` and `timestamp_field` values for the EQL search API. Also updates related guidance in the EQL requirement docs. Relates to #53073.	2020-03-04 09:17:37 -05:00
James Rodewig	0c4bf64095	[DOCS] Fix several Asciidoctor double arrow replacements (#52827 ) Per the [Asciidoctor docs][0], Asciidoctor replaces the following syntax with double arrows in the rendered HTML: * => renders as ⇒ * <= renders as ⇐ This escapes several unintended replacements, such as in the Painless docs. Where appropriate, it also replaces some double arrow instances with single arrows for consistency. [0]: https://asciidoctor.org/docs/user-manual/#replacements	2020-03-04 08:43:19 -05:00
James Rodewig	2b59f8ac34	[DOCS] Correct `hits.total.relation` response parm def (#52847 ) Fixes a partially completed definition for the `hits.total.relation` response parameter in the search API docs.	2020-03-04 08:23:34 -05:00
Aleksandr Maus	b47bffba24	EQL: consistent naming for event type vs event category (#53073 ) (#53090 ) Related to https://github.com/elastic/elasticsearch/issues/52941	2020-03-04 08:02:38 -05:00
Lisa Cawley	892f0d5848	[DOCS] Adds link in datafeed indices_options (#53067 )	2020-03-03 10:31:32 -08:00
Lisa Cawley	bb4bcbc54c	[DOCS] Adds ignore_throttled to index conventions (#53063 )	2020-03-03 10:23:56 -08:00
James Rodewig	cf87724ff6	[DOCS] Reformat `stop` token filter (#53059 ) Makes the following changes to the `stop` token filter docs: * Updates description * Adds a link to the related Lucene filter * Adds detailed analyze snippet * Updates custom analyzer and custom filter snippets * Adds a list of predefined stop words by language Co-authored-by: ScottieL <36999642+ScottieL@users.noreply.github.com>	2020-03-03 13:22:52 -05:00
István Zoltán Szabó	6cece3a709	[DOCS] Adds response body documentation to GET inference API (#53050 )	2020-03-03 16:26:40 +01:00
Adrien Grand	cb868d2f5e	Introduce a `constant_keyword` field. (#49713 ) (#53024 ) This field is a specialization of the `keyword` field for the case when all documents have the same value. It typically performs more efficiently than keywords at query time by figuring out whether all or none of the documents match at rewrite time, like `term` queries on `_index`. The name is up for discussion. I liked including `keyword` in it, so that we still have room for a `singleton_numeric` in the future. However I'm unsure whether to call it `singleton`, `constant` or something else, any opinions? For this field there is a choice between 1. accepting values in `_source` when they are equal to the value configured in mappings, but rejecting mapping updates 2. rejecting values in `_source` but then allowing updates to the value that is configured in the mapping This commit implements option 1, so that it is possible to reindex from/to an index that has the field mapped as a keyword with no changes to the source. Backport of #49713	2020-03-03 16:01:47 +01:00
James Rodewig	bcb68c860c	[DOCS] Reorganize EQL requirements page	2020-03-03 07:02:30 -05:00
James Rodewig	5cffa14f45	[DOCS] Fix typo in EQL docs	2020-03-02 16:09:03 -05:00
Costin Leau	712e0c05cd	EQL: Add implicit ordering on timestamp (#53004 ) QL: Move Sort base class from SQL to QL (cherry picked from commit 798015b7bbd565e9c4222724614baeb432c7c2b3)	2020-03-02 22:41:36 +02:00
Orhan Toy	ad2f630795	[DOCS] Fix formatting of simulate ingest pipeline API docs (#52754 )	2020-03-02 11:46:27 -05:00
István Zoltán Szabó	a267849def	[DOCS] Adds transform-settings page and its subpage to redirects (#52944 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-03-02 16:36:40 +01:00
Lisa Cawley	4fbe1b0550	[DOCS] Adds cat anomaly detectors API (#52866 ) (#52970 )	2020-03-02 07:28:55 -08:00
Hendrik Muhs	a328a8eaf1	[7.x][Transform] implement node.transform to control where to… (#52998 ) implement transform node attributes to disable transform on certain nodes and test which nodes are allowed to do remote connections closes #52200 closes #50033 closes #48734 backport #52712	2020-03-02 16:10:57 +01:00
James Rodewig	db64029919	[7.x] [DOCS] Add parameter examples to EQL search tutorial (#52953 ) Makes the following updates to the EQL search tutorial: * Adds an API response to the basic tutorial * Adds an example using the `event_type_field` parm * Adds an example using the `timestamp_field`parm * Adds an example using the `query` parm * Updates example dataset to support more EQL query variety	2020-03-02 10:08:03 -05:00
Aleksandr Maus	89ed857c79	EQL: Change request parameter query to filter and rule to query (#52971 ) (#53006 ) Related to https://github.com/elastic/elasticsearch/issues/52911	2020-03-02 09:26:23 -05:00
James Rodewig	d336faa0b0	[DOCS] Reformat trim token filter docs (#51649 ) Makes the following changes to the `trim` token filter docs: * Updates description * Adds a link to the related Lucene filter * Adds tip about removing whitespace using tokenizers * Adds detailed analyze snippets * Adds custom analyzer snippet	2020-03-02 07:48:23 -05:00
James Rodewig	f5bccad847	[DOCS] Correct guidance for `index_options` mapping parm (#52899 ) Adds a warning admonition stating that the `index_options` mapping parameter is intended only for `text` fields. Removes an outdated statement regarding default values for numeric and other datatypes.	2020-03-02 07:39:35 -05:00
rhymes	7eb4c07f1f	[DOCS] Fix typo in index and search analysis docs (#52988 )	2020-03-02 07:25:01 -05:00
Dimitris Athanasiou	85b4e45093	[7.x]ML] Parse and report memory usage for DF Analytics (#52778 ) (#52980 ) Adds reporting of memory usage for data frame analytics jobs. This commit introduces a new index pattern `.ml-stats-*` whose first concrete index will be `.ml-stats-000001`. This index serves to store instrumentation information for those jobs. Backport of #52778 and #52958	2020-02-29 13:03:40 +02:00
Rory Hunter	b1be7dcd2d	Document how to change GC logging behaviour (#52879 ) Closes #43990. Describe how to change the default GC settings without changing the default `jvm.options`. Give examples using `jvm.options.d`, and `ES_JAVA_OPTS` with Docker.	2020-02-28 21:27:45 +00:00
Martijn van Groningen	6aa9aaa2c6	Add validation for dynamic templates (#52890 ) Backport of #51233 to the seven dot x branch. Tries to load a `Mapper` instance for the mapping snippet of a dynamic template. This should catch things like using an analyzer that is undefined or mapping attributes that are unused. This is best effort: * If `{{name}}` placeholder is used in the mapping snippet then validation is skipped. * If `match_mapping_type` is not specified then validation is performed for all mapping types. If parsing succeeds with a single mapping type then this the dynamic mapping is considered valid. If is detected that a dynamic template mapping snippet is invalid at mapping update time then the mapping update is failed for indices created on 8.0.0-alpha1 and later. For indices created on prior version a deprecation warning is omitted instead. In 7.x clusters the mapping update will never fail in case of an invalid dynamic template mapping snippet and a deprecation warning will always be omitted. Closes #17411 Closes #24419 Co-authored-by: Adrien Grand <jpountz@gmail.com>	2020-02-28 10:35:04 +01:00
Nik Everett	1d1956ee93	Add size support to `top_metrics` (backport of #52662 ) (#52914 ) This adds support for returning the top "n" metrics instead of just the very top. Relates to #51813	2020-02-27 16:12:52 -05:00
Benjamin Trent	eac38e9847	[ML] Add indices_options to datafeed config and update (#52793 ) (#52905 ) This adds a new configurable field called `indices_options`. This allows users to create or update the indices_options used when a datafeed reads from an index. This is necessary for the following use cases: - Reading from frozen indices - Allowing certain indices in multiple index patterns to not exist yet These index options are available on datafeed creation and update. Users may specify them as URL parameters or within the configuration object. closes https://github.com/elastic/elasticsearch/issues/48056	2020-02-27 13:43:25 -05:00
Nattachai Suteerapongpan	14f847cc8f	[DOCS] Fix typo in task management API docs (#52881 )	2020-02-27 11:31:11 -05:00
Josh Devins	68ba571f70	Adds recall@k metric to rank eval API (#52889 ) This change adds the recall@k metric and refactors precision@k to match the new metric. Recall@k is an important metric to use for learning to rank (LTR) use-cases. Candidate generation or first ranking phase ranking functions are often optimized for high recall, in order to generate as many relevant candidates in the top-k as possible for a second phase of ranking. Adding this metric allows tuning that base query for LTR. See: https://github.com/elastic/elasticsearch/issues/51676 Backports: https://github.com/elastic/elasticsearch/pull/52577	2020-02-27 16:04:24 +01:00
István Zoltán Szabó	8785f57dfe	[DOCS] Reformats cat DFA API docs. (#52885 )	2020-02-27 14:21:52 +01:00
István Zoltán Szabó	4a33352a94	[DOCS] Adds cat trained model API documentation (#52824 )	2020-02-27 12:54:11 +01:00
Costin Leau	40bc06f6ad	EQL: Hook engine to Elasticsearch (#52828 ) Add query execution and return actual results returned from Elasticsearch inside the tests (cherry picked from commit 3e039282bf991af87604a6d4f8eada19d5e33842)	2020-02-27 11:22:22 +02:00
David Turner	69b78f7f8a	"Adding nodes" instructions only work on localhost (#52677 ) The introductory sections of the reference manual contains some simplified instructions for adding a node to the cluster. Unfortunately they are a little too simplified and only really work for clusters running on `localhost`. If you try and follow these instructions for a distributed cluster then the new node will, confusingly, auto-bootstrap itself into a distinct one-node cluster. Multiple nodes running on localhost is a valid config, of course, but we should spell out that these instructions are really only for experimentation and that it takes a bit more work to add nodes to a distributed cluster. This commit does so. Also, the "important config" instructions for discovery say that you MUST set `discovery.seed_hosts` whereas in fact it is fine to ignore this setting and use a dynamic discovery mechanism instead. This commit weakens this statement and links to the docs for dynamic discovery mechanisms. Finally, this section is also overloaded with some technical details that are not important for this context and are adequately covered elsewhere, and completely fails to note that the default discovery port is 9300. This commit addresses this.	2020-02-27 09:18:37 +00:00
James Rodewig	f5253d20f7	[DOCS] Update term vectors snippet to prevent CI failure (#52819 ) Adds the `?refresh=wait_for` query argument to an index API snippet in the term vectors API docs. This should ensure the document is indexed and available before a subsequent term vectors API request executes. Fixes #52814.	2020-02-26 12:41:40 -05:00
Lisa Cawley	b788ec7157	[DOCS] Adds cat datafeeds API (#52738 )	2020-02-26 09:28:57 -08:00
Bogdan Pintea	304e1e69b8	remove references to the SQL API from ODBC config (#52765 ) Remove reference to an "SQL API" which could suggest that one needs to treat this in a special way when configuring the ODBC driver. (cherry picked from commit 451c341e0193b542409e8891ec2a31e62529a5e7)	2020-02-26 13:39:54 +01:00
István Zoltán Szabó	f57422bbfd	[DOCS] Adds cat data frame analytics API (#52764 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-02-26 11:10:42 +01:00
Lisa Cawley	05f1cd74a6	[DOCS] Fixes monitoring links (#52790 )	2020-02-25 18:08:23 -08:00
Lisa Cawley	924f0bd243	[DOCS] Updates custom rules example (#52731 )	2020-02-25 09:32:52 -08:00
Andrei Stefan	51c6aefa55	SQL: Use calendar_interval of 1d for HISTOGRAMs with 1 DAY intervals (#52749 ) (#52771 ) (cherry picked from commit 556f5fa33be88570c4f8550cb8f784323d26a707)	2020-02-25 18:44:02 +02:00
Pius	563f033511	Update ilm-settings.asciidoc (#51577 )	2020-02-25 10:18:55 -05:00
bellengao	d2db16e046	[DOCS] Correct policy name in ILM docs example (#52354 ) Updates an example snippet to use a consistent policy name.	2020-02-25 09:36:22 -05:00
David Pilato	6c6ab8fa47	[DOS] Fix typo in CSV processor docs (#52649 ) Corrects an example array in a snippet of the CSV processor docs.	2020-02-25 08:48:50 -05:00
bellengao	49f37989c4	[DOCS] Fix typo in ingest node docs (#52671 )	2020-02-25 07:57:52 -05:00
David Roberts	cf122d13b8	[ML] Use event.timezone in file_structure_finder ingest pipeline (#52720 ) This is because beat.timezone was renamed to event.timezone in elastic/beats#9458	2020-02-25 12:33:53 +00:00
James Rodewig	9b05f6a668	[DOCS] Add admonition for app using cat APIs (#52727 ) Adds an explicit "important" admonition discouraging apps from using cat APIs. cat APIs are intended for human consumption via the command line or Kibana console only. They are not intended for consumption by applications.	2020-02-25 07:20:33 -05:00
James Rodewig	1a14ae4e1b	[DOCS] Document `include_in_*` nested mapping parms (#52648 ) Adds documentation for the `include_in_parent` and `include_in_root` mapping parameters for the `nested` mapping datatype.	2020-02-25 07:13:49 -05:00
Adrien Grand	5f81906fcf	Discourage from opting in for the `niofs` store. (#52638 ) Indices open with the `niofs` store type load much more data on-heap than indices open with the `mmapfs` store type. This limitation is now documented and examples have been updated to show how to update settings to use the `mmapfs` store type rather than `niofs`.	2020-02-25 08:54:11 +01:00
Adrien Grand	9b0ddc1c03	Clarify the resiliency trade-off of disabling replicas to speed up indexing. (#52714 ) We should be more explicit about the downsides of disabling replicas and explain that users should be ready to re-do the entire load in case of issues mid-way.	2020-02-25 08:54:10 +01:00
Adrien Grand	5ce66b8b3c	Document how CCR may be used to speed up indexing. (#52717 ) One architecture that we have recommended to several users to speed up indexing involved using CCR to prevent searching from stealing resources from indexing.	2020-02-25 08:54:10 +01:00
Bob Blank	28d4b71947	Clarified http.max_content_length description (#52329 ) Adding "greater than" based on discussion with @jasontedor for clarity.	2020-02-24 21:01:14 -05:00
Andrei Stefan	ed6b10bc03	SQL: use a calendar interval for histograms over 1 month intervals (#52586 ) (#52715 ) (cherry picked from commit 928b11a34ec92d90d082abdf4fa09f7ce1d7c0c4)	2020-02-25 01:41:51 +02:00
Julie Tibshirani	ba0401ecfd	Correct the name of the search timeout parameter. (#52733 ) The request body parameter is called 'timeout', not 'search_timeout'.	2020-02-24 14:59:06 -08:00
lcawl	c6e35b460e	[DOCS] Adds anchor for custom rules	2020-02-24 11:39:15 -08:00
Mayya Sharipova	034b1c0ba3	Correct boost calculation in script_score query (#52478 ) (#52724 ) Before boost in script_score query was wrongly applied only to the subquery. This commit makes sure that the boost is applied to the whole score that comes out of script. Closes #48465	2020-02-24 13:48:21 -05:00
James Rodewig	5e48811585	[DOCS] Document CCS-supported APIs (#52708 ) Explicitly notes the Elasticsearch API endpoints that support CCS. This should deter users from attempting to use CCS with other API endpoints, such as `GET <index>/_doc/<_id>`.	2020-02-24 09:59:08 -05:00
Ignacio Vera	ba9d3c6389	Add support for multipoint shape queries (#52564 ) (#52705 )	2020-02-24 13:46:51 +01:00
James Rodewig	98bcf06bae	[DOCS] Correct multi search API docs (#52523 ) * Adds an example request to the top of the page. * Relocates several parameters erroneously listed under "Request body" to the appropriate "Query parameters" section. * Updates the "Request body" section to better document the NDJSON structure of msearch requests.	2020-02-24 07:43:10 -05:00
Marios Trivyzas	c03f51f68f	[Docs] Clarify default value for `allow_no_indices` (#52635 ) (#52697 ) Add default value to each one of the usages of `allow_no_indices` since it differs between different APIs. Relates to: #52534 (cherry picked from commit 2eb986488ac326d6da6ab8ad0203a94e08684a36)	2020-02-24 11:57:32 +01:00
Benjamin Trent	afd90647c9	[ML] Adds feature importance to option to inference processor (#52218 ) (#52666 ) This adds machine learning model feature importance calculations to the inference processor. The new flag in the configuration matches the analytics parameter name: `num_top_feature_importance_values` Example: ``` "inference": { "field_mappings": {}, "model_id": "my_model", "inference_config": { "regression": { "num_top_feature_importance_values": 3 } } } ``` This will write to the document as follows: ``` "inference" : { "feature_importance" : { "FlightTimeMin" : -76.90955548511226, "FlightDelayType" : 114.13514762158526, "DistanceMiles" : 13.731580450792187 }, "predicted_value" : 108.33165831875137, "model_id" : "my_model" } ``` This is done through calculating the [SHAP values](https://arxiv.org/abs/1802.03888). It requires that models have populated `number_samples` for each tree node. This is not available to models that were created before 7.7. Additionally, if the inference config is requesting feature_importance, and not all nodes have been upgraded yet, it will not allow the pipeline to be created. This is to safe-guard in a mixed-version environment where only some ingest nodes have been upgraded. NOTE: the algorithm is a Java port of the one laid out in ml-cpp: https://github.com/elastic/ml-cpp/blob/master/lib/maths/CTreeShapFeatureImportance.cc usability blocked by: https://github.com/elastic/ml-cpp/pull/991	2020-02-21 18:42:31 -05:00
Mayya Sharipova	3840a763d8	Correct release notes for 7.5 (#52660 ) Remove a mention to a feature that was not merged, as its corresponding PR was closed.	2020-02-21 14:59:46 -05:00
Nik Richers	101bca86d2	[DOCS] Switch to standard ESS trial links (#52552 ) Switches ESS trial sign-up links over to a standard attribute. This provides better metrics for how effective these links are.	2020-02-21 12:07:10 -05:00
Lisa Cawley	4ff78e8a00	[7.x][DOCS] Adds X-Pack usage API (#52592 )	2020-02-21 06:57:11 -08:00
James Rodewig	068181b0b6	[DOCS] Add missing `indices` parms returned by `_nodes/stats` (#52055 ) Adds several human-readable `indices` parameters returned by the `_nodes/stats` API.	2020-02-21 08:15:59 -05:00
Andrei Stefan	7fe2843a9e	SQL: specify command to run the CLI on a remote machine without Elasticsearch (#52626 ) (cherry picked from commit 477b0eda8322c5dcb6861bd262bfeec17ff133fe)	2020-02-21 13:29:58 +02:00
James Rodewig	80b77e92d4	[7.x] [DOCS] Re-add redirects for API relocation (#52628 ) Re-adds several redirects removed with #50510. These redirects were related to the relocation of several API docs to new pages under the 'REST APIs' chapter. We've since decided to only remove such redirects with major releases.	2020-02-21 05:32:10 -05:00
István Zoltán Szabó	1d895118dd	[DOCS] Links transforms in aggregation docs (#52563 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-02-21 08:23:34 +01:00
Ignacio Vera	107f00a4ec	Add support for multipoint geoshape queries (#52133 ) (#52553 ) Currently multi-point queries are not supported when indexing your data using BKD-backed geoshape strategy. This commit removes this limitation.	2020-02-21 07:45:53 +01:00
Yannick Welsch	d76358c875	Deprecate fixed_auto_queue_size thread pool type (#52399 ) Relates #52280	2020-02-20 11:11:06 +01:00
Russ Cam	62da077beb	Specify name on enrich.get_policy as list type (#50217 ) This commit updates the enrich.get_policy API to specify name as a list, in line with other URL parts that accept a comma-separated list of values. In addition, update the get enrich policy API docs to align the URL part name in the documentation with the name used in the REST API specs. (cherry picked from commit 94f6f946ef283dc93040e052b4676c5bc37f4bde)	2020-02-20 11:39:28 +10:00
Lee Hinman	b11dbb2205	Correct SLM retention timezone documentation (#52533 ) This erroneously said that retention is run in the master node's timezone, however, it is actually run in UTC.	2020-02-19 13:46:43 -07:00
Valentin Crettaz	a68fafd64b	[DOCS] Clarify that "now" cannot be used in `date_range` at index time (#52446 ) `date_range` fields do not accept `"now"` as a value of either bounds at indexing time. This corrects an error in the range data type mapping docs.	2020-02-19 12:40:58 -05:00
Bogdan Pintea	db8b306085	SQL: update ODBC docs, cover Cloud ID, latest params (#52291 ) * Refresh snapshots with latest look Add new snapshots with the connection editor to reflect the latest UI. * Document the effect of the late added params Add details about the Cloud ID setting, as well as those on the Misc tab. (cherry picked from commit afa67625e847e99a22264f5dd6fa0daa37786c6f)	2020-02-19 17:42:28 +01:00
James Rodewig	43376c6e06	[DOCS] Document how CCS handles cluster-level settings (#49941 ) Updates the cross-cluster search (CCS) documentation to note how cluster-level settings are applied. When `ccs_minimize_roundtrips` is `true`, each cluster applies its own cluster-level settings to the request. When `ccs_minimize_roundtrips` is `false`, cluster-level settings for the local cluster is used. This includes shard limit settings, such as `action.search.shard_count.limit`, `pre_filter_shard_size`, and `max_concurrent_shard_requests`. If these limits are set too low, the request could be rejected.	2020-02-19 09:21:57 -05:00
debadair	969cdfaaa4	[DOCS] Clean up links from SQL client app pages. (#52442 ) * [DOCS] Clean up links from SQL client app pages. * Linked to client apps from prereqs.	2020-02-18 12:42:20 -08:00
Lisa Cawley	123b3c6f55	[DOCS] Clarifies description of num_top_feature_importance_values (#52246 ) Co-Authored-By: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com>	2020-02-18 08:50:21 -08:00
James Rodewig	9128106b4c	[DOCS] Remove 'analyzed string' references (#51946 ) The `string` field datatype was replaced by the `text` and `keyword` field datatypes in [5.0][0]. This removes several outdated references to 'analyzed string' fields. [0]:https://www.elastic.co/guide/en/elasticsearch/reference/5.0/breaking_50_mapping_changes.html#_string_fields_replaced_by_textkeyword_fields	2020-02-14 12:34:37 -05:00
Andrei Stefan	4eea9c20ee	SQL: document the use of a filter on _routing (#52355 ) * Fix "Description"s for various sections in the functions pages. * Added a TIP for searching using a routing key. * Other small polishings (cherry picked from commit 9fad0b1ac4409a42c435ed040f41cbaea18930a3)	2020-02-14 19:00:26 +02:00
Lisa Cawley	e77e49e956	[DOCS] Adds machine learning highlights (#52334 )	2020-02-14 08:51:55 -08:00
Nik Everett	146def8caa	Implement top_metrics agg (#51155 ) (#52366 ) The `top_metrics` agg is kind of like `top_hits` but it only works on doc values so it should be faster. At this point it is fairly limited in that it only supports a single, numeric sort and a single, numeric metric. And it only fetches the "very topest" document worth of metric. We plan to support returning a configurable number of top metrics, requesting more than one metric and more than one sort. And, eventually, non-numeric sorts and metrics. The trick is doing those things fairly efficiently. Co-Authored by: Zachary Tong <zach@elastic.co>	2020-02-14 11:19:11 -05:00
bellengao	cabc1769e2	[DOC] Remove definition typo in update alias API docs (#52184 ) Removes an erroneously duplicated definition heading from the update alias API reference docs.	2020-02-14 08:31:21 -05:00
Igor Motov	a66988281f	Add histogram field type support to boxplot aggs (#52265 ) Add support for the histogram field type to boxplot aggs. Closes #52233 Relates to #33112	2020-02-13 18:09:26 -05:00
debadair	291713f284	[DOCS] Fixed typo in jump link. (#52302 )	2020-02-12 17:53:00 -08:00
Ryan Ernst	12e378b3ac	Fix incorrect date nanos docs example (#52249 ) The example of how to access the nano value of a date_nanos field has been broken since it was created. This commit fixes it to use the correct scripting methods. closes #51931	2020-02-12 15:55:41 -08:00
Marios Trivyzas	dac720d7a1	Add a cluster setting to disallow expensive queries (#51385 ) (#52279 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050 (cherry picked from commit a8b39ed842c7770bd9275958c9f747502fd9a3ea)	2020-02-12 22:56:14 +01:00
Lisa Cawley	40b58e612d	[DOCS] Fixes, sorts ML tagged regions (#52283 )	2020-02-12 13:52:34 -08:00
Marios Trivyzas	d9fd6fc90c	SQL: [Docs] Fix typo Add missing closing "`" Follows: `c2e0552537`	2020-02-12 21:50:57 +01:00
James Rodewig	ca34817659	[DOCS] Add EQL limitations page (#52001 ) Documents limitations for EQL in Elasticsearch.	2020-02-12 08:45:43 -05:00
James Rodewig	20453d3ac8	[DOCS] Add basic EQL search tutorial docs (#51574 ) I plan to add additional sections to this page with future PRs: * Specify timestamp and event type fields * Specify a join key field * Filter using query DSL * Paginate a large response See #51057.	2020-02-12 08:42:09 -05:00
James Rodewig	3f151d1d75	[DOCS] Add redirects, update JSON spec to fix docs build (#51747 ) Docs build [#11556][0] broke due to several outdated or incorrect links in the JSON REST spec. This fixes those links where possible and adds redirects. [0]: https://elasticsearch-ci.elastic.co/job/elastic+docs+master+build/11556/	2020-02-12 08:30:59 -05:00
Marios Trivyzas	c2e0552537	SQL: [Docs] Add limitation for sorting on aggs (#52210 ) Add a section to point out that when ordering by an aggregate only plain aggregate functions are allowed, no scalars/operators can be used on top of them. Fixes: #52204 (cherry picked from commit 78a1185549ff7f3229fd2d036567eb2a4f2cf230)	2020-02-12 12:56:06 +01:00
James Rodewig	6fe8f1649b	[DOCS] Include docs on permanently unreleased branches only (#51743 ) Adds the ability to display docs on permanently unreleased branches, such as `master` and `7.x`. Also updates how the autoscaling and EQL docs are included. Currently, these feature-flag docs would display on any unreleased branches that contain the changes, such as 7.7.	2020-02-11 11:24:13 -05:00
Igor Motov	667e1a5225	Add Boxplot Aggregation (#52174 ) Adds a `boxplot` aggregation that calculates min, max, medium and the first and the third quartiles of the given data set. Closes #33112	2020-02-11 09:38:17 -05:00
David Turner	00b9098250	Ignore timeouts with single-node discovery (#52159 ) Today we use `cluster.join.timeout` to prevent nodes from waiting indefinitely if joining a faulty master that is too slow to respond, and `cluster.publish.timeout` to allow a faulty master to detect that it is unable to publish its cluster state updates in a timely fashion. If these timeouts occur then the node restarts the discovery process in an attempt to find a healthier master. In the special case of `discovery.type: single-node` there is no point in looking for another healthier master since the single node in the cluster is all we've got. This commit suppresses these timeouts and instead lets the node wait for joins and publications to succeed no matter how long this might take.	2020-02-11 14:15:01 +00:00
David Roberts	4c88996cd7	[DOCS] Correct important note for xpack.transform.enabled (#52194 ) Because transforms get assigned to an arbitrary data node it is important that the transforms plugin is enabled on every data node.	2020-02-11 13:02:10 +00:00
Yang Wang	16ba59e9d1	Expose more authentication info to ingest pipeline (#51305 ) (#52119 ) The changes add more granularity for identiying the data ingestion user. The ingest pipeline can now be configure to record authentication realm and type. It can also record API key name and ID when one is in use. This improves traceability when data are being ingested from multiple agents and will become more relevant with the incoming support of required pipelines (#46847) Resolves: #49106	2020-02-11 23:05:01 +11:00
Andrei Stefan	2f1631d9d0	Telemetry data initial implementation (#51715 ) (#52175 ) (cherry picked from commit f1d1cceacaacf226fcd2459f34689843b822fe4b)	2020-02-11 09:15:47 +02:00
Lisa Cawley	c4525f8cca	[DOCS] Adds ml-cpp PRs to release notes (#52158 ) Co-Authored-By: David Roberts <dave.roberts@elastic.co>	2020-02-10 18:06:01 -08:00
Lee Hinman	37a2e9bac6	[7.x] Allow forcemerge in the hot phase for ILM policies (#520… (#52083 ) * Allow forcemerge in the hot phase for ILM policies This commit changes the `forcemerge` action to also be allowed in the `hot` phase for policies. The forcemerge will occur after a rollover, and allows users to take advantage of higher disk speeds for performing the force merge (on a separate node type, for example). On caveat with this is that a `forcemerge` in the `hot` phase MUST be accompanied by a `rollover` action. ILM validates policies to ensure this is the case. Resolves #43165 * Use anyMatch instead of findAny in validation * Make randomTimeseriesLifecyclePolicy single-pass	2020-02-10 08:54:49 -07:00
David Roberts	1cefafdd14	[ML] Add new categorization stats to model_size_stats (#52009 ) This change adds support for the following new model_size_stats fields: - categorized_doc_count - total_category_count - frequent_category_count - rare_category_count - dead_category_count - categorization_status Backport of #51879	2020-02-10 09:10:50 +00:00
Jason Tedor	c4c0db6f21	Introduce jvm.options.d for customizing JVM options (#51882 ) This commit introduces the ability to override JVM options by adding custom JVM options files to a jvm.options.d directory. This simplifies administration of Elasticsearch by not requiring administrators to keep the root jvm.options file in sync with changes that we make to the root jvm.options file. Instead, they are not expected to modify this file but instead supply their own in jvm.options.d. In Docker installations, this means they can bind mount this directory in. In future versions of Elasticsearch, we can consider removing the root jvm.options file (instead, providing all options there as system JVM options).	2020-02-08 18:50:14 -05:00
Zachary Tong	c8f0fe135d	Update release notes for BC5	2020-02-07 16:41:51 -05:00
debadair	2588022b81	[DOCS] Fixed typo. (#52071 )	2020-02-07 11:04:56 -08:00
James Rodewig	db22fb6e1c	Revert "[DOCS] Include docs on permanently unreleased branches only (#51743 )" This reverts commit `9d09796815`.	2020-02-07 12:14:44 -05:00
David Kyle	8f10a7c6ca	[ML] Make Ensemble feature names optional (#51996 ) The featureNames field is requisite in individual models but is not required by the Ensemble.	2020-02-07 10:08:37 +00:00
Armin Braun	91e938ead8	Add Trace Logging of REST Requests (#51684 ) (#52015 ) Being able to trace log all REST requests to a node would make debugging a number of issues a lot easier.	2020-02-07 09:03:20 +01:00
Jason Tedor	25daf5f1e1	Add autoscaling API skelton (#51564 ) The main purpose of this commit is to add a single autoscaling REST endpoint skeleton, for the purpose of starting to build out the build and testing infrastructure that will surround it. For example, rather than commiting a fully-functioning autoscaling API, we introduce here the skeleton so that we can start wiring up the build and testing infrastructure, establish security roles/permissions, an so on. This way, in a forthcoming PR that introduces actual functionality, that PR will be smaller and have less distractions around that sort of infrastructure.	2020-02-06 21:55:01 -05:00
James Rodewig	9d09796815	[DOCS] Include docs on permanently unreleased branches only (#51743 ) Adds the ability to display docs on permanently unreleased branches, such as `master` and `7.x`. Also updates how the autoscaling and EQL docs are included. Currently, these feature-flag docs would display on any unreleased branches that contain the changes, such as 7.7.	2020-02-06 14:47:06 -05:00
Rory Hunter	cdb9862495	Clarify use of ES_JAVA_OPTS and Docker (#51984 ) Backport of #51867. Tweak the documentation around configuring the heap size when using Docker, to state that: - using `ES_JAVA_OPTS` is the preferred method - Any `ES_JAVA_OPTS` overrides the defaults in `jvm.options` - It's possible to bind-mount a custom `jvm.options`	2020-02-06 10:00:14 +00:00
Lisa Cawley	4c2dcf2bde	[DOCS] Adds curl explanation to getting started content (#51963 )	2020-02-05 19:02:21 -08:00
Przemko Robakowski	6332de40b4	Add empty_value parameter to CSV processor (#51567 ) (#51966 ) * Add empty_value parameter to CSV processor This change adds `empty_value` parameter to the CSV processor. This value is used to fill empty fields. Fields will be skipped if this parameter is ommited. This behavior is the same for both quoted and unquoted fields. * docs updated * Fix compilation problem Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-05 23:35:52 +01:00
Lisa Cawley	ea1d9e0803	[DOCS] Defines data frame transform stats API objects (#44197 )	2020-02-05 12:30:55 -08:00
Lisa Cawley	b93ebc29c1	[DOCS] Augments update license API (#51903 )	2020-02-05 11:08:11 -08:00
Lee Hinman	f8c8a10f05	Add documentation about ILM forcemerge with best_compression (#51893 ) This adds the option to the parameter list and a warning about the index being unavailable during the close and open operations. Relates to #49974	2020-02-05 09:37:41 -07:00
James Rodewig	b70cbc97aa	[DOCS] Add EQL syntax page (#51821 ) Adds documentation for basic EQL syntax. Joins, sequences, and other syntax to be added as its supported in future development. Co-Authored-By: Ross Wolf <31489089+rw-access@users.noreply.github.com>	2020-02-05 08:14:07 -05:00
David Kyle	289d4f4f4d	[ML] Remove stray field from inference docs (#51870 ) model_info_field is not a valid option	2020-02-05 10:50:51 +00:00
Adrien Grand	ad9d2f1922	Move analysis/mappings stats to cluster-stats. (#51875 ) Closes #51138	2020-02-05 11:02:25 +01:00
debadair	c0156cbb5d	Backporting updates to ILM org, overview, & GS (#51898 ) * [DOCS] Align with ILM API docs (#48705) * [DOCS] Reconciled with Snapshot/Restore reorg * [DOCS] Split off ILM overview to a separate topic. (#51287) * [DOCS} Split off overview to a separate topic. * [DOCS] Incorporated feedback from @jrodewig. * [DOCS] Edit ILM GS tutorial (#51513) * [DOCS] Edit ILM GS tutorial * [DOCS] Incorporated review feedback from @andreidan. * [DOCS] Removed test link & fixed anchor & title. * Update docs/reference/ilm/getting-started-ilm.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Fixed glossary merge error. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-02-04 16:45:18 -08:00
Florian Kelbert	43a7aadd46	[DOCS] Remove unneeded comma from CSV processor example (#51859 )	2020-02-04 09:26:20 -05:00
baifan	60a53f2897	[DOCS] Fix `disk.used_percent` typo in `_cat/nodes` docs (#51854 ) Corrects an example for the `disk.used_percent` parameter in `_cat/nodes` API.	2020-02-04 09:15:56 -05:00
Grzegorz Banasiak	87b126bbfc	[DOCS] Fix index_prefixes link in 'faster prefix queries' docs (#51833 ) Fixes a link in 'faster prefix queries' which incorrectly redirects to index_phrases mapping parameter description instead of index_prefixes.	2020-02-04 08:40:18 -05:00
William Brafford	ba2810f23d	Use standard format for reload settings API (#51560 ) (#51828 ) * Use standard format for reload settings API The reload-secure-settings API page was not reorganized for the standard API format, so this commit is reorganizing the page and adding some links to the page in related documentation. * Fix broken links * Reorder examples to correctly check API response * Note that only certain settings are reloadable * [DOCS] Edits layout * [DOCS] Removes unnecessary callouts Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-02-03 18:07:26 -05:00
Dan Hermann	4083eae0b7	[7.x] Secure password for monitoring HTTP exporter (#51775 ) Adds a secure and reloadable SECURE_AUTH_PASSWORD setting to allow keystore entries in the form "xpack.monitoring.exporters.*.auth.secure_password" to securely supply passwords for monitoring HTTP exporters. Also deprecates the insecure `AUTH_PASSWORD` setting.	2020-02-03 07:42:30 -06:00
James Rodewig	1545c2ab26	[DOCS] Document node stats response meta (#51263 ) Documents several metadata-related parameters returned by the `GET _nodes/stats` API.	2020-02-03 08:33:57 -05:00
Darren LaCasse	480e9238a4	[DOCS] Remove extra word (#51757 )	2020-01-31 10:30:06 -08:00
Zachary Tong	3147453600	Update Release Notes for BC4	2020-01-31 11:41:44 -05:00
Christoph Büscher	86f3b47299	Make `date_range` query rounding consistent with `date` (#50237 ) (#51741 ) Currently the rounding used in range queries can behave differently for `date` and `date_range` as explained in #50009. The behaviour on `date` fields is the one we document in https://www.elastic.co/guide/en/elasticsearch/reference/current/query-dsl-range-query.html#range-query-date-math-rounding. This change adapts the rounding behaviour for RangeType.DATE so it uses the same logic as the `date` for the `date_range` type. Backport of #50237	2020-01-31 15:35:05 +01:00
István Zoltán Szabó	dfc9f2330c	[DOCS] Adds PUT inference API docs (#51231 ) Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com> Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-01-31 13:13:34 +01:00
István Zoltán Szabó	9600ab4f57	[DOCS] Adds recommendation on dedicated master-eligible nodes (#51674 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-01-31 12:56:58 +01:00
Sven Schliesing	dc99acee66	[Docs] Fix typo in node-tool.asciidoc (#51667 )	2020-01-31 10:36:21 +01:00
Yang Wang	77b00fc0c0	Add warnings for invalid realm order config (#51195 ) (#51515 ) The changes are to help users prepare for migration to next major release (v8.0.0) regarding to the break change of realm order config. Warnings are added for when: * A realm does not have an order config * Multiple realms have the same order config The warning messages are added to both deprecation API and loggings. The main reasons for doing this are: 1) there is currently no automatic relay between the two; 2) deprecation API is under basic and we need logging for OSS.	2020-01-31 12:32:37 +11:00
Lisa Cawley	1a40ebfa67	[DOCS] Adds missing testenv attribute (#51719 )	2020-01-30 16:15:17 -08:00
Lee Hinman	b9faa0733d	[7.x] Rename ILM history index enablement setting (#51698 ) (#51705 ) * Rename ILM history index enablement setting The previous setting was `index.lifecycle.history_index_enabled`, this commit changes it to `indices.lifecycle.history_index_enabled` to indicate this is not an index-level setting (it's node level).	2020-01-30 15:27:44 -07:00
James Rodewig	36b2663e98	[DOCS] Add attribute for Lucene analysis links (#51687 ) Adds a `lucene-analysis-docs` attribute for the Lucene `/analysis/` javadocs directory. This should prevent typos and keep the docs DRY.	2020-01-30 11:24:01 -05:00
James Rodewig	4fcf5a9de4	[DOCS] Rewrite analysis intro (#51184 ) * [DOCS] Rewrite analysis intro. Move index/search analysis content. * Rewrites 'Text analysis' page intro as high-level definition. Adds guidance on when users should configure text analysis * Rewrites and splits index/search analysis content: * Conceptual content -> 'Index and search analysis' under 'Concepts' * Task-based content -> 'Specify an analyzer' under 'Configure...' * Adds detailed examples for when to use the same index/search analyzer and when not. * Adds new example snippets for specifying search analyzers * clarifications * Add toc. Decrement headings. * Reword 'When to configure' section * Remove sentence from tip	2020-01-30 09:32:16 -05:00
Marios Trivyzas	f373020349	SQL: Fix ORDER BY YEAR() function (#51562 ) Previously, if YEAR() was used as and ORDER BY argument without being wrapped with another scalar (e.g. YEAR(birth_date) + 10), no script ordering was used but instead the underlying field (e.g. birth_date) was used instead as a performance optimisation. This works correctly if YEAR() is the only ORDER BY arg but if further args are used as tie breakers for the ordering wrong results are produced. This is because 2 rows with the different birth_date but on the same year are not tied as the underlying ordering is on birth_date and not on the YEAR(birth_date), and the following ORDER BY args are ignored. Remove this optimisation for YEAR() to avoid incorrect results in such cases. As a consequence another bug is revealed: scalar functions on top of nested fields produce scripted sorting/filtering which is not yet supported. In such cases no error was thrown but instead all values for such nested fields were null and were passed to the script implementing the sorting/filtering, producing incorrect results. Detect such cases and throw a validation exception. Fixes: #51224 (cherry picked from commit f41efd6753dc3650a7eabb3e07b02b3b32c5704c)	2020-01-30 15:29:36 +01:00
Nhat Nguyen	f0fad5b622	Deprecate translog retention settings (#51588 ) (#51638 ) This change deprecates the translog retention settings as they are effectively ignored since 7.4. Relates #50775 Relates #45473	2020-01-30 09:03:10 -05:00
Henning Andersen	ca8601373a	[DOCS] Task management API experimental status issue (#51634 ) Add issue reference to documentation. Relates #51628	2020-01-30 14:15:47 +01:00
Peter Dyson	b5a2ee5be2	[DOCS] Fix minor typo affecting formatting (#51655 )	2020-01-29 23:44:09 -08:00
Lisa Cawley	28f2f3dd02	[DOCS] Minor fixes in transform documentation (#51633 )	2020-01-29 16:58:18 -08:00
Lisa Cawley	fdf74f6ae4	[DOCS] Removes beta qualifiers from transform documentation (#51553 )	2020-01-29 08:41:54 -08:00
Lisa Cawley	3f4156e95a	[DOCS] Adds release highlight for transforms (#51555 )	2020-01-29 08:35:02 -08:00
Albert Zaharovits	90285ee907	Deprecate timeout.tcp_read AD/LDAP realm setting (#47305 ) The timeout.tcp_read AD/LDAP realm setting, despite the low-level allusion, controls the time interval the realms wait for a response for a query (search or bind). If the connection to the server is synchronous (un-pooled) the response timeout is analogous to the tcp read timeout. But the tcp read timeout is irrelevant in the common case of a pooled connection (when a Bind DN is specified). The timeout.tcp_read qualifier is hereby deprecated in favor of timeout.response. In addition, the default value for both timeout.tcp_read and timeout.response is that of timeout.ldap_search, instead of the 5s (but the default for timeout.ldap_search is still 5s). The timeout.ldap_search defines the server-controlled timeout of a search request. There is no practical use case to have a smaller tcp_read timeout compared to ldap_search (in this case the request would time-out on the client but continue to be processed on the server). The proposed change aims to simplify configuration so that the more common configuration change, adjusting timeout.ldap_search up, has the expected result (no timeout during searches) without any additional modifications. Closes #46028	2020-01-29 10:48:26 +02:00
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
James Rodewig	139305ffc8	[DOCS] Document `indices` cluster stats (#50527 ) Documents the header and `indices` response parameters returned by the `_cluster/stats` API. Co-Authored-By: David Turner <david.turner@elastic.co>	2020-01-28 11:00:00 -05:00
Yannick Welsch	fa212fe60b	Stricter checks of setup and teardown in docs tests (#51430 ) Adds extra checks due to 7.x backport	2020-01-28 16:52:23 +01:00
Yannick Welsch	f6686345c9	Avoid unnecessary setup and teardown in docs tests (#51430 ) The docs tests have recently been running much slower than before (see #49753). The gist here is that with ILM/SLM we do a lot of unnecessary setup / teardown work on each test. Compounded with the slightly slower cluster state storage mechanism, this causes the tests to run much slower. In particular, on RAMDisk, docs:check is taking ES 7.4: 6:55 minutes ES master: 16:09 minutes ES with this commit: 6:52 minutes on SSD, docs:check is taking ES 7.4: ??? minutes ES master: 32:20 minutes ES with this commit: 11:21 minutes	2020-01-28 16:52:23 +01:00
James Rodewig	70e4ae3381	[DOCS] Reformat unique token filter docs (#50748 ) * Updates the description * Adds analyze, custom analyzer, and custom filter snippets * Adds parameter documentation	2020-01-28 10:42:25 -05:00
David Roberts	550254ec7f	[ML] Use CSV ingest processor in find_file_structure ingest pipeline (#51492 ) Changes the find_file_structure response to include a CSV ingest processor in the ingest pipeline it suggests. Previously the Kibana file upload functionality parsed CSV in the browser, but by parsing CSV in the ingest pipeline it makes the Kibana file upload functionality more easily interchangable with Filebeat such that the configurations it creates can more easily be used to import data with the same structure repeatedly in production.	2020-01-28 14:38:43 +00:00
William Brafford	9efa5be60e	Password-protected Keystore Feature Branch PR (#51123 ) (#51510 ) * Reload secure settings with password (#43197) If a password is not set, we assume an empty string to be compatible with previous behavior. Only allow the reload to be broadcast to other nodes if TLS is enabled for the transport layer. * Add passphrase support to elasticsearch-keystore (#38498) This change adds support for keystore passphrases to all subcommands of the elasticsearch-keystore cli tool and adds a subcommand for changing the passphrase of an existing keystore. The work to read the passphrase in Elasticsearch when loading, which will be addressed in a different PR. Subcommands of elasticsearch-keystore can handle (open and create) passphrase protected keystores When reading a keystore, a user is only prompted for a passphrase only if the keystore is passphrase protected. When creating a keystore, a user is allowed (default behavior) to create one with an empty passphrase Passphrase can be set to be empty when changing/setting it for an existing keystore Relates to: #32691 Supersedes: #37472 * Restore behavior for force parameter (#44847) Turns out that the behavior of `-f` for the add and add-file sub commands where it would also forcibly create the keystore if it didn't exist, was by design - although undocumented. This change restores that behavior auto-creating a keystore that is not password protected if the force flag is used. The force OptionSpec is moved to the BaseKeyStoreCommand as we will presumably want to maintain the same behavior in any other command that takes a force option. * Handle pwd protected keystores in all CLI tools (#45289) This change ensures that `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` can handle a password protected elasticsearch.keystore. For setup passwords the user would be prompted to add the elasticsearch keystore password upon running the tool. There is no option to pass the password as a parameter as we assume the user is present in order to enter the desired passwords for the built-in users. For saml-metadata, we prompt for the keystore password at all times even though we'd only need to read something from the keystore when there is a signing or encryption configuration. * Modify docs for setup passwords and saml metadata cli (#45797) Adds a sentence in the documentation of `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` to describe that users would be prompted for the keystore's password when running these CLI tools, when the keystore is password protected. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Elasticsearch keystore passphrase for startup scripts (#44775) This commit allows a user to provide a keystore password on Elasticsearch startup, but only prompts when the keystore exists and is encrypted. The entrypoint in Java code is standard input. When the Bootstrap class is checking for secure keystore settings, it checks whether or not the keystore is encrypted. If so, we read one line from standard input and use this as the password. For simplicity's sake, we allow a maximum passphrase length of 128 characters. (This is an arbitrary limit and could be increased or eliminated. It is also enforced in the keystore tools, so that a user can't create a password that's too long to enter at startup.) In order to provide a password on standard input, we have to account for four different ways of starting Elasticsearch: the bash startup script, the Windows batch startup script, systemd startup, and docker startup. We use wrapper scripts to reduce systemd and docker to the bash case: in both cases, a wrapper script can read a passphrase from the filesystem and pass it to the bash script. In order to simplify testing the need for a passphrase, I have added a has-passwd command to the keystore tool. This command can run silently, and exit with status 0 when the keystore has a password. It exits with status 1 if the keystore doesn't exist or exists and is unencrypted. A good deal of the code-change in this commit has to do with refactoring packaging tests to cleanly use the same tests for both the "archive" and the "package" cases. This required not only moving tests around, but also adding some convenience methods for an abstraction layer over distribution-specific commands. * Adjust docs for password protected keystore (#45054) This commit adds relevant parts in the elasticsearch-keystore sub-commands reference docs and in the reload secure settings API doc. * Fix failing Keystore Passphrase test for feature branch (#50154) One problem with the passphrase-from-file tests, as written, is that they would leave a SystemD environment variable set when they failed, and this setting would cause elasticsearch startup to fail for other tests as well. By using a try-finally, I hope that these tests will fail more gracefully. It appears that our Fedora and Ubuntu environments may be configured to store journald information under /var rather than under /run, so that it will persist between boots. Our destructive tests that read from the journal need to account for this in order to avoid trying to limit the output we check in tests. * Run keystore management tests on docker distros (#50610) * Add Docker handling to PackagingTestCase Keystore tests need to be able to run in the Docker case. We can do this by using a DockerShell instead of a plain Shell when Docker is running. * Improve ES startup check for docker Previously we were checking truncated output for the packaged JDK as an indication that Elasticsearch had started. With new preliminary password checks, we might get a false positive from ES keystore commands, so we have to check specifically that the Elasticsearch class from the Bootstrap package is what's running. * Test password-protected keystore with Docker (#50803) This commit adds two tests for the case where we mount a password-protected keystore into a Docker container and provide a password via a Docker environment variable. We also fix a logging bug where we were logging the identifier for an array of strings rather than the contents of that array. * Add documentation for keystore startup prompting (#50821) When a keystore is password-protected, Elasticsearch will prompt at startup. This commit adds documentation for this prompt for the archive, systemd, and Docker cases. Co-authored-by: Lisa Cawley <lcawley@elastic.co> * Warn when unable to upgrade keystore on debian (#51011) For Red Hat RPM upgrades, we warn if we can't upgrade the keystore. This commit brings the same logic to the code for Debian packages. See the posttrans file for gets executed for RPMs. * Restore handling of string input Adds tests that were mistakenly removed. One of these tests proved we were not handling the the stdin (-x) option correctly when no input was added. This commit restores the original approach of reading stdin one char at a time until there is no more (-1, \r, \n) instead of using readline() that might return null * Apply spotless reformatting * Use '--since' flag to get recent journal messages When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. It seems to me that we might be able to use journald's "--since" flag to retrieve only log messages from the last run, and that this might be less likely to fail due to race conditions in file deletion. Unfortunately, it looks as if the "--since" flag has a granularity of one-second. I've added a two-second sleep to make sure that there's a sufficient gap between the test that will read from journald and the test before it. * Use new journald wrapper pattern * Update version added in secure settings request Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>	2020-01-28 05:32:32 -05:00
James Rodewig	65f49d0bba	[DOCS] Add top-level EQL docs page. Adds EQL requirements page. (#51334 ) * Creates a top-level page for EQL in the ES reference. This page contains a high-level introduction and will include a nav for other EQL docs pages as they're built. * Creates a requirements page. This page outlines the fields needed to use EQL in ES.	2020-01-27 16:04:47 -05:00
James Rodewig	23b65390ab	[DOCS] Add response snippets to 'Testing analyzers' page (#51427 ) Adds response snippets to the `POST _analyze` snippets in the 'Testing analyzers' page. Co-authored-by: Emmanuel DEMEY <demey.emmanuel@gmail.com>	2020-01-27 08:41:44 -05:00
David Turner	49bde5d286	Remove DEBUG-level default logging from actions (#51459 ) In `2bb31fe` (v0.6.0!) we added DEBUG-level logging to the default config of action loggers "for easier debugging". This change to the default config lives on to this day. It does not obviously make debugging any easier any more, but it does result in a good deal of log noise sometimes. This commit removes this special case from the default config. Closes #51198	2020-01-27 10:50:10 +00:00
Lisa Cawley	931b22349f	[DOCS] Adds http to elasticsearch-certutil command reference (#51188 )	2020-01-24 09:59:07 -08:00
Mayya Sharipova	a29deecbda	Revert "Make it clear this is boost at index time (#51390 )" This reverts commit `3d5238bd95`.	2020-01-24 11:05:42 -05:00

... 7 8 9 10 11 ...

7406 Commits