OpenSearch

Commit Graph

Author	SHA1	Message	Date
Lisa Cawley	3f4156e95a	[DOCS] Adds release highlight for transforms (#51555 )	2020-01-29 08:35:02 -08:00
Albert Zaharovits	90285ee907	Deprecate timeout.tcp_read AD/LDAP realm setting (#47305 ) The timeout.tcp_read AD/LDAP realm setting, despite the low-level allusion, controls the time interval the realms wait for a response for a query (search or bind). If the connection to the server is synchronous (un-pooled) the response timeout is analogous to the tcp read timeout. But the tcp read timeout is irrelevant in the common case of a pooled connection (when a Bind DN is specified). The timeout.tcp_read qualifier is hereby deprecated in favor of timeout.response. In addition, the default value for both timeout.tcp_read and timeout.response is that of timeout.ldap_search, instead of the 5s (but the default for timeout.ldap_search is still 5s). The timeout.ldap_search defines the server-controlled timeout of a search request. There is no practical use case to have a smaller tcp_read timeout compared to ldap_search (in this case the request would time-out on the client but continue to be processed on the server). The proposed change aims to simplify configuration so that the more common configuration change, adjusting timeout.ldap_search up, has the expected result (no timeout during searches) without any additional modifications. Closes #46028	2020-01-29 10:48:26 +02:00
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
James Rodewig	139305ffc8	[DOCS] Document `indices` cluster stats (#50527 ) Documents the header and `indices` response parameters returned by the `_cluster/stats` API. Co-Authored-By: David Turner <david.turner@elastic.co>	2020-01-28 11:00:00 -05:00
Yannick Welsch	fa212fe60b	Stricter checks of setup and teardown in docs tests (#51430 ) Adds extra checks due to 7.x backport	2020-01-28 16:52:23 +01:00
Yannick Welsch	f6686345c9	Avoid unnecessary setup and teardown in docs tests (#51430 ) The docs tests have recently been running much slower than before (see #49753). The gist here is that with ILM/SLM we do a lot of unnecessary setup / teardown work on each test. Compounded with the slightly slower cluster state storage mechanism, this causes the tests to run much slower. In particular, on RAMDisk, docs:check is taking ES 7.4: 6:55 minutes ES master: 16:09 minutes ES with this commit: 6:52 minutes on SSD, docs:check is taking ES 7.4: ??? minutes ES master: 32:20 minutes ES with this commit: 11:21 minutes	2020-01-28 16:52:23 +01:00
James Rodewig	70e4ae3381	[DOCS] Reformat unique token filter docs (#50748 ) * Updates the description * Adds analyze, custom analyzer, and custom filter snippets * Adds parameter documentation	2020-01-28 10:42:25 -05:00
David Roberts	550254ec7f	[ML] Use CSV ingest processor in find_file_structure ingest pipeline (#51492 ) Changes the find_file_structure response to include a CSV ingest processor in the ingest pipeline it suggests. Previously the Kibana file upload functionality parsed CSV in the browser, but by parsing CSV in the ingest pipeline it makes the Kibana file upload functionality more easily interchangable with Filebeat such that the configurations it creates can more easily be used to import data with the same structure repeatedly in production.	2020-01-28 14:38:43 +00:00
William Brafford	9efa5be60e	Password-protected Keystore Feature Branch PR (#51123 ) (#51510 ) * Reload secure settings with password (#43197) If a password is not set, we assume an empty string to be compatible with previous behavior. Only allow the reload to be broadcast to other nodes if TLS is enabled for the transport layer. * Add passphrase support to elasticsearch-keystore (#38498) This change adds support for keystore passphrases to all subcommands of the elasticsearch-keystore cli tool and adds a subcommand for changing the passphrase of an existing keystore. The work to read the passphrase in Elasticsearch when loading, which will be addressed in a different PR. Subcommands of elasticsearch-keystore can handle (open and create) passphrase protected keystores When reading a keystore, a user is only prompted for a passphrase only if the keystore is passphrase protected. When creating a keystore, a user is allowed (default behavior) to create one with an empty passphrase Passphrase can be set to be empty when changing/setting it for an existing keystore Relates to: #32691 Supersedes: #37472 * Restore behavior for force parameter (#44847) Turns out that the behavior of `-f` for the add and add-file sub commands where it would also forcibly create the keystore if it didn't exist, was by design - although undocumented. This change restores that behavior auto-creating a keystore that is not password protected if the force flag is used. The force OptionSpec is moved to the BaseKeyStoreCommand as we will presumably want to maintain the same behavior in any other command that takes a force option. * Handle pwd protected keystores in all CLI tools (#45289) This change ensures that `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` can handle a password protected elasticsearch.keystore. For setup passwords the user would be prompted to add the elasticsearch keystore password upon running the tool. There is no option to pass the password as a parameter as we assume the user is present in order to enter the desired passwords for the built-in users. For saml-metadata, we prompt for the keystore password at all times even though we'd only need to read something from the keystore when there is a signing or encryption configuration. * Modify docs for setup passwords and saml metadata cli (#45797) Adds a sentence in the documentation of `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` to describe that users would be prompted for the keystore's password when running these CLI tools, when the keystore is password protected. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Elasticsearch keystore passphrase for startup scripts (#44775) This commit allows a user to provide a keystore password on Elasticsearch startup, but only prompts when the keystore exists and is encrypted. The entrypoint in Java code is standard input. When the Bootstrap class is checking for secure keystore settings, it checks whether or not the keystore is encrypted. If so, we read one line from standard input and use this as the password. For simplicity's sake, we allow a maximum passphrase length of 128 characters. (This is an arbitrary limit and could be increased or eliminated. It is also enforced in the keystore tools, so that a user can't create a password that's too long to enter at startup.) In order to provide a password on standard input, we have to account for four different ways of starting Elasticsearch: the bash startup script, the Windows batch startup script, systemd startup, and docker startup. We use wrapper scripts to reduce systemd and docker to the bash case: in both cases, a wrapper script can read a passphrase from the filesystem and pass it to the bash script. In order to simplify testing the need for a passphrase, I have added a has-passwd command to the keystore tool. This command can run silently, and exit with status 0 when the keystore has a password. It exits with status 1 if the keystore doesn't exist or exists and is unencrypted. A good deal of the code-change in this commit has to do with refactoring packaging tests to cleanly use the same tests for both the "archive" and the "package" cases. This required not only moving tests around, but also adding some convenience methods for an abstraction layer over distribution-specific commands. * Adjust docs for password protected keystore (#45054) This commit adds relevant parts in the elasticsearch-keystore sub-commands reference docs and in the reload secure settings API doc. * Fix failing Keystore Passphrase test for feature branch (#50154) One problem with the passphrase-from-file tests, as written, is that they would leave a SystemD environment variable set when they failed, and this setting would cause elasticsearch startup to fail for other tests as well. By using a try-finally, I hope that these tests will fail more gracefully. It appears that our Fedora and Ubuntu environments may be configured to store journald information under /var rather than under /run, so that it will persist between boots. Our destructive tests that read from the journal need to account for this in order to avoid trying to limit the output we check in tests. * Run keystore management tests on docker distros (#50610) * Add Docker handling to PackagingTestCase Keystore tests need to be able to run in the Docker case. We can do this by using a DockerShell instead of a plain Shell when Docker is running. * Improve ES startup check for docker Previously we were checking truncated output for the packaged JDK as an indication that Elasticsearch had started. With new preliminary password checks, we might get a false positive from ES keystore commands, so we have to check specifically that the Elasticsearch class from the Bootstrap package is what's running. * Test password-protected keystore with Docker (#50803) This commit adds two tests for the case where we mount a password-protected keystore into a Docker container and provide a password via a Docker environment variable. We also fix a logging bug where we were logging the identifier for an array of strings rather than the contents of that array. * Add documentation for keystore startup prompting (#50821) When a keystore is password-protected, Elasticsearch will prompt at startup. This commit adds documentation for this prompt for the archive, systemd, and Docker cases. Co-authored-by: Lisa Cawley <lcawley@elastic.co> * Warn when unable to upgrade keystore on debian (#51011) For Red Hat RPM upgrades, we warn if we can't upgrade the keystore. This commit brings the same logic to the code for Debian packages. See the posttrans file for gets executed for RPMs. * Restore handling of string input Adds tests that were mistakenly removed. One of these tests proved we were not handling the the stdin (-x) option correctly when no input was added. This commit restores the original approach of reading stdin one char at a time until there is no more (-1, \r, \n) instead of using readline() that might return null * Apply spotless reformatting * Use '--since' flag to get recent journal messages When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. It seems to me that we might be able to use journald's "--since" flag to retrieve only log messages from the last run, and that this might be less likely to fail due to race conditions in file deletion. Unfortunately, it looks as if the "--since" flag has a granularity of one-second. I've added a two-second sleep to make sure that there's a sufficient gap between the test that will read from journald and the test before it. * Use new journald wrapper pattern * Update version added in secure settings request Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>	2020-01-28 05:32:32 -05:00
James Rodewig	65f49d0bba	[DOCS] Add top-level EQL docs page. Adds EQL requirements page. (#51334 ) * Creates a top-level page for EQL in the ES reference. This page contains a high-level introduction and will include a nav for other EQL docs pages as they're built. * Creates a requirements page. This page outlines the fields needed to use EQL in ES.	2020-01-27 16:04:47 -05:00
James Rodewig	23b65390ab	[DOCS] Add response snippets to 'Testing analyzers' page (#51427 ) Adds response snippets to the `POST _analyze` snippets in the 'Testing analyzers' page. Co-authored-by: Emmanuel DEMEY <demey.emmanuel@gmail.com>	2020-01-27 08:41:44 -05:00
David Turner	49bde5d286	Remove DEBUG-level default logging from actions (#51459 ) In `2bb31fe` (v0.6.0!) we added DEBUG-level logging to the default config of action loggers "for easier debugging". This change to the default config lives on to this day. It does not obviously make debugging any easier any more, but it does result in a good deal of log noise sometimes. This commit removes this special case from the default config. Closes #51198	2020-01-27 10:50:10 +00:00
Lisa Cawley	931b22349f	[DOCS] Adds http to elasticsearch-certutil command reference (#51188 )	2020-01-24 09:59:07 -08:00
Mayya Sharipova	a29deecbda	Revert "Make it clear this is boost at index time (#51390 )" This reverts commit `3d5238bd95`.	2020-01-24 11:05:42 -05:00
Jonas F. Henriksen	3d5238bd95	Make it clear this is boost at index time (#51390 ) The way it was originally written, it sounds like we are boosting at query time. Of course, the effect is at query time, but the point here is that boosting is done at index time	2020-01-24 10:37:07 -05:00
Benjamin Trent	76660a5a4f	[7.x] [ML][Inference] add tags url param to GET (#51330 ) (#51404 ) * [ML][Inference] add tags url param to GET (#51330) Adds a new URL parameter, `tags` to the GET _ml/inference/<model_id> endpoint. This parameter allows the list of models to be further reduced to those who contain all the provided tags.	2020-01-24 08:26:58 -05:00
István Zoltán Szabó	8bdf654cc7	[DOCS] Refines description. (#51400 )	2020-01-24 13:34:25 +01:00
David Turner	40e7a826fc	Allow decimal max_task_wait_time in docs (#51352 ) The regex for the response to `GET _cat/health?v` in `getting-started.asciidoc` requires `max_task_wait_time` to match `(-\|\\d+(micros\|ms\|s))`, which doesn't match times such as `3.9ms` that contain a decimal point. This commit adjusts the regex to match times formatted like this too. Fixes #47537	2020-01-24 08:59:43 +00:00
Lisa Cawley	ec47698f7c	[DOCS] Updates categorization examples with wizard screenshots (#51133 )	2020-01-22 11:28:17 -08:00
Zachary Tong	83647101ef	Update Release notes for BC2	2020-01-22 14:05:59 -05:00
Lisa Cawley	4590d4156a	[DOCS] Clarify interval, frequency, and bucket span in ML APIs and example (#51280 )	2020-01-22 08:15:46 -08:00
Igor Motov	08e9c673e5	Fix leftover mentions of method parameter in Percentile Aggs (#51272 ) The method parameter is not used in the percentile aggs, instead the method is determined by the presence of `hdr` or `tdigest` objects. Relates to #8324	2020-01-22 10:03:35 -05:00
David Kyle	ca4b90a001	[ML] Calculate results and snapshot retention using latest bucket timestamps (#51061 ) (#51301 ) The retention period is calculated relative to the last bucket result or snapshot time rather than wall clock	2020-01-22 14:52:33 +00:00
Zachary Tong	b38fdf9f94	Add release highlights for 7.6 (#51070 ) Add release highlights for 7.6 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-01-22 09:42:57 -05:00
Russ Cam	86a50a24f3	[Docs] Including leading slash in range query doc example URLs (#51277 )	2020-01-22 09:42:18 +01:00
Stuart Tettemer	41c15b438d	Scripting: Add char position of script errors (#51069 ) (#51266 ) Add the character position of a scripting error to error responses. The contents of the `position` field are experimental and subject to change. Currently, `offset` refers to the character location where the error was encountered, `start` and `end` define a range of characters that contain the error. eg. ``` { "error": { "root_cause": [ { "type": "script_exception", "reason": "runtime error", "script_stack": [ "y = x;", " ^---- HERE" ], "script": "def x = new ArrayList(); Map y = x;", "lang": "painless", "position": { "offset": 33, "start": 29, "end": 35 } } ``` Refs: #50993	2020-01-21 13:45:59 -07:00
István Zoltán Szabó	30d1587ad5	[DOCS] Fixes indentation in inference processor code snippet (#51252 )	2020-01-21 16:22:16 +01:00
Lisa Cawley	79cf0894fa	[DOCS] Adds ML PRs to release notes (#51234 )	2020-01-20 11:08:23 -08:00
Jason Tedor	e20459c202	Exclude autoscaling docs from release docs (#51190 ) Since autoscaling is currently only under development, this commit causes the autoscaling docs to be excluded any time that release docs are being built.	2020-01-20 10:52:47 -05:00
Andrei Stefan	2908b7e5fc	SQL: add support for passing query parameters in REST API calls (#51029 ) (#51222 ) * REST PreparedStatement-like query parameters are now supported in the form of an array of non-object, non-array values where ES SQL parser will try to infer the data type of the value being passed as parameter. (cherry picked from commit 45b8bf619aecb1c03d7bc0cf06928dcc36005a66)	2020-01-20 16:40:19 +02:00
István Zoltán Szabó	424b4ed4ea	[DOCS] Expands the documentation of Node Query Cache (#51105 ) Co-authored-by: debadair <debadair@elastic.co>	2020-01-20 11:13:29 +01:00
Jess	4b31ad1c0c	[Docs] Small edits to Ranking Evaluation API docs (#51116 ) Small updates to grammar, syntax, and unclear wordings.	2020-01-20 10:30:23 +01:00
István Zoltán Szabó	e40580c24d	[DOCS] Removes CCS limitation item from Transforms limitations. (#51151 )	2020-01-20 09:44:11 +01:00
Jason Tedor	9ce4d2b901	Initial autoscaling commit (#51161 ) This commit merely adds the skeleton for the autoscaling project, adding the basics to include the autoscaling module in the default distribution, opt-in to code formatting, and a placeholder for the docs.	2020-01-17 15:31:12 -05:00
István Zoltán Szabó	83c92cf7eb	[DOCS] Adds text about data types to the categorization docs (#51145 )	2020-01-17 10:00:20 -08:00
lcawl	fee1a9528c	[DOCS] Removes duplicate title	2020-01-17 09:49:42 -08:00
Jay Modi	107989df3e	Introduce hidden indices (#51164 ) This change introduces a new feature for indices so that they can be hidden from wildcard expansion. The feature is referred to as hidden indices. An index can be marked hidden through the use of an index setting, `index.hidden`, at creation time. One primary use case for this feature is to have a construct that fits indices that are created by the stack that contain data used for display to the user and/or intended for querying by the user. The desire to keep them hidden is to avoid confusing users when searching all of the data they have indexed and getting results returned from indices created by the system. Hidden indices have the following properties: * API calls for all indices (empty indices array, _all, or ) will not return hidden indices by default. Wildcard expansion will not return hidden indices by default unless the wildcard pattern begins with a `.`. This behavior is similar to shell expansion of wildcards. * REST API calls can enable the expansion of wildcards to hidden indices with the `expand_wildcards` parameter. To expand wildcards to hidden indices, use the value `hidden` in conjunction with `open` and/or `closed`. * Creation of a hidden index will ignore global index templates. A global index template is one with a match-all pattern. * Index templates can make an index hidden, with the exception of a global index template. * Accessing a hidden index directly requires no additional parameters. Backport of #50452	2020-01-17 10:09:01 -07:00
Dimitris Athanasiou	b70ebdeb96	[7.x][ML] DF Analytics _explain API should skip object fields (#51115 ) (#51147 ) Object fields cannot be used as features. At the moment _explain API includes them and even worse it allows it does not error when an object field is excluded. This creates the expectation to the user that all children fields will also be excluded while it's not the case. This commit omits object fields from the _explain API and also adds an error if an object field is included or excluded. Backport of #51115	2020-01-17 14:02:59 +02:00
James Rodewig	2353fe47fc	[DOCS] Adds placeholder for 7.5.2 release notes (#51124 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-01-16 14:42:24 -05:00
James Rodewig	b6bf64b969	[DOCS] Collapse node stats response sections (#51063 ) elastic/docs#1687 added support for the `[%collapsible]` Asciidoc attribute, which creates collapsible sections in the HTML output. This PR makes two related changes to the nodes stats API documentation: * Makes the response parameter sections collapsible. This allows users to more easily navigate the page without long walls of text. * Reorders the response parameter sections to match the default order returned by the API. Relates to #47524.	2020-01-16 13:19:29 -05:00
James Rodewig	7ef906fde8	[DOCS] Add tutorials section to analysis topic (#50809 ) Adds a 'Configure text analysis' page to house tutorial content for the analysis topic. Also relocates the following pages as children as this new page: * 'Test an analyzer' * 'Configuring built-in analyzers' * 'Create a custom analyzer' I plan to add a tutorial for specifying index-time and search-time analyzers to this section as part of a future PR.	2020-01-16 13:12:06 -05:00
James Rodewig	ef26763ca9	[DOCS] Add concepts section to analysis topic (#50801 ) This helps the topic better match the structure of our machine learning docs, e.g. https://www.elastic.co/guide/en/machine-learning/7.5/ml-concepts.html This PR only includes the 'Anatomy of an analyzer' page as a 'Concepts' child page, but I plan to add other concepts, such as 'Index time vs. search time', with later PRs.	2020-01-16 13:00:39 -05:00
James Rodewig	1edaf2b101	[DOCS] Retitle analysis reference pages (#51071 ) * Changes titles to sentence case. * Appends pages with 'reference' to differentiate their content from conceptual overviews. * Moves the 'Normalizers' page to end of the Analysis topic pages.	2020-01-16 12:30:51 -05:00
James Rodewig	d590150ca2	[DOCS] Fix indent issue in similarity snippet (#51107 ) Updates snippet to consistently use 2-space indentation. The snippet previously used a mix of tab/5-space and 2-space indents. Co-authored-by: Peter Johnson <wiz@wiz.co.nz> Co-authored-by: Peter Johnson <peter@geocode.earth>	2020-01-16 11:00:15 -05:00
James Rodewig	1211772f6a	[DOCS] Use same index in Cluster Allocation Explain docs (#50936 ) Updates several example snippets in the Cluster Allocation Explain API docs to consistently use the `my_index` index. Previously, the snippets switches from `my_index` to `idx`, which could confuse users. Co-authored-by: Emmanuel DEMEY <demey.emmanuel@gmail.com> Co-authored-by: Emmanuel DEMEY <demey.emmanuel@gmail.com>	2020-01-16 09:15:26 -05:00
Ted Timmons	b345c7ff31	[Docs] Fix short alias for 'unassigned.for' (#51059 ) The short alias for `unassigned.for` is `uf`, not 'ua'.	2020-01-16 12:10:30 +01:00
PND	1d391f7113	[Docs] Fix example output of edge n-gram token filter. (#51085 )	2020-01-16 11:34:00 +01:00
Martijn van Groningen	02dfd71efa	Backport: Add pipeline name to ingest metadata (#51050 ) Backport: #50467 This commit adds the name of the current pipeline to ingest metadata. This pipeline name is accessible under the following key: '_ingest.pipeline'. Example usage in pipeline: PUT /_ingest/pipeline/2 { "processors": [ { "set": { "field": "pipeline_name", "value": "{{_ingest.pipeline}}" } } ] } Closes #42106	2020-01-16 10:50:47 +01:00
Adrien Grand	45d7bdcfd7	Add analysis components and mapping types to the usage API. (#51062 ) Knowing about used analysis components and mapping types would be incredibly useful in order to know which ones may be deprecated or should get more love. Some field types also act as a proxy to know about feature usage of some APIs like the `percolator` or `completion` fields types for percolation and the completion suggester, respectively.	2020-01-16 09:56:41 +01:00
Zachary Tong	a62c9e4e69	Fix 7.6 release notes file name	2020-01-15 15:16:48 -05:00
Zachary Tong	8f48c8d312	Add 7.6.0 release notes	2020-01-15 14:10:37 -05:00
Christoph Büscher	d291f189a8	Fix hardcoded version replacement in put-dfanalytics.asciidoc #51053 The version replacement for the code snippet should replace 7.6 with the current version, but doesn't match because of a missing whitespace. Closes #51052	2020-01-15 18:09:37 +01:00
taku333	65af0a0f0a	[DOCS] Add 7.5.1 link to release notes overview (#51022 )	2020-01-15 11:53:26 -05:00
Lee Hinman	b14a949fa9	Add blurb about ILM-injected unfollow action (#51009 ) These injected actions are harmless and safe to ignore for non-CCR indices. Resolves #50548	2020-01-15 09:46:57 -07:00
Robin Clarke	5ea18cb4af	[Docs] Fix sub-heading in start-stop-ilm.asciidoc (#51045 ) Removed superfluous `=`.	2020-01-15 16:16:22 +01:00
Przemysław Witek	b4a631277a	Add missing docs for new evaluation metrics (#50967 ) (#51041 )	2020-01-15 15:53:42 +01:00
István Zoltán Szabó	b570f417c2	[DOCS] Describes the relationship of the time-related settings in anomaly detection docs (#50959 ) Co-Authored-By: David Roberts <dave.roberts@elastic.co>	2020-01-15 08:46:04 +01:00
Tim Vernum	e41c0b1224	Deprecating kibana_user and kibana_dashboard_only_user roles (#50963 ) This change adds a new `kibana_admin` role, and deprecates the old `kibana_user` and`kibana_dashboard_only_user`roles. The deprecation is implemented via a new reserved metadata attribute, which can be consumed from the API and also triggers deprecation logging when used (by a user authenticating to Elasticsearch). Some docs have been updated to avoid references to these deprecated roles. Backport of: #46456 Co-authored-by: Larry Gregory <lgregorydev@gmail.com>	2020-01-15 11:07:19 +11:00
Yannick Welsch	4b0581f182	Remove custom metadata tool (#50813 ) Adds a command-line tool to remove broken custom metadata from the cluster state. Relates to #48701	2020-01-14 23:08:33 +01:00
James Rodewig	a290762df1	[DOCS] Document `breakers`, `script`, and `discovery` node stats (#50509 ) Documents the `breakers`, `script`, and `discovery` parameters returned by the `_nodes/stats` API.	2020-01-14 16:51:50 -05:00
Christoph Büscher	2f13751bad	Deprecate and remove camel-case nGram and edgeNGram tokenizers (#50862 ) (#50991 ) We deprecated and removed the camel-case versions of the nGram and edgeNGram filters a while ago and we should do the same with the nGram and edgeNGram tokenizers. This PR deprecates the use of these names in favour of ngram and edge_ngram in 7. Usage will be disallowed on new indices starting with 8 then.	2020-01-14 21:42:34 +01:00
lcawl	6848dee84b	[DOCS] Fixes typo in keystore command	2020-01-14 11:57:02 -08:00
Tal Levy	9ee2e11181	[7.x] Adds support for geo-bounds filtering in geogrid aggregations (#50996 ) * Adds support for geo-bounds filtering in geogrid aggregations (#50002) It is fairly common to filter the geo point candidates in geohash_grid and geotile_grid aggregations according to some viewable bounding box. This change introduces the option of specifying this filter directly in the tiling aggregation. This is even more relevant to `geo_shape` where the bounds will restrict the shape to be within the bounds this optional `bounds` parameter is parsed in an equivalent fashion to the bounds specified in the geo_bounding_box query.	2020-01-14 11:18:46 -08:00
Dimitris Athanasiou	1d8cb3c741	[7.x][ML] Add num_top_feature_importance_values param to regression and classi… (#50914 ) (#50976 ) Adds a new parameter to regression and classification that enables computation of importance for the top most important features. The computation of the importance is based on SHAP (SHapley Additive exPlanations) method. Backport of #50914	2020-01-14 16:46:09 +02:00
James Rodewig	f028ab08d1	[DOCS] Use `s` parameter in cat API overview example (#50616 ) Updates a snippet to use the `s` query string parameter rather than piping the output to a separate `sort` command. This ensures the snippet is tested and available in clients other than curl (Kibana console, etc.). Issue was originally raised by @hackaholic in #40926.	2020-01-14 08:22:07 -05:00
Yannick Welsch	22ba759e1f	Move metadata storage to Lucene (#50928 ) * Move metadata storage to Lucene (#50907) Today we split the on-disk cluster metadata across many files: one file for the metadata of each index, plus one file for the global metadata and another for the manifest. Most metadata updates only touch a few of these files, but some must write them all. If a node holds a large number of indices then it's possible its disks are not fast enough to process a complete metadata update before timing out. In severe cases affecting master-eligible nodes this can prevent an election from succeeding. This commit uses Lucene as a metadata storage for the cluster state, and is a squashed version of the following PRs that were targeting a feature branch: * Introduce Lucene-based metadata persistence (#48733) This commit introduces `LucenePersistedState` which master-eligible nodes can use to persist the cluster metadata in a Lucene index rather than in many separate files. Relates #48701 * Remove per-index metadata without assigned shards (#49234) Today on master-eligible nodes we maintain per-index metadata files for every index. However, we also keep this metadata in the `LucenePersistedState`, and only use the per-index metadata files for importing dangling indices. However there is no point in importing a dangling index without any shard data, so we do not need to maintain these extra files any more. This commit removes per-index metadata files from nodes which do not hold any shards of those indices. Relates #48701 * Use Lucene exclusively for metadata storage (#50144) This moves metadata persistence to Lucene for all node types. It also reenables BWC and adds an interoperability layer for upgrades from prior versions. This commit disables a number of tests related to dangling indices and command-line tools. Those will be addressed in follow-ups. Relates #48701 * Add command-line tool support for Lucene-based metadata storage (#50179) Adds command-line tool support (unsafe-bootstrap, detach-cluster, repurpose, & shard commands) for the Lucene-based metadata storage. Relates #48701 * Use single directory for metadata (#50639) Earlier PRs for #48701 introduced a separate directory for the cluster state. This is not needed though, and introduces an additional unnecessary cognitive burden to the users. Co-Authored-By: David Turner <david.turner@elastic.co> * Add async dangling indices support (#50642) Adds support for writing out dangling indices in an asynchronous way. Also provides an option to avoid writing out dangling indices at all. Relates #48701 * Fold node metadata into new node storage (#50741) Moves node metadata to uses the new storage mechanism (see #48701) as the authoritative source. * Write CS asynchronously on data-only nodes (#50782) Writes cluster states out asynchronously on data-only nodes. The main reason for writing out the cluster state at all is so that the data-only nodes can snap into a cluster, that they can do a bit of bootstrap validation and so that the shard recovery tools work. Cluster states that are written asynchronously have their voting configuration adapted to a non existing configuration so that these nodes cannot mistakenly become master even if their node role is changed back and forth. Relates #48701 * Remove persistent cluster settings tool (#50694) Adds the elasticsearch-node remove-settings tool to remove persistent settings from the on disk cluster state in case where it contains incompatible settings that prevent the cluster from forming. Relates #48701 * Make cluster state writer resilient to disk issues (#50805) Adds handling to make the cluster state writer resilient to disk issues. Relates to #48701 * Omit writing global metadata if no change (#50901) Uses the same optimization for the new cluster state storage layer as the old one, writing global metadata only when changed. Avoids writing out the global metadata if none of the persistent fields changed. Speeds up server:integTest by ~10%. Relates #48701 * DanglingIndicesIT should ensure node removed first (#50896) These tests occasionally failed because the deletion was submitted before the restarting node was removed from the cluster, causing the deletion not to be fully acked. This commit fixes this by checking the restarting node has been removed from the cluster. Co-authored-by: David Turner <david.turner@elastic.co> * fix tests Co-authored-by: David Turner <david.turner@elastic.co>	2020-01-14 09:35:43 +01:00
Nhat Nguyen	f0924e6d5b	Remove outdated requirement of CCR (#50859 ) With retention leases, users do not need to set index.soft_deletes.retention.operations. This change removes it from the requirements of CCR	2020-01-13 20:00:23 -05:00
Nhat Nguyen	fb32a55dd5	Deprecate synced flush (#50835 ) A normal flush has the same effect as a synced flush on Elasticsearch 7.6 or later. It's deprecated in 7.6 and will be removed in 8.0. Relates #50776	2020-01-13 19:54:38 -05:00
Przemko Robakowski	a18736b46d	[7.x] ILM action to wait for SLM policy execution (#50454 ) (#50943 ) * ILM action to wait for SLM policy execution (#50454) This change add new ILM action to wait for SLM policy execution to ensure that index has snapshot before deletion. Closes #45067 * Fix flaky TimeSeriesLifecycleActionsIT#testWaitForSnapshot test This change adds some randomness and cleanup step to TimeSeriesLifecycleActionsIT#testWaitForSnapshot and testWaitForSnapshotSlmExecutedBefore tests in attempt to make them stable. Reletes to #50781 * Formatting changes * Longer timeout * Fix Map.of in Java8 * Unused import removed	2020-01-14 01:34:33 +01:00
Peter Dyson	4cb525d8d3	[DOCS] Array of index patterns is also valid source indices with transform (#50777 )	2020-01-13 15:46:45 -08:00
Lee Hinman	91689e793d	[7.x] Refresh cached phase policy definition if possible on ne… (#50941 ) * Refresh cached phase policy definition if possible on new policy There are some cases when updating a policy does not change the structure in a significant way. In these cases, we can reread the policy definition for any indices using the updated policy. This commit adds this refreshing to the `TransportPutLifecycleAction` to allow this. It allows us to do things like change the configuration values for a particular step, even when on that step (for example, changing the rollover criteria while on the `check-rollover-ready` step). There are more cases where the phase definition can be reread that just the ones checked here (for example, removing an action that has already been passed), and those will be added in subsequent work. Relates to #48431	2020-01-13 14:31:41 -07:00
Lisa Cawley	a82ddfb182	[DOCS] Adds elasticsearch-keystore command reference (#50872 )	2020-01-13 13:08:21 -08:00
Nhat Nguyen	05f97d5e1b	Revert "Deprecate synced flush (#50835 )" This reverts commit `1a32d7142a`.	2020-01-13 11:41:03 -05:00
Nhat Nguyen	1a32d7142a	Deprecate synced flush (#50835 ) A normal flush has the same effect as a synced flush on Elasticsearch 7.6 or later. It's deprecated in 7.6 and will be removed in 8.0. Relates #50776	2020-01-13 10:58:29 -05:00
Ioannis Kakavas	ba37e3c4a0	Disable DiagnosticTrustManager in FIPS 140 (#49888 ) This commit changes the default behavior for xpack.security.ssl.diagnose.trust when running in a FIPS 140 JVM. More specifically, when xpack.security.fips_mode.enabled is true: - If xpack.security.ssl.diagnose.trust is not explicitly set, the default value of it becomes false and a log message is printed on info level, notifying of the fact that the TLS/SSL diagnostic messages are not enabled when in a FIPS 140 JVM. - If xpack.security.ssl.diagnose.trust is explicitly set, the value of it is honored, even in FIPS mode. This is relevant only for 7.x where we support Java 8 in which SunJSSE can still be used as a FIPS 140 provider for TLS. SunJSSE in FIPS mode, disallows the use of other TrustManager implementations than the one shipped with SunJSSE.	2020-01-13 17:04:23 +02:00
junmuz	6718ce0f62	[DOCS] Correct typo in `ignore_malformed` mapping parm docs (#50780 )	2020-01-13 09:49:53 -05:00
James Rodewig	4629a9714c	[DOCS] Fix time_zone example in range query docs (#50830 ) One of the example snippets in the range query docs was missing a required 'T' in the `date` format. This adds the required 'T'.	2020-01-10 08:24:48 -05:00
debadair	83d961391b	[DOCS] Move snapshot-restore out of modules. (#49618 ) (#50829 ) * [DOCS] Move snapshot-restore docs out of modules. * [DOCS] Incorporates comments from @jrodewig. * [DOCS] Fix snippet tests	2020-01-09 16:55:46 -08:00
Matt Braymer-Hayes	344c21813b	[DOCS] Fix typo in refresh API docs (#50759 )	2020-01-09 14:41:19 -05:00
Lisa Cawley	ef1c14ad01	[DOCS] Update license expiry links (#50812 )	2020-01-09 11:28:43 -08:00
Nik Everett	1d8e51f89d	Support offset in composite aggs (#50609 ) (#50808 ) Adds support for the `offset` parameter to the `date_histogram` source of composite aggs. The `offset` parameter is supported by the normal `date_histogram` aggregation and is useful for folks that need to measure things from, say, 6am one day to 6am the next day. This is implemented by creating a new `Rounding` that knows how to handle offsets and delegates to other rounding implementations. That implementation doesn't fully implement the `Rounding` contract, namely `nextRoundingValue`. That method isn't used by composite aggs so I can't be sure that any implementation that I add will be correct. I propose to leave it throwing `UnsupportedOperationException` until I need it. Closes #48757	2020-01-09 14:11:24 -05:00
lcawl	8a5de4f56f	[DOCS] Clarify detector_index property in ML APIs (#50723 )	2020-01-09 08:34:34 -08:00
Benjamin Trent	3e014d39c2	[Transform] fail to start/put on missing pipeline (#50701 ) (#50795 ) If a pipeline referenced by a transform does not exist, we should not allow the transform to be created. We do allow the pipeline existence check to be skipped with defer_validations, but if the pipeline still does not exist on `_start`, the pipeline will fail to start. relates: #50135	2020-01-09 10:33:22 -05:00
István Zoltán Szabó	4f150e4961	[7.x][DOCS] Moves analysis resources to PUT DFA API docs (#50793 )	2020-01-09 16:21:35 +01:00
István Zoltán Szabó	71afeec7d0	Revert "[DOCS] Moves analysis resources to PUT DFA API docs (#50704 )" This reverts commit `4e1107d5d7`.	2020-01-09 14:31:35 +01:00
István Zoltán Szabó	4e1107d5d7	[DOCS] Moves analysis resources to PUT DFA API docs (#50704 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-01-09 14:13:37 +01:00
István Zoltán Szabó	acd73dda1c	[DOCS] Improves find_file_structure documentation (#50743 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-01-09 11:20:29 +01:00
István Zoltán Szabó	0ac6786f41	[DOCS] Forms role and privilege requirements as bulleted lists in DFA API docs (#50732 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-01-09 10:45:18 +01:00
István Zoltán Szabó	8a1bb440e2	[DOCS] Clarifies model_size_stats.total_xxx_field_count objects and removes notes in GET job stats API docs. (#50728 )	2020-01-09 09:45:37 +01:00
István Zoltán Szabó	d7bb5d7531	[DOCS] Improves description for forecast_stats (#50729 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-01-09 09:35:47 +01:00
James Rodewig	78c9eee5ea	[DOCS] Add section ID to analysis overview page	2020-01-08 14:43:41 -06:00
James Rodewig	9d1567b13b	[DOCS] Add overview page to analysis topic (#50515 ) Adds a 'text analysis overview' page to the analysis topic docs. The goals of this page are: * Concisely summarize the analysis process while avoiding in-depth concepts, tutorials, or API examples * Explain why analysis is important, largely through highlighting problems with full-text searches missing analysis * Highlight how analysis can be used to improve search results	2020-01-08 12:54:00 -06:00
István Zoltán Szabó	0444da944e	[DOCS] Adds DFA resources as deleted page to redirects. (#50756 )	2020-01-08 19:01:16 +01:00
James Rodewig	f87e61ec30	[DOCS] Add default index-time analyzer example (#50501 ) The Analysis docs mention including a default analyzer in the index settings. However, no example snippet is included. This adds an example snippet that users can easily copy and adjust.	2020-01-08 11:07:49 -06:00
blueSky1825821	5ff6eafb4b	[Docs] Update similarity.asciidoc (#50719 ) DFRSimilarity -> DFR similarity	2020-01-08 17:48:26 +01:00
Adrien Grand	31158ab3d5	Add per-field metadata. (#50333 ) This PR adds per-field metadata that can be set in the mappings and is later returned by the field capabilities API. This metadata is completely opaque to Elasticsearch but may be used by tools that index data in Elasticsearch to communicate metadata about fields with tools that then search this data. A typical example that has been requested in the past is the ability to attach a unit to a numeric field. In order to not bloat the cluster state, Elasticsearch requires that this metadata be small: - keys can't be longer than 20 chars, - values can only be numbers or strings of no more than 50 chars - no inner arrays or objects, - the metadata can't have more than 5 keys in total. Given that metadata is opaque to Elasticsearch, field capabilities don't try to do anything smart when merging metadata about multiple indices, the union of all field metadatas is returned. Here is how the meta might look like in mappings: ```json { "properties": { "latency": { "type": "long", "meta": { "unit": "ms" } } } } ``` And then in the field capabilities response: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms" ] } } } } ``` When there are no conflicts, values are arrays of size 1, but when there are conflicts, Elasticsearch includes all unique values in this array, without giving ways to know which index has which metadata value: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms", "ns" ] } } } } ``` Closes #33267	2020-01-08 16:21:18 +01:00
James Rodewig	d3094f9d23	[DOCS] Fix typo in mapping date format docs	2020-01-08 07:55:51 -06:00
Christoph Büscher	d8c907d648	Remove _reload_search_analyzer experimental status (#50696 ) Removing the experimental status in the docs and the rest specs.	2020-01-08 10:35:19 +01:00
James Rodewig	de6b62f789	[DOCS] Fuzzy wildcard not supported in `query_string` (#50466 ) The `query_string` does not support mixing wildcards with fuzziness. This adds a related warning to the `query_string` docs.	2020-01-07 12:54:50 -06:00
James Rodewig	20eba1e410	[DOCS] Reformat reverse token filter docs (#50672 ) * Updates the description and adds a Lucene link * Adds analyze and custom analyzer snippets	2020-01-07 11:01:55 -06:00
James Rodewig	8009b07ccb	[DOCS] Reformat truncate token filter docs (#50687 ) * Updates the description and adds a Lucene link * Adds analyze, custom analyzer, and custom filter snippets * Adds parameter documentation	2020-01-07 10:33:57 -06:00
arkel-s	d5f4790f90	[DOCS] Add example format for `date_optional_time` (#50458 ) Adds an example format for `date_optional_time` to the `format` mapping parameter docs. Closes #50457	2020-01-07 10:13:34 -06:00
James Rodewig	0753915eed	[DOCS] Update SQL REST API pages for new structure (#50690 ) #43007 restructured the SQL REST API docs so they display across several pages. This updates up a reference that assumes a single page in the "Paginating through a large response" section. It also reformats a tip for the Kibana console. Closes #50688	2020-01-07 09:27:34 -06:00
James Rodewig	074866256b	[DOCS] Remove unneeded redirects (#50510 ) The docs/reference/redirects.asciidoc file stores a list of relocated or deleted pages for the Elasticsearch Reference documentation. This prunes several older redirects that are no longer needed.	2020-01-06 09:11:48 -06:00
James Rodewig	1299dda437	[DOCS] Warn about using `geo_centroid` as sub-agg to `geohash_grid` (#50038 ) If `geo_point fields` are multi-valued, using `geo_centroid` as a sub-agg to `geohash_grid` could result in centroids outside of bucket boundaries. This adds a related warning to the geo_centroid agg docs.	2020-01-06 07:47:54 -06:00
Nhat Nguyen	b71490b06b	Deprecate indices without soft-deletes (#50502 ) (#50634 ) Soft-deletes will be enabled for all indices in 8.0. Hence, we should deprecate new indices without soft-deletes in 7.x. Backport of #50502	2020-01-06 08:44:30 -05:00
Lisa Cawley	62969c35cd	[DOCS] Adds missing timing_stats descriptions (#50574 )	2020-01-03 09:14:09 -08:00
Orhan Toy	44827e577e	[DOCS] Fix missing quote in script-score-query.asciidoc (#50590 )	2020-01-03 16:15:45 +01:00
István Zoltán Szabó	0bcbddecf8	[DOCS] Fine-tunes training_percent definition. (#50601 )	2020-01-03 14:51:03 +01:00
James Rodewig	e6a469cc74	[DOCS] Reformat uppercase token filter docs (#50555 ) * Updates the description and adds a Lucene link * Adds analyze and custom analyzer snippets	2020-01-03 08:39:08 -05:00
Dimitris Athanasiou	ca0828ba07	[7.x][ML] Implement force deleting a data frame analytics job (#50553 ) (#50589 ) Adds a `force` parameter to the delete data frame analytics request. When `force` is `true`, the action force-stops the jobs and then proceeds to the deletion. This can be used in order to delete a non-stopped job with a single request. Closes #48124 Backport of #50553	2020-01-03 13:46:02 +02:00
Alan Woodward	8b362c657b	Add fuzzy intervals source (#49762 ) This intervals source will return terms that are similar to an input term, up to an edit distance defined by fuzziness, similar to FuzzyQuery. Closes #49595	2020-01-03 09:59:19 +00:00
István Zoltán Szabó	a34b3f133c	[DOCS] Specifies the possible data types of classification dependent_variable (#50582 )	2020-01-03 10:42:56 +01:00
Lee Hinman	0d78aa2708	Don't dump a stacktrace for invalid patterns when executing elasticsearch-croneval (#49744 ) (#50578 ) Co-authored-by: bellengao <gbl_long@163.com>	2020-01-02 16:57:51 -07:00
Lisa Cawley	81a9cff16f	[7.x][DOCS] Remove redundant results from ML APIs (#50565 )	2020-01-02 11:23:26 -08:00
David Turner	b209e7b6ff	Remove included docs (#50557 ) In #50499 we accidentally duplicated the docs for the `?local` parameter to the `GET _cat/nodes` API. This commit removes the duplicate docs.	2020-01-02 17:19:42 +00:00
Nik Everett	55107ce8ae	Docs: Refine note about `after_key` (#50475 ) * Docs: Refine note about `after_key` I was curious about composite aggregations, specifically I wanted to know how to write a composite aggregation that had all of its buckets filtered out so you had to use the `after_key`. Then I saw that we've declared composite aggregations not to work with pipelines in #44180. So I'm not sure you can do that any more. Which makes the note about `after_key` inaccurate. This rejiggers that section of the docs a little so it is more obvious that you send the `after_key` back to us. And so it is more obvious that you should only use the `after_key` that we give you rather than try to work it out for yourself. * Apply suggestions from code review Co-Authored-By: James Rodewig <james.rodewig@elastic.co> Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-01-02 10:03:23 -05:00
Oleg	7539fbb30f	Deprecate the 'local' parameter of /_cat/nodes (#50499 ) The cat nodes API performs a `ClusterStateAction` then a `NodesInfoAction`. Today it accepts the `?local` parameter and passes this to the `ClusterStateAction` but this parameter has no effect on the `NodesInfoAction`. This is surprising, because `GET _cat/nodes?local` looks like it might be a completely local call but in fact it still depends on every node in the cluster. This commit deprecates the `?local` parameter on this API so that it can be removed in 8.0. Relates #50088	2020-01-02 14:53:56 +00:00
Lisa Cawley	ab5a69d1e2	[7.x][DOCS] Move machine learning results definitions into APIs (#50543 )	2019-12-31 13:21:17 -08:00
Lisa Cawley	f8eef43fc6	[7.x][DOCS] Move model snapshot resource definitions into APIs (#50540 )	2019-12-31 10:53:05 -08:00
Lisa Cawley	3fb4f1b5bf	[DOCS] Moves job count resource definitions into API (#50529 )	2019-12-30 14:55:36 -08:00
Gilad Gal	9fdfb075bb	Deleted 'a' before plural 'messages' Deleted 'a' before plural 'messages'	2019-12-30 21:25:15 +02:00
Lisa Cawley	4b829db593	[7.x][DOCS] Move datafeed resource definitions into APIs (#50516 )	2019-12-30 09:35:16 -08:00
Lisa Cawley	72840c0cb2	[7.x][DOCS] Move anomaly detection job resource definitions into APIs (#50490 )	2019-12-27 13:30:26 -08:00
James Rodewig	7a14607a25	[DOCS] Abbreviate token filter titles (#50511 )	2019-12-27 11:01:52 -05:00
James Rodewig	3f7f31b6b0	[DOCS] Fix search request body links (#50500 ) PR #44238 changed several links related to the Elasticsearch search request body API. This updates several places still using outdated links or anchors. This will ultimately let us remove some redirects related to those link changes.	2019-12-26 14:31:09 -05:00
James Rodewig	ef467cc6f5	[DOCS] Remove unneeded redirects (#50476 ) The docs/reference/redirects.asciidoc file stores a list of relocated or deleted pages for the Elasticsearch Reference documentation. This prunes several older redirects that are no longer needed and don't require work to fix broken links in other repositories.	2019-12-26 08:29:28 -05:00
James Rodewig	1cec87c0e6	[DOCS] Document `transport` and `http` node stats (#50473 ) Documents the `transport` and `http` parameters returned by the `_nodes/stats` API.	2019-12-26 07:43:14 -05:00
Lisa Cawley	d479e0563a	[7.x][DOCS] Augments ML shared definitions (#50487 )	2019-12-24 10:22:05 -08:00
Orhan Toy	6a3d1a077e	[DOCS] Fixes "enables you to" typos (#50225 )	2019-12-23 14:39:14 -05:00
James Rodewig	694b119f0a	[DOCS] Percentile aggs are non-deterministic (#50468 ) Percentile aggregations are non-deterministic. A percentile aggregation can produce different results even when using the same data. Based on [this discuss post][0], the non-deterministic property stems from processes in Lucene that can affect the order in which docs are provided to the aggregation. This adds a warning stating that the aggregation is non-deterministic and what that means. [0]: https://discuss.elastic.co/t/different-results-for-same-query/111757	2019-12-23 13:13:34 -05:00
Nik Everett	01293ebad5	Fix docs typos (#50365 ) (#50464 ) Fixes a few typos in the docs. Co-authored-by: Xiang Dai <764524258@qq.com>	2019-12-23 12:38:17 -05:00
Igor Motov	339d10c16f	Geo: Switch generated GeoJson type names to camel case (#50400 ) Switches generated GeoJson type names to camel case to conform to the standard. Closes #49568	2019-12-20 15:37:22 -05:00
James Rodewig	27ae9a1435	[DOCS] Remove outdated file scripts refererence (#50437 ) File scripts were removed in 6.0 with #24627. This removes an outdated file scripts reference from the conditional clauses section of the search templates docs.	2019-12-20 14:53:40 -05:00
Lee Hinman	c3c9ccf61f	[7.x] Add ILM histore store index (#50287 ) (#50345 ) * Add ILM histore store index (#50287) * Add ILM histore store index This commit adds an ILM history store that tracks the lifecycle execution state as an index progresses through its ILM policy. ILM history documents store output similar to what the ILM explain API returns. An example document with ALL fields (not all documents will have all fields) would look like: ```json { "@timestamp": 1203012389, "policy": "my-ilm-policy", "index": "index-2019.1.1-000023", "index_age":123120, "success": true, "state": { "phase": "warm", "action": "allocate", "step": "ERROR", "failed_step": "update-settings", "is_auto-retryable_error": true, "creation_date": 12389012039, "phase_time": 12908389120, "action_time": 1283901209, "step_time": 123904107140, "phase_definition": "{\"policy\":\"ilm-history-ilm-policy\",\"phase_definition\":{\"min_age\":\"0ms\",\"actions\":{\"rollover\":{\"max_size\":\"50gb\",\"max_age\":\"30d\"}}},\"version\":1,\"modified_date_in_millis\":1576517253463}", "step_info": "{... etc step info here as json ...}" }, "error_details": "java.lang.RuntimeException: etc\n\tcaused by:etc etc etc full stacktrace" } ``` These documents go into the `ilm-history-1-00000N` index to provide an audit trail of the operations ILM has performed. This history storage is enabled by default but can be disabled by setting `index.lifecycle.history_index_enabled` to `false.` Resolves #49180 * Make ILMHistoryStore.putAsync truly async (#50403) This moves the `putAsync` method in `ILMHistoryStore` never to block. Previously due to the way that the `BulkProcessor` works, it was possible for `BulkProcessor#add` to block executing a bulk request. This was bad as we may be adding things to the history store in cluster state update threads. This also moves the index creation to be done prior to the bulk request execution, rather than being checked every time an operation was added to the queue. This lessens the chance of the index being created, then deleted (by some external force), and then recreated via a bulk indexing request. Resolves #50353	2019-12-20 12:33:36 -07:00
Jack Conradson	b81e072504	Document use of context in put stored script (#50446 ) This documents how to test compile a stored script against a specific context when using PUT/POST.	2019-12-20 10:53:43 -08:00
Lisa Cawley	2106a7b02a	[7.x][DOCS] Updates ML links (#50387 ) (#50409 )	2019-12-20 10:01:19 -08:00
Florian Kelbert	af8bed13d3	[DOCS] Fix typo in bucket sum aggregation docs (#50431 )	2019-12-20 08:48:25 -05:00
Jim Ferenczi	2acafd4b15	Optimize composite aggregation based on index sorting (#48399 ) (#50272 ) Co-authored-by: Daniel Huang <danielhuang@tencent.com> This is a spinoff of #48130 that generalizes the proposal to allow early termination with the composite aggregation when leading sources match a prefix or the entire index sort specification. In such case the composite aggregation can use the index sort natural order to early terminate the collection when it reaches a composite key that is greater than the bottom of the queue. The optimization is also applicable when a query other than match_all is provided. However the optimization is deactivated for sources that match the index sort in the following cases: * Multi-valued source, in such case early termination is not possible. * missing_bucket is set to true	2019-12-20 12:32:37 +01:00
Tim Brooks	cb73fb0f9b	Backport remote proxy mode stats and naming (#50402 ) * Update remote cluster stats to support simple mode (#49961) Remote cluster stats API currently only returns useful information if the strategy in use is the SNIFF mode. This PR modifies the API to provide relevant information if the user is in the SIMPLE mode. This information is the configured addresses, max socket connections, and open socket connections. * Send hostname in SNI header in simple remote mode (#50247) Currently an intermediate proxy must route conncctions to the appropriate remote cluster when using simple mode. This commit offers a additional mechanism for the proxy to route the connections by including the hostname in the TLS SNI header. * Rename the remote connection mode simple to proxy (#50291) This commit renames the simple connection mode to the proxy connection mode for remote cluster connections. In order to do this, the mode specific settings which we namespaced by their mode (ex: sniff.seed and proxy.addresses) have been reverted. * Modify proxy mode to support a single address (#50391) Currently, the remote proxy connection mode uses a list setting for the proxy address. This commit modifies this so that the setting is proxy_address and only supports a single remote proxy address.	2019-12-19 18:02:48 -07:00
Stuart Tettemer	2e76865290	[DOCS] Deterministic scripted queries are cached (#50408 ) (#50411 ) Backport Refs: #49321	2019-12-19 16:30:34 -07:00
István Zoltán Szabó	501ab83471	[DOCS] Adds inference processor documentation (#50204 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-12-19 12:21:04 +01:00
Igor Motov	c77ca98928	Geo: Switch generated WKT to upper case (#50285 ) Switches generated WKT to upper case to conform to the standard recommendation. Relates #49568	2019-12-18 17:29:08 -05:00
James Rodewig	73ca2e5826	[DOCS] Document `thread_pool` node stats (#50330 )	2019-12-18 17:02:00 -05:00
lcawl	246d926412	[DOCS] Fixes security links	2019-12-18 11:52:00 -08:00
James Rodewig	498532a12d	[DOCS] Clarify frozen indices are read-only (#50318 ) The freeze index API docs state that frozen indices are blocked for write operations. While this implies frozen indices are read-only, it does not explicitly use the term "read-only", which is found in other docs, such as the force merge docs. This adds the "ready-only" term to the freeze index API docs as well as other clarification.	2019-12-18 12:18:38 -05:00
Christoph Büscher	bf63f24209	[Docs] Remove `intervals` filter rule from allowed top-level rules (#50320 ) The `filter` rule is not allowed on the top-level of the query, so removing it from the list of allowed rules. Where it can be nested inside other rules, those rules already mention it.	2019-12-18 17:37:27 +01:00
Adrien Grand	ec11c04bd5	[DOCS] Add 7.5.1 release notes (#50196 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-12-18 11:09:53 -05:00
Kevin Woblick	cbffd127d5	[DOCS] Add warning about Docker port exposure (#50169 ) Docker bypasses the Uncomplicated Firewall (UFW) on Linux by editing the `iptables` config directly, which leads to the exposure of port 9200, even if you blocked it via UFW. This adds a warning along with work-arounds to the docs. Signed-off-by: Kovah <mail@kovah.de>	2019-12-18 09:16:36 -05:00
István Zoltán Szabó	5759a263cb	[DOCS] Adds GET, GET stats and DELETE inference APIs (#50224 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2019-12-18 09:18:56 +01:00
Lisa Cawley	30d66828ae	[DOCS] Move transform resource definitions into APIs (#50108 )	2019-12-17 12:31:31 -08:00
James Rodewig	726c35dfd0	[DOCS] Add identifier mapping tip to numeric and keyword datatype docs (#49933 ) Users often mistakenly map numeric IDs to numeric datatypes. However, this is often slow for the `term` and other term-level queries. The "Tune for search speed" docs includes advice for mapping numeric IDs to `keyword` fields. However, this tip is not included in the `numeric` or `keyword` field datatype doc pages. This rewords the tip in the "Tune for search speed" docs, relocates it to the `numeric` field docs, and reuses it using tagged regions.	2019-12-17 09:34:32 -05:00
James Rodewig	99fdea50dd	[DOCS] Correct cat snapshots API request example (#50274 ) For 7.x and earlier branches, `_cat/repositories` API requests require a repository name. This removes an erroneous request example without a repository name added with `a8e0275`.	2019-12-17 09:23:38 -05:00
Ignacio Vera	3717c733ff	"CONTAINS" support for BKD-backed geo_shape and shape fields (#50141 ) (#50213 ) Lucene 8.4 added support for "CONTAINS", therefore in this commit those changes are integrated in Elasticsearch. This commit contains as well a bug fix when querying with a geometry collection with "DISJOINT" relation.	2019-12-16 09:17:51 +01:00
Henning Andersen	a0044df78a	Reindex source types disregarded in 7.x (#49580 ) Clarify that types in source index are disregarded.	2019-12-15 17:30:28 +01:00
James Rodewig	bbb872a022	Revert "[DOCS] Add `index-extra-title-page.html` for direct HTML migration (#50189 )" This reverts commit `79c40728f7`.	2019-12-13 12:58:11 -05:00
James Rodewig	79c40728f7	[DOCS] Add `index-extra-title-page.html` for direct HTML migration (#50189 )	2019-12-13 12:47:13 -05:00
James Rodewig	cd04021961	[DOCS] Reformat token count limit filter docs (#49835 )	2019-12-13 08:44:39 -05:00
István Zoltán Szabó	8f36bfa37f	[7.x][DOCS] Changes hyperparam optimization section ID (#50173 )	2019-12-13 12:22:50 +01:00
István Zoltán Szabó	7611b3c9be	[7.x][DOCS] Moves data frame analytics job resource definitions into APIs (#50165 ) * [7.x][DOCS] Moves data frame analytics job resource definitions into APIs.	2019-12-13 11:48:21 +01:00
James Rodewig	249334f38d	[DOCS] Document JVM node stats (#49500 ) * [DOCS] Document JVM node stats Documents the `jvm` parameters returned by the `_nodes/stats` API. Co-Authored-By: James Baiera <james.baiera@gmail.com>	2019-12-12 15:42:18 -05:00
Lisa Cawley	2d19d0c166	[DOCS] Updates transform screenshots and text (#50059 )	2019-12-12 09:30:56 -08:00
James Rodewig	1186a5dc09	[DOCS] Reformat lowercase token filter docs (#49935 )	2019-12-12 09:50:12 -05:00
James Rodewig	364eb2d34c	[DOCS] Correct percentile rank agg example response (#50052 ) The example snippets in the percentile rank agg docs use a test dataset named `latency`, which is generated from docs/gradle.build. At some point the dataset and example snippets were updated, but the text surrounding the snippets was not. This means the text and the example snippets shown no longer match up. This corrects that by changing the snippets using /TESTRESPONSE magic comments.	2019-12-12 09:06:41 -05:00
Przemko Robakowski	4619834b97	[7.x] CSV ingest processor (#49509 ) (#50083 ) * CSV ingest processor (#49509) This change adds new ingest processor that breaks line from CSV file into separate fields. By default it conforms to RFC 4180 but can be tweaked. Closes #49113	2019-12-11 23:06:05 +01:00
Wilder Pereira	8ff809af2d	[DOCS] Replace interval notation with plain English in match query docs (#47334 ) As we discussed in #36371, interval notation is confusing to some users. This makes the intention clearer by just explaining inclusivity and exclusivity in the docs.	2019-12-11 09:58:28 -05:00
Patryk Krawaczyński	df558aa0ca	[DOCS] Document `index.queries.cache.enabled` as a static setting (#49886 )	2019-12-10 14:24:03 -05:00
Adrien Grand	87e72156ce	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) (#50039 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 18:04:58 +01:00
Peter Johnson	1a6e5bf220	[Docs] Fix typo in function-score-query.asciidoc (#50030 )	2019-12-10 17:33:03 +01:00
Lisa Cawley	15f27d8c54	[DOCS] Removes realm type security setting (#50001 )	2019-12-10 08:09:07 -08:00
James Rodewig	3f5678ca79	[DOCS] Remove shadow replica reference (#50029 ) Removes a reference to shadow replicas from the cat shards API docs and a comment in cluster/routing/UnassignedInfo.java. Shadow replicas were removed with #23906.	2019-12-10 09:30:51 -05:00
Dimitris Athanasiou	8891f4db88	[7.x][ML] Introduce randomize_seed setting for regression and classification (#49990 ) (#50023 ) This adds a new `randomize_seed` for regression and classification. When not explicitly set, the seed is randomly generated. One can reuse the seed in a similar job in order to ensure the same docs are picked for training. Backport of #49990	2019-12-10 15:29:19 +02:00
James Rodewig	33594380c7	[DOCS] Skip synced flush docs tests (#49986 ) The current snippets in the synced flush docs can cause conflicts with other background syncs, such as the global checkpoint sync or retention lease sync, in the docs tests. This skips tests for those snippets to avoid conflicts.	2019-12-09 13:17:38 -05:00
Artur Carvalho	d073bccaad	[Docs] Fix typo in getting-started.asciidoc (#49985 )	2019-12-09 16:24:30 +01:00
James Rodewig	1918a21baf	[DOCS] Correct inline shape snippets in shape query docs (#49921 ) In the shape query docs, the index mapping snippet uses the "geometry" shape field mapping. However, the doc index snippet uses the "location" property. This changes the "location" property to "geometry". It also adds a comment containing the search result snippet. This should prevent similar issues in the future.	2019-12-09 08:47:59 -05:00
Ryan Ernst	e66cfc4369	Fix incorrect use of multiline NOTE in rpm docs (#49962 ) This was a copy/paste error from #49893. This commit converts the NOTE to use inline style instead of one needing closing linebreak.	2019-12-06 17:43:51 -08:00
Ryan Ernst	d29f04209b	Disable repo configuration for rpm based systems (#49893 ) This commit changes the recommended repository file for rpm based systems to be disabled by default. This is a safer practice so upgrades of the system do no accidentally upgrade elasticsearch itself. closes #30660	2019-12-06 15:56:18 -08:00
Przemko Robakowski	d7083a84f4	Allow list of IPs in geoip ingest processor (#49573 ) (#49947 ) * Allow list of IPs in geoip ingest processor This change lets you use array of IPs in addition to string in geoip processor source field. It will set array containing geoip data for each element in source, unless first_only parameter option is enabled, then only first found will be returned. Closes #46193	2019-12-07 00:19:09 +01:00
István Zoltán Szabó	63d3933787	[DOCS] Fixes classification evaluation example response. (#49905 )	2019-12-06 13:25:40 +01:00
István Zoltán Szabó	bb91291273	[DOCS] Fixes attribute in transforms overview. (#49898 )	2019-12-06 10:24:29 +01:00
Hendrik Muhs	b17cfc93e3	[Transform][DOCS]rewrite client ip example to use continuous transform (#49822 ) adapt the transform example for suspicious client ips to use continuous transform	2019-12-06 08:20:48 +01:00
Orhan Toy	1641fcd488	[DOCS] Minor typo fixes in reindex.asciidoc (#49863 )	2019-12-05 20:25:11 +01:00
István Zoltán Szabó	f4b3bb7d6b	[DOCS] Adds an example of preprocessing actions to the PUT DFA API docs (#49831 )	2019-12-05 14:16:38 +01:00
István Zoltán Szabó	04e99ff1ee	[DOCS] Fixes typo in the ML anomaly detection time functions docs. (#49834 )	2019-12-05 09:58:30 +01:00
James Rodewig	42f902977d	[DOCS] Document `minimum_should_match` defaults for `bool` query (#48865 ) Adds documentation for the `minimum_should_match` parameter to the `bool` query docs. Includes docs for the default values: - `1` if the `bool` query includes at least one `should` clause and no `must` or `filter` clauses - `0` otherwise	2019-12-04 12:45:38 -05:00
James Rodewig	87a73b6bdf	[DOCS] Reformat length token filter docs (#49805 ) * Adds a title abbreviation * Updates the description and adds a Lucene link * Reformats the parameters section * Adds analyze, custom analyzer, and custom filter snippets Relates to #44726.	2019-12-04 09:59:08 -05:00
Alexander Reelsen	6e751f5536	Docs: Fix & test more grok processor documentation (#49447 ) The documentation contained a small error, as bytes and duration was not properly converted to a number and thus remained a string. The documentation is now also properly tested by providing a full blown simulate pipeline example.	2019-12-03 11:55:49 +01:00
Colin Goodheart-Smithe	0592b3c726	Removes PR that was not in 7.5.0 release	2019-12-03 10:20:05 +00:00
cachedout	c4cc90be1c	Recommend Metricbeat for 7.x (#49758 ) * Recommend Metricbeat for 7.x * Fix typo in link to configuring-metricbeat * [DOCS] Fixes build error and some terminology * Add to local exporter page per review feedback	2019-12-02 21:31:47 +00:00
James Rodewig	f1fd41cb53	[DOCS] Document CCR compatibility requirements (#49776 ) * Creates a prerequisites section in the cross-cluster replication (CCR) overview. * Adds concise definitions for local and remote cluster in a CCR context. * Documents that the ES version of the local cluster must be the same or a newer compatible version as the remote cluster.	2019-12-02 15:53:00 -05:00
Paul Sanwald	ebc13ca498	re-categorize things that appeared in multiple area labels (#49777 )	2019-12-02 15:03:38 -05:00
Hendrik Muhs	a5dc6e062e	Document issue 49730 in release notes for 7.5.0 (#49733 ) document low severity issue about transform audit index potentially disappearing during rolling upgrade See #49730 for details	2019-12-02 20:54:36 +01:00
lcawl	96f14fcfbd	[DOCS] Removes coming tags	2019-12-02 08:17:10 -08:00
Jim Ferenczi	e6dc5bf9c2	Add release highlights for 7.5.0 (#49320 )	2019-12-02 07:45:12 -08:00
Andrei Stefan	e2982b2110	SQL: handle NULL arithmetic operations with INTERVALs (#49633 ) (cherry picked from commit ce727615c08cf5ae422feb77f69ea24fb53cd9d1)	2019-12-02 17:31:05 +02:00
James Rodewig	ade72b97b7	[DOCS] Reformat keep types and keep words token filter docs (#49604 ) * Adds title abbreviations * Updates the descriptions and adds Lucene links * Reformats parameter definitions * Adds analyze and custom analyzer snippets * Adds explanations of token types to keep types token filter and tokenizer docs	2019-12-02 09:40:50 -05:00
David Turner	86a40f6d8b	Drop snapshot instructions for autobootstrap fix (#49755 ) The "Restore any snapshots as required" step is a trap: it's somewhere between tricky and impossible to restore multiple clusters into a single one. Also add a note about configuring discovery during a rolling upgrade to proscribe any rare cases where you might accidentally autobootstrap during the upgrade.	2019-12-02 14:33:42 +00:00
James Rodewig	3d44c1163a	[DOCS] Explicitly document enrich `target_field` includes `match_field` (#49407 ) When the enrich processor appends enrich data to an incoming document, it adds a `target_field` to contain the enrich data. This `target_field` contains both the `match_field` AND `enrich_fields` specified in the enrich policy. Previously, this was reflected in the documented example but not explicitly stated. This adds several explicit statements to the docs.	2019-12-02 09:13:24 -05:00
Henning Andersen	5adb33ec17	Deprecate sorting in reindex (#49458 ) (#49738 ) Reindex sort never gave a guarantee about the order of documents being indexed into the destination, though it could give a sense of locality of source data. It prevents us from doing resilient reindex and other optimizations and it has therefore been deprecated. Related to #47567	2019-12-01 19:24:27 +01:00
Henning Andersen	1d745f1e5c	Revert "Deprecate sorting in reindex (#49458 )" This reverts commit `27d45c9f1f`.	2019-11-29 22:08:19 +01:00
Mayya Sharipova	7cf170830c	Optimize sort on numeric long and date fields. (#49732 ) This rewrites long sort as a `DistanceFeatureQuery`, which can efficiently skip non-competitive blocks and segments of documents. Depending on the dataset, the speedups can be 2 - 10 times. The optimization can be disabled with setting the system property `es.search.rewrite_sort` to `false`. Optimization is skipped when an index has 50% or more data with the same value. Optimization is done through: 1. Rewriting sort as `DistanceFeatureQuery` which can efficiently skip non-competitive blocks and segments of documents. 2. Sorting segments according to the primary numeric sort field(#44021) This allows to skip non-competitive segments. 3. Using collector manager. When we optimize sort, we sort segments by their min/max value. As a collector expects to have segments in order, we can not use a single collector for sorted segments. We use collectorManager, where for every segment a dedicated collector will be created. 4. Using Lucene's shared TopFieldCollector manager This collector manager is able to exchange minimum competitive score between collectors, which allows us to efficiently skip the whole segments that don't contain competitive scores. 5. When index is force merged to a single segment, #48533 interleaving old and new segments allows for this optimization as well, as blocks with non-competitive docs can be skipped. Backport for #48804 Co-authored-by: Jim Ferenczi <jim.ferenczi@elastic.co>	2019-11-29 15:37:40 -05:00
Henning Andersen	27d45c9f1f	Deprecate sorting in reindex (#49458 ) Reindex sort never gave a guarantee about the order of documents being indexed into the destination, though it could give a sense of locality of source data. It prevents us from doing resilient reindex and other optimizations and it has therefore been deprecated. Related to #47567	2019-11-29 21:35:11 +01:00
Tugberk Ugurlu	dcb9d5177c	[Docs] Fix typo in templates.asciidoc (#49726 )	2019-11-29 18:43:13 +01:00
Dimitris Athanasiou	4edb2e7bb6	[7.x][ML] Add optional source filtering during data frame reindexing (#49690 ) (#49718 ) This adds a `_source` setting under the `source` setting of a data frame analytics config. The new `_source` is reusing the structure of a `FetchSourceContext` like `analyzed_fields` does. Specifying includes and excludes for source allows selecting which fields will get reindexed and will be available in the destination index. Closes #49531 Backport of #49690	2019-11-29 16:10:44 +02:00
Tim Vernum	e6f530c167	Improved diagnostics for TLS trust failures (#49669 ) - Improves HTTP client hostname verification failure messages - Adds "DiagnosticTrustManager" which logs certificate information when trust cannot be established (hostname failure, CA path failure, etc) These diagnostic messages are designed so that many common TLS problems can be diagnosed based solely (or primarily) on the elasticsearch logs. These diagnostics can be disabled by setting xpack.security.ssl.diagnose.trust: false Backport of: #48911	2019-11-29 15:01:20 +11:00
Tim Vernum	31f13e839c	Correct the documentation for create_doc privilege (#49354 ) The documentation was added in #47584 but those docs did not reflect the up-to-date behavior of the feature. Backport of: #47784	2019-11-29 12:59:16 +11:00
Marios Trivyzas	d5842aebab	[Docs] Enhance rolling upgrade guide (#49686 ) Add a couple of pointers for the user to check the overall cluster health and the version of ES running on every node. Fixes: #49670 (cherry picked from commit 8ca11f54cd839f41632c556601e94da67e91a3d1)	2019-11-28 17:02:36 +01:00
Ignacio Vera	326fe7566e	New Histogram field mapper that supports percentiles aggregations. (#48580 ) (#49683 ) This commit adds a new histogram field mapper that consists in a pre-aggregated format of numerical data to be used in percentiles aggregations.	2019-11-28 15:06:26 +01:00
Jim Ferenczi	d6445fae4b	Add a cluster setting to disallow loading fielddata on _id field (#49166 ) This change adds a dynamic cluster setting named `indices.id_field_data.enabled`. When set to `false` any attempt to load the fielddata for the `_id` field will fail with an exception. The default value in this change is set to `false` in order to prevent fielddata usage on this field for future versions but it will be set to `true` when backporting to 7x. When the setting is set to true (manually or by default in 7x) the loading will also issue a deprecation warning since we want to disallow fielddata entirely when https://github.com/elastic/elasticsearch/issues/26472 is implemented. Closes #43599	2019-11-28 09:35:28 +01:00
Ryan Ernst	f288696040	Remove legacy referene to file scripts (#49339 ) This commit removes outdated documentation about a path setting for file scripts which no longer exist. closes #45827	2019-11-27 10:42:33 -08:00
Ryan Ernst	297efa8324	Add JAVA_HOME env override location to docs (#49565 ) This commit clarifies how to override JAVA_HOME from the bundled jdk for deb and rpm installs, which each have their own file that is sourced upon service startup. closes #49068	2019-11-27 10:40:30 -08:00
Martijn van Groningen	0a42395dfa	Backport: add templating support to pipeline processor (#49643 ) Backport of #49030 This commit adds templating support to the pipeline processor's `name` option. Closes #39955	2019-11-27 15:53:40 +01:00
Xiang Dai	15342c4dd2	[DOCS] Clarify how to update max memory size in bootstrap checks (#48975 )	2019-11-27 09:40:00 -05:00
bellengao	2e0b079a16	[DOCS] Correct the request path for flush API docs (#49615 )	2019-11-27 09:27:28 -05:00
Yannick Welsch	5ffb615b37	[DOCS] Correct request path for synced flush API docs (#49631 ) Fixes an incorrect request path added with #46634	2019-11-27 08:44:51 -05:00
glerb	d1ed2ae25b	[Docs] Correct typo in log file name (#49620 )	2019-11-27 14:37:47 +01:00
Dimitrios Liappis	4b6915ea41	Clarify gid used by docker image process and bind-mount method (#49632 ) Fix reference about the uid:gid that Elasticsearch runs as inside the Docker container and add a packaging test to ensure that bind mounting a data dir with a random uid and gid:0 works as expected. Backport of #49529 Closes #47929	2019-11-27 13:42:54 +02:00
Martijn van Groningen	09c4269097	Add templating support to enrich processor (#49093 ) Adds support for templating to `field` and `target_field` options.	2019-11-27 08:53:11 +01:00
Martijn van Groningen	90850f4ea0	Backport: Introduce on_failure_pipeline ingest metadata inside on_failure block (#49596 ) Backport of #49076 In case an exception occurs inside a pipeline processor, the pipeline stack is kept around as header in the exception. Then in the on_failure processor the id of the pipeline the exception occurred is made accessible via the `on_failure_pipeline` ingest metadata. Closes #44920	2019-11-27 07:52:08 +01:00
lcawl	777431265b	[DOCS] Fixes typo in ML resources	2019-11-26 10:28:59 -08:00
lcawl	a42003b95b	[DOCS] Fixes data type formatting	2019-11-26 08:22:50 -08:00
Marios Trivyzas	3c69d4d0bd	SQL: Add TRUNC alias for TRUNCATE (#49571 ) Add TRUNC as alias to already implemented TRUNCATE numeric function which is the flavour supported by Oracle and PostgreSQL. Relates to: #41195 (cherry picked from commit f2aa7f0779bc5cce40cc0c1f5e5cf1a5bb7d84f0)	2019-11-26 12:32:54 +01:00
Christoph Büscher	a4208e44f7	[Docs] Correct `max_doc_freq` default value (#49536 ) The default is set to Integer.MAX_VALUE but is reported to be `0` in the docs. With the current implementation a value of 0 would mean all terms are filtered out, which is the opposite of "unbounded". Closes #49520	2019-11-26 10:47:05 +01:00
Tim Vernum	9cb1ace1c2	Expand docs on TLSv1 breaking change (#49352 ) The breaking changes cover the removal of TLSv1 from the default protocols, but assume that users who need to retain TLSv1 support will understand all the places where they may used it. This has proven not to be true, as it is easy to be unaware that (for example) an LDAP server is using TLSv1. This change explicitly lists all the places where TLS protocols may need to be configured. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> Co-Authored-By: Pius <pius@elastic.co>	2019-11-26 16:34:55 +11:00
James Rodewig	2fd58bb845	[DOCS] Add missing "_type" to delimited payload token filter docs	2019-11-25 16:16:05 -05:00
Lisa Cawley	26beb486c7	[DOCS] Fixes security links (#49563 )	2019-11-25 13:02:26 -08:00
James Rodewig	c40449ac22	[DOCS] Reformat delimited payload token filter docs (#49380 ) * Adds a title abbreviation * Relocates the older name deprecation warning * Updates the description and adds a Lucene link * Adds a note to explain payloads and how to store them * Adds analyze and custom analyzer snippets * Adds a 'Return stored payloads' example	2019-11-25 15:40:05 -05:00
James Rodewig	99476db2d0	[DOCS] Remove individual task retrieval from cat/tasks API (#49550 )	2019-11-25 10:32:39 -05:00
Kelly Campbell	df5afa797e	[DOCS] Correct GET path in cat tasks API docs (#49494 ) Previously, the request example included `GET _cat/_tasks`. However, the resource should be `tasks`, not `_tasks`.	2019-11-25 09:37:59 -05:00
David Roberts	62811c2272	[ML] Add default categorization analyzer definition to ML info (#49545 ) The categorization job wizard in the ML UI will use this information when showing the effect of the chosen categorization analyzer on a sample of input.	2019-11-25 13:39:16 +00:00
Dimitris Athanasiou	d21df9eba9	[ML][DOCS] Anomaly detection job retention days settings do not require restart (#49546 )	2019-11-25 14:19:10 +01:00
debadair	2ec047db04	[DOCS] Rename auditing topic. Closes #49012 (#49013 ) * [DOCS] Rename auditing topic. Closes #49012 * Fixed file name, fixed settings link. * Add link to settings	2019-11-22 14:16:58 -08:00
James Rodewig	d06c71eb82	[DOCS] Fix edge n-gram tokenizer nav Adds a missing float tag to the edge n-gram tokenizer docs. This tag ensures the edge n-gram tokenizer docs display on the same page.	2019-11-22 15:54:07 -05:00
Dimitris Athanasiou	8eaee7cbdc	[7.x][ML] Explain data frame analytics API (#49455 ) (#49504 ) This commit replaces the _estimate_memory_usage API with a new API, the _explain API. The API consolidates information that is useful before creating a data frame analytics job. It includes: - memory estimation - field selection explanation Memory estimation is moved here from what was previously calculated in the _estimate_memory_usage API. Field selection is a new feature that explains to the user whether each available field was selected to be included or not in the analysis. In the case it was not included, it also explains the reason why. Backport of #49455	2019-11-22 22:06:10 +02:00
Jason Tedor	71bcfbf1e3	Replace required pipeline with final pipeline (#49470 ) This commit enhances the required pipeline functionality by changing it so that default/request pipelines can also be executed, but the required pipeline is always executed last. This gives users the flexibility to execute their own indexing pipelines, but also ensure that any required pipelines are also executed. Since such pipelines are executed last, we change the name of required pipelines to final pipelines.	2019-11-22 14:37:36 -05:00
Lisa Cawley	ca895d3ad5	[DOCS] Merge rollup config details into API (#49412 )	2019-11-22 08:39:49 -08:00
James Rodewig	562607d3f5	[DOCS] Reformat n-gram token filter docs (#49438 ) Reformats the edge n-gram and n-gram token filter docs. Changes include: * Adds title abbreviations * Updates the descriptions and adds Lucene links * Reformats parameter definitions * Adds analyze and custom analyzer snippets * Adds notes explaining differences between the edge n-gram and n-gram filters Additional changes: * Switches titles to use "n-gram" throughout. * Fixes a typo in the edge n-gram tokenizer docs * Adds an explicit anchor for the `index.max_ngram_diff` setting	2019-11-22 10:38:50 -05:00
István Zoltán Szabó	35cc0e0948	[DOCS] Removes the default size definition of thread pool types (#49442 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-11-22 11:20:11 +01:00
Florian Kelbert	d444c334d7	Modify example for pinned query (#49481 ) I do not see any reason to advertise phones of specific companies.	2019-11-22 11:03:04 +01:00
István Zoltán Szabó	c13fce60a8	[DOCS] Removes data frame leftovers from transforms overview (#49434 )	2019-11-22 10:20:15 +01:00
James Rodewig	0fa3b887b7	[DOCS] Document several missing thread pools (#48543 ) Adds documentation for the following thread pools: - fetch_shard_started - fetch_shard_store - flush - force_merge - management Closes #48524 Co-Authored-By: Jay Modi <jaymode@users.noreply.github.com>	2019-11-21 13:12:56 -05:00
Hendrik Muhs	779b4bd92b	update the name of the audit index (#49432 ) small update to the name of the audit index changed in 7.5	2019-11-21 16:15:53 +01:00
James Rodewig	f264808a6a	[DOCS] Replace cross-cluster search PNG images with SVGs (#49395 )	2019-11-21 09:06:33 -05:00
James Rodewig	03600e4e12	[DOCS] Document `script_score` float precision limit (#49402 ) All document scores are positive 32-bit floating point numbers. However, this wasn't previously documented. This can result in surprising behavior, such as precision loss, for users when customizing scores using the function score query. This commit updates an existing admonition in the function score query docs to document the 32-bits precision limit. It also updates the search API reference docs to note that `_score` is a 32-bit float.	2019-11-21 08:54:49 -05:00
weizijun	3eb577f6c8	Document all shard allocation filtering attributes (#46992 ) This commit adds coverage to the docs for some missing built-in shard allocation attributes.	2019-11-21 08:30:30 -05:00
Peter Johnson	3221827a4b	[Docs] Correct typo in match-query.asciidoc (#49082 )	2019-11-21 11:31:01 +01:00
Lisa Cawley	0f15736687	[DOCS] Reformat rollup API docs (#49397 )	2019-11-20 10:46:16 -08:00
Bogdan Pintea	8c2ab8bb72	SQL:Docs: add the PIVOT clause to SELECT section (#49129 ) The PR adds the documentation on the PIVOT clause. (cherry picked from commit a55b36065e6496c44b6e3191296931d477a8e5f5)	2019-11-20 18:21:06 +01:00
Lisa Cawley	a27e0fe10d	[DOCS] Reformat ILM API docs (#49348 )	2019-11-20 08:24:46 -08:00
Mayya Sharipova	e3da60c23d	Increase the number of vector dims to 2048 (#46895 )	2019-11-20 07:47:33 -05:00
Christoph Büscher	4ffa050735	Allow custom characters in token_chars of ngram tokenizers (#49250 ) Currently the `token_chars` setting in both `edgeNGram` and `ngram` tokenizers only allows for a list of predefined character classes, which might not fit every use case. For example, including underscore "_" in a token would currently require the `punctuation` class which comes with a lot of other characters. This change adds an additional "custom" option to the `token_chars` setting, which requires an additional `custom_token_chars` setting to be present and which will be interpreted as a set of characters to inlcude into a token. Closes #25894	2019-11-20 10:37:12 +01:00
Mathew Davis	92a1faf545	Fixing a typo in the stop SLM api request header.	2019-11-19 23:06:49 -07:00
Lisa Cawley	2b9fb7ebe2	[DOCS] Merges security overview pages (#49342 )	2019-11-19 16:19:02 -08:00
Benjamin Trent	d068818b16	[ML][Inference] document new settings (#49309 ) (#49336 ) * [ML][Inference] document new settings * [DOCS] Minor edits	2019-11-19 16:43:19 -05:00
James Rodewig	62a3154d0e	[DOCS] [7.x] Add high-level docs for enrich processor and policies (#49194 ) (#49331 )	2019-11-19 16:38:13 -05:00
Lisa Cawley	75f1f612c2	[DOCS] Merges duplicate pages for Active Directory realms (#49205 )	2019-11-19 13:18:01 -08:00
Lisa Cawley	c4c8a7a43c	[DOCS] Merges duplicate pages for PKI realms (#49206 )	2019-11-19 10:51:09 -08:00
Lisa Cawley	62bbe419d3	[DOCS] Removes Beats security page (#49276 )	2019-11-19 09:15:30 -08:00
Lisa Cawley	97cdfd2848	[DOCS] Clarify ML job closure prerequisites (#49265 )	2019-11-19 08:36:50 -08:00
James Rodewig	a26916cc23	[DOCS] Reformat elision token filter docs (#49262 )	2019-11-19 10:55:22 -05:00
James Rodewig	8639ddab5e	[DOCS] Reformat fingerprint token filter docs (#49311 )	2019-11-19 10:55:21 -05:00
jimczi	cb5169ae37	update release notes for 7.5.0 after respin	2019-11-19 16:24:04 +01:00
Marios Trivyzas	fd1bb4a33a	SQL: Fix issue with mins & hours for DATEDIFF (#49252 ) Previously, DATEDIFF for minutes and hours was doing a rounding calculation using all the time fields (secs, msecs/micros/nanos). Instead it should first truncate the 2 dates to the respective field (mins or hours) zeroing out all the more detailed time fields and then make the subtraction. (cherry picked from commit 124cd18e20429e19d52fd8dc383827ea5132d428)	2019-11-19 14:25:28 +01:00
Lisa Cawley	abd4a70b10	[DOCS] Merges duplicate pages for Kerberos realms (#49207 )	2019-11-18 15:23:06 -08:00
Lisa Cawley	b4f82c9cdb	[DOCS] Merges duplicate pages for LDAP realms (#49203 )	2019-11-18 14:09:24 -08:00
Julie Tibshirani	81a9d98a47	Remove the 'experimental' marking from vector fields. (#49120 ) We wrapped up the API changes we wanted to make, and vector fields can now be considered GA.	2019-11-18 12:42:46 -08:00
Lisa Cawley	b0054eecd6	[DOCS] Merges duplicate pages for file realms (#49200 )	2019-11-18 12:02:18 -08:00
Lisa Cawley	48f53efd9a	[DOCS] Merges duplicate pages for SAML realms (#49209 )	2019-11-18 10:09:29 -08:00
Lisa Cawley	b0b5fcc4f6	[DOCS] Removes closed security PRs from release notes (#49256 )	2019-11-18 09:19:11 -08:00
gpaimla	7d20b50f45	Implement Lucene EstonianAnalyzer, Stemmer (#49149 ) This PR adds a new analyzer and stemmer for the Estonian language. Closes #48895	2019-11-18 17:24:21 +01:00
Antoine Garcia	288217e82b	[Docs] Specify field types not supporting doc values (#49041 ) The `string` type (with option `analyzed`) has been replaced by `text` after `6.0`, also the `annonated_text` field do not support doc values and should be mentioned.	2019-11-18 16:38:31 +01:00
Yannick Welsch	af797a77a1	Auto-expand indices according to allocation filtering rules (#48974 ) Honours allocation filtering rules when auto-expanding indices.	2019-11-18 12:01:56 +01:00
Rory Hunter	e84e21174b	Support `_FILE` suffixed env vars in Docker entrypoint (#49182 ) Backport of #47573. Closes #43603. Allow environment variables to be passed to ES in a Docker container via a file, by setting an environment variable with the `_FILE` suffix that points to the file with the intended value of the env var.	2019-11-18 08:22:35 +00:00
Lisa Cawley	09a9ec4d23	[DOCS] Merges duplicate pages for native realms (#49198 )	2019-11-15 15:35:53 -08:00
Lisa Cawley	de8107e350	[DOCS] Adds ml-cpp PRs to release notes (#49185 )	2019-11-15 09:36:39 -08:00
Lisa Cawley	eca93fcc5f	[DOCS] Adds machine learning node type and filters (#49121 )	2019-11-15 08:31:59 -08:00
Christos Soulios	d9f0245b10	[7.x] Implement stats aggregation for string terms (#49097 ) Backport of #47468 to 7.x This PR adds a new metric aggregation called string_stats that operates on string terms of a document and returns the following: min_length: The length of the shortest term max_length: The length of the longest term avg_length: The average length of all terms distribution: The probability distribution of all characters appearing in all terms entropy: The total Shannon entropy value calculated for all terms This aggregation has been implemented as an analytics plugin.	2019-11-15 14:36:21 +02:00
SylvainJuge	e8f49cdee0	[DOCS] minor fix to documentation: http.host can't default to itself (#48135 ) fix minor typos on http.host and transport.host default values. 7.x backport of https://github.com/elastic/elasticsearch/pull/48135	2019-11-14 18:16:38 +01:00
James Rodewig	e1726fff56	[DOCS] Reformat update license API docs (#48967 ) Makes a few changes to better align the update license API docs with the [API reference template][0]. Changes: * Replaces POST with PUT in several snippet examples. While both are valid, PUT is a bit more RESTful. * Removes leading slashes (/) from all snippets. * Relocates and retitles the 'Authorization' section to 'Prerequisites'. * Replaces explicit titles with the appropriate API reference template attributes. * Replaces unneeded `[float]` tags with explicit anchors. Closes #35341 [0]: https://github.com/elastic/docs/blob/master/shared/api-ref-ex.asciidoc	2019-11-14 08:00:42 -05:00
James Rodewig	095c34359f	[DOCS] Note limitations of `max_gram` parm in `edge_ngram` tokenizer for index analyzers (#49007 ) The `edge_ngram` tokenizer limits tokens to the `max_gram` character length. Autocomplete searches for terms longer than this limit return no results. To prevent this, you can use the `truncate` token filter to truncate tokens to the `max_gram` character length. However, this could return irrelevant results. This commit adds some advisory text to make users aware of this limitation and outline the tradeoffs for each approach. Closes #48956.	2019-11-13 14:28:12 -05:00
James Rodewig	838af15d29	[DOCS] Reformat compound word token filters (#49006 ) * Separates the compound token filters doc pages into separate token filter pages: * Dictionary decompounder token filter * Hyphenation decompounder token filter * Adds analyze API examples for each compound token filter * Adds a redirect for the removed compound token filters page Co-Authored-By: debadair <debadair@elastic.co>	2019-11-13 09:36:52 -05:00
István Zoltán Szabó	b55022b59f	[DOCS] Adds test clause to the code snippets in the cluster restart page (#49023 )	2019-11-13 14:36:44 +01:00
Julie Tibshirani	37fa3fb4ff	Ensure parameters are updated when merging flattened mappings. (#48971 ) (#49014 ) This PR makes the following two fixes around updating flattened fields: * Make sure that the new value for ignore_above is immediately taken into affect. Previously we recorded the new value but did not use it when parsing documents. * Allow depth_limit to be updated dynamically. It seems plausible that a user might want to tweak this setting as they encounter more data.	2019-11-12 21:50:39 -05:00
David Roberts	698ebd3d0a	[TEST] Mute docs snippet test in close-job.asciidoc (#49000 ) Due to https://github.com/elastic/elasticsearch/pull/48583#issuecomment-552991325	2019-11-12 17:34:27 +00:00
Orhan Toy	561351d2fc	[Docs] Fix _count HTTP method (#48979 )	2019-11-12 15:45:26 +01:00
István Zoltán Szabó	fc145575c4	[DOCS] Creates a cluster restart documentation page (#48583 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-11-12 14:50:53 +01:00
James Rodewig	42e92616f6	[DOCS] Document indices response parameters for node stats API (#47525 )	2019-11-12 08:35:35 -05:00
jimczi	0e82b5f59b	add release notes for 7.5.0	2019-11-12 09:59:14 +01:00
Benjamin Trent	46ab1db54f	[7.x] [ML] Add new geo_results.(actual_point\|typical_point) fields for `lat_long` results (#47050 ) (#48958 ) * [ML] Add new geo_results.(actual_point\|typical_point) fields for `lat_long` results (#47050) [ML] Add new geo_results.(actual_point\|typical_point) fields for `lat_long` results (#47050) Related PR: https://github.com/elastic/ml-cpp/pull/809 * adjusting bwc version	2019-11-11 15:43:03 -05:00
István Zoltán Szabó	c2f52015d3	[DOCS] Removes best practice about fields that are highly correlated to the dependent variable. (#48935 )	2019-11-11 16:01:21 +01:00
István Zoltán Szabó	91888959e8	[DOCS] Extends analyzed_fields description in PUT DFA API docs. (#48307 )	2019-11-11 15:55:12 +01:00
Patrick Maynard	4b85498617	[DOCS] Fix typo in search type docs (#48868 )	2019-11-11 09:38:48 -05:00
James Rodewig	dd92830801	[DOCS] Reformat condition token filter (#48775 )	2019-11-11 08:49:44 -05:00
Arne Welzel	f642baa9fb	[DOCS] Remove extra "when" (#48926 )	2019-11-11 10:11:02 +01:00
Yannick Welsch	87862868c6	Allow realtime get to read from translog (#48843 ) The realtime GET API currently has erratic performance in case where a document is accessed that has just been indexed but not refreshed yet, as the implementation will currently force an internal refresh in that case. Refreshing can be an expensive operation, and also will block the thread that executes the GET operation, blocking other GETs to be processed. In case of frequent access of recently indexed documents, this can lead to a refresh storm and terrible GET performance. While older versions of Elasticsearch (2.x and older) did not trigger refreshes and instead opted to read from the translog in case of realtime GET API or update API, this was removed in 5.0 (#20102) to avoid inconsistencies between values that were returned from the translog and those returned by the index. This was partially reverted in 6.3 (#29264) to allow _update and upsert to read from the translog again as it was easier to guarantee consistency for these, and also brought back more predictable performance characteristics of this API. Calls to the realtime GET API, however, would still always do a refresh if necessary to return consistent results. This means that users that were calling realtime GET APIs to coordinate updates on client side (realtime GET + CAS for conditional index of updated doc) would still see very erratic performance. This PR (together with #48707) resolves the inconsistencies between reading from translog and index. In particular it fixes the inconsistencies that happen when requesting stored fields, which were not available when reading from translog. In case where stored fields are requested, this PR will reparse the _source from the translog and derive the stored fields to be returned. With this, it changes the realtime GET API to allow reading from the translog again, avoid refresh storms and blocking the GET threadpool, and provide overall much better and predictable performance for this API.	2019-11-09 17:47:50 +01:00
Julian Simioni	5e4501eb3f	[Docs] Consolidate single example into a single line (#48904 ) The first example of splitting rules for the `word_delimiter` token filter was spread across two bullet points. This makes it look like they are two separate splitting rules.	2019-11-08 15:12:45 -05:00
Yannick Welsch	af887be3e5	Hide orphaned tasks from follower stats (#48901 ) CCR follower stats can return information for persistent tasks that are in the process of being cleaned up. This is problematic for tests where CCR follower indices have been deleted, but their persistent follower task is only cleaned up asynchronously afterwards. If one of the following tests then accesses the follower stats, it might still get the stats for that follower task. In addition, some tests were not cleaning up their auto-follow patterns, leaving orphaned patterns behind. Other tests cleaned up their auto-follow patterns. As always the same name was used, it just depended on the test execution order whether this led to a failure or not. This commit fixes the offensive tests, and will also automatically remove auto-follow-patterns at the end of tests, like we do for many other features. Closes #48700	2019-11-08 13:56:53 +01:00
bellengao	bdc7057d58	[DOCS] Correct typo in split index API docs (#48894 )	2019-11-07 15:27:27 -05:00
bellengao	293902c6a5	[DOCS] Fix shard type in CCR overview doc (#48882 ) Closes #48875	2019-11-07 10:09:45 -05:00
Tanguy Leroux	552381d7f9	Add mention to Pause Auto-Follower API in Upgrade Clusters docs (#48764 ) Relates #46665	2019-11-06 09:48:44 -05:00

... 4 5 6 7 8 ...

6820 Commits