OpenSearch

Commit Graph

Author	SHA1	Message	Date
Lisa Cawley	1a40ebfa67	[DOCS] Adds missing testenv attribute (#51719 )	2020-01-30 16:15:17 -08:00
Lee Hinman	b9faa0733d	[7.x] Rename ILM history index enablement setting (#51698 ) (#51705 ) * Rename ILM history index enablement setting The previous setting was `index.lifecycle.history_index_enabled`, this commit changes it to `indices.lifecycle.history_index_enabled` to indicate this is not an index-level setting (it's node level).	2020-01-30 15:27:44 -07:00
James Rodewig	36b2663e98	[DOCS] Add attribute for Lucene analysis links (#51687 ) Adds a `lucene-analysis-docs` attribute for the Lucene `/analysis/` javadocs directory. This should prevent typos and keep the docs DRY.	2020-01-30 11:24:01 -05:00
James Rodewig	4fcf5a9de4	[DOCS] Rewrite analysis intro (#51184 ) * [DOCS] Rewrite analysis intro. Move index/search analysis content. * Rewrites 'Text analysis' page intro as high-level definition. Adds guidance on when users should configure text analysis * Rewrites and splits index/search analysis content: * Conceptual content -> 'Index and search analysis' under 'Concepts' * Task-based content -> 'Specify an analyzer' under 'Configure...' * Adds detailed examples for when to use the same index/search analyzer and when not. * Adds new example snippets for specifying search analyzers * clarifications * Add toc. Decrement headings. * Reword 'When to configure' section * Remove sentence from tip	2020-01-30 09:32:16 -05:00
Marios Trivyzas	f373020349	SQL: Fix ORDER BY YEAR() function (#51562 ) Previously, if YEAR() was used as and ORDER BY argument without being wrapped with another scalar (e.g. YEAR(birth_date) + 10), no script ordering was used but instead the underlying field (e.g. birth_date) was used instead as a performance optimisation. This works correctly if YEAR() is the only ORDER BY arg but if further args are used as tie breakers for the ordering wrong results are produced. This is because 2 rows with the different birth_date but on the same year are not tied as the underlying ordering is on birth_date and not on the YEAR(birth_date), and the following ORDER BY args are ignored. Remove this optimisation for YEAR() to avoid incorrect results in such cases. As a consequence another bug is revealed: scalar functions on top of nested fields produce scripted sorting/filtering which is not yet supported. In such cases no error was thrown but instead all values for such nested fields were null and were passed to the script implementing the sorting/filtering, producing incorrect results. Detect such cases and throw a validation exception. Fixes: #51224 (cherry picked from commit f41efd6753dc3650a7eabb3e07b02b3b32c5704c)	2020-01-30 15:29:36 +01:00
Nhat Nguyen	f0fad5b622	Deprecate translog retention settings (#51588 ) (#51638 ) This change deprecates the translog retention settings as they are effectively ignored since 7.4. Relates #50775 Relates #45473	2020-01-30 09:03:10 -05:00
Henning Andersen	ca8601373a	[DOCS] Task management API experimental status issue (#51634 ) Add issue reference to documentation. Relates #51628	2020-01-30 14:15:47 +01:00
Peter Dyson	b5a2ee5be2	[DOCS] Fix minor typo affecting formatting (#51655 )	2020-01-29 23:44:09 -08:00
Lisa Cawley	28f2f3dd02	[DOCS] Minor fixes in transform documentation (#51633 )	2020-01-29 16:58:18 -08:00
Lisa Cawley	fdf74f6ae4	[DOCS] Removes beta qualifiers from transform documentation (#51553 )	2020-01-29 08:41:54 -08:00
Lisa Cawley	3f4156e95a	[DOCS] Adds release highlight for transforms (#51555 )	2020-01-29 08:35:02 -08:00
Albert Zaharovits	90285ee907	Deprecate timeout.tcp_read AD/LDAP realm setting (#47305 ) The timeout.tcp_read AD/LDAP realm setting, despite the low-level allusion, controls the time interval the realms wait for a response for a query (search or bind). If the connection to the server is synchronous (un-pooled) the response timeout is analogous to the tcp read timeout. But the tcp read timeout is irrelevant in the common case of a pooled connection (when a Bind DN is specified). The timeout.tcp_read qualifier is hereby deprecated in favor of timeout.response. In addition, the default value for both timeout.tcp_read and timeout.response is that of timeout.ldap_search, instead of the 5s (but the default for timeout.ldap_search is still 5s). The timeout.ldap_search defines the server-controlled timeout of a search request. There is no practical use case to have a smaller tcp_read timeout compared to ldap_search (in this case the request would time-out on the client but continue to be processed on the server). The proposed change aims to simplify configuration so that the more common configuration change, adjusting timeout.ldap_search up, has the expected result (no timeout during searches) without any additional modifications. Closes #46028	2020-01-29 10:48:26 +02:00
Gordon Brown	89c2834b24	Deprecate creation of dot-prefixed index names except for hidden and system indices (#49959 ) This commit deprecates the creation of dot-prefixed index names (e.g. .watches) unless they are either 1) a hidden index, or 2) registered by a plugin that extends SystemIndexPlugin. This is the first step towards more thorough protections for system indices. This commit also modifies several plugins which use dot-prefixed indices to register indices they own as system indices, and adds a plugin to register .tasks as a system index.	2020-01-28 10:01:16 -07:00
James Rodewig	139305ffc8	[DOCS] Document `indices` cluster stats (#50527 ) Documents the header and `indices` response parameters returned by the `_cluster/stats` API. Co-Authored-By: David Turner <david.turner@elastic.co>	2020-01-28 11:00:00 -05:00
Yannick Welsch	fa212fe60b	Stricter checks of setup and teardown in docs tests (#51430 ) Adds extra checks due to 7.x backport	2020-01-28 16:52:23 +01:00
Yannick Welsch	f6686345c9	Avoid unnecessary setup and teardown in docs tests (#51430 ) The docs tests have recently been running much slower than before (see #49753). The gist here is that with ILM/SLM we do a lot of unnecessary setup / teardown work on each test. Compounded with the slightly slower cluster state storage mechanism, this causes the tests to run much slower. In particular, on RAMDisk, docs:check is taking ES 7.4: 6:55 minutes ES master: 16:09 minutes ES with this commit: 6:52 minutes on SSD, docs:check is taking ES 7.4: ??? minutes ES master: 32:20 minutes ES with this commit: 11:21 minutes	2020-01-28 16:52:23 +01:00
James Rodewig	70e4ae3381	[DOCS] Reformat unique token filter docs (#50748 ) * Updates the description * Adds analyze, custom analyzer, and custom filter snippets * Adds parameter documentation	2020-01-28 10:42:25 -05:00
David Roberts	550254ec7f	[ML] Use CSV ingest processor in find_file_structure ingest pipeline (#51492 ) Changes the find_file_structure response to include a CSV ingest processor in the ingest pipeline it suggests. Previously the Kibana file upload functionality parsed CSV in the browser, but by parsing CSV in the ingest pipeline it makes the Kibana file upload functionality more easily interchangable with Filebeat such that the configurations it creates can more easily be used to import data with the same structure repeatedly in production.	2020-01-28 14:38:43 +00:00
William Brafford	9efa5be60e	Password-protected Keystore Feature Branch PR (#51123 ) (#51510 ) * Reload secure settings with password (#43197) If a password is not set, we assume an empty string to be compatible with previous behavior. Only allow the reload to be broadcast to other nodes if TLS is enabled for the transport layer. * Add passphrase support to elasticsearch-keystore (#38498) This change adds support for keystore passphrases to all subcommands of the elasticsearch-keystore cli tool and adds a subcommand for changing the passphrase of an existing keystore. The work to read the passphrase in Elasticsearch when loading, which will be addressed in a different PR. Subcommands of elasticsearch-keystore can handle (open and create) passphrase protected keystores When reading a keystore, a user is only prompted for a passphrase only if the keystore is passphrase protected. When creating a keystore, a user is allowed (default behavior) to create one with an empty passphrase Passphrase can be set to be empty when changing/setting it for an existing keystore Relates to: #32691 Supersedes: #37472 * Restore behavior for force parameter (#44847) Turns out that the behavior of `-f` for the add and add-file sub commands where it would also forcibly create the keystore if it didn't exist, was by design - although undocumented. This change restores that behavior auto-creating a keystore that is not password protected if the force flag is used. The force OptionSpec is moved to the BaseKeyStoreCommand as we will presumably want to maintain the same behavior in any other command that takes a force option. * Handle pwd protected keystores in all CLI tools (#45289) This change ensures that `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` can handle a password protected elasticsearch.keystore. For setup passwords the user would be prompted to add the elasticsearch keystore password upon running the tool. There is no option to pass the password as a parameter as we assume the user is present in order to enter the desired passwords for the built-in users. For saml-metadata, we prompt for the keystore password at all times even though we'd only need to read something from the keystore when there is a signing or encryption configuration. * Modify docs for setup passwords and saml metadata cli (#45797) Adds a sentence in the documentation of `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` to describe that users would be prompted for the keystore's password when running these CLI tools, when the keystore is password protected. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Elasticsearch keystore passphrase for startup scripts (#44775) This commit allows a user to provide a keystore password on Elasticsearch startup, but only prompts when the keystore exists and is encrypted. The entrypoint in Java code is standard input. When the Bootstrap class is checking for secure keystore settings, it checks whether or not the keystore is encrypted. If so, we read one line from standard input and use this as the password. For simplicity's sake, we allow a maximum passphrase length of 128 characters. (This is an arbitrary limit and could be increased or eliminated. It is also enforced in the keystore tools, so that a user can't create a password that's too long to enter at startup.) In order to provide a password on standard input, we have to account for four different ways of starting Elasticsearch: the bash startup script, the Windows batch startup script, systemd startup, and docker startup. We use wrapper scripts to reduce systemd and docker to the bash case: in both cases, a wrapper script can read a passphrase from the filesystem and pass it to the bash script. In order to simplify testing the need for a passphrase, I have added a has-passwd command to the keystore tool. This command can run silently, and exit with status 0 when the keystore has a password. It exits with status 1 if the keystore doesn't exist or exists and is unencrypted. A good deal of the code-change in this commit has to do with refactoring packaging tests to cleanly use the same tests for both the "archive" and the "package" cases. This required not only moving tests around, but also adding some convenience methods for an abstraction layer over distribution-specific commands. * Adjust docs for password protected keystore (#45054) This commit adds relevant parts in the elasticsearch-keystore sub-commands reference docs and in the reload secure settings API doc. * Fix failing Keystore Passphrase test for feature branch (#50154) One problem with the passphrase-from-file tests, as written, is that they would leave a SystemD environment variable set when they failed, and this setting would cause elasticsearch startup to fail for other tests as well. By using a try-finally, I hope that these tests will fail more gracefully. It appears that our Fedora and Ubuntu environments may be configured to store journald information under /var rather than under /run, so that it will persist between boots. Our destructive tests that read from the journal need to account for this in order to avoid trying to limit the output we check in tests. * Run keystore management tests on docker distros (#50610) * Add Docker handling to PackagingTestCase Keystore tests need to be able to run in the Docker case. We can do this by using a DockerShell instead of a plain Shell when Docker is running. * Improve ES startup check for docker Previously we were checking truncated output for the packaged JDK as an indication that Elasticsearch had started. With new preliminary password checks, we might get a false positive from ES keystore commands, so we have to check specifically that the Elasticsearch class from the Bootstrap package is what's running. * Test password-protected keystore with Docker (#50803) This commit adds two tests for the case where we mount a password-protected keystore into a Docker container and provide a password via a Docker environment variable. We also fix a logging bug where we were logging the identifier for an array of strings rather than the contents of that array. * Add documentation for keystore startup prompting (#50821) When a keystore is password-protected, Elasticsearch will prompt at startup. This commit adds documentation for this prompt for the archive, systemd, and Docker cases. Co-authored-by: Lisa Cawley <lcawley@elastic.co> * Warn when unable to upgrade keystore on debian (#51011) For Red Hat RPM upgrades, we warn if we can't upgrade the keystore. This commit brings the same logic to the code for Debian packages. See the posttrans file for gets executed for RPMs. * Restore handling of string input Adds tests that were mistakenly removed. One of these tests proved we were not handling the the stdin (-x) option correctly when no input was added. This commit restores the original approach of reading stdin one char at a time until there is no more (-1, \r, \n) instead of using readline() that might return null * Apply spotless reformatting * Use '--since' flag to get recent journal messages When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. It seems to me that we might be able to use journald's "--since" flag to retrieve only log messages from the last run, and that this might be less likely to fail due to race conditions in file deletion. Unfortunately, it looks as if the "--since" flag has a granularity of one-second. I've added a two-second sleep to make sure that there's a sufficient gap between the test that will read from journald and the test before it. * Use new journald wrapper pattern * Update version added in secure settings request Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>	2020-01-28 05:32:32 -05:00
James Rodewig	65f49d0bba	[DOCS] Add top-level EQL docs page. Adds EQL requirements page. (#51334 ) * Creates a top-level page for EQL in the ES reference. This page contains a high-level introduction and will include a nav for other EQL docs pages as they're built. * Creates a requirements page. This page outlines the fields needed to use EQL in ES.	2020-01-27 16:04:47 -05:00
James Rodewig	23b65390ab	[DOCS] Add response snippets to 'Testing analyzers' page (#51427 ) Adds response snippets to the `POST _analyze` snippets in the 'Testing analyzers' page. Co-authored-by: Emmanuel DEMEY <demey.emmanuel@gmail.com>	2020-01-27 08:41:44 -05:00
David Turner	49bde5d286	Remove DEBUG-level default logging from actions (#51459 ) In `2bb31fe` (v0.6.0!) we added DEBUG-level logging to the default config of action loggers "for easier debugging". This change to the default config lives on to this day. It does not obviously make debugging any easier any more, but it does result in a good deal of log noise sometimes. This commit removes this special case from the default config. Closes #51198	2020-01-27 10:50:10 +00:00
Lisa Cawley	931b22349f	[DOCS] Adds http to elasticsearch-certutil command reference (#51188 )	2020-01-24 09:59:07 -08:00
Mayya Sharipova	a29deecbda	Revert "Make it clear this is boost at index time (#51390 )" This reverts commit `3d5238bd95`.	2020-01-24 11:05:42 -05:00
Jonas F. Henriksen	3d5238bd95	Make it clear this is boost at index time (#51390 ) The way it was originally written, it sounds like we are boosting at query time. Of course, the effect is at query time, but the point here is that boosting is done at index time	2020-01-24 10:37:07 -05:00
Benjamin Trent	76660a5a4f	[7.x] [ML][Inference] add tags url param to GET (#51330 ) (#51404 ) * [ML][Inference] add tags url param to GET (#51330) Adds a new URL parameter, `tags` to the GET _ml/inference/<model_id> endpoint. This parameter allows the list of models to be further reduced to those who contain all the provided tags.	2020-01-24 08:26:58 -05:00
István Zoltán Szabó	8bdf654cc7	[DOCS] Refines description. (#51400 )	2020-01-24 13:34:25 +01:00
David Turner	40e7a826fc	Allow decimal max_task_wait_time in docs (#51352 ) The regex for the response to `GET _cat/health?v` in `getting-started.asciidoc` requires `max_task_wait_time` to match `(-\|\\d+(micros\|ms\|s))`, which doesn't match times such as `3.9ms` that contain a decimal point. This commit adjusts the regex to match times formatted like this too. Fixes #47537	2020-01-24 08:59:43 +00:00
Lisa Cawley	ec47698f7c	[DOCS] Updates categorization examples with wizard screenshots (#51133 )	2020-01-22 11:28:17 -08:00
Zachary Tong	83647101ef	Update Release notes for BC2	2020-01-22 14:05:59 -05:00
Lisa Cawley	4590d4156a	[DOCS] Clarify interval, frequency, and bucket span in ML APIs and example (#51280 )	2020-01-22 08:15:46 -08:00
Igor Motov	08e9c673e5	Fix leftover mentions of method parameter in Percentile Aggs (#51272 ) The method parameter is not used in the percentile aggs, instead the method is determined by the presence of `hdr` or `tdigest` objects. Relates to #8324	2020-01-22 10:03:35 -05:00
David Kyle	ca4b90a001	[ML] Calculate results and snapshot retention using latest bucket timestamps (#51061 ) (#51301 ) The retention period is calculated relative to the last bucket result or snapshot time rather than wall clock	2020-01-22 14:52:33 +00:00
Zachary Tong	b38fdf9f94	Add release highlights for 7.6 (#51070 ) Add release highlights for 7.6 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-01-22 09:42:57 -05:00
Russ Cam	86a50a24f3	[Docs] Including leading slash in range query doc example URLs (#51277 )	2020-01-22 09:42:18 +01:00
Stuart Tettemer	41c15b438d	Scripting: Add char position of script errors (#51069 ) (#51266 ) Add the character position of a scripting error to error responses. The contents of the `position` field are experimental and subject to change. Currently, `offset` refers to the character location where the error was encountered, `start` and `end` define a range of characters that contain the error. eg. ``` { "error": { "root_cause": [ { "type": "script_exception", "reason": "runtime error", "script_stack": [ "y = x;", " ^---- HERE" ], "script": "def x = new ArrayList(); Map y = x;", "lang": "painless", "position": { "offset": 33, "start": 29, "end": 35 } } ``` Refs: #50993	2020-01-21 13:45:59 -07:00
István Zoltán Szabó	30d1587ad5	[DOCS] Fixes indentation in inference processor code snippet (#51252 )	2020-01-21 16:22:16 +01:00
Lisa Cawley	79cf0894fa	[DOCS] Adds ML PRs to release notes (#51234 )	2020-01-20 11:08:23 -08:00
Jason Tedor	e20459c202	Exclude autoscaling docs from release docs (#51190 ) Since autoscaling is currently only under development, this commit causes the autoscaling docs to be excluded any time that release docs are being built.	2020-01-20 10:52:47 -05:00
Andrei Stefan	2908b7e5fc	SQL: add support for passing query parameters in REST API calls (#51029 ) (#51222 ) * REST PreparedStatement-like query parameters are now supported in the form of an array of non-object, non-array values where ES SQL parser will try to infer the data type of the value being passed as parameter. (cherry picked from commit 45b8bf619aecb1c03d7bc0cf06928dcc36005a66)	2020-01-20 16:40:19 +02:00
István Zoltán Szabó	424b4ed4ea	[DOCS] Expands the documentation of Node Query Cache (#51105 ) Co-authored-by: debadair <debadair@elastic.co>	2020-01-20 11:13:29 +01:00
Jess	4b31ad1c0c	[Docs] Small edits to Ranking Evaluation API docs (#51116 ) Small updates to grammar, syntax, and unclear wordings.	2020-01-20 10:30:23 +01:00
István Zoltán Szabó	e40580c24d	[DOCS] Removes CCS limitation item from Transforms limitations. (#51151 )	2020-01-20 09:44:11 +01:00
Jason Tedor	9ce4d2b901	Initial autoscaling commit (#51161 ) This commit merely adds the skeleton for the autoscaling project, adding the basics to include the autoscaling module in the default distribution, opt-in to code formatting, and a placeholder for the docs.	2020-01-17 15:31:12 -05:00
István Zoltán Szabó	83c92cf7eb	[DOCS] Adds text about data types to the categorization docs (#51145 )	2020-01-17 10:00:20 -08:00
lcawl	fee1a9528c	[DOCS] Removes duplicate title	2020-01-17 09:49:42 -08:00
Jay Modi	107989df3e	Introduce hidden indices (#51164 ) This change introduces a new feature for indices so that they can be hidden from wildcard expansion. The feature is referred to as hidden indices. An index can be marked hidden through the use of an index setting, `index.hidden`, at creation time. One primary use case for this feature is to have a construct that fits indices that are created by the stack that contain data used for display to the user and/or intended for querying by the user. The desire to keep them hidden is to avoid confusing users when searching all of the data they have indexed and getting results returned from indices created by the system. Hidden indices have the following properties: * API calls for all indices (empty indices array, _all, or ) will not return hidden indices by default. Wildcard expansion will not return hidden indices by default unless the wildcard pattern begins with a `.`. This behavior is similar to shell expansion of wildcards. * REST API calls can enable the expansion of wildcards to hidden indices with the `expand_wildcards` parameter. To expand wildcards to hidden indices, use the value `hidden` in conjunction with `open` and/or `closed`. * Creation of a hidden index will ignore global index templates. A global index template is one with a match-all pattern. * Index templates can make an index hidden, with the exception of a global index template. * Accessing a hidden index directly requires no additional parameters. Backport of #50452	2020-01-17 10:09:01 -07:00
Dimitris Athanasiou	b70ebdeb96	[7.x][ML] DF Analytics _explain API should skip object fields (#51115 ) (#51147 ) Object fields cannot be used as features. At the moment _explain API includes them and even worse it allows it does not error when an object field is excluded. This creates the expectation to the user that all children fields will also be excluded while it's not the case. This commit omits object fields from the _explain API and also adds an error if an object field is included or excluded. Backport of #51115	2020-01-17 14:02:59 +02:00
James Rodewig	2353fe47fc	[DOCS] Adds placeholder for 7.5.2 release notes (#51124 ) Co-Authored-By: Lisa Cawley <lcawley@elastic.co>	2020-01-16 14:42:24 -05:00
James Rodewig	b6bf64b969	[DOCS] Collapse node stats response sections (#51063 ) elastic/docs#1687 added support for the `[%collapsible]` Asciidoc attribute, which creates collapsible sections in the HTML output. This PR makes two related changes to the nodes stats API documentation: * Makes the response parameter sections collapsible. This allows users to more easily navigate the page without long walls of text. * Reorders the response parameter sections to match the default order returned by the API. Relates to #47524.	2020-01-16 13:19:29 -05:00

1 2 3 4 5 ...

6580 Commits