OpenSearch

Commit Graph

Author	SHA1	Message	Date
Marios Trivyzas	f958e9abdc	SQL: Implement scripting inside aggs (#55241 ) (#55371 ) Implement the use of scalar functions inside aggregate functions. This allows for complex expressions inside aggregations, with or without GROUBY as well as with or without a HAVING clause. e.g.: ``` SELECT MAX(CASE WHEN a IS NULL then -1 ELSE abs(a * 10) + 1 END) AS max, b FROM test GROUP BY b HAVING MAX(CASE WHEN a IS NULL then -1 ELSE abs(a * 10) + 1 END) > 5 ``` Scalar functions are still not allowed for `KURTOSIS` and `SKEWNESS` as this is currently not implemented on the ElasticSearch side. Fixes: #29980 Fixes: #36865 Fixes: #37271 (cherry picked from commit 506d1beea7abb2b45de793bba2e349090a78f2f9)	2020-04-17 12:41:22 +02:00
Lisa Cawley	c7cf6e621d	[DOCS] Remove text fields from classification dependent variables (#54849 )	2020-04-16 13:40:28 -07:00
Lisa Cawley	cf5278f771	[DOCS] Add ml-cpp PRs to 7.7 release notes (#55264 ) Co-Authored-By: David Roberts <dave.roberts@elastic.co>	2020-04-16 11:28:34 -07:00
Julie Tibshirani	d7cded8d7a	Fix updating include_in_parent/include_in_root of nested field. (#55326 ) The main changes are: 1. Throw an error when updating `include_in_parent` or `include_in_root` attribute of nested field dynamically by the PUT mapping API. 2. Add a test for the change. Closes #53792 Co-authored-by: bellengao <gbl_long@163.com>	2020-04-16 11:17:12 -07:00
James Rodewig	f0b9be8b1b	[DOCS] Reformat `flatten_graph` token filter (#54268 ) * [DOCS] Reformat `flatten_graph` token filter Makes the following changes to the `flatten_graph` token filter docs: * Rewrites description and adds Lucene link * Adds detailed analyze example * Adds analyzer example	2020-04-16 08:35:08 -04:00
Bogdan Pintea	b88dd47de3	Docs: add the change log for 7.7 (#55019 ) * Add the change log for 7.7 Add the change log for 7.7 * Update rel. notes to latest state (BC5) Update the release notes to current state (i.e. BC5). * Update docs/reference/release-notes/7.7.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-15 15:25:08 -04:00
Lisa Cawley	f0b9578684	[DOCS] Removes transform performance note (#55177 )	2020-04-15 10:42:52 -07:00
James Rodewig	4f2ab96f38	[DOCS] EQL: Document `indexOf` function (#55071 )	2020-04-15 11:29:50 -04:00
James Rodewig	8d6f0f6a76	[DOCS] Document `max_concurrent_searches` default (#55116 )	2020-04-15 10:04:23 -04:00
Benjamin Trent	8ff2cbf1a3	[7.x] [ML] adding prediction_field_type to inference config (#55128 ) (#55230 ) * [ML] adding prediction_field_type to inference config (#55128) Data frame analytics dynamically determines the classification field type. This field type then dictates the encoded JSON that is written to Elasticsearch. Inference needs to know about this field type so that it may provide the EXACT SAME predicted values as analytics. Here is added a new field `prediction_field_type` which indicates the desired type. Options are: `string` (DEFAULT), `number`, `boolean` (where close_to(1.0) == true, false otherwise). Analytics provides the default `prediction_field_type` when the model is created from the process.	2020-04-15 09:45:22 -04:00
Jake Landis	85139fad7e	[7.x] Advise a simpler curator migration (#54457 ) (#55188 ) Advice for migrating from Curator should simply be to phase out curator managed indices, since curator will ignore ILM indices https://www.elastic.co/guide/en/elasticsearch/client/curator/5.7/ilm-and-curator.html#ilm-and-curator. Co-authored-by: Jay Greenberg <PhaedrusTheGreek@users.noreply.github.com>	2020-04-15 07:55:31 -05:00
Lisa Cawley	2910d01179	[DOCS] Removes unshared sections from ml-shared.asciidoc (#55192 )	2020-04-14 18:47:09 -07:00
Yang Wang	f49354b7d7	Add migration notes for deprecating local parameter of get field mapping API (#55194 ) This is a follow-up for #55099 to add migration notes about the deprecation of local parameter for get field mappings API.	2020-04-15 11:38:05 +10:00
Igor Motov	1754e50cbd	[7.x] Add analytics plugin usage stats to _xpack/usage (#54911 ) (#55162 ) Adds analytics plugin usage stats to _xpack/usage. Closes #54847	2020-04-14 17:03:14 -04:00
James Rodewig	12130843ca	[DOCS] Add maintenance releases to upgrade table (#55012 ) Updates the supported upgrade path table in [Upgrade Elasticsearch][0] to include a new row for maintenance releases. For example, this row covers upgrading from 7.6.0 to 7.6.2. The new table row only displays for releases greater than n.x.0. For example, the new row will display for the 7.7.1 release but not the 7.7.0 release. [0]: https://www.elastic.co/guide/en/elasticsearch/reference/master/setup-upgrade.html	2020-04-14 11:28:55 -04:00
James Rodewig	3fbd8b371f	[DOCS] Use consistent line breaks in EQL function docs	2020-04-14 10:17:45 -04:00
Yannick Welsch	a610513ec7	Provide repository-level stats for searchable snapshots (#55051 ) Provides basic repository-level stats that will allow us to get some insight into how many requests are actually being made by the underlying SDK. Currently only tracks GET and LIST calls for S3 repositories. Most of the code is unfortunately boiler plate to add a new endpoint that will help us better understand some of the low-level dynamics of searchable snapshots.	2020-04-14 14:34:08 +02:00
lcawl	fcd96db006	[DOCS] Edits create data frame analytics job API (#54751 )	2020-04-13 10:43:52 -07:00
Nhat Nguyen	96bb1164f0	Support hierarchical task cancellation (#54757 ) With this change, when a task is canceled, the task manager will cancel not only its direct child tasks but all also its descendant tasks. Closes #50990	2020-04-13 12:35:21 -04:00
Igor Motov	51c6f69e02	[7.x] Add support for filters to T-Test aggregation (#54980 ) (#55066 ) Adds support for filters to T-Test aggregation. The filters can be used to select populations based on some criteria and use values from the same or different fields. Closes #53692	2020-04-13 12:28:58 -04:00
James Rodewig	57d6493e29	[DOCS] EQL: Document `string` function (#55086 )	2020-04-13 11:23:45 -04:00
Peter Dyson	f0b6cf4c11	[DOCS] Note where ILM policies are stored and backup caveats (#54859 )	2020-04-13 09:11:16 -06:00
Vishal Patel	16921ebbd8	[DOCS] Collapse nested objects in Explore API docs (#55067 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-04-13 09:27:03 -04:00
Ioannis Kakavas	7a8a66d9ae	[7.x] Fix ReloadSecureSettings API to consume password (#54771 ) (#55059 ) The secure_settings_password was never taken into consideration in the ReloadSecureSettings API. This commit fixes that and adds necessary REST layer testing. Doing so, it also: - Allows TestClusters to have a password protected keystore so that it can be set for tests. - Adds a parameter to the run task so that elastisearch can be run with a password protected keystore from source.	2020-04-13 09:50:55 +03:00
Yang Wang	862799956c	Deprecate local parameter for get field mapping request (#55014 ) (#55099 ) The usage of local parameter for GetFieldMappingRequest has been removed from the underlying transport action since v2.0. This PR deprecates the parameter from rest layer. It will be removed in next major version.	2020-04-12 13:48:47 +10:00
James Rodewig	2655dfa2fe	[DOCS] EQL: Reword field support for EQL functions (#55074 ) Changes boilerplate sentence of "If using a field as the argument, this parameter only supports..." to "...this parameter supports only...". The latter is a bit more clear and readable.	2020-04-10 15:33:29 -04:00
Jason Tedor	d1137ebdaa	Passthrough special characters in thread pool docs (#55080 ) Some of these characters are special to Asciidoctor and they ruin the rendering on this page. Instead, we use a macro to passthrough these characters without Asciidoctor applying any subtitutions to them. This commit then addresses some rendering issues in the thread pool docs. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-04-10 15:11:19 -04:00
Nik Everett	b99a50bcb9	value_count Aggregation optimization (backport of #54854 ) (#55076 ) We found some problems during the test. Data: 200Million docs, 1 shard, 0 replica hits \| avg \| sum \| value_count \| ----------- \| ------- \| ------- \| ----------- \| 20,000 \| .038s \| .033s \| .063s \| 200,000 \| .127s \| .125s \| .334s \| 2,000,000 \| .789s \| .729s \| 3.176s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| The performance of `avg`, `sum` and other is very close when performing statistics, but the performance of `value_count` has always been poor, even not on an order of magnitude. Based on some common-sense knowledge, we think that `value_count` and sum are similar operations, and the time consumed should be the same. Therefore, we have discussed the agg of `value_count`. The principle of counting in es is to traverse the field of each document. If the field is an ordinary value, the count value is increased by 1. If it is an array type, the count value is increased by n. However, the problem lies in traversing each document and taking out the field, which changes from disk to an object in the Java language. We summarize its current problems with Elasticsearch as: - Number cast to string overhead, and GC problems caused by a large number of strings - After the number type is converted to string, sorting and other unnecessary operations are performed Here is the proof of type conversion overhead. ``` // Java long to string source code, getChars is very time-consuming. public static String toString(long i) { int size = stringSize(i); if (COMPACT_STRINGS) { byte[] buf = new byte[size]; getChars(i, size, buf); return new String(buf, LATIN1); } else { byte[] buf = new byte[size * 2]; StringUTF16.getChars(i, size, buf); return new String(buf, UTF16); } } ``` test type \| average \| min \| max \| sum ------------ \| ------- \| ---- \| ----------- \| ------- double->long \| 32.2ns \| 28ns \| 0.024ms \| 3.22s long->double \| 31.9ns \| 28ns \| 0.036ms \| 3.19s long->String \| 163.8ns \| 93ns \| 1921 ms \| 16.3s particularly serious. Our optimization code is actually very simple. It is to manage different types separately, instead of uniformly converting to string unified processing. We added type identification in ValueCountAggregator, and made special treatment for number and geopoint types to cancel their type conversion. Because the string type is reduced and the string constant is reduced, the improvement effect is very obvious. hits \| avg \| sum \| value_count \| value_count \| value_count \| value_count \| value_count \| value_count \| \| \| \| double \| double \| keyword \| keyword \| geo_point \| geo_point \| \| \| \| before \| after \| before \| after \| before \| after \| ----------- \| ------- \| ------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| 20,000 \| 38s \| .033s \| .063s \| .026s \| .030s \| .030s \| .038s \| .015s \| 200,000 \| 127s \| .125s \| .334s \| .078s \| .116s \| .099s \| .278s \| .031s \| 2,000,000 \| 789s \| .729s \| 3.176s \| .439s \| .348s \| .386s \| 3.365s \| .178s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 2.700s \| 2.500s \| 2.600s \| 25.192s \| 1.278s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| 18.990s \| 19.000s \| 20.000s \| 168.971s \| 9.093s \| - The results are more in line with common sense. `value_count` is about the same as `avg`, `sum`, etc., or even lower than these. Previously, `value_count` was much larger than avg and sum, and it was not even an order of magnitude when the amount of data was large. - When calculating numeric types such as `double` and `long`, the performance is improved by about 8 to 9 times; when calculating the `geo_point` type, the performance is improved by 18 to 20 times.	2020-04-10 13:16:39 -04:00
James Rodewig	c440754784	[DOCS] EQL: Document `wildcard` function (#54086 )	2020-04-10 09:18:29 -04:00
oneoneonepig	356cc94889	[DOCS] Fix double quote typo in 7.0 breaking changes (#55040 )	2020-04-10 09:11:51 -04:00
Jason Tedor	9eeae59a83	Clarify available processors (#54907 ) The use of available processors, the terminology, and the settings around it have evolved over time. This commit cleans up some places in the codes and in the docs to adjust to the current terminology.	2020-04-10 08:48:27 -04:00
James Rodewig	51326432be	[DOCS] Add query reference docs template (#52292 )	2020-04-10 08:47:54 -04:00
James Rodewig	d5a609a2e5	[DOCS] Add token filter reference docs template (#52290 ) Creates a reusable template for token filter reference documentation. Contributors can make a copy of this template and customize it when documenting new token filters.	2020-04-10 08:45:10 -04:00
Marios Trivyzas	bf0cadb602	SQL: Implement DATETIME_PARSE function for parsing strings (#54960 ) (#55035 ) Implement DATETIME_PARSE(<datetime_str>, <pattern_str>) function which allows to parse a datetime string according to the specified pattern into a datetime object. The patterns allowed are those of java.time.format.DateTimeFormatter. Relates to #53714 (cherry picked from commit 3febcd8f3cdf9fdda4faf01f23a5f139f38b57e0)	2020-04-10 01:16:29 +02:00
Vishal Patel	51cb0c5c7b	[DOCS] Collapse nested objects in cluster reroute docs (#54851 )	2020-04-09 15:29:22 -04:00
István Zoltán Szabó	374f633b6e	[DOCS] Adds link points to the data frame analytics supported fields (#55004 ) Co-authored-by: lcawl <lcawley@elastic.co>	2020-04-09 11:27:57 -07:00
James Rodewig	c6cd8ca7c0	[DOCS] Update upgrade docs for 7.7 (#54978 )	2020-04-08 16:23:08 -04:00
James Rodewig	964cf565c9	[DOCS] EQL: Document `between` function (#54950 )	2020-04-08 13:49:15 -04:00
Théophile Helleboid - chtitux	a8aa36d427	[DOCS] Fix typo in SLM retention docs (#54797 )	2020-04-08 08:56:45 -04:00
Marios Trivyzas	6afd60b082	SQL: Implement DATETIME_FORMAT function for date/time formatting (#54832 ) (#54942 ) Implement DATETIME_FORMAT(<date/datetime/time>, ) function which allows for formatting a timestamp to the specified format. The patterns allowed as those of java.time.format.DateTimeFormatter. Related to #53714 (cherry picked from commit 72be0b54a9299e87e785469cdc9aafac2a48c046)	2020-04-08 13:45:47 +02:00
István Zoltán Szabó	3a3effedc2	[DOCS] Reworks some parts of EMM API docs (#54872 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-04-08 10:20:34 +02:00
Julie Tibshirani	475b210eec	Improve guidance on removing default mappings. (#54915 ) In 7.x, an index template will fail to apply if it contains a `_default_` mapping. Several users have expressed confusion over the fact that loading the template doesn't show any default mappings. This docs change clarifies that in order to see all mappings in the template, you must pass `include_type_name`.	2020-04-07 15:18:13 -07:00
James Rodewig	9569a8eb13	[DOCS] Add example to "avoid scripts" advice (#54719 ) Adds a detailed example to the "Avoid scripts" section of the "Tune for search speed" docs. The detail outlines how a script used to transform indexed data can be moved to ingest. The update also removes an outdated reference to supported script languages.	2020-04-07 15:25:10 -04:00
Jason Tedor	d1d478debf	Update docs to reflect node.processors (#54855 ) We namespaced the previous setting "processors" into "node.processors". This commit updates some of the documentation to reflect this.	2020-04-07 13:06:14 -04:00
Lisa Cawley	a7599031ae	[DOCS] Adds tranform node to list of default types (#54850 )	2020-04-07 08:49:05 -07:00
Ignacio Vera	076c199484	Add new point field. (#53804 ) (#54879 ) This commit adds a new point field that is able to index arbitrary pair of values (x/y) in the cartesian space. It only supports filtering using shape queries at the moment.	2020-04-07 15:28:50 +02:00
Tanguy Leroux	4d36917e52	Merge feature/searchable-snapshots branch into 7.x (#54803 ) (#54825 ) This is a backport of #54803 for 7.x. This pull request cherry picks the squashed commit from #54803 with the additional commits: 6f50c92 which adjusts master code to 7.x a114549 to mute a failing ILM test (#54818) 48cbca1 and 50186b2 that cleans up and fixes the previous test aae12bb that adds a missing feature flag (#54861) 6f330e3 that adds missing serialization bits (#54864) bf72c02 that adjust the version in YAML tests a51955f that adds some plumbing for the transport client used in integration tests Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Yannick Welsch <yannick@welsch.lu> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com> Co-authored-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-07 13:28:53 +02:00
Ioannis Kakavas	3560c0cbf2	Remove `_xpack` from license API example (#54698 ) (#54763 ) Resolves #54662	2020-04-07 09:51:37 +03:00
Lisa Cawley	b3d5300968	[DOCS] Collapses sections in put snapshot lifecycle policy API (#54834 ) (#54840 )	2020-04-06 13:46:56 -07:00
James Rodewig	e9c3bfc8e5	[DOCS] Collapse nested objects in node stats API response (#54755 ) Replaces dot notation with collapsed nested object formatting per the [Elastic API reference template][0]. [0]:https://github.com/elastic/docs/blob/master/shared/api-ref-ex.asciidoc	2020-04-06 15:19:54 -04:00
James Rodewig	548ad03941	[DOCS] Collapse nested objects in cluster stats API response (#54739 ) Replaces dot notation with collapsed nested object formatting per the [Elastic API reference template][0]. [0]:https://github.com/elastic/docs/blob/master/shared/api-ref-ex.asciidoc	2020-04-06 13:11:46 -04:00
Igor Motov	2794572a35	[7.x] Add Student's t-test aggregation support (#54469 ) (#54737 ) Adds t_test metric aggregation that can perform paired and unpaired two-sample t-tests. In this PR support for filters in unpaired is still missing. It will be added in a follow-up PR. Relates to #53692	2020-04-06 11:36:47 -04:00
Nhat Nguyen	2fdbed7797	Broadcast cancellation to only nodes have outstanding child tasks (#54312 ) Today when canceling a task we broadcast ban/unban requests to all nodes in the cluster. This strategy does not scale well for hierarchical cancellation. With this change, we will track outstanding child requests and broadcast the cancellation to only nodes that have outstanding child tasks. This change also prevents a parent task from sending child requests once it got canceled. Relates #50990 Supersedes #51157 Co-authored-by: Igor Motov <igor@motovs.org> Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-04-06 11:11:29 -04:00
István Zoltán Szabó	7dc1ba4273	[DOCS] Updates transform prerequisites (#54804 )	2020-04-06 17:07:59 +02:00
Christoph Büscher	def519ea70	[Docs] Correct date rounding example for `range` query (#51524 ) Looking into #50237 I realized that two of the examples given in the documentation around date math rounding for range queries on date fields using `gt` and `lt` is slightly off by a nanosecond. This PR changes this to the bounds that are currently parsed using these parameters.	2020-04-06 17:05:45 +02:00
István Zoltán Szabó	4cba1e6368	[DOCS] Changes kibana_user to kibana_admin in DFA API prerequisites. (#54806 )	2020-04-06 15:46:18 +02:00
Jason Tedor	184c038f59	Add get autoscaling policy API (#54762 ) This commit adds the get autoscaling policy API.	2020-04-04 18:04:25 -04:00
James Baiera	705e46d5c1	[DOCS] Remove unused cat tasks request parameters (#54539 ) (#54741 )	2020-04-03 15:33:28 -04:00
Lisa Cawley	de91d2aeea	[DOCS] Collapse nested objects in CCR APIs (#54697 )	2020-04-03 12:04:33 -07:00
Bogdan Pintea	7cef89e084	ODBC: Document the new VarcharLimit and EarlyExecution params (#54632 ) * Document VarcharLimit and EarlyExecution params Add the documentation for the newly added VarcharLimit and EarlyExecution DSN attributes. * Remove obsolete VersionChecking param This param had been removed already along the #53082 work. * Update docs/reference/sql/endpoints/odbc/configuration.asciidoc fix typo Co-Authored-By: Stuart Cam <stuart@codebrain.co.uk> * Update docs/reference/sql/endpoints/odbc/configuration.asciidoc fix typo Co-Authored-By: Stuart Cam <stuart@codebrain.co.uk> (cherry picked from commit f38761631a12b38f7f075635f7ac61dc96656cd7)	2020-04-03 14:46:39 +02:00
markharwood	2da2305587	Backport of lowercase normalizer PR #53882 A pre-configured normalizer for lower-casing. Closes #53872	2020-04-03 11:43:40 +01:00
István Zoltán Szabó	d025b90cd1	[DOCS] Makes PUT inference API docs collapsible (#54653 ) Co-authored-by: lcawl <lcawley@elastic.co>	2020-04-03 09:48:53 +02:00
Lisa Cawley	11afead21e	[DOCS] Adds collapsible sections to rollup APIs (#54690 )	2020-04-02 17:51:16 -07:00
Lisa Cawley	b138dc4565	[DOCS] Add processing details to get transforms stats API (#54368 ) (#54595 )	2020-04-02 16:12:57 -07:00
lcawl	d3fa0ec730	[DOCS] Fixes typo in node settings	2020-04-02 16:01:54 -07:00
Lisa Cawley	98965116fe	[DOCS] Clarify ML and transform settings on coordinating nodes (#54676 )	2020-04-02 15:38:15 -07:00
Julie Tibshirani	5fb7602227	Disallow changing 'enabled' on the root mapper. (#54681 ) In #33933 we disallowed changing the `enabled` parameter in object mappings. However, the fix didn't cover the root object mapper. This PR adjusts the change to also include the root mapper and clarifies the error message.	2020-04-02 15:28:48 -07:00
David Turner	4e083cd97d	indices.recovery.max_bytes_per_sec may be per-node (#54633 ) The `indices.recovery.max_bytes_per_sec` recovery bandwidth limit can differ between nodes if it is not set dynamically, but today this is not obvious. This commit adds a paragraph to its documentation clarifying how to set different bandwidth limits on each node. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-02 18:15:41 +01:00
Benjamin Trent	4a1610265f	[7.x] [ML] add new inference_config field to trained model config (#54421 ) (#54647 ) * [ML] add new inference_config field to trained model config (#54421) A new field called `inference_config` is now added to the trained model config object. This new field allows for default inference settings from analytics or some external model builder. The inference processor can still override whatever is set as the default in the trained model config. * fixing for backport	2020-04-02 12:25:10 -04:00
Benjamin Trent	65233383f6	[7.x] [ML] prefer secondary authorization header for data[feed\|frame] authz (#54121 ) (#54645 ) * [ML] prefer secondary authorization header for data[feed\|frame] authz (#54121) Secondary authorization headers are to be used to facilitate Kibana spaces support + ML jobs/datafeeds. Now on PUT/Update/Preview datafeed, and PUT data frame analytics the secondary authorization is preferred over the primary (if provided). closes https://github.com/elastic/elasticsearch/issues/53801 * fixing for backport	2020-04-02 11:20:25 -04:00
qiye	de8e0200fe	[DOCS] Correct `shape` field release in 7.5 release highlights (#54631 ) The `shape` field was added in 7.4, not 7.3. This corrects a small error in the 7.5 release highlights.	2020-04-02 09:19:40 -04:00
Benjamin Trent	eb31be0e71	[7.x] [ML] add num_matches and preferred_to_categories to category defintion objects (#54214 ) (#54639 ) * [ML] add num_matches and preferred_to_categories to category defintion objects (#54214) This adds two new fields to category definitions. - `num_matches` indicating how many documents have been seen by this category - `preferred_to_categories` indicating which other categories this particular category supersedes when messages are categorized. These fields are only guaranteed to be up to date after a `_flush` or `_close` native change: https://github.com/elastic/ml-cpp/pull/1062 * adjusting for backport	2020-04-02 09:09:19 -04:00
Jason Tedor	54ecb009bb	Add delete autoscaling policy API (#54601 ) This commit adds an API for deleting autoscaling policies.	2020-04-02 09:05:12 -04:00
Aleh Zasypkin	161eac1942	[7.x] Switch to the most recent Kibana configuration format and SAML/OIDC endpoints. (#54624 )	2020-04-02 11:59:11 +02:00
István Zoltán Szabó	3a8e880fe6	[DOCS] Adds snippet comparing two indices to the painless examples (#54563 )	2020-04-02 08:46:25 +02:00
Russ Cam	110d9a7845	Correct description for mget API request URI index (#52305 ) This commit corrects the description for the request URI index for the Multi Get (mget) API. The index can only be a single index name (multiple or wildcard expressions not supported), and acts as the index to use when "ids" are specified, or a document in the "docs" array does not specify an index. (cherry picked from commit aa4926ed7f91dfbf7973a01b1e4682e91dda11a9)	2020-04-02 09:08:34 +10:00
lcawl	949636944c	[DOCS] Fixes shared attribute for feature importance	2020-04-01 14:52:08 -07:00
Jason Tedor	ccecb78c98	Rename the policy in put autoscaling policy docs The put autoscaling policy docs use a "hot" policy as an example. Instead, this commit changes the name of this policy to "my_autoscaling_policy".	2020-04-01 16:32:03 -04:00
Jason Tedor	bd6b383926	Include autoscaling APIs in API docs (#54603 ) This commit adds a top-level link to the autoscaling API reference page to the API docs. Additionally, we add a conditional guard on the API pages to only include them in development builds of the docs.	2020-04-01 15:44:13 -04:00
lcawl	2cd35bf696	[DOCS] Adds release highlights placeholder	2020-04-01 09:22:20 -07:00
Lisa Cawley	f5ccf939d9	[DOCS] Clarifies API key breaking change (#54522 )	2020-04-01 08:58:15 -07:00
James Rodewig	21abc311fd	[DOCS] Reformat `keyword_repeat` token filter (#54428 )	2020-04-01 11:56:05 -04:00
James Rodewig	4982b720ef	[DOCS] EQL: Document `length` function (#54225 )	2020-04-01 11:35:36 -04:00
James Rodewig	b43eb5ac32	[DOCS] EQL: Document `endsWith` function (#54521 )	2020-04-01 10:43:37 -04:00
Dan Hermann	11bfbd8bbf	Rename examples in ILM guide to avoid association with data streams (#54579 )	2020-04-01 09:05:20 -05:00
István Zoltán Szabó	27f88fcdac	[DOCS] Updates estimate model memory docs (#54574 )	2020-04-01 15:55:25 +02:00
James Rodewig	95622d8782	[DOCS] EQL: Document `startsWith` function (#54518 ) (#54578 )	2020-04-01 09:30:27 -04:00
James Rodewig	92d570d6f3	[DOCS] EQL: Add search/index speed tip for functions (#54346 ) (#54575 ) EQL functions are an easy way for users to transform indexed data at search time. However, using multiple functions can make queries difficult to write and slows search speeds. Users can circumvent this by indexing fields containing the transformed data, but that usually slows index speeds. This adds a related tip and example covering these tradeoffs.	2020-04-01 08:39:04 -04:00
bellengao	e9c201b446	[DOCS] Correct field name in date_nanos doc (#54556 )	2020-04-01 14:22:32 +02:00
Jason Tedor	f670ae0bc8	Introduce autoscaling policies (#54473 ) This commit is the first in a series of commits that introduces autoscaling policies, and APIs for working with them. For now, we introduce the basic infrastructure, and a single API for putting an autoscaling policy. We will follow in rapid succession with APIs for getting, and deleting autoscaling policies.	2020-04-01 08:12:26 -04:00
István Zoltán Szabó	325b8ec0ce	[DOCS] Adds data_counts object to the GET DFA stats API (#54498 )	2020-04-01 10:07:28 +02:00
Jason Tedor	5fcda57b37	Rename MetaData to Metadata in all of the places (#54519 ) This is a simple naming change PR, to fix the fact that "metadata" is a single English word, and for too long we have not followed general naming conventions for it. We are also not consistent about it, for example, METADATA instead of META_DATA if we were trying to be consistent with MetaData (although METADATA is correct when considered in the context of "metadata"). This was a simple find and replace across the code base, only taking a few minutes to fix this naming issue forever.	2020-03-31 17:24:38 -04:00
James Rodewig	114894dd76	[DOCS] Add redirect for changed anchor ID (#54533 ) The anchor ID for the snapshot repository plugins section in the docs was recently changes from `_repository_plugins` to `snapshots-repository-plugins`. This adds a corresponding redirect so no links are broken.	2020-03-31 16:42:16 -04:00
James Rodewig	7401191019	[DOCS] Include 7.7.0 release notes (#54529 ) Includes the 7.7.0 release notes so they render in the HTML docs. Also removes a few legacy `coming[7.6.0]` tags.	2020-03-31 16:23:49 -04:00
Lisa Cawley	922ec8e961	[DOCS] Collapses nested objects in data frame analytics APIs (#54472 ) (#54526 )	2020-03-31 12:51:04 -07:00
Glen Smith	0d4a001ef2	[DOCS] Clarify cluster health status during rolling upgrade (#40757 ) Remove mention of the `yellow` and `red` starting health status from the rolling upgrade docs. Instead, we should emphasize that users wait for the node to recover with a health status of `green` rather than the starting status. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-03-31 14:13:05 -04:00
Dimitris Athanasiou	0b25e3b66c	[7.x][ML] Fix DF analytics explain API request in docs (#54510 ) (#54514 ) The explain API expects a data frame analytics config as its request. Backport of #54410	2020-03-31 18:56:52 +03:00
István Zoltán Szabó	a5497cd9e0	[DOCS] Adds HTTP response count example to Painless examples (#54412 )	2020-03-31 15:12:58 +02:00
István Zoltán Szabó	eeb23e9e73	[DOCS] Adds description of analysis_stats object and its properties to GET DFA stats API docs (#53881 ) Co-authored-by: Valeriy Khakhutskyy <1292899+valeriy42@users.noreply.github.com> Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-03-31 13:30:06 +02:00
James Rodewig	ce7539ce6c	[DOCS] Fix broken deprecated macros (#54466 ) Updates formatting for `deprecated` macros in the translog and synced flush docs. Previously, these macros were rendering literally.	2020-03-30 17:16:27 -04:00
Lisa Cawley	0fa1060ca4	[DOCS] Collapses content in machine learning APIs (#54234 ) (#54453 )	2020-03-30 11:06:33 -07:00
James Rodewig	ed1edb4964	[DOCS] Add missing word to keyword marker token filter docs	2020-03-30 10:52:14 -04:00
James Rodewig	21f362a2a8	[DOCS] Add a lowercase email example to keyword tokenizer docs (#53257 )	2020-03-30 09:06:04 -04:00
Benjamin Trent	374e76d7cd	[Transform] fixing naming in HLRC and _cat to match API content (#54300 ) (#54408 ) Fixing the naming of the HLRC values to match the ToXContent field names (i.e. the field names returned from an API call). Also fixes the names in the _cat API as well. closes #53946	2020-03-30 08:57:02 -04:00
István Zoltán Szabó	00eaa0ebe5	[DOCS] Changes scripted metric to filter aggs in transforms example (#54167 )	2020-03-30 09:51:07 +02:00
Jason Tedor	f0033783db	Deprecate node local storage setting (#54374 ) This setting is not documented and has dubious value since it means there can be nodes in the cluster (non-data and non-master nodes) that do not have persistent node IDs. This does not have any use cases so this commit removes the setting.	2020-03-28 14:36:41 -04:00
Gil Raphaelli	2984a54b7f	[DOCS] Fix typos in top metrics agg docs (#54299 )	2020-03-27 10:49:21 -04:00
AndyHunt66	2dd8946539	[DOCS] Remove redundant sentence in ingest processor docs (#54329 )	2020-03-27 08:25:09 -04:00
Christoph Büscher	f7ea794312	[Test] Don't expect specific scores in docs tests (#54297 ) The failing suggester documentation test was expecting specific scores in the test response, which is fragile implementation details that e.g. can change with different lucene versions and generally shouldn't be done in documentation test. Instead we usually replace the float values in the output response by the ones in the actual response. Closes #54257	2020-03-27 10:27:47 +01:00
Lisa Cawley	27cd5b343c	[DOCS] Augments cat transforms API (#53776 ) (#54232 )	2020-03-26 07:56:46 -07:00
Jason Tedor	d8f745736b	Clarify the remove keystore command can handle many (#54244 ) The remove keystore command can handle multiple settings. In a few places, we were not consistent about mentioning this. This commit addreses this, in the CLI help, and the docs.	2020-03-26 08:49:43 -04:00
Luca Cavanna	ff269160af	Async search: rename REST parameters (#54198 ) This commit renames wait_for_completion to wait_for_completion_timeout in submit async search and get async search. Also it renames clean_on_completion to keep_on_completion and turns around its behaviour. Closes #54069	2020-03-26 09:40:50 +01:00
István Zoltán Szabó	487b273286	[DOCS] Adds feature importance mapping subsection to inference processor docs (#54190 )	2020-03-26 09:26:50 +01:00
Jason Tedor	6af89e62d1	Allow keystore add-file to handle multiple settings (#54240 ) Today the keystore add-file command can only handle adding a single setting/file pair in a single invocation. This incurs the startup costs of the JVM many times, which in some environments can be expensive. This commit teaches the add-file keystore command to accept adding multiple settings in a single invocation.	2020-03-26 00:07:05 -04:00
Jason Tedor	4fbc0e9ab8	Complete keystore CLI options documentation (#54242 ) The documentation was missing the long option for the force option, and the short option for the stdin option. This commit addresses this by adding these to the documentation.	2020-03-25 23:53:34 -04:00
Jason Tedor	fe8257d981	Allow keystore add to handle multiple settings (#54229 ) Today the keystore add command can only handle adding a single setting/value pair in a single invocation. This incurs the startup costs of the JVM many times, which in some environments can be expensive. This commit teaches the add keystore command to accept adding multiple settings in a single invocation.	2020-03-25 22:58:20 -04:00
James Rodewig	30a32040d3	[DOCS] EQL: Document `substring` function (#53867 ) Adds documentation for the EQL `substring` function. Supporting changes: * Creates a new "EQL function reference" page * Updates the title of the "EQL syntax reference" page for consistency * Adds a brief "Functions" section to the EQL syntax docs * Updates EQL limitations docs to state that only array functions are unsupported	2020-03-25 12:23:59 -04:00
Nik Everett	16e4bd50e2	Add breaking change note for #53669	2020-03-25 09:31:14 -04:00
James Rodewig	74051d68a8	[DOCS] Reformat `keyword_marker` token filter (#54076 ) Makes the following changes to the `keyword_marker` token filter docs: * Rewrites description and adds Lucene link * Adds detailed analyze example * Rewrites parameter definitions * Adds custom analyzer and filter example	2020-03-25 09:26:06 -04:00
James Rodewig	2fdf6b2f96	[DOCS] Document missing data types for node stats API's response parameters (#53475 ) Documents missing data types for several response parameters returned by the node stats API. Also adds several missing human-readable parameters returned by the API.	2020-03-25 08:42:58 -04:00
Jason Tedor	381d7586e4	Introduce formal role for remote cluster client (#54138 ) This commit introduce a formal role for identifying nodes that are capable of making connections to remote clusters. Relates #53924	2020-03-24 21:59:43 -04:00
David Roberts	7667004b20	[ML] Add a model memory estimation endpoint for anomaly detection (#54129 ) A new endpoint for estimating anomaly detection job model memory requirements: POST _ml/anomaly_detectors/estimate_model_memory Backport of #53507	2020-03-24 22:55:11 +00:00
Jim Ferenczi	55f2e8bff0	[DOCS] Add 7.6.2 release notes (#53720 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: lcawl <lcawley@elastic.co>	2020-03-24 22:42:25 +01:00
markharwood	6a60f85bba	Wildcard field - add normalizer support (#53851 ) (#54109 ) Backport support for normalisation to wildcard field Closes #53603	2020-03-24 17:37:47 +00:00
Lisa Cawley	88b1b2f36f	[DOCS] Adds transform security privileges (#53908 )	2020-03-24 09:35:45 -07:00
Tim Brooks	caefa78513	Align remote info api with new settings (#54102 ) Currently the remote info api has added a number of possible fields (proxy, num_socket_connections, etc) that are available in proxy mode. These fields are not aligned with what the settings are named. This commit modifies this API to align with the settings.	2020-03-24 10:27:24 -06:00
Luca Cavanna	6b457abbd3	Async search: prevent users from overriding pre_filter_shard_size (#54088 ) Submit async search forces pre_filter_shard_size for the underlying search that it creates. With this commit we also prevent users from overriding such default as part of request validation.	2020-03-24 17:06:04 +01:00
Ahmet Arslan	52062565a9	[DOCS] Correct DFI docs regarding stop word removal (#53836 ) The documentation of DFI should recommend not to [remove stop words][1], since DFI is good at scoring queries that contain common terms: `the wall`, `the sun`, `the who`, etc. [1]:https://lucene.apache.org/core/8_1_1/core/org/apache/lucene/search/similarities/DFISimilarity.html	2020-03-24 10:48:42 -04:00
Karen Metts	9da589c5fd	[DOCS] Replace outdated Logstash monitoring link (#54032 ) Replaces a link to Logstash OSS-only content with a link to the general Logstash monitoring topic.	2020-03-24 10:03:31 -04:00
David Roberts	1421471556	[ML] Introduce a "starting" datafeed state for lazy jobs (#54065 ) It is possible for ML jobs to open lazily if the "allow_lazy_open" option in the job config is set to true. Such jobs wait in the "opening" state until a node has sufficient capacity to run them. This commit fixes the bug that prevented datafeeds for jobs lazily waiting assignment from being started. The state of such datafeeds is "starting", and they can be stopped by the stop datafeed API while in this state with or without force. Backport of #53918	2020-03-24 13:00:04 +00:00
Hendrik Muhs	7dcacf531f	[7.x][Transform][Rollup] add processing stats to record the ti… (#54027 ) add 2 additional stats: processing time and processing total which capture the time spent for processing results and how often it ran. The 2 new stats correspond to the existing indexing and search stats. Together with indexing and search this now allows the user to see the full picture, all 3 stages.	2020-03-24 09:22:02 +01:00
Jason Tedor	e3ca124537	Introduce autoscaling decisions (#53934 ) This is the first in a series of commits that will introduce the autoscaling deciders framework. This commit introduces the basic framework for representing autoscaling decisions.	2020-03-23 23:08:06 -04:00
Jim Ferenczi	9e3f7f4575	Add heuristics to compute pre_filter_shard_size when unspecified (#53873 ) (#54007 ) This commit changes the pre_filter_shard_size default from 128 to unspecified. This allows to apply heuristics based on the request and the target indices when deciding whether the can match phase should run or not. When unspecified, this pr runs the can match phase automatically if one of these conditions is met: * The request targets more than 128 shards. * The request contains read-only indices. * The primary sort of the query targets an indexed field. Users can opt-out from this behavior by setting the `pre_filter_shard_size` to a static value. Closes #39835	2020-03-24 02:05:15 +01:00
Julie Tibshirani	df7cfb3a5b	Remove the top-level 'mapping type' section. (#54035 ) It seemed confusing for users that our top-level mapping page still had a prominent section named 'Mapping Type'. This PR reworks the docs to remove this reference and adds a note about types removal (similar to the note we added to other APIs like put mapping).	2020-03-23 15:34:23 -07:00
James Rodewig	43199a8c82	[DOCS] Remove double space in WDG docs	2020-03-23 17:18:04 -04:00
James Rodewig	553d8a9ca9	[DOCS] Fix "letter case" typo Changes "lettercase" to "letter case" in the `uppercase` token filter docs.	2020-03-23 17:11:59 -04:00
Paweł Krześniak	c0534f4157	[DOCS] link fix (#53973 ) Fix bad link in top_metrics.	2020-03-23 14:20:54 -04:00
Luca Cavanna	932a7e3112	Backport of async search changes (#53976 ) * Get Async Search: omit _clusters section when empty (#53907) The _clusters section is omitted by the search API whenever no remote clusters are searched. Async search should do the same, but Get Async Search returns a deserialized response, hence a weird `_clusters` section with all values set to `0` gets returned instead. In fact the recreated Clusters object is not the same object as the EMPTY constant, yet it has the same content. This commit addresses this by changing the comparison in the `toXContent` method to not print out the section if the number of total clusters is `0`. * Async search: remove version from response (#53960) The goal of the version field was to quickly show when you can expect to find something new in the search response, compared to when nothing has changed. This can also be done by looking at the `_shards` section and `num_reduce_phases` returned with the search response. In fact when there has been one or more additional reduction of the results, you can expect new results in the search response. Otherwise, the `_shards` section could notify of additional failures of shards that have completed the query, but that is not a guarantee that their results will be exposed (only when the following partial reduction is performed their results will be available). That said this commit clarifies this in the docs and removes the version field from the async search response * Async Search: replicas to auto expand from 0 to 1 (#53964) This way single node clusters that are green don't go yellow once async search is used, while all the others still have one replica. * [DOCS] address timing issue in async search docs tests (#53910) The docs snippets for submit async search have proven difficult to test as it is not possible to guarantee that you get a response that is not final, even when providing `wait_for_completion=0`. In the docs we want to show though a proper long-running query, and its first response should be partial rather than final. With this commit we adapt the docs snippets to show a partial response, and replace under the hood all that's needed to make the snippets tests succeed when we get a final response. Also, increased the timeout so we always get a final response. Closes #53887 Closes #53891	2020-03-23 19:13:31 +01:00
Lisa Cawley	c4260ba3c7	[DOCS] Fixes formatting in transform overview (#53900 )	2020-03-23 10:21:54 -07:00
C.J. Jameson	5ac8b795ba	[DOCS] Clarify routing enforcement in docs (#53945 ) Removes a mention of the `_doc` mapping type that's no longer applicable now that mapping types are removed/deprecated.	2020-03-23 12:57:07 -04:00
Lisa Cawley	5f45780577	[DOCS] Adds data nanos transform limitation (#53826 )	2020-03-23 09:49:21 -07:00
Lisa Cawley	8b3ef4ac21	[DOCS] Add generated_dest_index to preview transform API (#53905 )	2020-03-23 09:43:44 -07:00
Marios Trivyzas	af03200ad6	SQL: Extend DATE_TRUNC to also operate on intervals(elastic - #46632 ) (#47720 ) (#53972 ) The function is extended to operate on intervals according to the PostgreSQL: https://www.postgresql.org/docs/9.1/functions-datetime.html#FUNCTIONS-DATETIME-TRUNC Closes : #46632 (cherry picked from commit 2dc79505825fa75e0711dcfa8e9c69e8028fc979) Co-authored-by: musteaf <gs_mustea@hotmail.com>	2020-03-23 15:05:16 +01:00
Dan Hermann	0528e2b06b	Document the deprecation of the 'auth.password' setting (#53822 )	2020-03-23 08:58:25 -05:00
István Zoltán Szabó	534e734ad1	[DOCS] Adds painless transform examples (#53274 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-03-23 10:39:47 +01:00
Przemyslaw Gomulka	015ad019d5	[docs] Known issue about joda patterns on 7.6 (#53957 )	2020-03-23 10:28:55 +01:00
Przemyslaw Gomulka	412e163cf6	[Doc] migration guide joda (#51986 ) The joda to java.time migration requires users to upgrade their mappings. We allow them to still use 6.x created indices with joda patterns in 7 but ask them to upgrade their patterns in 7.x. This migration guide is to help them understand how they could be affected and what needs to be changed in their mappings. closes #51614 closes #51236	2020-03-23 08:29:01 +01:00
Mark Vieira	0cfe6d90cc	Mute async-search test	2020-03-20 11:35:24 -07:00
Luca Belluccini	3937854444	[DOCS] Clarify upgrade paths (#53417 ) Unsupported upgrade paths: - `6.8 to 7.0`. Supported, but requires a full cluster restart: - `6.0–6.7 directly to 7.x`	2020-03-20 18:05:07 +00:00
Lisa Cawley	c2af015ee7	[DOCS] Updates list of transform aggs (#53820 )	2020-03-20 10:28:23 -07:00

1 2 3 4 5 ...

7033 Commits