OpenSearch

Commit Graph

Author	SHA1	Message	Date
Benjamin Trent	181ee3ae0b	[ML] specifying missing_field_value value and using it instead of empty_string (#53108 ) (#53165 ) For analytics, we need a consistent way of indicating when a value is missing. Inheriting from anomaly detection, analysis sent `""` when a field is missing. This works fine with numbers, but the underlying analytics process actually treats `""` as a category in categorical values. Consequently, you end up with this situation in the resulting model ``` { "frequency_encoding" : { "field" : "RainToday", "feature_name" : "RainToday_frequency", "frequency_map" : { "" : 0.009844409027270245, "No" : 0.6472019970785184, "Yes" : 0.6472019970785184 } } } ``` For inference this is a problem, because inference will treat missing values as `null`. And thus not include them on the infer call against the model. This PR takes advantage of our new `missing_field_value` option and supplies `\0` as the value.	2020-03-05 09:50:52 -05:00
István Zoltán Szabó	48707ec55a	[DOCS] Expands GET DFA stat API docs with response objects. (#53107 )	2020-03-05 15:31:55 +01:00
Aleksandr Maus	2dc872f052	EQL: Add HLRC for EQL stats (#53043 ) (#53148 )	2020-03-05 09:20:38 -05:00
Adrien Grand	360ac1997f	Fix test failures with the new `constant_keyword` field. (#53153 ) This test failed because YAML tests randomly install an index template that updates the default number of shards to 2. Closes #53131	2020-03-05 14:29:13 +01:00
Marios Trivyzas	487d442760	Implement Exitable DirectoryReader (#52822 ) (#53162 ) Implement an Exitable DirectoryReader that wraps the original DirectoryReader so that when a search task is cancelled the DirectoryReaders also stop their work fast. This is usuful for expensive operations like wilcard/prefix queries where the DirectoryReaders can spend lots of time and consume resources, as previously their work wouldn't stop even though the original search task was cancelled (e.g. because of timeout or dropped client connection). (cherry picked from commit 67acaf61f33bc5f54e26541514d07e375c202e03)	2020-03-05 14:17:31 +01:00
Nik Everett	28df7ae5ed	Support multiple metrics in `top_metrics` agg (backport of #52965 ) (#53163 ) This adds support for returning multiple metrics to the `top_metrics` agg. It looks like: ``` POST /test/_search?filter_path=aggregations { "aggs": { "tm": { "top_metrics": { "metrics": [ {"field": "v"}, {"field": "m"} ], "sort": {"s": "desc"} } } } } ```	2020-03-05 08:12:01 -05:00
David Roberts	01504df876	[TEST] Force close failed job before skipping test (#53128 ) The assumption added in #52631 skips a problematic test if it fails to create the required conditions for the scenario it is supposed to be testing. (This happens very rarely.) However, before skipping the test it needs to remove the failed job it has created because the standard test cleanup code treats failed jobs as fatal errors. Closes #52608	2020-03-05 10:52:41 +00:00
Armin Braun	204c366a4e	Upgrade GCS SDK to 1.104.0 (#52839 ) (#53152 ) Upgrading the GCS SDK to the most recent version. Adjusting (i.e. improving) the REST mock accordingly. This should significantly boost performance by pulling in https://github.com/googleapis/java-core/issues/86 in some cases.	2020-03-05 11:18:18 +01:00
Alan Woodward	3cd4b97618	Remove UnknownNamedObjectException (#53105 ) This was originally thrown from NamedXContentRegistry#parseNamedObject() but that method now throws a NamedObjectNotFoundException, so this is unused.	2020-03-05 10:06:59 +00:00
James Rodewig	e46bb54c7b	[DOCS] Document `any` keyword in EQL syntax (#52821 ) (#53157 ) Adds documentation for the `any` keyword to the EQL syntax docs. Includes: * Definition of an event category and its relationship to the event category field. * Example matching all event categories using `any` keyword * Example using `any` with `where true`	2020-03-05 05:02:47 -05:00
Ignacio Vera	058113aa42	upgrade to lucene-snapshot-fa75139efea (#53150 ) (#53151 )	2020-03-05 10:04:05 +01:00
Tanguy Leroux	52d4807f8d	Mute GoogleCloudStorageBlobStoreRepositoryTests on jdk8 (#53119 ) Tests in GoogleCloudStorageBlobStoreRepositoryTests are known to be flaky on JDK 8 (#51446, #52430 ) and we suspect a JDK bug (https://bugs.openjdk.java.net/browse/JDK-8180754) that triggers some assertion on the server side logic that emulates the Google Cloud Storage service. Sadly we were not able to reproduce the failures, even when using the same OS (Debian 9, Ubuntu 16.04) and JDK (Oracle Corporation 1.8.0_241 [Java HotSpot(TM) 64-Bit Server VM 25.241-b07]) of almost all the test failures on CI. While we spent some time fixing code (#51933, #52431) to circumvent the JDK bug they are still flaky on JDK-8. This commit mute these tests for JDK-8 only. Close ##52906	2020-03-05 09:18:05 +01:00
Lisa Cawley	859c6441b3	[DOCS] Adds PKI delegation.enabled example (#53030 )	2020-03-04 14:59:45 -08:00
Nik Everett	302980e0c4	Remove some ceremony in agg parsing (#53078 ) (#53117 ) With #50871 aggrgations should now be parsed directly by an `ObjectParser` or `ConstructingObjectParser` without the need for the ceremonial `parse` method. This removes 9 of those `parse` methods and parses the aggregation directly from their `ObjectParser`.	2020-03-04 13:06:41 -05:00
Ross Wolf	a5e82d7fd6	EQL: Add explicit 'any where ...' handling (#52526 )	2020-03-04 10:11:03 -07:00
Tim Brooks	f68917160e	Fix RemoteConnectionManager size() method (#52823 ) Currently the remote connection manager will delegate the size() call to the underlying cluster connection manager. This introduces the possibility that call will return 1 before the nodeConnection method has been triggered to add the connection to the remote connection list. This can cause issues, as the ensureConnected method checks the connection managers size and executes synchronously if the size is > 0. This leads to a potential cluster not connected exception while we are still waiting for the connection opened callback to be triggered. This commit fixes this issue by using the remote connection manager's size to report the connection manager's size. Fixes #52029.	2020-03-04 09:53:22 -07:00
Yannick Welsch	8ab74fea58	[7.x] Add 7.6.2 as version (#53114 )	2020-03-04 10:39:09 -06:00
Jake Landis	f08ed1f69a	[7.x] add 6.8.8 as version (#53021 )	2020-03-04 10:38:07 -06:00
Alan Woodward	dfebbbf862	BoolQueryBuilder uses ObjectParser (#52880 ) This commit removes the hand-rolled x-content parsing logic from BoolQueryBuilder and instead uses an ObjectParser to handle parsing. It also removes the long-deprecated (since version 6) disable_coord parameter.	2020-03-04 15:48:38 +00:00
Mark Vieira	812d981d4d	Update to reflect version of SLES used in CI Signed-off-by: Mark Vieira <portugee@gmail.com>	2020-03-04 07:38:28 -08:00
Nik Everett	609c61f75c	Formalize usage stats for analytics (backport of #52966 ) (#53077 ) This moves the usage statistics gathering from the `AnalyticsPlugin` into an `AnalyicsUsage`, removing the static state. It also checks the license level when parsing all analytics aggregations. This is how we were checking them before but we did it in an easy to forget way. This way is slightly simpler, I think.	2020-03-04 10:29:11 -05:00
James Rodewig	801e50203e	[DOCS] Add missing doc type to EQL search results	2020-03-04 10:26:11 -05:00
Martijn van Groningen	3fa5395ac8	Use correct issue number: #52453	2020-03-04 16:17:55 +01:00
Martijn van Groningen	2e325e24cb	Mute testMonitorClusterHealth test (#53109 ) Relates to #36782	2020-03-04 16:08:19 +01:00
Yannick Welsch	d1e7951e00	[DOCS] Add 7.6.1. release notes (#52874 ) Adds the release notes for 7.6.1.	2020-03-04 15:47:54 +01:00
Martijn van Groningen	b77f6746d1	unmute watcher single node test case relates to #36782	2020-03-04 15:25:17 +01:00
James Rodewig	e3d3c3400c	[DOCS] Update EQL default event category and timestamp values (#53102 ) Updates the documented default `event_category_field` and `timestamp_field` values for the EQL search API. Also updates related guidance in the EQL requirement docs. Relates to #53073.	2020-03-04 09:17:37 -05:00
James Rodewig	0c4bf64095	[DOCS] Fix several Asciidoctor double arrow replacements (#52827 ) Per the [Asciidoctor docs][0], Asciidoctor replaces the following syntax with double arrows in the rendered HTML: * => renders as ⇒ * <= renders as ⇐ This escapes several unintended replacements, such as in the Painless docs. Where appropriate, it also replaces some double arrow instances with single arrows for consistency. [0]: https://asciidoctor.org/docs/user-manual/#replacements	2020-03-04 08:43:19 -05:00
James Rodewig	2b59f8ac34	[DOCS] Correct `hits.total.relation` response parm def (#52847 ) Fixes a partially completed definition for the `hits.total.relation` response parameter in the search API docs.	2020-03-04 08:23:34 -05:00
Aleksandr Maus	b47bffba24	EQL: consistent naming for event type vs event category (#53073 ) (#53090 ) Related to https://github.com/elastic/elasticsearch/issues/52941	2020-03-04 08:02:38 -05:00
Marios Trivyzas	e180e2738a	SQL: [Tests] Add tests for optimization of aliased expressions (#53048 ) Add a unit test to verify that the optimization of expression (e.g. COALESCE) is applied to all instances of the expression: SELECT, WHERE, GROUP BY and HAVING. Relates to #35270 (cherry picked from commit 2ceedc7f2019fad92cd86679af1a9c6fa594aa8d)	2020-03-04 11:48:06 +01:00
Marios Trivyzas	1d5c842700	SQL: Fix column size for IP data type (#53056 ) Set size/displaySize to 45 which is the maximum string for an IP (v6), since IPs are returned as strings. Fixes: #52762 (cherry picked from commit 815f01747a4d54a274ca248af6fc08e5ea0728c1)	2020-03-04 10:36:44 +01:00
Mark Vieira	4b528d97ad	Consolidate duplication of BWC testing task setup in script plugin (#53079 ) (cherry picked from commit 33fc8e7ebfac8d47a5f9f026b3836bb47bea141a)	2020-03-03 14:43:02 -08:00
Zachary Tong	3fcf598b92	Reduce deprecation log noise from DateIntervalWrapper (#52655 ) Converts the deprecations to `deprecatedAndMaybeLog` to reduce the number of times we log deprecations, since some of these could be called at a high frequency (due to unconverted queries, aggs, etc)	2020-03-03 17:08:10 -05:00
Jay Modi	c610e0893d	Introduce system index APIs for Kibana (#53035 ) This commit introduces a module for Kibana that exposes REST APIs that will be used by Kibana for access to its system indices. These APIs are wrapped versions of the existing REST endpoints. A new setting is also introduced since the Kibana system indices' names are allowed to be changed by a user in case multiple instances of Kibana use the same instance of Elasticsearch. Additionally, the ThreadContext has been extended to indicate that the use of system indices may be allowed in a request. This will be built upon in the future for the protection of system indices. Backport of #52385	2020-03-03 14:11:36 -07:00
Nik Everett	7339427af5	Remove some deprecation warnings parsing aggs (backport of #53026 ) (#53072 ) With #50871 aggrgations should now be parsed directly by an `ObjectParser` or `ConstructingObjectParser` without the need for the ceremonial `parse` method. This removes 10 of those `parse` methods and parses the aggregation directly from their `ObjectParser`.	2020-03-03 15:27:49 -05:00
Lisa Cawley	892f0d5848	[DOCS] Adds link in datafeed indices_options (#53067 )	2020-03-03 10:31:32 -08:00
Lisa Cawley	bb4bcbc54c	[DOCS] Adds ignore_throttled to index conventions (#53063 )	2020-03-03 10:23:56 -08:00
James Rodewig	cf87724ff6	[DOCS] Reformat `stop` token filter (#53059 ) Makes the following changes to the `stop` token filter docs: * Updates description * Adds a link to the related Lucene filter * Adds detailed analyze snippet * Updates custom analyzer and custom filter snippets * Adds a list of predefined stop words by language Co-authored-by: ScottieL <36999642+ScottieL@users.noreply.github.com>	2020-03-03 13:22:52 -05:00
Andrei Stefan	9ad9ad7a6b	SQL: update SqlNodeSubclassTests list of min-two-parameters functions list (#53045 ) (#53058 ) (cherry picked from commit c741e49d9f5e7b78c1a78e1af97eb19354fe6864)	2020-03-03 19:37:37 +02:00
William Brafford	d3a8ac66c6	Handle special chars in JAVA_HOME in elasticsearch-service.bat (#52676 ) (#53057 ) * Handle special chars in JAVA_HOME in elasticsearch-service.bat (#52676) * Test case for windows service where JAVA_HOME path contains spaces (#53028) Co-authored-by: Muhammad Shaheer Akram <41253927+shaheerakr@users.noreply.github.com>	2020-03-03 12:01:54 -05:00
István Zoltán Szabó	6cece3a709	[DOCS] Adds response body documentation to GET inference API (#53050 )	2020-03-03 16:26:40 +01:00
Luca Cavanna	8a05b670ca	Address MinAndMax generics warnings (#52642 ) `MinAndMax` encapsulates min and max values for a shard. It uses generics to make sure that the values are of the same type and are also comparable. Though there are warnings whenever this class is currently used, which are addressed with this commit. Relates to #49092	2020-03-03 16:08:10 +01:00
Adrien Grand	cb868d2f5e	Introduce a `constant_keyword` field. (#49713 ) (#53024 ) This field is a specialization of the `keyword` field for the case when all documents have the same value. It typically performs more efficiently than keywords at query time by figuring out whether all or none of the documents match at rewrite time, like `term` queries on `_index`. The name is up for discussion. I liked including `keyword` in it, so that we still have room for a `singleton_numeric` in the future. However I'm unsure whether to call it `singleton`, `constant` or something else, any opinions? For this field there is a choice between 1. accepting values in `_source` when they are equal to the value configured in mappings, but rejecting mapping updates 2. rejecting values in `_source` but then allowing updates to the value that is configured in the mapping This commit implements option 1, so that it is possible to reindex from/to an index that has the field mapped as a keyword with no changes to the source. Backport of #49713	2020-03-03 16:01:47 +01:00
James Rodewig	bcb68c860c	[DOCS] Reorganize EQL requirements page	2020-03-03 07:02:30 -05:00
Yang Wang	70814daa86	Allow _rollup_search with read privilege (#52043 ) (#53047 ) Currently _rollup_search requires manage privilege to access. It should really be a read only operation. This PR changes the requirement to be read indices privilege. Resolves: #50245	2020-03-03 22:29:54 +11:00
Alan Woodward	3759063d34	Allow specifying an exclusive set of fields on ObjectParser (#52893 ) ObjectParser allows you to declare a set of required fields, such that at least one of the set must appear in an xcontent object for it to be valid. This commit adds the similar concept of a set of exclusive fields, such that at most one of the set must be present. It also enables required fields on ConstructingObjectParser, and re-implements PercolateQueryBuilder.fromXContent() to use object parsing as an example of how this works.	2020-03-03 10:56:20 +00:00
Martijn van Groningen	510db25dd0	Simplify watcher indexing listener.(#53046 ) Backport: #52627 Add watcher to trigger server after index operation has succeeded, instead of adding a watch to trigger service before the actual index operation has performed on the shard level. This logic is simpler to reason about in the case that a failure does occur during the execution of an index operation on the shard level. Relates to #52453, but I think doesn't fix it, but makes it easier to debug.	2020-03-03 11:01:57 +01:00
Hendrik Muhs	844f350774	[Transform] restructure transform yaml tests (#52956 ) restructure transform yaml tests to run cleanup in teardown phase relates #52428	2020-03-03 10:31:22 +01:00
Hendrik Muhs	d9258e210e	[Transform] fix sporadic race condition in TransformUsageIT (#52946 ) relax the test for trigger count fixes #52931	2020-03-03 10:27:36 +01:00

1 2 3 4 5 ...

50353 Commits All Branches Search

50353 Commits

All Branches