OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ioannis Kakavas	6c90727166	Fix custom policy in plugins in FIPS 140 (#52046 ) (#57049 ) Our FIPS 140 testing depends on setting the appropriate java policy in order to configure the JVM in FIPS mode. Some tests ( discovery-ec2 and ccr qa ) also needed to set a custom policy file to grant a specific permission, which overwrote the FIPS related policy and tests would fail. This change ensures that when a custom policy needs to be set in these tests, the permissions that are necessary for FIPS are also set. Resolves: #51685, #52034	2020-05-21 19:26:56 +03:00
Benjamin Trent	f00dfb2d5f	[ML] adds WKT support in filestructurefinder (#57014 ) (#57032 ) Field mapping detection is done via grok patterns. This commit adds well-known text (WKT) formatted geometry detection. If everything is a `POINT`, then a `geo_point` mapping is preferred. Otherwise, if all the fields are WKT geometries a `geo_shape` mapping is preferred. This does NOT detect other types of formatted geometries (geohash, comma delimited points, etc.) closes https://github.com/elastic/elasticsearch/issues/56967	2020-05-21 08:22:51 -04:00
markharwood	eb8cb31d46	Update Lucene version to 8.6.0-snapshot-9d6c738ffce (#57024 ) Same version as master	2020-05-21 11:28:16 +01:00
James Rodewig	37e2bb7057	[DOCS] Add watcher multi-doc index ex (#52040 ) (#57011 ) Adds an example snippet for creating a `_doc` payload field with the Watcher `index` action. Co-authored-by: Luiz Guilherme Pais dos Santos <luiz.santos@elastic.co>	2020-05-20 16:57:45 -04:00
Brandon Morelli	ec41d36c62	docs: update links to beats security docs (#56875 ) (#56953 )	2020-05-20 11:28:39 -07:00
Bogdan Pintea	ec4a6aa1c6	SQL: JDBC: fix temporary directory locked test errors in Windows (#56917 ) * Fix temp dir locked errors The tests involving a temporary directory (containing the JDBC JAR) fail on Windows because they can't be deleted, due to still being in use. This commit forces a premature closing of the JAR file, which mitigates the failure by giving the JVM more time to collect any open FDs. (Calling the System.gc() in the tests is another working alternative fix.) The stream-based JAR access is taken care by disabling the cache usage (cherry picked from commit 04f97333a015404a68e8f19223f33aadeb396687)	2020-05-20 19:46:57 +02:00
Florian Kelbert	edada6bc39	[Docs] Insert missing colon (#56980 )	2020-05-20 15:49:17 +02:00
Benjamin Trent	ee4ce8ecec	Fix geotile_grid group_by field mapping (#56939 ) (#56990 ) The original implementation utilized `bbox` as the index mapping type. This would not work as it would have to be `envelope`. But, given that `envelope` and `polygon` are tessellated in the same way, we choose to use `polygon` as the geo_shape type. This is for easier support other places in the stack (a la kibana maps)	2020-05-20 08:22:13 -04:00
Alan Woodward	18bfbeda29	Move merge compatibility logic from MappedFieldType to FieldMapper (#56915 ) Merging logic is currently split between FieldMapper, with its merge() method, and MappedFieldType, which checks for merging compatibility. The compatibility checks are called from a third class, MappingMergeValidator. This makes it difficult to reason about what is or is not compatible in updates, and even what is in fact updateable - we have a number of tests that check compatibility on changes in mapping configuration that are not in fact possible. This commit refactors the compatibility logic so that it all sits on FieldMapper, and makes it called at merge time. It adds a new FieldMapperTestCase base class that FieldMapper tests can extend, and moves the compatibility testing machinery from FieldTypeTestCase to here. Relates to #56814	2020-05-20 09:43:13 +01:00
Marios Trivyzas	644ae49817	SQL: Fix behaviour of COUNT(DISTINCT <literal>) (#56869 ) (#56932 ) Previously `COUNT(DISTINCT <literal>)` was returning the same result as `COUNT(<literal>)` which is not correct as it should always return 1 if there is at least one matching row (bucket if there is a GROUP BY), or 0 otherwise. (cherry picked from commit 7f7d7562d43034907f432d39d0d66f490d78f4a8)	2020-05-19 11:19:06 +02:00
Yannick Welsch	f296c08021	Increase timeout for assertLongBusy in AutoFollowIT (#56910 ) Closes #56891	2020-05-18 16:20:46 +02:00
Benjamin Trent	297f864884	[ML] relax throttling on expired data cleanup (#56711 ) (#56895 ) Throttling nightly cleanup as much as we do has been over cautious. Night cleanup should be more lenient in its throttling. We still keep the same batch size, but now the requests per second scale with the number of data nodes. If we have more than 5 data nodes, we don't throttle at all. Additionally, the API now has `requests_per_second` and `timeout` set. So users calling the API directly can set the throttling. This commit also adds a new setting `xpack.ml.nightly_maintenance_requests_per_second`. This will allow users to adjust throttling of the nightly maintenance.	2020-05-18 08:46:42 -04:00
David Kyle	0fac152188	Muse AsyncSearchActionIT (#56897 ) For #56765	2020-05-18 13:36:33 +01:00
Ioannis Kakavas	bb852ab2e7	Cause is tracked in #49094 (#56887 )	2020-05-18 15:03:38 +03:00
David Kyle	52a329fa12	Mute sql.client.VersionTests suite (#56883 ) For #56882	2020-05-18 10:15:30 +01:00
Bogdan Pintea	de7dd6154e	Fix range of version number generation in test (#56849 ) The version number componenent can't equal or exceed the revision multiplier. This fixes a the VersionTests unit test. (cherry picked from commit 7d2331a2818ae20024c5c3617cd4433f90e9c098)	2020-05-16 08:59:45 +02:00
Andrei Stefan	4d47d63f55	SQL: implement SUM, MIN, MAX, AVG over literals (#56786 ) (#56850 ) * Adds support for MIN, MAX, AVG, SUM aggregates acting on literals. SELECT SUM(1) FROM index and SELECT SUM(1), AVG(2) work both on indices and as local execution. (cherry picked from commit efb72907c0391612c4a2b6256e327060b4167912)	2020-05-16 02:13:55 +03:00
Jake Landis	813609b47c	Ensure that .watcher-history-11* template is in installed prior to use (#56734 ) WatcherIndexTemplateRegistry as of https://github.com/elastic/elasticsearch/pull/52962 requires all nodes to be on 7.7.0 before it allows the version 11 index template to be installed. While in a mixed cluster, nothing prevents Watcher from running on the new host before the all of the nodes are on 7.7.0. This will result in the .watcher-history-11* index without the proper mappings. Without the proper mapping a single document (for a large watch) can exceed the default 1000 field limit and cause error to show in the logs. This commit ensures the same logic for writing to the index is applied as for installing the template. In a mixed cluster, the `10` index template will continue to be written. Only once all of nodes are on 7.7.0+ will the `11` index template be installed and used. closes #56732	2020-05-15 16:29:04 -05:00
Dimitris Athanasiou	54d3cc74ec	[7.x][ML] Ensure class is represented when its cardinality is low (#56783 ) (#56829 ) In DF analytics classification, it is possible to use no samples of a class if its cardinality is too low. This commit fixes this by ensuring the target sample count can never be zero. Backport of #56783	2020-05-15 20:52:06 +03:00
Bogdan Pintea	14ad733bd1	SQL: JDBC: fix access to the Manifest for non-entry JAR URLs (#56797 ) (#56839 ) * JDBC: fix access to the Manifest for non-entry JAR The JDBC driver will attempt to read its version from the Manifest file embedded into its JAR. The URL pointing to the JAR can be provided in a few ways. So far, accessing the Manfiest was attempted by getting a URLConnection out of the URL and then getting an input stream out of this connection. For file JAR URLs, this only works however if the URL points to the driver as a JAR file entry (i.e. <sub-url>!/jdbc-driver.jar!/). If that's not the case, the JarURLConnection will throw an IOException. This commit fixes that: in case the URL points to a JAR entry (jar:file:<path>/jdbc-driver.jar!/), the manifest is read directly with JarURLConnection#getManifest(). (cherry picked from commit 2175b7b01cf5fcf3ab2bb21404a9bd454a8df3f0) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-05-15 19:35:54 +02:00
James Baiera	4809db3ff9	EnrichProcessorFactory should not throw NPE if missing metadata (#55977 ) (#56793 ) In some cases the Enrich processor factory may be called before it is ready to create processors. While these calls are usually made in error, the response from the Enrich processor is an NPE which is almost always an unhelpful error when debugging an issue.	2020-05-15 12:02:13 -04:00
Ioannis Kakavas	239ada1669	Test adjustments for FIPS 140 (#56526 ) This change aims to fix our setup in CI so that we can run 7.x in FIPS 140 mode. The major issue that we have in 7.x and did not have in master is that we can't use the diagnostic trust manager in FIPS mode in Java 8 with SunJSSE in FIPS approved mode as it explicitly disallows the wrapping of X509TrustManager. Previous attempts like #56427 and #52211 focused on disabling the setting in all of our tests when creating a Settings object or on setting fips_mode.enabled accordingly (which implicitly disables the diagnostic trust manager). The attempts weren't future proof though as nothing would forbid someone to add new tests without setting the necessary setting and forcing this would be very inconvenient for any other case ( see #56427 (comment) for the full argumentation). This change introduces a runtime check in SSLService that overrides the configuration value of xpack.security.ssl.diagnose.trust and disables the diagnostic trust manager when we are running in Java 8 and the SunJSSE provider is set in FIPS mode.	2020-05-15 18:10:45 +03:00
Benjamin Trent	f71c305090	[7.x] [Transform] add support for terms agg in transforms (#56696 ) (#56809 ) * [Transform] add support for terms agg in transforms (#56696) This adds support for `terms` and `rare_terms` aggs in transforms. The default behavior is that the results are collapsed in the following manner: `<AGG_NAME>.<BUCKET_NAME>.<SUBAGGS...>...` Or if no sub aggs exist `<AGG_NAME>.<BUCKET_NAME>.<_doc_count>` The mapping is also defined as `flattened` by default. This is to avoid field explosion while still providing (limited) search and aggregation capabilities.	2020-05-15 08:08:43 -04:00
David Roberts	270a23e422	[TEST] Fix log tail mocking in native process unit tests (#56804 ) This is a followup to #56632. Tests that had to be changed to mock the C++ log handler more accurately need to be more careful about when that stream ends, as ending of that stream is used to detect crashes in the production system. Fixes #56796	2020-05-15 12:46:37 +01:00
Alan Woodward	d33d13f2be	Simplify generics on Mapper.Builder (#56747 ) Mapper.Builder currently has some complex generics on it to allow fluent builder construction. However, the second parameter, a return type from the build() method, is unnecessary, as we can use covariant return types. This commit removes this second generic parameter.	2020-05-15 12:14:49 +01:00
David Turner	27a090232e	Suppress Kerberos tests on JDK15 (#56767 ) Somewhat convoluted AwaitsFix for #56507 that only applies on JDK15.	2020-05-15 07:41:04 +01:00
Yang Wang	c66e7ecbfe	Fix test failure of file role store auto-reload (#56398 ) (#56802 ) Ensure assertion is only performed when we can be sure that the desired changes are picked up by the file watcher.	2020-05-15 15:10:45 +10:00
Ryan Ernst	9fb80d3827	Move publishing configuration to a separate plugin (#56727 ) This is another part of the breakup of the massive BuildPlugin. This PR moves the code for configuring publications to a separate plugin. Most of the time these publications are jar files, but this also supports the zip publication we have for integ tests.	2020-05-14 20:23:07 -07:00
Tal Levy	5e90ff32f7	Add Normalize Pipeline Aggregation (#56399 ) (#56792 ) This aggregation will perform normalizations of metrics for a given series of data in the form of bucket values. The aggregations supports the following normalizations - rescale 0-1 - rescale 0-100 - percentage of sum - mean normalization - z-score normalization - softmax normalization To specify which normalization is to be used, it can be specified in the normalize agg's `normalizer` field. For example: ``` { "normalize": { "buckets_path": <>, "normalizer": "percent" } } ```	2020-05-14 17:40:15 -07:00
Mark Vieira	0fd756d511	Enforce strict license distribution requirements (#56642 )	2020-05-14 13:57:56 -07:00
Jake Landis	a22aabcc15	[7.x] Reduce chance for test failure due to schedule (#56633 ) (#56695 ) If CI is running tests at exactly 0 or 5 minutes past the hour the ack-watch docs tests may fail with a 409 error if the ack test happens to run at the exact time that the schedule watch is running. This commit changes the public documentation (and the test) for the ack to a feb 29th at noon schedule. Test doc or tests do not really care about the schedule date and this is chosen since it is a valid date, but one that is extremely unlikely to cause issues.	2020-05-14 15:52:04 -05:00
Costin Leau	6f4af43405	EQL: Skip execution for filters with empty results (#56718 ) Optimize away events queries and joins/sequence that cannot match any results without having to query the backend. (cherry picked from commit 69c8ef8cfefd8fc6dcb6d1a566bfcd537068e3e4)	2020-05-14 22:38:23 +03:00
Mark Tozzi	b718193a01	Clean up DocValuesIndexFieldData (#56372 ) (#56684 )	2020-05-14 12:42:37 -04:00
Dimitris Athanasiou	ac5902624c	[7.x][ML] Improve error upon DF analytics mappings conflict (#56700 ) (#56776 ) Adds the conflicting types and an example of an index which specifies them in order to make it easier for the user to understand the conflict. Backport of #56700	2020-05-14 19:16:10 +03:00
Jim Ferenczi	fb5e6329b7	Stop/Start async search maintenance service in tests(#56673 ) This change ensures that the maintenance service that is responsible for deleting the expired response is stopped between each test. This is needed since we check that no search context are in-flight after each test method. Fixes #55988	2020-05-14 15:13:01 +02:00
David Turner	bec6821fe6	AwaitsFix for #56755	2020-05-14 11:46:05 +01:00
Alexander Reelsen	3a263d91f6	Ensure watcher email action message ids are always unique (#56574 ) If an email action is used in a foreach loop, message ids could have been duplicated, which then get rejected by the mail server. This commit introduces an additional static counter in the email action in order to ensure that every message id is unique.	2020-05-14 10:36:00 +02:00
Przemysław Witek	98fbd85290	[7.x] Add scope-related fields to Annotation (#56417 ) (#56681 )	2020-05-14 10:23:13 +02:00
Andrei Stefan	ddf4e47e86	EQL: fix QueryFolderOkTests (#56714 ) (#56728 ) (cherry picked from commit 8b21ccd0eac3b3d0fbd090152b3dff6ae5217b52)	2020-05-14 10:58:25 +03:00
David Roberts	3051c37f92	[ML] Tail the C++ logging pipe before connecting other pipes (#56701 ) Prior to this change the named pipes that connect the ML C++ processes to the Elasticsearch JVM were all opened before any of them were read from or written to. This created a problem, where if the C++ process logged more messages between opening the log pipe and opening the last pipe to be connected than there was space for in the named pipe's buffer then the C++ process would block. This would mean it never got as far as opening the last named pipe, so the JVM would never get as far as reading from the log pipe, hence a deadlock. This change alters the connection order so that the JVM starts reading from the logging pipe immediately after opening it so that if the C++ process logs messages while opening the other named pipes they are captured in a timely manner and there is no danger of a deadlock. Backport of #56632	2020-05-14 07:10:30 +01:00
Aleksandr Maus	87a10806ab	EQL: Fix cidrMatch function fails to match when used in scripts (#56246 ) (#56735 ) EQL: Fix cidrMatch function fails to match when used in scripts (#56246) Addresses https://github.com/elastic/elasticsearch/issues/55709	2020-05-13 22:41:24 -04:00
Nik Everett	b98b260048	Merge significant_terms into the terms package (backport of #56699 ) (#56715 ) This merges the code for the `significant_terms` agg into the package for the code for the `terms` agg. They are super entangled already, this mostly just admits that to ourselves. Precondition for the terms work in #56487	2020-05-13 17:36:21 -04:00
Ross Wolf	61e2cf89b5	EQL: Add number function (#55084 ) * EQL: Add number function * EQL: Fix the locale used for number for deterministic functionality * EQL: Add more ToNumber tests * EQL: Add more number ToNumberProcessor unit tests * EQL: Remove unnecessary overrides, fix processor methods * EQL: Remove additional unnecessary overrides * EQL: Lint fixes for ToNumber * EQL: ToNumber renames from PR feedback * EQL: Remove NumberFormat locale handling * EQL: Removed NumberFormat from ToNumber * EQL: Add number function tests * EQL: ToNumberProcessorTests formatting * EQL: Remove newline in ToNumberProcessorTests * EQL: Add number(..., null) test * EQL: Create expression.function.scalar.math package * EQL: Remove painless whitespace for ToNumber.asScript * EQL: Add Long support	2020-05-13 14:09:06 -06:00
Costin Leau	9f1ecd52eb	EQL: Introduce support for sequences (#56300 ) Initial support for EQL sequences The current algorithm is focused on correctness and does not contain any optimization which is left for the future. The current implementation uses a state machine approach which moves ascending and runs each query one after the other working on computing sequences as the data comes in. For each result, the key and its timestamp are being extracted which are then used for matching/building a sequence. (cherry picked from commit 4f3e18c894a1841d333022361ad9d1fdf1477dc3)	2020-05-13 15:42:31 +03:00
Ignacio Vera	b4521d5183	upgrade to Lucene 8.6.0 snapshot (#56661 )	2020-05-13 14:25:16 +02:00
Marios Trivyzas	cbbbd499bf	SQL/EQL: Add support for scalars within LIKE/RLIKE (#56495 ) (#56674 ) - Add support for scalar functions on the field of SQL's LIKE/RLIKE - Add support for scalar functions on the field of EQL's match/matchLite Closes: #55058 (cherry picked from commit 51c14e2dbb7fb29004a23369c449d425b3ac8fe2)	2020-05-13 13:40:24 +02:00
Luca Cavanna	30e9a1b8c7	Improve error handling when decoding async execution ids (#56285 ) When decoding async execution ids, exceptions thrown from the decode method itself were not caught, leading to cryptic errors like "Input byte array has incorrect ending byte at 68" being returned. With this commit we return "invalid id: [abcdef]". Added tests coverage for a couple of these scenarios and also added tests for equals/hashcode methods.	2020-05-13 12:26:17 +02:00
Marios Trivyzas	e781193cf9	SQL: Fix JDBC url pattern in docs and error message (#56612 ) The docs pattern url was using `*` which means zero or many instead of `?` which means zero or one. The pattern url returned in error messages was not in sync with the one in the docs. Fixes: #56476 (cherry picked from commit 1a5945c3962cdda21482f4b0b3e0ca508534c2c4)	2020-05-13 12:13:58 +02:00
David Turner	c10b4ae15a	Support cloning of searchable snapshot indices (#56595 ) Today you can convert a searchable snapshot index back into a regular index by restoring the underlying snapshot, but this is somewhat wasteful if the shards are already in cache since it copies the whole index from the repository again. Instead, we can make use of the locally-cached data by using the clone API to copy the contents of the cache into the layout expected by a regular shard. This commit marks the searchable snapshot's private index settings as `NotCopyableOnResize` so that they are removed by resize operations such as cloning. Cloning a regular index typically hard-links the underlying files rather than copying them, but this is tricky to support in the case of a searchable snapshot so this commit takes the simpler approach of always copying the underlying files.	2020-05-13 11:05:14 +01:00
Ioannis Kakavas	cc119c3853	Expose idp.metadata.http.refresh for SAML realm (#56354 ) (#56593 ) This setting was not returned in the SamlRealmSettings#getSettings so it was not possible for users to set this in the realm config in our configuration.	2020-05-13 11:51:18 +03:00

1 2 3 4 5 ...

5500 Commits