OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Turner	33d8738c68	Fix RareClusterStateIT on MacOS (#40203 ) Today RareClusterStateIT#testAssignmentWithJustAddedNodes fails on my Mac because it waits for the default connection timeout of 30 seconds to connect to a fake node with IP address 0.0.0.0. This connection attempt fails much more quickly on Linux so the test passes. This commit fixes this by reducing the connection timeout for this test.	2019-03-19 17:33:21 +00:00
Nhat Nguyen	a13b4bc8c5	Always fail engine if delete operation fails (#40117 ) Unlike index operations which can fail at the document level to analyzing errors, delete operations should never fail at the document level whether soft-deletes is enabled or not. With this change, we will always fail the engine if we fail to apply a delete operation to Lucene. Closes #33256	2019-03-19 13:09:23 -04:00
Nhat Nguyen	d58864745c	Dump recovery if fail to get doc count with preference (#40168 ) With this change, we will dump the recovery state if we fail to get doc count for a given index with a preference in rolling upgrade tests. We should have more information to look into why the provided preference is not valid. I also unmuted `testRelocationWithConcurrentIndexing` in this change. Relates #34950	2019-03-19 13:09:04 -04:00
Gordon Brown	85bb5a7f46	Only count some fields types for deprecation check (#40166 ) Some field types are not used for queries which use auto-expansion, in particular, `binary`, `geo_point`, and `geo_shape`. This was causing the count returned by the deprecation check and the count returned by the query-time deprecation warning to be misaligned for indices with fields of those types, with the count returned by the deprecation check being larger.	2019-03-19 10:52:35 -06:00
Mark Vovchuk	a054a9866e	Update managing-roles.asciidoc	2019-03-19 08:21:29 -07:00
Luca Cavanna	4c9d7df887	Disable bwc tests Relates to #40177	2019-03-19 14:43:39 +01:00
Luca Cavanna	d14e79e849	Serialize top-level pipeline aggs as part of InternalAggregations (#40177 ) We currently convert pipeline aggregators to their corresponding InternalAggregation instance as part of the final reduction phase. They arrive to the coordinating node as part of QuerySearchResult objects fom the shards and, despite we may incrementally reduce aggs (hence we may have some non-final reduce and the final one later) all the reduction phases happen on the same node. With CCS minimizing roundtrips though, each cluster performs its own non-final reduction, and then serializes the results back to the CCS coordinating node which will perform the final coordination. This breaks the assumptions made up until now around reductions happening all on the same node. With #40101 we have made sure that top-level pipeline aggs are not reduced as part of the non-final reduction. The next step is to make sure that they don't get lost, meaning that each coordinating node needs to send them back to the CCS coordinating node as part of the top-level `InternalAggregations` object. Closes #40059	2019-03-19 14:43:39 +01:00
Luca Cavanna	803ec46331	Skip sibling pipeline aggregators reduction during non-final reduce (#40101 ) Today a coordinating node forces a final reduction of sibling pipeline aggregators whenever reducing aggs, unless it is reducing aggs incrementally. This works well for incremental reduction of aggs, but breaks CCS when minimizing roundtrips as each cluster ends up reducing its own pipeline aggregators locally while that should only be done by the CCS coordinating node later. This causes issues as after their reduction, pipeline aggs cannot be further reduced, which is what happens with CCS causing errors like "java.lang.UnsupportedOperationException: Not supported" being returned. Each coordinating node should rather honour the reduce context flag that indicates whether we are executing a final reduce or not. If not, it should leave the sibling pipeline aggregations alone. Note that his bug affects only pipeline aggs that don't have a parent in the aggs tree, while all the others work well. Relates to #40059 but does not fix it yet, as the CCS coordinating node also needs to be adapted to recreate sibling pipeline aggregators from the request.	2019-03-19 14:43:39 +01:00
Luca Cavanna	83f12a3d9c	CCS: skip empty search hits when minimizing round-trips (#40098 ) When minimizing round-trips, each cluster returns its own independent search response. In case sort by field and/or field collapsing were requested, when one cluster has no results to return, the information about the field that sorting was based on (SortField array) as well as the field (and the values) that collapsing was performed on are missing in the search response. That causes problems as we can't build the proper `TopDocs` instance which would need to be either `TopFieldDocs` or `CollapseTopFieldDocs`. The merge routine expects that all the top docs are of the same exact type which can't be guaranteed. Given that the problematic results are empty, hence have no impact on the final results, we can simply skip them. Relates to #32125 Closes #40067	2019-03-19 14:43:39 +01:00
Luca Cavanna	a11f1c88b3	[DOCS] add details on version compatibility and remote gateway selection (#40056 ) This commit clarifies how the gateway selection works when configuring remote clusters for CCR or CCS. Specifically, it clarifies compatibility between different versions which is a very common question.	2019-03-19 14:43:38 +01:00
Alex Doerr	740020dfe8	Clarify version compatibility in snapshot/restore docs (#39329 )	2019-03-19 14:43:38 +01:00
Luca Cavanna	9c38fa6468	[TEST] Update TransportSearchActionTests#testShouldMinimizeRoundtrips Relates to #40044 Closes #40051	2019-03-19 14:43:38 +01:00
Luca Cavanna	07bfb4c7f7	CCS: Disable minimizing round-trips when dfs is requested (#40044 ) When using DFS_QUERY_THEN_FETCH search type, the dfs phase is run and its results are used in the query phase to make scoring accurate. When using CCS, depending on whether the DFS phase runs in the CCS coordinating node (like if all shards were local) or in each remote cluster (when minimizing round-trips), scoring will differ. This commit disables minimizing round-trips whenever DFS is requested, as it is not currently possible to ensure that scoring is accurate in that case. Relates to #32125	2019-03-19 14:43:38 +01:00
David Kyle	387648065d	[ML] Data Frame HLRC start & stop APIs (#40197 )	2019-03-19 13:30:01 +00:00
Nhat Nguyen	8dc6862b17	Unmute and trace testPendingRefreshWithIntervalChange Tracked at #39565	2019-03-19 09:07:54 -04:00
Henning Andersen	dde41cc2dd	Node repurpose tool (#39403 ) When a node is repurposed to master/no-data or no-master/no-data, v7.x will not start (see #37748 and #37347). The `elasticsearch repurpose` tool can fix this by cleaning up the problematic data.	2019-03-19 11:52:02 +01:00
Alexander Reelsen	c46dd6ad08	Replace java mail with jakarta mail (#40088 ) The eclipse foundation has taken over the javax mail dependency, which resulted in a naming change of the dependency.	2019-03-19 09:56:44 +01:00
Yannick Welsch	1d8b5fc658	Fail command-line client's auto-URL detection with helpful message (#40151 ) The setup-passwords tool gives cryptic messages in case where custom discovery providers are used (see #33580). As the URL auto-detection logic should be seen as best effort, this commit improves the exception message to make it clearer what needs to be done to fix the issue. Relates #33580	2019-03-19 09:04:14 +01:00
Alpar Torok	cb29d3d8d0	Fix build-tools unit test on windows (#39986 )	2019-03-19 08:07:33 +02:00
Jason Tedor	f88e4181ca	Enable reading auto-follow patterns from x-content (#40130 ) This named writable was never registered, so it means that we could not read auto-follow patterns that were registered in the cluster state. This causes them to be lost on restarts, a bad bug. This commit addresses this by registering this named writable, and we add a basic CCR restart test to ensure that CCR keeps functioning properly when the follower is restarted.	2019-03-18 21:48:44 -04:00
Lisa Cawley	d093205b6a	[DOCS] Fixes edit_url attributes that were externalized as text strings (#40161 )	2019-03-18 14:32:02 -07:00
jaymode	afaa7e60f9	Mute FullClusterRestartIT#testWatcher This test has failed due to incorrect versions on the watches, see	2019-03-18 15:08:32 -06:00
Dimitris Athanasiou	95f660d577	Mute NoMasterNodeIT.testNoMasterActionsWriteMasterBlock test (#39689 ) Relates #39688	2019-03-18 15:04:26 -06:00
Henning Andersen	0b214c1bfb	Linearizability checker memory reduction (#40149 ) The cache used in linearizability checker now uses approximately 6x less memory by changing the cache from a set of (bits, state) tuples into a map from bits -> { state }. Each combination of states is kept once only, building on the assumption that the number of state permutations is small compared to the number of bits permutations. For those histories that are difficult to check we will have many bits combinations that use the same state permutations. We end up now using approximately 15 bytes per entry compared to 101 bytes before, ie. a 6x improvement, allowing us to linearizability check significantly longer histories. Re-enabled linearizability checker in CoordinatorTests, hoping above ensures we no longer run out of memory. Resolves #39437	2019-03-18 21:16:59 +01:00
Gordon Brown	c8a4a7fc9d	Remove Migration Upgrade and Assistance APIs (#40075 ) The Migration Assistance API has been functionally replaced by the Deprecation Info API, and the Migration Upgrade API is not used for the transition from ES 6.x to 7.x, and does not need to be kept around to repair indices that were not properly upgraded before upgrading the cluster, as was the case in 6.	2019-03-18 13:46:56 -06:00
Nhat Nguyen	38e9522218	Remove wait for cluster state step in peer recovery (#40004 ) We introduced WAIT_CLUSTERSTATE action in #19287 (5.0), but then stopped using it since #25692 (6.0). This change removes that action and related code in 7.x and 8.0. Relates #19287 Relates #25692	2019-03-18 15:17:21 -04:00
Nhat Nguyen	9ba0bdf528	Dump cluster state if ensureGreen timed out in QA tests (#40133 ) When the method ensureGreen in QA tests is timed out, it does not provide enough info for us to investigate why the testing index is not green yet. With this change, we will dump the cluster state if ensureGreen timed out. Relates #32027	2019-03-18 15:17:21 -04:00
Nhat Nguyen	d720a64b9e	Ensure sendBatch not called recursively (#39988 ) This PR introduces AsyncRecoveryTarget which executes remote calls of peer recovery asynchronously. In this change, we also add a new assertion to ensure that method sendBatch, which sends a batch of history operations in phase2, is never called recursively on the same thread. This new assertion will also be used in method sendFileChunks.	2019-03-18 15:17:21 -04:00
Lisa Cawley	7325b2a3a7	[DOCS] Moves monitoring settings (#40167 )	2019-03-18 11:13:26 -07:00
Andrey Ershov	42602478b8	Unmute, fix, refactor and zen2ify NetworkDisruptionIT (#38351 ) This commit unmutes NetworkDisruptionIT. It makes changes necessary for Zen2 - avoids usage of autoMinMasterNodes and selects cluster size, such that there is no need to call AddVotingExclusion. This test also introduces refactors a single method prepareDistruptedCluster to be used by both test methods. Unfortunately, NetworkDisruption is broken and the testNetworkPartitionRemovalRestoresConnections "is fixed" by introducing assertBusy - #38348. Relates #36205 Relates #38348 (cherry picked from commit 97707c7f892636e5b75c3df546b067414acb27cd)	2019-03-18 16:39:43 +01:00
Andrey Ershov	d35d9d1886	cd ES_HOME in elasticsearch-env (#39937 ) This commit adds cd $ES_HOME to elasticsearch-env and removes it from elasticsearch. This way, both elasticsearch and elasticsearch-cli are executed with the working directory set to $ES_HOME. The need for the fix arose from the following bug: 1. Explicitly set path.data to relative to ES_HOME path in elasticsearch.yml. 2. Run elasticsearch from any directory. Elasticsearch is able to correctly start. 3. Stop elasticsearch. 4. Run elasticsearch-node unsafe-bootstrap, not from ES_HOME directory. It will fail with an exception. This commit fixes the issue and adds a new test. This PR fixes the issue and adds a new test. Also tests >=100 are renamed because alphabetic order does not work for them. (cherry picked from commit 2ffc29306ff7366efc598e7b4dd2ce528895cd3a with fixes by #40083 and #40118)	2019-03-18 16:26:27 +01:00
Jason Tedor	5be12e0999	Safe publication of AutoFollowCoordinator (#40153 ) We were leaking a reference to an AutoFollowCoordinator during construction, violating safe publication according to the JLS specification. This commit addresses this by waiting to register AutoFollowCoordinator with the ClusterApplierService after the AutoFollowCoordinator is fully constructed. We also remove ourselves as a listener when stopping.	2019-03-18 10:13:41 -04:00
Andrei Stefan	791814bb47	SQL: fix incorrect ordering of groupings (GROUP BY) based on orderings (ORDER BY) (#40087 ) * Take into consideration aliases that can be used as aggregates and in the ORDER BY element so that the groupings are re-ordered inside the composite aggregation according to the ORDER BY ordering. (cherry picked from commit 110c0b90b9cf2e9344ab3f412cfa8f8cd94ad71f)	2019-03-18 15:37:45 +02:00
Costin Leau	076a68007c	SQL: Add multi_value_field_leniency inside FieldHitExtractor (#40113 ) For cases where fields can have multi values, allow the behavior to be customized through a dedicated configuration field. By default this will be enabled on the drivers so that existing datasets work instead of throwing an exception. For regular SQL usage, the behavior is false so that the user is aware of the underlying data. Fix #39700 (cherry picked from commit 2b351571961f172fd59290ee079126bbd081ceaf)	2019-03-18 14:56:03 +02:00
Jason Tedor	b8ad337234	Stop auto-followers on shutdown (#40124 ) When shutting down a node, auto-followers will keep trying to run. This is happening even as transport services and other components are being closed. In some cases, this can lead to a stack overflow as we rapidly try to check the license state of the remote cluster, can not because the transport service is shutdown, and then immeidately retry again. This can happen faster than the shutdown, and we die with stack overflow. This commit adds a stop command to auto-followers so that this retry loop occurs at most once on shutdown.	2019-03-18 07:25:31 -04:00
Jim Ferenczi	eb540125ea	Fix IndexSearcherWrapper visibility (#39071 ) (#40145 ) This change adds a wrapper for IndexSearcher that makes IndexSearcher#search(List, Weight, Collector) visible by sub-classes. The wrapper is used by the ContextIndexSearcher to call this protected method on a searcher created by a plugin. This ensures that an override of the protected method in an IndexSearcherWrapper plugin is called when a search is executed. Closes #30758	2019-03-18 11:33:54 +01:00
David Turner	8a3f87bdcd	Note that GET /_cluster/state is unstable (#40104 ) The `GET /_cluster/state` API returns an internal representation of the cluster state that does change from version to version. It's useful for debugging, but it is not intended for regular use by clients. This change adjusts the documentation of `GET /_cluster/state` to clarify that this API yields an internal representation that should not be expected to remain stable between versions. Relates #40061, #40016	2019-03-18 09:28:00 +00:00
Jim Ferenczi	5b73a1bc7d	Add an option to force the numeric type of a field sort (#38095 ) (#40084 ) This change adds an option to the `FieldSortBuilder` that allows to transform the type of a numeric field into another. Possible values for this option are `long` that transforms the source field into an integer and `double` that transforms the source field into a floating point. This new option is useful for cross-index search when the sort field is mapped differently on some indices. For instance if a field is mapped as a floating point in one index and as an integer in another it is possible to align the type for both indices using the `numeric_type` option: ``` { "sort": { "field": "my_field", "numeric_type": "double" <1> } } ``` <1> Ensure that values for this field are transformed to a floating point if needed.	2019-03-18 09:32:45 +01:00
Daniel Mitterdorfer	95024798c0	Document monitoring node stats collection timeout (#39846 ) (#40140 ) With this commit we document the setting `xpack.monitoring.collection.node.stats.timeout` that has been missing so far in the docs. Supersedes #31043	2019-03-18 08:59:02 +01:00
Ioannis Kakavas	607d05f0b8	Enable QA tests to run with FIPS nodes (#40105 ) This commit enables full-cluster-restart and rolling-upgrade tests to run with nodes using a JVM in fips approved only node by using PEM key material instead of a JKS for the transport layer in that case.	2019-03-18 09:00:31 +02:00
Ioannis Kakavas	3b9a884f92	Throw an exception when unable to read Certificate (#40092 ) With SUN security provider, a CertificateException is thrown when attempting to parse a Certificate from a PEM file on disk with `sun.security.provider.X509Provider#parseX509orPKCS7Cert` When using the BouncyCastle Security provider (as we do in fips tests) the parsing happens in CertificateFactory#engineGenerateCertificates which doesn't throw an exception but returns an empty list. In order to have a consistent behavior, this change makes it so that we throw a CertificateException when attempting to read a PEM file from disk and failing to do so in either Security Provider Resolves: #39580	2019-03-18 08:46:49 +02:00
Albert Zaharovits	124de8d938	Un-hardcode SecurityIndexManager to handle generic indices (#40064 ) `SecurityIndexManager` is hardcoded to handle only the `.security`-`.security-7` alias-index pair. This commit removes the hardcoded bits, so that the `SecurityIndexManager` can be reused for other indices, such as the planned security tokens index (`.security-tokens-7`).	2019-03-17 14:46:16 +02:00
Albert Zaharovits	1b75ee0bd7	AuditTrail correctly handle ReplicatedWriteRequest (#39925 ) This fix deduplicates index names in `BulkShardRequests` and only audits the specific resolved index for every comprising `BulkItemRequest`.	2019-03-17 13:05:26 +02:00
Ioannis Kakavas	a77e3d1ad8	Adjust ldap timeout for idp fixture (#40102 ) This change adjusts the LDAP connection timeout for retrieving attributes while performing the SAML IT to 5 seconds, from 5 ms that it previously was. Resolves: #40025	2019-03-17 11:46:52 +02:00
David Roberts	64028f3d8f	Mute JobResultsProviderIT.testMultipleSimultaneousJobCreations Due to https://github.com/elastic/elasticsearch/issues/40134	2019-03-17 07:50:08 +00:00
Jason Tedor	2df0405d97	Reenable BWC tests after removing cluster state size (#40127 ) This commit reenables the BWC tests after removing cluster state size and backporting that work.	2019-03-16 18:47:15 -04:00
Benjamin Trent	28729eb54c	[ML] fixing sort order (#40119 ) (#40123 )	2019-03-16 17:14:07 -05:00
Jason Tedor	0824eceacf	Add log message for auto-follower timeout When an auto-follower coordinator times out waiting for the remote cluster state, we do not log any indication of this. While this is expected behavior in quiet deployments, it is still useful to see this information for tracing the behavior of the auto-follow coordinator. This commit adds a trace log message indicating that the timeout.	2019-03-16 10:46:20 -04:00
Jason Tedor	86d1d03c37	Remove cluster state size (#40109 ) This commit removes the cluster state size field from the cluster state response, and drops the backwards compatibility layer added in 6.7.0 to continue to support this field. As calculation of this field was expensive and had dubious value, we have elected to remove this field.	2019-03-15 17:16:25 -04:00
Igor Motov	a019af7690	SQL: Refactor Literals serialization method (#40058 ) Since other classes besides intervals can be serialized as part of the Cursor, the getNamedWritables method should be moved from Intervals to a more generic class Literals. Relates to #39973	2019-03-15 14:30:28 -04:00

1 2 3 4 5 ...

45075 Commits All Branches Search

45075 Commits

All Branches