OpenSearch

Commit Graph

Author	SHA1	Message	Date
Henning Andersen	0b214c1bfb	Linearizability checker memory reduction (#40149 ) The cache used in linearizability checker now uses approximately 6x less memory by changing the cache from a set of (bits, state) tuples into a map from bits -> { state }. Each combination of states is kept once only, building on the assumption that the number of state permutations is small compared to the number of bits permutations. For those histories that are difficult to check we will have many bits combinations that use the same state permutations. We end up now using approximately 15 bytes per entry compared to 101 bytes before, ie. a 6x improvement, allowing us to linearizability check significantly longer histories. Re-enabled linearizability checker in CoordinatorTests, hoping above ensures we no longer run out of memory. Resolves #39437	2019-03-18 21:16:59 +01:00
Gordon Brown	c8a4a7fc9d	Remove Migration Upgrade and Assistance APIs (#40075 ) The Migration Assistance API has been functionally replaced by the Deprecation Info API, and the Migration Upgrade API is not used for the transition from ES 6.x to 7.x, and does not need to be kept around to repair indices that were not properly upgraded before upgrading the cluster, as was the case in 6.	2019-03-18 13:46:56 -06:00
Nhat Nguyen	38e9522218	Remove wait for cluster state step in peer recovery (#40004 ) We introduced WAIT_CLUSTERSTATE action in #19287 (5.0), but then stopped using it since #25692 (6.0). This change removes that action and related code in 7.x and 8.0. Relates #19287 Relates #25692	2019-03-18 15:17:21 -04:00
Nhat Nguyen	9ba0bdf528	Dump cluster state if ensureGreen timed out in QA tests (#40133 ) When the method ensureGreen in QA tests is timed out, it does not provide enough info for us to investigate why the testing index is not green yet. With this change, we will dump the cluster state if ensureGreen timed out. Relates #32027	2019-03-18 15:17:21 -04:00
Nhat Nguyen	d720a64b9e	Ensure sendBatch not called recursively (#39988 ) This PR introduces AsyncRecoveryTarget which executes remote calls of peer recovery asynchronously. In this change, we also add a new assertion to ensure that method sendBatch, which sends a batch of history operations in phase2, is never called recursively on the same thread. This new assertion will also be used in method sendFileChunks.	2019-03-18 15:17:21 -04:00
Lisa Cawley	7325b2a3a7	[DOCS] Moves monitoring settings (#40167 )	2019-03-18 11:13:26 -07:00
Andrey Ershov	42602478b8	Unmute, fix, refactor and zen2ify NetworkDisruptionIT (#38351 ) This commit unmutes NetworkDisruptionIT. It makes changes necessary for Zen2 - avoids usage of autoMinMasterNodes and selects cluster size, such that there is no need to call AddVotingExclusion. This test also introduces refactors a single method prepareDistruptedCluster to be used by both test methods. Unfortunately, NetworkDisruption is broken and the testNetworkPartitionRemovalRestoresConnections "is fixed" by introducing assertBusy - #38348. Relates #36205 Relates #38348 (cherry picked from commit 97707c7f892636e5b75c3df546b067414acb27cd)	2019-03-18 16:39:43 +01:00
Andrey Ershov	d35d9d1886	cd ES_HOME in elasticsearch-env (#39937 ) This commit adds cd $ES_HOME to elasticsearch-env and removes it from elasticsearch. This way, both elasticsearch and elasticsearch-cli are executed with the working directory set to $ES_HOME. The need for the fix arose from the following bug: 1. Explicitly set path.data to relative to ES_HOME path in elasticsearch.yml. 2. Run elasticsearch from any directory. Elasticsearch is able to correctly start. 3. Stop elasticsearch. 4. Run elasticsearch-node unsafe-bootstrap, not from ES_HOME directory. It will fail with an exception. This commit fixes the issue and adds a new test. This PR fixes the issue and adds a new test. Also tests >=100 are renamed because alphabetic order does not work for them. (cherry picked from commit 2ffc29306ff7366efc598e7b4dd2ce528895cd3a with fixes by #40083 and #40118)	2019-03-18 16:26:27 +01:00
Jason Tedor	5be12e0999	Safe publication of AutoFollowCoordinator (#40153 ) We were leaking a reference to an AutoFollowCoordinator during construction, violating safe publication according to the JLS specification. This commit addresses this by waiting to register AutoFollowCoordinator with the ClusterApplierService after the AutoFollowCoordinator is fully constructed. We also remove ourselves as a listener when stopping.	2019-03-18 10:13:41 -04:00
Andrei Stefan	791814bb47	SQL: fix incorrect ordering of groupings (GROUP BY) based on orderings (ORDER BY) (#40087 ) * Take into consideration aliases that can be used as aggregates and in the ORDER BY element so that the groupings are re-ordered inside the composite aggregation according to the ORDER BY ordering. (cherry picked from commit 110c0b90b9cf2e9344ab3f412cfa8f8cd94ad71f)	2019-03-18 15:37:45 +02:00
Costin Leau	076a68007c	SQL: Add multi_value_field_leniency inside FieldHitExtractor (#40113 ) For cases where fields can have multi values, allow the behavior to be customized through a dedicated configuration field. By default this will be enabled on the drivers so that existing datasets work instead of throwing an exception. For regular SQL usage, the behavior is false so that the user is aware of the underlying data. Fix #39700 (cherry picked from commit 2b351571961f172fd59290ee079126bbd081ceaf)	2019-03-18 14:56:03 +02:00
Jason Tedor	b8ad337234	Stop auto-followers on shutdown (#40124 ) When shutting down a node, auto-followers will keep trying to run. This is happening even as transport services and other components are being closed. In some cases, this can lead to a stack overflow as we rapidly try to check the license state of the remote cluster, can not because the transport service is shutdown, and then immeidately retry again. This can happen faster than the shutdown, and we die with stack overflow. This commit adds a stop command to auto-followers so that this retry loop occurs at most once on shutdown.	2019-03-18 07:25:31 -04:00
Jim Ferenczi	eb540125ea	Fix IndexSearcherWrapper visibility (#39071 ) (#40145 ) This change adds a wrapper for IndexSearcher that makes IndexSearcher#search(List, Weight, Collector) visible by sub-classes. The wrapper is used by the ContextIndexSearcher to call this protected method on a searcher created by a plugin. This ensures that an override of the protected method in an IndexSearcherWrapper plugin is called when a search is executed. Closes #30758	2019-03-18 11:33:54 +01:00
David Turner	8a3f87bdcd	Note that GET /_cluster/state is unstable (#40104 ) The `GET /_cluster/state` API returns an internal representation of the cluster state that does change from version to version. It's useful for debugging, but it is not intended for regular use by clients. This change adjusts the documentation of `GET /_cluster/state` to clarify that this API yields an internal representation that should not be expected to remain stable between versions. Relates #40061, #40016	2019-03-18 09:28:00 +00:00
Jim Ferenczi	5b73a1bc7d	Add an option to force the numeric type of a field sort (#38095 ) (#40084 ) This change adds an option to the `FieldSortBuilder` that allows to transform the type of a numeric field into another. Possible values for this option are `long` that transforms the source field into an integer and `double` that transforms the source field into a floating point. This new option is useful for cross-index search when the sort field is mapped differently on some indices. For instance if a field is mapped as a floating point in one index and as an integer in another it is possible to align the type for both indices using the `numeric_type` option: ``` { "sort": { "field": "my_field", "numeric_type": "double" <1> } } ``` <1> Ensure that values for this field are transformed to a floating point if needed.	2019-03-18 09:32:45 +01:00
Daniel Mitterdorfer	95024798c0	Document monitoring node stats collection timeout (#39846 ) (#40140 ) With this commit we document the setting `xpack.monitoring.collection.node.stats.timeout` that has been missing so far in the docs. Supersedes #31043	2019-03-18 08:59:02 +01:00
Ioannis Kakavas	607d05f0b8	Enable QA tests to run with FIPS nodes (#40105 ) This commit enables full-cluster-restart and rolling-upgrade tests to run with nodes using a JVM in fips approved only node by using PEM key material instead of a JKS for the transport layer in that case.	2019-03-18 09:00:31 +02:00
Ioannis Kakavas	3b9a884f92	Throw an exception when unable to read Certificate (#40092 ) With SUN security provider, a CertificateException is thrown when attempting to parse a Certificate from a PEM file on disk with `sun.security.provider.X509Provider#parseX509orPKCS7Cert` When using the BouncyCastle Security provider (as we do in fips tests) the parsing happens in CertificateFactory#engineGenerateCertificates which doesn't throw an exception but returns an empty list. In order to have a consistent behavior, this change makes it so that we throw a CertificateException when attempting to read a PEM file from disk and failing to do so in either Security Provider Resolves: #39580	2019-03-18 08:46:49 +02:00
Albert Zaharovits	124de8d938	Un-hardcode SecurityIndexManager to handle generic indices (#40064 ) `SecurityIndexManager` is hardcoded to handle only the `.security`-`.security-7` alias-index pair. This commit removes the hardcoded bits, so that the `SecurityIndexManager` can be reused for other indices, such as the planned security tokens index (`.security-tokens-7`).	2019-03-17 14:46:16 +02:00
Albert Zaharovits	1b75ee0bd7	AuditTrail correctly handle ReplicatedWriteRequest (#39925 ) This fix deduplicates index names in `BulkShardRequests` and only audits the specific resolved index for every comprising `BulkItemRequest`.	2019-03-17 13:05:26 +02:00
Ioannis Kakavas	a77e3d1ad8	Adjust ldap timeout for idp fixture (#40102 ) This change adjusts the LDAP connection timeout for retrieving attributes while performing the SAML IT to 5 seconds, from 5 ms that it previously was. Resolves: #40025	2019-03-17 11:46:52 +02:00
David Roberts	64028f3d8f	Mute JobResultsProviderIT.testMultipleSimultaneousJobCreations Due to https://github.com/elastic/elasticsearch/issues/40134	2019-03-17 07:50:08 +00:00
Jason Tedor	2df0405d97	Reenable BWC tests after removing cluster state size (#40127 ) This commit reenables the BWC tests after removing cluster state size and backporting that work.	2019-03-16 18:47:15 -04:00
Benjamin Trent	28729eb54c	[ML] fixing sort order (#40119 ) (#40123 )	2019-03-16 17:14:07 -05:00
Jason Tedor	0824eceacf	Add log message for auto-follower timeout When an auto-follower coordinator times out waiting for the remote cluster state, we do not log any indication of this. While this is expected behavior in quiet deployments, it is still useful to see this information for tracing the behavior of the auto-follow coordinator. This commit adds a trace log message indicating that the timeout.	2019-03-16 10:46:20 -04:00
Jason Tedor	86d1d03c37	Remove cluster state size (#40109 ) This commit removes the cluster state size field from the cluster state response, and drops the backwards compatibility layer added in 6.7.0 to continue to support this field. As calculation of this field was expensive and had dubious value, we have elected to remove this field.	2019-03-15 17:16:25 -04:00
Igor Motov	a019af7690	SQL: Refactor Literals serialization method (#40058 ) Since other classes besides intervals can be serialized as part of the Cursor, the getNamedWritables method should be moved from Intervals to a more generic class Literals. Relates to #39973	2019-03-15 14:30:28 -04:00
Tim Brooks	0b50a670a4	Remove transport name from tcp channel (#40074 ) Currently, we maintain a transport name ("mock-nio", "nio", "netty") that is passed to a `TcpTransportChannel` when a request is received. The value of this name is to associate with the task when we register a task with the task manager. However, it is only possible to run ES with one transport, so having an implementation specific name is unnecessary. This commit removes the name and replaces it with the generic "transport".	2019-03-15 12:04:13 -06:00
Julie Tibshirani	6ffa8a040d	Document the limitation around field aliases and percolator. (#40073 ) Currently if a field alias is updated, any percolator queries that contain the alias will still refer to its old target. This PR documents the issue while we look into addressing it. Relates to #37212.	2019-03-15 10:54:09 -07:00
Zachary Tong	c72feedd74	Do not allow Sampler to allocate more than maxDoc size, better CB accounting (#39381 ) The `sampler` agg creates a BestDocsDeferringCollector, which internally initializes a priority queue of size `shardSize`. This queue is populated with empty `Object` sentinels, which is roughly 16b per object. Similarly, the Diversified samplers create a DiversifiedTopDocsCollectors which internally track PQ slots with ScoreDocKeys, weighing in around 28kb If the user sets a very abusive `shard_size`, this could easily OOM a node or cluster since these PQ are allocated up-front without any checks. This commit makes sure that when we create the collector, it cannot be larger than the maxDoc so that we don't accidentally blow up the node. We ensure the size is not greater than the overall index maxDoc. A similar treatment is done for `maxDocsPerValue` parameter of the diversified samplers For good measure, this also adds in some CB accounting to try and track memory usage. Finally, a redundant array creation is removed to reduce a bit of temporary memory.	2019-03-15 13:19:55 -04:00
MK Swanson	4d2ceb5ed5	[DOCS] Edited intro sentence for clarity. Closes #39865 (#40108 )	2019-03-15 13:17:34 -04:00
Jack Conradson	dcaabdfce8	Add Painless cast tests for long and Long (#40007 )	2019-03-15 09:37:26 -07:00
Yannick Welsch	c74111ff8e	Reduce logging noise when stepping down as master before state recovery (#39950 ) Reduces the logging noise from the state recovery component when there are duelling elections. Relates to #32006	2019-03-15 17:24:03 +01:00
David Kyle	4eb3683d65	Mute CcrRetentionLeaseIT tests (#40090 )	2019-03-15 15:05:47 +00:00
Lisa Cawley	c92476f591	[DOCS] Replaces CCS terms with attributes (#40076 )	2019-03-15 07:57:51 -07:00
David Kyle	09809bc91b	[ML] Avoid assertions on empty Optional in DF usage test (#40043 ) Refactor the usage class to make testing simpler	2019-03-15 12:18:29 +00:00
Henning Andersen	95e61d4bb1	Blob Store compress default false (#40054 ) Fixed documentation to comply with code (compress=false is default until 8.0).	2019-03-15 12:31:03 +01:00
David Roberts	8d01b11918	[ML] Fix race condition when creating multiple jobs (#40049 ) If multiple jobs are created together and the anomaly results index does not exist then some of the jobs could fail to update the mappings of the results index. This lead them to fail to write their results correctly later. Although this scenario sounds rare, it is exactly what happens if the user creates their first jobs using the Nginx module in the ML UI. This change fixes the problem by updating the mappings of the results index if it is found to exist during a creation attempt. Fixes #38785	2019-03-15 10:18:03 +00:00
David Kyle	78a9754318	Mute test NetworkDisruptionIT.testJobRelocation Relates to #39858	2019-03-15 10:06:31 +00:00
Costin Leau	3960374a6f	SQL: Introduce MAD (MedianAbsoluteDeviation) aggregation (#40048 ) Add Median Absolute Deviation aggregation Fix #39597 (cherry picked from commit 4f09613942a9249d06c74da64ad7e6f362e97f56)	2019-03-15 11:45:15 +02:00
David Turner	0d152a54f8	Await all pending activity in testConnectAndDisconnect (#40037 ) We call `ensureConnections()` to undo the effects of a disruption. However, it is possible that one or more targets are currently CONNECTING and have been since the disruption was active, and that the connection attempt was thwarted by a concurrent disruption to the connection. If so, we cannot simply add our listener to the queue because it will be notified when this CONNECTING activity completes even though it was disrupted. We must therefore wait for all the current activity to finish and then go through and reconnect to any missing nodes. Closes #40030.	2019-03-15 08:08:57 +00:00
Ryan Ernst	8f09c77777	Add no-jdk distributions (#39882 ) This commit adds a variant for every official distribution that omits the bundled jdk. The "no-jdk" naming is conveyed through the package classifier, alongside the platform. Package tests are also added for each new distribution.	2019-03-15 00:55:57 -07:00
David Turner	a323132503	Create retention leases file during recovery (#39359 ) Today we load the shard history retention leases from disk whenever opening the engine, and treat a missing file as an empty set of leases. However in some cases this is inappropriate: we might be restoring from a snapshot (if the target index already exists then there may be leases on disk) or force-allocating a stale primary, and in neither case does it make sense to restore the retention leases from disk. With this change we write an empty retention leases file during recovery, except for the following cases: - During peer recovery the on-disk leases may be accurate and could be needed if the recovery target is made into a primary. - During recovery from an existing store, as long as we are not force-allocating a stale primary. Relates #37165	2019-03-15 07:49:49 +00:00
David Turner	8d2184b315	Fix up committed configuration on fake Zen1 nodes (#40065 ) Today we test Zen1/Zen2 compatibility by running 7.x nodes with a "fake" Zen1 implementation. However this is not a truly faithful test because these nodes do known how to properly deserialize a 7.x cluster state, voting configurations and all, whereas a real Zen1 node is in 6.7 and ignores the coordination metadata. We only ever apply a cluster state that's been committed, which in Zen2 involves setting the last-committed configuration to equal the last-accepted configuration. Zen1 knows nothing about this adjustment, so it is possible for these to differ. This breaks the assertion that the cluster states are equal on all nodes after integration tests. This commit fixes this by implementing this adjustment in Zen1 before applying a cluster state. Fixes #40055.	2019-03-15 07:44:31 +00:00
Ioannis Kakavas	35aaf04c8c	Handle empty input in AddStringKeyStoreCommand (#39490 ) This change ensures that we do not make assumptions about the length of the input that we can read from the stdin. It still consumes only one line, as the previous implementation	2019-03-15 09:38:22 +02:00
Lisa Cawley	e050d50759	[DOCS] Removes X-Pack settings section (#39870 )	2019-03-14 14:37:37 -07:00
Jack Conradson	b57af6c401	Add a Painless Context REST API (#39382 ) This PR adds an internal REST API for querying context information about Painless whitelists. Commands include the following: GET /_scripts/painless/_context -- retrieves a list of contexts GET /_scripts/painless/_context?context=%name% retrieves all available information about the API for this specific context	2019-03-14 12:42:12 -07:00
Ioannis Kakavas	ae0ff05c90	Adjust testGetSslCertificates to run in FIPS (#40046 ) As discovered in #40041, when parsing certificates from files, the SUN Security Provider normalizes DNs from parsed certificates by adding spaces between RDNs, while the BouncyCastle one (which we use in FIPS tests) does not. We could proceed to normalize the DNs in the same manner in this test by using i.e. the Unbound LDAP SDK but since the goal of this test is to validate that we do get to read these exact certificates from our trust sources and not to validate subject DNs, this commit changes the test to check the serial number instead Resolves: #40041	2019-03-14 20:25:47 +02:00
Tamara Braun	e2b60c7141	Fix not Recognizing Disabled Object Mapper (#39862 ) * Fixes not finding disabled object mapper when using dotted field name notation * Closes #39456	2019-03-14 10:57:00 -07:00
Lisa Cawley	52b0b9cb55	[DOCS] Add reindex steps for internal indices (#40019 )	2019-03-14 10:30:13 -07:00

... 6 7 8 9 10 ...

45352 Commits All Branches Search

45352 Commits

All Branches