OpenSearch

Commit Graph

Author	SHA1	Message	Date
Yannick Welsch	a0ae1cc987	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 23:13:12 +01:00
Yannick Welsch	03d0ea91ef	Zen2: Rename tombstones to exclusions (#36226 ) Renames the withdrawal / tombstones APIs to voting configuration exclusions.	2018-12-05 23:12:28 +01:00
Andrey Ershov	5d6602120f	[Zen2] Hide not recovered state (#36224 ) This commit hides ClusterStates that have a STATE_NOT_RECOVERED_BLOCK from ClusterStateAppliers. This is needed, because some appliers, such as IngestService, rely on the fact, that cluster states with STATE_NOT_RECOVERED_BLOCK won't contain anything useful. Once the state is recovered it's fully available for the appliers. This commit also switches many of the remaining tests that require state persistence/recovery from Zen1 to Zen2.	2018-12-05 23:11:20 +01:00
jaymode	2d03eeb632	Test: mute testDataNodeRestartWithBusyMasterDuringSnapshot This commit mutes DedicatedClusterSnapshotRestoreIT# testDataNodeRestartWithBusyMasterDuringSnapshot which fails reproducibly. See #36283	2018-12-05 14:06:38 -07:00
Jay Modi	dba3f4ec9f	add version 6.5.3 (#36268 )	2018-12-05 12:15:09 -07:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Mark Tozzi	ce7b1886b8	Unit test for very large percentile aggs (#36122 ) Closes #19528 * Port a test Colin wrote for the TDigest library to validate TDigests storing over 2*MAXINT values. This appears to have been fixed in version 3.2 of TDigest, which Elasticsearch has been using for some time, so no changes were necessary to resolve this issue.	2018-12-05 10:56:48 -05:00
Andrey Ershov	e8289167ee	Fix FAIL_ON_METHOD in MetaDataStateFormatTests fail method should be computed before eval, not inside it	2018-12-05 14:59:41 +01:00
Yannick Welsch	b20497560c	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 14:06:38 +01:00
Yannick Welsch	0b9efff5cb	Zen2: Persist cluster states the old way on non-master-eligible nodes (#36247 ) The shard deletion logic (triggered by IndicesStore), which also leads to index metadata deletion on non-master-eligible data nodes, currently races against the new cluster state persistence logic triggered by accepting cluster states. One thread is writing the index metadata while another one is deleting the index metadata, leading to exceptions and assertions tripping (see below). The solution proposed by this PR is to move the cluster state persistence of non-master-eligible nodes back to the cluster applier service, just as it used to be for Zen1. This ensures that the index metadata deletion logic, which is triggered by the shard deletion logic, runs on the same thread on which we persist the cluster state.	2018-12-05 14:04:45 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00
Alpar Torok	60e45cd81d	Testing conventions task part 2 (#36107 ) Closes #35435 - make it easier to add additional testing tasks with the proper configuration and add some where they were missing. - mute or fix failing tests - add a check as part of testing conventions to find classes not included in any testing task.	2018-12-05 14:20:01 +02:00
Martijn van Groningen	11935cd480	Replace Streamable w/ Writeable in BaseTasksResponse and subclasses (#36176 ) This commit replaces usages of Streamable with Writeable for the BaseTasksResponse / TransportTasksAction classes and subclasses of these classes. Note that where possible response fields were made final. Relates to #34389	2018-12-05 13:14:10 +01:00
Yannick Welsch	42457b5960	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-05 11:39:38 +01:00
Adrien Grand	4c2e0d76f3	Fix test failures when using the `termvectors` endpoint with a type name.	2018-12-05 11:28:12 +01:00
Tim Brooks	8bde608979	Register CcrRepository based on settings update (#36086 ) This commit adds an empty CcrRepository snapshot/restore repository. When a new cluster is registered in the remote cluster settings, a new CcrRepository is registered for that cluster. This is implemented using a new concept of "internal repositories". RepositoryPlugin now allows implementations to return factories for "internal repositories". The "internal repositories" are different from normal repositories in that they cannot be registered through the external repository api. Additionally, "internal repositories" are local to a node and are not stored in the cluster state. The repository will be unregistered if the remote cluster is removed.	2018-12-04 14:36:50 -07:00
Yannick Welsch	70c361ea5a	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 21:26:11 +01:00
Armin Braun	3c54b413ad	SNAPSHOT: Repo Creation out of ClusterStateTask (#36157 ) * Move `createRepository` call out of cluster state tasks * Now only `RepositoriesService#applyClusterState` manipulates `this.repositories` * Closes #9488	2018-12-04 20:53:51 +01:00
Adrien Grand	d24b40f688	Make typeless APIs usable with indices whose type name is different from `_doc` (#35790 ) This commit makes `document`, `update`, `explain`, `termvectors` and `mapping` typeless APIs work on indices that have a type whose name is not `_doc`. Unfortunately, this needs to be a bit of a hack since I didn't want calls with random type names to see documents with the type name that the user had chosen upon type creation. The `explain` and `termvectors` do not support being called without a type for now so the test is just using `_doc` as a type for now, we will need to fix tests later but this shouldn't require further changes server-side since passing `_doc` as a type name is what typeless APIs do internally anyway. Relates #35190	2018-12-04 19:22:17 +01:00
Yannick Welsch	9026f98aca	Remove trace logging from CoordinatorTests	2018-12-04 18:53:32 +01:00
Adrien Grand	0df08dd458	Set Lucene version upon index creation. (#36038 ) It is important that all shards of a given index have the same `indexCreatedVersionMajor` to Lucene, or eg. merging those shards is going to be considered illegal. At the moment, we use the latest Lucene version when creating a shard, which could cause shards to have different created versions eg. in case of forced allocation. This commit makes sure to reuse the appropriate Lucene version in order to avoid such issues. Closes #33826	2018-12-04 17:53:20 +01:00
Nhat Nguyen	b59deb573e	Always set soft-deletes field of IndexWriterConfig (#36196 ) Today we configure the soft-deletes field iff soft-deletes enabled. Although this choice was correct, it prevents an engine with soft-deletes disabled from opening a Lucene index with soft-deletes. Moreover, this change should not have any side-effect if a Lucene index does not have any soft-deletes. Relates #36141	2018-12-04 11:15:34 -05:00
Andrey Ershov	35e3d77e2c	[Zen2] Implement state recovery (#36013 ) This commit implements proper metadata recovery for Zen2. GatewayService is responsible for the recovery. In Zen1 GatewayService creates an instance of Gateway, that is used to reach out to other cluster nodes, get their state and calculate the most up-to-date state based on versions. After that Gateway performs upgrade and archival of ClusterSettings and closes bad indices. Then recovered state is passed to GatewayService.GatewayRecoveryListener that mixes up current state and restored state, removes state not recovered block, creates the routing table and performs re-routing. In Zen2 we should perform this kind of logic on cluster startup, except mixing state (because there is nothing to mix) and opening routing table. This commit refactors out all `ClusterUpdate` functions in a separate class `ClusterStateUpdaters`, which is used by `Gateway` and `GatewayService` in case of Zen1, and by `GatewayMetaState` and `GatewayService` in case of Zen2. This commit also switches all integration tests that are already using Zen2 from InMemoryPersistedState to GatewayMetaState.	2018-12-04 14:45:45 +01:00
Tanguy Leroux	5d684ca473	Combine the execution of an exclusive replica operation with primary term update (#36116 ) This commit changes how an operation which requires all index shard operations permits is executed when a primary term update is required: the operation and the update are combined so that the operation is executed after the primary term update under the same blocking operation. Closes #35850 Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2018-12-04 14:39:31 +01:00
Yannick Welsch	80ee7943c9	Merge remote-tracking branch 'elastic/master' into zen2	2018-12-04 09:37:09 +01:00
David Turner	034c7655b7	[Zen2] Reduce cluster scope in NodeDisconnectIT (#36168 ) This test suite can stop all the shared master-eligible nodes, which breaks the cluster since any non-shared master-eligible nodes are stopped first in the reset process between tests. Since this test suite can leave the cluster in this somewhat broken state, it seems best that it uses a new cluster for each test.	2018-12-04 07:48:56 +00:00
Andy Bristol	01b8f99c17	[test] mute RemoveCorruptedShardDataCommandIT	2018-12-03 16:07:16 -08:00
João Barbosa	d27aa72b17	Added soft limit to open scroll contexts #25244 (#36009 ) This change adds a soft limit to open scroll contexts that can be controlled with the dynamic cluster setting `search.max_open_scroll_context` (defaults to 500).	2018-12-03 19:57:10 +01:00
Julie Tibshirani	59ee8b5c69	Remove the deprecated _termvector endpoint. (#36131 )	2018-12-03 10:22:42 -08:00
David Turner	c01aecb4b1	[Zen2] Do not probe non-master nodes back (#36160 ) Today if a node `A` sends a peers request to another node `B` then `B` will react by sending a peers request back to `A`. However if `A` is not master-eligible then this reaction is pointless and fails with an exception saying `non-master-eligible node found`, adding noise to the logs. This change suppresses this response to non-master-eligible nodes.	2018-12-03 17:19:18 +00:00
Armin Braun	433a506d06	SNAPSHOT: Improve Resilience SnapshotShardService (#36113 ) * Resolve the index in the snapshotting thread * Added test for routing table - snapshot state mismatch	2018-12-03 16:39:29 +01:00
Nhat Nguyen	9c1c46a02f	TEST: Adjust min_retained_seq_no expectation min_retained_seq_no is non-negative, however, if the number of retained operations is greater than 0, then the expectation may be negative.	2018-12-03 08:59:13 -05:00
Luca Cavanna	b5cae0af58	Enforce max_buckets limit only in the final reduction phase (#36152 ) Given that we check the max buckets limit on each shard when collecting the buckets, and that non final reduction cannot add buckets (see #35921), there is no point in counting and checking the number of buckets as part of non final reduction phases. Such check is still needed though in the final reduction phases to make sure that the number of returned buckets is not above the allowed threshold. Relates somehow to #32125 as we will make use of non final reduction phases in CCS alternate execution mode and that increases the chance that this check trips for nothing when reducing aggs in each remote cluster.	2018-12-03 13:55:18 +01:00
Boaz Leskes	36ddca7d0c	Disable merges in testReuseInFileBasedPeerRecovery The test assumes lucene files don't change. Closes #35772	2018-12-03 13:45:19 +01:00
Jim Ferenczi	74aca756b8	Remove the distinction between query and filter context in QueryBuilders (#35354 ) When building a query Lucene distinguishes two cases, queries that require to produce a score and queries that only need to match. We cloned this mechanism in the QueryBuilders in order to be able to produce different queries based on whether they need to produce a score or not. However the only case in es that require this distinction is the BoolQueryBuilder that sets a different minimum_should_match when a `bool` query is built in a filter context.. This behavior doesn't seem right because it makes the matching of `should` clauses different when the score is not required. Closes #35293	2018-12-03 11:49:11 +01:00
Armin Braun	328d022ddd	MINOR: Some Cleanups around Store (#36139 ) * Moved method `canOpenIndex` is only used in tests -> moved to test CP * Simplify `org.elasticsearch.index.store.Store#renameTempFilesSafe` * Delete some dead methods	2018-12-03 11:21:42 +01:00
Armin Braun	f763037b03	MINOR: BlobstoreRepository Cleanups (#36140 ) * Removed redundant private getter * Removed unused `version` field	2018-12-03 11:11:10 +01:00
Armin Braun	9c49aacbcf	MINOR: Remove Dead Code in QueryCache (#36147 )	2018-12-03 10:02:35 +01:00
Martijn van Groningen	43773a32a4	Replace Streamable w/ Writeable in BaseTasksRequest and subclasses (#35854 ) * Replace Streamable w/ Writeable in BaseTasksRequest and subclasses This commit replaces usages of Streamable with Writeable for the BaseTasksRequest / TransportTasksAction classes and subclasses of these classes. Relates to #34389	2018-12-03 08:04:29 +01:00
Armin Braun	9c0a429709	TESTS: Fix IndexStatsIT#testFilterCacheStats (#36143 ) * Test randomly failed because of background merges * Fixed by force merging down to a single segment * Closes #32506	2018-12-03 06:16:12 +01:00
David Turner	8bb1952975	Fix NodeJoinTests again (#36133 ) In #36033 we removed a catch block because we thought we were preventing exceptions by avoiding concurrent elections, missing the obvious fact that some joins are supposed to be failing. As a quick fix the catch was reinstated in `3a5dab6d8e` but this change adds finesse by only catching exceptions from the joins that we expect to fail. It also inlines an always-false parameter to `initialState()`.	2018-12-01 09:54:01 +00:00
David Turner	9cc416bc46	Weaken assertion in PeerFinder It can be inactive with no leader if it's handling an incoming PeersRequest before being activated for the first time.	2018-12-01 07:20:19 +00:00
David Turner	3a5dab6d8e	Reinstate catch removed in error in #36033	2018-12-01 07:10:19 +00:00
Julie Tibshirani	0e1ddfd825	Deprecate types in document delete requests. (#36087 ) * Make sure to use _doc as a type name in the CRUD HLRC tests. * Deprecate types in document delete requests.	2018-11-30 15:11:29 -08:00
Julie Tibshirani	98b290637d	Deprecate the _termvector endpoint. (#36098 )	2018-11-30 13:11:58 -08:00
Nik Everett	df56f0734e	Tasks: Retry if task can't be written (#35054 ) Adds about a minute worth of backoffs and retries to saving task results so it is much more likely that a busy cluster won't lose task results. This isn't an ideal solution to losing task results, but it is an incremental improvement. If all of the retries fail when still log the task result, but that is far from ideal. Closes #33764	2018-11-30 16:06:58 -05:00
Luca Cavanna	0ebc17743a	Histogram aggs: add empty buckets only in the final reduce step (#35921 ) Empty buckets don't need to be added when performing an incremental reduction step, they can be added later in the final reduction step. This will allow us to later remove the max buckets limit when performing non final reduction.	2018-11-30 20:33:09 +01:00
Tim Brooks	ea7ea51050	Make `TcpTransport#openConnection` fully async (#36095 ) This is a follow-up to #35144. That commit made the underlying connection opening process in TcpTransport asynchronous. However the method still blocked on the process being complete before returning. This commit moves the blocking to the ConnectionManager level. This is another step towards the top-level TransportService api being async.	2018-11-30 11:30:42 -07:00
Armin Braun	986bf52d1f	[Zen2] Allow Setting a List of Bootstrap Nodes to Wait for (#35847 )	2018-11-30 18:53:08 +01:00
Tim Brooks	da100c5479	Remove `Lifecycle` from `ConnectionManager` (#36092 ) Prior to #35441 `ConnectionManager` had a `Lifecycle` object to support the ping runnable. After that commit, the connection amanger only needs the existing `AtomicBoolean` to indicate if it is running.	2018-11-30 09:04:32 -07:00
Luca Cavanna	43ea498f2f	[TEST] Reduce number of buckets created in InternalDateHistogramTests New that we test with min_doc_count set to 0 as well, we may end up generating a lot more buckets. This commit adjusts the min bound and max bound, as well as the offset for each randomly generated agg instance so that we don't end up hitting the 10.000 max buckets limit. Relates to #36064	2018-11-30 14:16:32 +01:00
Luca Cavanna	e3eb05b14b	[TEST] Increase InternalDateHistogramTests coverage (#36064 ) In this test we were randomizing different values but minDocCount was hardcoded to 1. It's important to test other values, especially `0` as it's the default. The test needed some adapting in the way buckets are randomly generated: all aggs need to share the same interval, minDocCount and emptyBucketInfo. Also assertions need to take into account that more (or less) buckets are expected depending on minDocCount.	2018-11-30 11:21:46 +01:00
Adrien Grand	fa3d365ee8	Fix CompositeBytesReference#slice to not throw AIOOBE with legal offsets. (#35955 ) CompositeBytesReference#slice has two bugs: - One that makes it fail if the reference is empty and an empty slice is created, this is #35950 and is fixed by special-casing empty-slices. - One performance bug that makes it always create a composite slice when creating a slice that ends on a boundary, this is fixed by computing `limit` as the index of the sub reference that holds the last element rather than the next element after the slice. Closes #35950	2018-11-30 10:32:46 +01:00
Jake Landis	f8636e58f9	Support content type `application/x-ndjson` in DeprecationRestHandler (#36025 ) org.elasticsearch.rest.RestController#hasContentType checks to see if the RestHandler supports the `application/x-ndjson` Content-Type. DeprecationRestHandler is a wrapper around the real RestHandler, and prior to this change would always return `false` due to the interface's default supportsContentStream(). This prevents API's that use multi-line JSON from properly being deprecated resulting in an HTTP 406 error. This change ensures that the DeprecationRestHandler honors the supportsContentStream() of the wrapped RestHandler. Relates to #35958	2018-11-29 11:45:45 -06:00
Jim Ferenczi	8a7f3f75f3	Add support for rest_total_hits_as_int (#36051 ) The support for rest_total_hits_as_int has already been merged to 6x in #35848 so this change adds this new option to master. The plan was to add this new option as part of #35848 but we've decided to wait a few days before merging this breaking change so this commit just handles the new option as a noop exactly like 6x for now. This will allow users to migrate to this parameter before #35848 is merged. Relates #33028	2018-11-29 18:36:16 +01:00
Tim Brooks	c305f9dc03	Make keepalive pings bidirectional and optimizable (#35441 ) This is related to #34405 and a follow-up to #34753. It makes a number of changes to our current keepalive pings. The ping interval configuration is moved to the ConnectionProfile. The server channel now responds to pings. This makes the keepalive pings bidirectional. On the client-side, the pings can now be optimized away. What this means is that if the channel has received a message or sent a message since the last pinging round, the ping is not sent for this round.	2018-11-29 08:55:53 -07:00
Jim Ferenczi	ecd29089a8	Cache the score of the parent document in the nested agg (#36019 ) The nested agg can defer the collection of children if it is nested under another aggregation. In such case accessing the score in the children aggregation throws an error because the scorer has already advanced to the next parent. This change fixes this error by caching the score of the parent in the nested aggregation. Children aggregations that work on nested documents will be able to access the _score. Also note that the _score in this case is always the parent's score, there is no way to retrieve the score of a nested docs in aggregations. Closes #35985 Closes #34555	2018-11-29 14:35:25 +01:00
Armin Braun	48dc6c3442	[Zen2] Implement Tombstone REST APIs (#36007 ) * [Zen2] Implement Tombstone REST APIs * Adds REST API for withdrawing votes and clearing vote withdrawls * Tests added to Netty4 module since we need a real Network impl. for Http endpoints	2018-11-29 14:34:10 +01:00
David Turner	7f257187af	[Zen2] Update default for USE_ZEN2 to true (#35998 ) Today the default for USE_ZEN2 is false and it is overridden in many places. By defaulting it to true we can be sure that the only places in which Zen2 does not work are those in which it is explicitly set to false.	2018-11-29 12:18:35 +00:00
David Turner	277ccba3bd	[Zen2] fix NodeJoinTests#testConcurrentJoining() (#36033 ) Today we sometimes create a setup in which the node is a quorum on its own, which allows it to win a pre-voting round and schedule an election essentially at will, causing it to discard all the joins it just received and fail the test. This change excludes this case, preventing stray elections from ruining things.	2018-11-29 12:11:08 +00:00
David Turner	87408b04d4	[Zen2] Only elect master-eligible nodes (#35996 ) Today any node can win an election. However, the whole point of master-eligibility is that master-ineligible nodes should not be elected as the leader; furthermore master-ineligible nodes do not have any outgoing STATE channels so cannot publish cluster states, so their leadership is ineffective and disruptive. This change ensures that the elected leader is master-eligible by preventing master-ineligible nodes from scheduling an election.	2018-11-29 12:10:43 +00:00
Alan Woodward	a646f85a99	Ensure TokenFilters only produce single tokens when parsing synonyms (#34331 ) A number of tokenfilters can produce multiple tokens at the same position. This is a problem when using token chains to parse synonym files, as the SynonymMap requires that there are no stacked tokens in its input. This commit ensures that when used to parse synonyms, these tokenfilters either produce a single version of their input token, or that they throw an error when mappings are generated. In indexes created in elasticsearch 6.x deprecation warnings are emitted in place of the error. * asciifolding and cjk_bigram produce only the folded or bigrammed token * decompounders, synonyms and keyword_repeat are skipped * n-grams, word-delimiter-filter, multiplexer, fingerprint and phonetic throw errors Fixes #34298	2018-11-29 10:35:38 +00:00
Tanguy Leroux	0967620641	ActiveShardCount should not fail when closing the index (#35936 ) The ActiveShardCount is used by cluster state observers to wait for a given number of shards to be active before returning to the caller. The current implementation does not work when an index is closed while an observer is waiting on shards to be active. In this case, a NPE is thrown and the observer is never notified that the shards won't become active. This commit fixes the ActiveShardCount.enoughShardsActive() so that it does not fail when an index is closed, similarly to what is done when an index is deleted.	2018-11-29 09:08:30 +01:00
Alpar Torok	e0a678f0c4	Remove version.qualified from MainResponse (#35412 ) The fully qualified version will be returned as `version.number`	2018-11-29 08:41:39 +02:00
Ryan Ernst	afd42df15f	Core: Deguice RepositoriesService (#36016 ) This commit moves the RepositoriesService to be created outside of guice.	2018-11-28 21:20:44 -08:00
Jim Ferenczi	9ca3a06475	Remove custom QueryBuilder#analyzeGraphPhrase (#35983 ) Now that https://issues.apache.org/jira/browse/LUCENE-8479 is fixed we can remove the custom implementation of QueryBuilder#analyzeGraphPhrase in the match QueryBuilder.	2018-11-28 20:15:27 +01:00
Luca Cavanna	4b85769d24	Increase InternalHistogramTests coverage (#36004 ) In `InternalHistogramTests` we were randomizing different values but `minDocCount` was hardcoded to `1`. It's important to test other values, especially `0` as it's the default. To make this possible, the test needed some adapting in the way buckets are randomly generated: all aggs need to share the same `interval`, `minDocCount` and `emptyBucketInfo`. Also assertions need to take into account that more (or less) buckets are expected depending on `minDocCount`. This was originated by #35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to #26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, which was triggered by the increased test coverage.	2018-11-28 20:06:40 +01:00
Nik Everett	0588dad80b	Tasks: Only require task permissions (#35667 ) Right now using the `GET /_tasks/<taskid>` API and causing a task to opt in to saving its result after being completed requires permissions on the `.tasks` index. When we built this we thought that that was fine, but we've since moved towards not leaking details like "persisting task results after the task is completed is done by saving them into an index named `.tasks`." A more modern way of doing this would be to save the tasks into the index "under the hood" and to have APIs to manage the saved tasks. This is the first step down that road: it drops the requirement to have permissions to interact with the `.tasks` index when fetching task statuses and when persisting statuses beyond the lifetime of the task. In particular, this moves the concept of the "origin" of an action into a more prominent place in the Elasticsearch server. The origin of an action is ignored by the server, but the security plugin uses the origin to make requests on behalf of a user in such a way that the user need not have permissions to perform these actions. It can be made to be fairly precise. More specifically, we can create an internal user just for the tasks API that just has permission to interact with the `.tasks` index. This change doesn't do that, instead, it uses the ubiquitus "xpack" user which has most permissions because it is simpler. Adding the tasks user is something I'd like to get to in a follow up change. Instead, the majority of this change is about moving the "origin" concept from the security portion of x-pack into the server. This should allow any code to use the origin. To keep the change managable I've also opted to deprecate rather than remove the "origin" helpers in the security code. Removing them is almost entirely mechanical and I'd like to that in a follow up as well. Relates to #35573	2018-11-28 09:28:27 -05:00
Christoph Büscher	51a7dc54ec	Fix custom AUTO issue with Fuzziness#toXContent (#35807 ) Currently when a Fuzziness instance with custom AUTO distance values gets written to XContent, the customized lower and upper distance values are ommited and can consequently not be parsed back. This changes this to write the String including the optional custom values when writing to XContent and fixes the tests that should have caught this in the first place, e.g. by adding the custom low and high distance values to the equality check.	2018-11-28 15:07:11 +01:00
Andrey Ershov	0b45fb98b9	[Zen2] Generate coordinationMetaData with different configs (#35991 ) This PR fixes test failure, which is caused by equal randomly generated lastAcceptedConfiguration and lastCommittedConfguration.	2018-11-28 14:49:07 +01:00
Yannick Welsch	5f0c036183	Disable testDeleteCreateInOneBulk on Zen2 This test needs adaptation to run with Zen2	2018-11-28 13:34:48 +01:00
Christoph Büscher	2f547bac65	Remove deprecated methods from QueryStringQueryBuilder (#35912 ) This change removes the deprecated useDisMax() and useAllFields() methods from the QueryStringQueryBuilder and related tests. The disMax parameter has already been a no-op since 6.0 and also the useAllFields has been deprecated since 6.0 and there is a direct replacement via defaultField.	2018-11-28 11:09:03 +01:00
Jeff Hajewski	49087f16f5	Adds deprecation logging to ScriptDocValues#getValues. (#34279 ) `ScriptDocValues#getValues` was added for backwards compatibility but no longer needed. Scripts using the syntax `doc['foo'].values` when `doc['foo']` is a list should be using `doc['foo']` instead. Closes #22919	2018-11-27 14:30:13 -05:00
Julie Tibshirani	25c416b12d	Deprecate types in search and multi search templates. (#35669 ) This PR adds deprecation warnings to the relevant `RestAction` classes, plus tests in `RestActionTests`. No updates to REST tests, the Java HLRC, or documentation were necessary, since they didn't make use of types.	2018-11-27 10:19:19 -08:00
Simon Willnauer	ad1f0dccd4	Validate metdata on `_msearch` (#35938 ) MultiSearchRequests issues through `_msearch` now validate all keys in the metadata section. Previously unknown keys were ignored while now an exception is thrown. Closes #35869	2018-11-27 17:08:24 +01:00
Tim Brooks	cc1fa799c8	Remove `TcpChannel#setSoLinger` method (#35924 ) This commit removes the dedicated `setSoLinger` method. This simplifies the `TcpChannel` interface. This method has very little effect as the SO_LINGER is not set prior to the channels being closed in the abstract transport test case. We still will set SO_LINGER on the `MockNioTransport`. However we can do this manually.	2018-11-27 09:08:14 -07:00
Christophe Bismuth	adc0b560c0	Raise a 404 exception when document source is not found (#33384 ) (#34083 ) This pull request makes the `RestGetSourceAction` return a `ResourceNotFoundException` with a proper JSON response when source or document itself is missing (see issue #33384). Here is below a sample JSON output: ``` { "error": { "root_cause": [ { "type": "resource_not_found_exception", "reason": "Source not found [index1]/[_doc]/[1]" } ], "type": "resource_not_found_exception", "reason": "Source not found [index1]/[_doc]/[1]" }, "status": 404 } ```	2018-11-27 10:35:45 -05:00
Andrey Ershov	0e283f9670	[Zen2] PersistedState interface implementation (#35819 ) Today GatewayMetaState is capable of atomically storing MetaData to disk. We've also moved fields that are needed to be persisted in Zen2 from ClusterState to ClusterState.MetaData.CoordinationMetaData. This commit implements PersistedState interface. version and currentTerm are persisted as a part of Manifest. GatewayMetaState now implements both ClusterStateApplier and PersistedState interfaces. We started with two descendants Zen1GatewayMetaState and Zen2GatewayMetaState, but it turned out to be not easy to glue it. GatewayMetaState now constructs previousClusterState (including MetaData) and previousManifest inside the constructor so that all PersistedState methods are usable as soon as GatewayMetaState instance is constructed. Also, loadMetaData is renamed to getMetaData, because it just returns previousClusterState.metaData(). Sadly, we don't have access to localNode (obtained from TransportService in the constructor, so getLastAcceptedState should be called, after setLocalNode method is invoked. Currently, when deciding whether to write IndexMetaData to disk, we're comparing current IndexMetaData version and received IndexMetaData version. This is not safe in Zen2 if the term has changed. So updateClusterState now accepts incremental write method parameter. When it's set to false, we always write IndexMetaData to disk. Things that are not covered by GatewayMetaStateTests are covered by GatewayMetaStatePersistedStateTests. This commit also adds an option to use GatewayMetaState instead of InMemoryPersistedState in TestZenDiscovery. However, by default InMemoryPersistedState is used and only one test in PersistedStateIT used GatewayMetaState. In order to use it for other tests, proper state recovery should be implemented.	2018-11-27 15:04:52 +01:00
Martijn van Groningen	447e5d212a	Changed versions in serialization code after backporting #35535	2018-11-27 08:00:06 +01:00
Gordon Brown	119835decd	Always enforce cluster-wide shard limit (#34892 ) This removes the option to run a cluster without enforcing the cluster-wide shard limit, making strict enforcement the default and only behavior. The limit can still be adjusted as desired using the cluster settings API.	2018-11-26 17:05:12 -07:00
Igor Motov	663563f64b	Geo: better handling of malformed geo_points (#35554 ) Improves handling of malformed geo_points when `ignore_malformed` is set to true Closes #35419	2018-11-26 09:44:42 -10:00
Jim Ferenczi	900caa20ef	Handles exists query in composite aggs (#35758 ) This commit adds the support for exists query in the sorted execution mode of the `composite` aggregation. We'll execute the aggregation from the sorted points and use early termination if the main query is an `exists` query over the first source of the `composite` aggregation.	2018-11-26 19:08:14 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Simon Willnauer	ca9b2b9931	Repsect indices options on _msearch (#35887 ) Today we don't respect the indices options when they are passed as request parameters to the `_msearch` endpoint. This is unintuitive and doesn't cause any errors. This changes uses the top-level indices options as the defaults for each sub search-request. Closes #35851	2018-11-26 14:26:39 +01:00
Christophe Bismuth	04ebc63e34	RoutingMissingException in more like this (#33974 ) More like this query allows to provide identifiers of documents to be retrieved as like/unlike items. It can happen that at retrieval time an error is thrown, for instance caused by missing routing value when `_routing` is set required in the mapping. Instead of ignoring such error and returning no documents for the query, the error should be re-thrown and returned to users. As part of this change also mget and mtermvectors are unified in the way they throw such exception like it happens in other places, so that a `RoutingMissingException` is raised. Closes #29678	2018-11-26 13:57:57 +01:00
Tanguy Leroux	9bdbba23f8	[Tests] Fix IndexShardTests.testAcquirePrimaryAllOperationsPermits() This test fails on CI because of an inappropriate assertion, which is I think a leftover and has no real value.	2018-11-26 13:44:12 +01:00
Luca Cavanna	e44390ac20	InitialSearchPhase minor cleanups (#35864 ) This commit simplifies the throttling logic in InitialSearchPhase and removes some asserts from it. Also, a few formatting changes are applied to its code and surrounding classes.	2018-11-26 13:42:41 +01:00
David Turner	a68a46450b	[Zen2] Add lag detector (#35685 ) A publication can succeed and complete before all nodes have applied the published state and acknowledged it, thanks to the publication timeout; however we need every node eventually either to apply the published state (or a later state) or be removed from the cluster. This change introduces the LagDetector which achieves this liveness property by removing any lagging nodes from the cluster.	2018-11-26 10:52:49 +00:00
iverase	401b814d1a	[CI] Muting method testOperationPermitOnReplicaShards in IndexShardTests Relates to #35850	2018-11-26 09:33:23 +01:00
Martijn van Groningen	7624734f14	Added wait_for_metadata_version parameter to cluster state api. (#35535 ) The `wait_for_metadata_version` parameter will instruct the cluster state api to only return a cluster state until the metadata's version is equal or greater than the version specified in `wait_for_metadata_version`. If the specified `wait_for_timeout` has expired then a timed out response is returned. (a response with no cluster state and wait for timed out flag set to true) In the case metadata's version is equal or higher than `wait_for_metadata_version` then the api will immediately return. This feature is useful to avoid external components from constantly polling the cluster state to whether somethings have changed in the cluster state's metadata.	2018-11-26 08:50:08 +01:00
Simon Willnauer	4711c5cdf3	Always return false from `refreshNeeded` on ReadOnlyEngine (#35837 ) Acquiring a searcher is unnecessary to determine if a refresh is necessary since read-only engines never refresh. Closes #35785	2018-11-24 09:25:42 +01:00
Simon Willnauer	e46e44ce38	Wrap can_match reader with ElasticsearchDirectoryReader (#35857 ) Code that operates on-top of the engine requires all readers returned to be unwrapped into ElasticsearchDirectoryReader. The special reader the FrozenEngine uses wasn't wrapped.	2018-11-24 09:23:53 +01:00
Andrey Ershov	f47636b254	[Zen2] Introduce VotingTombstone class (#35832 ) Today voting tombstones are stored in CoordinationMetaData as Set<DiscoveryNode>. DiscoveryNode is not a lightweight object and have a lot of fields. It also has toXContent method, but no fromXContent method and the output of toXContent is not enough to re-create DiscoveryNode object. And votingTombstone set should be persisted as a part of MetaData. On the other hand, the only thing required from the tombstone is the nodeId. This PR adds VotingTombstone class for voting tombstones, which consists of two fields for now - nodeId and nodeName. It could be extended/shrank in the future if needed. This PR also resolves TODO's related to the voting tombstones xcontent story. Example of CoordinationMetaData.toXContent with voting tombstones: { "term": 1, "last_committed_config": [ "fkwLdOBvXSlgRTBfgNAL", "tmQiPGHvUxXzPkkCDSJo", "HhOmtQBZAThpHIGWhxpz", "qZHWGpoDNPYRNIiqKsDl" ], "last_accepted_config": [ "lhqacKmriwhHGFZcvqbx", "MYysmBuROkvJRlDcusyd" ], "voting_tombstones": [ { "node_id": "McjbZbRkEz", "node_name": "pdKIWeNJUO" }, { "node_id": "cpXkVibGwo", "node_name": "UnCvFgdVsc" }, { "node_id": "EylRNOztbc", "node_name": "ohOhkbMWZX" } ] }	2018-11-23 18:34:06 +01:00
Yannick Welsch	51d2e986c5	Remove BWC conditions after backport of #35731 This PR was backported to 6.x, so the extra BWC conditions are not needed anymore	2018-11-23 17:11:06 +01:00
Adrien Grand	5b370316a6	Remove some legacy code from when indices could have multiple types. (#35815 ) This code is only necessary up to indices created with version 5.x while 7.0 only supports indices created with 6.x or 7.0.	2018-11-23 15:15:26 +01:00
Yannick Welsch	2970abfce9	Add read-only repository verification (#35731 ) Adds a verification mode for read-only repositories. It also makes the extra bucket check on repository creation obsolete, which fixes #35703.	2018-11-23 14:45:05 +01:00
Christoph Büscher	88d862e69f	[CI] Muting two methods in IndexShardTests Relates to #35850	2018-11-23 14:29:26 +01:00
David Turner	d01436de3c	Copy checkpoint atomically when rolling generation (#35407 ) Today when rolling a transog generation we copy the checkpoint from `translog.ckp` to `translog-nnnn.ckp` using a simple `Files.copy()` followed by appropriate `fsync()` calls. The copy operation is not atomic, so if we crash at the wrong moment we can leave an incomplete checkpoint file on disk. In practice the checkpoint is so small that it's either empty or fully written. However, we do not correctly handle the case where it's empty when the node restarts. In contrast, in `recoverFromFiles()` we _do_ copy the checkpoint atomically. This commit extracts the atomic copy operation from `recoverFromFiles()` and re-uses it in `rollGeneration()`.	2018-11-23 08:43:34 +00:00
Jim Ferenczi	be69a774df	Fix analyzed prefix query in query_string (#35756 ) This change fixes analyzed prefix queries in `query_string` to be ignored if all terms are removed during the analysis. Closes #31702	2018-11-23 09:42:23 +01:00
Tanguy Leroux	2e37f17a7d	Expose all permits acquisition in IndexShard and TransportReplicationAction (#35540 ) This pull request exposes two new methods in the IndexShard and TransportReplicationAction classes in order to allow transport replication actions to acquire all index shard operation permits for their execution. It first adds the acquireAllPrimaryOperationPermits() and the acquireAllReplicaOperationsPermits() methods to the IndexShard class which allow to acquire all operations permits on a shard while exposing a Releasable. It also refactors the TransportReplicationAction class to expose two protected methods (acquirePrimaryOperationPermit() and acquireReplicaOperationPermit()) that can be overridden when a transport replication action requires the acquisition of all permits on primary and/or replica shard during execution. Finally, it adds a TransportReplicationAllPermitsAcquisitionTests which illustrates how a transport replication action can grab all permits before adding a cluster block in the cluster state, making subsequent operations that requires a single permit to fail). Related to elastic #33888	2018-11-23 09:26:38 +01:00
Dimitris Athanasiou	43d6ec8bcd	Remove unnecessary throws IOException in CompressedXContent.string() (#35821 )	2018-11-22 15:08:46 +00:00
Jim Ferenczi	e37a0ef844	Upgrade to lucene-8.0.0-snapshot-67cdd21996 (#35816 )	2018-11-22 15:42:59 +01:00
Albert Zaharovits	4fc911a129	Mute test InternalEngineTests Relates #35823	2018-11-22 15:34:53 +02:00
Tanguy Leroux	f9f7261d60	Revert "Revert "[RCI] Check blocks while having index shard permit in TransportReplicationAction (#35332 )"" This reverts commit `d3d7c01`	2018-11-22 12:13:19 +01:00
Mayya Sharipova	b6014d971c	Forbid negative scores in functon_score query (#35709 ) * Forbid negative scores in functon_score query - Throw an exception when scores are negative in field_value_factor function - Throw an exception when scores are negative in script_score function Relates to #33309	2018-11-22 06:08:48 -05:00
Tanguy Leroux	11052b75c7	TransportResyncReplicationAction should not honour blocks (#35795 ) After #35332 has been merged, we noticed some test failures like #35597 in which one or more replica shards failed to be promoted as primaries because the primary replica re-synchronization never succeed. After some digging it appeared that the execution of the resync action was blocked because of the presence of a global cluster block in the cluster state (in this case, the "no master" block), making the resync action to fail when executed on the primary. Until #35332 such failures never happened because the TransportResyncReplicationAction is skipping the reroute phase, the only place where blocks were checked. Now with #35332 blocks are checked during reroute and also during the execution of the transport replication action on the primary. After some internal discussion, we decided that the TransportResyncReplicationAction should never be blocked. This action is part of the replica to primary promotion and makes sure that replicas are in sync and should not be blocked when the cluster state has no master or when the index is read only. This commit changes the TransportResyncReplicationAction to make obvious that it does not honor blocks. It also adds a simple test that fails if the resync action is blocked during the primary action execution. Closes #35597	2018-11-22 10:50:12 +01:00
David Turner	cfdf666672	[Zen2] Fix test failures in diff-based publishing (#35684 ) `testIncompatibleDiffResendsFullState` sometimes makes a 2-node cluster and then partitions one of the nodes from the leader, which makes the leader stand down. Then when the partition is removed the cluster re-forms but does so by sending full cluster states, not diffs, causing the test to fail. Additionally `testDiffBasedPublishing` sometimes fails if a publication is delivered out-of-order, wiping out a fresher last-received cluster state with a less-fresh one. This is fixed here by passing the received cluster state to the coordinator before recording it as the last-received one, relying on the coordinator's freshness checks.	2018-11-22 09:08:52 +00:00
Igor Motov	39789d0a10	GEO: More robust handling of ignore_malformed in geoshape parsing (#35603 ) Adds an XContent sub parser class that can to wrap another XContent parser at the beginning of an object and allow skiping all children in case of the parsing failure. It also uses this subparser to ignore the rest of the GeoJson shape if the parsing fails and we need to ignore the geoshape due to the ignore_malformed flag. Supersedes #34498 Closes #34047	2018-11-21 11:04:01 -10:00
Armin Braun	1a5553d495	MINOR: Cleanup Runnables in SnapshotsService (#35796 ) * Simplify complex `Runnable` by moving to `AbstractRunnable`	2018-11-21 19:58:51 +01:00
Nick Knize	3bee25cb70	[GEO] Add support to ShapeBuilders for building Lucene geometry (#35707 ) * [GEO] Add support to ShapeBuilders for building Lucene geometry This commit adds support for building lucene geometry from the ShapeBuilders. This is needed for integrating LatLonShape as the primary indexing approach for geo_shape field types. All unit and integration tests are updated to add randomization for testing both jts/s4j shapes and lucene shapes.	2018-11-21 11:15:01 -06:00
Yannick Welsch	c816347253	Disable testClusterJoinDespiteOfPublishingIssues for Zen2 This test is failing sometimes with Zen2 due to the lack of lag detection. Zen1 does not have this problem as it only considers a join as valid if the corresponding cluster state update is successfully published and committed on the joining node.	2018-11-21 17:21:44 +01:00
Andrey Ershov	a056bd8c1c	[Zen2] Move ClusterState fields to be persisted to ClusterState.MetaData (#35625 ) Today we have a way to atomically persist global MetaData and IndexMetaData to disk when new ClusterState is received. All other ClusterState fields are not persisted. However, there are other parts of ClusterState that should be persisted, namely: version term lastCommittedConfiguration lastAcceptedConfiguration votingTombstones version is changed frequently, other fields are not. We decided to group term, lastCommittedConfiguration, lastAcceptedConfiguration and votingTombstones into CoordinationMetaData class and make CoordinationMetaData a field inside MetaData. MetaData.toXContent and MetaData.fromXContent should take care of CoordinationMetaData. version stays as a top level field in ClusterState and will be persisted as part of Manifest in a follow-up commit. Also MetaData.isGlobalStateEquals should be extended to include coordinationMetaData in comparison. This commit favors exposing getters, such as getTerm directly in ClusterState to avoid massive code changes. An example of CoordinationMetaState.toXContent: { "term": 1, "last_committed_config": [ "TiIuBcbBtpuXyDDVHXeD", "ZIAoVbkjjLPLUuYLaTkw" ], "last_accepted_config": [ "OwkXbXZNOZPJqccdFHdz", "LouzsGYwmQzpeQMrboZe", "fCKGRZdjLTqzXAqPUtGL", "pLoxshjpJXwDhbgjfYJy", "SjINLwFIlIEFZCbjrSFo", "MDkVncJEVyZLJktopWje" ] }	2018-11-21 17:03:26 +01:00
Andrey Ershov	6ac0cb1842	Merge branch master into zen2 2 types of conflicts during the merge: 1) Line length fix 2) Classes no longer extend AbstractComponent	2018-11-21 15:36:49 +01:00
Yannick Welsch	8939a7894f	Zen2: Move disruption tests to Zen2 (#35724 ) - Moves disruption tests to Zen2 - Registers a few missing settings - Removes .put(TestZenDiscovery.USE_ZEN2.getKey(), true) from tests where Zen2 is now enabled by default through the parent test class - Moves QuorumGatewayIT back to Zen1, as it is not stable with Zen2 as it currently relies on dangling indices due to the lack of proper CS persistence, which triggers secondary failures	2018-11-21 14:43:33 +01:00
Armin Braun	bdf632b6f9	SNAPSHOTING+MINOR: Simplify SnapshortShardService (#35769 )	2018-11-21 13:50:17 +01:00
Christoph Büscher	6638708b56	Remove deprecated QueryStringQueryBuilder#splitOnWhiteSpace (#35763 ) This parameter has been deprecated and was ignored since 6.0, so its Java API methods can be removed.	2018-11-21 10:29:08 +01:00
Christoph Büscher	5847f8379c	Move ScoreAccessor to test-framework (#35766 ) This class is only used by RandomScoreFunctionIT and the MockScriptEngine, so it shouldn't be part of the server codebase.	2018-11-21 10:28:31 +01:00
Christoph Büscher	ff03443ab9	Fix problem with MatchNoDocsQuery in disjunction queries (#35726 ) Queries across multiple fields generate MatchNoDocsQuerys for fields that are unmapped. In certain situation this can lead to erroneous behaviour, for example when an umapped field is used in a query_string query across several fields. If some of the tokens in the query string get eliminated by an analyzer on the mapped fields, the same token will currently generate MatchNoDocsQuerys combined into a disjunction, which in turn leads to no matches in the overall query. Instead we should simply not add MatchNoDocsQuerys to those disjunctions. Closes #34708	2018-11-21 03:49:49 +01:00
Mayya Sharipova	643bb20137	Add a new query type - ScriptScoreQuery (#34533 ) * Add a new query type - ScriptScoreQuery script_score query uses script to calculate document scores. Added as a substitute for function_score with an intentation to deprecate function_scoreq query. ```http GET /_search { "query": { "script_score" : { "query": { "match": { "message": "elasticsearch" } }, "script" : { "source": "Math.log(2 + doc['likes'].value)" }, "min_score" : 2 } } } ``` Add several functions to painless to be used inside script_score: double rational(double, double) double sigmoid(double, double, double) double randomNotReproducible() double randomReproducible(String, int) double decayGeoLinear(String, String, String, double, GeoPoint) double decayGeoExp(String, String, String, double, GeoPoint) double decayGeoGauss(String, String, String, double, GeoPoint) double decayNumericLinear(String, String, String, double, double) double decayNumericExp(String, String, String, double, double) double decayNumericGauss(String, String, String, double, double) double decayDateLinear(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateExp(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateGauss(String, String, String, double, JodaCompatibleZonedDateTime) Date functions only works on dates in the default format and default time zone	2018-11-20 16:10:06 -05:00
Ryan Ernst	c07ad67718	Add 6.5.2 version	2018-11-20 12:33:26 -08:00
Gordon Brown	4bda469861	Fix line lengths in misc other files (#35650 )	2018-11-20 12:29:48 -07:00
Vladimir Dolzhenko	953c8586df	Fix PrimaryAllocationIT#testForceStaleReplicaToBePromotedToPrimary (#35728 ) Closes 35497	2018-11-20 20:05:23 +01:00
Christoph Büscher	e91f404d16	Remove remains of 'auto_generate_phrase_queries' (#35735 ) This parameter in the `query_string` query was deprecated in 6.0 and ignored since then. Its API methods and remaining uses can be removed in the upcoming major version. Relates to #35734	2018-11-20 16:07:17 +01:00
Jim Ferenczi	a5f5ceb869	Remove remaining line length violations for o.e.index (#35652 ) This commit removes line length violations in the classes under org.elasticsearch.index.	2018-11-20 08:09:14 +01:00
Simon Willnauer	29ef442841	Add a `_freeze` / `_unfreeze` API (#35592 ) This commit adds a rest endpoint for freezing and unfreezing an index. Among other cleanups mainly fixing an issue accessing package private APIs from a plugin that got caught by integration tests this change also adds documentation for frozen indices. Note: frozen indices are marked as `beta` and available as a basic feature. Relates to #34352	2018-11-20 08:03:24 +01:00
Benjamin Trent	7a779a9988	[TEST] fixing ClusterApplierService#testClusterStateUpdateLogging() (#35714 )	2018-11-19 14:23:49 -06:00
Yannick Welsch	47ada69c46	Zen2: Move most integration tests to Zen2 (#35678 ) Zen2 is now feature-complete enough to run most ESIntegTestCase tests. The changes in this PR are as follows: - ClusterSettingsIT is adapted to not be Zen1 specific anymore (it was using Zen1 settings). - Some of the integration tests require persistent storage of the cluster state, which is not fully implemented yet (see #33958). These tests keep running with Zen1 for now but will be switched over as soon as that is fully implemented. - Some very few integration tests are not running yet with Zen2 for other reasons, depending on some of the other open points in #32006.	2018-11-19 21:15:29 +01:00
Andrey Ershov	f9ecd0c49e	[Zen2] Write manifest file (#35049 ) Elasticsearch node is responsible for storing cluster metadata. There are 2 types of metadata: global metadata and index metadata. `GatewayMetaState` implements `ClusterStateApplier` and receives all `ClusterStateChanged` events and is responsible for storing modified metadata to disk. When new `ClusterStateChanged` event is received, `GatewayMetaState` checks if global metadata has changed and if it's the case writes new global metadata to disk. After that `GatewayMetaState` checks if index metadata has changed or there are new indices assigned to this node and if it's the case writes new index metadata to disk. Atomicity of global metadata and index metadata writes is ensured by `MetaDataStateFormat` class. Unfortunately, there is no atomicity when more than one metadata changes (global and index, or metadata for two indices). And atomicity is important for Zen2 correctness. This commit adds atomicity by adding a notion of manifest file, represented by `MetaState` class. `MetaState` contains pointers to current metadata. More precisely, it stores global state generation as long and map from `Index` to index metadata generation as long. Atomicity of writes for manifest file is ensured by `MetaStateFormat` class. The algorithm of writing changes to the disk would be the following: 1. Write global metadata state file to disk and remember it's generation. 2. For each new/changed index write state file to disk and remember it's generation. For each not-changed index use generation from previous manifest file. If index is removed or this node is no longer responsible for this index - forget about the index. 3. Create `MetaState` object using previously remembered generations and write it to disk. 4. Remove old state files for global metadata, indices metadata and manifest. Additonally new implementation relies on enhanced `MetaDataStateFormat` failure semantics, `applyClusterState` throws IOException, whose descendant `WriteStateException` could be (and should be in Zen2) explicitly handled.	2018-11-19 19:49:44 +01:00
Gordon Brown	b2057138a7	Remove AbstractComponent from AbstractLifecycleComponent (#35560 ) AbstractLifecycleComponent now no longer extends AbstractComponent. In order to accomplish this, many, many classes now instantiate their own logger.	2018-11-19 09:51:32 -07:00
Nik Everett	036c15d8a3	Build: Fix official plugins list (#35661 ) The list of official plugins accidentally included `qa` projects like, well, `qa` and `amazon-ec2`. This changes the mechanism that we use to build the list and adds a test to catch this. Closes #35623	2018-11-19 10:17:05 -05:00
Ekal Golas	450db7fcf7	[Tests] Fix slowness of AutoDateHistogramAggregatorTests (#35072 ) Randomize test assertion and test set size instead of asserting on an exhaustive list of dates with fixed test set size. Also refactor common objects used to avoid recreating them, avoid date to string conversion and reduce duplicate test code Closes #33181	2018-11-19 15:21:01 +01:00
Martijn van Groningen	c468d928b8	Remove line length violations in o.e.indices package (#35647 ) Relates to #34884	2018-11-19 07:36:24 +01:00
Lee Hinman	22933ebcd5	Fix line length for org.elasticsearch.action.* files (#35607 ) This removes the checkstyle suppressions for things in the `action` package. Relates to #34884	2018-11-16 14:11:39 -07:00
Arthur Gavlyukovskiy	022726011c	Remove use of AbstractComponent in server (#35444 ) Removed extending of AbstractComponent and changed logger usage to explicit declaration. Abstract classes still have logger declaration using this.getClass() in order to show implementation class name in its logs. See #34488	2018-11-16 16:10:32 -05:00
Julie Tibshirani	c6a0904e0e	Deprecate types in count and msearch. (#35421 ) * Deprecate types in count requests. * Move RestCountAction to the 'search' package. * Deprecate types in multi search requests. * Add tests for types deprecation in the _search endpoint.	2018-11-16 13:04:43 -08:00
Jack Conradson	d62bbca56d	[Scripting] Use Number as a return value for BucketAggregationScript (#35653 ) This change fixes #35351. Users were no longer able to return types of numbers other than doubles for bucket aggregation scripts. This change reverts to the previous behavior of being able to return any type of number and having it converted to a double outside of the script.	2018-11-16 12:33:01 -08:00
Jernej Klancic	baf33b3162	Removes AbstractComponent from several classes (#35566 ) Removes inhertiting from AbstractComponent for some classes (mostly in the plugins module). Relates to #34488	2018-11-16 20:50:18 +01:00
Julie Tibshirani	40ba4de5e6	Deprecate types in validate query requests. (#35575 )	2018-11-16 08:59:04 -08:00
Tanguy Leroux	d3d7c0158e	Revert "[RCI] Check blocks while having index shard permit in TransportReplicationAction (#35332 )" This reverts commit `31567cefb4`.	2018-11-16 15:38:51 +01:00
Tom Callahan	76b77dbaa4	Remove remaining line length violations for o.e.action.admin.cluster (#35156 ) This inserts newlines in order to reduce line lengths in the o.e.action.admin.cluster package to 140 characters or less. This also remves the checkstyle suppressions for affected files. Relates #34884, #34923	2018-11-16 09:37:42 -05:00
Christoph Büscher	2da4bc85cf	Remove usages of CharSequence in Sets (#35501 ) The javadocs of the CharSequence interface state that not all of its implementations define the general contracts of the Object#equals and Object#hashCode methods, therefore it is dangerous to use different CharSequence instances as elements in a set or as keys in a map. While we probably mostly use Strings in sets, in some places this is not enforced. To prevent this from accidentally happening, this change replaces all occurances of Set<CharSequence> which are currently mostly used in the completion suggester code with the more concrete usage of Set<String>.	2018-11-16 09:35:22 +01:00
Jim Ferenczi	903d27a8fd	Fix phrase_slop in query_string query (#35533 ) This change ensures that custom `phrase_slop`s are correctly handled in the `query_string` query. Fixes #35125	2018-11-16 09:08:07 +01:00
Lee Hinman	ce35d049e9	[TEST] Fix ClusterApplierServiceTests.testClusterStateUpdateLogging This changes the test to not use a `CountDownlatch`, instead adding an assertion for the final logging message and waiting until the `MockAppender` has seen it before proceeding. Resolves #23739	2018-11-15 14:15:23 -07:00
David Turner	86ef041539	[Zen2] Introduce ClusterBootstrapService (#35488 ) Today, the bootstrapping of a Zen2 cluster is driven externally, requiring something else to wait for discovery to converge and then to inject the initial configuration. This is hard to use in some situations, such as REST tests. This change introduces the `ClusterBootstrapService` which brings the bootstrap retry logic within each node and allows it to be controlled via an (unsafe) node setting.	2018-11-15 20:09:22 +00:00
Jim Ferenczi	c7a2c6d549	Handle IndexOrDocValuesQuery in composite aggregation (#35392 ) The `composite` aggregation can optimize its execution when the query is a `match_all` or a `range` over the field that is used in the first source of the aggregation. However we only check for instances of `PointRangeQuery` whereas the range query builder creates an `IndexOrDocValuesQuery`. This means that today the optimization does not apply to `range` query even if the code could handle it. This change fixes this issue by extracting the index query inside `IndexOrDocValuesQuery`.	2018-11-15 17:52:06 +01:00
David Turner	928ab6a5b1	Suppress CachedTimeThread in hot threads output (#35558 ) In #23175 we renamed `ThreadPool$EstimatedTimeThread` to `ThreadPool$CachedTimeThread` but did not update the corresponding entry in `HotThreads#isIdleThread`. This commit addresses this.	2018-11-15 08:25:09 +00:00
Tanguy Leroux	c9b4ef0dfd	Use RunOnce when appropriate (#35553 ) This pull request replaces some blocks of code that must be run once and that are currently based on AtomicBoolean by the convient RunOnce class added in #35489.	2018-11-15 09:24:40 +01:00
David Turner	135c3f0f07	Merge branch 'master' into zen2	2018-11-15 08:24:26 +00:00
Tanguy Leroux	5cd9b518ac	Fix usages of XContentParserUtils.ensureExpectedToken() (#35530 ) Some usages of XContentParserUtils.ensureExpectedToken() are inverting the expected and actual tokens, resulting in wrong error messages.	2018-11-15 09:18:32 +01:00
Hendrik Muhs	fc774a3776	add ES 6.5.1 (on master) (#35549 ) add ES 6.5.1	2018-11-14 21:08:21 +01:00
Tanguy Leroux	c8c8ce2374	Extract RunOnce into a dedicated class (#35489 ) This commit extracts the static inner class RunOnce from WorkerBulkByScrollTaskState so that it can be reused at other places.	2018-11-14 17:33:04 +01:00
Andrey Ershov	045fdd0d3b	Merge master into zen2	2018-11-14 15:37:13 +03:00
Tanguy Leroux	bbe50e7a86	Remove LoggingRunnable class (#35486 ) This commit removes the unused LoggingRunnable class.	2018-11-14 10:12:25 +01:00
Tanguy Leroux	31567cefb4	[RCI] Check blocks while having index shard permit in TransportReplicationAction (#35332 ) Today, the TransportReplicationAction checks the global level blocks and the index level blocks before routing the operation to the primary, in the ReroutePhase, and it happens at the very beginning of the transport replication action execution. For the upcoming rework of the Close Index API and in order to deal with primary relocation, we'll need to also check for blocks before executing the operation on the primary (while holding a permit) but before routing to the new primary. This pull request change the AsyncPrimaryAction so that it checks for replication action's blocks before executing the operation locally or before routing the primary action to the newly primary shard. The check is done while holding a PrimaryShardReference. Related to #33888	2018-11-14 09:43:55 +01:00
Hendrik Muhs	5c84708ee5	test: expose error message on failure	2018-11-14 08:25:41 +01:00
Christoph Büscher	d8b1c23e1d	Remove Comparable interface from ScoreAccessor (#35519 ) The way ScoreAccessor implements `compareTo()` is problematic because it doesn't completely follow the Comparable contract, specificaly symmetry (if x is a ScoreAccessor and y any Number then x.comparTo(y) works, but y.compareTo(x) generally does not even compile). Fortunately we don't seem to use the fact that ScoreAccessor is a Comparable anywhere, so we can simply remove it.	2018-11-14 05:58:05 +01:00
David Turner	229637fd7e	[Zen2] Remove duplicate discovered peers (#35505 ) Today the `PeerFinder` probes each address it obtains, identifies the node to which it just connected, and then returns all such nodes. However, this can lead to duplicates if a node manages to connect to another node via two distinct addresses. This causes bootstrapping to fail since `BootstrapConfiguration#resolve` forbids duplicates. This change alters the behaviour of the `PeerFinder` to remove duplicates in this situation.	2018-11-13 22:30:36 +00:00
Vladimir Dolzhenko	9728119b82	[CI] AllocationIdIT testFailedRecoveryOnAllocateStalePrimaryRequiresAnotherAllocateStalePrimary failure Closes #35504	2018-11-13 20:57:50 +01:00
David Turner	8e40a2bbe2	[Zen2] Introduce vote withdrawal (#35446 ) If shutting down half or more of the master-eligible nodes, their votes must first be explicitly withdrawn to ensure that the cluster doesn't lose its quorum. This works via _voting tombstones_, stored in the cluster state, which tell the reconfigurator to remove nodes from the voting configuration. This change introduces voting tombstones to the cluster state, together with transport APIs for adding and removing them, and makes use of these APIs in `InternalTestCluster` to support tests which remove at least half of the master-eligible nodes at once (e.g. shrinking from two master-eligible nodes to one).	2018-11-13 19:32:32 +00:00
David Turner	0e1a12122c	Merge branch 'master' into zen2	2018-11-13 15:25:35 +00:00
David Turner	fbd3cab410	[Zen2] Remove AbstractComponent usage (#35483 ) AbstractComponent was deprecated in #35140 and is looking like it will be removed at some point by #34888. Today all it does is provide a logger. This change removes the usages of AbstractComponent that live solely in the zen2 feature branch to avoid some future merge pain, and replaces it where necessary with some directly-created loggers.	2018-11-13 15:20:49 +00:00
Simon Willnauer	3229dfc4de	Allow efficient can_match phases on frozen indices (#35431 ) This change adds a special caching reader that caches all relevant values for a range query to rewrite correctly in a can_match phase without actually opening the underlying directory reader. This allows frozen indices to be filtered with can_match and in-turn searched with wildcards in a efficient way since it allows us to exclude shards that won't match based on their date-ranges without opening their directory readers. Relates to #34352 Depends on #34357	2018-11-13 14:53:55 +01:00
Christoph Büscher	0a6614a03a	Correct implemented interface of ParsedReverseNested (#35455 ) The ParsedReverseNested implementation should implement the ReverseNested interface and not the Nested interface. Although this is an empty marker interface it is confusing and can lead to casting errors. Also adding a test to check that both ParsedNested and ParsedReverseNested implement the correct interface. Closes #35449	2018-11-13 10:34:29 +01:00
Jason Tedor	a18b599d64	Handle OS pretty name on old OS without OS release (#35453 ) Some very old ancient versions of Linux do not have /etc/os-release. For example, old Red Hat-like OS. This commit adds a fallback for handling pretty name for these OS.	2018-11-12 19:31:12 -05:00
Tim Brooks	71cfb730f6	Register remote cluster compress setting (#35464 ) This is a follow up to #35357. That commit failed to register the new cluster.remote.cluster_name.transport.compress setting with `ClusterSettings`. This commit fixes that.	2018-11-12 16:07:42 -07:00
Igor Motov	e7896bcefc	Geo: enables coerce support in WKT polygon parser (#35414 ) WKT parser now automatically closes open polygons similar to GeoJSON parser if coerce flag in mapping is set to true. Closes to #35059	2018-11-12 09:40:04 -10:00
Jason Tedor	40ca62c298	Address handling of OS pretty name on some OS (#35451 ) Some OS (e.g., Oracle Linux Server 6.9) have a trailing space at the end of the PRETTY_NAME line in /etc/os-release. This commit addresses this by accounting for this trailing space when extracting the pretty name.	2018-11-12 14:27:57 -05:00
Yannick Welsch	d2ff01af13	Zen2: Add basic Zen1 transport-level BWC (#35443 ) Implements serialization compatibility between Zen1 and Zen2 transport action, allowing a Zen1 node to join a fully formed Zen2 cluster and vice-versa.	2018-11-12 19:31:10 +01:00
Nick Knize	2591f66a33	upgrade to lucene-8.0.0-snapshot-6d9c714052 (#35428 )	2018-11-12 10:48:27 -06:00
Yannick Welsch	fe29b18c26	Fix compilation	2018-11-12 11:05:11 +01:00
Yannick Welsch	4e6c58c942	Merge remote-tracking branch 'elastic/master' into zen2	2018-11-12 10:03:59 +01:00
Christoph Büscher	09cac321e7	Upgrade to Joda 2.10.1 (#35410 ) This version contains a bugfix that allows us to reenable one of our muted tests in DateTimeUnitTests. Closes #33749	2018-11-12 10:02:41 +01:00
Tim Brooks	ba478827ad	Improve MockTcpTransport memory usage (#35402 ) The MockTcpTransport is not friendly in regards to memory usage. It must allocate multiple byte arrays for every message. This improves the memory situation by failing fast if the message is improperly formatted. Additionally, it uses reusable big arrays for at least half of the allocated byte arrays.	2018-11-09 10:12:49 -07:00
David Turner	f69a5c9b3c	Fix compile error introduced by conflict in previous two commits	2018-11-09 15:50:11 +00:00
Jim Ferenczi	7054e289fa	Add trace log of the request for the query and fetch phases (#34479 ) This change adds a logger for the query and fetch phases that prints all requests before their execution at the trace level. This will help debugging cases where an issue occurs during the execution since only completed queries are logged by the slow logs.	2018-11-09 09:41:51 +01:00
Tim Brooks	bccc99c2be	Fix TcpTransport compression test (#35396 ) This commit fixes an assertion in the TcpTransportTests compresssion test.	2018-11-08 18:04:48 -07:00
Tim Brooks	93c2c604e5	Move compression config to ConnectionProfile (#35357 ) This is related to #34483. It introduces a namespaced setting for compression that allows users to configure compression on a per remote cluster basis. The transport.tcp.compress remains as a fallback setting. If transport.tcp.compress is set to true, then all requests and responses are compressed. If it is set to false, only requests to clusters based on the cluster.remote.cluster_name.transport.compress setting are compressed. However, after this change regardless of any local settings, responses will be compressed if the request that is received was compressed.	2018-11-08 10:37:59 -07:00
Jason Tedor	5c2a5f2e37	Adjust BWC version on OS pretty name This commit adjusts the BWC version the OS pretty name field on OsInfo now that this field has been backported to the 6.x development branch.	2018-11-08 12:24:10 -05:00
Jason Tedor	730ec1ddfb	Add more detailed OS name on Linux (#35352 ) Today our OS information returned in node stats only returns a high-level name of the OS (e.g., "Linux"). Yet, for some uses this is too high-level and knowing at a finer level of granularity the underlying OS can be useful. This commit extracts the pretty name on Linux from /etc/os-release. This pretty name usually includes the Linux vendor and the Linux vendor version number (e.g., Fedora 28).	2018-11-08 12:16:58 -05:00
Yannick Welsch	c315ead0ac	Zen2: Add diff-based publishing (#35290 ) Enables diff-based publishing, which is an optimization where only the changing parts of the cluster state are published to the nodes in the cluster, falling back to full cluster state publishing if the receiver does not have the previous cluster state.	2018-11-08 17:16:09 +01:00
David Turner	6885a7cb0f	Introduce transport API for cluster bootstrapping (#34961 ) - Introduces a transport API for bootstrapping a Zen2 cluster - Introduces a transport API for requesting the set of nodes that a master-eligible node has discovered and for waiting until this comprises the expected number of nodes. - Alters ESIntegTestCase to use these APIs when forming a cluster, rather than injecting the initial configuration directly.	2018-11-08 16:09:37 +00:00
Christoph Büscher	113af7996c	Make limit on number of expanded fields configurable (#35284 ) Currently we introduced a hard limit of 1024 to the number of fields a query can be expanded to in #26541. Instead of using a hard limit, we should make this configurable. This change removes the hard limit check and uses the existing `max_clause_count` setting instead. Closes #34778	2018-11-08 17:04:40 +01:00
Daniel Mitterdorfer	6980feddd2	Remove unused class MemoryCircuitBreaker The class `MemoryCircuitBreaker` is unused so we remove all its traces from the code base. Relates #35367	2018-11-08 15:33:24 +01:00
David Turner	77789a733d	Merge branch 'master' into 2018-11-08-merge-master	2018-11-08 13:38:18 +00:00
Alpar Torok	518e0de078	Mute test #35365	2018-11-08 12:27:40 +02:00
Christoph Büscher	14b811446f	Preserve `date_histogram` format when aggregating on unmapped fields (#35254 ) Currently when aggregating on an unmapped date field (e.g. using a date_histogram) we don't preserve the aggregations `format` setting but instead use the default format. This can lead to loosing the aggregations `format` when aggregating over several indices where some of them contain unmapped date fields and are encountered first in the reduce phase. Related to #31760	2018-11-08 10:22:25 +01:00
Jim Ferenczi	891fdda68e	Allow unmapped fields in composite aggregations (#35331 ) Today the `composite` aggregation throws an error if a source targets an unmapped field and `missing_bucket` is set to false. Documents without a value for a source cannot produce any bucket if `missing_bucket` is not activated so the error is a shortcut to say that the response will be empty. However this is not consistent with the `terms` aggregation which accepts unmapped field by default even if the response is also guaranteed to be empty. This commit removes this restriction, if a source contains an unmapped field we now return an empty response (no buckets). Closes #35317	2018-11-08 09:30:52 +01:00
Tanguy Leroux	1703a61fec	[RCI] Add IndexShardOperationPermits.asyncBlockOperations(ActionListener<Releasable>) (#34902 ) The current implementation of asyncBlockOperations() can be used to execute some code once all indexing operations permits have been acquired, then releases all permits immediately after the code execution. This immediate release is not suitable for treatments that need to keep all permits over multiple execution steps. This commit adds a new asyncBlockOperations() that exposes a Releasable, making it possible to acquire all permits and only release them all when needed by closing the Releasable. The existing blockOperations() method has been modified to delegate permit acquisition/releasing to this new method. Relates to #33888	2018-11-08 09:23:33 +01:00
Jason Tedor	4f4fc3b8f8	Replicate index settings to followers (#35089 ) This commit uses the index settings version so that a follower can replicate index settings changes as needed from the leader. Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2018-11-07 21:20:51 -05:00
Ryan Ernst	a4d979cfc8	Scripting: Add back lookup vars in score script (#34833 ) The lookup vars under params (namely _fields and _source) were inadvertently removed when scoring scripts were converted to using script contexts. This commit adds them back, along with deprecation warnings for those that should not be used.	2018-11-07 15:09:09 -08:00
Nhat Nguyen	ed8732b161	Use soft-deleted docs to resolve strategy for engine operation (#35230 ) A CCR test failure shows that the approach in #34474 is flawed. Restoring the LocalCheckpointTracker from an index commit can cause both FollowingEngine and InternalEngine to incorrectly ignore some deletes. Here is a small scenario illustrating the problem: 1. Delete doc with seq=1 => engine will add a delete tombstone to Lucene 2. Flush a commit consisting of only the delete tombstone 3. Index doc with seq=0 => engine will add that doc to Lucene but soft-deleted 4. Restart an engine with the commit (step 2); the engine will fill its LocalCheckpointTracker with the delete tombstone in the commit 5. Replay the local translog in reverse order: index#0 then delete#1 6. When process index#0, an engine will add it into Lucene as a live doc and advance the local checkpoint to 1 (seq#1 was restored from the commit - step 4). 7. When process delete#1, an engine will skip it because seq_no=1 is less than or equal to the local checkpoint. We should have zero document after recovering from translog, but here we have one. Since all operations after the local checkpoint of the safe commit are retained, we should find them if the look-up considers also soft-deleted documents. This PR fills the disparity between the version map and the local checkpoint tracker by taking soft-deleted documents into account while resolving strategy for engine operations. Relates #34474 Relates #33656	2018-11-07 15:26:30 -05:00
Martijn van Groningen	8de3c6e618	Ignore date ranges containing 'now' when pre-processing a percolator query (#35160 ) Today when a percolator query contains a date range then the query analyzer extracts that range, so that at search time the `percolate` query can exclude percolator queries efficiently that are never going to match. The problem is that if 'now' is used it is evaluated at index time. So the idea is to rewrite date ranges with 'now' to a match all query, so that the query analyzer can't extract it and the `percolate` query is then able to evaluate 'now' at query time.	2018-11-07 20:41:27 +01:00
Simon Willnauer	0cc0fd2d15	Add a frozen engine implementation (#34357 ) This change adds a `frozen` engine that allows lazily open a directory reader on a read-only shard. The engine wraps general purpose searchers in a LazyDirectoryReader that also allows to release and reset the underlying index readers after any and before secondary search phases. Relates to #34352	2018-11-07 20:23:35 +01:00
Vladimir Dolzhenko	f789d49fb3	Put a fake allocation id on allocate stale primary command (#34140 ) removes fake allocation id after recovery is done Relates to #33432	2018-11-07 20:18:11 +01:00
Simon Willnauer	2131e119d7	Apply `ignore_throttled` also to concrete indices (#35335 ) Today we only apply `ingore_throttled` to expansions from wildcards, date math expressions and aliases. Yet, this is tricky since we might have resolved certain expressions in pre-filter steps like security. It's more consistent to apply this logic to all expressions including concrete indices. Relates to #34354	2018-11-07 18:43:27 +01:00
Alpar Torok	5ae03195d3	Make version field names more meaningful (#35334 ) * Consolidate the name of the qualified build version * Field name in response should not be redundant	2018-11-07 18:36:02 +02:00
Alpar Torok	8a85b2eada	Remove build qualifier from server's Version (#35172 ) With this change, `Version` no longer carries information about the qualifier, we still need a way to show the "display version" that does have both qualifier and snapshot. This is now stored by the build and red from `META-INF`.	2018-11-07 14:01:05 +02:00
Vladimir Dolzhenko	a467a816bc	Fix DeleteRequest / GetRequest / UpdateRequest / ExplainRequest validation for null and/or empty id/type (#35314 ) Closes #35297	2018-11-07 12:52:11 +01:00
Tim Brooks	f395b1eace	Open node connections asynchronously (#35144 ) This is related to #29023. Additionally at other points we have discussed a preference for removing the need to unnecessarily block threads for opening new node connections. This commit lays the groudwork for this by opening connections asynchronously at the transport level. We still block, however, this work will make it possible to eventually remove all blocking on new connections out of the TransportService and Transport.	2018-11-06 17:58:20 -07:00
Julie Tibshirani	ec53288fc0	Remove include_type_name from the relevant APIs. (#35192 ) We've decided that the bulk, delete, get, index, update, and search APIs should not contain this request parameter, and we will instead accept both typed and typeless calls.	2018-11-06 14:33:48 -08:00
javanna	3c18aa0c15	add version 6.4.4	2018-11-06 19:20:20 +01:00
David Turner	7e356ac29b	[Zen2] Introduce auto_shrink_voting_configuration setting (#35217 ) Today we allow the user to set the minimum size of a voting configuration. On reflection we would rather this was simply '3' where possible, and we can use the retirement API to control the removal of nodes more explicitly. This change replaces the old reconfigurator setting with a new one, `cluster.auto_shrink_voting_configuration`, which determines whether Elasticsearch should automatically remove nodes from the voting configuration or not.	2018-11-06 18:10:29 +00:00
Nik Everett	348c28d1d1	Logger: Merge ESLoggerFactory into Loggers (#35146 ) `ESLoggerFactory` is now not particularly interesting and simple enought to fold entirely into `Loggers. So let's do that. Closes #32174	2018-11-06 10:49:54 -05:00
Simon Willnauer	6e58284683	Serialize ignore_throttled also to 6.6 after backport	2018-11-06 13:50:30 +01:00
javanna	17b7d2efcb	[TEST] increase await timeout in RemoteClusterConnectionTests We have seen an improvement when we bumped the timeout from 1s to 5s, but there are still a few failures for this tests. With this commit we bump the timeout to 10 seconds hoping it will stop all the failures.	2018-11-06 13:36:22 +01:00
Jim Ferenczi	999f8f5850	Adapt Lucene BWC version Bump the Lucene version used by ES 6.6 now that the 6.x branch is upgraded to Lucene 7.6.	2018-11-06 12:15:33 +01:00
Nick Knize	a5e1f4d3a2	Upgrade to lucene-8.0.0-snapshot-31d7dfe6b1 (#35224 )	2018-11-06 11:55:23 +01:00
Simon Willnauer	833e0f8ecf	Prevent throttled indices to be searched through wildcards by default (#34354 ) Today if a wildcard, date-math expression or alias expands/resolves to an index that is search-throttled we still search it. This is likely not the desired behavior since it can unexpectedly slow down searches significantly. This change adds a new indices option that allows `search`, `count` and `msearch` to ignore throttled indices by default. Users can force expansion to throttled indices by using `ignore_throttled=true` on the rest request to expand also to throttled indices. Relates to #34352	2018-11-06 09:45:30 +01:00
David Turner	2fb3d1a465	[Zen2] Fix some rarely-failing tests (#35198 ) Recent changes have left a few Zen2 tests occasionally failing. This commit fixes them.	2018-11-05 21:54:53 +00:00
Armin Braun	216c761a5d	MINOR: Remove Dead Code in Routing (#35074 ) * MINOR: Remove Dead Code in Routing	2018-11-05 20:40:27 +01:00
Yannick Welsch	4f35eea8fe	[TEST] Fix testConcurrentTermIncreaseOnReplicaShard This test has a bug that got introduced during the refactoring of #32442. With 2 concurrent term increments, we can only assert under the operation permit that we are in the correct operation term, not that there is not already another term bump pending. Closes #34862	2018-11-05 16:18:20 +01:00
Christoph Büscher	02043a2260	[Tests] Fix rare edge case in SimpleQueryStringBuilderTests (#35201 ) If the random query string is "now" by accident _and_ we are also not setting some field names to use explicitely, then we can hit the "mapped_date" field from default test setup. This correctly leads to the query being was marked as not cacheable, but we assume and check so later. This change fixes this rare edge case by making sure we don't hit the "date" field in this rare cases. Closes #35183	2018-11-05 13:31:13 +01:00
Alexander Reelsen	409050e8de	Refactor: Remove settings from transport action CTOR (#35208 ) As settings are not used in the transport action constructor, this removes the passing of the settings in all the transport actions.	2018-11-05 13:08:18 +01:00
Boaz Leskes	28078642b3	Engine.newChangesSnapshot may cause unneeded refreshes if called concurrently (#35169 ) When the engine is asked for historical operations, we check if some of the requested operations are not yet refreshed and if so we refresh before returning the operations. The refresh check is based on capturing the local checkpoint before each refresh and comparing that value to the one requested when `newChangesSnapshot` was called. If the requested range is above the captured local checkpoint we issue a refresh. This can currently cause unneeded extra refreshes if the method is called concurrently which may cause unwanted degradation in indexing performance. This is especially relevant for CCR where we always ask for a range below the global checkpoint. That range is guaranteed to be below the local checkpoint of the shard and one refresh is enough to serve multiple changes requests. This commit fixes this by introducing a dedicated mutex to make sure the test for whether a refresh is needed actually wait for concurrents for concurrent refreshes that were caused by another change refresh. Note that this is not a big change in semantics as refreshes are serialized by lucene anyway. I also opted not to keep the synchronization to the changes snapshot request only even if in theory we can apply it to all refreshes, not matter where they come from.	2018-11-04 13:43:33 +01:00
Nhat Nguyen	855ab3fa1e	Add equals/hashCode to SeqNoStats (#35223 ) This commit adds equals/hashCode to SeqNoStats so we can verify it wholly in tests.	2018-11-02 21:31:36 -04:00
Jack Conradson	44f08717ba	[Scripting] Make Max Script Length Setting Dynamic (#35184 ) This changes the current script.max_size_in_bytes to be dynamic so it can be set through the cluster settings API. This setting is also applied to inline scripts in the compile method of ScriptService to prevent excessively long inline scripts from being compiled. The script length limit is removed from Painless as this is no longer necessary with the protection in compile.	2018-11-02 16:07:54 -07:00
Tim Brooks	0166388d74	Use single netty event loop group for transports (#35181 ) Currently we create a new netty event loop group for client connections and all server profiles. Each new group creates new threads for io processing. This means 2 * num of processors new threads for each group. A single group should be able to handle all io processing (for the transports). This also brings the netty module inline with what we do for nio. Additionally, this PR renames the worker threads to be the same for netty and nio.	2018-11-02 16:31:19 -06:00
Nhat Nguyen	d6e44129b1	TEST: Only check max_seq_no_of_updates when rollback (#35170 ) Currently, we assume that rollback always happens in the test testRestoreLocalHistoryFromTranslogOnPromotion. However, if the global checkpoint equals max_seq_no, we won't rollback. This causes the max_seq_no_of_updates assertion failed because max_seq_no_of_updates won't be advanced to the global checkpoint. With this commit, we assert max_seq_no_of_updates in two different paths.	2018-11-02 12:27:48 -04:00
Nhat Nguyen	e753e12f61	Do not alloc full buffer for small change requests (#35158 ) Today we always allocate a full buffer (1024 elements) in a LuceneChangesSnapshot even though the requesting size is smaller. With this change, we will use the requesting size as the buffer size if it's smaller than the default batch size; otherwise uses the default batch size.	2018-11-02 08:49:55 -04:00
Daniel Mitterdorfer	ccbe80c3a0	Introduce durability of circuit breaking exception With this commit we differentiate between permanent circuit breaking exceptions (which require intervention from an operator and should not be automatically retried) and transient ones (which may heal themselves eventually and should be retried). Furthermore, the parent circuit breaker will categorize a circuit breaking exception as either transient or permanent based on the categorization of memory usage of its child circuit breakers. Closes #31986 Relates #34460	2018-11-02 13:12:44 +01:00
Colin Goodheart-Smithe	fc6e1f7f3f	Merge branch 'master' into index-lifecycle	2018-11-02 10:56:35 +00:00
Andy Bristol	2a60c24043	[test] mute QueryProfilerIT.testProfileMatchesRegular	2018-11-01 16:59:06 -07:00
Tal Levy	c6c01425bb	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-11-01 11:38:42 -07:00
Julie Tibshirani	8fb3290e5c	Fix a bug in function_score queries where we use the wrong boost_mode. (#35148 )	2018-11-01 11:15:26 -07:00
Tal Levy	c3cf7dd305	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-11-01 10:13:02 -07:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Seong-hyun, Oh	9ef4788c13	Make XContentBuilder in AliasActions build `is_write_index` field (#35071 ) Make XContentBuilder in AliasesActions build `is_write_index` field	2018-10-31 14:15:46 -07:00
lipsill	d181d1bab1	Remove deprecated url parameters `_source_include` and `_source_exclude` (#35097 ) Removes `_source_include` and `_source_exclude` url parameters. These parameters have been deprecated in #33475. Closes #22792	2018-10-31 17:11:59 -04:00
Armin Braun	e6f9f0666e	NETWORKING: MockTransportService Wait for Close (#35038 ) * NETWORKING: MockTransportService Wait for Close * Make `MockTransportService` wait `30s` for close listeners to run before failing the assertion * Closes #34990	2018-10-31 21:33:49 +01:00
Andy Bristol	6492eaa84d	[test] mad tests more lenient approximation	2018-10-31 11:48:58 -07:00
Nik Everett	ca620ff4ce	Loggers: Drop last deprecated logger function (#35082 ) Drop the last function from `Loggers` that just wraps Log4j2. Relates to #32174	2018-10-31 14:38:29 -04:00
Tal Levy	d5d28420b6	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-31 10:47:07 -07:00
Luca Cavanna	ef5181c678	Allow to enable pings for specific remote clusters (#34753 ) When we connect to remote clusters, there may be a few more routers/firewalls in-between compared to when we connect to nodes in the same cluster. We've experienced cases where firewalls drop connections completely and keep-alives seem not to be enough, or they are not properly configured. With this commit we allow to enable application-level pings specifically from CCS nodes to the selected remote nodes through the new setting `cluster.remote.${clusterAlias}.transport.ping_schedule`. The new setting is similar `transport.ping_schedule` but it does not affect intra-cluster communication, pings are only sent to specific remote cluster when specifically enabled, as they are disabled by default. Relates to #34405	2018-10-31 17:32:53 +01:00
Armin Braun	3fa67c5d8a	DISCOVERY: Cleanup AbstractDisruptionTestCase (#34808 ) * DISCOVERY: Cleanup AbstractDisruptionTestCase * Make the internal test cluster manage minimum master nodes where we used the default of (nodes / 2 + 1) before * Remove use of the `NodeConfigurationSource` indirection * Relates #33675	2018-10-31 07:52:37 +01:00
Nik Everett	086ada4c08	Core: Drop settings member from AbstractComponent (#35083 ) Drops the `Settings` member from `AbstractComponent`, moving it from the base class on to the classes that use it. For the most part this is a mechanical change that doesn't drop `Settings` accesses. The one exception to this is naming threads where it switches from an invocation that passes `Settings` and extracts the node name to one that explicitly passes the node name. This change doesn't drop the `Settings` argument from `AbstractComponent`'s ctor because this change is big enough as is. We'll do that in a follow up change.	2018-10-30 16:10:38 -04:00
Ryan Ernst	512319cef7	Test: Filter out deprecated joda tzs in tests (#34868 ) This commit filters out usage of deprecated tzs by tests. These are tested separately and should not require checking for warnings on any test using random timezones. closes #34188	2018-10-30 11:15:34 -07:00
Vladimir Dolzhenko	be75b40a29	Fix LineLength Check Suppressions: index.mapper (#35087 ) Relates #34884	2018-10-30 18:00:14 +01:00
Tal Levy	18c72e86c5	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-30 08:09:57 -07:00
Andy Bristol	b8280ea7cc	median absolute deviation agg (#34482 ) This commit adds a new single value metric aggregation that calculates the statistic called median absolute deviation, which is a measure of variability that works on more types of data than standard deviation Our calculation of MAD is approximated using t-digests. In the collect phase, we collect each value visited into a t-digest. In the reduce phase, we merge all value t-digests, then create a t-digest of deviations using the first t-digest's median and centroids	2018-10-30 07:22:52 -07:00
Andrey Ershov	97f74c5a38	Merge branch 'master' into 'zen2' Conflicts during the merge: 1. >=140 chars line length fixed for a lot of project files and warnings for those files are no longer suppressed 2. Node name is removed from AbstractComponent, it’s no longer taken from settings, but is explicitly passed as constructor argument and there were quite a few new classes on zen2 branch that require this change 3. TransportResponseHandler interface changed (new method added) and Zen2 makes a lot of subclasses in tests 4. Deprecated way of obtaining logger was changed	2018-10-30 14:39:48 +03:00
Alan Woodward	c74232037a	Remove Accountable interface from BytesReference (#34900 )	2018-10-30 10:27:31 +00:00
Przemyslaw Gomulka	995bf0ee66	Bulk Api support for global parameters (#34528 ) Bulk Request in High level rest client should be consistent with what is possible in Rest API, therefore should support global parameters. Global parameters are passed in URL in Rest API. Some parameters are mandatory - index, type - and would fail validation if not provided before before the bulk is executed. Optional parameters - routing, pipeline. The usage of these should be consistent across sync/async execution, bulk processor and BulkRequestBuilder closes #26026	2018-10-30 09:08:12 +01:00
Ryan Ernst	5dda2b0c7a	Remove remaining line length violations in o.e.cluster (#34941 ) relates #34923, #34884	2018-10-29 19:45:35 -07:00
Tal Levy	c9e4d26a53	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-29 14:03:55 -07:00
lipsill	6df1c9e818	Deprecate `_source_include` and `_source_exclude` url parameters (#33475 ) Deprecates `_source_include` and `_source_exclude` url parameters in favor of `_source_inclues` and `_source_excludes` because those are consistent with the rest of Elasticsearch's APIs. Relates to #22792	2018-10-29 12:06:38 -04:00
Nik Everett	b093116a1e	Logging: Drop another deprecated Loggers method (#34520 ) Drop a method from `Loggers` that we deprecated because it just delegated to `LogManager`.	2018-10-29 10:05:24 -04:00
Mark Tozzi	329a94be0c	Cleanup suppressed overlength line for action.support package (#34889 ) Clean up lines over 140 characters in the `org.elasticsearch.action.support.*` packages Relates to #34884	2018-10-29 09:22:20 -04:00
Igor Motov	01c62fc06b	Fix line length for bootstrap/client/discovery/gateway files (#34905 ) Removes the checkstyle suppressions for files in org.elasticsearch.bootstrap/client/discovery/gateway packages. Relates to #34884	2018-10-26 18:13:09 -04:00
Jake Landis	11fa8d3744	Enforce 140 char line lengths for packages action.bulk/delete/explain/get/index (#34885 ) part of #34884	2018-10-26 16:14:04 -05:00
Ryan Ernst	f5200e34ad	Remove line length violations for o.e.cluster (mostly) (#34923 ) This commit removes line length violations in most of the classes under org.elasticsearch.cluster.	2018-10-26 13:37:24 -07:00
Tal Levy	d8322ca069	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-26 12:46:21 -07:00
Nik Everett	9f87fdc7ab	Drop deprecationLogger from AbstractComponent (#34859 ) Drops the `deprecationLogger` from `AbstractComponent`, moving it to places where we need it. This saves us from building a bunch of `DeprecationLogger`s that we don't need. Relates to #34488	2018-10-26 15:40:16 -04:00
Nik Everett	10295b306d	Core: Drop nodeName from AbstractComponent (#34487 ) `AbstractComponent` is trouble because its name implies that everything should extend from it. It is useful, but maybe too broadly useful. The things it offers access too, the `Settings` instance for the entire server and a logger are nice to have around, but not really needed everywhere. The `Settings` instance especially adds a fair bit of ceremony to testing without any value. This removes the `nodeName` method from `AbstractComponent` so it is more clear where we actually need the node name.	2018-10-26 15:26:14 -04:00
Armin Braun	64a044240a	MINOR: Remove Deadcode in aggregtions.support (#34323 ) * Removed methods are just unused (the exceptions being isGeoPoint() and is isFloatingPoint() but those could more efficiently be replaced by enum comparisons to simplify the code) * Remove exceptions aren't thrown	2018-10-26 20:57:57 +02:00
Jack Conradson	aefe2909c4	[Style] Remove line length violations from ingest actions (#34886 )	2018-10-26 09:15:35 -07:00
Jay Modi	a0279bc069	Responses can use Writeable.Reader interface (#34655 ) In order to remove Streamable from the codebase, Response objects need to be read using the Writeable.Reader interface which this change enables. This change enables the use of Writeable.Reader by adding the `Action#getResponseReader` method. The default implementation simply uses the existing `newResponse` method and the readFrom method. As responses are migrated to the Writeable.Reader interface, Action classes can be updated to throw an UnsupportedOperationException when `newResponse` is called and override the `getResponseReader` method. Relates #34389	2018-10-26 09:21:54 -06:00
Lee Hinman	af28d1f648	Fix line length for org.elasticsearch.common.* files (#34888 ) This removes the checkstyle suppressions for things in the `common` package. Relates to #34884	2018-10-26 08:47:39 -06:00
Jim Ferenczi	1b879ea8ac	Refactor children aggregator into a generic ParentJoinAggregator (#34845 ) This commit adds a new ParentJoinAggregator that implements a join using global ordinals in a way that can be reused by the `children` and the upcoming `parent` aggregation. This new aggregator is a refactor of the existing ParentToChildrenAggregator with two main changes: * It uses a dense bit array instead of a long array when the aggregation does not have any parent. * It uses a single aggregator per bucket if it is nested under another aggregation. For the latter case we use a `MultiBucketAggregatorWrapper` in the factory in order to ensure that each instance of the aggregator handles a single bucket. This is more inlined with the strategy we use for other aggregations like `terms` aggregation for instance since the number of buckets to handle should be low (thanks to the breadth_first strategy). This change is also required for #34210 which adds the `parent` aggregation in the parent-join module. Relates #34508	2018-10-26 16:26:45 +02:00
Gordon Brown	5c2c1f44c8	[Style] Fix line lengths in action.admin.indices (#34890 ) Clean up lines over 140 characters in the the `org.elasticsearch.action.admin.indices` packages	2018-10-26 08:01:38 -06:00
Armin Braun	db12005674	Fix LineLength Check Suppressions: index.fielddata (#34891 ) * Fix linelength suppressions in index.fielddata * Some lines that were too long were dead code => Removed them and all code that became dead because of it * Relates #34884	2018-10-26 12:56:19 +02:00
David Turner	33345d96ef	Delete flaky SettingsBasedHostProviderIT test (#34813 ) testClusterFormsByScanningPorts is flaky: sometimes in CI it's not possible to bind to any of the ports we need to in order for the port scanning to work. This change removes this test, and #34809 describes a better way to test this behaviour.	2018-10-26 07:52:31 +01:00
Tal Levy	e1fdd00420	Lowercase static final DeprecationLogger instance names (#34887 ) After discussing on the team's FixItFriday, we concluded that static final instance variables that are mutable should be lowercased. Historically, DeprecationLogger was uppercased more frequently than lowercased.	2018-10-25 21:12:19 -07:00
Tal Levy	810cd46a30	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-25 14:35:33 -07:00
Tim Brooks	cf9aff954e	Reduce channels in AbstractSimpleTransportTestCase (#34863 ) This is related to #30876. The AbstractSimpleTransportTestCase initiates many tcp connections. There are normally over 1,000 connections in TIME_WAIT at the end of the test. This is because every test opens at least two different transports that connect to each other with 13 channel connection profiles. This commit modifies the default connection profile used by this test to 6. One connection for each type, except for REG which gets 2 connections.	2018-10-25 13:37:49 -06:00
Lee Hinman	3e7042832a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-25 11:00:36 -06:00
Christophe Bismuth	70871b5af7	Check self references in metric agg after last doc collection (#33593 ) (#34001 ) * Check self references in metric agg after last doc collection (#33593) * Revert 0aff5a30c5dbad9f476be14f34b81e2d1991bb0f (#33593) * Check self refs in metric agg only once in post collection hook (#33593) * Remove unnecessary mocking (#33593)	2018-10-25 17:12:50 +01:00
Tanguy Leroux	3225b2dcd3	Add 6.6.0 version to master (#34847 ) This commit adds the 6.6.0 version constant to the master branch, and adapts the VersionTests.	2018-10-25 17:30:25 +02:00
lipsill	2b652f3242	Logging: server: clean up logging (#34593 ) Replace internal deprecated calls to `Loggers.getLogger(Class)` with direct calls to log4j `LogManager.getLogger(Class)`	2018-10-25 09:52:50 -04:00
lipsill	185c06bb7f	Logging: tests: clean up logging (#34606 ) Replace internal deprecated calls to `Loggers.getLogger(Class)` with direct calls to log4j `LogManager.getLogger(Class)`	2018-10-25 09:52:41 -04:00
Ryan Ernst	687dc1eb11	Scripting: Remove SearchScript (#34730 ) This commit removes the last non context based script class.	2018-10-24 15:03:38 -07:00
Andrey Atapin	5f588180f9	Improve IndexNotFoundException's default error message (#34649 ) This commit adds the index name to the error message when an index is not found.	2018-10-24 12:53:31 -07:00
Stéphane Campinas	04f3e67c77	Remove redundant method from RestClearScrollAction (#34268 ) The check for null argument is already done in `splitStringByCommaToArray`, hence it can be removed, which allows us to remove the whole splitScrollIds private method.	2018-10-24 21:31:29 +02:00
Mayya Sharipova	bf4d90a5dc	HLRC API for _termvectors (#33447 ) * HLRC API for _termvectors relates to #27205	2018-10-24 14:27:22 -04:00
Alpar Torok	795d57b4f9	Auto configure all test tasks (#34666 ) With this change, we apply the common test config automatically to all newly created tasks instead of opting in specifically. For plugin authors using the plugin externally this means that the configuration will be applied to their RandomizedTestingTasks as well. The purpose of the task is to simplify setup and make it easier to change projects that use the `test` task but actually run integration tests to use a task called `integTest` for clarity, but also because we may want to configure and run them differently. E.x. using different levels of concurrency.	2018-10-24 16:05:50 +03:00
Andrey Ershov	7a3cd10718	[Zen2] Change MetaDataStateFormat write semantics (#34709 ) Currently, if MetaDataStateFormat.write throws an IOExceptions if there was some problem with persisting state to disk. If an exception is thrown, loadLatestState may read either old state or new state. This is not enough for the Zen2 algorithm. In case of failure, we need to distinguish between 2 cases: storage is left in clean state or storage is left in a dirty state. If storage is left in the clean state, loadLatestState may read only old state. If storage is left in a dirty state, loadLatestState may read either old or new state. If an exception occurs when writing the manifest file to disk this distinction is important for Zen2. If storage is clean, the node can continue to be a part of the cluster and may try to accept further cluster state updates (if it fails to accept cluster state updates it will be kicked off from the cluster using different mechanism). But if storage is dirty, the node should be restarted and it will be able to start up successfully only once it successfully re-writes manifest file to disk. This commit changes MetaDataStateFormat.write signature, replacing IOException with WriteStateException, which “isDirty” method could be used to distinguish between 2 failure cases. We need to minimise the number of failures, that leave storage in a dirty state. That’s why this PR changes the algorithm that is used to store state to disk. It has the following layout: 1. For the first state location, create and fsync tmp file with state content. 2. For each extra location, copy and fsync tmp file with state content. 2. Atomically rename tmp file in the first location. 3. For each extra location, atomically rename tmp file. 4. For each location, fsync state directory. 5. Perform cleanup of old files, ignoring exceptions. If an exception occurs in steps 1-3, storage is clearly in the clean state. If an exception occurs in step 5, storage is clearly in dirty state. Exception in step 4 is questionable, there are 2 options: 1. Consider it as a failure. If the first disk fails, state disappears. So this is a failure and storage is in a dirty state. 2. Do not consider it as failure at all, ignore disk failures. This commit prefers 1st approach and MetaDataTestFormatTests.testFailRandomlyAndReadAnyState tests for disk failures.	2018-10-24 13:45:12 +03:00
Ryan Ernst	8da1c9626a	Scripting: Add back params._source access in scripted metric aggs (#34777 ) Access to special variables _source and _fields were accidentally removed in recent refactorings. This commit adds them back, along with a test. closes #33884	2018-10-23 18:07:53 -07:00
Gordon Brown	da20dfd81c	Add cluster-wide shard limit warnings (#34021 ) In a future major version, we will be introducing a soft limit on the number of shards in a cluster based on the number of nodes in the cluster. This limit will be configurable, and checked on operations which create or open shards and issue a warning if the operation would take the cluster over the limit. There is an option to enable strict enforcement of the limit, which turns the warnings into errors. In a future release, the option will be removed and strict enforcement will be the default (and only) behavior.	2018-10-23 16:35:10 -06:00
Julie Tibshirani	c5a0739381	Mute SettingsBasedHostProviderIT to avoid future test flakes.	2018-10-23 15:26:39 -07:00
Zachary Tong	299d044bfc	Collapse pipeline aggs into single package (#34658 ) - Restrict visibility of Aggregators and Factories - Move PipelineAggregatorBuilders up a level so it is consistent with AggregatorBuilders - Checkstyle line length fixes for a few classes - Minor odds/ends (swapping to method references, formatting, etc)	2018-10-23 16:01:01 -04:00
Tal Levy	62ac2fa5ec	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-23 09:43:46 -07:00
Jake Landis	89dc07bdd9	ingest: better support for conditionals with simulate?verbose (#34155 ) This commit introduces two corrections to the way simulate?verbose handles conditionals on processors. 1) Prior to this change when executing simulate?verbose for processors with conditionals that evaluate to false, that processor would still be displayed in the result set. What was displayed was correct, such that no changes to the document occurred. However, if the conditional evaluates to false, the processor should not even be displayed. 2) Prior to this change when executing simulate?verbose for pipeline processors with conditionals, the individual steps would no longer be displayed. Commit `e37e5df` addressed the issue, but failed account for a conditional on the pipeline processor. Since a pipeline processor can introduce cycles and is effectively a single processor that encapsulates multiple other processors that are potentially guarded by a single conditional, special handling is needed to for pipeline and conditional pipeline processors.	2018-10-23 11:33:48 -05:00
Zachary Tong	4dbf498721	[Rollup] Job deletion should be invoked on the allocated task (#34574 ) We should delete a job by directly talking to the allocated task and telling it to shutdown. Today we shut down a job via the persistent task framework. This is not ideal because, while the job has been removed from the persistent task CS, the allocated task continues to live until it gets the shutdown message. This means a user can delete a job, immediately delete the rollup index, and then see new documents appear in the just-deleted index. This happens because the indexer in the allocated task is still running and indexes a few more documents before getting the shutdown command. In this PR, the transport action is changed to a TransportTasksAction, and we invoke onCancelled() directly on the matching job. The race condition still exists after this PR (albeit less likely), but this was a precursor to fixing the issue and a self-contained chunk of code. A second PR will followup to fix the race itself.	2018-10-23 12:23:22 -04:00
Albert Zaharovits	11881e7b50	Empty GetAliases authorization fix (#34444 ) This fixes a bug about aliases authorization. That is, a user might see aliases which he is not authorized to see. This manifests when the user is not authorized to see any aliases and the `GetAlias` request is empty which normally is a marking that all aliases are requested. In this case, no aliases should be returned, but due to this bug, all aliases will have been returned.	2018-10-23 18:50:20 +03:00
Christoph Büscher	583f2852f0	[Test] Remove dead code from ExceptionSerializationTests (#34713 ) The `ignore` set contains entries of type Class<?>, but the check is performed on Path objects. This always returns false so is useless currently. Looking at the first commit of this test that already shows this behaviour this never excluded anything, so it can be removed.	2018-10-23 15:44:47 +02:00
Jake Landis	ad94e79350	ingest: processor stats (#34724 ) This change introduces stats per processors. Total, time, failed, current are currently supported. All pipelines will now show all top level processors that belong to it. Failure processors are not displayed, however, the time taken to execute the failure chain is part of the stats for the top level processor. The processor name is the type of the processor, ordered as defined in the pipeline. If a tag for the processor is found, then the tag is appended to the type. Pipeline processors will have the pipeline name appended to the name of the name of the processors (before the tag if one exists). If more then one pipeline is used to process the document, then each pipeline will carry its own stats. The outer most pipeline will also include the inner most pipeline stats. Conditional processors will only included in the stats if the condition evaluates to true.	2018-10-23 07:30:52 -05:00
Igor Motov	123f784e32	Tests: Add checks to GeoDistanceQueryBuilderTests (#34273 ) Adds checks for parsed geo distance query. It is a bit hack-ish since it compares with query's toString() output, but it is better than no checks. The parsed query itself has default visibility, so we cannot access it here unless we move the test to org.apache.lucene.document package. Fixes #34043	2018-10-23 07:55:41 -04:00
Armin Braun	8e155b8430	INGEST: Rename Pipeline Processor Param. (#34733 ) * `name` is more readable/ergnomic than having `pipeline` twice	2018-10-23 13:43:26 +02:00
Alexander Reelsen	83fd93b2fd	Core: Move IndexNameExpressionResolver to java time (#34507 ) This switches from joda time to java time when resolving index names using date math. This commit also removes two non registered settings from the code, which could not be used anyway. An unused method was removed as well. Relates #27330	2018-10-23 13:26:02 +02:00
Alpar Torok	0536635c44	Upgrade forbiddenapis to 2.6 (#33809 ) * Upgrade forbiddenapis to 2.6 Closes #33759 * Switch forbiddenApis back to official plugin * Remove CLI based task * Fix forbiddenApisJava9	2018-10-23 12:06:46 +03:00
Tal Levy	67bfdb16ad	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-22 13:09:37 -07:00
Julie Tibshirani	f854330e06	Make sure to use the type _doc in the REST documentation. (#34662 ) * Replace custom type names with _doc in REST examples. * Avoid using two mapping types in the percolator docs. * Rename doc -> _doc in the main repository README. * Also replace some custom type names in the HLRC docs.	2018-10-22 11:54:04 -07:00
Lee Hinman	5dd79bf58c	Make accounting circuit breaker settings dynamic (#34372 ) * Make accounting circuit breaker settings dynamic These missed the original property making them dynamic. This fixes the issue so these can now be set at any time. Resolves #34368	2018-10-22 09:55:00 -06:00
Julie Tibshirani	fbb9ac34f9	Deprecate type exists requests. (#34663 )	2018-10-22 08:46:11 -07:00
Yannick Welsch	6d6ac74a08	Zen2: Fail fast on disconnects (#34503 ) Integrates the failure detectors with the Connection lifecycle, to fail nodes as soon as: - a leader detects one of his followers disconnecting. - a follower detects its leader disconnecting.	2018-10-22 17:20:12 +02:00
Jason Tedor	0577703183	Revert "ingest: processor stats (#34202 )" This reverts commit `6567729600`.	2018-10-21 13:16:15 -04:00
Ryan Ernst	222652dfce	Scripting: Convert script fields to use script context (#34164 ) This commit removes the use of SearchScript for script fields and adds a new FieldScript.	2018-10-20 16:33:49 -07:00
Nhat Nguyen	7ab464807d	TEST: Mute testDedupByPrimaryTerm Should be fixed by #34667	2018-10-20 18:23:02 -04:00
Jake Landis	6567729600	ingest: processor stats (#34202 ) This change introduces stats per processors. Total, time, failed, current are currently supported. All pipelines will now show all top level processors that belong to it. Failure processors are not displayed, however, the time taken to execute the failure chain is part of the stats for the top level processor. The processor name is the type of the processor, ordered as defined in the pipeline. If a tag for the processor is found, then the tag is appended to the type. Pipeline processors will have the pipeline name appended to the name of the name of the processors (before the tag if one exists). If more then one pipeline is used to process the document, then each pipeline will carry its own stats. The outer most pipeline will also include the inner most pipeline stats. Conditional processors will only included in the stats if the condition evaluates to true.	2018-10-20 16:01:01 -05:00
Nhat Nguyen	d90b6730c7	CCR: Following primary should process NoOps once (#34408 ) This is a follow-up for #34288. Relates #34412	2018-10-19 21:10:13 -04:00
Jim Ferenczi	ba87c543c0	[TEST] Fix sporadic failures in CompletionSuggestSearchIT#testTiebreak Relates #34508	2018-10-20 01:05:48 +02:00
David Turner	bfd24fc030	[Zen2] Reconfigure cluster as its membership changes (#34592 ) As master-eligible nodes join or leave the cluster we should give them votes or take them away, in order to maintain the optimal level of fault-tolerance in the system. #33924 introduced the `Reconfigurator` to calculate the optimal configuration of the cluster, and in this change we add the plumbing needed to actually perform the reconfigurations needed as the cluster grows or shrinks.	2018-10-19 19:24:54 +01:00
Nhat Nguyen	bd92a28cfc	CCR: Replicate existing ops with old term on follower (#34412 ) Since #34288, we might hit deadlock if the FollowTask has more fetchers than writers. This can happen in the following scenario: Suppose the leader has two operations [seq#0, seq#1]; the FollowTask has two fetchers and one writer. 1. The FollowTask issues two concurrent fetch requests: {from_seq_no: 0, num_ops:1} and {from_seq_no: 1, num_ops:1} to read seq#0 and seq#1 respectively. 2. The second request which fetches seq#1 completes before, and then it triggers a write request containing only seq#1. 3. The primary of a follower fails after it has replicated seq#1 to replicas. 4. Since the old primary did not respond, the FollowTask issues another write request containing seq#1 (resend the previous write request). 5. The new primary has seq#1 already; thus it won't replicate seq#1 to replicas but will wait for the global checkpoint to advance at least seq#1. The problem is that the FollowTask has only one writer and that writer is waiting for seq#0 which won't be delivered until the writer completed. This PR proposes to replicate existing operations with the old primary term (instead of the current term) on the follower. In particular, when the following primary detects that it has processed an process already, it will look up the term of an existing operation with the same seq_no in the Lucene index, then rewrite that operation with the old term before replicating it to the following replicas. This approach is wait-free but requires soft-deletes on the follower. Relates #34288	2018-10-19 13:56:00 -04:00
Igor Motov	94bde37bcf	Geo: Don't flip longitude of envelopes crossing dateline (#34535 ) When a envelope that crosses the dateline is specified as a part of geo_shape query is parsed it shouldn't have its left and right points flipped. Fixes #34418	2018-10-19 13:53:54 -04:00
Jim Ferenczi	fba5d39bbb	Fix completion suggester's score tie-break (#34508 ) The shard suggestion sort uses a different tie-break than the one that is used to merge different shards responses. The former uses the internal document identifier when scores are the same whereas the latter compares the surface form first. Because of this discrepancy some suggestion outputs are linked to the wrong documents because the merge sort reorders the shard suggestions differently. This change fixes this bug by duplicating the Lucene collector in order to be able to apply the same tiebreak strategy than the merge sort. This logic will be removed when https://issues.apache.org/jira/browse/LUCENE-8529 is fixed. Closes #34378	2018-10-19 19:46:55 +02:00
Nhat Nguyen	90ca5b1fde	Fill LocalCheckpointTracker with Lucene commit (#34474 ) Today we rely on the LocalCheckpointTracker to ensure no duplicate when enabling optimization using max_seq_no_of_updates. The problem is that the LocalCheckpointTracker is not fully reloaded when opening an engine with an out-of-order index commit. Suppose the starting commit has seq#0 and seq#2, then the current LocalCheckpointTracker would return "false" when asking if seq#2 was processed before although seq#2 in the commit. This change scans the existing sequence numbers in the starting commit, then marks these as completed in the LocalCheckpointTracker to ensure the consistent state between LocalCheckpointTracker and Lucene commit.	2018-10-19 12:38:06 -04:00
David Turner	3de266e3cf	Merge branch 'master' into zen2	2018-10-19 14:30:07 +01:00
Colin Goodheart-Smithe	84ef91529c	Merge branch 'master' into index-lifecycle	2018-10-19 13:24:04 +01:00
Christophe Bismuth	3036ab1048	Don't omit default values when updating routing exclusions (#33638 ) Exclusion setting `cluster.routing.allocation.exclude._host` default value is an empty string. When an exclusion setting is sent with a null value the o.e.c.s.Setting#innerGetRaw API return an empty string (probably to avoid a NullPointerException to be raised). The o.e.c.r.a.d.FilterAllocationDecider class is developed to omit updates of default values for exclusion setting. That's why a null exclusion setting value is translated to an empty string which is equals to the exclusion default value which is configured to be ignored. A simple fix would be to not omit default values for exclusion setting and keep the NullPointerException guard. This is the purpose of this commit. Closes #32721	2018-10-19 13:57:41 +02:00
Jim Ferenczi	7b49beb9b0	Fix threshold frequency computation in Suggesters (#34312 ) The `term` and `phrase` suggesters have different options to filter candidates based on their frequencies. The `popular` mode for instance filters candidate terms that occur in less docs than the original term. However when we compute this threshold we use the total term frequency of a term instead of the document frequency. This is not inline with the actual filtering which is always based on the document frequency. This change fixes this discrepancy and clarifies the meaning of the different frequencies in use in the suggesters. It also ensures that the threshold doesn't overflow the maximum allowed value (Integer.MAX_VALUE). Closes #34282	2018-10-19 13:33:19 +02:00
markharwood	fe623acf66	Docs - removed experimental/beta markers from adjacency matrix aggregation (#34599 )	2018-10-19 09:33:59 +01:00
Daniel Mitterdorfer	dbb6fe58fa	Remove hand-coded XContent duplicate checks With this commit we cleanup hand-coded duplicate checks in XContent parsing. They were necessary previously but since we reconfigured the underlying parser in #22073 and #22225, these checks are obsolete and were also ineffective unless an undocumented system property has been set. As we also remove this escape hatch, we can remove the additional checks as well. Closes #22253 Relates #34588	2018-10-19 10:13:13 +02:00
Alexander Reelsen	e498b7d437	Core: Parse floats in epoch millis parser (#34504 ) In order to stay BWC compatible with joda time, the epoch millis date formatter needs to parse dates with a dot like `123.45`. This adds this functionality for the epoch millis parser in the same way as for the epoch seconds parser. It also adds support for scientific notations like `1.0e3` and fixes parsing of negative values for epoch seconds and epoch millis.	2018-10-19 10:02:45 +02:00
Christoph Büscher	4f7895800e	Remove unused methods in ValueType (#34624 ) The removed methods seem unused in the rest of the project.	2018-10-19 09:50:45 +02:00
David Turner	e13ce66a3c	[Zen2] Calculate optimal cluster configuration (#33924 ) We wish to commit a cluster state update after having received a response from more than half of the master-eligible nodes in the cluster. This is optimal: requiring either more or fewer votes than half harms resilience. For instance if we have three master nodes then, we want to be able to commit a cluster state after receiving responses from any two nodes; requiring responses from all three is clearly not resilient to the failure of any node, and if we could commit an update after a response from just one node then that node would be required for every commit, which is also not resilient. However, this means we must adjust the configuration (the set of voting nodes in the cluster) whenever a master-eligible node joins or leaves. The calculation of the best configuration for the cluster is the job of the Reconfigurator, introduced here.	2018-10-18 13:19:27 +01:00
Christoph Büscher	7bcf496315	[Tests] Correct map lookup in ReplicationTrackerTests (#34565 )	2018-10-18 11:23:53 +02:00
Tal Levy	09067c8942	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-17 15:37:11 -07:00
Ryan Ernst	8734540345	Ensure map keys cannot be self referencing (#34569 ) This commit improves self reference checking to map keys, as well as adds it to ingest script processing.	2018-10-17 15:16:13 -07:00
Jason Tedor	9be87adb95	Increment settings version when upgrading index (#34566 ) When we upgrade an index, we set the settings version upgraded setting. This should be considered a settings change, and therefore we need to increment the settings version. This commit addresses that.	2018-10-17 18:00:17 -04:00
Nik Everett	b6aa42777a	Search: Wrap lucene classes at 140 columns (#34491 ) Applies our line length guidance for all classes in the server in `lucene` directories except `XMoreLikeThis`. The only long line in `XMoreLikeThis` says "remove this when we upgrade to Lucene 5. Given that we're on Lucene 8, this is a little terrifying and deserves another look.	2018-10-17 15:54:35 -04:00
Armin Braun	08d4bf6e84	TESTS: Remove Dead Code in Test Infra. (#34548 ) * None of this infrastructure is used * Some redundant throws and resulting catch code removed	2018-10-17 20:08:39 +01:00
Colin Goodheart-Smithe	90f7cec7a5	Merge branch 'master' into index-lifecycle	2018-10-17 18:22:23 +01:00
Simon Willnauer	b0e98cbce2	Pass the host name on as `server_name` if proxy mode is on (#34559 ) In remote cluster setup if we see a configured proxy we should set the seed nodes host name as the `server_name` to trigger SNI based routing even for seed nodes. Since remote cluster connections are plain TCP connections we have to set the host manually since the other side can't take it from the request URL like in the HTTP case. This also adds some more informative logging to remote cluster connection.	2018-10-17 19:11:50 +02:00
Andrey Ershov	51f38ddc0c	Switch MetaDataStateFormat to Lucene directory abstraction (#33989 ) Switch MetaDataStateFormat to Lucene directory abstraction This commit switches MetaDataStateFormat class to Lucene directory abstraction to make it easier to test MetaDataStateFormat for different IO failures. This commits also adds different IO failures tests to MetaDataStateFormatTests.	2018-10-17 18:17:17 +02:00
Andrey Ershov	93bb24e1f8	Merge branch 'master' into zen2	2018-10-17 14:37:53 +02:00
Armin Braun	3954d041a0	SCRIPTING: Move sort Context to its Own Class (#33717 ) * SCRIPTING: Move sort Context to its own Class	2018-10-17 10:02:44 +01:00
Tal Levy	fbe8dc014c	Merge branch 'master' into index-lifecycle	2018-10-16 13:58:53 -07:00
Simon Willnauer	a93aefb4a4	Assume that rollover datemath tests run on the same day. (#34527 ) in #28741 RolloverIT fails because we are cutting over to the next day while the test executes. We assume that this doesn't happen based on the assertions in the test. This adds a assumeTrue to ensure we are at least 5 min away form a date-flip. Closes #28741	2018-10-16 20:22:32 +02:00
David Turner	303575f742	Fix up merge of master	2018-10-16 15:29:47 +01:00
Armin Braun	ea576a8ca2	Disc: Move AbstractDisruptionTC to filebased D. (#34461 ) * Discovery: Move AbstractDisruptionTestCase to file-based discovery. * Relates #33675 * Simplify away ClusterDiscoveryConfiguration	2018-10-16 15:28:40 +01:00
David Turner	950ca3adda	Merge branch 'master' into zen2	2018-10-16 14:41:14 +01:00
Simon Willnauer	d43a1fac33	Lock down Engine.Searcher (#34363 ) `Engine.Searcher` is non-final today which makes it error prone in the case of wrapping the underlying reader or lucene `IndexSearcher` like we do in `IndexSearcherWrapper`. Yet, there is no subclass of it yet that would be dramatic to just drop on the floor. With the start of development of frozen indices this changed since in #34357 functionality was added to a subclass which would be dropped if a `IndexSearcherWrapper` is installed on an index. This change locks down the `Engine.Searcher` to prevent such a functionality trap.	2018-10-16 14:53:07 +02:00
Martijn van Groningen	a1ec91395c	Changed CCR internal integration tests to use a leader and follower cluster instead of a single cluster (#34344 ) The `AutoFollowTests` needs to restart the clusters between each tests, because it is using auto follow stats in assertions. Auto follow stats are only reset by stopping the elected master node. Extracted the `testGetOperationsBasedOnGlobalSequenceId()` test to its own test, because it just tests the shard changes api. * Renamed AutoFollowTests to AutoFollowIT, because it is an integration test. Renamed ShardChangesIT to IndexFollowingIT, because shard changes it the name of an internal api and isn't a good name for an integration test. * move creation of NodeConfigurationSource to a seperate method * Fixes issues after merge, moved assertSeqNos() and assertSameDocIdsOnShards() methods from ESIntegTestCase to InternalTestCluster, so that ccr tests can use these methods too.	2018-10-16 14:45:46 +02:00
Jason Tedor	05911fb499	Adjust settings version BWC version after backport This commit adjusts the settings version BWC version after backporting the change to the 6.x branch which currently is versioned as 6.5.0.	2018-10-16 06:38:38 -04:00
Jim Ferenczi	544de13d8e	Disallow negative query boost (#34486 ) This change disallows negative query boosts. Negative scores are not allowed in Lucene 8 so it is easier to just disallow negative boosts entirely. We should also deprecate negative boosts in 6x in order to ensure that users are aware when they'll upgrade to ES 7. Relates #33309	2018-10-16 11:31:53 +01:00
Jason Tedor	4b2052c683	Introduce index settings version (#34429 ) This commit introduces settings version to index metadata. This value is monotonically increasing and is updated on settings updates. This will be useful in cross-cluster replication so that we can request settings updates from the leader only when there is a settings update.	2018-10-16 06:22:20 -04:00
Daniel Mitterdorfer	92b2e1a209	Remove lenient boolean handling With this commit we remove some leftovers from #26389 which cleaned up lenient boolean handling. Relates #26389 Relates #22298 Relates #34467	2018-10-16 06:30:00 +02:00
Jason Tedor	55dee53046	Do not update number of replicas on no indices (#34481 ) Today when submitting an update settings request to update the number of replicas with a wildcard that does not match any indices and allow no indices is set to true, the request ends up being interpreted as updating the number of replicas for all indices. That is, consider the following sequence: PUT /test-index { "settings": { "index.number_of_replicas": 0 } } PUT /non-existent-*/_settings?expand_wildcards=open&allow_no_indices=true { "settings": { "index.number_of_replicas": 1 } } GET /test-index/_settings The latter will show that the number of replicas on test-index is now one. This is surprising, and should be considered a bug. The underlying problem here is treating no indices in the underlying methods used to update the routing table and the metadata as meaning all indices. This commit takes away this assumption. Tests that relied on this behavior have been changed to no longer rely on this. A test for this situation is added in UpdateNumberOfReplicasIT.	2018-10-15 19:49:58 -04:00
Nik Everett	23ece922c9	Core: Remove two methods from AbstractComponent (#34336 ) This removes another two methods from `AbstractComponent`. One isn't used at all and another is only used in a single class in watcher. I've moved the method that watcher uses into the single class that uses it.	2018-10-15 16:05:14 -04:00
Nik Everett	a6d1cc6ca9	Revert "Search: Fix spelling mistake in Javadoc (#34480 )" This reverts commit `4e1d7baed0`.	2018-10-15 15:42:11 -04:00
fonxian	4e1d7baed0	Search: Fix spelling mistake in Javadoc (#34480 ) "iff" -> "if".	2018-10-15 15:38:37 -04:00
Ryan Ernst	26f1d7fc94	Tests: Handle epoch date formatters edge cases (#34437 ) This commit handles cases testing withLocale and withZone when the zone and locale in question is the same as the special base case. This can happen sometimes since the locale and zoneids are randomized.	2018-10-15 12:18:18 -07:00
Jim Ferenczi	67577fca56	Fix handling of empty keyword in terms aggregation (#34457 ) Empty values on keyword fields are filtered by the `map` execution mode of the `terms` aggregation. This commit restores them as valid buckets. Closes #34434	2018-10-15 19:33:52 +01:00
Armin Braun	ebca27371c	SCRIPTING: Move Aggregation Script Context to its own class (#33820 ) * SCRIPTING: Move Aggregation Script Context to its own class	2018-10-15 17:28:05 +01:00
Colin Goodheart-Smithe	0b42eda0e3	Merge branch 'master' into index-lifecycle	2018-10-15 16:03:37 +01:00
David Turner	9bb620eece	Mute PartitionedRoutingIT#testShrinking on Windows	2018-10-15 13:18:00 +01:00
Ryan Ernst	72d818c304	Tests: Fix DateFormatter equals tests with locale (#34435 ) This commit removes randomization of locale for DateFormatter equals tests, instead using explicit locales. The test framework already randomizes locales, so the random choice of the second locale can sometimes be equal to the already chosen locale. Randomization also does not provide any extra protection, as the equality of DateFormatter does not implement equality of the locales itself. closes #34337	2018-10-14 23:54:49 +01:00
Yannick Welsch	5fbead00a3	Zen2: Add infrastructure for integration tests (#34365 ) Adds the infrastructure to run integration tests against Zen2.	2018-10-14 20:55:04 +01:00
David Turner	8b9fa55c93	Add storage-layer disruptions to CoordinatorTests (#34347 ) Today we assume the storage layer operates perfectly in CoordinatorTests, which means we are not testing that the system's invariants are preserved if the storage layer fails for some reason. This change injects (rare) storage-layer failures during the safety phase to cover these cases.	2018-10-13 14:24:15 +01:00
David Turner	d98199df14	Extend duration of fixLag() (#34364 ) Today, fixLag() waits for a new cluster state to be committed. However, it does not account for the fact that a term bump may occur, requiring a new election to take place after the cluster state is committed. This change fixes this.	2018-10-11 23:24:08 +01:00
David Turner	a32e303b0c	Account for election duration (#34362 ) Today we may schedule two elections very close together, which can cause the first election to fail even if there are no other nodes. This change adds a delay in between subsequent elections on the same node, effectively allowing time for each election to complete before scheduling the next one.	2018-10-11 15:31:08 +01:00
Jay Modi	6d99d7dafc	ListenableFuture should preserve ThreadContext (#34394 ) ListenableFuture may run a listener on the same thread that called the addListener method or it may execute on another thread after the future has completed. Whenever the ListenableFuture stores the listener for execution later, it should preserve the thread context which is what this change does.	2018-10-11 15:24:38 +01:00
Nhat Nguyen	33791ac27c	CCR: Following primary should process operations once (#34288 ) Today we rewrite the operations from the leader with the term of the following primary because the follower should own its history. The problem is that a newly promoted primary may re-assign its term to operations which were replicated to replicas before by the previous primary. If this happens, some operations with the same seq_no may be assigned different terms. This is not good for the future optimistic locking using a combination of seqno and term. This change ensures that the primary of a follower only processes an operation if that operation was not processed before. The skipped operations are guaranteed to be delivered to replicas via either primary-replica resync or peer-recovery. However, the primary must not acknowledge until the global checkpoint is at least the highest seqno of all skipped ops (i.e., they all have been processed on every replica). Relates #31751 Relates #31113	2018-10-10 15:39:57 -04:00
Simon Willnauer	34b935ae57	Improve `getRestHandlerWrapper` JavaDocs (#34376 ) Questions on how to work with `ActionPlugin#getRestHandlerWrapper()` come up in discuss forums all the time. This change adds an example to the javadoc how this method should/could be used.	2018-10-10 17:28:07 +01:00
David Turner	52a3a19551	Add low-level bootstrap implementation (#34345 ) Today we inject the initial configuration of the cluster (i.e. the set of voting nodes) at startup. In reality we must support injecting the initial configuration after startup too. This commit adds low-level support for doing so as safely as possible.	2018-10-08 15:56:48 +01:00
Yannick Welsch	49cbcaff4f	Allow excluding folder names when scanning for dangling indices (#34349 ) ES is scanning for dangling indices on every cluster state update. For this, it lists the subfolders of the indices directory to determine which extra index directories exist on the node where there's no corresponding index in the cluster state. These are potential targets for dangling index import. On certain machine types, and with large number of indices, this subfolder listing can be horribly slow. This means that every cluster state update will be slowed down by potentially hundreds of milliseconds. One of the reasons for this poor performance is that Files.isDirectory() is a relatively expensive call on some OS and JDK versions. There is no need though to do all these isDirectory calls for folders which we know we are going to discard anyhow in the next step of the dangling indices logic. This commit allows adding an exclusion predicate to the availableIndexFolders methods which can dramatically speed up this method when scanning for dangling indices.	2018-10-08 15:35:50 +02:00
David Turner	ac99d1d66d	Fix bugs in fixLag() (#34346 ) The hack to work around lag detection had some issues: - it always called runFor(), even if no lag was detected - it looked at the last-accepted state not the last-applied state, so missed some lag situations. This fixes these issues.	2018-10-08 11:33:25 +01:00
Nik Everett	06993e0c35	Logging: Make ESLoggerFactory package private (#34199 ) Since all calls to `ESLoggerFactory` outside of the logging package were deprecated, it seemed like it'd simplify things to migrate all of the deprecated calls and declare `ESLoggerFactory` to be package private. This does that.	2018-10-06 09:54:08 -04:00
David Turner	03da4f6c51	Gather votes from all nodes (#34335 ) Today we accept that some nodes may vote for the wrong master in an election. This is mostly fine because they do end up joining the correct master in the end, but the lack of a vote from every follower may prevent a future desirable reconfiguration from taking place. The solution is to hold another election in a yet-higher term in order to collect a complete set of votes. Elections are somewhat disruptive so we should think carefully about when this election should take place. One option is to wait as late as possible (on the grounds that it might not ever be necessary). This unfortunately makes it harder to predict how an apparently-smoothly-running cluster will react to nodes leaving and joining. Instead we prefer to perform the election as soon as possible in the leader's term, adding "votes from all followers" to the invariants that we expect to hold in a stable cluster. The start of a leader's term is already a somewhat disrupted time for the cluster, so performing another election at this point does not materially change the cluster's behaviour. This change implements the logic needed to trigger a new election in order to satisfy this extra stabilisation condition.	2018-10-06 07:22:04 +01:00
Daniel Mitterdorfer	7d826916b9	Adjust size of BigArrays in circuit breaker test With this commit we restore the previous behavior in `BigArraysTests#testMaxSizeExceededOnResize` but lower the sizes that are tested to the range between 256 bytes to 16 kB so the test does not produce a whole lot of garbage. The previous attempt to reduce the amount of garbage produced by that test was to properly size the array initially but it failed to account for object alignment which lead to test failures in some cases. While it would be possible to account for object alignment, we would need to open up BigArrays or directly use the underlying Lucene API which would require us to allocate an array upfront only to find its size (incl. object alignment). Instead we have fixed this issue by conservatively sizing the array initially (so the initial allocation will never trip the circuit breaker) and reduce garbage by reducing the circuit breaker's upper bound as described previously. Closes #33750 Relates #34325	2018-10-05 15:39:08 +02:00
Jim Ferenczi	5c7b52e930	Adapt bwc version after backport Relates #33587	2018-10-05 13:07:39 +02:00
eray	daf88335d7	Add max_children limit to nested sort (#33587 ) Add an option to `nested` sort to limit the number of children to visit when picking the sort value of the root document. Closes #33592	2018-10-05 12:02:47 +02:00
David Turner	29d7d1d503	Minor housekeeping of tests (#34315 ) From experience with #34257, here are a few things that help with analysing logs from test runs. Also we prevent trying to stabilise a cluster with raised delay variability, because lowering the delay variability requires time to allow all the extra-varied-scheduled tasks to work their way out of the system.	2018-10-05 07:57:03 +01:00
Dimitris Athanasiou	4dacfa95d2	[ML] Allow asynchronous job deletion (#34058 ) This changes the delete job API by adding the choice to delete a job asynchronously. The commit adds a `wait_for_completion` parameter to the delete job request. When set to `false`, the action returns immediately and the response contains the task id. This also changes the handling of subsequent delete requests for a job that is already being deleted. It now uses the task framework to check if the job is being deleted instead of the cluster state. This is a beneficial for it is going to also be working once the job configs are moved out of the cluster state and into an index. Also, force delete requests that are waiting for the job to be deleted will not proceed with the deletion if the first task fails. This will prevent overloading the cluster. Instead, the failure is communicated better via notifications so that the user may retry. Finally, this makes the `deleting` property of the job visible (also it was renamed from `deleted`). This allows a client to render a deleting job differently. Closes #32836	2018-10-05 02:41:28 +03:00
Nik Everett	09aaed4fe4	Tasks: Document that status is not semvered (#34270 ) The `status` part of the tasks API reflects the internal status of a running task. In general, we do not make backwards breaking changes to the `status` but because it is internal we reserve the right to do so. I suspect we will very rarely excercise that right but it is important that we have it so we're not boxed into any particular implementation for a request. In some sense this is policy making by documentation change. In another it is clarification of the way we've always thought of this field. I also reflect the documentation change into the Javadoc in a few places. There I acknowledge Kibana's "special relationship" with Elasticsearch. Kibana parses `_reindex`'s `status` field and, because we're friends with those folks, we should talk to them before we make backwards breaking changes to it. We want to be friends with everyone but there is only so much time in the day and we don't want to make backwards breaking fields to `status` at all anyway. So we hope that breaking changes documentation should be enough for other folks. Relates to #34245.	2018-10-04 14:42:37 -04:00
Yannick Welsch	b32abcbd00	Zen2: Add Cluster State Applier (#34257 ) Adds the cluster state applier to Coordinator, and adds tests for cluster state acking.	2018-10-04 20:33:28 +02:00
Vladimir Dolzhenko	dcfe64e0e4	[CI] Fix bogus ScheduleWithFixedDelayTests.testRunnableRunsAtMostOnceAfterCancellation Closes #34004	2018-10-04 16:31:56 +02:00
Armin Braun	3ccfc3de58	SCRIPTING: Terms set query expression (#33856 ) * SCRIPTING: Add Expr. Compile for TermSetQuery Ctx. * Follow up to #33602 adding the ability to compile TermsSetQuery scripts with the expressions engine in the same way we support SearchScript in Expressions * Duplicated the code here for now to make the change less complex, the only difference to SearchScript is that `_score` and `_value` are not handled for TermsSetQuery * remove redundant check	2018-10-04 16:03:57 +02:00
Nik Everett	ab8a5563f2	Logging: Drop remaining Settings log ctor (#34149 ) Drops the last logging constructor that takes `Settings` because it is no longer needed. Watcher goes through a lot of effort to pass `Settings` to `Logger` constructors and dropping `Settings` from all of those calls allowed us to remove quite a bit of log-based ceremony from watcher.	2018-10-04 09:18:04 -04:00
David Turner	c6b0f08472	Add safety phase to CoordinatorTests (#34241 ) Today's CoordinatorTests have a limited amount of randomisation in how things are scheduled. However, to be fully confident in Zen2's liveness we require the system to stabilise after any permitted sequence of events. We can achieve this by running the system in a much more random fashion for a while, with much larger variation in when things are scheduled (simulating GC pressure and network disruption) and then continuing to assert that the system stabilises as we expect. When running randomly, we do not expect to make significant progress and merely verify that no safety property is violated. This change introduces the runRandomly() test method which implements this idea. It also fixes a handful of liveness bugs that this first version of runRandomly() exposed.	2018-10-04 07:40:26 +01:00
Jim Ferenczi	e8b986cc37	Fix sporadic failure in NestedObjectMapperTests Relates #34225	2018-10-04 07:40:46 +02:00
Nhat Nguyen	6dd716b0c4	Replace version with reader cache key in IndicesRequestCache (#34189 ) Today we use the version of a DirectoryReader as a component of the key of IndicesRequestCache. This usage is perfectly fine since the version is advanced every time a new change is made into IndexWriter. In other words, two DirectoryReaders with the same version should have the same content. However, this invariant is only guaranteed in the context of a single IndexWriter because the version is reset to the committed version value when IndexWriter is re-opened. Since #33473, each IndexShard may have more than one IndexWriter, and using the version of a DirectoryReader as a part of the cache key can cause IndicesRequestCache to return stale cached values. For example, in #27650, we rollback the engine (i.e., re-open IndexWriter), index new documents, refresh, then make a count request, but the search layer mistakenly returns the count of the DirectoryReader of the previous IndexWriter because the current DirectoryReader has the same version of the old DirectoryReader even their documents are different. This is possible because these two readers come from different IndexWriters. This commit replaces the the version with the reader cache key of IndexReader as a component of the cache key of IndicesRequestCache. Closes #27650 Relates #33473	2018-10-03 21:03:24 -04:00
David Turner	cbe1cf98c6	Merge branch 'master' into zen2	2018-10-03 22:12:56 +01:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Jim Ferenczi	ee21067a41	Add early termination support for min/max aggregations (#33375 ) This commit adds the support to early terminate the collection of a leaf in the min/max aggregator. If the query matches all documents the min and max value for a numeric field can be retrieved efficiently in the points reader. This change applies this optimization when possible.	2018-10-03 18:33:39 +02:00
Lee Hinman	90c55f5e36	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-03 09:11:28 -06:00
albendz	f09190c14d	Require combine and reduce scripts in scripted metrics aggregation (#33452 ) * Make text message not required in constructor for slack * Remove unnecessary comments in test file * Throw exception when reduce or combine is not provided; update tests * Update integration tests for scripted metrics to always include reduce and combine * Remove some old changes from previous branches * Rearrange script presence checks to be earlier in build * Change null check order in script builder for aggregated metrics; correct test scripts in IT * Add breaking change details to PR	2018-10-03 15:22:01 +01:00
Jim Ferenczi	41528c0813	Adapt bwc version after backport (bis) Relates #34225	2018-10-03 14:24:01 +02:00
Jim Ferenczi	1aa8e72be7	Adapt bwc version after backport Relates #34225	2018-10-03 12:24:07 +02:00
Jim Ferenczi	5a3e031831	Preserve the order of nested documents in the Lucene index (#34225 ) Today we reverse the initial order of the nested documents when we index them in order to ensure that parents documents appear after their children. This means that a query will always match nested documents in the reverse order of their offsets in the source document. Reversing all documents is not needed so this change ensures that parents documents appear after their children without modifying the initial order in each nested level. This allows to match children in the order of their appearance in the source document which is a requirement to efficiently implement #33587. Old indices created before this change will continue to reverse the order of nested documents to ensure backwark compatibility.	2018-10-03 11:55:30 +02:00
Colin Goodheart-Smithe	2d64e3db9a	Adds trace logging to IndicesRequestCache (#34180 ) * Adds trace logging to IndicesRequestCache This change adds trace level logging to `IndicesrrequestCache` witht eh primary aim of helping to identify the cause of teh failures in https://github.com/elastic/elasticsearch/issues/32827. The cache will log at trace level when a cache hit or miss occurs including the reader version and the cache key. Note that this change adds a `cacheKeyRenderer` whcih supplies a human readable String of the cache key since the actual cache key itself is a `BytesReference` containing the wire protocol serialised form of the request. Logging is also added for the case where a search timeout occurs and fr that reason the cache entry is invalidated. * Adds comment to remaind us to remove cacheKeyRenderer	2018-10-03 08:58:33 +01:00
David Turner	a9eae1d068	Merge branch 'master' into zen2	2018-10-03 08:36:34 +01:00
Gordon Brown	fb907706ec	Merge branch 'master' into index-lifecycle	2018-10-02 13:43:46 -06:00
Dimitrios Liappis	f12e0a8398	Add ES version 6.4.3 (#34239 ) Version bump	2018-10-02 21:15:58 +03:00
David Turner	a7ce4b31ed	Fix logging of cluster state update descriptions (#34182 ) In #28941 we changed the computation of cluster state task descriptions but this introduced a bug in which we only log the empty descriptions (rather than the non-empty ones). This change fixes that.	2018-10-02 19:08:19 +01:00
Christoph Büscher	5183ea3d68	Use OptionalInt instead of Optional<Integer> (#34220 ) Optionals containing boxed primitive types are prohibitively costly because they have two level of boxing. For Optional<Integer> the analogous OptionalInt can be used to avoid the boxing of the contained int value.	2018-10-02 15:58:07 +02:00
Jim Ferenczi	ead6ffce54	Fix cross fields mode of the query_string query (#34216 ) This change fixes a bug in the cross fields mode of the `query_string` query. The multi fields query builder must be reseted before parsing in order to clear the list of expanded fields coming from the previous text block. Closes #34215	2018-10-02 14:53:26 +02:00
Przemyslaw Gomulka	3f8cc89c9f	Completion types with multi-fields support (#34081 ) Mappings with completion type and multi-fields, were not able to index array or object format on completion fields. Only string format was supported. This is fixed by providing multiField parser with externalValueContext with already parsed object closes #15115	2018-10-02 14:32:56 +02:00
Alexander Reelsen	b1b0f3276b	Core: Add methods to get locale/timezone in DateFormatter (#34113 ) This adds some method into the `DateFormatter` interface, namely * `withLocale()` to change the locale of a date formatter * `getLocale()` * `getZone()` * `hashCode()` * `equals()` These methods will be needed for aggregations and mapping changes, where zones and locales can be specified in the mapping or in search/aggs parts of a search request.	2018-10-02 14:13:30 +02:00
David Turner	a127805b4a	[Zen2] Simulate scheduling delays (#34181 ) Today we schedule tasks (both immediate and future ones) exactly when requested. In fact it is more realistic to allow for a small amount of delay in the scheduling of tasks, and this helps to exercise more interleavings of actions and therefore to improve test coverage. This change adds to the DeterministicTaskQueue the ability to add a random delay to the scheduling of tasks. This change also provides more explicit timeouts for stabilisation in the CoordinatorTests. Using the randomised scheduling feature in the CoordinatorTests also found a situation in which we could become a leader, then a candidate, and then a leader again very quickly, causing a clash of the _BECOME_MASTER_ and _FINISH_ELECTION_ tasks. We change their behaviour to not consider these duplicates to be problematic.	2018-10-02 11:22:05 +01:00
Jim Ferenczi	aba4a59d0d	Handle terms query when detecting if a query can match nested docs (#34072 ) When nested objects are present in the mappings, we add a filter in queries to exclude them if there is no evidence that the query cannot match in this space. In 6x we visit the query in order to find a mandatory clause that can match root documents only. If we find one we can omit the nested documents filter. Currently only `term` and `range` queries are checked, this change adds the support for `terms` query to effectively remove the nested filter if a mandatory `terms` clause targets a non-nested field. Closes #34067	2018-10-02 09:30:23 +02:00
David Turner	2aff005a69	Clean up TransportMasterNodeAction (#34076 ) Mainly this fixes a warning by replacing the unchecked `new ActionListener` with the checked `new ActionListener<Response>`, and it also fixes the line length violations in this class.	2018-10-02 03:17:55 +01:00
Lee Hinman	2d9cb21490	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-01 14:10:09 -06:00
Christophe Bismuth	2923fb5b31	Disallow "enabled" attribute change for types in mapping update (#33933 ) This commit adds a check for "enabled" attribute change for types when a RestPutMappingAction is received. A MappingException is thrown when such a change is detected. Change are prevented in both ways: "false -> true" and "true -> false". Closes #33566	2018-10-01 20:49:08 +02:00
Vladimir Dolzhenko	2e2ae19b97	drop elasticsearch-translog for 7.0 (#33373 ) #32281 adds elasticsearch-shard to provide bwc version of elasticsearch-translog for 6.x; have to remove elasticsearch-translog for 7.0 Relates to #31389	2018-10-01 16:21:14 +02:00
Christoph Büscher	17e6932bf3	[Tests] Rename DocumentMapperMergeTests (#34121 ) Renaming to simply DocumentMapperTests to indicate this is where other unit tests should go. Also removing outdates Todo in DocumentMapperParserTests.	2018-10-01 10:29:19 +02:00
Jason Tedor	e2bd2028d8	Allow specifying shard changes batch sizes in bytes (#34168 ) This commit changes the shard changes requests from using a raw byte value to being able to be specified using bytes units (e.g., 4mb).	2018-09-30 14:22:22 -04:00
Martijn van Groningen	b1a27b2e6b	[CCR] Add unfollow API (#34132 ) The unfollow API changes a follower index into a regular index, so that it will accept write requests from clients. For the unfollow api to work the index follow needs to be stopped and the index needs to be closed. Closes #33931	2018-09-30 19:19:34 +02:00
Nhat Nguyen	ad61398879	CCR: Optimize indexing ops using seq_no on followers (#34099 ) This change introduces the indexing optimization using sequence numbers in the FollowingEngine. This optimization uses the max_seq_no_updates which is tracked on the primary of the leader and replicated to replicas and followers. Relates #33656	2018-09-28 20:42:26 -04:00
Ryan Ernst	47cbae9b26	Scripting: Remove ExecutableScript (#34154 ) This commit removes the legacy ExecutableScript, which was no longer used except in tests. All uses have previously been converted to script contexts.	2018-09-28 17:13:08 -07:00
Lee Hinman	6ea396a476	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-28 15:40:12 -06:00

... 6 7 8 9 10 ...

2248 Commits