OpenSearch

Commit Graph

Author	SHA1	Message	Date
Robert Muir	d0deb28336	Merge pull request #13550 from rmuir/this_test_is_fucked_up NewPathForShardTests is broken every which way from Sunday.	2015-09-14 04:13:55 -04:00
Boaz Leskes	15f4863ef6	Discovery: Add two phased commit to Cluster State publishing When publishing a new cluster state, the master will send it to all the node of the cluster, noting down how many master nodes responded successfully. The nodes do not yet process the new cluster state, but rather park it in memory. As soon as at least minimum master nodes have ack-ed the cluster state change, it is committed and a commit request is sent to all the node that responded so far (and will respond in the future). Once receiving the commit requests the nodes continue to process the cluster state change as they did before this change. A few notable comments: 1. For this change to have effect, min master nodes must be configured. 2. All basic cluster state validation is done in the first phase of publish and is thus now part of `ShardOperationResult` 3. A new `COMMIT_TIMEOUT` settings is introduced, dictating how long a master should wait for nodes to ack the first phase. Unlike `PUBLISH_TIMEOUT`, if waiting for a commit times out, the cluster state change will be rejected. 4. Failing to achieve a min master node of acks, will cause the master to step down as it clearly doesn't have enough active followers. 5. Previously there was a short window between the moment a master lost it's followers and it stepping down because of node fault detection failures. In this short window, the master could process any change (but fail to publish it). This PR closes this gap to 0. 6. A dedicated pending cluster states queue was added to keep pending non-comitted cluster states and manage the logic around processing committed cluster states. See #13303 for details. Closes #13062 , Closes #13303	2015-09-14 09:55:41 +02:00
Simon Willnauer	562126a332	Clenup InternalClusterInfoService This commit allows to refresh the info service in a blocking fashion which allows tests to prevent installing listeners alltogether and makes the class easier to test. Installing a listnener is always subject to concurrent modifications where the listener might be called mulitple times or with stale information which causes tests to fail.	2015-09-14 09:37:19 +02:00
Simon Willnauer	a0c10850d9	Use Supplier instead of Reflection Java 8 allows for method references which in-turn will cause compile errors if a method is not visible while reflection fails late and maybe too late. We can now register Request instances via FooRequest::new instead of passing FooRequest.class and call it's ctor via reflection.	2015-09-14 09:11:32 +02:00
Boaz Leskes	564e1eca42	merge from master	2015-09-14 09:03:45 +02:00
Tanguy Leroux	561f5b590f	Fix AwarenessAllocationIT.testAwarenessZones test This test sometimes fails because the first node is elected as master and waits 30s for incoming joins but in the meanwhile the 3 other nodes form a cluster on their side. The index will be created and its shards allocated on these 3 nodes, then the test checks for the number of shards on each node (it should be 2 or 3) but because the first node has not fully join the cluster yet one node will have 5 shards. closes #13305	2015-09-14 08:57:31 +02:00
Robert Muir	4a9b8a94c9	NewPathForShardTests is broken every which way from Sunday. 1. FileSystem wrapping code is broken, thats why you get providermismatch exception! Instead of fixing this, it SuppressesForbidden!!!! 2. Because it uses SuppressForbidden on the test, the whole thing is lenient, it uses java.io.File for example! 3. Of course it fails consistently on windows because it can't remove files, because it leaks file handles (locks) like a sieve since it does not close node environment. With correct wrapping this is always detected by e.g. our leak detection FS. Instead of fixing the leak, it assumesFalse(WINDOWS) !!!!! I do not know how this snuck past me, but I need this fixed to remove setAccessible.	2015-09-14 01:30:09 -04:00
Jason Tedor	b4adad7c0a	Code cleanup in TransportBroadcastByNodeAction	2015-09-13 21:35:24 -04:00
Boaz Leskes	a758eec78a	Recovery: failed to properly ack translog ops during wait on mapping changes During the second phase of recovery, replayed transaction log entries may need to wait on mapping changes that have not yet propagated to the target node. Currently we correctly replay the operation at a later stage, but we acknowledge the replay request before actually performing the work. Example failure: http://build-us-00.elastic.co/job/es_feature_two_phase_pub/859/ Closes #13535	2015-09-13 21:14:40 +02:00
Simon Willnauer	5098dcaf96	[TEST] Don't rely on currentTimeMillis in RoutingServiceTests If the machine is very slow this test fails if the delta of the unallocaiton timestamp and the last scheduled delay is greater than the scheduled delay time.	2015-09-13 16:58:56 +02:00
Britta Weber	5a35a8582b	[test] mute and add link to issue see #13542	2015-09-13 15:06:28 +02:00
Robert Muir	7703aab95a	support mocking these components in tests without setAccessible	2015-09-12 20:49:20 -04:00
Robert Muir	b16e1569fe	Remove all setAccessible in tests and forbid	2015-09-12 19:46:39 -04:00
Robert Muir	3a0dd59a15	Merge pull request #13537 from rmuir/die_setaccessible_in_tests_die Remove easy uses of setAccessible in tests.	2015-09-12 17:19:38 -04:00
Robert Muir	6a15518e8c	don't list this class twice (vi typo)	2015-09-12 17:07:18 -04:00
Robert Muir	174ca77ebf	Hack around aws security hole of accessing sun.security.ssl, s3 repository works on java 9 again Today this is really horrible, and we have a PR sent to fix it, but nobody does anything: https://github.com/aws/aws-sdk-java/pull/432 With java 9, we cannot even grant the permission, this kind of sheistiness is not allowed, and s3 repository is completely broken. The problem is their code is still broken, and won't handle neither SecurityException (our PR) nor the new InaccessibleObjectException they will get from java 9. We use a really hacky hack to deliver an exception that their code catches (IllegalAccessException) instead. This means s3 repository is working on java 9, and we close off access to sun.security.ssl completely	2015-09-12 14:16:16 -04:00
Robert Muir	a7418cd9bb	Remove easy uses of setAccessible in tests. Don't worry, I will fix the rest. But some of those remaining will need a lucene upgrade, we need to add a getter or two for tests to do things cleanly.	2015-09-12 11:57:30 -04:00
Robert Muir	d6f56030d8	ban setAccessible from core code. In addition to being a big security problem, setAccessible is a risk for java 9 migration. We need to clean up our code so we can ban it and eventually enforce this with security manager for third-party code, too, or we may have problems. Instead of using setAccessible, use the correct modifier (e.g. public). TODO: ban in tests TODO: ban in security manager at runtime	2015-09-12 02:11:06 -04:00
Robert Muir	3ec5cf6263	get our stats back by reflecting mxbeans correctly	2015-09-11 22:28:37 -04:00
Robert Muir	8be7fde181	Revert "Update monitor probe tests for java 9: this stuff is no longer accessible" This reverts commit `af2df9aef6`.	2015-09-11 22:21:02 -04:00
Jason Tedor	d6d8d30d47	Remove and forbid use of com.google.common.collect.ImmutableSortedMap This commit removes and now forbids all uses of com.google.common.collect.ImmutableSortedMap across the codebase. This is one of many steps in the eventual removal of Guava as a dependency. Relates #13224	2015-09-11 17:42:19 -04:00
Igor Motov	39ca45050a	Simplify the BlobContainer blob writing interface Instead of asking blob store to create output for posting blob content, this change provides that content of the blob to the blob store for writing. This will significantly simplify the interface for S3 and Azure plugins.	2015-09-11 16:39:58 -04:00
Simon Willnauer	7b0f086946	Remove and forbid use of several com.google.common.util. classes This commit replaces: * com.google.common.util.concurrent.ListenableFuture * com.google.common.util.concurrent.SettableFuture * com.google.common.util.concurrent.Futures * com.google.common.util.concurrent.MoreExecutors And forbits its usage via forbidden APIs. This is one of many steps in the eventual removal of Guava as a dependency. Relates to #13224	2015-09-11 21:15:12 +02:00
Igor Motov	3fd7d95f52	Snapshot restore request should accept indices options Fixes an issue introduced in #10744 as a result of which the restore request stopped accepting indices options such as ignore_unavailable. Fixes #13335	2015-09-11 13:43:00 -04:00
Simon Willnauer	e35293b055	Merge pull request #13518 from s1monw/cleanup_search_request Cleanup SearchRequest & SearchRequestBuilder	2015-09-11 19:41:52 +02:00
Jason Tedor	67dea6b3f6	Merge pull request #13517 from jasontedor/settings-should-not-leak-guava-dependency Stop o.e.c.s.Settings from leaking Guava dependency	2015-09-11 11:44:38 -04:00
Simon Willnauer	01a2ba1b95	Cleanup SearchRequest & SearchRequestBuilder We have a gazillion ways to specify the source of the search request. There is no need to so many we can reduce them dramatically and also remove some deprecated API.	2015-09-11 17:38:35 +02:00
Lee Hinman	016ba3531a	Merge remote-tracking branch 'dakrone/remove-disable-decider'	2015-09-11 09:28:13 -06:00
Boaz Leskes	e370b83685	fix old style plugin injection	2015-09-11 17:10:41 +02:00
Jason Tedor	f1ff6706aa	Stop o.e.c.s.Settings from leaking Guava dependency The public org.elasticsearch.common.settings.Settings#getAsMap method leaks the dependency on Guava by returning a com.google.common.collect.ImmutableMap. The leaking of this dependency should be removed in preparation for the eventual complete removal of Guava as a dependency. Relates #13224	2015-09-11 11:09:08 -04:00
Nicholas Knize	bc75fe4a45	add a weighted centroid to the geohash_grid aggregator and cut over to lucene GeoHashUtils	2015-09-11 09:57:08 -05:00
Britta Weber	73b4391a46	[test] fix urls for windows	2015-09-11 16:25:19 +02:00
Britta Weber	53bb17c83c	[test] use tmp dir otherwise the security manager will complain	2015-09-11 14:48:19 +02:00
Masaru Hasegawa	6a9ef99d0f	Take relocating shard into consideration during awareness allocation Previous fix #12551 counted twice for relocating shard (source and target). Fix it to consider only target node.	2015-09-11 21:09:31 +09:00
Simon Willnauer	6b4699fbf7	Only add unicast provider if all settings are present Closes #13492	2015-09-11 13:21:03 +02:00
Britta Weber	2c618a11de	plugins: fix print of url when it contains spaces	2015-09-11 13:13:48 +02:00
javanna	380fe52b30	Java api: remove filterBuilder element from FunctionScoreQueryBuilder The filter element has been deprecated in the function_score query parser. Whenever a filter is found it gets wrapped into a query automatically. The filter in the java api builder is always null, there is no way to set its value, just a leftover.	2015-09-11 13:08:33 +02:00
javanna	a641c7ae52	Function score query: remove deprecated support for boost_factor boost_factor was deprecated in 1.4.0.beta1. We can remove the support for it in 3.0, its replacement is `weight`. Closes #13510	2015-09-11 12:58:25 +02:00
Boaz Leskes	1b8047e51c	merge master	2015-09-11 09:56:13 +02:00
Adrien Grand	e988a9edc6	Merge pull request #13439 from rmuir/1702090 upgrade lucene to r1702265	2015-09-11 09:48:43 +02:00
Boaz Leskes	80b59e0d66	Discovery: Add a dedicate queue for incoming ClusterStates The initial implementation of two phase commit based cluster state publishing (#13062) relied on a single in memory "pending" cluster state that is only processed by ZenDiscovery once committed by the master. While this is fine on it's own, it resulted in an issue with acknowledged APIs, such as the open index API, in the extreme case where a node falls behind and receives a commit message after a new cluster state has been published. Specifically: 1) Master receives and acked-API call and publishes cluster state CS1 2) Master waits for a min-master nodes to receives CS1 and commits it. 3) All nodes that have responded to CS1 are sent a commit message, however, node N didn't respond yet 4) Master waits for publish timeout (defaults to 30s) for all nodes to process the commit. Node N fails to do so. 5) Master publishes a cluster state CS2. Node N responds to cluster state CS1's publishing but receives cluster state CS2 before the commit for CS1 arrives. 6) The commit message for cluster CS1 is processed on node N, but fails because CS2 is pending. This caused the acked API in step 1 to return (but CS2 , is not yet processed). In this case, the action indicated by CS1 is not yet executed on node N and therefore the acked API calls return pre-maturely. Note that once CS2 is processed but the change in CS1 takes effect (cluster state operations are safe to batch and we do so all the time). An example failure can be found on: http://build-us-00.elastic.co/job/es_feature_two_phase_pub/314/ This commit extracts the already existing pending cluster state queue (processNewClusterStates) from ZenDiscovery into it's own class, which serves as a temporary container for in-flight cluster states. Once committed the cluster states are transferred to ZenDiscovery as they used to before. This allows "lagging" cluster states to still be successfully committed and processed (and likely to be ignored as a newer cluster state has already been processed). As a side effect, all batching logic is now extracted from ZenDiscovery and is unit tested.	2015-09-11 09:23:41 +02:00
Masaru Hasegawa	5ae00a6129	Take initializing shards into consideration during awareness allocation It makes decision consistent. Fixes #12522	2015-09-11 13:13:36 +09:00
Ryan Ernst	f3142e0c93	Merge pull request #13497 from rjernst/test_jar_cleanup Reorganize sharing of constants for mock cluster info service	2015-09-10 21:10:29 -07:00
Jason Tedor	b5b22d4b01	Remove and forbid use of com.google.common.collect.Queues This commit removes and now forbids all uses of com.google.common.collect.Queues across the codebase. This is one of many steps in the eventual removal of Guava as a dependency. Relates #13224	2015-09-10 23:59:59 -04:00
Jason Tedor	65353b8a32	Remove and forbid use of com.google.common.collect.ImmutableSortedSet This commit removes and now forbids all uses of com.google.common.collect.ImmutableSortedSet across the codebase. This is one of many steps in the eventual removal of Guava as a dependency. Relates #13224	2015-09-10 23:33:55 -04:00
Robert Muir	af2df9aef6	Update monitor probe tests for java 9: this stuff is no longer accessible	2015-09-10 23:18:58 -04:00
Ryan Ernst	4a9f5bcf1d	Tests: Reorganize sharing of constants for mock cluster info service The MockInternalClusterInfoService depends on a constant and helper method from actual test cases. This moves the constant and helper method into the mock itself. Without this change, the classes put into the test jar are not completely useable on their own (since this mock is in the test jar, but the test cases are not).	2015-09-10 20:08:04 -07:00
Robert Muir	6ba4d132df	Disable some exception serialization asserts on java 9, likely StackTraceElement has new stuff in it and we have to think about what to do	2015-09-10 22:45:53 -04:00
Robert Muir	552d5b0f65	Add java 9 support for bootclasspath to JvmInfo (it throws UOE otherwise which is not properly handled)	2015-09-10 22:15:27 -04:00
Robert Muir	c663d9b6a5	Merge branch 'jarhell2.0' into 1702090	2015-09-10 22:07:37 -04:00

1 2 3 4 5 ...

1264 Commits