OpenSearch

Commit Graph

Author	SHA1	Message	Date
Brian Murphy	c165e640fc	Indexed Scripts/Templates : Change the default auto_expand to 0-all This commit changes the auto_expand_replicas setting for the ````.scripts```` index to 0-all from 1-all.	2014-08-28 15:31:44 +01:00
Brian Murphy	f44bb502ee	Indexed Scripts/Templates : Fix .script index template. This commit makes the default number of shards for the .scripts index to ````1````, it also forces the auto_expand replicas to ````1-all````. This change means that script index GET requests to load scripts from the index should always use the local copy of the scripts index, preventing any network traffic or calls on script GET.	2014-08-28 14:54:24 +01:00
javanna	88839ec546	[TEST] apply default settings by calling super.nodeSettings method when providing test specific methods	2014-08-28 15:35:35 +02:00
javanna	a0e9532dca	[TEST] make default settings don't override test specific settings	2014-08-28 15:35:34 +02:00
javanna	645db6867b	[TEST] apply default settings before test specific ones to external nodes in bw comp tests, otherwise the defaults win all the time	2014-08-28 15:35:34 +02:00
Britta Weber	44dbd9b0c9	test: write heap dump to log folder Per default the heap dump is written to target/JX/pidXYZ.hprof In order to keep them when a new test is is started, they should be written to log folder which is not cleared in a new test run. Heap dump location can be set with -Dtests.heapdump.path=/path/to/heapdump closes #7452	2014-08-28 14:51:10 +02:00
Lee Hinman	09816fdf57	Validate create index requests' number of primary/replica shards Fixes #7495	2014-08-28 14:20:32 +02:00
Simon Willnauer	cc37ae13bc	[CORE] Make network interface iteration order consistent Today the iteration order of the interfaces might change across JVMs this commit cleans up the NetworkUtils class and attempts to ensure consistent iteration order across JVMs.	2014-08-28 12:35:56 +02:00
Simon Willnauer	c93e6e3f67	[TEST] Fix RandomScoreFunctionTests#testConsistentHitsWithSameSeed	2014-08-28 12:31:47 +02:00
Boaz Leskes	c6090e5d9b	[Tests] add a debug logging message when starting an external node	2014-08-28 12:13:05 +02:00
Martijn van Groningen	6de18262dd	Test: Increase the ping timeout to avoid that a candidate master node makes the decision to elect itself too soon.	2014-08-28 11:49:30 +02:00
Simon Willnauer	1d960d08f7	[TEST] only expand to 1 replica in SnapshotBackwardsCompatibilityTest	2014-08-28 11:20:33 +02:00
Simon Willnauer	d062b2b0a4	[TEST] use a dedicated port range per test JVM For reliability and debug purposes each test JVM should use it's own TCP port range if executed in parallel. This also moves away from the default port range to prevent conflicts with running ES instance on the local machine.	2014-08-28 09:18:39 +02:00
londocr	1213eec834	Spelling error of aggregation	2014-08-28 08:57:12 +02:00
Ryan Ernst	eb22d9ec24	FunctionScore: Fixed RandomScoreFunction to guard against _uid field not existing. Also added a test case to check the random score works with queries on an empty index.	2014-08-27 17:01:01 -07:00
Simon Willnauer	59da079bae	[SNAPSHOT] Ensure BWC layer can read chunked blobs	2014-08-27 21:33:40 +02:00
Martijn van Groningen	94eed4ef56	Introduced FixedBitSetFilterCache that guarantees to produce a FixedBitSet and does evict based on size or time. Only when segments are merged away due to merging then entries in this cache are cleaned up. Nested and parent/child rely on the fact that type filters produce a FixedBitSet, the FixedBitSetFilterCache does this. Also if nested and parent/child is configured the type filters are eagerly loaded by default via the FixedBitSetFilterCache. Closes #7037 Closes #7031	2014-08-27 21:28:36 +02:00
Boaz Leskes	852a1103f3	[Internal] user node's cluster name as a default for an incoming cluster state who misses it ClusterState has a reference to the cluster name since version 1.1.0 (`df7474b9fc`) . However, if the state was sent from a master of an older version, this name can be set to null. This is an unexpected and can cause bugs. The bad part is that it will never correct it self until a full cluster restart where the cluster state is rebuilt using the code of the latest version. This commit changes the default to the node's cluster name. Relates to #7386 Closes #7414	2014-08-27 20:24:27 +02:00
Boaz Leskes	55e9f169c3	[Tests] change BasicBackwardsCompatibilityTest to be compatible with 1.0.3 Also increase the time we wait for an external node to join Sadly tests are not yet stable enough, testing with 1.0.3 is still disabled	2014-08-27 20:14:45 +02:00
Ryan Ernst	65afa1d93b	FunctionScore: Refactor RandomScoreFunction to be consistent, and return values in rang [0.0, 1.0] RandomScoreFunction previously relied on the order the documents were iterated in from Lucene. This caused changes in ordering, with the same seed, if documents moved to different segments. With this change, a murmur32 hash of the _uid for each document is used as the "random" value. Also, the hash is adjusted so as to only return values between 0.0 and 1.0 to enable easier manipulation to fit into users' scoring models. closes #6907, #7446	2014-08-27 08:37:25 -07:00
Alexander Reelsen	3aa72f2738	Test: Allow global test cluster to have configurable settings source This allows to reuse the global test cluster with specific configurations, which is useful in plugins.	2014-08-27 17:04:14 +02:00
Boaz Leskes	d5552a980f	[Discovery] UnicastZenPing should also ping last known discoNodes At the moment, when a node looses connection to the master (due to a partition or the master was stopped), we ping the unicast hosts in order to discover other nodes and elect a new master or get of another master than has been elected in the mean time. This can go wrong if all unicast targets are on the same side of a minority partition and therefore will never rejoin once the partition is healed. Closes #7336	2014-08-27 15:47:42 +02:00
Boaz Leskes	ff8b7409f7	[Discovery] add a debug log if a node responds to a publish request after publishing timed out.	2014-08-27 15:47:41 +02:00
Martijn van Groningen	5932371f21	[TEST] Adapt testNoMasterActions since metadata isn't cleared if there is a no master block	2014-08-27 15:47:41 +02:00
Martijn van Groningen	c8919e4bf5	[TEST] Changed action names.	2014-08-27 15:47:41 +02:00
Martijn van Groningen	702890e461	[TEST] Remove the forceful `network.mode` setting in DiscoveryWithServiceDisruptions#testMasterNodeGCs now local transport use worker threads.	2014-08-27 15:47:41 +02:00
Boaz Leskes	26d90882e5	[Transport] Introduced worker threads to prevent alien threads of entering a node. Requests are handled by the worked thread pool of the target node instead of the generic thread pool of the source node. Also this change is required in order to make GC disruption work with local transport. Previously the handling of the a request was performed on on a node that that was being GC disrupted, resulting in some actions being performed while GC was being simulated.	2014-08-27 15:47:40 +02:00
Martijn van Groningen	966a55d21c	Typo: s/Recieved/Received	2014-08-27 15:47:40 +02:00
Martijn van Groningen	47326adb67	[TEST] Make sure all shards are allocated before killing a random data node.	2014-08-27 15:47:40 +02:00
Martijn van Groningen	403ebc9e07	[Discovery] Added cluster version and master node to the nodes fault detecting ping request The cluster state version allows resolving the case where a old master node become unresponsive and later wakes up and pings all the nodes in the cluster, allowing the newly elected master to decide whether it should step down or ask the old master to rejoin.	2014-08-27 15:47:40 +02:00
Boaz Leskes	50f852ffeb	[TEST] Added LongGCDisruption and a test simulating GC on master nodes Also rename DiscoveryWithNetworkFailuresTests to DiscoveryWithServiceDisruptions which better suites what we do.	2014-08-27 15:47:40 +02:00
Martijn van Groningen	4b8456e954	[Discovery] Master fault detection and nodes fault detection should take cluster name into account. Both master fault detection and nodes fault detection request should also send the cluster name, so that on the receiving side the handling of these requests can be failed with an error. This error can be caught on the sending side and for master fault detection the node can fail the master locally and for nodes fault detection the node can be failed. Note this validation will most likely never fail in a production cluster, but in during automated tests where cluster / nodes are created and destroyed very frequently.	2014-08-27 15:47:39 +02:00
Martijn van Groningen	364374dd03	[TEST] Added test that verifies that no shard relocations happen during / after a master re-election.	2014-08-27 15:47:39 +02:00
Martijn van Groningen	130e680cfb	[Discovery] Made the handeling of the join request batch oriented. In large clusters when a new elected master is chosen, there are many join requests to handle. By batching them up the the cluster state doesn't get published for each individual join request, but many handled at the same time, which results into a single new cluster state which ends up be published. Closes #6984	2014-08-27 15:47:39 +02:00
Shay Banon	0244ddb0cd	retry logic to unwrap exception to check for illegal state it probably comes wrapped in a remote exception, which we should unwrap in order to detect it..., also, simplified a bit the retry logic	2014-08-27 15:47:39 +02:00
Boaz Leskes	cccd060a0c	[Discovery] verify we have a master after a successful join request After master election, nodes send join requests to the elected master. Master is then responsible for publishing a new cluster state which sets the master on the local node's cluster state. If something goes wrong with the cluster state publishing, this process will not successfully complete. We should check it after the join request returns and if it failed, retry pinging. Closes #6969	2014-08-27 15:47:38 +02:00
Boaz Leskes	ffcf1077d8	[Discovery] join master after first election Currently, pinging results are only used if the local node is elected master or if they detect another already active master. This has the effect that master election requires two pinging rounds - one for the elected master to take is role and another for the other nodes to detect it and join the cluster. We can be smarter and use the election of the first round on other nodes as well. Those nodes can try to join the elected master immediately. There is a catch though - the elected master node may still be processing the election and may reject the join request if not ready yet. To compensate a retry mechanism is introduced to try again (up to 3 times by default) if this happens. Closes #6943	2014-08-27 15:47:38 +02:00
Boaz Leskes	a40984887b	[Tests] Fixed some issues with SlowClusterStateProcessing Reduced expected time to heal to 0 (we interrupt and wait on stop disruption). It was also wrongly indicated in seconds. We didn't properly wait between slow cluster state tasks	2014-08-27 15:47:38 +02:00
Martijn van Groningen	c2142c0f6d	Discovery: Don't include local node to pingMasters list. We might end up electing ourselves without any form of verification.	2014-08-27 15:47:38 +02:00
Martijn van Groningen	5e38e9eb4f	Discovery: Only add local node to possibleMasterNodes if it is a master node.	2014-08-27 15:47:37 +02:00
Martijn van Groningen	67685cb026	Discovery: If not enough possible masters are found, but there are masters to ping (ping responses did include master node) then these nodes should be resolved. After the findMaster() call we try to connect to the node and if it isn't the master we start looking for a new master via pinging again. Closes #6904	2014-08-27 15:47:37 +02:00
Boaz Leskes	f029a24d53	[Store] migrate non-allocated shard deletion to use ClusterStateNonMasterUpdateTask	2014-08-27 15:47:37 +02:00
Boaz Leskes	bebaf9799c	[Tests] stability improvements added explicit cleaning of temp unicast ping results reduce gateway local.list_timeout to 10s. testVerifyApiBlocksDuringPartition: verify master node has stepped down before restoring partition	2014-08-27 15:47:30 +02:00
Boaz Leskes	ea2783787c	[Tests] Introduced ClusterDiscoveryConfiguration Closes #6890	2014-08-27 15:47:23 +02:00
Boaz Leskes	ccabb4aa20	Remove unneeded reference to DiscoveryService which potentially causes circular references	2014-08-27 15:47:23 +02:00
Boaz Leskes	7fa3d7081b	[logging] don't log an error if scheduled reroute is rejected because local node is no longer master Since it runs in a background thread after a node is added, or submits a cluster state update when a node leaves, it may be that by the time it is executed the local node is no longer master.	2014-08-27 15:47:23 +02:00
Boaz Leskes	e0543b3426	[Internal] Migrate new initial state cluster update task to a ClusterStateNonMasterUpdateTask	2014-08-27 15:47:23 +02:00
Boaz Leskes	c12d0901f6	[Tests] Increase timeout when waiting for partitions to heal the current 30s addition is tricky because we use 30s as timeout in many places...	2014-08-27 15:47:22 +02:00
Boaz Leskes	7b6e194923	[Tests] Don't log about restoring a partition if the partition is not active.	2014-08-27 15:47:22 +02:00
Boaz Leskes	522d4afe0c	[Tests] Use local gateway This is important to for proper primary allocation decisions	2014-08-27 15:47:22 +02:00

... 3 4 5 6 7 ...

9533 Commits All Branches Search

9533 Commits

All Branches