OpenSearch

Commit Graph

Author	SHA1	Message	Date
Masaru Hasegawa	3df2a086d4	Resolve index names in indices_boost This change allows specifying alias/wildcard expression in indices_boost. And added another format for specifying indices_boost. It accepts array of index name and boost pair. If an index is included in multiple aliases/wildcard expressions, the first match will be used. With new format, old format is marked as deprecated. Closes #4756	2016-12-11 21:41:49 +09:00
Simon Willnauer	01d67e09b9	Detach handshake from connect to node (#22037 ) Today we connect and publish the nodes connection before we execute a handshake with the node we connect to. In the case of connecting to a node that won't pass the handshake this connection is already `published` and other code paths can use it. This commit detaches the connection and the publish of the connection such that `TransportService` can do a handshake before actually connect and publish the connection.	2016-12-10 10:03:26 +01:00
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Nik Everett	e9bb8d8b38	Don't allow yaml tests with `warnings` that don't skip `warnings` (#21989 ) If you write a yaml test with a `warnings` section in a `do` block that doesn't also have a corresponding `skip` section for `warnings` then client test runners that don't support `warnings` will fail. This causes the elasticsearch build to fail so we catch these errors earlier. Related to #21811	2016-12-08 13:17:31 -05:00
Ali Beyad	e6e7bab58c	Prepares allocator decision objects for use with the allocation explain API (#21691 ) This commit enhances the allocator decision result objects (namely, AllocateUnassignedDecision, MoveDecision, and RebalanceDecision) to enable them to be used directly by the cluster allocation explain API. In particular, this commit does the following: - Adds serialization and toXContent methods to the response objects, which will form the explain API responses. - Moves the calculation of the final explanation to the response object itself, removing it from the responsibility of the allocators. - Adds shard store information to the NodeAllocationResult, so that store information is available for each node, when explaining a shard allocation by the PrimaryShardAllocator or the ReplicaShardAllocator. - Removes RebalanceDecision in favor of using MoveDecision for both moving and rebalancing shards. - Removes NodeRebalanceResult in favor of using NodeAllocationResult. - Changes the notion of weight ranking to be relative to the current node, instead of an absolute weight that doesn't convey any added value to the API user and can be confusing. - Introduces a new enum AllocationDecision to convey the decision type, which enables conveying unassigned, moving, and rebalancing scenarios with more detail as opposed to just Decision.Type and AllocationStatus.	2016-12-07 17:37:51 -05:00
Adrien Grand	c746854e03	Pre-built analysis factories do not implement MultiTermAware correctly. (#21981 ) We had tests for the regular factories, but not for the pre-built ones, that ship by default without requiring users to define them in the analysis settings.	2016-12-07 10:32:25 +01:00
Boaz Leskes	4519bdfeb0	InternalTestCluster shouldn't auto heal an active disruption when a new one is set Instead people should explicitly clear the existing one so it's clear what's going on.	2016-12-06 19:58:11 +01:00
Boaz Leskes	a7050b2d56	Remove `InternalTestCluster.startNode(s)Async` (#21846 ) Since the removal of local discovery of #https://github.com/elastic/elasticsearch/pull/20960 we rely on minimum master nodes to be set in our test cluster. The settings is automatically managed by the cluster (by default) but current management doesn't work with concurrent single node async starting. On the other hand, with `MockZenPing` and the `discovery.initial_state_timeout` set to `0s` node starting and joining is very fast making async starting an unneeded complexity. Test that still need async starting could, in theory, still do so themselves via background threads. Note that this change also removes the usage of `INITIAL_STATE_TIMEOUT_SETTINGS` as the starting of nodes is done concurrently (but building them is sequential)	2016-12-06 12:06:15 +01:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Nik Everett	0c724b1878	Keep context during reindex's retries (#21941 ) * Keep context during reindex's retries This fixes reindex and friend's retries to keep the context. * Docs	2016-12-02 13:48:51 -05:00
Tanguy Leroux	fe95aef6a9	[TEST] Remove CompositeTestCluster and ExternalNode (#21933 ) They are not used anymore. Related #21915	2016-12-02 13:25:40 +01:00
Simon Willnauer	20177f6eee	[TEST] Add back ExternalTestCluster - downstream tests still use it	2016-12-02 10:54:27 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
Simon Willnauer	6522538033	Add validation for supported index version on node join, restore, upgrade & open index (#21830 ) Today we can easily join a cluster that holds an index we don't support since we currently allow rolling upgrades from 5.x to 6.x. Along the same lines we don't check if we can support an index based on the nodes in the cluster when we open, restore or metadata-upgrade and index. This commit adds additional safety that fails cluster state validation, open, restore and /or upgrade if there is an open index with an incompatible index version created in the cluster. Realtes to #21670	2016-12-01 15:40:35 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	087a85a4e7	always auto manage min master node in testTwoNodeCluster	2016-12-01 12:57:44 +01:00
Boaz Leskes	9097abee04	Add before and after logging for unit tests Currently we have these logs for integration tests only. This adds the following log at the start: ``` logger.info("[{}]: before test", getTestName()); ``` and this is logged at the end, but before any clean up done in sub classes ``` logger.info("[{}]: after test", getTestName()); ```	2016-12-01 12:56:37 +01:00
Luca Cavanna	103984a4a1	Remove indices query (#21837 ) The indices query is deprecated since 5.0.0 (#17710). It can now be removed in master (future 6.0 version).	2016-11-30 19:37:01 +01:00
Adrien Grand	34e682d3bc	Prevent testing on double values whose toString may use the scientific notation. This might break query parsers because the standard analyzer splits on punctuation.	2016-11-30 16:48:46 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Boaz Leskes	be4074e13d	improve debug logging when node waits for initial cluster state And enabled debug logging in InternalTestClusterTests so we can see it.	2016-11-29 20:38:19 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00
Luca Cavanna	360b74eda8	[TEST] Don't reinitialize YamlTestClient and RestClient before each single test (#21807 ) In the past we ran yaml tests against an internal cluster, which would get restarted after each test failure, hence the client objects needed to eventually be refreshed before each test. That is why we had the initClient method to re-initialize the YamlTestClient in the execution context. We ended up though re-initializing the client unconditionally, which is not needed. Also, ESRestTestCase recreates the RestClient against the external cluster before each test, which is not needed given that nothing changes in the external cluster. This commit removes the initClient method from the yaml tests execution context. The YamlTestClient can be eagerly created before the first yaml test runs and then re-used in subsequent tests. Also api calls to check for nodes versions etc. are moved out of YamlTestClient to ESClientYamlSuiteTestCase. Also the RestClient is now initialized in ESRestTestCase before the first test runs, and kept around afterwards as a static member. Basically each subclass of EsRestTestCase will have its own RestClient instance, but the client will be shared across the different tests within the same class. The yaml test suite is just a special suite, composed of 600+ tests that are loaded from files, which will share the same client instance. This change should speed tests up as well, as we don't recreate the RestClient before each single test, and we don't call _cat/nodes either before each single test.	2016-11-28 18:43:27 +01:00
Simon Willnauer	b7292a6005	Remove TcpTransport#addressSupported since TransportAddress is now final TransportAddress used to be customizable per transport but this has been removed a while ago. Therefore we can remove all usage of this method as well. Relates to #20695	2016-11-28 16:06:59 +01:00
Yannick Welsch	8390648709	Minor clean-ups in MockBigArrays (#21822 ) Removes an unused static variable and an unused instance variable.	2016-11-28 14:09:26 +01:00
Yannick Welsch	7e198f0e41	Detect nodes being blocked by GC-disrupted node (#21797 ) The disruption type LongGCDisruption simulates GCs on a node by suspending all the threads of that node. If the suspended threads are in a code section with shared JVM locks, however, it can prevent the other nodes from doing their thing. The class LongGCDisruption has a list of class names for which we know that this can occur. Whenever a test using the GC disruption type fails in mysterious ways, it becomes a long guessing game to find the offending class. This commit adds code to LongGCDisruption to automatically detect these situations, fail the test early and report the offending class and all relevant context.	2016-11-28 11:24:25 +01:00
Simon Willnauer	41e9ed13d6	[TEST] Fix AbstractBytesReferenceTestCase#testSlice to not assert on offset	2016-11-24 15:31:36 +01:00
Jason Tedor	8416b16dfd	Improve handling of unreleased versions Today when handling unreleased versions for backwards compatilibity support, we scatted version constants across the code base and add some asserts to support removing these constants when the version in question is actually released. This commit improves this situation, enabling us to just add a single unreleased version constant that can be renamed when the version is actually released. This should make maintenance of these versions simpler. Relates #21760	2016-11-23 15:49:05 -05:00
Ryan Ernst	6940b2b8c7	Remove groovy scripting language (#21607 ) * Scripting: Remove groovy scripting language Groovy was deprecated in 5.0. This change removes it, along with the legacy default language infrastructure in scripting.	2016-11-22 19:24:12 -08:00
Nik Everett	1791623700	Document `error_trace` The `error_trace` parameter turns on the `stack_trace` field in errors which returns stack traces. Removes documentation for `camelCase` because it hasn't worked in a while.... Documents the internal parameters used to render stack traces as internal only. Closes #21708	2016-11-22 19:16:07 -05:00
Simon Willnauer	a9a2753f0b	Add a HostFailureListener to notify client code if a node got disconnected (#21709 ) Today there is no way to get notified if a node is disconnected. Client code must poll the TransportClient constantly to detect that a node is not connected anymore in order to react and add new nodes or notify altering etc. For instance if a hostname gets resolved to an IP but that host is disconnected clients want to reconnect by resolving the hostname again which is a common situation in cloud environments. Closes #21424	2016-11-22 20:46:28 +01:00
Jason Tedor	9dc65037bc	Lazy resolve unicast hosts Today we eagerly resolve unicast hosts. This means that if DNS changes, we will never find the host at the new address. Moreover, a single host failng to resolve causes startup to abort. This commit introduces lazy resolution of unicast hosts. If a DNS entry changes, there is an opportunity for the host to be discovered. Note that under the Java security manager, there is a default positive cache of infinity for resolved hosts; this means that if a user does want to operate in an environment where DNS can change, they must adjust networkaddress.cache.ttl in their security policy. And if a host fails to resolve, we warn log the hostname but continue pinging other configured hosts. When doing DNS resolutions for unicast hostnames, we wait until the DNS lookups timeout. This appears to be forty-five seconds on modern JVMs, and it is not configurable. If we do these serially, the cluster can be blocked during ping for a lengthy period of time. This commit introduces doing the DNS lookups in parallel, and adds a user-configurable timeout for these lookups. Relates #21630	2016-11-22 14:17:04 -05:00
Areek Zillur	0ccf8a742d	Add support for merging custom meta data in tribe node (#21552 ) * Add support for merging custom meta data in tribe node Currently, when any underlying cluster has custom metadata (via plugin), tribe node does not store custom meta data in its cluster state. This is because the tribe node has no idea how to select the appropriate custom metadata from one or many custom metadata (corresponding to the number of underlying clusters). This change adds an interface that custom metadata implementations can extend to add support for merging mulitple custom metadata of the same type for storing in the tribe state. Relates to #20544 Supersedes #20791 * Simplify updating tribe state * Add tests for merging multiple custom metadata types in tribe node * cleanup merging custom md logic in tribe service	2016-11-21 12:03:01 -05:00
Tanguy Leroux	e7b9e65fc3	Add checkstyle rule to forbid empty javadoc comments (#20881 ) This commit adds a RegexpMultiline check to checkstyle that yells when an empty Javadoc comment is found in Java files. Related #20871	2016-11-21 12:36:44 +01:00
Adrien Grand	6581b77198	Remove store throttling. (#21573 ) Store throttling has been disabled by default since Lucene added automatic throttling of merge operations based on the indexing rate.	2016-11-17 09:33:32 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Adrien Grand	00de8e07fc	Do not cache term queries. (#21566 ) There have been reports that the query cache did not manage to speed up search requests when the query includes a large number of different sub queries since a single request may manage to exhaust the whole history (256 queries) while the query cache only starts caching queries once they appear multiple times in the history (#16031). On the other hand, increasing the size of the query cache is a bit controversial (#20116) so this pull request proposes a different approach that consists of never caching term queries, and not adding them to the history of queries either. The reasoning is that these queries should be fast anyway, regardless of caching, so taking them out of the equation should not cause any slow down. On the other hand, the fact that they are not added to the cache history anymore means that other queries have greater chances of being cached.	2016-11-16 10:02:24 +01:00
Boaz Leskes	d99d02ecc3	InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted This is to deal with potential delays in processing the fact that node was node is restarted.	2016-11-15 17:58:08 +00:00
Boaz Leskes	9171407906	remove an unneeded assert busy	2016-11-15 17:36:06 +00:00
Boaz Leskes	2c0338fa87	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 17:09:08 +00:00
Boaz Leskes	d6c2b4f7c5	Adapt InternalTestCluster to auto adjust `minimum_master_nodes` (#21458 ) #20960 removed `LocalDiscovery` and we now use `ZenDiscovery` in all our tests. To keep cluster forming fast, we are using a `MockZenPing` implementation which uses static maps to return instant results making master election fast. Currently, we don't set `minimum_master_nodes` causing the occasional split brain when starting multiple nodes concurrently and their pinging is so fast that it misses the fact that one of the node has elected it self master. To solve this, `InternalTestCluster` is modified to behave like a true cluster and manage and set `minimum_master_nodes` correctly with every change to the number of nodes. Tests that want to manage the settings themselves can opt out using a new `autoMinMasterNodes` parameter to the `ClusterScope` annotation. Having `min_master_nodes` set means the started node may need to wait for other nodes to be started as well. To combat this, we set `discovery.initial_state_timeout` to `0` and wait for the cluster to form once all node have been started. Also, because a node may wait and ping while other nodes are started, `MockZenPing` is adapted to wait rather than busy-ping.	2016-11-15 13:42:26 +00:00
Boaz Leskes	c9f49039d3	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 10:14:47 +00:00
Ryan Ernst	c7bd4f3454	Tests: Add TestZenDiscovery and replace uses of MockZenPing with it (#21488 ) This changes adds a test discovery (which internally uses the existing mock zenping by default). Having the mock the test framework selects be a discovery greatly simplifies discovery setup (no more weird callback to a Node method).	2016-11-14 21:46:10 -08:00
Ryan Ernst	d14c470b89	Remove generics from ActionRequest closes #21368	2016-11-14 15:32:01 -08:00
Jason Tedor	491a945ac8	Add socket permissions for tribe nodes Today when a node starts, we create dynamic socket permissions based on the configured HTTP ports and transport ports. If no ports are configured, we use the default port ranges. When a tribe node starts, a tribe node creates an internal node client for connecting to each remote cluster. If neither an explicit HTTP port nor transport ports were specified, the default port ranges are large enough for the tribe node and its internal node clients. If an explicit HTTP port or transport port was specified for the tribe node, then socket permissions for those ports will be created, but not for the internal node clients. Whether the internal node clients have explicit ports specified, or attempt to bind within the default range, socket permissions for these will not have been created and the internal node clients will hit a permissions issue when attempting to bind. This commit addresses this issue by also accounting for tribe nodes when creating the dynamic socket permissions. Additionally, we add our first real integration test for tribe nodes.	2016-11-14 11:58:44 -05:00
Simon Willnauer	bdc942fa72	Enable 5.x to 6.x BWC tests This commit enables real BWC testing against a 5.1 snapshot. All REST tests plus rolling upgrade test now run against a mixed version cross major version cluster.	2016-11-14 14:26:49 +01:00
Jason Tedor	c7a1b3eb50	Merge branch 'master' into feature/seq_no * master: Hack around cluster service and logging race Do not prematurely shutdown Log4j Support decimal constants with trailing [dD] in painless (#21412) In painless suggest a long constant if int won't do (#21415) Account for different paths for sysctl utilities [TEST] testRebalancePossible() may not have an assigned node id Tests: Disable merge in SearchCancellationTests Tests: clean search scroll at the end of SearchCancellationIT	2016-11-13 20:01:44 -05:00
Jason Tedor	d273419d00	Do not prematurely shutdown Log4j When a node closes, we shutdown logging as the last statement. This statement must be last lest any subsequent attempts to log will blow up by running into security permissions. Yet, in the case of a tribe node this isn't enough. The first internal tribe node to close will shutdown logging, and subsequent node closes will blow up with the aforementioned problem. This commit migrate the Log4j shutdown to occur as part of the shutdown hook that closes the node, after all nodes have closed. Consequently, we can remove a hack in the test infrastructure to prevent Log4j shutdowns when internal test nodes close and instead just register a single shutdown hook that runs when the test JVM exits. Relates #21519	2016-11-13 17:27:30 -05:00
Jason Tedor	1e7c424479	Merge branch 'master' into feature/seq_no * master: ShardActiveResponseHandler shouldn't hold to an entire cluster state Ensures cleanup of temporary index-* generational blobs during snapshotting (#21469) Remove (again) test uses of onModule (#21414) [TEST] Add assertBusy when checking for pending operation counter after tests Revert "Add trace logging when aquiring and releasing operation locks for replication requests" Allows multiple patterns to be specified for index templates (#21009) [TEST] fixes rebalance single shard check as it isn't guaranteed that a rebalance makes sense and the method only tests if rebalance is allowed Document _reindex with random_score	2016-11-11 11:25:27 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Ryan Ernst	48bfb142b9	Remove (again) test uses of onModule (#21414 ) This change was reverted after it caused random test failures. This was due to a copy/paste error in the original PR which caused the mock version of ClusterInfoService to be used whenever the mock ZenPing was used, and the real ClusterInfoService to be used when MockZenPing was not used.	2016-11-10 16:06:14 -08:00
Areek Zillur	7ed195fe93	[TEST] Add assertBusy when checking for pending operation counter after tests Currently, pending operations can complete after tests with disruption scheme completes. This commit waits for the pending operation counter to complete after the tests are run	2016-11-10 18:35:52 -05:00
Alexander Lin	0219a211d3	Allows multiple patterns to be specified for index templates (#21009 ) * Allows for an array of index template patterns to be provided to an index template, and rename the field from 'template' to 'index_pattern'. Closes #20690	2016-11-10 18:00:30 -05:00
javanna	2f32c1173b	Revert "Tests: Remove a couple test uses of onModule (#21414 )" This reverts commit `b326f0bc51`.	2016-11-09 11:32:16 +01:00
Ryan Ernst	b326f0bc51	Tests: Remove a couple test uses of onModule (#21414 ) There were still a couple test use cases and examples that were using onModule. This change cleans those cases up.	2016-11-08 13:50:13 -08:00
Nik Everett	b7531984a9	Ignore IAE when checking for version serialization This allows us to throw IllegalArgumentException from serialization code when the destination node can't support the request.	2016-11-08 11:36:12 -05:00
Yannick Welsch	cd34eed03e	Make ensureGreen and ensureYellow wait for cluster size consistency (#21344 ) We currently often use ensureGreen or ensureYellow to check whether the cluster is in a good state again after shutting down a node. With the change in #21092, however, it can happen that if the node that is stopped is the master node, another node will become master and publish a cluster state where it is master but where the node that was stopped hasn't been removed yet from the cluster state. It will only publish a second state thereafter where the old master is removed. If the ensureGreen/ensureYellow is timed just right, it will get to execute before the second cluster state update removing the old master and the condition ensureGreen / ensureYellow might not hold at that point anymore.	2016-11-08 11:07:54 +01:00
Ryan Ernst	7a2c984bcc	Test: Remove multi process support from rest test runner (#21391 ) At one point in the past when moving out the rest tests from core to their own subproject, we had multiple test classes which evenly split up the tests to run. However, we simplified this and went back to a single test runner to have better reproduceability in tests. This change removes the remnants of that multiplexing support.	2016-11-07 15:07:34 -08:00
Nik Everett	a13a050271	Add automatic parallelization support to reindex and friends (#20767 ) Adds support for `?slices=N` to reindex which automatically parallelizes the process using parallel scrolls on `_uid`. Performance testing sees a 3x performance improvement for simple docs on decent hardware, maybe 30% performance improvement for more complex docs. Still compelling, especially because clusters should be able to get closer to the 3x than the 30% number. Closes #20624	2016-11-04 20:59:15 -04:00
Jason Tedor	f16c308efd	Assert status logger does not warn on Log4j usage Today if you start Elasticsearch with the status logger configured to the warn level, or use a transport client with the default status logger level, you will see warn messages about deprecation loggers being created with different message factories and that formatting might be broken. This happens because the deprecation logger is constructed using the message factory from its parent, an artifact leftover from the first Log4j 2 implementation that used a custom message factory. When that custom message factory was removed, this constructor invocation should have been changed to not explicitly use the message factory from the parent. This commit fixes this invocation. However, we also had some status checking to all tests to ensure that there are no warn status log messages that might indicate a configuration problem with Log4j 2. These assertions blow up badly without the fix for the deprecation logger construction, and also caught a misconfiguration in one of the logging tests. Relates #21339	2016-11-04 14:19:59 -04:00
Nik Everett	8943421494	Only log rest connection setup once per suite (#21280 ) This is a bit funky to do with junit because we need per test state but we only want to log it per suite. So we use a static flag that we test per test and reset before every suite.	2016-11-03 21:47:11 -04:00
Yannick Welsch	39f4229594	Add information about in-flight requests when checking IndexShard operation counter (#21308 ) Our test infrastructure checks after running each test that there are no more in-flight requests on the shard level. Whenever the check fails, we only know that there were in-flight requests but don't know what requests were causing this issue. This commit adds the replication tasks that are still active at that moment to the assertion error.	2016-11-03 18:36:07 +01:00
Ryan Ernst	dc6ed7b8d4	Remove pluggability of ZenPing (#21049 ) Plugins: Remove pluggability of ZenPing ZenPing is the part of zen discovery which knows how to ping nodes. There is only one alternative implementation, which is just for testing. This change removes the ability to add custom zen pings, and instead hooks in the MockZenPing for tests through an overridden method in MockNode. This also folds in the ZenPingService (which was really just a single method) into ZenDiscovery, and removes the idea of having multiple ZenPing instances. Finally, this was the last usage of the ExtensionPoint classes, so that is also removed here.	2016-11-03 08:20:20 -07:00
Boaz Leskes	be1772b70d	pending states assertion should dump states This was removed in a cleanup assuming that Hamcrest will dump the array content. Sadly it only dumps the size.	2016-11-03 09:02:29 +01:00
Christoph Büscher	b3370de715	Tests: Add warning header checks to QueryBuilder tests and QueryParseContextTests This adds checks for expected warning headers to the query builder test infrastructure. Tests that are adding deprecation warnings to the response headers need to check those, otherwise the abstract base class for the test class will complain at teardown.	2016-11-02 15:45:33 +01:00
Yannick Welsch	6930a4846c	[TEST] Check static test state after suite scoped cluster is shut down (#21256 ) Checks on static test state are run by an @After method in ESTestCase. Suite-scoped tests in ESIntegTestCase only shut down in an @AfterClass method, which executes after the @After method in ESTestCase. The suite-scoped cluster can thus still execute actions that will violate the checks in @After without those being caught. A subsequent test executing within the same JVM will fail these checks however when @After gets called for that test. This commit adds an explicit call to check the static test state after the suite-scoped cluster has been shut down.	2016-11-02 15:00:16 +01:00
Boaz Leskes	0daf483587	Change ClusterState and PendingClusterTasksResponse's toString() to their prettyPrint format (#21245 ) The current XContent output is much harder to read than the prettyPrint format. This commit folds prettyPrint into toString and removes it.	2016-11-02 13:43:39 +01:00
Simon Willnauer	cf1457ed22	Allow skip test by version OR feature (#21240 ) Today these two are considered mutual exclusive but they are not in practice. For instance a mixed version cluster might not return a given warning depending on which node we talk to but on the other hand some runners might not even support warnings at all so the test might be skipped either by version or by feature.	2016-11-02 12:24:20 +01:00
Adrien Grand	aa6cd93e0f	Require arguments for QueryShardContext creation. (#21196 ) The `IndexService#newQueryShardContext()` method creates a QueryShardContext on shard `0`, with a `null` reader and that uses `System.currentTimeMillis()` to resolve `now`. This may hide bugs, since the shard id is sometimes used for query parsing (it is used to salt random score generation in `function_score`), passing a `null` reader disables query rewriting and for some use-cases, it is simply not ok to rely on the current timestamp (eg. percolation). So this pull request removes this method and instead requires that all call sites provide these parameters explicitly.	2016-11-02 09:48:49 +01:00
Simon Willnauer	2ba4dadea0	[TEST] fix extrasFS file filtering in OldIndexUtils	2016-11-02 09:38:51 +01:00
Simon Willnauer	4db1ac931f	Fix InternalEngineTests#testUpgradeOldIndex for 5.0.0 BWC indices Relates to #21147	2016-11-02 09:38:44 +01:00
Jason Tedor	7751049c14	Add version for 5.0.0 This commit adds the version constant for 5.0.0. Relates #21244	2016-11-01 14:09:00 -04:00
Boaz Leskes	523f7ea71e	Fix a racing condition in MockTransportService#addUnresponsiveRule where a request can be delayed even if the rule was removed. Relates to #21129 Also properly reset DiscoveryWithServiceDisruptionsIT#disableBeforeIndexDeletion	2016-11-01 14:08:18 +01:00
Boaz Leskes	ef192ff2cf	ESIntegTestCase.jav: use ClusterState.prettyPrint for pending ClusterState assertions	2016-11-01 12:54:20 +01:00
Yannick Welsch	d7d5909e69	Disconnect from newly added nodes if cluster state publishing fails (#21197 ) Before publishing a cluster state the master connects to the nodes that are added in the cluster state. When publishing fails, however, it does not disconnect from these nodes, leaving NodeConnectionsService out of sync with the currently applied cluster state.	2016-10-31 15:09:43 +01:00
Simon Willnauer	9598616dfe	Fallback to '/' info call to fetch cluster version The `_cat/nodes` API might not be available in all clusters for instance if they have authorization enabled. This change falls back to the previously used method of using the '/' endpoint to fetch the nodes version, this is best effort and will emit a warning.	2016-10-28 16:22:53 +02:00
Adrien Grand	b3cc54cf0d	Upgrade to lucene-6.3.0-snapshot-ed102d6 (#21150 ) Lucene 6.3 is expected to be released in the next weeks so it'd be good to give it some integration testing. I had to upgrade randomized-testing too so that both Lucene and Elasticsearch are on the same version.	2016-10-28 14:47:15 +02:00
Simon Willnauer	43dbf9c7b6	Use all available hosts in REST tests and allow for real master election (#21161 ) Today we only use a single node to send requests to when we run REST tests. In some cases we have more than one node (ie. in the BWC case) where we should send requests to all nodes in a round-robin fashion. This change passes all available node endpoints to the rest test. Additionally, this change adds the setting of `discovery.zen.minimum_master_nodes` to the cluster formation forcing the nodes to wait for all other nodes until the cluster is formed. This allows for a more realistic master election and allows all master eligable nodes to become master while before always the first node in the cluster became the master. This also adds logging to each test run to log the master nodes version and the minimum node version in the cluster to help debugging BWC test failures.	2016-10-28 12:18:47 +02:00
Simon Willnauer	97cc426a89	Fix bwc cluster formation in order to run BWC tests against a mixed version cluster (#21145 ) This fixes our cluster formation task to run REST tests against a mixed version cluster. Yet, due to some limitations in our test framework `indices.rollover` tests are currently disabled for the BWC case since they select the current master as the merge node which happens to be a BWC node and we can't relocate all shards to it since the primaries are on a higher version node. This will be fixed in a followup. Closes #21142 Note: This has been cherry-picked from 5.0 and fixes several rest tests as well as a BWC break in `OsStats.java`	2016-10-27 17:03:53 +02:00
Yannick Welsch	f3e578f942	Stop delaying existing requests after network delay rule is cleared (#21129 ) The network disruption type "network delay" continues delaying existing requests even after the disruption has been cleared. This commit ensures that the requests get to execute right after the delay rule is cleared.	2016-10-27 13:48:17 +02:00
Jason Tedor	9c3e4d6e22	Add correct Content-Length on HEAD requests This commit fixes responses to HEAD requests so that the value of the Content-Length is correct per the HTTP spec. Namely, the value of this header should be equal to the Content-Length if the request were not a HEAD request. This commit also fixes a memory leak on HEAD requests to the main action that arose from the bytes on a builder not being released due to them being dropped on the floor to ensure that the response to the main action did not have a body. Relates #21123	2016-10-25 23:08:19 -04:00
Igor Motov	17ad88d539	Makes search action cancelable by task management API Long running searches now can be cancelled using standard task cancellation mechanism.	2016-10-25 12:27:34 -10:00
Christoph Büscher	f6f129b21f	Consolidate code for equals/hashCode testing in central utility class Currently test that check that equals() and hashCode() are working as expected for classes implementing them are quiet similar. This change moves common assertions in this method to a common utility class. In addition, another common utility function in most of these test classes that creates copies of input object by running them through a StreamOutput and reading them back in, is moved to ESTestCase so it can be shared across all these classes. Closes #20629	2016-10-24 15:50:40 +02:00
Simon Willnauer	0a410d3916	Pass executor name to request interceptor to support async intercept calls (#21089 ) Today the request interceptor can't support async calls since the response of the async call would execute on a different thread ie. a client or listener thread. This means in-turn that the intercepted handler is not executed with the thread it was supposed to run and therefor can, if it's executing blocking operations, potentially deadlock an entire server.	2016-10-24 13:57:07 +02:00
Ryan Ernst	53cff0f00f	Move all zen discovery classes into o.e.discovery.zen (#21032 ) * Move all zen discovery classes into o.e.discovery.zen This collapses sub packages of zen into zen. These all had just a couple classes each, and there is really no reason to have the subpackages. * fix checkstyle	2016-10-20 00:44:48 -07:00
javanna	c92b550df2	[TEST] Remove create special case in yaml test client Now that the create api has its own spec, we can remove the special case in the yaml test client for it Relates to #20924	2016-10-20 08:48:15 +02:00
Boaz Leskes	c3987156ab	Remove local discovery in favor of a simpler `MockZenPings` (#20960 ) `LocalDiscovery` is a discovery implementation that uses static in memory maps to keep track of current live nodes. This is used extensively in our tests in order to speed up cluster formation (i.e., shortcut the 3 second ping period used by `ZenDiscovery` by default). This is sad as that mean that most of the test run using a different discovery semantics than what is used in production. Instead of replacing the entire discovery logic, we can use a similar approach to only shortcut the pinging components.	2016-10-18 21:12:15 +02:00
Boaz Leskes	eaa105951f	Simplify GlobalCheckpointService and properly hook it for cluster state updates (#20720 ) During a recent merge from master, we lost the bridge from IndicesClusterStateService to the GlobalCheckpointService of primary shards, notifying them of changes to the current set of active/initializing shards. This commits add the bridge back (with unit tests). It also simplifies the GlobalCheckpoint tracking to use a simpler model (which makes use the fact that the global check point sync is done periodically). The old integration CheckpointIT test is moved to IndexLevelReplicationTests. I also added similar assertions to RelocationsIT, which surfaced a bug in the primary relocation logic and how it plays with global checkpoint updates. The test is currently await-fixed and will be fixed in a follow up issue.	2016-10-17 16:33:03 +02:00
Tanguy Leroux	1755cc08f3	REST API parser should fail on duplicate params/paths/methods/parts (#20940 ) This commit changes the current REST API parser to make it fail and throw an exception when a REST specification file contains a duplicated parameters, or path, or method, or path part.	2016-10-17 09:19:07 +02:00
Simon Willnauer	5137f44bd6	[TEST] return empty array if AbstractQueryTestCase#currentTypes is null This is important to allow any test to use RandomQueryBuilder#createQuery() since some of the query builders that are used in this test test the length of the types array and otherwise will thow NPE if the test is not a subclass of AbstractQueryTestCase.	2016-10-15 14:46:54 +02:00
Boaz Leskes	bc8ad8de5a	MockBigArrays should tell you who originally released them	2016-10-12 13:03:40 +02:00
Tanguy Leroux	44ac5d057a	Remove empty javadoc (#20871 ) This commit removes as many as empty javadocs comments my regexp has found	2016-10-12 10:27:09 +02:00
Adrien Grand	1914df7b5f	Do not cache script queries. (#20799 ) The cache relies on the equals() method so we just need to make sure script queries can never be equals, even to themselves in the case that a weight is used to produce a Scorer on the same segment multiple times. Closes #20763	2016-10-11 09:17:21 +02:00
Simon Willnauer	4fd1276542	Prevent AbstractArrays from release bytes more than once (#20819 ) Today we throw an assertion error if we release an AbstractArray more than once. Yet, it's recommended to implement close methods such that they can be invoked more than once. Guaranteed single release calls are hard to implement and some situations might not be tested causing for instance `CircuitBreaker` to operate on corrupted memory stats.	2016-10-10 17:30:37 +02:00
javanna	e154e6a758	[TEST] reformatted comment in query tests	2016-10-10 10:53:17 +02:00
Nik Everett	cf4038b668	DeGuice some of IndicesModule UpdateHelper, MetaDataIndexUpgradeService, and some recovery stuff. Move ClusterSettings to nullable ctor parameter of TransportService so it isn't forgotten.	2016-10-07 11:14:38 -04:00
Simon Willnauer	7452028e50	Simplify TransportAddress (#20798 ) since TransportAddress is now final we can simplify it's interface a bit and remove methods that are only used in tests or are plain delegates.	2016-10-07 15:56:54 +02:00
Simon Willnauer	194a6b1df0	Remove LocalTransport in favor of MockTcpTransport (#20695 ) This change proposes the removal of all non-tcp transport implementations. The mock transport can be used by default to run tests instead of local transport that has roughly the same performance compared to TCP or at least not noticeably slower. This is a master only change, deprecation notice in 5.x will be committed as a separate change.	2016-10-07 11:27:47 +02:00
Simon Willnauer	9c9afe3f01	Remove SearchContext#current and all it's threadlocals (#20778 ) Today SearchContext expose the current context as a thread local which makes any kind of sane interface design very very hard. This PR removes the thread local entirely and instead passes the relevant context anywhere needed. This simplifies state management dramatically and will allow for a much leaner SearchContext interface down the road.	2016-10-06 19:51:54 +02:00
Colin Goodheart-Smithe	40f8f281e0	Merge branch 'master' into dont_cache_scripts	2016-10-06 09:09:23 +01:00
Colin Goodheart-Smithe	ce6f6d3835	Review comments	2016-10-06 08:55:31 +01:00
Simon Willnauer	134b1f9b4d	Prevent thread suspension when inside SecurityManager (#20770 ) LongGCDisruption suspends and resumes node threads but respects several `unsafe` class name patterns where it's unsafe to suspend. For instance log4j uses a global lock so we can't suspend a thread that is currently calling into log4j. The same is true for the security manager, it's similar to log4j a shared resource between the test and the node that is _suspended_. This change adds `java.lang.SecrityManager` to the unsafe patterns. This prevents test framework deadlocking if a nodes thread is supended while it's calling into the security manager that uses synchronized maps etc.	2016-10-05 21:40:27 +02:00
Simon Willnauer	e556c289b9	use a private rewrite context to prevent exposing isCachable	2016-10-05 11:41:49 +02:00
Simon Willnauer	7ba22bb75b	fix random score function builder to deal with empty seeds	2016-10-05 10:45:24 +02:00
Simon Willnauer	587bdcef38	add extra safety when accessing scripts or now and reqeusts are cached	2016-10-05 09:41:48 +02:00
Simon Willnauer	94b7873b49	Add a #markAsNotCachable() method to context to mark requests as not cachable	2016-10-04 18:05:00 +02:00
Simon Willnauer	56f35baf47	Add date-math support to `_rollover` (#20709 ) today it's not possible to use date-math efficiently with the `_rollover` API. This change adds support for date-math in the target index as well as support for preserving the math logic when an existing index that was created with a date math expression all subsequent indices are created with the same expression.	2016-10-03 16:52:33 +02:00
Boaz Leskes	27eab74510	merge from master	2016-09-30 17:19:30 +02:00
Jason Tedor	3a4ffd7b86	Fix failing logging listener tests The logging listener tests started failing after `953a8a959b` when the tests are run with tests.es.logger.level set to any level other than debug. This is because these tests were based around the assumption that the default logging level was info, which was the case before that commit fixed setting the default logging level via that system property. This commit fixes these failing tests by adjusting this assumption to account for the fact that the default logging level could be different.	2016-09-30 08:09:35 +02:00
Boaz Leskes	a16d644c68	allow settings logging level via a sys config in unit tests Pipe in the `tests.es.logger.level` system property to the log4j config file used in tests. We still default to info. Also adapts the logger name to use the first letter of packages.	2016-09-29 03:04:43 +02:00
Jason Tedor	0808611184	Fix failing tests after merge This commit fixes failing tests in feature/seq_no after merging master in.	2016-09-29 03:04:37 +02:00
Boaz Leskes	953a8a959b	allow settings logging level via a sys config in unit tests Pipe in the `tests.es.logger.level` system property to the log4j config file used in tests. We still default to info. Also adapts the logger name to use the first letter of packages.	2016-09-29 01:33:13 +02:00
Jason Tedor	25fd9e26c4	Merge branch 'master' into feature/seq_no * master: (1199 commits) [DOCS] Remove non-valid link to mapping migration document Revert "Default `include_in_all` for numeric-like types to false" test: add a test with ipv6 address docs: clearify that both ip4 and ip6 addresses are supported Include complex settings in settings requests Add production warning for pre-release builds Clean up confusing error message on unhandled endpoint [TEST] Increase logging level in testDelayShards() change health from string to enum (#20661) Provide error message when plugin id is missing Document that sliced scroll works for reindex Make reindex-from-remote ignore unknown fields Remove NoopGatewayAllocator in favor of a more realistic mock (#20637) Remove Marvel character reference from guide Fix documentation for setting Java I/O temp dir Update client benchmarks to log4j2 Changes the API of GatewayAllocator#applyStartedShards and (#20642) Removes FailedRerouteAllocation and StartedRerouteAllocation IndexRoutingTable.initializeEmpty shouldn't override supplied primary RecoverySource (#20638) Smoke tester: Adjust to latest changes (#20611) ...	2016-09-29 00:22:31 +02:00
Jason Tedor	3c8ff45917	Add production warning for pre-release builds This commit adds a usage warning when Elasticsearch is started with a pre-release build. Relates #20674	2016-09-27 20:13:12 -04:00
Boaz Leskes	ee76c1a5c9	Remove NoopGatewayAllocator in favor of a more realistic mock (#20637 ) Many of our unit tests instantiate an `AllocationService`, which requires having a `GatewayAllocator`. Today almost all of our test use a class called `NoopGatewayAllocator` which does nothing, effectively leaving all shard assignments to the balanced allocator. This is sad as it means we test a system that behaves differently than our production logic in very basic things. For example, a started primary that is lost will be assigned to a node that didn't use to have it. This PR removes `NoopGatewayAllocator` in favor of a new `TestGatewayAllocator` that inherits the standard `GatewayAllocator` and overrides shard information fetching to return information based on historical assignments the allocator has done. The only exception is `BalanceConfigurationTests` which does test only the balancer and I opted to not have it work around the `GatewayAllocator` being in it's way.	2016-09-25 20:15:30 +02:00
Ali Beyad	ac1b13dde7	Changes the API of GatewayAllocator#applyStartedShards and (#20642 ) Changes the API of GatewayAllocator#applyStartedShards and GatewayAllocator#applyFailedShards to take both a RoutingAllocation and a list of shards to apply. This allows better mock allocators to be created as being done in #20637. Closes #20642	2016-09-23 09:31:46 -04:00
Ali Beyad	029fc909b5	Removes FailedRerouteAllocation and StartedRerouteAllocation Removes the FailedRerouteAllocation class and StartedRerouteAllocation class, as they were just wrappers for RerouteAllocation that stored started and failed shards, but these started and failed shards can be passed in directly to the methods that needed them, removing the need for this wrapper class and extra level of indirection. Closes #20626	2016-09-23 09:02:36 -04:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Yannick Welsch	191fadafcc	Fix logger usage checker for Log4j 2 With the switch to Log4j 2 throughout our code base, the logger usage checker was temporarily disabled. This commit adapts the checks to work with Log4j 2 and re-enables the Gradle checks. Closes #20243	2016-09-21 14:44:14 +02:00
Simon Willnauer	0151974500	`_flush` should block by default (#20597 ) This commit changes the default behavior of `_flush` to block if other flushes are ongoing. This also removes the use of `FlushNotAllowedException` and instead simply return immediately by skipping the flush. Users should be aware if they set this option that the flush might or might not flush everything to disk ie. no transactional behavior of some sort. Closes #20569	2016-09-21 14:20:24 +02:00
Tanguy Leroux	7645abaad9	Remove duplicate methods in ByteSizeValue (#20560 ) This commit removes `ByteSizeValue`'s methods that are duplicated (ex: `mbFrac()` and `getMbFrac()`) in order to only keep the `getN` form. It also renames `mb()` -> `getMb()`, `kb()` -> `getKB()` in order to be more coherent with the `ByteSizeUnit` method names.	2016-09-20 14:07:23 +02:00
Ali Beyad	50584c4103	Merge pull request #20532 from rjernst/rolling_upgrades This PR introduces backward compatibility index tests to test the rolling upgrade process amongst Elasticsearch instances within the same major version. The test executes in three phases. In the first phase, we form a cluster of 2 ES instances on an old version. In the second phase, we keep one of the nodes from the old cluster, kill the other node, but preserve its data directory and start an instance of the current version of ES using the same data directory as the killed instance. In the third phase, we kill the other old version ES instance from the first phase and launch a new instance, using the same data directory as the killed instance. Therefore, during phase 3, we have fully migrated and have all current versions of ES running. In each phase, we run REST tests that index documents and search them, ensuring at each stage that the documents from the previous phase are still there. Note that because we haven't released a GA yet of 5.0, the tests currently don't start an old version cluster in the first phase. Once GA is released, this will be changed to make the backward compatibility version 5.0, while the current version in the cluster will be 5.x.	2016-09-19 16:14:38 -04:00
Simon Willnauer	ee8d14798f	Unguice Transport and friends (#20526 ) This change removes all guice interaction from Transport, HttpServerTransport, HttpServer and TransportService. All these classes as well as their subclasses or extended version configured via plugins are now created by using plain old bloody java constructors. YAY!	2016-09-19 22:10:47 +02:00
Boaz Leskes	2ee9ab25d9	Remove `RoutingAllocation.Result` (#20538 ) Currently all the reroute-like methods of `AllocationService` return a result object of type `RoutingAllocation.Result`. The result object contains the new `RoutingTable` and `MetaData` plus an indication whether those were changed. The caller is then responsible of updating a cluster state with these. These means that things can easily go wrong and one can take one of these but not the other causing inconsistencies. We already have a utility method on the `ClusterState` builder that does but no one forces you to do so. Also 99% of the callers do the same thing: i.e., check if the result was changed and if so update the very same cluster state that was passed to `AllocationService`. This PR folds this pattern into `AllocationService` and changes almost all it's methods to return a new cluster state (potentially the original one). This saves some 500 lines of code. The one exception here is the reroute API which executes allocation commands and potentially returns an explanation as well (next to the routing table and metadata). That API now returns a `CommandsResult` object which encapsulate a cluster state and the explanation.	2016-09-19 13:54:35 +02:00
Ali Beyad	98230d035a	Adds a preserveIndicesUponCompletion method to ESRestTestCase that can be overridden by subclasses if the test must not delete indices it created after exiting.	2016-09-16 19:21:26 -04:00
Ali Beyad	ce86ed1fdd	Merge remote-tracking branch 'upstream/master' into rolling_upgrades	2016-09-16 10:43:38 -04:00
Simon Willnauer	f5daa165f1	Remove ability to plug-in TransportService (#20505 ) TransportService is such a central part of the core server, replacing it's implementation is risky and can cause serious issues. This change removes the ability to plug in TransportService but allows registering a TransportInterceptor that enables plugins to intercept requests on both the sender and the receiver ends. This is a commonly used and overwritten functionality but encapsulates the custom code in a contained manner.	2016-09-16 09:47:53 +02:00
Boaz Leskes	577dcb3237	Add current cluster state version to zen pings and use them in master election (#20384 ) During a networking partition, cluster states updates (like mapping changes or shard assignments) are committed if a majority of the masters node received the update correctly. This means that the current master has access to enough nodes in the cluster to continue to operate correctly. When the network partition heals, the isolated nodes catch up with the current state and get the changes they couldn't receive before. However, if a second partition happens while the cluster is still recovering from the previous one and the old master is put in the minority side, it may be that a new master is elected which did not yet catch up. If that happens, cluster state updates can be lost. This commit fixed 95% of this rare problem by adding the current cluster state version to `PingResponse` and use them when deciding which master to join (and thus casting the node's vote). Note: this doesn't fully mitigate the problem as a cluster state update which is issued concurrently with a network partition can be lost if the partition prevents the commit message (part of the two phased commit of cluster state updates) from reaching any single node in the majority side and the partition does allow for the master to acknowledge the change. We are working on a more comprehensive fix but that requires considerate work and is targeted at 6.0.	2016-09-15 23:39:11 +02:00
Nik Everett	d0be96df7b	Clean up snapshots after each REST test The only repository we can be sure is safe to clean is `fs` so we clean any snapshots in those repositories after each test. Other repositories like url and azure tend to throw exceptions rather than let us fetch their contents during the REST test. So we clean what we can.... Closes #18159	2016-09-15 14:49:11 -04:00
Boaz Leskes	8469c98e34	Fix LongGCDisruption to be aware of log4j2 (#20348 ) LongGCDisruption simulates a Long GC by suspending all threads belonging to a node. That's fine, unless those threads hold shared locks that can prevent other nodes from running. Concretely the logging infrastructure, which is shared between the nodes, can cause some deadlocks. LongGCDisruption has protection for this, but it needs to be updated to point at log4j2 classes, introduced in #20235 This commit also fixes improper handling of retry logic in LongGCDisruption and adds a protection against deadlocking the test code which activates the disruption (and uses logging too! :)). On top of that we have some new, evil and nasty tests.	2016-09-15 08:50:18 +02:00
Ali Beyad	3f79874042	Prevent the rolling upgrades rest tests from cleaning up indices after finishing if a the tests.rest.preserve_indices system property is set	2016-09-14 23:34:19 -04:00
Simon Willnauer	17ddee7011	Remove TransportService#registerRequestHandler leniency (#20469 ) `TransportService#registerRequestHandler` allowed to register handlers more than once and issues an annoying warn log message when this happens. This change simple throws an exception to prevent regsitering the same handler more than once. This commit also removes the ability to remove request handlers. Relates to #20468	2016-09-14 20:32:29 +02:00
Luca Cavanna	14e17f44a1	Replace usage of LuceneTestCase#getBaseTempDirForTestClass (#20484 ) LuceneTestCase#getBaseTempDirForTestClass is deprecated, we should not use it. Closes #15845	2016-09-14 19:35:20 +02:00
Simon Willnauer	89640965d2	Unguice SearchModule (#20456 ) After this change SearchModule doesn't subclass AbstractModule anymore and all wiring happens in `Node.java`. As a side-effect several tests don't need a guice injector anymore.	2016-09-14 10:07:53 +02:00
Jason Tedor	7560101ec7	Complete Elasticsearch logger names This commit modifies the logger names within Elasticsearch to be the fully-qualified class name as opposed removing the org.elasticsearch prefix and dropping the class name. This change separates the root logger from the Elasticsearch loggers (they were equated from the removal of the org.elasticsearch prefix) and enables log levels to be set at the class level (instead of the package level). Relates #20457	2016-09-13 22:46:54 -04:00
Jason Tedor	fbe27664a6	Fix prefix logging Today we add a prefix when logging within Elasticsearch. This prefix contains the node name, and index and shard-level components if appropriate. Due to some implementation details with Log4j 2 , this does not work for integration tests; instead what we see is the node name for the last node to startup. The implementation detail here is that Log4j 2 there is only one logger for a name, message factory pair, and the key derived from the message factory is the class name of the message factory. So, when the last node starts up and starts setting prefixes on its message factories, it will impact the loggers for the other nodes. Additionally, the prefixes are lost when logging an exception. This is due to another implementation detail in Log4j 2. Namely, since we log exceptions using a parameterized message, Log4j 2 decides that that means that we do not want to use the message factory that we have provided (the prefix message factory) and so logs the exception without the prefix. This commit fixes both of these issues. Relates #20429	2016-09-13 14:46:34 -04:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Nicholas Knize	ef926894f4	Cut over geo_point field and queries to new LatLonPoint type This commit cuts over geo_point fields to use Lucene's new point-based LatLonPoint type for indexes created in 5.0. Indexes created prior to 5.0 continue to use their respective encoding type. Below is a description of the changes made to support the new encoding type: * New indexes use a new LatLonPointFieldMapper which provides a parse method for the new type * The new LatLonPoint parse method removes support for lat_lon and geohash parameters * Backcompat testing for deprecated lat_lon and geohash parameters is added to all unit and integration tests * LatLonPointFieldMapper provides DocValues support (enabled by default) which uses Lucene's new LatLonDocValuesField type * New LatLonPoint field data classes are added for aggregation support (wraps LatLonPoint's Numeric Doc Values) * MultiFields use the geohash as the string value instead of the lat,lon string making it easier to perform geo string queries on the geohash instead of a lat,lon comma delimited string. Removed Features: * With the removal of geohash indexing, GeoHashCellQuery support is removed for all new indexes (still supported on existing indexes) * LatLonPoint does not support a Distance Range query because it is super inefficient. Instead, the geo_distance_range query should be accomplished using either the geo_distance aggregation, sorting by descending distance on a geo_distance query, or a boolean must not of the excluded distance (which is what the distance_range query did anyway). TODO: * fix/finish yaml changes for plugin and rest integration tests * update documentation	2016-09-13 12:17:36 -05:00
Jason Tedor	013e3f6fcc	Remove unused import from BootstrapForTesting This commit removes an unused import for o.e.c.l.LogConfigurator from o.e.b.BootstrapForTesting.	2016-09-13 09:49:15 -04:00
Tanguy Leroux	6090c51fc5	Add quiet option to disable console logging (#20422 ) This commit adds a -q/--quiet option to Elasticsearch so that it does not log anything in the console and closes stdout & stderr streams. This is useful for SystemD to avoid duplicate logs in both journalctl and /var/log/elasticsearch/elasticsearch.log while still allows the JVM to print error messages in stdout/stderr if needed. closes #17220	2016-09-13 14:08:24 +02:00
Lee Hinman	44278db1bc	Merge pull request #20433 from dakrone/remove-cluster-name-folder-fallback No longer allow cluster name in data path	2016-09-12 17:01:49 -05:00
Lee Hinman	94625d74e4	No longer allow cluster name in data path In 5.x we allowed this with a deprecation warning. This removes the code added for that deprecation, requiring the cluster name to not be in the data path. Resolves #20391	2016-09-12 15:47:01 -06:00
Simon Willnauer	686994ae2d	Deguice SearchService and friends (#20423 ) This change removes the guice dependency handling for SearchService and several related classes like SearchTransportController and SearchPhaseController. The latter two now have package private constructors and dependencies like FetchPhase are now created by calling their constructors explicitly. This also cleans up several users of the DefaultSearchContext and centralized it's creation inside SearchService.	2016-09-12 22:42:55 +02:00
Ali Beyad	b1e87aa13c	Split allocator decision making from decision application (#20347 ) Splits the PrimaryShardAllocator and ReplicaShardAllocator's decision making for a shard from the implementation of that decision on the routing table. This is a step toward making it easier to use the same logic for the cluster allocation explain APIs.	2016-09-12 16:21:39 -04:00
Boaz Leskes	b08352047d	Introduce IndexShardTestCase (#20411 ) Introduce a base class for unit tests that are based on real `IndexShard`s. The base class takes care of all the little details needed to create and recover shards. This commit also moves `IndexShardTests` and `ESIndexLevelReplicationTestCase` to use the new base class. All tests in `IndexShardTests` that required a full node environment were moved to a new `IndexShardIT` suite.	2016-09-12 18:20:25 +02:00
Ali Beyad	f39f9b9760	Update discovery nodes after cluster state is published (#20409 ) Before, when there was a new cluster state to publish, zen discovery would first update the set of nodes to ping based on the new cluster state, then publish the new cluster state. This is problematic because if the cluster state failed to publish, then the set of nodes to ping should not have been updated. This commit fixes the issue by updating the set of nodes to ping for fault detection only after the new cluster state has been published.	2016-09-12 12:07:51 -04:00
Luca Cavanna	4b00cc37a1	Merge pull request #20382 from javanna/enhancement/cleanup_parse_elements Cleanup sub fetch phase extension point	2016-09-09 22:47:15 +02:00
Tal Levy	dda32545bb	add ignore_missing option to relevant processors (#20194 )	2016-09-09 12:20:18 -07:00
javanna	90ab460fcc	move parsing of search ext sections to the coordinating node	2016-09-09 19:10:42 +02:00

1 2 3 4 5 ...

874 Commits