OpenSearch

Commit Graph

Author	SHA1	Message	Date
Adrien Grand	2b8fa64cf7	ESIntegTestCase.indexRandom should not introduce types. (#24202 ) Since we plan on removing types, `indexRandom` should not introduce new types. This commit refactors `indexRandom` to reuse existing types.	2017-04-21 10:38:36 +02:00
Nik Everett	caf376c8af	Start building analysis-common module (#23614 ) Start moving built in analysis components into the new analysis-common module. The goal of this project is: 1. Remove core's dependency on lucene-analyzers-common.jar which should shrink the dependencies for transport client and high level rest client. 2. Prove that analysis plugins can do all the "built in" things by moving all "built in" behavior to a plugin. 3. Force tests not to depend on any oddball analyzer behavior. If tests need anything more than the standard analyzer they can use the mock analyzer provided by Lucene's test infrastructure.	2017-04-19 18:51:34 -04:00
Ali Beyad	3c82eea5fb	Wait for cluster to become quiescent between REST tests (#24148 ) [TEST] ensures REST tests wait for cluster state updates to finish processing before moving to the next test	2017-04-19 13:17:09 -04:00
Jim Ferenczi	f05af0a382	Enable index-time sorting (#24055 ) This change adds an index setting to define how the documents should be sorted inside each Segment. It allows any numeric, date, boolean or keyword field inside a mapping to be used to sort the index on disk. It is not allowed to use a `nested` fields inside an index that defines an index sorting since `nested` fields relies on the original sort of the index. This change does not add early termination capabilities in the search layer. This will be added in a follow up. Relates #6720	2017-04-19 14:36:11 +02:00
Ryan Ernst	212f24aa27	Tests: Clean up rest test file handling (#21392 ) This change simplifies how the rest test runner finds test files and removes all leniency. Previously multiple prefixes and suffixes would be tried, and tests could exist inside or outside of the classpath, although outside of the classpath never quite worked. Now only classpath tests are supported, and only one resource prefix is supported, `/rest-api-spec/tests`. closes #20240	2017-04-18 15:07:08 -07:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Jason Tedor	8033c576b7	Detect remnants of path.data/default.path.data bug In Elasticsearch 5.3.0 a bug was introduced in the merging of default settings when the target setting existed as an array. When this bug concerns path.data and default.path.data, we ended up in a situation where the paths specified in both settings would be used to write index data. Since our packaging sets default.path.data, users that configure multiple data paths via an array and use the packaging are subject to having shards land in paths in default.path.data when that is very likely not what they intended. This commit is an attempt to rectify this situation. If path.data and default.path.data are configured, we check for the presence of indices there. If we find any, we log messages explaining the situation and fail the node. Relates #24099	2017-04-17 07:03:46 -04:00
Ali Beyad	0afcaf5627	[TEST] fix BytesReference tests to never have a negative slice offset	2017-04-13 16:16:53 -04:00
Lee Hinman	5cace8e48a	Remove shadow replicas Resolves #22024	2017-04-11 11:26:26 -06:00
Colin Goodheart-Smithe	0114f0061c	Removes version 2.x constants from Version (#24011 ) * Removes version 2.x constants from Version Closes #21887 * Addresses review comments	2017-04-11 08:31:22 +01:00
Ryan Ernst	65f7a76630	Settings: Add secure file setting to keystore (#24001 ) Some systems like GCE rely on a plaintext file containing credentials. Rather than extract the information out of that credentials file and store each peace individually in the keystore, it is cleaner to just store the entire file. This commit adds support to the keystore wrapper for secure file settings. These are settings that contain an entire file that would normally be stored on the local filesystem. Retrieving the file returns an input stream to the file contents. This also adds a `add-file` command to the keystore cli. In order to support both strings and files as values for settings, the metadata format of the keystore has also been updated (with backcompat) to keep a map of setting name to type.	2017-04-10 13:10:42 -07:00
Jay Modi	42b0b05af1	Test: add support for replacing stashed values within headers of REST tests (#24014 ) This commit adds support for replacing a stashed value within a header of a REST test. This is useful for requests that may want to use a value previously obtained within a header.	2017-04-10 12:10:01 -04:00
javanna	3b7bc8012a	[TEST] increase minimum length of randomly generated fields in RandomObjects We had a couple of unfortunate field name collisions in our CI, where the json duplicate check tripped. Increasing the minimum length of randomly generated field names should decrease the chance of this issue happening again.	2017-04-10 11:32:23 +02:00
Ryan Ernst	d4c0ef0028	Settings: Migrate ec2 discovery sensitive settings to elasticsearch keystore (#23961 ) This change adds secure settings for access/secret keys and proxy username/password to ec2 discovery. It adds the new settings with the prefix `discovery.ec2`, copies other relevant ec2 client settings to the same prefix, and deprecates all other settings (`cloud.aws.` and `cloud.aws.ec2.`). Note that this is simpler than the client configs in repository-s3 because discovery is only initialized once for the entire node, so there is no reason to complicate the configuration with the ability to have multiple sets of client settings. relates #22475	2017-04-07 13:28:15 -07:00
Yannick Welsch	a3cceb8a00	[TEST] Fix testMultipleNodesShutdownNonMasterNodes to wait for the right nodes to rejoin the cluster This test was sporadically failing for the following reason: - 4 nodes (nodes 0, 1, 2, and 3) running with `minimum_master_nodes` set to 3 - we stop 2 nodes (node 0 and 3) - wait for cluster block to be in place on all nodes - start 2 nodes (node 4 and node 5) and do a `prepareHealth().setWaitForNodes("4")` - then do a search request The search request runs into the `ClusterBlockException` as the `prepareHealth().setWaitForNodes("4")` check succeeds on a cluster state that has nodes 1, 2, 3, and 4, i.e., only one of the two new nodes has joined the cluster and only one of the two dead nodes was removed by the master (removing the dead nodes only happens after there are again `minimum_master_nodes` nodes in the cluster). This commit fixes the issue by reusing a method from InternalTestCluster that checks that the right nodes have rejoined the cluster.	2017-04-07 15:26:21 +02:00
Luca Cavanna	13cf8aaa52	[TEST] fix shuffling of xContent keys (#23929 ) ESTestCase has methods to shuffle xContent keys given a builder or a parser. Shuffling wasn't actually doing what was expected but rather reordering the keys in their natural ordering, hence the output was always the same at every run. Corrected that and added tests, also fixed a couple of tests that were affected by this fix.	2017-04-07 10:20:32 +02:00
Lee Hinman	0257a7b97a	Only re-parse operation if a mapping update was needed When executing an index operation on the primary shard, `TransportShardBulkAction` first parses the document, sees if there are any mapping updates that needs to be applied, and then updates the mapping on the master node. It then re-parses the document to make sure that the mappings have been applied and propagated. This adds a check that skips the second parsing of the document in the event there was not a mapping update applied in the first case. Fixes a performance regression introduced in #23665	2017-04-05 09:29:44 -06:00
Luca Cavanna	318d365b12	[TEST] make sure that fromXContent doesn't rely on keys ordering (#23901 ) We shuffle the keys before we parse our responses for the high level client so that we make sure we never rely on keys ordering.	2017-04-05 11:12:34 +02:00
Jason Tedor	3136ed1490	Rename random ASCII helper methods This commit renames the random ASCII helper methods in ESTestCase. This is because this method ultimately uses the random ASCII methods from randomized runner, but these methods actually only produce random strings generated from [a-zA-Z]. Relates #23886	2017-04-04 11:04:18 -04:00
Boaz Leskes	2266947ac5	testDifferentRolesMaintainPathOnRestart - fix broken comment	2017-04-04 11:03:44 +02:00
Boaz Leskes	20b274d7b9	testDifferentRolesMaintainPathOnRestart - lower join timeout as split elections are likely the test reduce the wait for initial cluster state to 0, causing multiple nodes to be start while elections are going on. This means there is a chance of a split election which shouldn't cause the test to time out.	2017-04-04 10:36:09 +02:00
Jason Tedor	71293a89bf	Introduce single-node discovery This commit adds a single node discovery type. With this discovery type, a node will elect itself as master and never form a cluster with another node. Relates #23595	2017-04-04 03:02:58 -04:00
Boaz Leskes	40eb68c95a	testRestorePersistentSettings doesn't to mess with discovery settings	2017-04-03 16:23:17 +02:00
Boaz Leskes	55a3fd1919	testDifferentRolesMaintainPathOnRestart shouldn't use auto managing of min master nodes It starts nodes in any order and thus it disabled the wait for first cluster state at node start up time the later is required for the auto management logic. Closes #23728	2017-04-03 16:23:17 +02:00
Boaz Leskes	5cf1d4ae90	mute testDifferentRolesMaintainPathOnRestart See https://github.com/elastic/elasticsearch/issues/23728	2017-04-03 10:23:04 +02:00
Jason Tedor	1d648a3d46	Fix BootstrapForTesting blowup This commit fixes an issue with BootstrapForTesting where the common case was to invoke a method with a null parameter that does not accept null.	2017-04-01 17:49:40 -04:00
Jason Tedor	8c554215e0	Ban Boolean#getBoolean The method Boolean#getBoolean is dangerous. It is too easy to mistakenly invoke this method thinking that it is parsing a string as a boolean. However, what it actually does is get a system property with the specified string, and then attempts to use usual crappy boolean parsing in the JDK to parse that system property as boolean with complete leniency (it parses every input value into either true or false); that is, this method amounts to invoking Boolean#parseBoolean(String) on the result of System#getProperty(String). Boo. This commit bans usage of this method. Relates #23864	2017-04-01 17:02:19 -04:00
Tim Brooks	5fa80a6521	Pass exception from sendMessage to listener (#23559 ) This commit changes the listener passed to sendMessage from a Runnable to a ActionListener. This change also removes IOException from the sendMessage signature. That signature is misleading as it allows implementers to assume an exception will be thrown in case of failure. That does not happen due to Netty's async nature.	2017-03-30 15:08:23 -05:00
Jason Tedor	48357e43d3	Honor update request timeout When executing an update request, the request timeout is not transferred to the index/delete request executed on behalf of the update request. This leads to update requests not timing out when they should (e.g., if not all shards are available when the request specifies wait_for_shards=all with a small timeout). This commit causes the index/delete requests to honor the update request timeout. Relates #23825	2017-03-30 14:38:34 -04:00
Ryan Ernst	f8453aca57	Packaging: Remove classpath ordering hack (#23596 ) After the removal of the joda time hack we used to have, we can cleanup the codebase handling in security, jarhell and plugins to be more picky about uniqueness. This was originally in #18959 which was never merged. closes #18959	2017-03-21 12:12:16 -07:00
Jason Tedor	7b17689458	Search took time should use a relative clock Search took time uses an absolute clock to measure elapsed time, and then tries to deal with the complexities of using an absolute clock for this purpose. Instead, we should use a high-precision monotonic relative clock that is designed exactly for measuring elapsed time. This commit modifies the search infrastructure to use a relative clock for measuring took time, but still provides an absolute clock for the components of search that require a real clock (e.g., index name expression resolution, etc.). Relates #23662	2017-03-20 18:48:51 -04:00
Igor Motov	1bd66136d7	Task Manager should be able to support non-transport tasks (#23619 ) Currently the task manager is tied to the transport and can only create tasks based on TransportRequests. This commit enables task manager to support tasks created by non-transport services such as the persistent tasks service.	2017-03-17 19:29:18 -04:00
Christoph Büscher	d02b6f58fa	Tests: Adapt ExistsQueryBuilderTests to changes in ExistQueryBuilder#toQuery() (#23462 ) Recent changes in the Lucene query that the ExistsQueryBuilder creates broke this test.	2017-03-02 18:27:30 +01:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Jason Tedor	64e193874f	Properly clean up thread context after tests Today when resetting the deprecation logger after a test is torn down, we attach a new thread context to the deprecation logger. This thread context is never cleared and we are left with a thread context attached to the deprecation logger for every test method that ran in the same JVM. This commit adds a flag when resetting the deprecation logger to not attach a new thread context when the test is being torn down. Relates #23441	2017-03-01 16:34:10 -05:00
Adrien Grand	3134d6b520	Add unit tests to percentile ranks aggregations. (#23240 ) Relates #22278	2017-03-01 13:57:40 +01:00
Jason Tedor	7ce06aeb8c	Fix date format in warning headers This commit fixes the date format in warning headers. There is some confusion around whether or not RFC 1123 requires two-digit days. However, the warning header specification very clearly relies on a format that requires two-digit days. This commit removes the usage of RFC 1123 date/time format from Java 8, which allows for one-digit days, in favor of a format that forces two-digit days (it's otherwise identical to RFC 1123 format, it is just fixed width). Relates #23418	2017-02-28 20:28:07 -05:00
Jason Tedor	ee2f6ccf32	Add convenience method for asserting deprecations This commit adds a convenience method for simultaneously asserting settings deprecations and other warnings and fixes some tests where setting deprecations and general warnings were present.	2017-02-28 18:24:39 -05:00
Ali Beyad	5e2e45cad9	Makes the same_shard host dyanamically updatable (#23397 ) Previously, cluster.routing.allocation.same_shard.host was not a dynamic setting and could not be updated after startup. This commit changes the behavior to allow the setting to be dynamically updatable. The documentation already states that the setting is dynamic so no documentation changes are required. Closes #22992	2017-02-28 12:48:54 -05:00
Jim Ferenczi	5c84640126	Upgrade to lucene-6.5.0-snapshot-d00c5ca (#23385 ) Lucene upgrade	2017-02-27 18:39:04 +01:00
Jason Tedor	577e6a5e14	Correct warning header to be compliant The warning header used by Elasticsearch for delivering deprecation warnings has a specific format (RFC 7234, section 5.5). The format specifies that the warning header should be of the form warn-code warn-agent warn-text [warn-date] Here, the warn-code is a three-digit code which communicates various meanings. The warn-agent is a string used to identify the source of the warning (either a host:port combination, or some other identifier). The warn-text is quoted string which conveys the semantic meaning of the warning. The warn-date is an optional quoted date that can be in a few different formats. This commit corrects the warning header within Elasticsearch to follow this specification. We use the warn-code 299 which means a "miscellaneous persistent warning." For the warn-agent, we use the version of Elasticsearch that produced the warning. The warn-text is unchanged from what we deliver today, but is wrapped in quotes as specified (this is important as a problem that exists today is that multiple warnings can not be split by comma to obtain the individual warnings as the warnings might themselves contain commas). For the warn-date, we use the RFC 1123 format. Relates #23275	2017-02-27 12:14:21 -05:00
javanna	756e26cb33	[TEST] make headers case-insensitive when running yaml tests	2017-02-27 12:27:03 +01:00
javanna	4f487ab1b9	[TEST] randomize request content_type between all of the supported formats	2017-02-27 12:27:03 +01:00
javanna	9a2dba3036	[TEST] add support for binary responses to REST tests infra	2017-02-27 12:27:03 +01:00
javanna	ca858befab	[TEST] create HttpEntity earlier in REST tests This allows to set content-type together with the body itself. At the moment it is always json, but this change allows makes it easier to randomize it later	2017-02-27 12:27:03 +01:00
javanna	04aaedc083	[TEST] Remove content type auto-detection while parsing request body in REST tests	2017-02-27 12:27:03 +01:00
Ryan Ernst	48548f6c3d	CLI: Fix prompting for yes/no to handle console returning null (#23320 ) Console.readText may return null in certain cases. This commit fixes a bug in Terminal.promptYesNo which assumed a non-null return value. It also adds a test for this, and modifies mock terminal to be able to handle null input values.	2017-02-24 20:20:17 -08:00
Simon Willnauer	ce625ebdcc	Expose `batched_reduce_size` via `_search` (#23288 ) In #23253 we added an the ability to incrementally reduce search results. This change exposes the parameter to control the batch since and therefore the memory consumption of a large search request.	2017-02-21 18:36:59 +01:00
Tanguy Leroux	3a0fc526bb	UpdateRequest implements ToXContent (#23289 ) This commit changes UpdateRequest so that it implements the ToXContentObject interface.	2017-02-21 15:20:15 +01:00
Simon Willnauer	f933f80902	First step towards incremental reduction of query responses (#23253 ) Today all query results are buffered up until we received responses of all shards. This can hold on to a significant amount of memory if the number of shards is large. This commit adds a first step towards incrementally reducing aggregations results if a, per search request, configurable amount of responses are received. If enough query results have been received and buffered all so-far received aggregation responses will be reduced and released to be GCed.	2017-02-21 13:02:48 +01:00
Tanguy Leroux	872412f645	[Tests] Cleans up DocWriteResponse parsing tests (#23233 ) This commit cleans up some parsing tests added from the High Level Rest Client: IndexResponseTests, DeleteResponseTests, UpdateResponseTests, BulkItemResponseTests. These tests are now more uniform with the others test-from-to-XContent tests we have, they now shuffle the XContent fields before parsing, the asserting method for parsed objects does not used a Map<String, Object> anymore, and buggy equals/hasCode methods in ShardInfo and ShardInfo.Failure have been removed.	2017-02-20 09:45:33 +01:00
Jay Modi	b234644035	Enforce Content-Type requirement on the rest layer and remove deprecated methods (#23146 ) This commit enforces the requirement of Content-Type for the REST layer and removes the deprecated methods in transport requests and their usages. While doing this, it turns out that there are many places where *Entity classes are used from the apache http client libraries and many of these usages did not specify the content type. The methods that do not specify a content type explicitly have been added to forbidden apis to prevent more of these from entering our code base. Relates #19388	2017-02-17 14:45:41 -05:00
Boaz Leskes	f83db675c8	Ensure network connections are restored after disruptions (#23135 ) With #22977, network disruption also disconnects nodes from the transport service. That has the side effect that when the disruption is healed, the disconnected node stay disconnected until the `NodeConnectionsService` restores the connection. This can take too long for the tests. This PR adds logic to the cluster healing to restore connections immediately. See https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=debian/611/console for an example failure.	2017-02-15 11:04:52 +02:00
Adrien Grand	8d6a41f671	Nested queries should avoid adding unnecessary filters when possible. (#23079 ) When nested objects are present in the mappings, many queries get deoptimized due to the need to exclude documents that are not in the right space. For instance, a filter is applied to all queries that prevents them from matching non-root documents (`+: -_type:__`). Moreover, a filter is applied to all child queries of `nested` queries in order to make sure that the child query only matches child documents (`_type:__nested_path`), which is required by `ToParentBlockJoinQuery` (the Lucene query behing Elasticsearch's `nested` queries). These additional filters slow down `nested` queries. In 1.7-, the cost was somehow amortized by the fact that we cached filters very aggressively. However, this has proven to be a significant source of slow downs since 2.0 for users of `nested` mappings and queries, see #20797. This change makes the filtering a bit smarter. For instance if the query is a `match_all` query, then we need to exclude nested docs. However, if the query is `foo: bar` then it may only match root documents since `foo` is a top-level field, so no additional filtering is required. Another improvement is to use a `FILTER` clause on all types rather than a `MUST_NOT` clause on all nested paths when possible since `FILTER` clauses are more efficient. Here are some examples of queries and how they get rewritten: ``` "match_all": {} ``` This query gets rewritten to `ConstantScore(+:* -_type:__)` on master and `ConstantScore(_type:AutomatonQuery {\norg.apache.lucene.util.automaton.Automaton@4371da44})` with this change. The automaton is the complement of `_type:__` so it matches the same documents, but is faster since it is now a positive clause. Simplistic performance testing on a 10M index where each root document has 5 nested documents on average gave a latency of 420ms on master and 90ms with this change applied. ``` "term": { "foo": { "value": "0" } } ``` This query is rewritten to `+foo:0 #(ConstantScore(+: -_type:__))^0.0` on master and `foo:0` with this change: we do not need to filter nested docs out since the query cannot match nested docs. While doing performance testing in the same conditions as above, response times went from 250ms to 50ms. ``` "nested": { "path": "nested", "query": { "term": { "nested.foo": { "value": "0" } } } } ``` This query is rewritten to `+ToParentBlockJoinQuery (+nested.foo:0 #_type:__nested) #(ConstantScore(+:* -_type:__))^0.0` on master and `ToParentBlockJoinQuery (nested.foo:0)` with this change. The top-level filter (`-_type:__`) could be removed since `nested` queries only match documents of the parent space, as well as the child filter (`#_type:__nested`) since the child query may only match nested docs since the `nested` object has both `include_in_parent` and `include_in_root` set to `false`. While doing performance testing in the same conditions as above, response times went from 850ms to 270ms.	2017-02-14 16:05:19 +01:00
Christoph Büscher	5b459a0bdc	[Tests] increase minimal field name when creating random objects I encountered several cases of duplicate field names when generating random fields using the RandomObjects helper. This leads to invalid json in some tests, so increasing the minimum field name length to four to make this less likely to happen.	2017-02-14 11:31:37 +01:00
Jason Tedor	5343b87502	Handle bad HTTP requests When Netty decodes a bad HTTP request, it marks the decoder result on the HTTP request as a failure, and reroutes the request to GET /bad-request. This either leads to puzzling responses when a bad request is sent to Elasticsearch (if an index named "bad-request" does not exist then it produces an index not found exception and otherwise responds with the index settings for the index named "bad-request"). This commit addresses this by inspecting the decoder result on the HTTP request and dispatching the request to a bad request handler preserving the initial cause of the bad request and providing an error message to the client. Relates #23153	2017-02-13 17:39:25 -05:00
Jay Modi	61e383813d	Make the version of the remote node accessible on a transport channel (#23019 ) This commit adds a new method to the TransportChannel that provides access to the version of the remote node that the response is being sent on and that the request came from. This is helpful for serialization of data attached as headers.	2017-02-13 15:15:57 -05:00
jaymode	d8d03f45c2	Fix communication with 5.3.0 nodes This commit fixes communication with 5.3.0 nodes to send XContentType to these nodes since #22691 was backported to the 5.3 branch.	2017-02-13 13:15:51 -05:00
Boaz Leskes	6a8ef0ea74	Traces in testAdapterSendReceiveCallbacks should only listen the relevant actions The traces callback is only called after responses are set. This can lead to concurrent issues where the trace is notified of previously sent responses if it was added after the response was sent (enabling further execution of the test) but before the tracer call backs are called.	2017-02-12 09:20:18 +02:00
Boaz Leskes	c2494bbaed	log extra information on failure of testAdapterSendReceiveCallbacks	2017-02-11 19:41:19 +02:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Boaz Leskes	cd1cb41603	Move EvilPeerRecoveryIT to a unit test in RecoveryDuringReplicationTests (#22900 ) EvillPeerRecoveryIT checks scenario where recovery is happening while there are on going indexing operation that already have been assigned a seq# . This is fairly hard to achieve and the test goes through a couple of hoops via the plugin infra to achieve that. This PR extends the unit tests infra to allow for those hoops to happen in unit tests. This allows the test to be moved to RecoveryDuringReplicationTests Relates to #22484	2017-02-09 20:14:03 +02:00
Simon Willnauer	ecb01c15b9	Fold InternalSearchHits and friends into their interfaces (#23042 ) We have a bunch of interfaces that have only a single implementation for 6 years now. These interfaces are pretty useless from a SW development perspective and only add unnecessary abstractions. They also require lots of casting in many places where we expect that there is only one concrete implementation. This change removes the interfaces, makes all of the classes final and removes the duplicate `foo` `getFoo` accessors in favor of `getFoo` from these classes.	2017-02-08 14:40:08 +01:00
Yannick Welsch	9154686623	Remove legacy primary shard allocation mode based on versions (#23016 ) Elasticsearch v5.0.0 uses allocation IDs to safely allocate primary shards whereas prior versions of ES used a version-based mode instead. Elasticsearch v5 still has support for version-based primary shard allocation as it needs to be able to load 2.x shards. ES v6 can drop the legacy support.	2017-02-08 10:00:55 +01:00
Boaz Leskes	ba06c14a97	TransportService.connectToNode should validate remote node ID (#22828 ) #22194 gave us the ability to open low level temporary connections to remote node based on their address. With this use case out of the way, actual full blown connections should validate the node on the other side, making sure we speak to who we think we speak to. This helps in case where multiple nodes are started on the same host and a quick node restart causes them to swap addresses, which in turn can cause confusion down the road.	2017-02-07 22:11:32 +02:00
Ryan Ernst	470ad1ae4a	Settings: Add secure settings validation on startup (#22894 ) Secure settings from the elasticsearch keystore were not yet validated. This changed improves support in Settings so that secure settings more seamlessly blend in with normal settings, allowing the existing settings validation to work. Note that the setting names are still not validated (yet) when using the elasticsearc-keystore tool.	2017-02-07 09:34:41 -08:00
Tim Brooks	27b7d9bd8d	Add FileSystemUtil method to read 'file:/' URLs (#23020 ) As part of #22116 we are going to forbid usage of api java.net.URL#openStream(). However in a number of places across the we use this method to read files from the local filesystem. This commit introduces a helper method openFileURLStream(URL url) to read files from URLs. It does specific validation to only ensure that file:/ urls are read. Additionlly, this commit removes unneeded method FileSystemUtil.newBufferedReader(URL, Charset). This method used the openStream () method which will soon be forbidden. Instead we use the Files.newBufferedReader(Path, Charset).	2017-02-07 10:24:22 -06:00
Boaz Leskes	03ef756539	MockTransportService should physically disconnect when simulating it (#22977 ) This is in order to trigger listeners for disconnect events, most importantly the NodeFaultDetection. MockTransportService now does slightly a better job at mimicking real life failures: connecting to already connected node will be a noop (we don't detect any errors here in production either) and failing to send will cause the target node to be disconnected. This is the cause of failure in https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.2+multijob-unix-compatibility/os=debian/72	2017-02-06 17:44:29 +01:00
Boaz Leskes	5e7d22357f	Connect to new nodes concurrently (#22984 ) When a node receives a new cluster state from the master, it opens up connections to any new node in the cluster state. That has always been done serially on the cluster state thread but it has been a long standing TODO to do this concurrently, which is done by this PR. This is spin off of #22828, where an extra handshake is done whenever connecting to a node, which may slow down connecting. Also, the handshake is done in a blocking fashion which triggers assertions w.r.t blocking requests on the cluster state thread. Instead of adding an exception, I opted to implement concurrent connections which both side steps the assertion and compensates for the extra handshake.	2017-02-06 16:32:41 +01:00
Jason Tedor	9a0b216c36	Upgrade checkstyle to version 7.5 This commit upgrades the checkstyle configuration from version 5.9 to version 7.5, the latest version as of today. The main enhancement obtained via this upgrade is better detection of redundant modifiers. Relates #22960	2017-02-03 09:46:44 -05:00
Jay Modi	7520a107be	Optionally require a valid content type for all rest requests with content (#22691 ) This change adds a strict mode for xcontent parsing on the rest layer. The strict mode will be off by default for 5.x and in a separate commit will be enabled by default for 6.0. The strict mode, which can be enabled by setting `http.content_type.required: true` in 5.x, will require that all incoming rest requests have a valid and supported content type header before the request is dispatched. In the non-strict mode, the Content-Type header will be inspected and if it is not present or not valid, we will continue with auto detection of content like we have done previously. The content type header is parsed to the matching XContentType value with the only exception being for plain text requests. This value is then passed on with the content bytes so that we can reduce the number of places where we need to auto-detect the content type. As part of this, many transport requests and builders were updated to provide methods that accepted the XContentType along with the bytes and the methods that would rely on auto-detection have been deprecated. In the non-strict mode, deprecation warnings are issued whenever a request with body doesn't provide the Content-Type header. See #19388	2017-02-02 14:07:13 -05:00
Igor Motov	c34b63dadd	Expand AbstractSerializingTestCase and AbstractWireSerializingTestCase to test diff serialization This commit adds two additional test cases that can be used to verify correct diff serialization in additional to binary and xcontent serialization.	2017-02-02 12:19:53 -05:00
Tanguy Leroux	f86fd62821	Parse elasticsearch exception's root causes (#22924 ) This commit change ElasticsearchException.failureFromXContent() method so that it now parses root causes which were ignored before, and adds them as suppressed exceptions of the returned exception.	2017-02-02 17:00:16 +01:00
Boaz Leskes	eb36b82de4	Seq Number based recovery should validate last lucene commit max seq# (#22851 ) The seq# base recovery logic relies on rolling back lucene to remove any operations above the global checkpoint. This part of the plan is not implemented yet but have to have these guarantees. Instead we should make the seq# logic validate that the last commit point (and the only one we have) maintains the invariant and if not, fall back to file based recovery. This commit adds a test that creates situation where rollback is needed (primary failover with ops in flight) and fixes another issue that was surfaced by it - if a primary can't serve a seq# based recovery request and does a file copy, it still used the incoming `startSeqNo` as a filter. Relates to #22484 & #10708	2017-01-31 20:27:31 +01:00
Ryan Ernst	29f63c78cc	Internal: Convert empty and size checks of settings to not use getAsMap() (#22890 ) With the new secure settings, methods like getAsMap() no longer work correctly as a means of checking for empty settings, or the total size. This change converts the existing uses of that method to use methods directly on Settings. Note this does not update the implementations to account for SecureSettings, as that will require a followup which changes how secure settings work.	2017-01-31 10:44:09 -08:00
Nik Everett	e042c77301	Add tests for reducing top hits (#22837 ) Also adds many `equals` and `hashCode` implementations and moves the failure printing in `MatchAssertion` into a common spot and exposes it over `assertEqualsWithErrorMessageFromXContent` which does an object equality test but then uses `toXContent` to print the differences. Relates to #22278	2017-01-27 20:54:11 -05:00
Nik Everett	2e48fb8294	Move delete by query helpers into core (#22810 ) This moves the building blocks for delete by query into core. This should enabled two thigns: 1. Plugins other than reindex to implement "bulk by scroll" style operations. 2. Plugins to directly call delete by query. Those plugins should be careful to make sure that task cancellation still works, but this should be possible. Notes: 1. I've mostly just moved classes and moved around tests methods. 2. I haven't been super careful about cohesion between these core classes and reindex. They are quite interconnected because I wanted to make the change as mechanical as possible. Closes #22616	2017-01-27 16:09:18 -05:00
Ryan Ernst	aad51d44ab	S3 repository: Add named configurations (#22762 ) * S3 repository: Add named configurations This change implements named configurations for s3 repository as proposed in #22520. The access/secret key secure settings which were added in #22479 are reverted, and the only secure settings are those with the new named configs. All other previously used settings for the connection are deprecated. closes #22520	2017-01-27 10:42:45 -08:00
Nik Everett	8abd4101eb	Add tests for reducing top hits Also adds many `equals` and `hashCode` implementations and moves the failure printing in `MatchAssertion` into a common spot and exposes it over `assertEqualsWithErrorMessageFromXContent` which does an object equality test but then uses `toXContent` to print the differences. Relates to #22278	2017-01-27 12:32:17 -05:00
Jason Tedor	930282e161	Introduce sequence-number-based recovery This commit introduces sequence-number-based recovery. When a replica has fallen out of sync, rather than performing a file-based recovery we first attempt to replay operations since the last local checkpoint on the replica. To do this, at the start of recovery the replica tells the primary what its local checkpoint is. The primary will then wait for all operations between that local checkpoint and the current maximum sequence number to complete; this is to ensure that there are no gaps in the operations that will be replayed from the primary to the replica. This is a best-effort attempt as we currently have no guarantees on the primary that these operations will be available; if we are not able to replay all operations in the desired range, we just fallback to file-based recovery. Later work will strengthen the guarantees. Relates #22484	2017-01-27 08:16:38 -08:00
Jim Ferenczi	e48bc2eed7	Add field collapsing for search request (#22337 ) * Add top hits collapsing to search request The field collapsing is done with a custom top docs collector that "collapse" search hits with same field value. The distributed aspect is resolve using the two passes that the regular search uses. The first pass "collapse" the top hits, then the coordinating node merge/collapse the top hits from each shard. ``` GET _search { "collapse": { "field": "category", } } ``` This change also adds an ExpandCollapseSearchResponseListener that intercepts the search response and expands collapsed hits using the CollapseBuilder#innerHit} options. The retrieval of each inner_hits is done by sending a query to all shards filtered by the collapse key. ``` GET _search { "collapse": { "field": "category", "inner_hits": { "size": 2 } } } ```	2017-01-23 16:33:51 +01:00
Simon Willnauer	27b5c2ad54	Pass `forceExecution` flag to transport interceptor (#22739 ) To effectively allow a plugin to intercept a transport handler it needs to know if the handler must be executed even if there is a rejection on the thread pool in the case the wrapper forks a thread to execute the actual handler.	2017-01-23 11:04:27 +01:00
Simon Willnauer	824beea89d	Fix handling of document failure expcetion in InternalEngine (#22718 ) Today we try to be smart and make a generic decision if an exception should be treated as a document failure but in some cases concurrency in the index writer make this decision very difficult since we don't have a consistent state in the case another thread is currently failing the IndexWriter/InternalEngine due to a tragic event. This change simplifies the exception handling and makes specific decisions about document failures rather than using a generic heuristic. This prevent exceptions to be treated as document failures that should have failed the engine but backed out of failing since since some other thread has already taken over the failure procedure but didn't finish yet.	2017-01-20 16:55:00 +01:00
Ryan Ernst	c5b4bba30b	S3 repository: Deprecate specifying credentials through env vars, sys props, and remove profile files (#22567 ) * S3 repository: Deprecate specifying credentials through env vars and sys props This is a follow up to #22479, where storing credentials secure way was added.	2017-01-19 12:36:32 -08:00
Simon Willnauer	24e2847af2	Streamline foreign stored context restore and allow to perserve response headers (#22677 ) Today we do not preserve response headers if they are present on a transport protocol response. While preserving these headers is not always desired, in the most cases we should pass on these headers to have consistent results for depreciation headers etc. yet, this hasn't been much of a problem since most of the deprecations are detected early ie. on the coordinating node such that this bug wasn't uncovered until #22647 This commit allow to optionally preserve headers when a context is restored and also streamlines the context restore since it leaked frequently into the callers thread context when the callers context wasn't restored again.	2017-01-18 16:17:54 +01:00
Simon Willnauer	19f9cb307a	Merge branch 'master' into feature/multi_cluster_search	2017-01-18 09:24:35 +01:00
Luca Cavanna	bc5b604cbd	[TEST] parse global parameters from _common.json (#22655 ) Replace the hardcoded global parameters in the yaml test suite with parameters parsed from the newly added _common.json file. Relates to #22569	2017-01-17 16:13:09 +01:00
Ali Beyad	e2977889b8	Allow comma delimited array settings to have a space after each entry (#22591 ) Previously, certain settings that could take multiple comma delimited values would pick up incorrect values for all entries but the first if each comma separated value was followed by a whitespace character. For example, the multi-value "A,B,C" would be correctly parsed as ["A", "B", "C"] but the multi-value "A, B, C" would be incorrectly parsed as ["A", " B", " C"]. This commit allows a comma separated list to have whitespace characters after each entry. The specific settings that were affected by this are: cluster.routing.allocation.awareness.attributes index.routing.allocation.require.* index.routing.allocation.include.* index.routing.allocation.exclude.* cluster.routing.allocation.require.* cluster.routing.allocation.include.* cluster.routing.allocation.exclude.* http.cors.allow-methods http.cors.allow-headers For the allocation filtering related settings, this commit also provides validation of each specified entry if the filtering is done by _ip, _host_ip, or _publish_ip, to ensure that each entry is a valid IP address. Closes #22297	2017-01-17 08:51:04 -06:00
Simon Willnauer	709cb9a39e	Merge branch 'master' into feature/multi_cluster_search	2017-01-17 12:34:36 +01:00
Michael McCandless	ebd38e2a6a	Expose FlattenGraphTokenFilter (#22643 ) FlattenGraphTokenFilter is necessary for using graph-based token streams (e.g. the new SynonymGraphFilter) during indexing.	2017-01-16 16:53:32 -05:00
Simon Willnauer	f30b1f82ee	Remove HttpServer and HttpServerAdapter in favor of a simple dispatch method (#22636 ) Today we have quite some abstractions that are essentially providing a simple dispatch method to the plugins defining a `HttpServerTransport`. This commit removes `HttpServer` and `HttpServerAdaptor` and introduces a simple `Dispatcher` functional interface that delegate to `RestController` by default. Relates to #18482	2017-01-16 21:06:08 +01:00
Luca Cavanna	193111919c	move ignore parameter support from yaml test client to low level rest client (#22637 ) All the language clients support a special ignore parameter that doesn't get passed to elasticsearch with the request, but used to indicate which error code should not lead to an exception if returned for a specific request. Moving this to the low level REST client will allow the high level REST client to make use of it too, for instance so that it doesn't have to intercept ResponseExceptions when the get api returns a 404.	2017-01-16 18:54:44 +01:00
Simon Willnauer	895124e67e	Merge branch 'master' into feature/multi_cluster_search	2017-01-16 13:20:45 +01:00
Simon Willnauer	5f0344a918	Pass ThreadContext to transport interceptors to allow header modification (#22618 ) TransportInterceptors are commonly used to enrich requests with headers etc. which requires access the the thread context. This is not always easily possible since threadpools are hard to access for instance if the interceptor is used on a transport client. This commit passes on the thread context to all the interceptors for further consumption. Closes #22585	2017-01-15 13:35:39 +01:00
Simon Willnauer	3f784a4424	Merge branch 'master' into feature/multi_cluster_search	2017-01-15 10:28:34 +01:00
Simon Willnauer	2dd0ec57b2	[TEST] Remove connection listener from all transports in AbstractSimpleTransportTestCase#testSendRandomRequests	2017-01-13 23:19:04 +01:00
Simon Willnauer	63e4552c0d	Merge branch 'master' into feature/multi_cluster_search	2017-01-13 23:07:20 +01:00
Simon Willnauer	4c1ee018f6	Remove setLocalNode from ClusterService and TransportService (#22608 ) ClusterService and TransportService expect the local discovery node to be set before they are started but this requires manual interaction and is error prone since to work absolutely correct they should share the same instance (same ephemeral ID). TransportService also has 2 modes of operation, mainly realted to transport client vs. internal to a node. This change removes the mode where we don't maintain a local node and uses a dummy local node in the transport client since we don't bind to any port in such a case. Local discovery node instances are now managed by the node itself and only suppliers and factories that allow creation only once are passed to TransportService and ClusterService.	2017-01-13 16:12:27 +01:00
Simon Willnauer	d5fa84f869	Harder close and remove reference concurrency in MockTcpTransport (#22613 ) There was still small race in MockTcpTransport where channesl that are concurrently closing are not yet removed from the reference tracking causing tests to fail. Compared to the other races before this is a rather small windown and requires very very short test durations.	2017-01-13 16:04:05 +01:00
Simon Willnauer	6779ea9c2a	Merge branch 'master' into feature/multi_cluster_search	2017-01-13 12:10:23 +01:00
Simon Willnauer	acf2d2f86f	Ensure new connections won't be opened if transport is closed or closing (#22589 ) Today there are several races / holes in TcpTransport and MockTcpTransport that can allow connections to be opened and remain unclosed while the actual transport implementation is closed. A recently added assertions in #22554 exposes these problems. This commit fixes several issues related to missed locks or channel creations outside of a lock not checking if the resource is still open.	2017-01-12 20:27:09 +01:00
javanna	8072f168a3	Remove ParseFieldMatcher usages from QueryParseContext	2017-01-12 14:43:35 +01:00
Luca Cavanna	7674de9e1f	Move human flag under always accepted query_string params (#22562 ) There are some parameters that are accepted by each and every api we expose. Those (pretty, source, error_trace and filter_path) are not explicitly listed in the spec of every api, rather whitelisted in clients test runners so that they are always accepted. The `human` flag has been treated up until now as a parameter that's accepted by only some stats and info api, but that doesn't reflect reality as es core treats it exactly like `pretty` (relevant especially now that we validate params and throw exception when we find one that is not supported). Furthermore, the human flag has effect on every api that outputs a date, time, percentage or byte size field. For instance the tasks api outputs a date field although they don't have the human flag explicitly listed in their spec. There are other similar cases. This commit removes the human flag from the rest spec and makes it an always accepted query_string param.	2017-01-12 10:04:45 +01:00
Simon Willnauer	00781d24ce	Merge branch 'master' into feature/multi_cluster_search	2017-01-11 23:40:46 +01:00
Simon Willnauer	8a0393f718	Move assertion for open channels under TcpTransport lock TcpTransport has an actual mechanism to stop resources in subclasses. Instead of overriding `doStop` subclasses should override `stopInternal` that is executed under the connection lock guaranteeing that there is no concurrency etc. Relates to #22554	2017-01-11 23:37:12 +01:00
Ryan Ernst	8015fbbf25	Make s3 repository sensitive settings use secure settings (#22479 ) * Settings: Make s3 repository sensitive settings use secure settings This change converts repository-s3 to use the new secure settings. In order to support the multiple ways we allow aws creds to be configured, it also moves the main methods for the keystore wrapper into a SecureSettings interface, in order to allow settings prefixing to work.	2017-01-11 11:19:46 -08:00
Simon Willnauer	d3124dd62b	Merge branch 'master' into feature/multi_cluster_search	2017-01-11 17:03:30 +01:00
Simon Willnauer	6810125a8b	Prevent open channel leaks if handshake times out or is interrupted (#22554 ) The low level TCP handshake can cause channel / connection leaks if it's interrupted since the caller doesn't close the channel / connection if the handshake was not successful. This commit fixes the channel leak and adds general test infrastructure to detect channel leaks in the future.	2017-01-11 17:02:36 +01:00
Simon Willnauer	6d2d878068	Merge branch 'master' into feature/multi_cluster_search	2017-01-11 09:28:00 +01:00
Tanguy Leroux	2dcb05fca8	Add fromxcontent methods to index response (#22229 ) This commit adds the parsing fromXContent() methods to the IndexResponse class. The method is based on a ObjectParser because it is easier to use when parsing parent abstract classes like DocWriteResponse. It also changes the ReplicationResponse.ShardInfo so that it now implements ToXContentObject. This way, the ShardInfo.fromXContent() method can be used by the IndexResponse's ObjectParser.	2017-01-10 20:25:32 +01:00
Yannick Welsch	c35277e623	[TEST] Fix JSON generation of failure in InternalTestCluster Relates to #22387	2017-01-10 17:53:04 +01:00
Boaz Leskes	f387848f83	MockTransportService.doClose assertions should check openConnections under lock	2017-01-10 14:03:31 +01:00
Yannick Welsch	9fc1a735cc	Keep NodeConnectionsService in sync with current nodes in the cluster state (#22509 ) The NodeConnectionsService currently determines which nodes to connect to / disconnect from by inspecting cluster state changes and connecting to added nodes / disconnecting from removed nodes. When a master steps down (for example due to another master-eligible node shutting down which brings the number of master-eligible nodes below minimum_master_master), and the connection to other existing nodes was dropped while pinging, however, the connection to these nodes is not re-established while publishing the first cluster state that establishes the node as master. This commit changes the NodeConnectionsService connect / disconnect logic to always rely on the state that is to be / was published, looking not only at the added / removed nodes, but validating that exactly all nodes that are currently registered in NodeConnectionsService are connected (corresponds to a NOOP if the node is already connected).	2017-01-10 13:29:49 +01:00
Simon Willnauer	1ef98ede17	Merge branch 'master' into feature/multi_cluster_search	2017-01-09 12:09:23 +01:00
Nik Everett	12923ef896	Close and flush refresh listeners on shard close Right now closing a shard looks like it strands refresh listeners, causing tests like `delete/50_refresh/refresh=wait_for waits until changes are visible in search` to fail. Here is a build that fails: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+multi_cluster_search+multijob-darwin-compatibility/4/console This attempts to fix the problem by implements `Closeable` on `RefreshListeners` and rejecting listeners when closed. More importantly the act of closing the instance flushes all pending listeners so we shouldn't have any stranded listeners on close. Because it was needed for testing, this also adds the number of pending listeners to the `CommonStats` object and all API to which that flows: `_cat/nodes`, `_cat/indices`, `_cat/shards`, and `_nodes/stats`.	2017-01-06 20:03:32 -05:00
Ryan Ernst	cd6e3f4cea	Merge branch 'master' into keystore	2017-01-06 09:32:08 -08:00
Tim B	b9c2c2f6f0	Move IfConfig.logIfNecessary call into bootstrap (#22455 ) This is related to #22116. A logIfNecessary() call makes a call to NetworkInterface.getInterfaceAddresses() requiring SocketPermission connect privileges. By moving this to bootstrap the logging call can be made before installing the SecurityManager.	2017-01-06 11:10:53 -06:00
Simon Willnauer	418ec62bfb	Merge branch 'master' into feature/multi_cluster_search	2017-01-06 10:24:40 +01:00
Ryan Ernst	eb596d7270	more renames	2017-01-06 01:03:45 -08:00
javanna	d87a30647b	remove ParseFieldMatcher usages from SearchAfterBuilder	2017-01-05 19:33:04 +01:00
Simon Willnauer	0183b0c5a8	More cleanups	2017-01-05 15:23:55 +01:00
Simon Willnauer	80bf01d3c0	Merge branch 'master' into feature/multi_cluster_search	2017-01-05 08:00:03 +01:00
Simon Willnauer	a5daa5d3a2	Execute low level handshake in #openConnection (#22440 ) Today we execute the low level handshake on the TCP layer in #connectToNode. If #openConnection is used directly, which is truly expert, no handshake is executed which allows connecting to nodes that are not necessarily compatible. This change moves the handshake to #openConnection to prevent bypassing this logic.	2017-01-05 07:32:53 +01:00
Tim B	be22a250b6	Replace Socket, ServerSocket, and HttpServer usages in tests with mocksocket versions (#22287 ) This integrates the mocksocket jar with elasticsearch tests. Mocksocket wraps actions requiring SocketPermissions in doPrivilege blocks. This will eventually allow SocketPermissions to be assigned to the mocksocket jar opposed to the entire elasticsearch codebase.	2017-01-04 14:38:51 -06:00
Adrien Grand	f8998fece5	Upgrade to lucene-6.4.0-snapshot-084f7a0. (#22413 )	2017-01-04 19:03:52 +01:00
Simon Willnauer	e642965804	Cleanup lots of code, add javadocs and tests	2017-01-04 17:26:00 +01:00
Simon Willnauer	c6573e6e56	Filter actions to trace in test Notifications for request tracing are invoked concurrently and can still be in flight once a tracer is installed in the test. This can lead to side-effects since the test relied on exact invocations. This commit adds action filtering to the test tracer to only count invocations for the relevant actions. Closes #22418	2017-01-03 23:40:52 +01:00
Simon Willnauer	422cd1ef77	Add support for proxy nodes this commit adds full support for proxy nodes on the search layer. This allows to connection only to a small set of nodes on a remote cluster to exectue the search. The nodes will proxy the request to the correct node in the cluster while the coordinting node doesn't need to be connected to the target node.	2017-01-03 17:24:32 +01:00
javanna	6329a98a97	Remove ParseFieldMatcher usages from SearchContext	2017-01-03 15:52:32 +01:00
javanna	71d6a37032	[TEST] assign blacklistPathMatchers only after the contexts have been assigned There could be an issue creating the REST clients and/or making the first request to the external cluster. If that happens, the blacklist has already been assigned and the following tests will fail because of an assertion that checks that the blacklist is not already assigned when the contexts are not.	2017-01-03 15:25:05 +01:00
Daniel Mitterdorfer	1ed64f0551	Eliminate unneccessary declaration of IOException With this commit we remove the declaration of IOException from assertWarnings and modify all call sites. Checked with @javanna	2017-01-03 12:36:28 +01:00
Igor Motov	ca90d9ea82	Remove PROTO-based custom cluster state components Switches custom cluster state components from PROTO-based de-serialization to named objects based de-serialization	2016-12-28 13:32:35 -05:00
Adrien Grand	2d81750a13	Make ESTestCase resilient to initialization errors.	2016-12-26 14:55:22 +01:00
Adrien Grand	d89757b848	Fix mutate function to always actually modify the failure object.	2016-12-26 10:34:50 +01:00
Jason Tedor	ddf4a463f3	Reject invalid test logging annotations Today we silently ignore invalid test logging annotations. This commit rejects these annotations, failing the processing of the annotation and aborting the test.	2016-12-23 07:51:35 -05:00
Jason Tedor	432ec54347	Apply logging levels in hierarchical order This commit adds a test for applying logging levels in hierarchical order, and addresses an issue with restoring the logging levels at the end of a test or suite.	2016-12-23 07:51:19 -05:00
Yannick Welsch	baea17b53f	Separate cluster update tasks that are published from those that are not (#21912 ) This commit factors out the cluster state update tasks that are published (ClusterStateUpdateTask) from those that are not (LocalClusterUpdateTask), serving as a basis for future refactorings to separate the publishing mechanism out of ClusterService.	2016-12-23 12:23:52 +01:00
Boaz Leskes	215874aff3	process TestLogging annotation value in prefix-first order We have to sort the logger names so they wouldn't override each other. Processing org.elasticsearch:DEBUG after org.elasticsearch.transport:TRACE resets the setting of the later	2016-12-23 09:03:43 +01:00
Ryan Ernst	fb690ef748	Settings: Add infrastructure for elasticsearch keystore This change is the first towards providing the ability to store sensitive settings in elasticsearch. It adds the `elasticsearch-keystore` tool, which allows managing a java keystore. The keystore is loaded upon node startup in Elasticsearch, and used by the Setting infrastructure when a setting is configured as secure. There are a lot of caveats to this PR. The most important is it only provides the tool and setting infrastructure for secure strings. It does not yet provide for keystore passwords, keypairs, certificates, or even convert any existing string settings to secure string settings. Those will all come in follow up PRs. But this PR was already too big, so this at least gets a basic version of the infrastructure in. The two main things to look at. The first is the `SecureSetting` class, which extends `Setting`, but removes the assumption for the raw value of the setting to be a string. SecureSetting provides, for now, a single helper, `stringSetting()` to create a SecureSetting which will return a SecureString (which is like String, but is closeable, so that the underlying character array can be cleared). The second is the `KeyStoreWrapper` class, which wraps the java `KeyStore` to provide a simpler api (we do not need the entire keystore api) and also extend the serialized format to add metadata needed for loading the keystore with no assumptions about keystore type (so that we can change this in the future) as well as whether the keystore has a password (so that we can know whether prompting is necessary when we add support for keystore passwords).	2016-12-22 16:28:34 -08:00
Nik Everett	f5f2149ff2	Remove much ceremony from parsing client yaml test suites (#22311 ) * Remove a checked exception, replacing it with `ParsingException`. * Remove all Parser classes for the yaml sections, replacing them with static methods. * Remove `ClientYamlTestFragmentParser`. Isn't used any more. * Remove `ClientYamlTestSuiteParseContext`, replacing it with some static utility methods. I did not rewrite the parsers using `ObjectParser` because I don't think it is worth it right now.	2016-12-22 11:00:34 -05:00
Colin Goodheart-Smithe	06576ed13b	Adds abstract test classes for serialisation (#22281 ) This adds test classes that can be used to test the wire serialisation and (optionally) the XContent serialisation of objects that implement Streamable/Writeable and ToXContent. These test classes will enable classes sich as InternalAggregation (or at least its implementations) to be tested in a consistent way when is comes to testing serialisation.	2016-12-22 10:49:18 +00:00
Jason Tedor	7946396fe6	Introduce translog no-op As the translog evolves towards a full operations log as part of the sequence numbers push, there is a need for the translog to be able to represent operations for which a sequence number was assigned, but the operation did not mutate the index. Examples of how this can arise are operations that fail after the sequence number is assigned, and gaps in this history that arise when an operation is assigned a sequence number but the operation never completed (e.g., a node crash). It is important that these operations appear in the history so that they can be replicated and replayed during recovery as otherwise the history will be incomplete and local checkpoints will not be able to advance. This commit introduces a no-op to the translog to set the stage for these efforts. Relates #22291	2016-12-21 23:08:16 -05:00
Boaz Leskes	0e9186e137	Simplify Unicast Zen Ping (#22277 ) The `UnicastZenPing` shows it's age and is the result of many small changes. The current state of affairs is confusing and is hard to reason about. This PR cleans it up (while following the same original intentions). Highlights of the changes are: 1) Clear 3 round flow - no interleaving of scheduling. 2) The previous implementation did a best effort attempt to wait for ongoing pings to be sent and completed. The pings were guaranteed to complete because each used the total ping duration as a timeout. This did make it hard to reason about the total ping duration and the flow of the code. All of this is removed now and ping should just complete within the given duration or not be counted (note that it was very handy for testing, but I move the needed sync logic to the test). 3) Because of (2) the pinging scheduling changed a bit, to give a chance for the last round to complete. We now ping at the beginning, 1/3 and 2/3 of the duration. 4) To offset for (3) a bit, incoming ping requests are now added to on going ping collections. 5) UnicastZenPing never establishes full blown connections (but does reuse them if there). Relates to #22120 6) Discovery host providers are only used once per pinging round. Closes #21739 7) Usage of the ability to open a connection without connecting to a node ( #22194 ) and shorter connection timeouts helps with connections piling up. Closes #19370 8) Beefed up testing and sped them up. 9) removed light profile from production code	2016-12-21 15:09:58 +01:00
Nik Everett	567c65b0d5	Replace IndicesQueriesRegistry (#22289 ) * Switch query parsing to namedObject * Remove IndicesQueriesRegistry	2016-12-21 09:05:14 -05:00
javanna	7141f6b554	[TEST] improve error message in ESTestCase#assertWarnings	2016-12-21 13:31:02 +01:00
Luca Cavanna	ae01a51b44	[TEST] make ESSingleNodeTestCase tests repeatable (#22283 ) If we conditionally do random things, e.g. initialize a node only after the first test, we have to make sure that we unconditionally create a new seed calling random.nextLong(), then initialize the node under a private randomness context. This makes sure that any random usage through Randomness.get() will retrieve the proper random instance through RandomizedContext.current().getRandom(). When running under private randomness, the context will return the Random instance that was created with the provided seed (forked from the main random instance) rather than the main Random that's exposed to tests as well. Otherwise tests become non repeatable because that initialization part happens only before the first executed test.	2016-12-21 11:44:24 +01:00
Nik Everett	a04dcfb95b	Introduce XContentParser#namedObject (#22003 ) Introduces `XContentParser#namedObject which works a little like `StreamInput#readNamedWriteable`: on startup components register parsers under names and a superclass. At runtime we look up the parser and call it to parse the object. Right now the parsers take a context object they use to help with the parsing but I hope to be able to eliminate the need for this context as most what it is used for at this point is to move around parser registries which should be replaced by this method eventually. I make no effort to do so in this PR because it is big enough already. This is meant to the a start down a road that allows us to remove classes like `QueryParseContext`, `AggregatorParsers`, `IndicesQueriesRegistry`, and `ParseFieldRegistry`. The goal here is to reduce the amount of plumbing required to allow parsing pluggable things. With this you don't have to pass registries all over the place. Instead you must pass a super registry to fewer places and use it to wrap the reader. This is the same tradeoff that we use for NamedWriteable and it allows much, much simpler binary serialization. We think we want that same thing for xcontent serialization. The only parsing actually converted to this method is parsing `ScoreFunctions` inside of `FunctionScoreQuery`. I chose this because it is relatively self contained.	2016-12-20 11:05:24 -05:00
Ryan Ernst	850f51db01	Internal: Refactor SettingCommand into EnvironmentAwareCommand (#22175 ) * Internal: Refactor SettingCommand into EnvironmentAwareCommand This change renames and changes the behavior of SettingCommand to have its primary method take in a fully initialized Environment for elasticsearch instead of just a map of settings. All of the subclasses of SettingCommand already did this at some point, so this just removes duplication.	2016-12-19 15:23:44 -08:00
javanna	5dae10db11	[TEST] add warnings check to ESTestCase We are currenlty checking that no deprecation warnings are emitted in our query tests. That can be moved to ESTestCase (disabled in ESIntegTestCase) as it allows us to easily catch where our tests use deprecated features and assert on the expected warnings.	2016-12-19 19:39:56 +01:00
javanna	6a27628f12	Remove support for strict parsing mode We return deprecation warnings as response headers, besides logging them. Strict parsing mode stayed around, but was only used in query tests, though we also introduced checks for deprecation warnings there that don't need strict parsing anymore (see #20993). We can then safely remove support for strict parsing mode. The final goal is to remove the ParseFieldMatcher class, but there are many many users of it. This commit prepares the field for the removal, by deprecating ParseFieldMatcher and making it effectively not needed. Strict parsing is removed from ParseFieldMatcher, and strict parsing is replaced in tests where needed with deprecation warnings checks. Note that the setting to enable strict parsing was never ported to the new settings infra hance it cannot be set in production. It is really only used in our own tests. Relates to #19552	2016-12-19 19:39:56 +01:00
javanna	38914f17ed	[TEST] improve ElasticsearchAssertions#assertEquivalent for ToXContent Rename the method to assertToXContentEquivalent to highlight that it's tailored to ToXContent comparisons. Rather than parsing into a map and replacing byte[] in both those maps, add custom equality assertions that recursively walk maps and lists and call Arrays.equals whenever a byte[] is encountered.	2016-12-19 19:32:50 +01:00
Luca Cavanna	3421e54a42	Add fromXContent method to GetResponse (#22082 ) Moved field values `toXContent` logic to `GetField` (from `GetResult`), which outputs its own fields, and can also parse them now. Also added `fromXContent` to `GetResult` and `GetResponse`. The start object and end object for `GetResponse` output have been moved to `GetResult#toXContent`, from the corresponding rest action. This makes it possible to have `toXContent` and `fromXContent` completely symmetric, as parsing requires looping till an end object is found which is weird when the corresponding `toXContent` doesn't print that out. This also introduces the foundation for testing retrieval of _source and stored field values.	2016-12-19 17:21:26 +01:00
Nik Everett	5bec4f8024	Unescape \\r in stash dump Oh windows..... Relates to #22195	2016-12-19 10:57:26 -05:00
Yannick Welsch	63af03a104	Atomic mapping updates across types (#22220 ) This commit makes mapping updates atomic when multiple types in an index are updated. Mappings for an index are now applied in a single atomic operation, which also allows to optimize some of the cross-type updates and checks.	2016-12-19 14:39:50 +01:00
Boaz Leskes	b857b316b6	Add BWC layer to seq no infra and enable BWC tests (#22185 ) Sequence BWC logic consists of two elements: 1) Wire level BWC using stream versions. 2) A changed to the global checkpoint maintenance semantics. For the sequence number infra to work with a mixed version clusters, we have to consider situation where the primary is on an old node and replicas are on new ones (i.e., the replicas will receive operations without seq#) and also the reverse (i.e., the primary sends operations to a replica but the replica can't process the seq# and respond with local checkpoint). An new primary with an old replica is a rare because we do not allow a replica to recover from a new primary. However, it can occur if the old primary failed and a new replica was promoted or during primary relocation where the source primary is treated as a replica until the master starts the target. 1) Old Primary & New Replica - this case is easy as is taken care of by the wire level BWC. All incoming requests will have their seq# set to `UNASSIGNED_SEQ_NO`, which doesn't confuse the local checkpoint logic (keeping it at `NO_OPS_PERFORMED`) 2) New Primary & Old replica - this one is trickier as the global checkpoint service currently takes all in sync replicas into consideration for the global checkpoint calculation. In order to deal with old replicas, we change the semantics to say all new node in sync replicas. That means the replicas on old nodes don't count for the global checkpointing. In this state the seq# infra is not fully operational (you can't search on it, because copies may miss it) but it is maintained on shards that can support it. The old replicas will have to go through a file based recovery at some point and will get the seq# information at that point. There is still an edge case where a new primary fails and an old replica takes over. I'lll discuss this one with @ywelsch as I prefer to avoid it completely. This PR also re-enables the BWC tests which were disabled. As such it had to fix any BWC issue that had crept in. Most notably an issue with the removal of the `timestamp` field in #21670. The commit also includes a fix for the default value of the seq number field in replicated write requests (it was 0 but should be -2), that surface some other minor bugs which are fixed as well. Last - I added some debugging tools like more sane node names and forcing replication request to implement a `toString`	2016-12-19 13:08:24 +01:00
Daniel Mitterdorfer	3ce7b119d2	Enable strict duplicate checks for all XContent types (#22225 ) With this commit we enable the Jackson feature 'STRICT_DUPLICATE_DETECTION' by default for all XContent types (not only JSON). We have also changed the name of the system property to disable this feature from `es.json.strict_duplicate_detection` to the now more appropriate name `es.xcontent.strict_duplicate_detection`. Relates elastic/elasticsearch#19614 Relates elastic/elasticsearch#22073	2016-12-19 09:29:47 +01:00
Simon Willnauer	ccfeac8dd5	Remove `doHandshake` test-only settings from TcpTransport (#22241 ) In #22094 we introduce a test-only setting to simulate transport impls that don't support handshakes. This commit implements the same logic without a setting.	2016-12-18 09:26:53 +01:00
Jason Tedor	58d73bae74	Tighten sequence numbers recovery This commit touches addresses issues related to recovery and sequence numbers: - A sequence number can be assigned and a Lucene commit created with a maximum sequence number at least as large as that sequence number, yet the operation corresponding to that sequence number can be missing from both the Lucene commit and the translog. This means that upon recovery the local checkpoint will be stuck at or below this missing sequence number. To address this, we force the local checkpoint to the maximum sequence number in the Lucene commit when opening the engine. Note that there can still be gaps in the history in the translog but we do not address those here. - The global checkpoint is transferred to the target shard at the end of peer recovery. - Additionally, we reenable the relocation integration tests. Lastly, this work uncovered some bugs in the assignment of sequence numbers on replica operations: - setting the sequence number on replica write requests was missing, very likely introduced as a result of resolving merge conflicts - handling operations that arrive out of order on a replica and have a version conflict with a previous operation were never marked as processed Relates #22212	2016-12-17 09:20:46 -05:00
Simon Willnauer	1f3eb068d5	Add infrastructure to manage network connections outside of Transport/TransportService (#22194 ) Some expert users like UnicastZenPing today establishes real connections to nodes during it's ping phase that can be used by other parts of the system. Yet, this is potentially dangerous and undesirable unless the nodes have been fully verified and should be connected to in the case of a cluster state update or if we join a newly elected master. For use-cases like this, this change adds the infrastructure to manually handle connections that are not publicly available on the node ie. should not be managed by `Transport`/`TransportSerivce`	2016-12-17 11:49:57 +01:00
Simon Willnauer	0c0353fc7d	[TEST] Add some testlogging	2016-12-16 14:25:17 +01:00
Masaru Hasegawa	a0185c83a7	Merge pull request #21393 from masaruh/alias_boost Resolve index names in indices_boost	2016-12-16 15:07:51 +09:00
Nik Everett	61597f2c20	Send error_trace by default when testing (#22195 ) Sends the `error_trace` parameter with all requests sent by the yaml test framework, including the doc snippet tests. This can be overridden by settings `error_trace: false`. While this drift's core's handling of the yaml tests from the client's slightly this should only be a problem for tests that rely on the default value, both of which I've fixed by setting the value explicitly. This also escapes `\n` and `\t` in the `Stash dump on failure` so the `stack_trace` is more readable. Also fixes `RestUpdateSettingsAction` to not think of the `error_trace` parameter as a setting.	2016-12-15 13:35:14 -05:00
Boaz Leskes	b6cbcc49ba	ClusterService should expose "applied" cluster states (i.e., remove ClusterStateStatus) (#21817 ) `ClusterService` is responsible of updating the cluster state on every node (as a response to an API call on the master and when non-masters receive a new state from the master). When a new cluster state is processed, it is made visible via the `ClusterService#state` method and is sent to series of listeners. Those listeners come in two flavours - one is to change the state of the node in response to the new cluster state (call these cluster state appliers), the other is to start a secondary process. Examples for the later include an indexing operation waiting for a shard to be started or a master node action waiting for a master to be elected. The fact that we expose the state before applying it means that samplers of the cluster state had to worry about two things - working based on a stale CS and working based on a future, i.e., "being applied" CS. The `ClusterStateStatus` was used to allow distinguishing between the two. Working with a stale cluster state is not avoidable. How this PR changes things to make sure consumers don't need to worry about future CS, removing the need for the status and simplifying the waiting logic. This change does come with a price as "cluster state appliers" can't sample the cluster state from `ClusterService` whenever they want as the cluster state isn't exposed yet. However, recent clean ups made this is situation easier and this PR takes the last steps to remove such sampling. This also helps clarify the "information flow" and helps component separation (and thus potential unit testing). It also adds an assertion that will trigger if the cluster state is sampled by such listeners. Note that there are still many "appliers" that could be made a simpler, unrestricted "listener" but this can be done in smaller bits in the future. The commit also makes it clear what the `appliers` and what the `listeners` are by using dedicated interfaces. Also, since I had to change the listener types I went ahead and changed the data structure for temporary/timeout listeners (used for the observer) so addition and removal won't be an O(n) operation.	2016-12-15 17:06:25 +01:00
Simon Willnauer	ef610636b6	Remove TCP handshake BWC from master (#22151 ) Since #22094 has been back-ported to 5.2 we can remove all BWC layers from master since all supported version will handle handshake requests. Relates to #22094	2016-12-15 12:47:01 +01:00
Simon Willnauer	d27a12510b	Handle race-condition when connection is closed before handshake listener was added Today sending a message on a closed channel doesn't throw an exception. The channel might just swallow the exception and informs the internal async exception handler that a channel got disconnected. This change adds a safety check that we fail the handshake if we registered a handler but the channel has been closed already for instance due to a reset by peer.	2016-12-15 12:41:50 +01:00
Simon Willnauer	80d6539e9c	Handle connection close / reset events gracefully during handshake (#22178 ) Low level handshake code doesn't handle situations gracefully if the connection is concurrently closed or reset by peer. This commit adds the relevant code to fail the handshake if the connection is closed.	2016-12-14 23:04:14 +01:00
Boaz Leskes	bf65a69bbf	Enforce min master nodes in test cluster (#22065 ) In order to start clusters with min master nodes set without setting `discovery.initial_state_timeout`, #21846 has changed the way we start nodes. Instead to the previous serial start up, we now always start the nodes in an async fashion (internally). This means that starting a cluster is unsafe without `min_master_nodes` being set. We should therefore make it mandatory.	2016-12-14 20:14:16 +01:00
Daniel Mitterdorfer	7e5058037b	Enable strict duplicate checks for JSON content With this commit we enable the Jackson feature 'STRICT_DUPLICATE_DETECTION' by default. This ensures that JSON keys are always unique. While this has a performance impact, benchmarking has indicated that the typical drop in indexing throughput is around 1 - 2%. As a last resort, we allow users to still disable strict duplicate checks by setting `-Des.json.strict_duplicate_detection=false` which is intentionally undocumented. Closes #19614	2016-12-14 09:35:53 +01:00
Nik Everett	49bdd29f91	Consolidate more parser creation into ESTestCase This will make it easier to add the forthcoming required argument, `NamedXContentRegistry`.	2016-12-13 20:28:41 -05:00
Jason Tedor	510ad7b9c7	Add shutdown hook for closing CLI commands This commit enables CLI commands to be closeable and installs a runtime shutdown hook to ensure that if the JVM shuts down (as opposed to aborting) the close method is called. It is not enough to wrap uses of commands in main methods in try-with-resources blocks as these will not run if, say, the virtual machine is terminated in response to SIGINT, or system shutdown event. Relates #22126	2016-12-13 19:10:11 -05:00
Nik Everett	872984d21a	Continue consolidating `XContentParser` construction in tests (#22145 ) Consolidate more parser creation in tests Moves more parser creation in tests to the `createParser` methods in `ESTestCase`.	2016-12-13 17:22:39 -05:00
Simon Willnauer	7a9b667e98	Introduce a low level protocol handshake (#22094 ) Today we rely on the version that the API user passes in together with the DiscoveryNode. This commit introduces a low level handshake where nodes exchange their version to be used with the transport protocol that is executed every time a connection to a node is established. This, on the one hand allows to change the wire protocol based on the version we are talking to even without a full cluster restart. Today we would need to carry on a BWC layer across major versions but with a handshake we can rely on the fact that the latest version of the previous minor executes a handshake and uses the latest protocol version across all communication with the N+1 version nodes. This change is yet fully backwards compatible, a followup PR will remove the BWC in 6.0 once this has been back-ported to the 5.x branch	2016-12-13 21:06:23 +01:00
Nik Everett	ce86405394	Start to centralize creation of XContentParser in tests (#22096 ) Starts to centralize creation of the `XContentParser` in `protected final` methods on `ESTestCase`. The idea is to enable adding `NamedXContentRegistry` relatively easily by giving tests a single place they can override to define the `NamedXContentRegistry`. Since `NamedXContentRegistry` doesn't exist yet neither does the override point. This doesn't attempt to migrate all the tests to calling the new methods to build the parsers. I wanted to make this so we could review the concept and then I'll merge a followup to migrate the tests.	2016-12-13 11:22:15 -05:00
Simon Willnauer	b667ff46c4	Allow plugins to install bootstrap checks (#22110 ) Plugins also have the need to provide better OOTB experience by configuring defaults unless the plugin is used in _production_ mode. This change exposes the bootstrap check infrastructure as part of the plugin API to allow plugins to specify / install their own bootstrap checks if necessary.	2016-12-12 17:35:00 +01:00
Luca Cavanna	6d987a9b69	Remove support for empty queries (#22092 ) Our query DSL supports empty queries (`{}`), which have a different meaning depending on the query that holds it, either ignored, match_all or match_none. We deprecated the support for empty queries in 5.0, where we log a deprecation warning wherever they are used. The way we supported it once we moved query parsing to the coordinating node was having an Optional<QueryBuilder> return type in all of our parse methods (called fromXContent). See #17624. The central place for this was QueryParseContext#parseInnerQueryBuilder. We can now remove all the optional return types and simply throw an exception whenever an empty query is found.	2016-12-12 12:37:12 +01:00
Masaru Hasegawa	3df2a086d4	Resolve index names in indices_boost This change allows specifying alias/wildcard expression in indices_boost. And added another format for specifying indices_boost. It accepts array of index name and boost pair. If an index is included in multiple aliases/wildcard expressions, the first match will be used. With new format, old format is marked as deprecated. Closes #4756	2016-12-11 21:41:49 +09:00
Simon Willnauer	01d67e09b9	Detach handshake from connect to node (#22037 ) Today we connect and publish the nodes connection before we execute a handshake with the node we connect to. In the case of connecting to a node that won't pass the handshake this connection is already `published` and other code paths can use it. This commit detaches the connection and the publish of the connection such that `TransportService` can do a handshake before actually connect and publish the connection.	2016-12-10 10:03:26 +01:00
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Nik Everett	e9bb8d8b38	Don't allow yaml tests with `warnings` that don't skip `warnings` (#21989 ) If you write a yaml test with a `warnings` section in a `do` block that doesn't also have a corresponding `skip` section for `warnings` then client test runners that don't support `warnings` will fail. This causes the elasticsearch build to fail so we catch these errors earlier. Related to #21811	2016-12-08 13:17:31 -05:00
Ali Beyad	e6e7bab58c	Prepares allocator decision objects for use with the allocation explain API (#21691 ) This commit enhances the allocator decision result objects (namely, AllocateUnassignedDecision, MoveDecision, and RebalanceDecision) to enable them to be used directly by the cluster allocation explain API. In particular, this commit does the following: - Adds serialization and toXContent methods to the response objects, which will form the explain API responses. - Moves the calculation of the final explanation to the response object itself, removing it from the responsibility of the allocators. - Adds shard store information to the NodeAllocationResult, so that store information is available for each node, when explaining a shard allocation by the PrimaryShardAllocator or the ReplicaShardAllocator. - Removes RebalanceDecision in favor of using MoveDecision for both moving and rebalancing shards. - Removes NodeRebalanceResult in favor of using NodeAllocationResult. - Changes the notion of weight ranking to be relative to the current node, instead of an absolute weight that doesn't convey any added value to the API user and can be confusing. - Introduces a new enum AllocationDecision to convey the decision type, which enables conveying unassigned, moving, and rebalancing scenarios with more detail as opposed to just Decision.Type and AllocationStatus.	2016-12-07 17:37:51 -05:00
Adrien Grand	c746854e03	Pre-built analysis factories do not implement MultiTermAware correctly. (#21981 ) We had tests for the regular factories, but not for the pre-built ones, that ship by default without requiring users to define them in the analysis settings.	2016-12-07 10:32:25 +01:00
Boaz Leskes	4519bdfeb0	InternalTestCluster shouldn't auto heal an active disruption when a new one is set Instead people should explicitly clear the existing one so it's clear what's going on.	2016-12-06 19:58:11 +01:00
Boaz Leskes	a7050b2d56	Remove `InternalTestCluster.startNode(s)Async` (#21846 ) Since the removal of local discovery of #https://github.com/elastic/elasticsearch/pull/20960 we rely on minimum master nodes to be set in our test cluster. The settings is automatically managed by the cluster (by default) but current management doesn't work with concurrent single node async starting. On the other hand, with `MockZenPing` and the `discovery.initial_state_timeout` set to `0s` node starting and joining is very fast making async starting an unneeded complexity. Test that still need async starting could, in theory, still do so themselves via background threads. Note that this change also removes the usage of `INITIAL_STATE_TIMEOUT_SETTINGS` as the starting of nodes is done concurrently (but building them is sequential)	2016-12-06 12:06:15 +01:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Nik Everett	0c724b1878	Keep context during reindex's retries (#21941 ) * Keep context during reindex's retries This fixes reindex and friend's retries to keep the context. * Docs	2016-12-02 13:48:51 -05:00
Tanguy Leroux	fe95aef6a9	[TEST] Remove CompositeTestCluster and ExternalNode (#21933 ) They are not used anymore. Related #21915	2016-12-02 13:25:40 +01:00
Simon Willnauer	20177f6eee	[TEST] Add back ExternalTestCluster - downstream tests still use it	2016-12-02 10:54:27 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
Simon Willnauer	6522538033	Add validation for supported index version on node join, restore, upgrade & open index (#21830 ) Today we can easily join a cluster that holds an index we don't support since we currently allow rolling upgrades from 5.x to 6.x. Along the same lines we don't check if we can support an index based on the nodes in the cluster when we open, restore or metadata-upgrade and index. This commit adds additional safety that fails cluster state validation, open, restore and /or upgrade if there is an open index with an incompatible index version created in the cluster. Realtes to #21670	2016-12-01 15:40:35 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	087a85a4e7	always auto manage min master node in testTwoNodeCluster	2016-12-01 12:57:44 +01:00
Boaz Leskes	9097abee04	Add before and after logging for unit tests Currently we have these logs for integration tests only. This adds the following log at the start: ``` logger.info("[{}]: before test", getTestName()); ``` and this is logged at the end, but before any clean up done in sub classes ``` logger.info("[{}]: after test", getTestName()); ```	2016-12-01 12:56:37 +01:00
Luca Cavanna	103984a4a1	Remove indices query (#21837 ) The indices query is deprecated since 5.0.0 (#17710). It can now be removed in master (future 6.0 version).	2016-11-30 19:37:01 +01:00
Adrien Grand	34e682d3bc	Prevent testing on double values whose toString may use the scientific notation. This might break query parsers because the standard analyzer splits on punctuation.	2016-11-30 16:48:46 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Boaz Leskes	be4074e13d	improve debug logging when node waits for initial cluster state And enabled debug logging in InternalTestClusterTests so we can see it.	2016-11-29 20:38:19 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00
Luca Cavanna	360b74eda8	[TEST] Don't reinitialize YamlTestClient and RestClient before each single test (#21807 ) In the past we ran yaml tests against an internal cluster, which would get restarted after each test failure, hence the client objects needed to eventually be refreshed before each test. That is why we had the initClient method to re-initialize the YamlTestClient in the execution context. We ended up though re-initializing the client unconditionally, which is not needed. Also, ESRestTestCase recreates the RestClient against the external cluster before each test, which is not needed given that nothing changes in the external cluster. This commit removes the initClient method from the yaml tests execution context. The YamlTestClient can be eagerly created before the first yaml test runs and then re-used in subsequent tests. Also api calls to check for nodes versions etc. are moved out of YamlTestClient to ESClientYamlSuiteTestCase. Also the RestClient is now initialized in ESRestTestCase before the first test runs, and kept around afterwards as a static member. Basically each subclass of EsRestTestCase will have its own RestClient instance, but the client will be shared across the different tests within the same class. The yaml test suite is just a special suite, composed of 600+ tests that are loaded from files, which will share the same client instance. This change should speed tests up as well, as we don't recreate the RestClient before each single test, and we don't call _cat/nodes either before each single test.	2016-11-28 18:43:27 +01:00
Simon Willnauer	b7292a6005	Remove TcpTransport#addressSupported since TransportAddress is now final TransportAddress used to be customizable per transport but this has been removed a while ago. Therefore we can remove all usage of this method as well. Relates to #20695	2016-11-28 16:06:59 +01:00
Yannick Welsch	8390648709	Minor clean-ups in MockBigArrays (#21822 ) Removes an unused static variable and an unused instance variable.	2016-11-28 14:09:26 +01:00
Yannick Welsch	7e198f0e41	Detect nodes being blocked by GC-disrupted node (#21797 ) The disruption type LongGCDisruption simulates GCs on a node by suspending all the threads of that node. If the suspended threads are in a code section with shared JVM locks, however, it can prevent the other nodes from doing their thing. The class LongGCDisruption has a list of class names for which we know that this can occur. Whenever a test using the GC disruption type fails in mysterious ways, it becomes a long guessing game to find the offending class. This commit adds code to LongGCDisruption to automatically detect these situations, fail the test early and report the offending class and all relevant context.	2016-11-28 11:24:25 +01:00
Simon Willnauer	41e9ed13d6	[TEST] Fix AbstractBytesReferenceTestCase#testSlice to not assert on offset	2016-11-24 15:31:36 +01:00
Jason Tedor	8416b16dfd	Improve handling of unreleased versions Today when handling unreleased versions for backwards compatilibity support, we scatted version constants across the code base and add some asserts to support removing these constants when the version in question is actually released. This commit improves this situation, enabling us to just add a single unreleased version constant that can be renamed when the version is actually released. This should make maintenance of these versions simpler. Relates #21760	2016-11-23 15:49:05 -05:00
Ryan Ernst	6940b2b8c7	Remove groovy scripting language (#21607 ) * Scripting: Remove groovy scripting language Groovy was deprecated in 5.0. This change removes it, along with the legacy default language infrastructure in scripting.	2016-11-22 19:24:12 -08:00
Nik Everett	1791623700	Document `error_trace` The `error_trace` parameter turns on the `stack_trace` field in errors which returns stack traces. Removes documentation for `camelCase` because it hasn't worked in a while.... Documents the internal parameters used to render stack traces as internal only. Closes #21708	2016-11-22 19:16:07 -05:00
Simon Willnauer	a9a2753f0b	Add a HostFailureListener to notify client code if a node got disconnected (#21709 ) Today there is no way to get notified if a node is disconnected. Client code must poll the TransportClient constantly to detect that a node is not connected anymore in order to react and add new nodes or notify altering etc. For instance if a hostname gets resolved to an IP but that host is disconnected clients want to reconnect by resolving the hostname again which is a common situation in cloud environments. Closes #21424	2016-11-22 20:46:28 +01:00
Jason Tedor	9dc65037bc	Lazy resolve unicast hosts Today we eagerly resolve unicast hosts. This means that if DNS changes, we will never find the host at the new address. Moreover, a single host failng to resolve causes startup to abort. This commit introduces lazy resolution of unicast hosts. If a DNS entry changes, there is an opportunity for the host to be discovered. Note that under the Java security manager, there is a default positive cache of infinity for resolved hosts; this means that if a user does want to operate in an environment where DNS can change, they must adjust networkaddress.cache.ttl in their security policy. And if a host fails to resolve, we warn log the hostname but continue pinging other configured hosts. When doing DNS resolutions for unicast hostnames, we wait until the DNS lookups timeout. This appears to be forty-five seconds on modern JVMs, and it is not configurable. If we do these serially, the cluster can be blocked during ping for a lengthy period of time. This commit introduces doing the DNS lookups in parallel, and adds a user-configurable timeout for these lookups. Relates #21630	2016-11-22 14:17:04 -05:00
Areek Zillur	0ccf8a742d	Add support for merging custom meta data in tribe node (#21552 ) * Add support for merging custom meta data in tribe node Currently, when any underlying cluster has custom metadata (via plugin), tribe node does not store custom meta data in its cluster state. This is because the tribe node has no idea how to select the appropriate custom metadata from one or many custom metadata (corresponding to the number of underlying clusters). This change adds an interface that custom metadata implementations can extend to add support for merging mulitple custom metadata of the same type for storing in the tribe state. Relates to #20544 Supersedes #20791 * Simplify updating tribe state * Add tests for merging multiple custom metadata types in tribe node * cleanup merging custom md logic in tribe service	2016-11-21 12:03:01 -05:00
Tanguy Leroux	e7b9e65fc3	Add checkstyle rule to forbid empty javadoc comments (#20881 ) This commit adds a RegexpMultiline check to checkstyle that yells when an empty Javadoc comment is found in Java files. Related #20871	2016-11-21 12:36:44 +01:00
Adrien Grand	6581b77198	Remove store throttling. (#21573 ) Store throttling has been disabled by default since Lucene added automatic throttling of merge operations based on the indexing rate.	2016-11-17 09:33:32 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Adrien Grand	00de8e07fc	Do not cache term queries. (#21566 ) There have been reports that the query cache did not manage to speed up search requests when the query includes a large number of different sub queries since a single request may manage to exhaust the whole history (256 queries) while the query cache only starts caching queries once they appear multiple times in the history (#16031). On the other hand, increasing the size of the query cache is a bit controversial (#20116) so this pull request proposes a different approach that consists of never caching term queries, and not adding them to the history of queries either. The reasoning is that these queries should be fast anyway, regardless of caching, so taking them out of the equation should not cause any slow down. On the other hand, the fact that they are not added to the cache history anymore means that other queries have greater chances of being cached.	2016-11-16 10:02:24 +01:00
Boaz Leskes	d99d02ecc3	InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted This is to deal with potential delays in processing the fact that node was node is restarted.	2016-11-15 17:58:08 +00:00
Boaz Leskes	9171407906	remove an unneeded assert busy	2016-11-15 17:36:06 +00:00
Boaz Leskes	2c0338fa87	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 17:09:08 +00:00
Boaz Leskes	d6c2b4f7c5	Adapt InternalTestCluster to auto adjust `minimum_master_nodes` (#21458 ) #20960 removed `LocalDiscovery` and we now use `ZenDiscovery` in all our tests. To keep cluster forming fast, we are using a `MockZenPing` implementation which uses static maps to return instant results making master election fast. Currently, we don't set `minimum_master_nodes` causing the occasional split brain when starting multiple nodes concurrently and their pinging is so fast that it misses the fact that one of the node has elected it self master. To solve this, `InternalTestCluster` is modified to behave like a true cluster and manage and set `minimum_master_nodes` correctly with every change to the number of nodes. Tests that want to manage the settings themselves can opt out using a new `autoMinMasterNodes` parameter to the `ClusterScope` annotation. Having `min_master_nodes` set means the started node may need to wait for other nodes to be started as well. To combat this, we set `discovery.initial_state_timeout` to `0` and wait for the cluster to form once all node have been started. Also, because a node may wait and ping while other nodes are started, `MockZenPing` is adapted to wait rather than busy-ping.	2016-11-15 13:42:26 +00:00
Boaz Leskes	c9f49039d3	Merge remote-tracking branch 'upstream/master' into feature/seq_no	2016-11-15 10:14:47 +00:00
Ryan Ernst	c7bd4f3454	Tests: Add TestZenDiscovery and replace uses of MockZenPing with it (#21488 ) This changes adds a test discovery (which internally uses the existing mock zenping by default). Having the mock the test framework selects be a discovery greatly simplifies discovery setup (no more weird callback to a Node method).	2016-11-14 21:46:10 -08:00
Ryan Ernst	d14c470b89	Remove generics from ActionRequest closes #21368	2016-11-14 15:32:01 -08:00
Jason Tedor	491a945ac8	Add socket permissions for tribe nodes Today when a node starts, we create dynamic socket permissions based on the configured HTTP ports and transport ports. If no ports are configured, we use the default port ranges. When a tribe node starts, a tribe node creates an internal node client for connecting to each remote cluster. If neither an explicit HTTP port nor transport ports were specified, the default port ranges are large enough for the tribe node and its internal node clients. If an explicit HTTP port or transport port was specified for the tribe node, then socket permissions for those ports will be created, but not for the internal node clients. Whether the internal node clients have explicit ports specified, or attempt to bind within the default range, socket permissions for these will not have been created and the internal node clients will hit a permissions issue when attempting to bind. This commit addresses this issue by also accounting for tribe nodes when creating the dynamic socket permissions. Additionally, we add our first real integration test for tribe nodes.	2016-11-14 11:58:44 -05:00
Simon Willnauer	bdc942fa72	Enable 5.x to 6.x BWC tests This commit enables real BWC testing against a 5.1 snapshot. All REST tests plus rolling upgrade test now run against a mixed version cross major version cluster.	2016-11-14 14:26:49 +01:00
Jason Tedor	c7a1b3eb50	Merge branch 'master' into feature/seq_no * master: Hack around cluster service and logging race Do not prematurely shutdown Log4j Support decimal constants with trailing [dD] in painless (#21412) In painless suggest a long constant if int won't do (#21415) Account for different paths for sysctl utilities [TEST] testRebalancePossible() may not have an assigned node id Tests: Disable merge in SearchCancellationTests Tests: clean search scroll at the end of SearchCancellationIT	2016-11-13 20:01:44 -05:00
Jason Tedor	d273419d00	Do not prematurely shutdown Log4j When a node closes, we shutdown logging as the last statement. This statement must be last lest any subsequent attempts to log will blow up by running into security permissions. Yet, in the case of a tribe node this isn't enough. The first internal tribe node to close will shutdown logging, and subsequent node closes will blow up with the aforementioned problem. This commit migrate the Log4j shutdown to occur as part of the shutdown hook that closes the node, after all nodes have closed. Consequently, we can remove a hack in the test infrastructure to prevent Log4j shutdowns when internal test nodes close and instead just register a single shutdown hook that runs when the test JVM exits. Relates #21519	2016-11-13 17:27:30 -05:00
Jason Tedor	1e7c424479	Merge branch 'master' into feature/seq_no * master: ShardActiveResponseHandler shouldn't hold to an entire cluster state Ensures cleanup of temporary index-* generational blobs during snapshotting (#21469) Remove (again) test uses of onModule (#21414) [TEST] Add assertBusy when checking for pending operation counter after tests Revert "Add trace logging when aquiring and releasing operation locks for replication requests" Allows multiple patterns to be specified for index templates (#21009) [TEST] fixes rebalance single shard check as it isn't guaranteed that a rebalance makes sense and the method only tests if rebalance is allowed Document _reindex with random_score	2016-11-11 11:25:27 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Ryan Ernst	48bfb142b9	Remove (again) test uses of onModule (#21414 ) This change was reverted after it caused random test failures. This was due to a copy/paste error in the original PR which caused the mock version of ClusterInfoService to be used whenever the mock ZenPing was used, and the real ClusterInfoService to be used when MockZenPing was not used.	2016-11-10 16:06:14 -08:00
Areek Zillur	7ed195fe93	[TEST] Add assertBusy when checking for pending operation counter after tests Currently, pending operations can complete after tests with disruption scheme completes. This commit waits for the pending operation counter to complete after the tests are run	2016-11-10 18:35:52 -05:00
Alexander Lin	0219a211d3	Allows multiple patterns to be specified for index templates (#21009 ) * Allows for an array of index template patterns to be provided to an index template, and rename the field from 'template' to 'index_pattern'. Closes #20690	2016-11-10 18:00:30 -05:00
javanna	2f32c1173b	Revert "Tests: Remove a couple test uses of onModule (#21414 )" This reverts commit `b326f0bc51`.	2016-11-09 11:32:16 +01:00
Ryan Ernst	b326f0bc51	Tests: Remove a couple test uses of onModule (#21414 ) There were still a couple test use cases and examples that were using onModule. This change cleans those cases up.	2016-11-08 13:50:13 -08:00
Nik Everett	b7531984a9	Ignore IAE when checking for version serialization This allows us to throw IllegalArgumentException from serialization code when the destination node can't support the request.	2016-11-08 11:36:12 -05:00
Yannick Welsch	cd34eed03e	Make ensureGreen and ensureYellow wait for cluster size consistency (#21344 ) We currently often use ensureGreen or ensureYellow to check whether the cluster is in a good state again after shutting down a node. With the change in #21092, however, it can happen that if the node that is stopped is the master node, another node will become master and publish a cluster state where it is master but where the node that was stopped hasn't been removed yet from the cluster state. It will only publish a second state thereafter where the old master is removed. If the ensureGreen/ensureYellow is timed just right, it will get to execute before the second cluster state update removing the old master and the condition ensureGreen / ensureYellow might not hold at that point anymore.	2016-11-08 11:07:54 +01:00
Ryan Ernst	7a2c984bcc	Test: Remove multi process support from rest test runner (#21391 ) At one point in the past when moving out the rest tests from core to their own subproject, we had multiple test classes which evenly split up the tests to run. However, we simplified this and went back to a single test runner to have better reproduceability in tests. This change removes the remnants of that multiplexing support.	2016-11-07 15:07:34 -08:00
Nik Everett	a13a050271	Add automatic parallelization support to reindex and friends (#20767 ) Adds support for `?slices=N` to reindex which automatically parallelizes the process using parallel scrolls on `_uid`. Performance testing sees a 3x performance improvement for simple docs on decent hardware, maybe 30% performance improvement for more complex docs. Still compelling, especially because clusters should be able to get closer to the 3x than the 30% number. Closes #20624	2016-11-04 20:59:15 -04:00
Jason Tedor	f16c308efd	Assert status logger does not warn on Log4j usage Today if you start Elasticsearch with the status logger configured to the warn level, or use a transport client with the default status logger level, you will see warn messages about deprecation loggers being created with different message factories and that formatting might be broken. This happens because the deprecation logger is constructed using the message factory from its parent, an artifact leftover from the first Log4j 2 implementation that used a custom message factory. When that custom message factory was removed, this constructor invocation should have been changed to not explicitly use the message factory from the parent. This commit fixes this invocation. However, we also had some status checking to all tests to ensure that there are no warn status log messages that might indicate a configuration problem with Log4j 2. These assertions blow up badly without the fix for the deprecation logger construction, and also caught a misconfiguration in one of the logging tests. Relates #21339	2016-11-04 14:19:59 -04:00
Nik Everett	8943421494	Only log rest connection setup once per suite (#21280 ) This is a bit funky to do with junit because we need per test state but we only want to log it per suite. So we use a static flag that we test per test and reset before every suite.	2016-11-03 21:47:11 -04:00
Yannick Welsch	39f4229594	Add information about in-flight requests when checking IndexShard operation counter (#21308 ) Our test infrastructure checks after running each test that there are no more in-flight requests on the shard level. Whenever the check fails, we only know that there were in-flight requests but don't know what requests were causing this issue. This commit adds the replication tasks that are still active at that moment to the assertion error.	2016-11-03 18:36:07 +01:00
Ryan Ernst	dc6ed7b8d4	Remove pluggability of ZenPing (#21049 ) Plugins: Remove pluggability of ZenPing ZenPing is the part of zen discovery which knows how to ping nodes. There is only one alternative implementation, which is just for testing. This change removes the ability to add custom zen pings, and instead hooks in the MockZenPing for tests through an overridden method in MockNode. This also folds in the ZenPingService (which was really just a single method) into ZenDiscovery, and removes the idea of having multiple ZenPing instances. Finally, this was the last usage of the ExtensionPoint classes, so that is also removed here.	2016-11-03 08:20:20 -07:00
Boaz Leskes	be1772b70d	pending states assertion should dump states This was removed in a cleanup assuming that Hamcrest will dump the array content. Sadly it only dumps the size.	2016-11-03 09:02:29 +01:00
Christoph Büscher	b3370de715	Tests: Add warning header checks to QueryBuilder tests and QueryParseContextTests This adds checks for expected warning headers to the query builder test infrastructure. Tests that are adding deprecation warnings to the response headers need to check those, otherwise the abstract base class for the test class will complain at teardown.	2016-11-02 15:45:33 +01:00
Yannick Welsch	6930a4846c	[TEST] Check static test state after suite scoped cluster is shut down (#21256 ) Checks on static test state are run by an @After method in ESTestCase. Suite-scoped tests in ESIntegTestCase only shut down in an @AfterClass method, which executes after the @After method in ESTestCase. The suite-scoped cluster can thus still execute actions that will violate the checks in @After without those being caught. A subsequent test executing within the same JVM will fail these checks however when @After gets called for that test. This commit adds an explicit call to check the static test state after the suite-scoped cluster has been shut down.	2016-11-02 15:00:16 +01:00
Boaz Leskes	0daf483587	Change ClusterState and PendingClusterTasksResponse's toString() to their prettyPrint format (#21245 ) The current XContent output is much harder to read than the prettyPrint format. This commit folds prettyPrint into toString and removes it.	2016-11-02 13:43:39 +01:00
Simon Willnauer	cf1457ed22	Allow skip test by version OR feature (#21240 ) Today these two are considered mutual exclusive but they are not in practice. For instance a mixed version cluster might not return a given warning depending on which node we talk to but on the other hand some runners might not even support warnings at all so the test might be skipped either by version or by feature.	2016-11-02 12:24:20 +01:00
Adrien Grand	aa6cd93e0f	Require arguments for QueryShardContext creation. (#21196 ) The `IndexService#newQueryShardContext()` method creates a QueryShardContext on shard `0`, with a `null` reader and that uses `System.currentTimeMillis()` to resolve `now`. This may hide bugs, since the shard id is sometimes used for query parsing (it is used to salt random score generation in `function_score`), passing a `null` reader disables query rewriting and for some use-cases, it is simply not ok to rely on the current timestamp (eg. percolation). So this pull request removes this method and instead requires that all call sites provide these parameters explicitly.	2016-11-02 09:48:49 +01:00
Simon Willnauer	2ba4dadea0	[TEST] fix extrasFS file filtering in OldIndexUtils	2016-11-02 09:38:51 +01:00
Simon Willnauer	4db1ac931f	Fix InternalEngineTests#testUpgradeOldIndex for 5.0.0 BWC indices Relates to #21147	2016-11-02 09:38:44 +01:00
Jason Tedor	7751049c14	Add version for 5.0.0 This commit adds the version constant for 5.0.0. Relates #21244	2016-11-01 14:09:00 -04:00
Boaz Leskes	523f7ea71e	Fix a racing condition in MockTransportService#addUnresponsiveRule where a request can be delayed even if the rule was removed. Relates to #21129 Also properly reset DiscoveryWithServiceDisruptionsIT#disableBeforeIndexDeletion	2016-11-01 14:08:18 +01:00
Boaz Leskes	ef192ff2cf	ESIntegTestCase.jav: use ClusterState.prettyPrint for pending ClusterState assertions	2016-11-01 12:54:20 +01:00
Yannick Welsch	d7d5909e69	Disconnect from newly added nodes if cluster state publishing fails (#21197 ) Before publishing a cluster state the master connects to the nodes that are added in the cluster state. When publishing fails, however, it does not disconnect from these nodes, leaving NodeConnectionsService out of sync with the currently applied cluster state.	2016-10-31 15:09:43 +01:00
Simon Willnauer	9598616dfe	Fallback to '/' info call to fetch cluster version The `_cat/nodes` API might not be available in all clusters for instance if they have authorization enabled. This change falls back to the previously used method of using the '/' endpoint to fetch the nodes version, this is best effort and will emit a warning.	2016-10-28 16:22:53 +02:00
Adrien Grand	b3cc54cf0d	Upgrade to lucene-6.3.0-snapshot-ed102d6 (#21150 ) Lucene 6.3 is expected to be released in the next weeks so it'd be good to give it some integration testing. I had to upgrade randomized-testing too so that both Lucene and Elasticsearch are on the same version.	2016-10-28 14:47:15 +02:00
Simon Willnauer	43dbf9c7b6	Use all available hosts in REST tests and allow for real master election (#21161 ) Today we only use a single node to send requests to when we run REST tests. In some cases we have more than one node (ie. in the BWC case) where we should send requests to all nodes in a round-robin fashion. This change passes all available node endpoints to the rest test. Additionally, this change adds the setting of `discovery.zen.minimum_master_nodes` to the cluster formation forcing the nodes to wait for all other nodes until the cluster is formed. This allows for a more realistic master election and allows all master eligable nodes to become master while before always the first node in the cluster became the master. This also adds logging to each test run to log the master nodes version and the minimum node version in the cluster to help debugging BWC test failures.	2016-10-28 12:18:47 +02:00
Simon Willnauer	97cc426a89	Fix bwc cluster formation in order to run BWC tests against a mixed version cluster (#21145 ) This fixes our cluster formation task to run REST tests against a mixed version cluster. Yet, due to some limitations in our test framework `indices.rollover` tests are currently disabled for the BWC case since they select the current master as the merge node which happens to be a BWC node and we can't relocate all shards to it since the primaries are on a higher version node. This will be fixed in a followup. Closes #21142 Note: This has been cherry-picked from 5.0 and fixes several rest tests as well as a BWC break in `OsStats.java`	2016-10-27 17:03:53 +02:00
Yannick Welsch	f3e578f942	Stop delaying existing requests after network delay rule is cleared (#21129 ) The network disruption type "network delay" continues delaying existing requests even after the disruption has been cleared. This commit ensures that the requests get to execute right after the delay rule is cleared.	2016-10-27 13:48:17 +02:00
Jason Tedor	9c3e4d6e22	Add correct Content-Length on HEAD requests This commit fixes responses to HEAD requests so that the value of the Content-Length is correct per the HTTP spec. Namely, the value of this header should be equal to the Content-Length if the request were not a HEAD request. This commit also fixes a memory leak on HEAD requests to the main action that arose from the bytes on a builder not being released due to them being dropped on the floor to ensure that the response to the main action did not have a body. Relates #21123	2016-10-25 23:08:19 -04:00
Igor Motov	17ad88d539	Makes search action cancelable by task management API Long running searches now can be cancelled using standard task cancellation mechanism.	2016-10-25 12:27:34 -10:00
Christoph Büscher	f6f129b21f	Consolidate code for equals/hashCode testing in central utility class Currently test that check that equals() and hashCode() are working as expected for classes implementing them are quiet similar. This change moves common assertions in this method to a common utility class. In addition, another common utility function in most of these test classes that creates copies of input object by running them through a StreamOutput and reading them back in, is moved to ESTestCase so it can be shared across all these classes. Closes #20629	2016-10-24 15:50:40 +02:00
Simon Willnauer	0a410d3916	Pass executor name to request interceptor to support async intercept calls (#21089 ) Today the request interceptor can't support async calls since the response of the async call would execute on a different thread ie. a client or listener thread. This means in-turn that the intercepted handler is not executed with the thread it was supposed to run and therefor can, if it's executing blocking operations, potentially deadlock an entire server.	2016-10-24 13:57:07 +02:00
Ryan Ernst	53cff0f00f	Move all zen discovery classes into o.e.discovery.zen (#21032 ) * Move all zen discovery classes into o.e.discovery.zen This collapses sub packages of zen into zen. These all had just a couple classes each, and there is really no reason to have the subpackages. * fix checkstyle	2016-10-20 00:44:48 -07:00
javanna	c92b550df2	[TEST] Remove create special case in yaml test client Now that the create api has its own spec, we can remove the special case in the yaml test client for it Relates to #20924	2016-10-20 08:48:15 +02:00
Boaz Leskes	c3987156ab	Remove local discovery in favor of a simpler `MockZenPings` (#20960 ) `LocalDiscovery` is a discovery implementation that uses static in memory maps to keep track of current live nodes. This is used extensively in our tests in order to speed up cluster formation (i.e., shortcut the 3 second ping period used by `ZenDiscovery` by default). This is sad as that mean that most of the test run using a different discovery semantics than what is used in production. Instead of replacing the entire discovery logic, we can use a similar approach to only shortcut the pinging components.	2016-10-18 21:12:15 +02:00
Boaz Leskes	eaa105951f	Simplify GlobalCheckpointService and properly hook it for cluster state updates (#20720 ) During a recent merge from master, we lost the bridge from IndicesClusterStateService to the GlobalCheckpointService of primary shards, notifying them of changes to the current set of active/initializing shards. This commits add the bridge back (with unit tests). It also simplifies the GlobalCheckpoint tracking to use a simpler model (which makes use the fact that the global check point sync is done periodically). The old integration CheckpointIT test is moved to IndexLevelReplicationTests. I also added similar assertions to RelocationsIT, which surfaced a bug in the primary relocation logic and how it plays with global checkpoint updates. The test is currently await-fixed and will be fixed in a follow up issue.	2016-10-17 16:33:03 +02:00
Tanguy Leroux	1755cc08f3	REST API parser should fail on duplicate params/paths/methods/parts (#20940 ) This commit changes the current REST API parser to make it fail and throw an exception when a REST specification file contains a duplicated parameters, or path, or method, or path part.	2016-10-17 09:19:07 +02:00
Simon Willnauer	5137f44bd6	[TEST] return empty array if AbstractQueryTestCase#currentTypes is null This is important to allow any test to use RandomQueryBuilder#createQuery() since some of the query builders that are used in this test test the length of the types array and otherwise will thow NPE if the test is not a subclass of AbstractQueryTestCase.	2016-10-15 14:46:54 +02:00
Boaz Leskes	bc8ad8de5a	MockBigArrays should tell you who originally released them	2016-10-12 13:03:40 +02:00
Tanguy Leroux	44ac5d057a	Remove empty javadoc (#20871 ) This commit removes as many as empty javadocs comments my regexp has found	2016-10-12 10:27:09 +02:00
Adrien Grand	1914df7b5f	Do not cache script queries. (#20799 ) The cache relies on the equals() method so we just need to make sure script queries can never be equals, even to themselves in the case that a weight is used to produce a Scorer on the same segment multiple times. Closes #20763	2016-10-11 09:17:21 +02:00
Simon Willnauer	4fd1276542	Prevent AbstractArrays from release bytes more than once (#20819 ) Today we throw an assertion error if we release an AbstractArray more than once. Yet, it's recommended to implement close methods such that they can be invoked more than once. Guaranteed single release calls are hard to implement and some situations might not be tested causing for instance `CircuitBreaker` to operate on corrupted memory stats.	2016-10-10 17:30:37 +02:00
javanna	e154e6a758	[TEST] reformatted comment in query tests	2016-10-10 10:53:17 +02:00
Nik Everett	cf4038b668	DeGuice some of IndicesModule UpdateHelper, MetaDataIndexUpgradeService, and some recovery stuff. Move ClusterSettings to nullable ctor parameter of TransportService so it isn't forgotten.	2016-10-07 11:14:38 -04:00
Simon Willnauer	7452028e50	Simplify TransportAddress (#20798 ) since TransportAddress is now final we can simplify it's interface a bit and remove methods that are only used in tests or are plain delegates.	2016-10-07 15:56:54 +02:00
Simon Willnauer	194a6b1df0	Remove LocalTransport in favor of MockTcpTransport (#20695 ) This change proposes the removal of all non-tcp transport implementations. The mock transport can be used by default to run tests instead of local transport that has roughly the same performance compared to TCP or at least not noticeably slower. This is a master only change, deprecation notice in 5.x will be committed as a separate change.	2016-10-07 11:27:47 +02:00
Simon Willnauer	9c9afe3f01	Remove SearchContext#current and all it's threadlocals (#20778 ) Today SearchContext expose the current context as a thread local which makes any kind of sane interface design very very hard. This PR removes the thread local entirely and instead passes the relevant context anywhere needed. This simplifies state management dramatically and will allow for a much leaner SearchContext interface down the road.	2016-10-06 19:51:54 +02:00
Colin Goodheart-Smithe	40f8f281e0	Merge branch 'master' into dont_cache_scripts	2016-10-06 09:09:23 +01:00
Colin Goodheart-Smithe	ce6f6d3835	Review comments	2016-10-06 08:55:31 +01:00
Simon Willnauer	134b1f9b4d	Prevent thread suspension when inside SecurityManager (#20770 ) LongGCDisruption suspends and resumes node threads but respects several `unsafe` class name patterns where it's unsafe to suspend. For instance log4j uses a global lock so we can't suspend a thread that is currently calling into log4j. The same is true for the security manager, it's similar to log4j a shared resource between the test and the node that is _suspended_. This change adds `java.lang.SecrityManager` to the unsafe patterns. This prevents test framework deadlocking if a nodes thread is supended while it's calling into the security manager that uses synchronized maps etc.	2016-10-05 21:40:27 +02:00
Simon Willnauer	e556c289b9	use a private rewrite context to prevent exposing isCachable	2016-10-05 11:41:49 +02:00
Simon Willnauer	7ba22bb75b	fix random score function builder to deal with empty seeds	2016-10-05 10:45:24 +02:00
Simon Willnauer	587bdcef38	add extra safety when accessing scripts or now and reqeusts are cached	2016-10-05 09:41:48 +02:00
Simon Willnauer	94b7873b49	Add a #markAsNotCachable() method to context to mark requests as not cachable	2016-10-04 18:05:00 +02:00
Simon Willnauer	56f35baf47	Add date-math support to `_rollover` (#20709 ) today it's not possible to use date-math efficiently with the `_rollover` API. This change adds support for date-math in the target index as well as support for preserving the math logic when an existing index that was created with a date math expression all subsequent indices are created with the same expression.	2016-10-03 16:52:33 +02:00
Boaz Leskes	27eab74510	merge from master	2016-09-30 17:19:30 +02:00
Jason Tedor	3a4ffd7b86	Fix failing logging listener tests The logging listener tests started failing after `953a8a959b` when the tests are run with tests.es.logger.level set to any level other than debug. This is because these tests were based around the assumption that the default logging level was info, which was the case before that commit fixed setting the default logging level via that system property. This commit fixes these failing tests by adjusting this assumption to account for the fact that the default logging level could be different.	2016-09-30 08:09:35 +02:00
Boaz Leskes	a16d644c68	allow settings logging level via a sys config in unit tests Pipe in the `tests.es.logger.level` system property to the log4j config file used in tests. We still default to info. Also adapts the logger name to use the first letter of packages.	2016-09-29 03:04:43 +02:00
Jason Tedor	0808611184	Fix failing tests after merge This commit fixes failing tests in feature/seq_no after merging master in.	2016-09-29 03:04:37 +02:00
Boaz Leskes	953a8a959b	allow settings logging level via a sys config in unit tests Pipe in the `tests.es.logger.level` system property to the log4j config file used in tests. We still default to info. Also adapts the logger name to use the first letter of packages.	2016-09-29 01:33:13 +02:00
Jason Tedor	25fd9e26c4	Merge branch 'master' into feature/seq_no * master: (1199 commits) [DOCS] Remove non-valid link to mapping migration document Revert "Default `include_in_all` for numeric-like types to false" test: add a test with ipv6 address docs: clearify that both ip4 and ip6 addresses are supported Include complex settings in settings requests Add production warning for pre-release builds Clean up confusing error message on unhandled endpoint [TEST] Increase logging level in testDelayShards() change health from string to enum (#20661) Provide error message when plugin id is missing Document that sliced scroll works for reindex Make reindex-from-remote ignore unknown fields Remove NoopGatewayAllocator in favor of a more realistic mock (#20637) Remove Marvel character reference from guide Fix documentation for setting Java I/O temp dir Update client benchmarks to log4j2 Changes the API of GatewayAllocator#applyStartedShards and (#20642) Removes FailedRerouteAllocation and StartedRerouteAllocation IndexRoutingTable.initializeEmpty shouldn't override supplied primary RecoverySource (#20638) Smoke tester: Adjust to latest changes (#20611) ...	2016-09-29 00:22:31 +02:00
Jason Tedor	3c8ff45917	Add production warning for pre-release builds This commit adds a usage warning when Elasticsearch is started with a pre-release build. Relates #20674	2016-09-27 20:13:12 -04:00
Boaz Leskes	ee76c1a5c9	Remove NoopGatewayAllocator in favor of a more realistic mock (#20637 ) Many of our unit tests instantiate an `AllocationService`, which requires having a `GatewayAllocator`. Today almost all of our test use a class called `NoopGatewayAllocator` which does nothing, effectively leaving all shard assignments to the balanced allocator. This is sad as it means we test a system that behaves differently than our production logic in very basic things. For example, a started primary that is lost will be assigned to a node that didn't use to have it. This PR removes `NoopGatewayAllocator` in favor of a new `TestGatewayAllocator` that inherits the standard `GatewayAllocator` and overrides shard information fetching to return information based on historical assignments the allocator has done. The only exception is `BalanceConfigurationTests` which does test only the balancer and I opted to not have it work around the `GatewayAllocator` being in it's way.	2016-09-25 20:15:30 +02:00
Ali Beyad	ac1b13dde7	Changes the API of GatewayAllocator#applyStartedShards and (#20642 ) Changes the API of GatewayAllocator#applyStartedShards and GatewayAllocator#applyFailedShards to take both a RoutingAllocation and a list of shards to apply. This allows better mock allocators to be created as being done in #20637. Closes #20642	2016-09-23 09:31:46 -04:00
Ali Beyad	029fc909b5	Removes FailedRerouteAllocation and StartedRerouteAllocation Removes the FailedRerouteAllocation class and StartedRerouteAllocation class, as they were just wrappers for RerouteAllocation that stored started and failed shards, but these started and failed shards can be passed in directly to the methods that needed them, removing the need for this wrapper class and extra level of indirection. Closes #20626	2016-09-23 09:02:36 -04:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Simon Willnauer	0151974500	`_flush` should block by default (#20597 ) This commit changes the default behavior of `_flush` to block if other flushes are ongoing. This also removes the use of `FlushNotAllowedException` and instead simply return immediately by skipping the flush. Users should be aware if they set this option that the flush might or might not flush everything to disk ie. no transactional behavior of some sort. Closes #20569	2016-09-21 14:20:24 +02:00
Tanguy Leroux	7645abaad9	Remove duplicate methods in ByteSizeValue (#20560 ) This commit removes `ByteSizeValue`'s methods that are duplicated (ex: `mbFrac()` and `getMbFrac()`) in order to only keep the `getN` form. It also renames `mb()` -> `getMb()`, `kb()` -> `getKB()` in order to be more coherent with the `ByteSizeUnit` method names.	2016-09-20 14:07:23 +02:00
Ali Beyad	50584c4103	Merge pull request #20532 from rjernst/rolling_upgrades This PR introduces backward compatibility index tests to test the rolling upgrade process amongst Elasticsearch instances within the same major version. The test executes in three phases. In the first phase, we form a cluster of 2 ES instances on an old version. In the second phase, we keep one of the nodes from the old cluster, kill the other node, but preserve its data directory and start an instance of the current version of ES using the same data directory as the killed instance. In the third phase, we kill the other old version ES instance from the first phase and launch a new instance, using the same data directory as the killed instance. Therefore, during phase 3, we have fully migrated and have all current versions of ES running. In each phase, we run REST tests that index documents and search them, ensuring at each stage that the documents from the previous phase are still there. Note that because we haven't released a GA yet of 5.0, the tests currently don't start an old version cluster in the first phase. Once GA is released, this will be changed to make the backward compatibility version 5.0, while the current version in the cluster will be 5.x.	2016-09-19 16:14:38 -04:00
Simon Willnauer	ee8d14798f	Unguice Transport and friends (#20526 ) This change removes all guice interaction from Transport, HttpServerTransport, HttpServer and TransportService. All these classes as well as their subclasses or extended version configured via plugins are now created by using plain old bloody java constructors. YAY!	2016-09-19 22:10:47 +02:00
Boaz Leskes	2ee9ab25d9	Remove `RoutingAllocation.Result` (#20538 ) Currently all the reroute-like methods of `AllocationService` return a result object of type `RoutingAllocation.Result`. The result object contains the new `RoutingTable` and `MetaData` plus an indication whether those were changed. The caller is then responsible of updating a cluster state with these. These means that things can easily go wrong and one can take one of these but not the other causing inconsistencies. We already have a utility method on the `ClusterState` builder that does but no one forces you to do so. Also 99% of the callers do the same thing: i.e., check if the result was changed and if so update the very same cluster state that was passed to `AllocationService`. This PR folds this pattern into `AllocationService` and changes almost all it's methods to return a new cluster state (potentially the original one). This saves some 500 lines of code. The one exception here is the reroute API which executes allocation commands and potentially returns an explanation as well (next to the routing table and metadata). That API now returns a `CommandsResult` object which encapsulate a cluster state and the explanation.	2016-09-19 13:54:35 +02:00
Ali Beyad	98230d035a	Adds a preserveIndicesUponCompletion method to ESRestTestCase that can be overridden by subclasses if the test must not delete indices it created after exiting.	2016-09-16 19:21:26 -04:00
Ali Beyad	ce86ed1fdd	Merge remote-tracking branch 'upstream/master' into rolling_upgrades	2016-09-16 10:43:38 -04:00
Simon Willnauer	f5daa165f1	Remove ability to plug-in TransportService (#20505 ) TransportService is such a central part of the core server, replacing it's implementation is risky and can cause serious issues. This change removes the ability to plug in TransportService but allows registering a TransportInterceptor that enables plugins to intercept requests on both the sender and the receiver ends. This is a commonly used and overwritten functionality but encapsulates the custom code in a contained manner.	2016-09-16 09:47:53 +02:00
Boaz Leskes	577dcb3237	Add current cluster state version to zen pings and use them in master election (#20384 ) During a networking partition, cluster states updates (like mapping changes or shard assignments) are committed if a majority of the masters node received the update correctly. This means that the current master has access to enough nodes in the cluster to continue to operate correctly. When the network partition heals, the isolated nodes catch up with the current state and get the changes they couldn't receive before. However, if a second partition happens while the cluster is still recovering from the previous one and the old master is put in the minority side, it may be that a new master is elected which did not yet catch up. If that happens, cluster state updates can be lost. This commit fixed 95% of this rare problem by adding the current cluster state version to `PingResponse` and use them when deciding which master to join (and thus casting the node's vote). Note: this doesn't fully mitigate the problem as a cluster state update which is issued concurrently with a network partition can be lost if the partition prevents the commit message (part of the two phased commit of cluster state updates) from reaching any single node in the majority side and the partition does allow for the master to acknowledge the change. We are working on a more comprehensive fix but that requires considerate work and is targeted at 6.0.	2016-09-15 23:39:11 +02:00
Nik Everett	d0be96df7b	Clean up snapshots after each REST test The only repository we can be sure is safe to clean is `fs` so we clean any snapshots in those repositories after each test. Other repositories like url and azure tend to throw exceptions rather than let us fetch their contents during the REST test. So we clean what we can.... Closes #18159	2016-09-15 14:49:11 -04:00
Boaz Leskes	8469c98e34	Fix LongGCDisruption to be aware of log4j2 (#20348 ) LongGCDisruption simulates a Long GC by suspending all threads belonging to a node. That's fine, unless those threads hold shared locks that can prevent other nodes from running. Concretely the logging infrastructure, which is shared between the nodes, can cause some deadlocks. LongGCDisruption has protection for this, but it needs to be updated to point at log4j2 classes, introduced in #20235 This commit also fixes improper handling of retry logic in LongGCDisruption and adds a protection against deadlocking the test code which activates the disruption (and uses logging too! :)). On top of that we have some new, evil and nasty tests.	2016-09-15 08:50:18 +02:00
Ali Beyad	3f79874042	Prevent the rolling upgrades rest tests from cleaning up indices after finishing if a the tests.rest.preserve_indices system property is set	2016-09-14 23:34:19 -04:00
Simon Willnauer	17ddee7011	Remove TransportService#registerRequestHandler leniency (#20469 ) `TransportService#registerRequestHandler` allowed to register handlers more than once and issues an annoying warn log message when this happens. This change simple throws an exception to prevent regsitering the same handler more than once. This commit also removes the ability to remove request handlers. Relates to #20468	2016-09-14 20:32:29 +02:00
Luca Cavanna	14e17f44a1	Replace usage of LuceneTestCase#getBaseTempDirForTestClass (#20484 ) LuceneTestCase#getBaseTempDirForTestClass is deprecated, we should not use it. Closes #15845	2016-09-14 19:35:20 +02:00
Simon Willnauer	89640965d2	Unguice SearchModule (#20456 ) After this change SearchModule doesn't subclass AbstractModule anymore and all wiring happens in `Node.java`. As a side-effect several tests don't need a guice injector anymore.	2016-09-14 10:07:53 +02:00
Jason Tedor	7560101ec7	Complete Elasticsearch logger names This commit modifies the logger names within Elasticsearch to be the fully-qualified class name as opposed removing the org.elasticsearch prefix and dropping the class name. This change separates the root logger from the Elasticsearch loggers (they were equated from the removal of the org.elasticsearch prefix) and enables log levels to be set at the class level (instead of the package level). Relates #20457	2016-09-13 22:46:54 -04:00
Jason Tedor	fbe27664a6	Fix prefix logging Today we add a prefix when logging within Elasticsearch. This prefix contains the node name, and index and shard-level components if appropriate. Due to some implementation details with Log4j 2 , this does not work for integration tests; instead what we see is the node name for the last node to startup. The implementation detail here is that Log4j 2 there is only one logger for a name, message factory pair, and the key derived from the message factory is the class name of the message factory. So, when the last node starts up and starts setting prefixes on its message factories, it will impact the loggers for the other nodes. Additionally, the prefixes are lost when logging an exception. This is due to another implementation detail in Log4j 2. Namely, since we log exceptions using a parameterized message, Log4j 2 decides that that means that we do not want to use the message factory that we have provided (the prefix message factory) and so logs the exception without the prefix. This commit fixes both of these issues. Relates #20429	2016-09-13 14:46:34 -04:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Nicholas Knize	ef926894f4	Cut over geo_point field and queries to new LatLonPoint type This commit cuts over geo_point fields to use Lucene's new point-based LatLonPoint type for indexes created in 5.0. Indexes created prior to 5.0 continue to use their respective encoding type. Below is a description of the changes made to support the new encoding type: * New indexes use a new LatLonPointFieldMapper which provides a parse method for the new type * The new LatLonPoint parse method removes support for lat_lon and geohash parameters * Backcompat testing for deprecated lat_lon and geohash parameters is added to all unit and integration tests * LatLonPointFieldMapper provides DocValues support (enabled by default) which uses Lucene's new LatLonDocValuesField type * New LatLonPoint field data classes are added for aggregation support (wraps LatLonPoint's Numeric Doc Values) * MultiFields use the geohash as the string value instead of the lat,lon string making it easier to perform geo string queries on the geohash instead of a lat,lon comma delimited string. Removed Features: * With the removal of geohash indexing, GeoHashCellQuery support is removed for all new indexes (still supported on existing indexes) * LatLonPoint does not support a Distance Range query because it is super inefficient. Instead, the geo_distance_range query should be accomplished using either the geo_distance aggregation, sorting by descending distance on a geo_distance query, or a boolean must not of the excluded distance (which is what the distance_range query did anyway). TODO: * fix/finish yaml changes for plugin and rest integration tests * update documentation	2016-09-13 12:17:36 -05:00
Jason Tedor	013e3f6fcc	Remove unused import from BootstrapForTesting This commit removes an unused import for o.e.c.l.LogConfigurator from o.e.b.BootstrapForTesting.	2016-09-13 09:49:15 -04:00
Tanguy Leroux	6090c51fc5	Add quiet option to disable console logging (#20422 ) This commit adds a -q/--quiet option to Elasticsearch so that it does not log anything in the console and closes stdout & stderr streams. This is useful for SystemD to avoid duplicate logs in both journalctl and /var/log/elasticsearch/elasticsearch.log while still allows the JVM to print error messages in stdout/stderr if needed. closes #17220	2016-09-13 14:08:24 +02:00
Lee Hinman	44278db1bc	Merge pull request #20433 from dakrone/remove-cluster-name-folder-fallback No longer allow cluster name in data path	2016-09-12 17:01:49 -05:00
Lee Hinman	94625d74e4	No longer allow cluster name in data path In 5.x we allowed this with a deprecation warning. This removes the code added for that deprecation, requiring the cluster name to not be in the data path. Resolves #20391	2016-09-12 15:47:01 -06:00
Simon Willnauer	686994ae2d	Deguice SearchService and friends (#20423 ) This change removes the guice dependency handling for SearchService and several related classes like SearchTransportController and SearchPhaseController. The latter two now have package private constructors and dependencies like FetchPhase are now created by calling their constructors explicitly. This also cleans up several users of the DefaultSearchContext and centralized it's creation inside SearchService.	2016-09-12 22:42:55 +02:00
Ali Beyad	b1e87aa13c	Split allocator decision making from decision application (#20347 ) Splits the PrimaryShardAllocator and ReplicaShardAllocator's decision making for a shard from the implementation of that decision on the routing table. This is a step toward making it easier to use the same logic for the cluster allocation explain APIs.	2016-09-12 16:21:39 -04:00
Boaz Leskes	b08352047d	Introduce IndexShardTestCase (#20411 ) Introduce a base class for unit tests that are based on real `IndexShard`s. The base class takes care of all the little details needed to create and recover shards. This commit also moves `IndexShardTests` and `ESIndexLevelReplicationTestCase` to use the new base class. All tests in `IndexShardTests` that required a full node environment were moved to a new `IndexShardIT` suite.	2016-09-12 18:20:25 +02:00
Ali Beyad	f39f9b9760	Update discovery nodes after cluster state is published (#20409 ) Before, when there was a new cluster state to publish, zen discovery would first update the set of nodes to ping based on the new cluster state, then publish the new cluster state. This is problematic because if the cluster state failed to publish, then the set of nodes to ping should not have been updated. This commit fixes the issue by updating the set of nodes to ping for fault detection only after the new cluster state has been published.	2016-09-12 12:07:51 -04:00
Luca Cavanna	4b00cc37a1	Merge pull request #20382 from javanna/enhancement/cleanup_parse_elements Cleanup sub fetch phase extension point	2016-09-09 22:47:15 +02:00
Tal Levy	dda32545bb	add ignore_missing option to relevant processors (#20194 )	2016-09-09 12:20:18 -07:00
javanna	90ab460fcc	move parsing of search ext sections to the coordinating node	2016-09-09 19:10:42 +02:00
javanna	65c7f61ad9	decouple registration of SearchExtParsers from sub fetch phases Search section supports an ext section that is used to provide additional config needed from plugins. It is now tied to sub fetch phases because it is the only section that may need additional config, but there is no reason for the two to be tightly coupled. It is now possible to register a searchExtParser independently from a sub fetch phase. All a search ext parser does is parsing some ext section of a search request, whose parsed resulting object is stored in the search context for later retrieval.	2016-09-09 18:05:49 +02:00
javanna	f9530dfe8f	remove FetchSubPhaseContext in favour of generic fetch sub phase builder of type object The context was an object where the parsed info are stored. That is more of what we call the builder since after the search refactoring. No need for generics in FetchSubPhaseParser then. Also the previous setHitsExecutionNeeded wasn't useful, it can be removed as well, given that once there is a parsed ext section, it will become a builder that can be retrieved by the sub fetch phase. The sub fetch phase is responsible for doing nothing in case the builder is not set, meaning that the fetch sub phase is plugged in but the request didn't have the corresponding section.	2016-09-09 18:05:49 +02:00
javanna	dc2ba90f48	clarify that SearchParseElement is only used for custom fetch sub phases and clean up extension point SearchParseElement is renamed to FetchSubPhaseParser and moved to the search.fetch package. Its parse method doesn't get the SearchContext as argument anymore, only the XContentParser, and the return type is what gets parsed (the fetch sub phase context which we may as well rename later). It is the parser that initializes the FetchSubPhaseContext then. SearchService retrieves the parser by name, calls parse against it and stores the result of parsing by name. No need for FetchSubPhase.ContextFactory anymore, which can be removed.	2016-09-09 18:05:49 +02:00
javanna	a33ca70ff5	make docValueFields similar to other standard sub fetch phases Given that doc value fields is our own fetch sub phase, it doesn't need to be implemented like if it was plugged in from the outside. It doesn't need its own fetch sub phase context, but it can just be an instance member in SearchContext	2016-09-09 18:05:49 +02:00
Jason Tedor	d8475488b8	Disable console logging Previously we would disable console logging in certain circumstances (for example, if Elasticsearch is not in the foreground, or if Elasticsearch is in the foreground but an exception was thrown during bootstrap). This commit makes this handling work with Log4j 2. This will prevent users from seeing double bootstrap check failure messages. Relates #20387	2016-09-09 09:15:35 -04:00
Jason Tedor	de43565abc	Do not log full bootstrap checks exception By default, when an exception causes the JVM to terminate, the stack trace is printed. In the case of failing bootstrap checks, this stack trace is useless to the user, and might even distract them from seeing that the bootstrap checks failed for reasons under their control. With this commit, we cause the stack trace for a failing bootstrap check to be truncated. We also modify some methods to not declare that they throw the top level checked exception type Exception, but instead explicitly declare the exceptions that they throw. These exceptions are caught and wrapped in a BootstrapException so that we can percolate only two exception types out of Bootstrap#init as checked exception, BootstrapException and NodeValidationException. Relates #19989	2016-09-08 10:56:11 -04:00
Tanguy Leroux	4fb7ac8254	Clean up XContentBuilder This commit cleans most of the methods of XContentBuilder so that: - Jackson's convenience methods are used instead of our custom ones (ie field(String,long) now uses Jackson's writeNumberField(String, long) instead of calling writeField(String) then writeNumber(long)) - null checks are added for all field names and values - methods are grouped by type in the class source - methods have the same parameters names - duplicated methods like field(String, String...) and array(String, String...) are removed - varargs methods now have the "array" name to reflect that it builds arrays - unused methods like field(String,BigDecimal) are removed - all methods now follow the execution path: field(String,?) -> field(String) then value(?), and value(?) -> writeSomething() method. Methods to build arrays also follow the same execution path.	2016-09-08 15:09:09 +02:00
Alexander Lin	f825e8f4cb	Exposing lucene 6.x minhash filter. (#20206 ) Exposing lucene 6.x minhash tokenfilter Generate min hash tokens from an incoming stream of tokens that can be used to estimate document similarity. Closes #20149	2016-09-07 09:38:12 +02:00
Simon Willnauer	11f2da5f14	Skip loading of jansi from log4j2 (#20334 ) Jython shades `jansi` into it's classpath without changing it's package or anything like that. This causes attempts to load native code on windows which blows up tests. This change adds `log4j.skipJansi=true` system property to our tests as well as to the JVM properties we set.	2016-09-06 05:53:00 -04:00
Simon Willnauer	5c2d9fa158	Improve error reporting for tests with BackgroundIndexer (#20324 ) The BackgroundIndexer now uses auto-generated IDs randomly. This causes some problems for tests that still rely on the fact that the IDs are increasing integers. This change exposes all IDs via a Set<String> to iterate over for tests.	2016-09-05 16:28:49 +02:00
Nik Everett	549ca3178b	Rename method in OldIndexUtils loadIndexList -> loadDataFilesList. The new method name is more accurate.	2016-09-02 10:16:30 -04:00
javanna	7c03f65c36	[TEST] adjusted EsTestCase#randomPositiveLong	2016-09-02 10:23:49 +02:00
javanna	536d13ff11	ProcessInfo to implement Writeable rather than Streamable	2016-09-02 10:23:05 +02:00
Simon Willnauer	825b80f2a6	[TEST] fix possible NPE in ClientYamlTestExecutionContext	2016-09-02 10:07:58 +02:00
Jason Tedor	1e80adbfbe	Configure test logging with Log4j 2 This commit configures test logging for Log4j 2. The default logger configuration uses the console appender but at the error level, so most tests are missing logging. Instead, this commit provides a configuration for tests which is picked up from the classpath by Log4j 2 when it initializes. However, this now means that we can no longer initialize Log4j with a bare-bones configuration when tests run as doing so will prevent Log4j 2 from attempting to configure logging via the classpath. Consequently, we move this needed initialization (as commented, to avoid a message about a status logger not being configured when we are preparing to configure Log4j from properties files in the config directory) to only run when we are explicitly configuring Log4j from properties files. Relates #20284	2016-09-01 14:00:47 -04:00
Simon Willnauer	a0becd26b1	Optimize indexing for the autogenerated ID append-only case (#20211 ) If elasticsearch controls the ID values as well as the documents version we can optimize the code that adds / appends the documents to the index. Essentially we an skip the version lookup for all documents unless the same document is delivered more than once. On the lucene level we can simply call IndexWriter#addDocument instead of #updateDocument but on the Engine level we need to ensure that we deoptimize the case once we see the same document more than once. This is done as follows: 1. Mark every request with a timestamp. This is done once on the first node that receives a request and is fixed for this request. This can be even the machine local time (see why later). The important part is that retry requests will have the same value as the original one. 2. In the engine we make sure we keep the highest seen time stamp of "retry" requests. This is updated while the retry request has its doc id lock. Call this `maxUnsafeAutoIdTimestamp` 3. When the engine runs an "optimized" request comes, it compares it's timestamp with the current `maxUnsafeAutoIdTimestamp` (but doesn't update it). If the the request timestamp is higher it is safe to execute it as optimized (no retry request with the same timestamp has been run before). If not we fall back to "non-optimzed" mode and run the request as a retry one and update the `maxUnsafeAutoIdTimestamp` unless it's been updated already to a higher value Relates to #19813	2016-09-01 10:39:40 +02:00
Simon Willnauer	419627c460	Ensure ESTestCase is initialized before we run tests	2016-09-01 09:39:44 +02:00
Jason Tedor	76ab02e002	Merge branch 'master' into log4j2 * master: Avoid NPE in LoggingListener Randomly use Netty 3 plugin in some tests Skip smoke test client on JDK 9 Revert "Don't allow XContentBuilder#writeValue(TimeValue)" [docs] Remove coming in 2.0.0 Don't allow XContentBuilder#writeValue(TimeValue) [doc] Remove leftover from CONSOLE conversion Parameter improvements to Cluster Health API wait for shards (#20223) Add 2.4.0 to packaging tests list Docs: clarify scale is applied at origin+offest (#20242)	2016-08-31 16:37:55 -04:00
Stian Lindhom	c2eddaf2c9	Avoid NPE in LoggingListener This commit avoids an NPE that could arise when implementing an ESTestCase for test classes placed in the default package. Relates #20269	2016-08-31 16:11:12 -04:00
Ali Beyad	4641254ea6	Parameter improvements to Cluster Health API wait for shards (#20223 ) * Params improvements to Cluster Health API wait for shards Previously, the cluster health API used a strictly numeric value for `wait_for_active_shards`. However, with the introduction of ActiveShardCount and the removal of write consistency level for replication operations, `wait_for_active_shards` is used for write operations to represent values for ActiveShardCount. This commit moves the cluster health API's usage of `wait_for_active_shards` to be consistent with its usage in the write operation APIs. This commit also changes `wait_for_relocating_shards` from a numeric value to a simple boolean value `wait_for_no_relocating_shards` to set whether the cluster health operation should wait for all relocating shards to complete relocation. * Addresses code review comments * Don't be lenient if `wait_for_relocating_shards` is set	2016-08-31 11:58:19 -04:00
Jason Tedor	e166459bbe	Merge branch 'master' into log4j2 * master: Increase visibility of deprecation logger Skip transport client plugin installed on JDK 9 Explicitly disable Netty key set replacement percolator: Fail indexing percolator queries containing either a has_child or has_parent query. Make it possible for Ingest Processors to access AnalysisRegistry Allow RestClient to send array-based headers Silence rest util tests until the bogusness can be simplified Remove unknown HttpContext-based test as it fails unpredictably on different JVMs Tests: Improve rest suite names and generated test names for docs tests Add support for a RestClient base path	2016-08-31 10:59:27 -04:00
Jason Tedor	abf8a1a3f0	Avoid allocating log parameterized messages This commit modifies the call sites that allocate a parameterized message to use a supplier so that allocations are avoided unless the log level is fine enough to emit the corresponding log message.	2016-08-30 18:17:09 -04:00
Ryan Ernst	2a7a187bf8	Silence rest util tests until the bogusness can be simplified	2016-08-30 14:58:44 -07:00
Ryan Ernst	e19f2b6348	Tests: Improve rest suite names and generated test names for docs tests Rest test suites are currently only the directory above the yaml test file. That is confusing when there are more than one directory level which contain yaml tests, as there are in generated docs tests. This change makes rest tests use the full relative path to the rest test root as the suite name, and also makes the test names for docs tests a little clearer (that they are testing an example from a specific line number, instead of just the line number as an opaque test name).	2016-08-30 13:55:44 -07:00
Jason Tedor	7da0cdec42	Introduce Log4j 2 This commit introduces Log4j 2 to the stack.	2016-08-30 13:31:24 -04:00
javanna	61145bfb2f	[TEST] minor cleanups to AbstractQueryTestCase Removed null check for token, if we are outside the null it already means it is null. Fixed typo in comment and remove leftover assignment to unused local variable.	2016-08-29 16:52:11 +02:00
Yannick Welsch	f070c8727b	[TEST] Add additional logging to testStaleMasterNotHijackingMajority This test is periodically failing. As I suspect that the GCDisruption scheme is somehow making the wrong node block on its cluster state update thread, I've added some more logging and a thread dump once the given assertion triggers again.	2016-08-29 13:42:13 +02:00
Yannick Welsch	1b75cb63a2	Add recovery source to ShardRouting (#19516 ) Adds an explicit recoverySource field to ShardRouting that characterizes the type of recovery to perform: - fresh empty shard copy - existing local shard copy - recover from peer (primary) - recover from snapshot - recover from other local shards on same node (shrink index action)	2016-08-27 16:11:10 +02:00
Tanguy Leroux	68b943dc53	Fix MoreLikeThisQueryBuilderTests.testUnknownObjectException() Objects hierarchy must be tracked when entering/leaving an object so that it better knows if the "newField" has been inserted into an arbitrary holding object. Can be reproduced with gradle :core:test -Dtests.seed=760F8BD0F7E46D45 -Dtests.class=org.elasticsearch.index.query.MoreLikeThisQueryBuilderTests -Dtests.method="testUnknownObjectException" -Dtests.security.manager=true -Dtests.locale=ko -Dtests.timezone=Etc/Zulu	2016-08-25 20:54:06 +02:00
Tanguy Leroux	fbcfddbb77	Fix AbstractQueryTestCase.testUnknownObjectException() When need to check the whole hierarchy of objects to know if the newly inserted "newField" object is part of an arbitrary holding object or not. Reproduced with `gradle :modules:percolator:test -Dtests.seed=736B0B67DA7A3632 -Dtests.class=org.elasticsearch.percolator.PercolateQueryBuilderTests -Dtests.method="testUnknownObjectException" -Dtests.security.manager=true -Dtests.locale=es-ES -Dtests.timezone=ART`	2016-08-25 16:24:22 +02:00
Michael McCandless	1fe3e36934	Merge pull request #20147 from mikemccand/lucene_620_upgrade Upgrade to Lucene 6.2.0	2016-08-25 06:03:34 -04:00
Tanguy Leroux	20719f9b2f	Improve AbstractQueryTestCase#unknownObjectExceptionTest() This method fails when a randomized string value contains a double-quote. This commit changes the method so that it is not based on string concatenation anymore. It now use XContentGenerator & XContentParser to mutate the valid queries. Related #19864	2016-08-25 10:57:30 +02:00
Mike McCandless	5eb66e3378	Mark Scandinavian analysis components as multi term aware	2016-08-24 19:50:25 -04:00
Mike McCandless	7492300544	Remove now unused Store.renameFile, and obsolete commented out code	2016-08-24 18:20:30 -04:00
Mike McCandless	0ccfe69789	Upgrade to Lucene 6.2.0	2016-08-24 17:26:28 -04:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Nicholas Knize	28ed0e7abf	Deprecate optimize_bbox on geodistance queries Deprecates the optimize_bbox parameter on geodistance queries. This has no longer been needed since version 2.2 because lucene geo distance queries (postings and LatLonPoint) already optimize by bounding box.	2016-08-23 09:14:54 -05:00
Yannick Welsch	771668f380	Use routingResult method to update cluster state after reroute This ensures that the routing table as well as the metadata (with the primary terms and in-sync allocation ids) is updated.	2016-08-19 17:15:02 +02:00
Ryan Ernst	8c60455ed6	Fix checkstyle line length violations in allocation tests	2016-08-17 16:28:31 -07:00
Ryan Ernst	1ff348ed7f	Plugins: Make custom allocation deciders use pull based extensions This change converts AllocationDecider registration from push based on ClusterModule to implementing with a new ClusterPlugin interface. AllocationDecider instances are allowed to use only Settings and ClusterSettings.	2016-08-17 15:55:31 -07:00
Ryan Ernst	2ea50bc162	Merge pull request #20018 from rjernst/split_disk_threshold Internal: Split disk threshold monitoring from decider	2016-08-17 07:57:50 -07:00
Yannick Welsch	27a760f9c1	Add routing changes API to RoutingAllocation (#19992 ) Adds a class that records changes made to RoutingAllocation, so that at the end of the allocation round other values can be more easily derived based on these changes. Most notably, it: - replaces the explicit boolean flag that is passed around everywhere to denote changes to the routing table. The boolean flag is automatically updated now when changes actually occur, preventing issues where it got out of sync with actual changes to the routing table. - records actual changes made to RoutingNodes so that primary term and in-sync allocation ids, which are part of index metadata, can be efficiently updated just by looking at the shards that were actually changed.	2016-08-17 10:46:59 +02:00
Ryan Ernst	b2c0f2d08f	Internal: Split disk threshold monitoring from decider In addition to be an allocation decider, DiskThresholdDecider also monitors the used disk in order to trigger a reroute when the thresholds are crossed. This change splits out the settings for disk thresholds into DiskThresholdSettings, and moves the monitoring to a new DiskThresholdMonitor. DiskThresholdDecider is then in line with other allocation deciders, needing only Settings and ClusterSettings for construction, which will allow deguicing allocation deciders.	2016-08-17 00:22:16 -07:00
Lee Hinman	1825d8060c	Merge remote-tracking branch 'dakrone/lockobtainfailed-replacement'	2016-08-16 14:41:27 -06:00
Lee Hinman	1de3388fa3	Switching LockObtainFailedException over to ShardLockObtainFailedException `LobObtainFailedException` should be reserved for on-disk locks that Lucene attempts (like `write.lock`). This switches our in-memory semaphore locks for shards to use a different exception. Additionally, ShardLockObtainFailedException no longer subclasses IOException, since no IO is being done is this case. Resolves #19978	2016-08-16 14:37:36 -06:00
Nik Everett	46bf8baf2e	Switch aggregation registration for push to pull Adds `getAggregations` to `SearchPlugin` which can be used to register aggregations. Fixup MockNode which wasn't createing MockBigArrays.	2016-08-16 09:08:36 -04:00
Nik Everett	cf6e1a4362	Move all FetchSubPhases to `o.e.search.fetch.subphase` As the most complicated `FetchSubPhase` highlighting gets its own package (`o.e.seach.fetch.subphase.highlight`. No other `FetchSubPhase`s get their own package. Instead they all reside together in `o.e.search.fetch.subphase`. Add package descriptions to `o.e.search.fetch` and subpackages.	2016-08-12 18:21:15 -04:00
Jason Tedor	1f0673c9bd	Default max local storage nodes to one This commit defaults the max local storage nodes to one. The motivation for this change is that a default value greather than one is dangerous as users sometimes end up unknowingly starting a second node and start thinking that they have encountered data loss. Relates #19964	2016-08-12 09:26:20 -04:00
Nik Everett	9f8f2ea54b	Remove ESIntegTestCase#pluginList It was a useful method in 1.7 when javac's type inference wasn't as good, but now we can just replace it with `Arrays.asList`.	2016-08-11 15:44:02 -04:00
Yannick Welsch	522b137097	Make NetworkPartition disruption scheme configurable (#19534 ) This commit separates the description of the links in the network that are to be disrupted from the failure that is to be applied to the links (disconnect/unresponsive/delay). Previously we had subclasses for the various kind of network disruption schemes combining on one hand failure mode (disconnect/unresponsive/delay) as well as the network links to cut (two partitions / bridge partitioning) into a single class.	2016-08-11 14:55:06 +02:00
Adrien Grand	0d6ac57acf	Collapse o.e.index.mapper packages. #19921 I also reduced the visibility of a couple classes and renamed/consolidated some test classes for consistency, eg. removing the `Simple` prefix or using the `<Type>FieldMapperTests` convention for testing field mappers.	2016-08-10 17:51:11 +02:00
javanna	7d4a6499e1	[TEST] add inline comments to AbstractQueryTestCase#unknownObjectExceptionTest	2016-08-10 12:21:25 +02:00
javanna	8391e6de37	[TEST] enable testUnknownObjectException for alternate query versions too	2016-08-10 12:21:25 +02:00
javanna	0a98b5e56e	[TEST] make AbstractQueryTestCase#testUnknownObjectException more accurate testUnknownObjectException used to generate malformed json objects in some cases, due to the existence of arrays as it was not closing the injected object correctly. That is why the test was catching JsonParseException among the exception that are expected to be thrown. That is fixed by tracking where the new object is placed and placing its end object marker to the right level rather than always at the end. Also introduced a mechanism to explicitly declare objects that won't cause any exception when they get additional objects injected, so that there is no need to override the method anymore as that caused copy pasting of the whole test method. This also makes sure that changes are reflected in tests, as those inner objects are not skipped but we actually check that what is declared is true (no exceptions get thrown when an additional object is added within them.	2016-08-10 11:48:51 +02:00
Lee Hinman	5849c488b5	Merge remote-tracking branch 'dakrone/compliation-breaker'	2016-08-09 11:57:26 -06:00
Lee Hinman	2be52eff09	Circuit break the number of inline scripts compiled per minute When compiling many dynamically changing scripts, parameterized scripts (<https://www.elastic.co/guide/en/elasticsearch/reference/master/modules-scripting-using.html#prefer-params>) should be preferred. This enforces a limit to the number of scripts that can be compiled within a minute. A new dynamic setting is added - `script.max_compilations_per_minute`, which defaults to 15. If more dynamic scripts are sent, a user will get the following exception: ```json { "error" : { "root_cause" : [ { "type" : "circuit_breaking_exception", "reason" : "[script] Too many dynamic script compilations within one minute, max: [15/min]; please use on-disk, indexed, or scripts with parameters instead", "bytes_wanted" : 0, "bytes_limit" : 0 } ], "type" : "search_phase_execution_exception", "reason" : "all shards failed", "phase" : "query", "grouped" : true, "failed_shards" : [ { "shard" : 0, "index" : "i", "node" : "a5V1eXcZRYiIk8lecjZ4Jw", "reason" : { "type" : "general_script_exception", "reason" : "Failed to compile inline script [\"aaaaaaaaaaaaaaaa\"] using lang [painless]", "caused_by" : { "type" : "circuit_breaking_exception", "reason" : "[script] Too many dynamic script compilations within one minute, max: [15/min]; please use on-disk, indexed, or scripts with parameters instead", "bytes_wanted" : 0, "bytes_limit" : 0 } } } ], "caused_by" : { "type" : "general_script_exception", "reason" : "Failed to compile inline script [\"aaaaaaaaaaaaaaaa\"] using lang [painless]", "caused_by" : { "type" : "circuit_breaking_exception", "reason" : "[script] Too many dynamic script compilations within one minute, max: [15/min]; please use on-disk, indexed, or scripts with parameters instead", "bytes_wanted" : 0, "bytes_limit" : 0 } } }, "status" : 500 } ``` This also fixes a bug in `ScriptService` where requests being executed concurrently on a single node could cause a script to be compiled multiple times (many in the case of a powerful node with many shards) due to no synchronization between checking the cache and compiling the script. There is now synchronization so that a script being compiled will only be compiled once regardless of the number of concurrent searches on a node. Relates to #19396	2016-08-09 10:26:27 -06:00
javanna	329eaaea65	[TEST] expand AbstractQueryTestCase#testQueryWrappedInArray to run against query alternate versions	2016-08-08 19:09:43 +02:00
javanna	2437226802	[TEST] restore tests repeatability in AbstractQueryTestCase Some random operations were conditionally performed in the before test, which made tests not repeatable. For instance take the seed chain to repeat a specific iteration and try to reproduce it, this conditional code would get executed in both cases when trying to isolate the failure, but not among the different iterations (as only the first method/iteration executes it), hence the failure will not reproduce. Moved the random operations to beforeClass and left the non random part in the before method, which is needed as it depends on some method that can be overridden by subclasses.	2016-08-05 22:38:31 +02:00
Luca Cavanna	4c1a3b9a53	Merge pull request #19791 from javanna/fix/multiple_fields_queries Query parsers to throw exception when multiple field names are provided	2016-08-05 15:53:35 +02:00
Ali Beyad	f59ca9083b	Snapshot repository cleans up empty index folders (#19751 ) This commit cleans up indices in a snapshot repository when all snapshots containing the index are all deleted. Previously, empty indices folders would lay around after all snapshots containing them were deleted.	2016-08-05 09:39:02 -04:00
javanna	7f0bd56094	[TEST] use expectThrows wherever possible in query builder unit tests	2016-08-05 13:55:18 +02:00
Nik Everett	1e587406d8	Fail yaml tests and docs snippets that get unexpected warnings Adds `warnings` syntax to the yaml test that allows you to expect a `Warning` header that looks like: ``` - do: warnings: - '[index] is deprecated' - quotes are not required because yaml - but this argument is always a list, never a single string - no matter how many warnings you expect get: index: test type: test id: 1 ``` These are accessible from the docs with: ``` // TEST[warning:some warning] ``` This should help to force you to update the docs if you deprecate something. You must add the warnings marker to the docs or the build will fail. While you are there you should update the docs to add deprecation warnings visible in the rendered results.	2016-08-04 15:23:05 -04:00
Daniel Mitterdorfer	4598c36027	Fix various concurrency issues in transport (#19675 ) Due to various issues (most notably a missing happens-before edge between socket accept and channel close in MockTcpTransport), MockTcpTransportTests sometimes did not terminate. With this commit we fix various concurrency issues that led to this hanging test. Failing example build: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-os-compatibility/os=oraclelinux/835/console	2016-08-04 21:00:59 +02:00
javanna	cd9388ce66	[TEST] parse query alternate versions in strict mode AbstractQueryTestCase parses the main version of the query in strict mode, meaning that it will fail if any deprecated syntax is used. It should do the same for alternate versions (e.g. short versions). This is the way it is because the two alternate versions for ids query are both deprecated. Moved testing for those to a specific test method that isolates the deprecations and actually tests that the two are deprecated.	2016-08-04 19:49:43 +02:00
javanna	146f02183d	[TEST] remove unused methods and fix some warnings in AbstractQueryTestCase Also fix line length issues	2016-08-04 10:06:25 +02:00
Luca Cavanna	c5a9427293	Merge pull request #19750 from javanna/fix/npe_parse_field_array Throw ParsingException if a query is wrapped in an array	2016-08-03 18:21:39 +02:00
javanna	4805250ecf	Throw ParsingException if a query is wrapped in an array Our parsing code accepted up until now queries in the following form (note that the query starts with `[`: ``` { "bool" : [ { "must" : [] } ] } ``` This would lead to a null pointer exception as most parsers assume that the field name ("must" in this example) is the first thing that can be found in a query if its json is valid, hence always non null while parsing. Truth is that the additional array layer doesn't make the json invalid, hence the following code fragment would cause NPE within ParseField, because null gets passed to `parseContext.isDeprecatedSetting`: ``` if (token == XContentParser.Token.FIELD_NAME) { currentFieldName = parser.currentName(); } else if (parseContext.isDeprecatedSetting(currentFieldName)) { // skip } else if (token == XContentParser.Token.START_OBJECT) { ``` We could add null checks in each of our parsers in lots of places, but we rely on `currentFieldName` being non null in all of our parsers, and we should consider it a bug when these unexpected situations are not caught explicitly. It would be best to find a way to prevent such queries altogether without changing all of our parsers. The reason why such a query goes through is that we've been allowing a query to start with either `[` or `{`. The only reason I found is that we accept `match_all : []`. This seems like an undocumented corner case that we could drop support for. Then we can be stricter and accept only `{` as start token of a query. That way the only next token that the parser can encounter if the json is valid (otherwise the json parser would barf earlier) is actually a field_name, hence the assumption that all our parser makes hold. The downside of this is simply dropping support for `match_all : []` Relates to #12887	2016-08-03 17:05:14 +02:00
Nik Everett	ca8f666c66	Add line number to yaml test failures Old: ``` > Throwable #1: java.lang.AssertionError: expected [2xx] status code but api [reindex] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25}],"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25,"caused_by":{"type":"illegal_argument_exception","reason":"[dest] unknown field [asdfadf], parser not found"}},"status":400}] > at __randomizedtesting.SeedInfo.seed([9325F8C5C6F227DD:1B71C71F680E4A25]:0) > at org.elasticsearch.test.rest.yaml.section.DoSection.execute(DoSection.java:119) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.test(ESClientYamlSuiteTestCase.java:309) > at java.lang.Thread.run(Thread.java:745) ``` New: ``` > Throwable #1: java.lang.AssertionError: Failure at [reindex/10_basic:12]: expected [2xx] status code but api [reindex] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25}],"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25,"caused_by":{"type":"illegal_argument_exception","reason":"[dest] unknown field [asdfadf], parser not found"}},"status":400}] > at __randomizedtesting.SeedInfo.seed([444DEEAF47322306:CC19D175E9CE4EFE]:0) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.executeSection(ESClientYamlSuiteTestCase.java:329) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.test(ESClientYamlSuiteTestCase.java:309) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.AssertionError: expected [2xx] status code but api [reindex] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25}],"type":"parsing_exception","reason":"[reindex] failed to parse field [dest]","line":1,"col":25,"caused_by":{"type":"illegal_argument_exception","reason":"[dest] unknown field [asdfadf], parser not found"}},"status":400}] > at org.elasticsearch.test.rest.yaml.section.DoSection.execute(DoSection.java:119) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.executeSection(ESClientYamlSuiteTestCase.java:325) > ... 37 more ``` Sorry for the longer stack trace, but I wanted to be sure I didn't throw anything away by accident.	2016-08-03 10:59:57 -04:00
Britta Weber	abcb4c8a97	[Test] move methods from bwc test to test package for use in plugins (#19738 ) * [Test] move methods from bwc test to test package for use in other plugins	2016-08-03 11:41:46 +02:00
Ryan Ernst	df8dc64e9b	Plugins: Make NamedWriteableRegistry immutable and add extenion point for named writeables Currently any code that wants to added NamedWriteables to the NamedWriteableRegistry can do so via guice injection of the registry, and registering at construction time. However, this makes the registry complex: it has both get and register methods synchronized, and there is likely contention on the read side from multiple threads. The registration has mostly already been contained to guice modules at node construction time. This change makes the registry immutable, taking all of the NamedWriteable readers at construction time. It also allows plugins to added arbitrary named writables that it may use in its own transport actions.	2016-08-02 15:56:25 -07:00
Ali Beyad	c4ae23f5d8	Enables implementations of the BlobContainer interface to (#19749 ) conform with the requirements of the writeBlob method by throwing a FileAlreadyExistsException if attempting to write to a blob that already exists. This change means implementations of BlobContainer should never overwrite blobs - to overwrite a blob, it must first be deleted and then can be written again. Closes #15579	2016-08-02 09:48:21 -04:00
Ali Beyad	456ea56527	Cleans up the BlobContainer interface by removing the (#19727 ) writeBlob method takes a BytesReference in favor of just the writeBlob method that takes an InputStream. Closes #18528	2016-08-02 09:21:43 -04:00
Ali Beyad	25d8eca62d	Removes the notion of write consistency level across all APIs in favor of waiting for active shard copy count (wait_for_active_shards).	2016-08-01 13:35:29 -04:00
Ali Beyad	9f88a8194a	Merge pull request #19706 from elastic/enhancement/snapshot-blob-handling More resilient blob handling in snapshot repositories	2016-08-01 12:03:53 -04:00
Tanguy Leroux	386902903e	[TEST] Kill remaining lang-groovy messy tests After #13834 many tests that used Groovy scripts (for good or bad reason) in their tests have been moved in the lang-groovy module and the issue #13837 has been created to track these messy tests in order to clean them up. The work started with #19280, #19302 and #19336 and this PR moves the remaining messy tests back in core, removes the dependency on Groovy, changes the scripts in order to use the mocked script engine, and change the tests to integration tests. It also moves IndexLookupIT test back (even if it has good chance to be removed soon) and fixes its tests. It also changes AbstractQueryTestCase to use custom script plugins in tests. closes #13837	2016-08-01 16:59:47 +02:00
Alexander Lin	9ac6389e43	Rename operation to result and reworking responses * Rename operation to result and reworking responses * Rename DocWriteResponse.Operation enum to DocWriteResponse.Result These are just easier to interpret names. Closes #19664	2016-08-01 10:42:58 -04:00
Alexander Lin	119026b4fb	Remove isCreated and isFound from the Java API This is cleanup work from #19566, where @nik9000 suggested trying to nuke the isCreated and isFound methods. I've combined nuking the two methods with removing UpdateHelper.Operation in favor of DocWriteResponse.Operation here. Closes #19631.	2016-07-29 14:21:43 -04:00
Nik Everett	2e7336dc10	Add package-info to o.e.test.rest This removes two packages, consolidating them into their parent package and adds `package-info.java` files to describe all of the packages under `org.elasticsearch.test.rest`.	2016-07-28 16:07:44 -04:00
David Pilato	0d2ccf0989	Merge branch 'pr/15724-gce-network-host-master'	2016-07-28 16:59:18 +02:00
Nik Everett	fb45f6a8a8	Add authentication to reindex-from-remote The tests for authentication extend ESIntegTestCase and use a mock authentication plugin. This way the clients don't have to worry about running it. Sadly, that means we don't really have good coverage on the REST portion of the authentication. This also adds ElasticsearchStatusException, and exception on which you can set an explicit status. The nice thing about it is that you can set the RestStatus that it returns to whatever arbitrary status you like based on the status that comes back from the remote system. reindex-from-remote then uses it to wrap all remote failures, preserving the status from the remote Elasticsearch or whatever proxy is between us and the remove Elasticsearch.	2016-07-27 14:17:41 -04:00
David Pilato	e9339a1960	Merge branch 'master' into pr/15724-gce-network-host-master	2016-07-27 11:24:53 +02:00
Boaz Leskes	6f76740a58	await fix testConcurrentSendRespondAndDisconnect	2016-07-26 23:42:10 +02:00
Nik Everett	9270e8b22b	Rename client yaml test infrastructure This makes it obvious that these tests are for running the client yaml suites. Now that there are other ways of running tests using the REST client against a running cluster we can't go on calling the shared client yaml tests "REST tests". They are rest tests, but they aren't the rest tests.	2016-07-26 13:53:44 -04:00
David Pilato	0d3edee928	Merge branch 'master' into pr/15724-gce-network-host-master	2016-07-26 18:51:01 +02:00
David Pilato	fde15ae470	Move custom name resolvers to NetworkService CTOR Instead of using NetworkModule we can directly inject them in NetworkService CTOR. See https://github.com/elastic/elasticsearch/pull/15765#issuecomment-235307974	2016-07-26 18:26:30 +02:00
Boaz Leskes	fabfd425f0	remove socket timeout from MockTcpTransport added in `b208a7dbae`	2016-07-26 18:04:05 +02:00
Boaz Leskes	dbdb6341a5	increase logging information in testConcurrentSendRespondAndDisconnect	2016-07-26 18:02:22 +02:00
Daniel Mitterdorfer	b208a7dbae	Add socket timeout in MockTcpTransport With this commit we set an explicit socket timeout in MockTcpTransport to avoid hanging tests in case of disconnections.	2016-07-26 16:04:51 +02:00
Nik Everett	a95d4f4ee7	Add Location header and improve REST testing This adds a header that looks like `Location: /test/test/1` to the response for the index/create/update API. The requirement for the header comes from https://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html https://tools.ietf.org/html/rfc7231#section-7.1.2 claims that relative URIs are OK. So we use an absolute path which should resolve to the appropriate location. Closes #19079 This makes large changes to our rest test infrastructure, allowing us to write junit tests that test a running cluster via the rest client. It does this by splitting ESRestTestCase into two classes: * ESRestTestCase is the superclass of all tests that use the rest client to interact with a running cluster. * ESClientYamlSuiteTestCase is the superclass of all tests that use the rest client to run the yaml tests. These tests are shared across all official clients, thus the `ClientYamlSuite` part of the name.	2016-07-25 17:02:40 -04:00
Boaz Leskes	b90dff7292	increase log level to debug in testConcurrentSendRespondAndDisconnect	2016-07-25 22:01:09 +02:00
Ali Beyad	2f831c3abb	BytesArray tests fix: offsets don't matter on a zero bytes array Closes #19582	2016-07-25 15:22:08 -04:00
Tanguy Leroux	f745c96949	Clean up more messy tests After #13834 many tests that used Groovy scripts (for good or bad reason) in their tests have been moved in the lang-groovy module and the issue #13837 has been created to track these messy tests in order to clean them up. This commit moves more tests back in core, removes the dependency on Groovy, changes the scripts in order to use the mocked script engine, and change the tests to integration tests.	2016-07-25 17:02:49 +02:00
Boaz Leskes	cd596772ee	Persistent Node Names (#19456 ) With #19140 we started persisting the node ID across node restarts. Now that we have a "stable" anchor, we can use it to generate a stable default node name and make it easier to track nodes over a restarts. Sadly, this means we will not have those random fun Marvel characters but we feel this is the right tradeoff. On the implementation side, this requires a bit of juggling because we now need to read the node id from disk before we can log as the node node is part of each log message. The PR move the initialization of NodeEnvironment as high up in the starting sequence as possible, with only one logging message before it to indicate we are initializing. Things look now like this: ``` [2016-07-15 19:38:39,742][INFO ][node ] [_unset_] initializing ... [2016-07-15 19:38:39,826][INFO ][node ] [aAmiW40] node name set to [aAmiW40] by default. set the [node.name] settings to change it [2016-07-15 19:38:39,829][INFO ][env ] [aAmiW40] using [1] data paths, mounts [[ /(/dev/disk1)]], net usable_space [5.5gb], net total_space [232.6gb], spins? [unknown], types [hfs] [2016-07-15 19:38:39,830][INFO ][env ] [aAmiW40] heap size [1.9gb], compressed ordinary object pointers [true] [2016-07-15 19:38:39,837][INFO ][node ] [aAmiW40] version[5.0.0-alpha5-SNAPSHOT], pid[46048], build[473d3c0/2016-07-15T17:38:06.771Z], OS[Mac OS X/10.11.5/x86_64], JVM[Oracle Corporation/Java HotSpot(TM) 64-Bit Server VM/1.8.0_51/25.51-b03] [2016-07-15 19:38:40,980][INFO ][plugins ] [aAmiW40] modules [percolator, lang-mustache, lang-painless, reindex, aggs-matrix-stats, lang-expression, ingest-common, lang-groovy, transport-netty], plugins [] [2016-07-15 19:38:43,218][INFO ][node ] [aAmiW40] initialized ``` Needless to say, settings `node.name` explicitly still works as before. The commit also contains some clean ups to the relationship between Environment, Settings and Plugins. The previous code suggested the path related settings could be changed after the initial Environment was changed. This did not have any effect as the security manager already locked things down.	2016-07-23 22:46:48 +02:00
Jason Tedor	2d1b0587dd	Introduce Netty 4 This commit adds transport-netty4, a transport and HTTP implementation based on Netty 4. Relates #19526	2016-07-22 22:26:35 -04:00
Ali Beyad	a0a4d67eae	All snapshot metadata files use UUID for the blob ID	2016-07-22 13:52:13 -04:00
gfyoung	d98fd36dad	Added deleteBlob IOException test	2016-07-22 13:48:45 -04:00
javanna	db8beeba3b	Merge branch 'master' into feature/async_rest_client	2016-07-22 15:51:03 +02:00
Boaz Leskes	bd574d92ae	Verify lower level transport exceptions don't bubble up on disconnects (#19518 ) #19096 introduced a generic TCPTransport base class so we can have multiple TCP based transport implementation. These implementations can vary in how they respond internally to situations where we concurrently send, receive and handle disconnects and can have different exceptions. However, disconnects are important events for the rest of the code base and should be distinguished from other errors (for example, it signals TransportMasterAction that it needs to retry and wait for the a (new) master to come back). Therefore, we should make sure that all the implementations do the proper translation from their internal exceptions into ConnectTransportException which is used externally. Similarly we should make sure that the transport implementation properly recognize errors that were caused by a disconnect as such and deal with them correctly. This was, for example, the source of a build failure at https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/1080 , where a concurrency issue cause SocketException to bubble out of MockTcpTransport. This PR adds a tests which concurrently simulates connects, disconnects, sending and receiving and makes sure the above holds. It also fixes anything (not much!) that was found it.	2016-07-22 14:35:47 +02:00
Tal Levy	f7cd86ef6d	rethrow script compilation exceptions into ingest configuration exceptions (#19318 ) * rethrow script compilation exceptions into ingest configuration exceptions * update readProcessor to rethrow any exception as an ElasticsearchException	2016-07-20 10:37:56 -07:00
javanna	a9b5c5adbe	restore throws IOException clause on all performRequest sync methods We throw IOException, which is the exception that is going to be thrown in 99% of the cases. A more generic exception can happen, and if it is a runtime one we just let it bubble up as is, otherwise we wrap it into runtime one so that we don't require to catch Exception everywhere, which seems odd. Also adjusted javadocs for all performRequest methods	2016-07-19 15:18:05 +02:00
javanna	1bb33cf572	Remove RestClient#JSON_CONTENT_TYPE constant, already available in ContentType class	2016-07-19 15:17:12 +02:00
javanna	e742d65e02	[TEST] Make sure the last response body is always available in our REST tests With the introduction of the async client, ResponseException doesn't eagerly read the response body anymore into a string. That is better, but raised a problem in our REST tests infra: we were reading the response body twice, while it can only be consumed once. Introduced a RestTestResponseException that wraps a ResponseException and exposes the body which now gets read only once.	2016-07-19 15:16:45 +02:00
javanna	41e97a7cb1	RestClient: take builder out to its own class The RestClient class is getting bigger and bigger, its builder can definitely be taken out to its own top level class: RestClientBuilder	2016-07-19 15:16:45 +02:00
javanna	1fbec71243	Rest client: introduce async performRequest method and use async client under the hood for sync requests too The new method accepts the usual parameters (method, endpoint, params, entity and headers) plus a response listener and an async response consumer. Shortcut methods are also added that don't require params, entity and the async response consumer optional. There are a few relevant api changes as a consequence of the move to async client that affect sync methods: - Response doesn't implement Closeable anymore, responses don't need to be closed - performRequest throws Exception rather than just IOException, as that is the the exception that we get from the FutureCallback#failed method in the async http client - ssl configuration is a bit simpler, one only needs to call setSSLStrategy from a custom HttpClientConfigCallback, that doesn't end up overridng any other default around connection pooling (it used to happen with the sync client and make ssl configuration more complex) Relates to #19055	2016-07-19 15:15:58 +02:00
Nik Everett	a2a7ea1f17	Make ExtendedBounds immutable We used to mutate it as part of building the aggregation. That caused assertVersionSerializable to fail because it assumes that requests aren't mutated after they are sent. Closes #19481	2016-07-19 08:48:14 -04:00
Simon Willnauer	8394544548	Add a dedicated client/transport project for transport-client (#19435 ) The `client/transport` project adds a new jar build project that pulls in all dependencies and configures all required modules. Preinstalled modules are: * transport-netty * lang-mustache * reindex * percolator The `TransportClient` classes are still in core while `TransportClient.Builder` has only a protected construcutor such that users are redirected to use the new `TransportClientBuilder` from the new jar. Closes #19412	2016-07-18 15:42:24 +02:00
Martijn van Groningen	e0ebf5da1c	Template cleanup: * Removed `Template` class and unified script & template parsing logic. Templates are scripts, so they should be defined as a script. Unless there will be separate template infrastructure, templates should share as much code as possible with scripts. * Removed ScriptParseException in favour for ElasticsearchParseException * Moved TemplateQueryBuilder to lang-mustache module because this query is hard coded to work with mustache only	2016-07-18 10:16:01 +02:00
Ali Beyad	687e2e12b3	Merge pull request #19450 from elastic/feature/friendly-index-creation Makes index creation more friendly	2016-07-15 11:48:21 -04:00
Ali Beyad	d78f40fb1e	Index creation waits for active shard copies before returning (#18985 ) Before returning, index creation now waits for the configured number of shard copies to be started. In the past, a client would create an index and then potentially have to check the cluster health to wait to execute write operations. With the cluster health semantics changing so that index creation does not cause the cluster health to go RED, this change enables waiting for the desired number of active shards to be active before returning from index creation. Relates #9126	2016-07-15 11:19:27 -04:00
Martijn van Groningen	d0069f0fbb	Provide access to ThreadContext in ingest plugins Also introduced a `Processor.Parameters` class that is holder for several services processors rely on, the IngestPlugin#getProcessors(...) method has been changed to accept `Processor.Parameters` instead of each service seperately.	2016-07-15 08:16:15 +02:00
Jason Tedor	31c648eee8	Rename transport-netty to transport-netty3 This commit renames the Netty 3 transport module from transport-netty to transport-netty3. This is to make room for a Netty 4 transport module, transport-netty4. Relates #19439	2016-07-14 22:03:14 -04:00
Jason Tedor	575fa4e00a	Fix line-length in o/e/t/r/s/Features.java This commit fixes a line-length checkstyle violation in o/e/t/r/s/Features.java.	2016-07-14 18:10:35 -04:00
Honza Král	e21b1e8066	[TEST] add 'yaml' feature for the test runner (#19436 ) Also renamed 30_yaml.yaml to 30_json.yaml since it tests json, not yaml	2016-07-14 17:30:32 +02:00
Simon Willnauer	5616251f22	Remove `node.mode` and `node.local` settings (#19428 ) Today `node.mode` and `node.local` serve almost the same purpose, they are a shortcut for `discovery.type` and `transport.type`. If `node.local: true` or `node.mode: local` is set elasticsearch will start in _local_ mode which means only nodes within the same JVM are discovered and a non-network based transport is used. The _local_ mode it only really used in tests or if nodes are embedded. For both, embedding and tests explicit configuration via `discovery.type` and `transport.type` should be preferred. This change removes all the usage of these settings and by-default doesn't configure a default transport implemenation since netty is now a module. Yet, to make the user expericence flawless, plugins or modules can set a `http.type.default` and `transport.type.default`. Plugins set this via `PluginService#additionalSettings()` which enforces _set-once_ which prevents node startup if set multiple times. This means that our distributions will just startup with netty transport since it's packaged as a module unless `transport.type` or `http.transport.type` is explicitly set. This change also found a bunch of bugs since several NamedWriteables were not registered if a transport client is used. Now that we don't rely on the `node.mode` leniency which is inherited instead of using explicit settings, `TransportClient` uses `AssertingLocalTransport` which detects these problems since it serializes all messages. Closes #16234	2016-07-14 13:21:10 +02:00
Simon Willnauer	29fd0f1bd8	[TEST] Remove wrong transportName from MockTcpTransport#ctor	2016-07-13 12:50:52 +02:00
Simon Willnauer	067ca1f996	[TEST] Use a semaphore to block unitl all in-flight requests are released	2016-07-13 10:31:05 +02:00
Simon Willnauer	814c7224f9	Merge pull request #19392 from elastic/modularize_netty This moves all netty related code into modules/transport-netty the module is build as a zip file as well as a JAR to serve as a dependency for transport client. For the time being this is required otherwise we have no network based impl. for transport client users. This might be subject to change given that we move forward http client.	2016-07-13 09:52:03 +02:00
Simon Willnauer	eba69ffade	[TEST] First decrement in-flight requests before releasing the latch	2016-07-12 22:58:03 +02:00
Simon Willnauer	ec55f9fff7	[TEST] Make AbstractSimpleTransportTestCase#testTimeoutSendExceptionWithDelayedResponse more robust and wait for in-flight request	2016-07-12 20:41:37 +02:00
Simon Willnauer	4fb79707bd	Fix remaining tests that either need access to the netty module or require explict configuration Some tests still start http implicitly or miss configuring the transport clients correctly. This commit fixes all remaining tests and adds a depdenceny to `transport-netty` from `qa/smoke-test-http` and `modules/reindex` since they need an http server running on the nodes. This also moves all required permissions for netty into it's module and out of core.	2016-07-12 16:29:57 +02:00
Luca Cavanna	f6aec3fdb5	Merge pull request #19373 from javanna/enhancement/rest_client_builder_callback Rest Client: add callback to customize http client settings	2016-07-12 13:30:27 +02:00
javanna	512b8be791	RestClient: simplify ssl configuration and make http config callback functional friendly	2016-07-12 13:25:55 +02:00
Boaz Leskes	081d04afac	Make NotMasterException a first class citizen (#19385 ) That exception is currently serialized as its current base class IllegalStateException which confuses code supposed to deal with the stepping down of a master. This is an important exception and we should be able to serialize it correctly. This commit fixes it by moving the exception to inherit from ElasticsearchException and properly register it. As a bonus I adapted CapturingTransport to properly simulate serialized exceptions.	2016-07-12 12:44:40 +02:00
javanna	fa0b354e66	Rest Client: add callback to customize http client settings The callback replaces the ability to fully replace the http client instance. By doing that, one used to lose any default that the RestClient had set for the underlying http client. Given that you'd usually override one or two things only, like a couple of timeout values, the ssl factory or the default credentials providers, it is not uder friendly if by doing that users end up replacing the whole http client instance and lose any default set by us.	2016-07-12 12:31:28 +02:00
Simon Willnauer	199a5a1f04	Fix TcpTransport#sendRequest to raise NotConnectedExcepiton if we get disconnected while sending This also fixes a race in AbstractSimpleTransportTestCase where we never wait long enough for all response to finish causing expected failures.	2016-07-12 10:56:20 +02:00
Ryan Ernst	93aebbef0f	Merge branch 'master' into modularize_netty	2016-07-11 23:49:00 -07:00
Ryan Ernst	7195d1e0ff	Fix plugins service to not double bind plugin components	2016-07-11 17:05:56 -07:00
Nik Everett	8263873783	Switch search extension from push to pull Switches most search behavior extensions from push (`onModule(SearchModule)`) to pull (`implements SearchPlugin`). This effort in general gives plugin authors a much cleaner view of how to extend Elasticsearch and starts to set up portions of Elasticsearch as "the plugin API". This commit in particular does that for search-time behavior like customized suggesters, highlighters, score functions, and significance heuristics. It also switches most such customization to being done at search module construction time which is much, much easier to reason about from a testing perspective. It also helps significantly in the process of de-guice-ing Elasticsearch's startup. There are at least two major search time extensions that aren't covered in this commit that will simply have to wait for the next commit on the topic because this one has already grown large: custom aggregations and custom queries. These will likely live in the same SearchPlugin interface as well.	2016-07-11 18:49:05 -04:00
Ryan Ernst	99ac65931a	Plugins: Add components creator as bridge between guice and new plugin init world This change adds a createComponents() method to Plugin implementations which they can use to return already constructed componenents/services. Eventually this should be just services ("components" don't really do anything), but for now it allows any object so that preconstructed instances by plugins can still be bound to guice. Over time we should add basic services as arguments to this method, but for now I have left it empty so as to not presume what is a necessary service.	2016-07-11 14:14:06 -07:00
Simon Willnauer	048e4416e7	Move netty transport and http into a module This moves all netty code and it's dependency into a module.	2016-07-11 22:21:29 +02:00
Ali Beyad	0faf638710	Blocked allocations on primary causes RED health If the allocation decision for a primary shard was NO, this should cause the cluster health for the shard to go RED, even if the shard belongs to a newly created index or is part of cluster recovery. Relates #9126	2016-07-11 15:32:13 -04:00
Ali Beyad	417bd0cd63	Index creation does not cause the cluster health to go RED Previously, index creation would momentarily cause the cluster health to go RED, because the primaries were still being assigned and activated. This commit ensures that when an index is created or an index is being recovered during cluster recovery and it does not have any active allocation ids, then the cluster health status will not go RED, but instead be YELLOW. Relates #9126	2016-07-11 15:30:47 -04:00
Simon Willnauer	47bd2f9ca5	More cleanups aroung tests that require HTTP to be enalbed. (#19363 ) this commit moves the most of the http related integ tests out into it's own `qa/smoke-test-http` project where most of the test can run against the external cluster.	2016-07-11 20:44:57 +02:00
Nik Everett	4b171b84cb	Fix modifier order checkstyle	2016-07-11 12:59:45 -04:00
Christoph Büscher	0d428b6ba8	Add test for GeoHashUtils#bbox()	2016-07-11 10:46:31 -05:00
Simon Willnauer	ee193f7697	[TEST] Catch RejectedOperationException when disconnecting from node in MockTcpTransport	2016-07-11 16:36:26 +02:00
Simon Willnauer	07260d4351	[TEST] Use AbstractRunnable when forking off threads on an executor	2016-07-11 16:27:07 +02:00
Simon Willnauer	3f3c93ec65	Add blocking socket based MockTcpTransport (#19332 ) Today we have a bunch of tests that use netty transport for several reasons these tests use it because they need to run some tcp based transport. Yet, this couples our tests tightly to the netty implementation which should be tested on it's own. This change adds a plain socket based blocking TcpTransport implementation that is used by default in tests if local transport is suppressed or if network is selected. It also adds another tcp network implementation as a showcase how the interface works.	2016-07-11 12:17:52 +02:00
javanna	942e342662	Rest Client: use short performRequest methods when possible	2016-07-11 10:36:26 +02:00
Jason Tedor	e86aa29f67	Die with dignity Today when a thread encounters a fatal unrecoverable error that threatens the stability of the JVM, Elasticsearch marches on. This includes out of memory errors, stack overflow errors and other errors that leave the JVM in a questionable state. Instead, the Elasticsearch JVM should die when these errors are encountered. This commit causes this to be the case. Relates #19272	2016-07-07 14:44:03 -04:00
Tanguy Leroux	b58f2eb5c2	Move back some messy tests from Groovy plugin to core This commit moves back some messy tests that have been placed in lang-groovy module in https://github.com/elastic/elasticsearch/pull/13834. It removes the dependency on Groovy plugin as well as change back the tests to integration tests (IT suffix). It also changes the current MockScriptEngine and MockScriptPlugin to make it easier to use.	2016-07-07 15:26:36 +02:00
Alexander Reelsen	71b48fb16c	Dependencies: Update to jopt-5.0 (#19278 ) The new version of jopt allows us to remove a couple of TODOs in the code. Closes #12368	2016-07-07 08:50:10 +02:00
Ryan Ernst	e7818f75e1	Fix checkstyle for TestProcessor	2016-07-05 22:33:08 -07:00
Ryan Ernst	2fc41adeb5	Merge branch 'master' into ingest_plugin_api	2016-07-05 20:53:03 -07:00
Jason Tedor	d0765d0761	Merge branch 'master' into feature/seq_no * master: (192 commits) [TEST] Fix rare OBOE in AbstractBytesReferenceTestCase Reindex from remote Rename writeThrowable to writeException Start transport client round-robin randomly Reword Refresh API reference (#19270) Update fielddata.asciidoc Fix stored_fields message Add missing footer notes in mapper size docs Remote BucketStreams Add doc values support to the _size field in the mapper-size plugin Bump version to 5.0.0-alpha5. Update refresh.asciidoc Update shrink-index.asciidoc Change Debian repository for Vagrant debian-8 box [TEST] fix test to account for internal empyt reference optimization Upgrade to netty 3.10.6.Final (#19235) [TEST] fix histogram test when extended bounds overlaps data Remove redundant modifier Simplify TcpTransport interface by reducing send code to a single send method (#19223) Fix style violation in InstallPluginCommand.java ...	2016-07-05 22:01:07 -04:00
Nik Everett	b3c015e2bb	Reindex from remote This adds a remote option to reindex that looks like ``` curl -POST 'localhost:9200/_reindex?pretty' -d'{ "source": { "remote": { "host": "http://otherhost:9200" }, "index": "target", "query": { "match": { "foo": "bar" } } }, "dest": { "index": "target" } }' ``` This reindex has all of the features of local reindex: * Using queries to filter what is copied * Retry on rejection * Throttle/rethottle The big advantage of this version is that it goes over the HTTP API which can be made backwards compatible. Some things are different: The query field is sent directly to the other node rather than parsed on the coordinating node. This should allow it to support constructs that are invalid on the coordinating node but are valid on the target node. Mostly, that means old syntax.	2016-07-05 16:13:17 -04:00
Jason Tedor	96f283c195	Rename writeThrowable to writeException This commit renames writeThrowable to writeException. The situation here stems from the fact that the StreamOutput method for serializing Exceptions needs to accept Throwables too as Throwables can be the cause of serialized Exceptions. Yet, we do not serialize Throwables in the Error sub-hierarchy in a way that they can be deserialized into their initial type. This leads to an asymmetry in the StreamOutput method for serializing Exceptions and the StreamInput method for writing Excpetions. Namely, the former will accept Throwables but the latter will only return Exceptions. A goal with the stream methods has always been symmetry in the method names so that serialization/deserialization routines appear symmetrical in code. It is this asymmetry on the input/output types for Exceptions on StreamOutput/StreamInput that clashes with the desired symmetry of naming. Despite this, we should favor symmetry in the naming of the methods. This commit renames StreamOutput#writeThrowable to StreamOutput#writeException which leaves us with Exception StreamInput#readException and void StreamOutput#writeException(Throwable).	2016-07-05 14:37:01 -04:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Tanguy Leroux	0e7faf1005	Enable Checkstyle RedundantModifier	2016-07-04 15:22:12 +02:00
Jason Tedor	3343ceeae4	Do not catch throwable Today throughout the codebase, catch throwable is used with reckless abandon. This is dangerous because the throwable could be a fatal virtual machine error resulting from an internal error in the JVM, or an out of memory error or a stack overflow error that leaves the virtual machine in an unstable and unpredictable state. This commit removes catch throwable from the codebase and removes the temptation to use it by modifying listener APIs to receive instances of Exception instead of the top-level Throwable. Relates #19231	2016-07-04 08:41:06 -04:00
Ryan Ernst	5a66c08ae9	Merge branch 'master' into ingest_plugin_api	2016-07-01 16:27:52 -07:00
Ryan Ernst	822c995367	Internal: Remove generics from LifecycleComponent The only reason for LifecycleComponent taking a generic type was so that it could return that type on its start and stop methods. However, this chaining has no practical necessity. Instead, start and stop can be void, and a whole bunch of confusing generics disappear.	2016-07-01 16:17:42 -07:00
Ryan Ernst	e5caadc4f3	Merge branch 'master' into ingest_plugin_api	2016-07-01 12:35:26 -07:00
Nik Everett	f30a70c51f	Fix comment I forgot a word....	2016-07-01 14:48:08 -04:00
Nik Everett	ff42d7cfc6	Add embedded stash key support to rest tests This allowes embedding stash keys in string like `t${key}est`. This allows simple string concatenation like acitons. The test for this is in `ObjectPathTests` because `Stash` doesn't seem to have a test on its own and it is simple enough to test embedded stashes this way. And this is a way I expect them to be used eventually.	2016-07-01 14:11:11 -04:00
Ryan Ernst	65c9b0b588	Merge branch 'master' into ingest_plugin_api	2016-07-01 09:26:17 -07:00
Tanguy Leroux	8c40b2b54e	Fix order of modifiers	2016-07-01 16:57:14 +02:00
Simon Willnauer	5c8164a561	Clean up BytesReference (#19196 ) BytesReference should be a really simple interface, yet it has a gazillion ways to achieve the same this. Methods like `#hasArray`, `#toBytesArray`, `#copyBytesArray` `#toBytesRef` `#bytes` are all really duplicates. This change simplifies the interface dramatically and makes implementations of it much simpler. All array access has been removed and is streamlined through a single `#toBytesRef` method. Utility methods to materialize a compact byte array has been added too for convenience.	2016-07-01 16:09:31 +02:00
javanna	dd781d410a	fix line length problems in all classes under o.e.test.rest package	2016-07-01 11:13:10 +02:00
javanna	0b5a549305	[TEST] remove special treatment for stashed $body in REST tests, instead always evaluate the stash through ObjectPath When we introduced docs testing we added a special case for $body in Stash, so that the last stashed body could be evaluated, and expressions like "$body.took" could be extracted out of it. We can instead do that for any object in the stash, by simply wrapping the internal map in an ObjectPath instance. We can then drop the special stashResponse method and go back to using the ordinary stashValue too. The downside of this change is that it adds a feature that may not be supported by other REST test runners, namely the evaluation of compouned paths from the stash. If we have "object" stashed as an object, it is now possible to extract directly each subobject of it as well e.g. "object.subobject.field1". None of the current REST tests rely on this, but our docs snippets tests do.	2016-07-01 11:13:10 +02:00
javanna	43b82ce244	[TEST] remove feature yaml from REST tests The only runner that supported it was the java runner, we can use json format instead given that the default one with cat apis is text	2016-07-01 11:13:10 +02:00
javanna	60bafa5d78	[TEST] parse yaml responses too through ObjectPath rather than only json responses No need to match against yaml responses via regexes in REST tests, yaml responses can be properly parsed via ObjectPath instead. Few REST tests need to be updated accordingly.	2016-07-01 11:13:10 +02:00
javanna	34f5c50a7f	[TEST] eagerly parse response body at ObjectPath initialization and read content type from response headers We are going to parse the body anyways whenever it's in json format as it is going to be stashed. It is not useful to lazily parse it anymore. Also this allows us to not rely on automatic detection of the xcontent type based on the content of the response, but rather read the content type from the response headers.	2016-07-01 11:13:10 +02:00
javanna	d5df738538	[TEST] ObjectPath to support parsing yaml or json that have an array as root object ObjectPath used a Map up until now for the internal representation of its navigable object. That works in most of the cases, but there could also be an array as root object, in which case a List needs to be used instead of a Map. This commit changes the internal representation of the object to Object which can either be a List or a Map. The change is minimal as ObjectPath already had the checks in place to verify the type of the object in the current position and navigate through it. Note: The new test added to ObjectPathTest uses yaml format explicitly as auto-detection of json format works only for a json object that starts with '{', not if the root object is actually an array and starts with '['.	2016-07-01 11:13:10 +02:00
javanna	bbaa23bdfd	[TEST] extend ObjectPathTests to support also yaml format	2016-07-01 11:13:10 +02:00
javanna	44dc801e90	[TEST] make JsonPath independent of data format, rename to ObjectPath The internal representation of the object that JsonPath gives access to is a map. That is independent of the initial input format, which is json but could also be yaml etc. This commit renames JsonPath to ObjectPath and adds a static method to create an ObjectPath from an XContent	2016-07-01 11:13:10 +02:00
javanna	76199ce497	[TEST] rename REST tests Stash methods to distinguish between retrieving a value and replacing values within a map Stash#unstashMap -> replaceStashedValues Stash#unstashValue -> getValue	2016-07-01 11:13:10 +02:00
javanna	62462f5d9b	[TEST] replace ResponseBodyAssertion with existing MatchAssertion We introduced a special response_body assertion to test our docs snippets. The match assertion does the same job though and can be reused and adapted where needed. ResponseBodyAssertion contains provides much better and accurate errors though, which can be now utilized in MatchAssertion so that many more REST tests can benefit from readable error messages. Each response body gets always stashed and can be retrieved for later evaluations already. Instead of providing the response body as strings that get parsed to json objects separately, then converted to maps as ResponseBodyAssertion did, we parse everything once, the json is part of the yaml test, which is supported. The only downside is that json comments cannot be used, rather yaml comments should be used (// C style vs # ). There were only two docs tests that were using comments in ingest-node.asciidoc where I went ahead and remove the comments which didn't seem that useful anyways.	2016-07-01 11:13:10 +02:00
javanna	598c36128e	Revert "Raised IOException on deleteBlob (#18815 )" This reverts commit `d24cc65cad` as it seems to be causing test failures.	2016-07-01 11:00:32 +02:00
gfyoung	d24cc65cad	Raised IOException on deleteBlob (#18815 ) Raise IOException on deleteBlob if the blob doesn't exist This commit raises an IOException on BlobContainer#deleteBlob if the blob does not exist, in conformance with the BlobContainer interface contract. Each implementation of BlobContainer now conforms to this contract (file system, S3, Azure, HDFS). This commit also contains blob container tests for each of the repository implementations. Closes #18530	2016-06-30 23:00:10 -04:00
Nik Everett	f5a269b029	Start migration away from aggregation streams We'll migrate to NamedWriteable so we can share code with the rest of the system. So we can work on this in multiple pull requests without breaking Elasticsearch in between the commits this change supports both old style `InternalAggregations.stream` serialization and `NamedWriteable` style serialization. As such it creates about a half dozen `// NORELEASE` comments that will have to be removed once the migration is complete. This also introduces a boolean `transportClient` flag to `SearchModule` which is used to skip inappropriate registrations for for the transport client while still registering the things it needs. In this case that means that the `InternalAggregation` subclasses are registered with the `NamedWriteableRegistry` but the `AggregationBuilder` subclasses are not. Finally, this moves aggregation registration from guice configuration time to `SearchModule` construction time. This will make it simpler to work with in the future as we further clean up Elasticsearch's extension points.	2016-06-30 12:57:34 -04:00
Boaz Leskes	09ca6d6ed2	Add a BridgePartition to be used by testAckedIndexing (#19172 ) We have long worked to capture different partitioning scenarios in our testing infra. This PR adds a new variant, inspired by the Jepsen blogs, which was forgotten far - namely a partition where one node can still see and be seen by all other nodes. It also updates the resiliency page to better reflect all the work that was done in this area.	2016-06-30 17:58:12 +02:00
jaymode	983a64c833	Add support for `teardown` section in REST tests This commits adds support for a `teardown` section that can be defined in REST tests to clean up any items that may have been created by the test and are not cleaned up by deletion of indices and templates.	2016-06-30 11:33:29 -04:00
Ryan Ernst	0732004ae8	Merge pull request #19177 from rjernst/ingest_factory_generic Remove generics from ingest Processor.Factory	2016-06-30 08:08:26 -07:00
Simon Willnauer	40ec639c89	Factor out abstract TCPTransport* classes to reduce the netty footprint (#19096 ) Today we have a ton of logic inside the NettyTransport* codebase. The footprint of the code that has a direct netty dependency is large and alternative implementations are pretty hard today since they need to know all about our proticol etc. This change moves most of the code into TCPTransport* baseclasses and moves all the protocol send code together. The base classes now contain the majority of the logic while NettyTransport* classes remain to implement the glue code, configuration and optimization.	2016-06-30 13:41:53 +02:00
Ryan Ernst	e4f265eb3a	Ingest: Remove generics from Processor.Factory The factory for ingest processor is generic, but that is only for the return type of the create mehtod. However, the actual consumer of the factories only cares about Processor, so generics are not needed. This change removes the generic type from the factory. It also removes AbstractProcessorFactory which only existed in order pull the optional tag from config. This functionality is moved to the caller of the factories in ConfigurationUtil, and the create method now takes the tag. This allows the covariant return of the implementation to work with tests not needing casts.	2016-06-30 02:33:54 -07:00
Ryan Ernst	08b3b6264e	Tests pass, started removing generics from processor factory	2016-06-30 01:49:22 -07:00
Ryan Ernst	f1376262fe	Merge branch 'master' into ingest_plugin_api	2016-06-29 14:16:16 -07:00
Simon Willnauer	872cdffc27	Factor out ChannelBuffer from BytesReference (#19129 ) The ChannelBuffer interface today leaks into the BytesReference abstraction which causes a hard dependency on Netty across the board. This chance moves this dependency and all BytesReference -> ChannelBuffer conversion into NettyUtlis and removes the abstraction leak on BytesReference. This change also removes unused methods on the BytesReference interface and simplifies access to internal pages.	2016-06-29 10:45:05 +02:00
Ryan Ernst	258c3e86ab	Added IngestPlugin api, cutover common and geoip, changed ingest factory api to take ProcessorsRegistry	2016-06-28 10:52:07 -07:00
Yannick Welsch	3cc2251e33	Fix number of arguments provided to logger calls	2016-06-28 17:38:56 +02:00
Boaz Leskes	2512594d9e	Testing infra - stablize data folder usage and clean up (#19111 ) The plan for persistent node ids ( #17811 ) is to tie the node identity to a file stored in it's data folders. As such it becomes important that nodes in our testing infra have better affinity with their data folders and that their data folders are not cleaned underneath them. The first is important because we fix the random seed used for node id generation (for reproducibility) and allowing the same node to use two different data folders causes two separate nodes to have the same id, which prevents the cluster from forming. The second is important, for example, where a full cluster restart / single node restart need to maintain node identity and wiping the data folders at the wrong moment prevents this. Concretely this commit does the following: 1) Remove previous attempts to have data folder per role using a prefix. This wasn't effective as it was using the data paths settings which are only used for part of the runs. An attempt to completely separate the paths via the home dir failed due to assumptions made by index custom path about node data folder ordinal uniqueness (see #19076) 2) Change full cluster restarts to start up nodes in the same order their were first created in, only randomly swapping nodes with the same roles. 3) Change test cluster reset methods to first shutdown the unneeded nodes and then re-start the shared nodes that were shut down, so they'll reclaim their data folders. 4) Improve data folder wiping logic and make sure it wipes only folders of "offline" nodes. 5) Add some very basic tests	2016-06-28 16:38:56 +02:00
Jason Tedor	2f638b5a23	Keep input time unit when parsing TimeValues This commit modifies TimeValue parsing to keep the input time unit. This enables round-trip parsing from instances of String to instances of TimeValue and vice-versa. With this, this commit removes support for the unit "w" representing weeks, and also removes support for fractional values of units (e.g., 0.5s). Relates #19102	2016-06-27 18:41:18 -04:00
Nik Everett	79fa778e33	Fix percolator tests They need their plugin or they'll break!	2016-06-27 15:34:36 -04:00
Ryan Ernst	33ccc5aead	Merge branch 'master' into mapper_plugin_api	2016-06-27 11:19:59 -07:00
Boaz Leskes	cb0824e957	Make shard store fetch less dependent on the current cluster state, both on master and non data nodes (#19044 ) #18938 has changed the timing in which we send out to nodes to fetch their shard stores. Instead of doing this after the cluster state resulting of the node's join was published, #18938 made it be sent concurrently to the publishing processes. This revealed a couple of points where the shard store fetching is dependent of the current state of affairs of the cluster state, both on the master and the data nodes. The problem discovered were already present without #18938 but required a failure/extreme situations to make them happen.This PR tries to remove as much as possible of these dependencies making shard store fetching simpler and make the way to re-introduce #18938 which was reverted. These are the notable changes: 1) Allow TransportNodesAction (of which shard store fetching is derived) callers to supply concrete disco nodes, so it won't need the cluster state to resolve them. This was a problem because the cluster state containing the needed nodes was not yet made available through ClusterService. Note that long term we can expect the rest layer to resolve node ids to concrete nodes, making this mode the only one needed. 2) The data node relied on the cluster state to have the relevant index meta data so it can find data when custom paths are used. We now fall back to read the meta data from disk if needed. 3) The data node was relying on it's own IndexService state to indicate whether the data it has corresponds to an existing allocation. This is of course something it can not know until it got (and processed) the new cluster state from the master. This flag in the response is now removed. This is not a problem because we used that flag to protect against double assigning of a shard to the same node, but we are already protected from it by the allocation deciders. 4) I removed the redundant filterNodeIds method in TransportNodesAction - if people want to filter they can override resolveRequest.	2016-06-27 15:05:06 +02:00
Nik Everett	71b95fb63c	Switch analysis from push to pull Instead of plugins calling `registerTokenizer` to extend the analyzer they now instead have to implement `AnalysisPlugin` and override `getTokenizer`. This lines up extending plugins in with extending scripts. This allows `AnalysisModule` to construct the `AnalysisRegistry` immediately as part of its constructor which makes testing anslysis much simpler. This also moves the default analysis configuration into `AnalysisModule` which is how search is setup. Like `ScriptModule`, `AnalysisModule` no longer extends `AbstractModule`. Instead it is only responsible for building `AnslysisRegistry`. We still bind `AnalysisRegistry` but we only do so in `Node`. This is means it is available at module construction time so we slowly remove the need to bind it in guice.	2016-06-26 07:15:42 -04:00
Ryan Ernst	6995bde710	Merge branch 'master' into mapper_plugin_api	2016-06-24 11:15:06 -07:00
Jason Tedor	112669daed	Merge branch 'master' into feature/seq_no * master: (416 commits) docs: removed obsolete information, percolator queries are not longer loaded into jvm heap memory. Upgrade JNA to 4.2.2 and remove optionality [TEST] Increase timeouts for Rest test client (#19042) Update migrate_5_0.asciidoc Add ThreadLeakLingering option to Rest client tests Add a MultiTermAwareComponent marker interface to analysis factories. #19028 Attempt at fixing IndexStatsIT.testFilterCacheStats. Fix docs build. Move templates out of the Search API, into lang-mustache module revert - Inline reroute with process of node join/master election (#18938) Build valid slices in SearchSourceBuilderTests Docs: Convert aggs/misc to CONSOLE Docs: migration notes for _timestamp and _ttl Group client projects under :client [TEST] Add client-test module and make client tests use randomized runner directly Move upgrade test to upgrade from version 2.3.3 Tasks: Add completed to the mapping Fail to start if plugin tries broken onModule Remove duplicated read byte array methods Rename `fields` to `stored_fields` and add `docvalue_fields` ...	2016-06-23 11:52:11 -04:00
Yannick Welsch	a5908a5da5	[TEST] Increase timeouts for Rest test client (#19042 ) Some Rest / Doc tests were running into the default socket timeout of 10 seconds.	2016-06-23 14:05:56 +02:00
Adrien Grand	7ba5bceebe	Add a MultiTermAwareComponent marker interface to analysis factories. #19028 This is the same as what Lucene does for its analysis factories, and we hawe tests that make sure that the elasticsearch factories are in sync with Lucene's. This is a first step to move forward on #9978 and #18064.	2016-06-23 10:19:24 +02:00
Tanguy Leroux	04da1bda0d	Move templates out of the Search API, into lang-mustache module This commit moves template support out of the Search API to its own dedicated Search Template API in the lang-mustache module. It provides a new SearchTemplateAction that can be used to render templates before it gets delegated to the usual Search API. The current REST endpoint are identical, but the Render Search Template endpoint now uses the same Search Template API with a new "simulate" option. When this option is enabled, the Search Template API only renders template and returns immediatly, without executing the search. Closes #17906	2016-06-23 09:30:53 +02:00
Nik Everett	0bf447c697	Group client projects under :client :client ---------> :client:rest :client-sniffer -> :client:sniffer :client-test ----> :client:test This lines the client up with how we do things like modules and plugins.	2016-06-22 14:26:41 -04:00
javanna	490d9c8cf7	Merge branch 'master' into feature/http_client	2016-06-22 09:50:07 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
Ryan Ernst	e817b5daa3	Plugins: Remove guice from Mapper plugins This changes adds a MapperPlugin interface which allows pull style retrieval of mappers and metadata mappers added by plugins. For now, I have kept the MapperRegistry, but this should be removed in the future as it is just a silly container for 2 maps which could themselves be passed around.	2016-06-21 22:50:39 -07:00
Nik Everett	8925400f67	Remove guice from ScriptService Makes ScriptModule just a plain class that manages building the ScriptSettings and ScriptService from plugins. When we need to bind ScriptService with guice we bind it in a lambda.	2016-06-21 16:45:45 -04:00
Adrien Grand	8078c205f9	Revert "Remove `_timestamp` and `_ttl` on 5.x indices. #18980" This reverts commit `969e953645`. Docs are failing because of the removed functionality. I will fix the docs before pushing it again.	2016-06-21 19:19:49 +02:00
Adrien Grand	969e953645	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-21 18:04:58 +02:00
javanna	886cb37efb	Merge branch 'master' into feature/http_client	2016-06-21 15:53:37 +02:00
Nik Everett	ba1d6907ab	Quiet the logging of the docs tests Significantly quiets the logging of the docs tests by: 1. Switching two log statements to debug level. 2. Only calling ESTestCase#afterIfFailed if the test failure wasn't just assumptions being violated.	2016-06-21 08:31:09 -04:00
Martijn van Groningen	82f7bfad98	ingest: merged o.e.ingest.core with o.e.ingest and in ingest-common module added o.e.ingest.common package and moved all code to that package.	2016-06-21 09:24:00 +02:00
Simon Willnauer	459665914b	Detach BigArrays from Guice (#18973 ) BigArrays can be fully constructed without Guice, this change cleans up it's creation and the mocking in MockNode.	2016-06-20 13:18:19 +02:00
Simon Willnauer	e50314bb6e	Remove NodeClientModule and PluginsModule	2016-06-20 11:53:07 +02:00
Simon Willnauer	7fea5bd8e7	Remove obsolete Modules that can simply be inlined in node creation	2016-06-20 11:28:14 +02:00
Simon Willnauer	260f38fd76	Remove VersionModule and use Version#current consistently. We pretended to be able to ackt like a different version node for so long it's time to be honest and remove this ability. It's just confusing and where needed and tested we should build dedicated extension points.	2016-06-20 10:55:52 +02:00
Tanguy Leroux	98951b1203	Compile each Groovy script in its own classloader closes #18572	2016-06-20 08:17:09 +02:00
Boaz Leskes	14cd8a6794	Introduce Replication unit tests using real shards (#18930 ) This commit introduce unit testing infrastructure to test replication operations using real index shards. This is infra is complementary to the full integration tests and unit testing of ReplicationOperation we already have. The new ESIndexLevelReplicationTestCase base makes it easier to test and simulate failure mode that require real shards and but do not need the full blow stack of a complete node. The commit also add a simple "nothing is wrong" test plus a test that checks we don't drop docs during the various stages of recovery. For now, only single doc indexing is supported but this can be easily extended in the future.	2016-06-18 18:53:47 +02:00
Areek Zillur	9356a6090f	Merge branch 'master' into enhancement/rollover_api	2016-06-17 11:35:57 -04:00
Simon Willnauer	bdb6dcea3a	Cleanup ClusterService dependencies and detached from Guice (#18941 ) This change removes some unnecessary dependencies from ClusterService and cleans up ClusterName creation. ClusterService is now not created by guice anymore.	2016-06-17 17:07:19 +02:00
Areek Zillur	545ffa7801	Merge branch 'master' into enhancement/rollover_api	2016-06-17 10:33:11 -04:00
javanna	af93533a17	Merge branch 'master' into feature/http_client	2016-06-17 13:50:18 +02:00
Areek Zillur	6adffa6b7b	Merge branch 'master' into enhancement/rollover_api	2016-06-16 17:27:32 -04:00
Ryan Ernst	8196cf01e3	Merge branch 'master' into plugin_name_api	2016-06-16 13:49:28 -07:00
Simon Willnauer	b22c526b34	Cut over settings registration to a pull model (#18890 ) Today we have a push model for registering basically anything. All our extension points are defined on modules which we pass in to plugins. This is harder to maintain and adds unnecessary dependencies on the modules itself. This change moves towards a pull model where the plugin offers a getter kind of method to get the extensions. This will also help in the future if we need to pass dependencies to the extension points which can easily be defined on the method as arguments if a pull model is used.	2016-06-16 15:52:58 +02:00
Nik Everett	5aa4769b25	Move waitForTaskCompletion into TaskManager This allows for listening for the waiting to start using MockTaskManager. This allows us to work around a race condition in the TasksIT.	2016-06-16 09:45:46 -04:00
Simon Willnauer	18ff051ad5	Simplify ScriptModule and script registration (#18903 ) Registering a script engine or native scripts still uses Guice today and is much more complicated than needed. This change moves to a pull based model where script plugins have to implement a dedicated interface `ScriptPlugin` and defines simple getter returning instances rather than classes.	2016-06-16 09:35:13 +02:00
Ryan Ernst	a4503c2aed	Plugins: Remove name() and description() from api In 2.0 we added plugin descriptors which require defining a name and description for the plugin. However, we still have name() and description() which must be overriden from the Plugin class. This still exists for classpath plugins. But classpath plugins are mainly for tests, and even then, referring to classpath plugins with their class is a better idea. This change removes name() and description(), replacing the name for classpath plugins with the full class name.	2016-06-15 17:12:22 -07:00
Tal Levy	a26260fb72	new ScriptProcessor for Ingest (#18193 ) add new ScriptProcessor for executing ES Scripts within pipelines	2016-06-15 14:57:18 -07:00
Daniel Mitterdorfer	f32b700472	Exclude admin / diagnostic requests from HTTP request limiting With this commit we exclude certain HTTP requests that are needed to inspect the cluster from HTTP request limiting to ensure these commands are processed even in critical memory conditions. Relates #17951, relates #18145, closes #18833	2016-06-15 14:29:46 +02:00
javanna	ace3a7b146	Merge branch 'master' into feature/http_client	2016-06-15 11:44:46 +02:00
Simon Willnauer	429dd3a876	Simplify FetchSubPhase registration and detach it from Guice (#18862 ) this commit removes FetchSubPhrase registration by class to registration by instance. No Guice binding needed anymore.	2016-06-15 09:13:02 +02:00
Nik Everett	d0e4485d42	Move NamingConventionsCheck into buildSrc This will let things that don't depend on :test:framework like the client use it. Also skip initializing the classes we check because we don't care about their initialization behavior because we're not executing them. This makes the naming conventions check pretty close to instant from a "human eye" perspective.	2016-06-14 18:30:34 -04:00
Colin Goodheart-Smithe	d7e3f9e4eb	#18854 Remove size 0 options in aggregations Remove size 0 options in aggregations	2016-06-14 15:32:42 +01:00
Simon Willnauer	4d78f280ed	Remove dead code and dead parameters (#18855 )	2016-06-14 15:25:44 +02:00
Colin Goodheart-Smithe	cfd3356ee3	Remove size 0 options in aggregations This removes the ability to set `size: 0` in the `terms`, `significant_terms` and `geohash_grid` aggregations for the reasons described in https://github.com/elastic/elasticsearch/issues/18838 Closes #18838	2016-06-14 13:07:02 +01:00
Adrien Grand	44c653f5a8	Upgrade to lucene-6.1.0-snapshot-3a57bea.	2016-06-10 16:18:12 +02:00
javanna	cf6e713d77	Merge branch 'master' into feature/http_client	2016-06-09 17:43:45 +02:00
javanna	437c4f210b	rename ElasticsearchResponse to Response and ElasticsearchResponseException to ResponseException	2016-06-09 14:38:32 +02:00
javanna	04d620da74	require hosts when creating RestClient.Builder Also fix order of arguments when using assertEquals	2016-06-08 12:37:50 +02:00
Jason Tedor	d896886973	Merge branch 'master' into feature/seq_no * master: (51 commits) Switch QueryBuilders to new MatchPhraseQueryBuilder Added method to allow creation of new methods on-the-fly. more cleanups Remove cluster name from data path Remove explicit parallel new GC flag rehash the docvalues in DocValuesSliceQuery using BitMixer.mix instead of the naive Long.hashCode. switch FunctionRef over to methodhandles ingest: Move processors from core to ingest-common module. Fix some typos (#18746) Fix ut convert FunctionRef/Def usage to methodhandles. Add the ability to partition a scroll in multiple slices. API: use painless types in FunctionRef Update ingest-node.asciidoc compute functional interface stuff in Definition Use method name in bootstrap check might fork test Make checkstyle happy (add Lookup import, line length) Don't hide LambdaConversionException and behave like real javac compiled code when a conversion fails. This works anyways, because fallback is allowed to throw any Throwable Pass through the lookup given by invokedynamic to the LambdaMetaFactory. Without it real lambdas won't work, as their implementations are private to script class checkstyle have your upper L ...	2016-06-07 17:57:53 -04:00
Martijn van Groningen	f611f1c99e	ingest: Move processors from core to ingest-common module. Folded grok processor into ingest-common module. The rest tests have been moved to ingest-common module as well, because these tests don't run in the rest-api-spec module but in the distribution:integ-test-zip module and adding a test plugin there felt just wrong to me. I think this is ok. I left a tiny ingest rest test behind in that tests with an empty pipeline. Removed messy tests, these tests were already covered in the rest tests Added ingest test plugin in test infra so that each module testing integration with ingest doesn't need write its own plugin Moved reindex ingest tests to qa module Closes #18490	2016-06-07 17:32:52 +02:00
Jason Tedor	da74323141	Register thread pool settings This commit refactors the handling of thread pool settings so that the individual settings can be registered rather than registering the top level group. With this refactoring, individual plugins must now register their own settings for custom thread pools that they need, but a dedicated API is provided for this in the thread pool module. This commit also renames the prefix on the thread pool settings from "threadpool" to "thread_pool". This enables a hard break on the settings so that: - some of the settings can be given more sensible names (e.g., the max number of threads in a scaling thread pool is now named "max" instead of "size") - change the soft limit on the number of threads in the bulk and indexing thread pools to a hard limit - the settings names for custom plugins for thread pools can be prefixed (e.g., "xpack.watcher.thread_pool.size") - remove dynamic thread pool settings Relates #18674	2016-06-06 22:09:12 -04:00
Areek Zillur	d96fe20e3a	add named writable registry glue	2016-06-06 16:11:46 -04:00
Jason Tedor	a60b8948ba	Merge branch 'master' into feature/seq_no * master: (184 commits) Add back pending deletes (#18698) refactor matrix agg documentation from modules to main agg section Implement ctx.op = "delete" on _update_by_query and _reindex Close SearchContext if query rewrite failed Wrap lines at 140 characters (:qa projects) Remove log file painless: Add support for the new Java 9 MethodHandles#arrayLength() factory (see https://bugs.openjdk.java.net/browse/JDK-8156915) More complete exception message in settings tests Use java from path if JAVA_HOME is not set Fix uncaught checked exception in AzureTestUtils [TEST] wait for yellow after setup doc tests (#18726) Fix recovery throttling to properly handle relocating non-primary shards (#18701) Fix merge stats rendering in RestIndicesAction (#18720) [TEST] mute RandomAllocationDeciderTests.testRandomDecisions Reworked docs for index-shrink API (#18705) Improve painless compile-time exceptions Adds UUIDs to snapshots Add test rethrottle test case for delete-by-query Do not start scheduled pings until transport start Adressing review comments ...	2016-06-06 11:16:22 -04:00
Yannick Welsch	0a8afa2e72	Add back pending deletes (#18698 ) Triggering the pending deletes logic was accidentally removed in the clean up PR #18602.	2016-06-06 15:14:09 +02:00
javanna	a461dd84d2	Build: add hamcrest and securemock to version.properties	2016-06-06 15:02:52 +02:00
Boaz Leskes	4844325921	Introduced Global checkpoints for Sequence Numbers (#15485 ) Global checkpoints are update by the primary and represent the common part of history across shard copies, as know at a given time. The primary is also in charge of periodically broadcast this information to the replicas. See #10708 for more details.	2016-06-06 12:53:04 +02:00
javanna	56e689e1b3	[TEST] remove unused method	2016-06-04 01:05:53 +02:00
javanna	b15279b5ef	Allow to pass socket facttry registry to createDefaultHttpClient method	2016-06-03 23:59:26 +02:00
javanna	b891c46657	[TEST] remove status matcher and hasStatus assertion All it does is checking the status code of a response, which can be done with a single line in each test	2016-06-03 23:25:17 +02:00
javanna	f17f0f9247	rename ElasticsearchResponse#getFirstHeader to getHeader	2016-06-03 18:28:31 +02:00
javanna	23a94bb974	[TEST] create standard RestClient at first request and reuse it A RestClient instance is now created whenever EsIntegTestCase#getRestClient is invoked for the first time. It is then kept until the cluster is cleared (depending on the cluster scope of the test). Renamed other two restClient methods to createRestClient, as that instance needs to be closed and managed in the tests.	2016-06-03 18:00:54 +02:00
javanna	e81aad972a	remove usage of deprecated api	2016-06-03 16:01:07 +02:00
javanna	eae914ae8e	Replace rest test client with low level RestClient We still have a wrapper called RestTestClient that is very specific to Rest tests, as well as RestTestResponse etc. but all the low level bits around http connections etc. are now handled by RestClient.	2016-06-03 16:01:07 +02:00
javanna	325b723930	[TEST] add rest client test dependency and replace usage of HttpRequestBuilder with RestClient in integration tests	2016-06-03 16:01:07 +02:00
Ali Beyad	b720216395	Adds UUIDs to snapshots This commit adds a UUID for each snapshot, in addition to the already existing repository and snapshot name. The addition of UUIDs will enable more robust handling of the deletion of previous snapshots and lingering files from partially failed delete operations, on top of being able to uniquely track each snapshot. Closes #18228 Relates #18156	2016-06-02 17:01:48 -04:00
Christoph Büscher	9067407cdd	Adressing review comments	2016-06-02 16:19:23 +02:00
Christoph Büscher	e2b6dbc020	Add tests to check that toQuery() doesn't return null	2016-06-02 11:25:56 +02:00
Christoph Büscher	359f45988f	Handle empty query bodies at parse time and remove EmptyQueryBuilder Currently we support empty query clauses like the filter in "constant_score" : { "filter" : { } } How these clauses are handled depends on the surrounding query. They later are either ignored, converted to match all or no documents or passed up further in the query hierarchy. During parsing these claues are currently represented as EmptyQueryBuilders. When not handled anywhere else, these special cases need to be checked for on the shard when building the lucene query. This is trappy, so this PR changes the parsing of compound queries. Instead of returning QueryBuilder, the core query parsing method QueryShardContext#parseInnerQueryBuilder() now return an Optional which can be empty in the case of empty query clauses. This has the advantage of forcing callers to deal with this sooner or later. When encountering empty Optionals, compound query builders now have the choice to ignore them, pass them on or rewrite to a different query, depending on context.	2016-06-02 11:25:56 +02:00
Yannick Welsch	c20bf5d747	[TEST] Fix tests that rely on assumption that data dirs are removed after index deletion (#18681 ) Relates to #18602	2016-06-01 17:02:09 +02:00
Simon Willnauer	88800e8e47	Move PageCacheRecycler into BigArrays (#18666 ) PageCacheRecycler is really just an implementation detail of BigArrays. There is no need to leak this class anywhere outside of it.	2016-06-01 09:43:11 +02:00
Ali Beyad	0efac76f01	Clarify the semantics of the BlobContainer interface This commit clarifies the behavior that must be adhered to by any implementors of the BlobContainer interface. This is done through expanded Javadocs. Closes #18157 Closes #15580	2016-05-31 19:22:55 -04:00
Jason Tedor	e21d8b31f1	Remove thread pool from page cache recycler The page cache recycler has a dependency on thread pool that was there for historical reasons but is no longer needed. This commit removes this now unneeded dependency. Relates #18664	2016-05-31 14:51:58 -04:00
Simon Willnauer	502a775a7c	Add primitive to shrink an index into a single shard (#18270 ) This adds a low level primitive operations to shrink an existing index into a new index with a single shard. This primitive expects all shards of the source index to allocated on a single node. Once the target index is initializing on the shrink node it takes a snapshot of the source index shards and copies all files into the target indices data folder. An [optimization](https://issues.apache.org/jira/browse/LUCENE-7300) coming in Lucene 6.1 will also allow for optional constant time copy if hard-links are supported by the filesystem. All mappings are merged into the new indexes metadata once the snapshots have been taken on the merge node. To shrink an existing index all shards must be moved to a single node (one instance of each shard) and the index must be read-only: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_settings' -d '{ "settings" : { "index.routing.allocation.require._name" : "shrink_node_name", "index.blocks.write" : true } } ``` once all shards are started on the shrink node. the new index can be created via: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_shrink/logs_single_shard' -d '{ "settings" : { "index.codec" : "best_compression", "index.number_of_replicas" : 1 } }' ``` This API will perform all needed check before the new index is created and selects the shrink node based on the allocation of the source index. This call returns immediately, to monitor shrink progress the recovery API should be used since all copy operations are reflected in the recovery API with byte copy progress etc. The shrink operation does not modify the source index, if a shrink operation should be canceled or if the shrink failed, the target index can simply be deleted and all resources are released.	2016-05-31 10:41:44 +02:00
Boaz Leskes	318a4e3ef6	Introduce dedicated master nodes in testing infrastructure (#18514 ) This PR changes the InternalTestCluster to support dedicated master nodes. The creation of dedicated master nodes can be controlled using a new `supportsMasterNodes` parameter to the ClusterScope annotation. If set to true (the default), dedicated master nodes will randomly be used. If set to false, no master nodes will be created and data nodes will also be allowed to become masters. If active, test runs will either have 1 or 3 masternodes	2016-05-27 08:44:20 +02:00
Yannick Welsch	31b0777c91	Simplify delayed shard allocation (#18351 ) This commit simplifies the delayed shard allocation implementation by assigning clear responsibilities to the various components that are affected by delayed shard allocation: - UnassignedInfo gets a boolean flag delayed which determines whether assignment of the shard should be delayed. The flag gets persisted in the cluster state and is thus available across nodes, i.e. each node knows whether a shard was delayed-unassigned in a specific cluster state. Before, nodes other than the current master were unaware of that information. - This flag is initially set as true if the shard becomes unassigned due to a node leaving and the index setting index.unassigned.node_left.delayed_timeout being strictly positive. From then on, unassigned shards can only transition from delayed to non-delayed, never in the other direction. - The reroute step is in charge of removing the delay marker (comparing timestamp when node left to current timestamp). - A dedicated service DelayedAllocationService, reacting to cluster change events, has the responsibility to schedule reroutes to remove the delay marker. Closes #18293	2016-05-26 13:39:55 +02:00
Adrien Grand	cad959b980	Validate parameters of native sig score scripts so that we know which ones are not set.	2016-05-26 10:07:38 +02:00
Jason Tedor	9d39b05845	Remove deprecation suppression Failing the build on deprecation warnings was removed in `19b3ec88af`. This commit removes the suppressed deprecation warnings so that their use is surfaced in the build now. Relates #18582	2016-05-25 17:15:36 -04:00
Nik Everett	bef1c8511d	s/tests.logger.level/tests.es.logger.level/ This is a leftover spot that wasn't changed. It was breaking ClusterSettingsIT#ClusterSettingsIT because that test expected the test's log level to default to the default logger level for the nodes.	2016-05-24 13:25:16 -04:00
Martijn van Groningen	27cc2fe4dc	Moved the percolator from core to its own module Significant changes: * AbstractQueryTestCase has moved to the test framework module, in order for query builder tests in modules and plugins * Added support to AbstractQueryTestCase to register plugins * Lift the restriction that only one percolator could be added per index. This validation existed in MapperService, but because the percolator moved to a module it could no longer exist there. Instead of bringing it back it was removed. This validation existed since the percolator cache only supported one percolator query per document, since the percolator cache has been removed this restriction could removed as well. * While moving percolator tests to the new module, also removed a couple of tests for the deprecated percolate and mpercolate api. These APIs are now sugar APIs for bwc and rediect to the searvh and msearvh APIs. Some tests were still testing as if percolate and mpercolate API did the percolation, but this no longer the case and these tests could be removed.	2016-05-24 11:01:57 +02:00
Ryan Ernst	f6074d383b	Merge pull request #18532 from rjernst/less_assert_busy Tests: Remove unnecessary Callable variant of assertBusy	2016-05-23 17:11:54 -07:00
Chris Earle	b49635539d	Remove support for -Des.* system properties in integration tests This now requires that system properties passed to Gradle must be in the form of "-Dtests.es." instead of "-Des.". It then chops off "tests.es." and passes that as a "-E" property to Elasticsearch. Also changed system properties: - `tests.logger.level` became `tests.es.logger.level` - `node.mode` became `tests.es.node.mode` - `node.local` became `tests.es.node.local`	2016-05-23 19:38:21 -04:00
Ryan Ernst	c7b45b2cc7	Tests: Remove unnecessary Callable variant of assertBusy The assertBusy method currently has both a Runnable and Callable version. This has caused confusion with type inference and lambdas sometimes, in particular with java 9. This change removes the callable version as nothing was actually using it.	2016-05-23 16:17:43 -07:00
Jason Tedor	f63d1255d1	Cleanup settings and system properties entanglement This commit cleans up some additional places where system properties were being used to pass settings to Elasticsearch. Relates #18524	2016-05-23 14:47:22 -04:00
Luca Cavanna	d2afe759a7	prevent registration of duplicated rest spec (#18504 ) Rather than having one win against the other, reject duplicated apis. Also enforce the convention that see the api name have the same name as the name of the rest spec file that defines it.	2016-05-23 12:17:42 +02:00
Jason Tedor	ad7229fe72	Merge branch 'master' into feature/seq_no * master: (158 commits) Document the hack Refactor property placeholder use of env. vars Force java9 log4j hack in testing Fix log4j buggy java version detection Make java9 work again Don't mkdir directly in deb init script Fix env. var placeholder test so it's reproducible Remove ScriptMode class in favor of boolean true/false [rest api spec] fix doc urls Netty request/response tracer should wait for send Filter client/server VM options from jvm.options [rest api spec] fix url for reindex api docs Remove use of a Fields class in snapshot responses that contains x-content keys, in favor of declaring/using the keys directly. Limit retries of failed allocations per index (#18467) Proxy box method to use valueOf. Use the build-in valueOf method instead of the custom one. Fixed tests and added a comment to the box method. Fix boxing. Do not decode path when sending error Fix race condition in snapshot initialization ...	2016-05-21 21:04:43 -04:00
Ryan Ernst	37d36f2f4c	Merge branch 'master' into java9	2016-05-21 14:19:58 -07:00
Ryan Ernst	41a5c0cfa1	Force java9 log4j hack in testing	2016-05-21 13:41:38 -07:00
Ryan Ernst	1d40c4bbc1	Make java9 work again This change makes ES compile with java9 again, build 118. * There are a handful of changes due to failure to determine types during compile. * The attachment plugins which use tika needed to have tika upgraded in order to pickup fixes there for java 9. * azure discovery and s3 repository indirectly depend on jaxb, which is no longer in the default modules. They now add a jaxb dependency externally, and make JarHell allow for this package.	2016-05-21 09:41:51 -07:00
Lee Hinman	fdfd2a2f18	Remove ScriptMode class in favor of boolean true/false This removes the ScriptMode class entirely, which was an enum with two options (ON and OFF) which essentially boiled down to true and false. Now the boolean values are used instead.	2016-05-20 15:01:30 -06:00
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00
Luca Cavanna	fcee329332	update http client version to 4.5.2 and http-core 4.4.4 (#18399 ) StrictHostnameVerifier can now be removed	2016-05-20 12:02:42 +02:00
Jason Tedor	c257e2c51f	Remove settings and system properties entanglement Today when parsing settings during bootstrap, we add a system property for every Elasticsearch setting. Additionally, settings can be set via system properties. This commit simplifies this situation. - settings are no longer propogated to system properties - system properties can not be used to set settings - the "es." prefix on settings is no longer required (nor permitted) - test logging has a dedicated system property (tests.logger.level) Relates #18198	2016-05-19 14:08:08 -04:00
Christoph Büscher	d2515727d0	Improve random DateTimeZone creation in tests We often require a random joda DateTimeZone in our tests. Currently there are a few options for generating such a random DateTimeZone from the set of available ids. Currently most random picks are not really reproducable across different jvms because they rely on order in the ids set implementation. The helper in DateProcessorFactoryTests thus performs a sort on the set of ids before random picking from the result, so I moved this to ESTestCase to make it publicly available and changed all other tests to use that method.	2016-05-19 18:12:48 +02:00
Tanguy Leroux	35d3bdab84	Add Google Cloud Storage repository plugin Closes #12880	2016-05-19 13:26:23 +02:00
Jason Tedor	ecce53f0df	Add I/O statistics on Linux This commit adds a variety of real disk metrics for the block devices that back Elasticsearch data paths. A collection of statistics are read from /proc/diskstats and are used to report the raw metrics for operations and read/write bytes. Relates #15915	2016-05-17 16:16:39 -04:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Jason Tedor	15d3d74444	Merge branch 'master' into feature/seq_no * master: (904 commits) Removes unused methods in the o/e/common/Strings class. Add note regarding thread stack size on Windows painless: restore accidentally removed test Documented fuzzy_transpositions in match query Add not-null precondition check in BulkRequest Build: Make run task you full zip distribution Build: More pom generation improvements Add test for wrong array index Take return type from "after" field. painless: build descriptor of array and field load/store in code; fix array index to adapt type not DEF Build: Add developer info to generated pom files painless: improve exception stacktraces painless: Rename the dynamic call site factory to DefBootstrap and make the inner class very short (PIC = Polymorphic Inline Cache) Remove dead code. Avoid race while retiring executors Allow only a single extension for a scripting engine Adding REST tests to ensure key_as_string behavior stays consistent [test] Set logging to 11 on reindex test [TEST] increase logger level until we know what is going on Don't allow `fuzziness` for `multi_match` types cross_fields, phrase and phrase_prefix ...	2016-05-14 20:23:59 -04:00
Robert Muir	2028691e66	painless: improve exception stacktraces closes #18319	2016-05-13 15:40:45 -04:00
Lee Hinman	9bcdafedda	Allow only a single extension for a scripting engine Previously multiple extensions could be provided, however, this can lead to confusion with on-disk scripts (ie, "foo.js" and "foo.javascript") having different content. Only a single extension is now supported. The only language currently supporting multiple extensions was the Javascript engine ("js" and "javascript"). It now only supports the `.js` extension. Relates to #10598	2016-05-13 09:54:31 -06:00
Lee Hinman	efff3918d8	Remove support for mulitple languages per scripting engine	2016-05-13 09:24:31 -06:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
Yannick Welsch	7753420540	Make ShardRouting and UnassignedInfo immutable (#17821 ) This makes defensive copying of ShardRouting objects obsolete whenever we do a reroute and trashes less objects.	2016-05-10 19:11:04 +02:00
Nik Everett	ddc531e729	Build a plugin for testing docs This makes it much easier to apply to other projects. Fixes to doc tests infrastructure: * Fix comparing lists. Was totally broken. * Fix order of actual vs expected parameters. * Allow multiple `// TESTRESPONSE` lines with substitutions to join into one big list of subtitutions. This makes lets the docs look tidier. * Exclude build from snippet scanning * Allow subclasses of ESRestTestCase access to the admin execution context	2016-05-09 14:07:27 -04:00
Nik Everett	b7d02fbd1e	Improve logging of raw rest actions on failure Log the method and the path.	2016-05-09 13:04:33 -04:00
Nik Everett	ef2e3a8c39	Rest tests: More defense around stashing body Integration tests failed: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/483/console We'll see if the rest tests were hiding some other failure.	2016-05-09 09:52:23 -04:00
Chris Earle	5be79ed02c	Add Failure Details to every NodesResponse Most of the current implementations of BaseNodesResponse (plural Nodes) ignore FailedNodeExceptions. - This adds a helper function to do the grouping to TransportNodesAction - Requires a non-null array of FailedNodeExceptions within the BaseNodesResponse constructor - Reads/writes the array to output - Also adds StreamInput and StreamOutput methods for generically reading and writing arrays	2016-05-06 14:59:43 -04:00
Christoph Büscher	7d14728960	Add xContent shuffling to some more tests This adds some random shuffling of xContent to some more test cases. Relates to #5831	2016-05-06 10:46:39 +02:00
Adrien Grand	de8354dd7f	Allow binary sort values. #17959 The `ip` field uses a binary representation internally. This breaks when rendering sort values in search responses since elasticsearch tries to write a binary byte[] as an utf8 json string. This commit extends the `DocValueFormat` API in order to give fields a chance to choose how to render values. Closes #6077	2016-05-06 09:27:02 +02:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Jason Tedor	784c9e5fb9	Introduce node handshake This commit introduces a handshake when initiating a light connection. During this handshake, node information, cluster name, and version are received from the target node of the connection. This information can be used to immediately validate that the target node is a member of the same cluster, and used to set the version on the stream. This will allow us to extend APIs that are used during initial cluster recovery without a major version change. Relates #15971	2016-05-04 20:06:47 -04:00
Jason Tedor	78d615f320	Merge pull request #18110 from jasontedor/strings-split-as-array Remove Strings#splitStringToArray Remove arbitrary separator/wildcard from PathTrie	2016-05-04 09:38:47 -04:00
Jason Tedor	2dea449949	Remove Strings#splitStringToArray This commit removes the method Strings#splitStringToArray and replaces the call sites with invocations to String#split. There are only two explanations for the existence of this method. The first is that String#split is slightly tricky in that it accepts a regular expression rather than a character to split on. This means that if s is a string, s.split(".") does not split on the character '.', but rather splits on the regular expression '.' which splits on every character (of course, this is easily fixed by invoking s.split("\\.") instead). The second possible explanation is that (again) String#split accepts a regular expression. This means that there could be a performance concern compared to just splitting on a single character. However, it turns out that String#split has a fast path for the case of splitting on a single character and microbenchmarks show that String#split has 1.5x--2x the throughput of Strings#splitStringToArray. There is a slight behavior difference between Strings#splitStringToArray and String#split: namely, the former would return an empty array in cases when the input string was null or empty but String#split will just NPE at the call site on null and return a one-element array containing the empty string when the input string is empty. There was only one place relying on this behavior and the call site has been modified accordingly.	2016-05-04 08:12:41 -04:00
Isabel Drost-Fromm	a8bf75983f	Merge branch 'master' into tests/switch_to_random_value_other_than_for_sort	2016-05-04 10:24:46 +02:00
Daniel Mitterdorfer	0a6f40c7f5	Enable HTTP compression by default with compression level 3 With this commit we compress HTTP responses provided the client supports it (as indicated by the HTTP header 'Accept-Encoding'). We're also able to process compressed HTTP requests if needed. The default compression level is lowered from 6 to 3 as benchmarks have indicated that this reduces query latency with a negligible increase in network traffic. Closes #7309	2016-05-03 08:53:15 +02:00
Isabel Drost-Fromm	372eceb854	Switch to using predicate for testing existing value	2016-05-02 15:41:05 +02:00
Isabel Drost-Fromm	47fefdd273	Switch from separate sort_mode to more general randomValueOtherThan ... for sort tests only ...	2016-04-28 14:45:56 +02:00
Jason Tedor	efeec4d096	Merge pull request #17017 from jasontedor/generic-thread-pool Actually bound the generic thread pool	2016-04-26 08:27:48 -04:00
Alexander Reelsen	486c783f08	Testing: Remove unused junit rule (#17947 ) This rule was used to repeat failed tests due to binding on an already bound port. The test has been fixed so we can get rid of this rule as well.	2016-04-26 09:53:49 +02:00
Adrien Grand	31a9845bc2	Remove the `SearchType` setter on `SearchContext`. #17955 It was not used.	2016-04-26 09:08:37 +02:00
Ali Beyad	d39eb2d691	Adds tombstones to cluster state for index deletions Previously, we would determine index deletes in the cluster state by comparing the index metadatas between the current cluster state and the previous cluster state and decipher which ones were missing (the missing ones are deleted indices). This led to a situation where a node that went offline and rejoined the cluster could potentially cause dangling indices to be imported which should have been deleted, because when a node rejoins, its previous cluster state does not contain reliable state. This commit introduces the notion of index tombstones in the cluster state, where we are explicit about which indices have been deleted. In the case where the previous cluster state is not useful for index metadata comparisons, a node now determines which indices are to be deleted based on these tombstones in the cluster state. There is also functionality to purge the tombstones after exceeding a certain amount. Closes #17265 Closes #16358 Closes #17435	2016-04-25 15:43:20 -04:00
Jason Tedor	5608fa7ac1	Actually bound the generic thread pool This commit actually bounds the size of the generic thread pool. The generic thread pool was of type cached, a thread pool with an unbounded number of workers and an unbounded work queue. With this commit, the generic thread pool is now of type scaling. As such, the cached thread pool type has been removed. By default, the generic thread pool is constructed with a core pool size of four, a max pool size of 128 and idle workers can be reaped after a keep-alive time of thirty seconds expires. The work queue for this thread pool remains unbounded.	2016-04-25 06:47:26 -04:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Nik Everett	65f6f6bc8d	Normalize registration for SignificanceHeuristics When I pulled on the thread that is "Remove PROTOTYPEs from SignificanceHeuristics" I ended up removing SignificanceHeuristicStreams and replacing it with readNamedWriteable. That seems like a lot at once but it made sense at the time. And it is what we want in the end, I think. Anyway, this also converts registration of SignificanceHeuristics to use ParseFieldRegistry to make them consistent with Queries, Aggregations and lots of other stuff. Adds a new and wonderous hack to support serialization checking of NamedWriteables registered by plugins! Related to #17085	2016-04-19 09:47:37 -04:00
Daniel Mitterdorfer	3688629e11	Adjust line-length of transport related classes to coding standard	2016-04-15 10:12:24 +02:00
Ali Beyad	b87fd54ba9	Improvements to the IndicesService class This commit contains the following improvements/fixes: 1. Renaming method names and variables to better reflect the purpose of the method and the semantics of the variable. 2. For deleting indexes, replace the closed parameter passed to the delete index/store methods with obtaining the index's state from the IndexSettings that is already passed in. 3. Added tests to the IndexWithShadowReplicaIT suite, some of which show issues in the shadow replica delete process that are captured in Github issue 17695. Closes #17638	2016-04-14 11:14:02 -04:00
Nik Everett	64f5a4f848	Stop map collisions on FiltersTests Adds randomUnique to generate unique things and uses it to make unique keys. The offending seed was 81AE616FEAD10F17.	2016-04-13 08:35:45 -04:00
Daniel Mitterdorfer	117bc68af3	Limit request size on HTTP level With this commit we limit the size of all in-flight requests on HTTP level. The size is guarded by the same circuit breaker that is also used on transport level. Similarly, the size that is used is HTTP content length. Relates #16011	2016-04-13 09:58:08 +02:00
Daniel Mitterdorfer	52b2016447	Limit request size on transport level With this commit we limit the size of all in-flight requests on transport level. The size is guarded by a circuit breaker and is based on the content size of each request. By default we use 100% of available heap meaning that the parent circuit breaker will limit the maximum available size. This value can be changed by adjusting the setting network.breaker.inflight_requests.limit Relates #16011	2016-04-13 09:54:59 +02:00
Adrien Grand	226644ea2c	Do not assume term queries use the inverted index. #17532 We have a couple places in the code base that assume that search is always done on the inverted index. However with the new points API in Lucene 6, this is not true anymore. This commit makes MappedFieldType.indexedValueForSearch protected and fixes call sites to keep working for field types that use the inverted index and either work differently ar throw an exception otherwise. For instance, it will still be possible to run cross_fields multi match queries on numeric fields, but the score contributions will not be blended as well as before, and significant terms aggregations on long terms will not be possible anymore since points do not record document frequencies.	2016-04-12 09:47:20 +02:00
Adrien Grand	0eb1a816c8	Allow the query cache to be disabled. #16268 This replaces the internal `index.queries.cache.type` setting with a new `index.queries.cache.enabled` setting, which is documented. Closes #15802	2016-04-11 18:06:16 +02:00
Alexander Reelsen	da19ddf3e6	Ingest Attachment: Allow to prevent base64 conversions by using raw bytes (#16601 ) CBOR is natively supported in Elasticsearch and allows for byte arrays. This means, that by using CBOR the user can prevent base64 conversions for the data being sent back and forth. This PR adds support to extract data from a byte array in addition to a string. This also required to add a ByteArrayValueSource class.	2016-04-11 14:14:56 +02:00
David Pilato	1e346d1ac1	Merge branch 'fix/17625-close-ingest-factory'	2016-04-11 10:00:19 +02:00
Nik Everett	525ce40d1c	Give SearchContext a toString and move the string capturing to capture time.	2016-04-10 20:55:31 -04:00
Nik Everett	ac94e5f287	Provide more information about open contexts Sometimes we get a test failure caused by search contexts left open. The tests include a stack trace of the call that opened the context but nothing else about the context. This adds more information about the context that has been left open like what query it was running, what shard it targeted, and whether or not it was a scroll. Relates to #17582	2016-04-10 20:55:31 -04:00
David Pilato	24f48b86b5	Update after review and add a Test	2016-04-09 13:14:25 +02:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
Chris Earle	d97d5ebb8b	Remove hostname from NetworkAddress.format This removes the inconsistent output of IP addresses. The format was parsing-unfriendly and it makes it hard to reason about API responses, such as to _nodes. With this change in place, it will never print the hostname as part of the default format, which has the added benefit that it can be used consistently for URIs, which was not the case when the hostname might appear at the front with "hostname/ip:port".	2016-04-07 17:27:59 -04:00
Adrien Grand	c33300c543	Make MappedFieldType responsible for providing a parser/formatter. #17546 Aggregations need to perform instanceof calls on MappedFieldType instances in order to know how they should be parsed or formatted. Instead, we should let the field types provide a formatter/parser that can can be used.	2016-04-07 16:57:50 +02:00
jaymode	f9d1e8a5f3	Root rest api delegates to a transport action This change makes the root (/) rest api delegate to a transport action to get the data for the response. This aligns this rest api with all of the other apis, which delegate to one or more actions. In doing this, unit tests were added to provide coverage of the RestMainAction and the associated classes.	2016-04-07 10:03:49 -04:00
Jason Tedor	0a69985153	Merge pull request #17038 from jasontedor/enable_acked Prepare for enabling acked indexing	2016-04-06 18:13:28 -04:00
Jimmy Jones	f157dae053	Disallow unquoted field names, fix testcases using unquoted JSON	2016-04-06 14:37:15 -06:00
Clinton Gormley	cbbf80ca35	v2.3.0 has been released and no longer needs to be hardcoded as -SNAPSHOT	2016-04-04 19:03:43 +02:00
Jason Tedor	c7c8b1d825	Merge branch 'master' into enable_acked * master: (156 commits) Make JNA calls optional Added RPM metadata Remove PROTOTYPE from MLT.Item Remove PROTOTYPE from VersionType Fix mistake in TopHits change Remove PROTOTYPEs from highlighting Clean up some log messages Command line arguments with comma must be quoted on windows Cluster Health should run on applied states, even if waitFor=0 #17440 ingest: make concrete processor impl final, like all other processor concrete impls. Improve some test method comments. Document task id's as string in the rest spec Replace FieldStatsProvider with a method on MappedFieldType. #17334 cleanup test Remove MathUtils. #17454 Addressing review comments fix javadocs Make TranslogConfig immutable and pass TranslogGeneration as a ctor arg to Translog [reindex] Don't get rejected Remove redundant commit - #openTranslog() already commits in that case ...	2016-04-02 13:56:00 -04:00
Christoph Büscher	9d68a515b8	Merge pull request #17453 from cbuescher/add-xcontent-randomization Add randomization of XContentBuilder output to query tests	2016-04-01 15:02:01 +02:00
Christoph Büscher	7a1b06ce0b	Improve some test method comments.	2016-04-01 11:04:56 +02:00
Christoph Büscher	1a697a1ae6	Addressing review comments	2016-03-31 21:46:17 +02:00
Simon Willnauer	baa2d51e59	Merge pull request #17422 from s1monw/recovery_mem_buffer_access Move translog recover outside of the engine We changed the way we manage engine memory buffers to an open model where each shard can essentially has infinite memory. The indexing memory controller is responsible for moving memory to disk when it's needed. Yet, this doesn't work today when we recover from store/translog since the engine is not fully initialized such that IMC has no access to the engine, neither to it's memory buffer nor can it move data to disk. The biggest issue here is that translog recovery happends inside the Engine constructor which is problematic by itself since it might take minutes and uses a not yet fully initialzied engine to perform write operations on. This change detaches the translog recovery and makes it the responsibility of the caller to run it once the engine is fully constructed or skip it if not necessary.	2016-03-31 21:03:00 +02:00
Christoph Büscher	bbb6d91147	Add randomization of XContentBuilder output to query tests Currently our testing of parsing query builders is limited to the default order of the parameters that each builders toXContent() method produces. To better test real queries where the order of parameters can be different, this change adds a helper method to ESTestCase that takes a XContentBuilder and randomly shuffles the order of the fields inside an object. This is used in AbstractQueryTestCase, but it can be used in other similar places in the future.	2016-03-31 18:17:39 +02:00
Simon Willnauer	1e06139584	Move translog recover outside of the engine We changed the way we manage engine memory buffers to an open model where each shard can essentially has infinite memory. The indexing memory controller is responsible for moving memory to disk when it's needed. Yet, this doesn't work today when we recover from store/translog since the engine is not fully initialized such that IMC has no access to the engine, neither to it's memory buffer nor can it move data to disk. The biggest issue here is that translog recovery happends inside the Engine constructor which is problematic by itself since it might take minutes and uses a not yet fully initialzied engine to perform write operations on. This change detaches the translog recovery and makes it the responsibility of the caller to run it once the engine is fully constructed or skip it if not necessary.	2016-03-30 23:24:24 +02:00
javanna	b9f9b2e3ee	Merge branch 'master' into enhancement/discovery_node_one_getter	2016-03-30 17:22:40 +02:00
javanna	62ac7d219f	Remove DiscoveryNodes#masterNode in favour of existing DiscoveryNodes#getMasterNode	2016-03-30 15:28:32 +02:00
javanna	f8b5d1f5b0	Remove DiscoveryNodes#masterNodeId in favour of existing DiscoveryNodes#getMasterNodeId	2016-03-30 15:28:06 +02:00
javanna	2dbba45f2c	Rename static DiscoveryNode#localNode(Settings) to DiscoveryNode#isLocalNode(Settings)	2016-03-30 15:27:26 +02:00
javanna	49e952e272	Rename static DiscoveryNode#dataNode(Settings) to isDataNode	2016-03-30 15:26:41 +02:00
javanna	2230fec9ea	Rename static DiscoveryNode#masterNode(Settings) to isMasterNode	2016-03-30 15:26:10 +02:00
javanna	a8bbdff3bc	Remove DiscoveryNode#name in favour of existing DiscoveryNode#getName	2016-03-30 14:47:36 +02:00
javanna	9889f10e5e	Remove DiscoveryNode#id in favour of existing DiscoveryNode#getId	2016-03-30 14:42:15 +02:00
Camilo Diaz Repka	7be11a36cd	Refactor: replace all ocurrences of ESTestCase.getRandom() for random(). Remove getRandom().	2016-03-29 23:18:05 -04:00
javanna	061f09d9a4	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 20:19:33 +02:00
Jason Tedor	c4324f9964	Merge branch 'master' into enable_acked * master: (25 commits) Replication operation that try to perform the primary phase on a replica should be retried split long line in ConvertProcessorTests add type conversion support to ConvertProcessor percolator: Make explain use the two phase iterator test: make sure we don't flush during indexing the percolator queries Added experimental annotation to the update-by-query and reindex docs Fixed bad YAML in reindex REST test: 50_routing.yaml Update-by-query rest tests: fixed bad yaml and deleted a client-dependent test Prevents exception being raised when ordering by an aggregation which wasn't collected The reindex body is now required, which changes the exception thrown by the REST test Docs: Included Nodes Task API and tidied reindex/update-by-query Rename update-by-query REST tests to update_by_query REST: The body is required in the reindex API The source parameter should not be defined in the delete-by-query REST spec Renamed update-by-query REST spec to update_by_query Fix test bug in TypeQueryBuilderTests. Add comment why it is safe to check the number of nested fields in MapperService.merge. Automatically add a sub keyword field to string dynamic mappings. #17188 Type filters should not have a performance impact when there is a single type. #17350 Add API to explain why a shard is or isn't assigned ...	2016-03-29 11:42:34 -04:00
Colin Goodheart-Smithe	ff3fd99074	Prevents exception being raised when ordering by an aggregation which wasn't collected If a terms aggregation was ordered by a metric nested in a single bucket aggregator which did not collect any documents (e.g. a filters aggregation which did not match in that term bucket) an ArrayOutOfBoundsException would be thrown when the ordering code tried to retrieve the value for the metric. This fix fixes all numeric metric aggregators so they return their default value when a bucket ordinal is requested which was not collected. Closes #17225	2016-03-29 13:28:03 +01:00
javanna	de5cbda8e7	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 10:48:47 +02:00
Lee Hinman	80ab366de4	Add API to explain why a shard is or isn't assigned This adds a new `/_cluster/allocation/explain` API that explains why a shard can or cannot be allocated to nodes in the cluster. Additionally, it will show where the master desires to put the shard, according to the `ShardsAllocator`. It looks like this: ``` GET /_cluster/allocation/explain?pretty { "index": "only-foo", "shard": 0, "primary": false } ``` Though, you can optionally send an empty body, which means "explain the allocation for the first unassigned shard you find". The output when a shard is unassigned looks like this: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : false }, "assigned" : false, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-03-22T20:04:23.620Z" }, "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 0.06666675, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "NO", "weight" : -1.3833332, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 2.3166666, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` And when the shard is assigned, the output looks like: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : true }, "assigned" : true, "assigned_node_id" : "Qc6VL8c5RWaw1qXZ0Rg57g", "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 1.4499999, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "CURRENTLY_ASSIGNED", "weight" : 0.0, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 3.6999998, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` Only "NO" decisions are returned by default, but all decisions can be shown by specifying the `?include_yes_decisions=true` parameter in the request. Resolves #14593	2016-03-28 15:21:02 -06:00
Jason Tedor	4793630eb8	Merge branch 'master' into enable_acked * master: (419 commits) Remove PROTOTYPE from ShapeBuilders Take filterNodeIds into consideration while sending tasks actions requests to nodes test: cleanup imports and method rename Remove PROTOTYPE from SortBuilders percolator: Add query extract support for the blended term query and the common terms query. Don't iterate over shard routing if it's null [TEST] Reduce size of random shapes Add some debug logging to testPrimaryRelocationWhileIndexing Order methods in IndicesClusterStateService according to execution Tidied up percolator doc annotations In cat.snapshots, repository is required Do not retrieve all indices stats when checking for cache resets Enforce `discovery.zen.minimum_master_nodes` is set when bound to a public ip #17288 Port Primary Terms to master #17044 Revert "Add debug logging for Vagrant upgrade test" Ownership for data, logs, and configs for packages add on_failure exception metadata to ingest document for verbose simulate Revert "Merge pull request #16843 from xuzha/s3-encryption" Update Format, add new settings into the setting test Update and rebase the init implementation. ...	2016-03-28 12:29:53 -04:00
javanna	a9f4982c40	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-25 20:16:40 +01:00
javanna	93ce36a198	separated attributes from node roles in DiscoveryNode Node roles are now serialized as well, they are not part of the node attributes anymore. DiscoveryNodeService takes care of dividing settings into attributes and roles. DiscoveryNode always requires to pass in attributes and roles separately.	2016-03-25 20:14:27 +01:00
Boaz Leskes	dcd2642dad	Merge branch 'master' into feature/seq_no	2016-03-25 17:26:14 +01:00
Boaz Leskes	91021e3019	merge from master	2016-03-25 15:50:48 +01:00
Tanguy Leroux	a22529cceb	Do not retrieve all indices stats when checking for cache resets	2016-03-25 13:16:12 +01:00
Boaz Leskes	fe43eef1b5	Port Primary Terms to master #17044 Primary terms is a way to make sure that operations replicated from stale primary are rejected by shards following a newly elected primary. Original PRs adding this to the seq# feature branch #14062 , #14651 . Unlike those PR, here we take a different approach (based on newer code in master) where the primary terms are stored in the meta data only (and not in `ShardRouting` objects). Relates to #17038 Closes #17044	2016-03-25 12:01:00 +01:00
javanna	27d4994aff	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-24 18:10:11 +01:00
Alexander Reelsen	b2573858b6	Version: Set version to 5.0.0-alpha1 Change version, required a minor fix in the RPM building. In case of a alpha/beta version, the release will contain alpha/beta as the RPM version cannot contains dashes/tildes.	2016-03-24 08:36:08 +01:00
Honza Král	b139f4e0bf	[TEST] Move yaml test requiring yaml, add skip:yaml Clients don't ship with yaml (de)serializer by default so this test must be optionally skipped	2016-03-23 14:50:23 +01:00
javanna	030453d320	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-23 11:25:34 +01:00
Adrien Grand	e50eeeaffb	Refactor fielddata mappings. #17148 The fielddata settings in mappings have been refatored so that: - text and string have a `fielddata` (boolean) setting that tells whether it is ok to load in-memory fielddata. It is true by default for now but the plan is to make it default to false for text fields. - text and string have a `fielddata_frequency_filter` which contains the same thing as `fielddata.filter.frequency` used to (but validated at parsing time instead of being unchecked settings) - regex fielddata filtering is not supported anymore and will be dropped from mappings automatically on upgrade. - text, string and _parent fields have an `eager_global_ordinals` (boolean) setting that tells whether to load global ordinals eagerly on refresh. - in-memory fielddata is not supported on keyword fields anymore at all. - the `fielddata` setting is not supported on other fields that text and string and will be dropped when upgrading if specified.	2016-03-23 09:48:13 +01:00
Boaz Leskes	7c8cdf4a71	merged from master	2016-03-22 19:21:28 +01:00
Simon Willnauer	1988b8b387	[TEST] Reuse EsTestCase#createAnalysisService in KuromojiAnalysisTests	2016-03-22 13:45:20 +01:00
Boaz Leskes	39ae16bc4c	merge from master	2016-03-22 11:46:26 +01:00
Simon Willnauer	33521fc27c	Detach IndexShard from node services this is the last step to remove node level service from IndexShard. This means that tests can now more easily create an IndexShard instance without starting a node and removes the dependency between IndexShard and Client/ScriptService	2016-03-22 11:02:04 +01:00
javanna	bf390a935e	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-21 17:18:23 +01:00
Boaz Leskes	2d1152ebac	Remove ClusterService interface, in favor of it's only production instance #17183 We current have a ClusterService interface, implemented by InternalClusterService and a couple of test classes. Since the decoupling of the transport service and the cluster service, one can construct a ClusterService fairly easily, so we don't need this extra indirection. Closes #17183	2016-03-21 13:55:10 +01:00
Martijn van Groningen	e3b7e5d75a	percolator: Replace percolate api with the new percolator query Also replaced the PercolatorQueryRegistry with the new PercolatorQueryCache. The PercolatorFieldMapper stores the rewritten form of each percolator query's xcontext in a binary doc values field. This make sure that the query rewrite happens only during indexing (some queries for example fetch shapes, terms in remote indices) and the speed up the loading of the queries in the percolator query cache. Because the percolator now works inside the search infrastructure a number of features (sorting fields, pagination, fetch features) are available out of the box. The following feature requests are automatically implemented via this refactoring: Closes #10741 Closes #7297 Closes #13176 Closes #13978 Closes #11264 Closes #10741 Closes #4317	2016-03-21 12:21:50 +01:00
Boaz Leskes	858610d0d1	merge from master	2016-03-19 13:57:40 +01:00
Simon Willnauer	e91a141233	Prevent index level setting from being configured on a node level Today we allow to set all kinds of index level settings on the node level which is error prone and difficult to get right in a consistent manner. For instance if some analyzers are setup in a yaml config file some nodes might not have these analyzers and then index creation fails. Nevertheless, this change allows some selected settings to be specified on a node level for instance: * `index.codec` which is used in a hot/cold node architecture and it's value is really per node or per index * `index.store.fs.fs_lock` which is also dependent on the filesystem a node uses All other index level setting must be specified on the index level. For existing clusters the index must be closed and all settings must be updated via the API on each of the indices. Closes #16799	2016-03-17 14:42:18 +01:00
Christoph Büscher	6ddf9ae92f	Merge branch 'master' into feature-suggest-refactoring	2016-03-16 15:27:02 +01:00
Igor Motov	b10db19595	Bring back tests for missing elements in the diff-serialized cluster state We can add it back now that we improved our compression framework. Closes #11257	2016-03-16 09:13:52 -04:00
Nik Everett	7197172047	[reindex] Properly register status Without this commit fetching the status of a reindex from a node that isn't coordinating the reindex will fail. This commit properly registers reindex's status so this doesn't happen. To do so it moves all task status registration into NetworkModule and creates a method to register other statuses which the reindex plugin calls.	2016-03-16 07:40:49 -04:00
Christoph Büscher	39667b5793	Merge branch 'master' into feature-suggest-refactoring Conflicts: docs/reference/migration/migrate_5_0/java.asciidoc	2016-03-16 12:06:42 +01:00
Jason Tedor	618441aea3	Merge pull request #17088 from jasontedor/simplify-bootstrap-settings Bootstrap does not set system properties	2016-03-15 19:25:16 -04:00
Yannick Welsch	e91fd09692	Enable jdk-system-out Forbidden API checks on test sources	2016-03-15 15:03:37 +01:00
Christoph Büscher	97638c95fc	Merge branch 'master' into feature-suggest-refactoring Conflicts: docs/reference/migration/migrate_5_0.asciidoc	2016-03-14 11:13:47 +01:00
Jason Tedor	8a05c2a2be	Bootstrap does not set system properties Today, certain bootstrap properties are set and read via system properties. This action-at-distance way of managing these properties is rather confusing, and completely unnecessary. But another problem exists with setting these as system properties. Namely, these system properties are interpreted as Elasticsearch settings, not all of which are registered. This leads to Elasticsearch failing to startup if any of these special properties are set. Instead, these properties should be kept as local as possible, and passed around as method parameters where needed. This eliminates the action-at-distance way of handling these properties, and eliminates the need to register these non-setting properties. This commit does exactly that. Additionally, today we use the "-D" command line flag to set the properties, but this is confusing because "-D" is a special flag to the JVM for setting system properties. This creates confusion because some "-D" properties should be passed via arguments to the JVM (so via ES_JAVA_OPTS), and some should be passed as arguments to Elasticsearch. This commit changes the "-D" flag for Elasticsearch settings to "-E".	2016-03-13 20:09:15 -04:00
David Pilato	9acb0bb28c	Merge branch 'master' into pr/16598-register-filter-settings # Conflicts: # core/src/main/java/org/elasticsearch/cluster/service/InternalClusterService.java # core/src/main/java/org/elasticsearch/common/settings/IndexScopedSettings.java # core/src/main/java/org/elasticsearch/common/settings/Setting.java	2016-03-13 14:52:10 +01:00
Ryan Ernst	591fb8f028	Merge branch 'master' into cli-parsing	2016-03-11 10:45:05 -08:00

... 12 13 14 15 16 ...

1579 Commits