OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ryan Ernst	7195d1e0ff	Fix plugins service to not double bind plugin components	2016-07-11 17:05:56 -07:00
Nik Everett	8263873783	Switch search extension from push to pull Switches most search behavior extensions from push (`onModule(SearchModule)`) to pull (`implements SearchPlugin`). This effort in general gives plugin authors a much cleaner view of how to extend Elasticsearch and starts to set up portions of Elasticsearch as "the plugin API". This commit in particular does that for search-time behavior like customized suggesters, highlighters, score functions, and significance heuristics. It also switches most such customization to being done at search module construction time which is much, much easier to reason about from a testing perspective. It also helps significantly in the process of de-guice-ing Elasticsearch's startup. There are at least two major search time extensions that aren't covered in this commit that will simply have to wait for the next commit on the topic because this one has already grown large: custom aggregations and custom queries. These will likely live in the same SearchPlugin interface as well.	2016-07-11 18:49:05 -04:00
Ryan Ernst	535b60cb2b	Merge pull request #19371 from rjernst/plugin_components Add components getter as bridge between guice and new plugin init world	2016-07-11 14:23:45 -07:00
Ryan Ernst	6b4d0001a2	Remove unnecessary warning suppression	2016-07-11 14:20:47 -07:00
Ryan Ernst	05ea943def	Rename local plugin componenents list for clarity	2016-07-11 14:16:23 -07:00
Ryan Ernst	99ac65931a	Plugins: Add components creator as bridge between guice and new plugin init world This change adds a createComponents() method to Plugin implementations which they can use to return already constructed componenents/services. Eventually this should be just services ("components" don't really do anything), but for now it allows any object so that preconstructed instances by plugins can still be bound to guice. Over time we should add basic services as arguments to this method, but for now I have left it empty so as to not presume what is a necessary service.	2016-07-11 14:14:06 -07:00
Simon Willnauer	048e4416e7	Move netty transport and http into a module This moves all netty code and it's dependency into a module.	2016-07-11 22:21:29 +02:00
Nik Everett	c680bd57da	Fix scroll test It was relying on unreasonably large windows crashing. Those large windows abort the request immediately now.	2016-07-11 16:04:17 -04:00
Ali Beyad	7759c23272	Fix line length formatting for ClusterStateHealthTests	2016-07-11 15:32:13 -04:00
Ali Beyad	0faf638710	Blocked allocations on primary causes RED health If the allocation decision for a primary shard was NO, this should cause the cluster health for the shard to go RED, even if the shard belongs to a newly created index or is part of cluster recovery. Relates #9126	2016-07-11 15:32:13 -04:00
Ali Beyad	417bd0cd63	Index creation does not cause the cluster health to go RED Previously, index creation would momentarily cause the cluster health to go RED, because the primaries were still being assigned and activated. This commit ensures that when an index is created or an index is being recovered during cluster recovery and it does not have any active allocation ids, then the cluster health status will not go RED, but instead be YELLOW. Relates #9126	2016-07-11 15:30:47 -04:00
Nik Everett	3ea1360625	Limit batch size when scrolling Limits the batch size from scrolling using the same setting as interactive search: `index.max_result_window`. Closes #19249	2016-07-11 15:29:45 -04:00
Simon Willnauer	47bd2f9ca5	More cleanups aroung tests that require HTTP to be enalbed. (#19363 ) this commit moves the most of the http related integ tests out into it's own `qa/smoke-test-http` project where most of the test can run against the external cluster.	2016-07-11 20:44:57 +02:00
Nik Everett	89614586e9	Migrate range, date_range, and geo_distance aggregations to NamedWriteable	2016-07-11 13:00:36 -04:00
Christoph Büscher	0d428b6ba8	Add test for GeoHashUtils#bbox()	2016-07-11 10:46:31 -05:00
Nicholas Knize	f77f79c24a	GeoBoundingBoxQueryBuilder should fail when topLeft and bottomRight are the same coordinate	2016-07-11 10:25:33 -05:00
Jason Tedor	df7ad9970b	Batch process node left and node failure Today when a node is removed the cluster (it leaves or it fails), we submit a cluster state update task. These cluster state update tasks are processed serially on the master. When nodes are removed en masse (e.g., a rack is taken down or otherwise becomes unavailable), the master will be slow to process these failures because of the resulting reroutes and publishing of each subsequent cluster state. We improve this in this commit by processing the node removals using the cluster state update task batch processing framework. Relates #19289	2016-07-11 08:30:09 -04:00
Simon Willnauer	3f3c93ec65	Add blocking socket based MockTcpTransport (#19332 ) Today we have a bunch of tests that use netty transport for several reasons these tests use it because they need to run some tcp based transport. Yet, this couples our tests tightly to the netty implementation which should be tested on it's own. This change adds a plain socket based blocking TcpTransport implementation that is used by default in tests if local transport is suppressed or if network is selected. It also adds another tcp network implementation as a showcase how the interface works.	2016-07-11 12:17:52 +02:00
Simon Willnauer	1d03a1409c	Catch assertion errors on commit and turn it into a real exception (#19357 ) Lucene IndexWriter asserts on files existing on the filesystem but some tests throw IOException explicitly on those operatiosn such that some tests trip asserts. We had this before on InternalEngine#ctor and added some logic there to catch only a specific assertions based on some excepition stack analysis. This change applies the same logic to the IndexWriter#commit part of the engine since it can hit the same issue. This also fixes a self-suppression issue in Store.java. Closes #19356	2016-07-11 11:20:56 +02:00
javanna	942e342662	Rest Client: use short performRequest methods when possible	2016-07-11 10:36:26 +02:00
Simon Willnauer	36dbe7250f	Cleanup usage of http.enabled (#19351 ) Several tests required http.enabled where it was unnecessary. We also had RestMainActionIT which tests what two of our REST tests test already so I removed it. The explicit use of http.enabled: false is also obsolet since our test do that by default.	2016-07-11 10:21:03 +02:00
Ryan Ernst	25ed93dd28	Fix test edge case for random bytes reference iter. Getting an offset to the last byte means we can only stream one byte and then we are done, we can't get another offset after it.	2016-07-10 09:11:32 -07:00
Nicholas Knize	ab8b577aea	Use GeoDistanceIT.testDuelOptimizations for bwc testing only This test is only relevant for testing the GeoDistanceBuilder with old indices (pre GeoPointV2). closes #19263	2016-07-09 19:08:06 -05:00
Ryan Ernst	3832825a87	Merge pull request #19348 from rjernst/deguice_attrs Remove CustomNodeAttributes extension point	2016-07-09 12:46:26 -07:00
Ryan Ernst	2b9d4bdf85	Plugins: Remove CustomNodeAttributes extension point The DiscoveryNodeService exists to register CustomNodeAttributes which plugins can add. This is not necessary, since plugins can already add additional attributes, and use the node attributes prefix. This change removes the DiscoveryNodeService, and converts the only consumer, the ec2 discovery plugin, to add the ec2 availability zone in additionalSettings().	2016-07-08 21:39:11 -07:00
Ryan Ernst	7e59181e58	Internal: Remove child injectors from guice This change removes the ability for guice to have child injectors (and the entire concept of parent injectors) from our fork of guice. The methodology for removing was simple: I removed createChildInjector, and continued to remove methods and members that were unused until my head was spinning. The motivation for this change is to limit what our fork of guice gives us access to, so we don't regress and start adding back more complicated uses.	2016-07-08 15:22:50 -07:00
Ryan Ernst	dea00a0b16	Merge pull request #19324 from rjernst/repository_deguice2 Add RepositoryPlugin interface for registering snapshot repositories	2016-07-08 14:38:07 -07:00
Nicholas Knize	8dd4a6473e	Remove radial restriction for GeoDistanceQuery As of lucene 6.1 GeoDistanceQuery no longer requires restricting the radial distance in GeoPointDistanceQuery. closes #17578	2016-07-08 12:13:22 -05:00
Nicholas Knize	72fa345f5e	Mute GeoDistanceIT while fixing	2016-07-08 10:02:15 -05:00
Simon Willnauer	f6ac147b1d	Add a unit test that sends random requests among 3 nodes (#19329 ) This adds a test that uses transport implementation and sends random requests to 3 different nodes, the request handlers maybe forwarding the requests to yet another node etc. until returning the response. This test basically tests that nodes are not deadlocking in a distributed fashion.	2016-07-08 14:13:36 +02:00
Martijn van Groningen	7b8ae54f0f	percolator: Also support query term extract for queries wrapped inside a FunctionScoreQuery Additionally for highlighting percolator hits, also extract percolator query from FunctionScoreQuery and DisjunctionMaxQuery	2016-07-08 10:51:48 +02:00
Simon Willnauer	1cb1373722	[TEST] Test analyzer alias works Relates to #19163	2016-07-08 10:33:12 +02:00
Ryan Ernst	e6be4af014	Plugins: Add RepositoryPlugin interface for registering snapshot repositories Repository plugins currently use a lot of custom classes like RepositoryName and RepositorySettings in order to use guice to construct repository implementations. But repositories now only really need their settings to be constructed. Anything else they need (eg a cloud client) can be constructed within the plugin, instead of via guice. This change makes repository plugins use the new pull model. It removes guice from the construction of Repository objects (no more child injectors) and also from all repository plugins.	2016-07-08 00:10:03 -07:00
Nik Everett	81fcdfcee9	Expose task information from NodeClient This exposes a method to start an action and return a task from `NodeClient`. This allows reindex to use the injected `Client` rather than require injecting `TransportAction`s	2016-07-07 18:02:09 -04:00
Nik Everett	fe0f28965a	Clean up serialization of terms aggregation results Move to NamedWriteable and remove a lot of duplication.	2016-07-07 17:01:09 -04:00
Nik Everett	7da753a4d7	Migrate sampler and missing aggregations to NamedWriteable This is another step down the path to removing aggregation's special "streams" which reimplement NamedWriteable.	2016-07-07 16:40:38 -04:00
Nik Everett	d83e1cccac	Fix checkstyle in test	2016-07-07 16:40:11 -04:00
Ryan Ernst	89d69ea5a2	Merge pull request #19292 from rjernst/repository_deguice Simplified repository api for snapshot/restore	2016-07-07 13:03:58 -07:00
Ryan Ernst	593f8bdf0c	Rename repository api methods for clarity and tweak documentation.	2016-07-07 12:54:10 -07:00
Jason Tedor	e86aa29f67	Die with dignity Today when a thread encounters a fatal unrecoverable error that threatens the stability of the JVM, Elasticsearch marches on. This includes out of memory errors, stack overflow errors and other errors that leave the JVM in a questionable state. Instead, the Elasticsearch JVM should die when these errors are encountered. This commit causes this to be the case. Relates #19272	2016-07-07 14:44:03 -04:00
Jason Tedor	d3f8329a3d	Tighten ensure atomic move cleanup This commit tightens the cleanup after possible errors while ensuring the filesystem supports atomic move. Relates #19309	2016-07-07 14:40:05 -04:00
Tanguy Leroux	3267fc4e0c	Clean up more messy tests After #13834 many tests that used Groovy scripts (for good or bad reason) in their tests have been moved in the lang-groovy module and the issue #13837 has been created to track these messy tests in order to clean them up. This commit moves more tests back in core, removes the dependency on Groovy, changes the scripts in order to use the mocked script engine, and change the tests to integration tests.	2016-07-07 17:50:23 +02:00
Clinton Gormley	ec3807d426	Added 2.3.4 version and bwc indices	2016-07-07 16:36:23 +02:00
Tanguy Leroux	b58f2eb5c2	Move back some messy tests from Groovy plugin to core This commit moves back some messy tests that have been placed in lang-groovy module in https://github.com/elastic/elasticsearch/pull/13834. It removes the dependency on Groovy plugin as well as change back the tests to integration tests (IT suffix). It also changes the current MockScriptEngine and MockScriptPlugin to make it easier to use.	2016-07-07 15:26:36 +02:00
Alexander Reelsen	71b48fb16c	Dependencies: Update to jopt-5.0 (#19278 ) The new version of jopt allows us to remove a couple of TODOs in the code. Closes #12368	2016-07-07 08:50:10 +02:00
Jason Tedor	b3105bd316	Fix modifier order in BytesStreamsTests This commit fixes an issue with the ordering of modifiers on a static nested class in BytesStreamsTests.	2016-07-06 20:11:38 -04:00
Ryan Ernst	dd7be74bcf	Plugins: Simplified repository api for snapshot/restore The api for snapshot/restore was split up between two interfaces, Repository and IndexShardRepository. There was also complex initialization and injection between the two. However, there is always a one to one relationship between the two. This change moves the IndexShardRepository api into Repository, as well as updates the API so as not to require any services to be injected for sublcasses.	2016-07-06 17:09:30 -07:00
Chris Earle	b927cfe1de	Add DeprecationRestHandler to automatically log deprecated REST calls This adds a new proxy for RestHandlers and RestControllers so that requests made to deprecated REST APIs can be automatically logged in the ES logs via the DeprecationLogger as well as via a "Warning" header (RFC-7234) for all responses.	2016-07-06 19:52:00 -04:00
Yannick Welsch	76057e6b05	Fix test issue where index is explicitly deleted during cluster state update Calling indicesService.deleteIndex() can trip an assertion if there is an ongoing cluster state applied in IndicesClusterStateService. This means that the index is possibly deleted after the failMissingShards check and before we try creating new and updated shards, tripping an assertion that non-existing shards must have shard state initializing (started in this case).	2016-07-06 16:51:22 +02:00
Jim Ferenczi	37725f640c	Add missing field type in the FieldStats response. This change adds the type of the field in the fieldstats response. It can be one of the following: * "integer" for byte, short, integer and long * "float" for float, half-float and double * "date" for date * "ip" for ip * "text" for string, keyword and text. Closes #17750	2016-07-06 15:24:09 +02:00
Ryan Ernst	f49ce8e6fe	Add basic tests for ingest plugins setup	2016-07-05 21:03:39 -07:00
Ryan Ernst	2fc41adeb5	Merge branch 'master' into ingest_plugin_api	2016-07-05 20:53:03 -07:00
Ryan Ernst	18c9e7adaf	Merge branch 'master' into unused	2016-07-05 20:46:36 -07:00
Ryan Ernst	14eefb7607	Internal: Remove guice from transport client helper classes This change removes injection for constructors of TransportClientNodesService and TransportProxyClient.	2016-07-05 19:51:03 -07:00
Igor Motov	74af0e36f3	[TEST] Fix rare OBOE in AbstractBytesReferenceTestCase	2016-07-05 16:40:03 -04:00
Nik Everett	b3c015e2bb	Reindex from remote This adds a remote option to reindex that looks like ``` curl -POST 'localhost:9200/_reindex?pretty' -d'{ "source": { "remote": { "host": "http://otherhost:9200" }, "index": "target", "query": { "match": { "foo": "bar" } } }, "dest": { "index": "target" } }' ``` This reindex has all of the features of local reindex: * Using queries to filter what is copied * Retry on rejection * Throttle/rethottle The big advantage of this version is that it goes over the HTTP API which can be made backwards compatible. Some things are different: The query field is sent directly to the other node rather than parsed on the coordinating node. This should allow it to support constructs that are invalid on the coordinating node but are valid on the target node. Mostly, that means old syntax.	2016-07-05 16:13:17 -04:00
Jason Tedor	96f283c195	Rename writeThrowable to writeException This commit renames writeThrowable to writeException. The situation here stems from the fact that the StreamOutput method for serializing Exceptions needs to accept Throwables too as Throwables can be the cause of serialized Exceptions. Yet, we do not serialize Throwables in the Error sub-hierarchy in a way that they can be deserialized into their initial type. This leads to an asymmetry in the StreamOutput method for serializing Exceptions and the StreamInput method for writing Excpetions. Namely, the former will accept Throwables but the latter will only return Exceptions. A goal with the stream methods has always been symmetry in the method names so that serialization/deserialization routines appear symmetrical in code. It is this asymmetry on the input/output types for Exceptions on StreamOutput/StreamInput that clashes with the desired symmetry of naming. Despite this, we should favor symmetry in the naming of the methods. This commit renames StreamOutput#writeThrowable to StreamOutput#writeException which leaves us with Exception StreamInput#readException and void StreamOutput#writeException(Throwable).	2016-07-05 14:37:01 -04:00
Justin Patrin	ebe616988a	Start transport client round-robin randomly This commit modifies the initial value of the transport client round-robin index to a random value so that initial requests are more likely to not all hit the same node. Relates #14143	2016-07-05 13:17:17 -04:00
LeeDr	036b8ff177	Fix stored_fields message	2016-07-05 09:11:29 -05:00
Nik Everett	3269c84c7a	Remote BucketStreams It isn't used.	2016-07-05 09:02:36 -04:00
Adrien Grand	4b0d317e63	Bump version to 5.0.0-alpha5.	2016-07-05 14:34:23 +02:00
Simon Willnauer	d08812d839	[TEST] fix test to account for internal empyt reference optimization	2016-07-05 11:23:43 +02:00
Simon Willnauer	a4ec0ac22f	Upgrade to netty 3.10.6.Final (#19235 )	2016-07-05 11:11:55 +02:00
Simon Willnauer	cbbc8790a5	Remove redundant modifier	2016-07-05 08:39:08 +02:00
Simon Willnauer	44ccf67e33	Simplify TcpTransport interface by reducing send code to a single send method (#19223 ) Due to some optimization on the netty layer we had quite some code / cruft added to the TcpTransport to allow for those optimizations. After cleaning up BytesReference we can now move this optimization into TcpTransport and have a simple send method on the implementation layer instead. This commit adds a CompositeBytesReference that also allows message headers to be written separately which simplify the header code as well since no skips are needed anymore.	2016-07-05 08:33:19 +02:00
Jason Tedor	a00a54ebda	Fix style violation in InstallPluginCommand.java This commit fixes a checkstyle violation in InstallPluginCommand.java added after renaming UserError to UserException in `f9d55be1ed`.	2016-07-04 19:45:46 -04:00
Jason Tedor	f9d55be1ed	Rename UserError The top-level class Throwable represents all errors and exceptions in Java. This hierarchy is divided into Error and Exception, the former being serious problems that applications should not try to catch and the latter representing exceptional conditions that an application might want to catch and handle. This commit renames org.elasticsearch.cli.UserError to org.elasticsearch.UserException to make its name consistent with where it falls in this hierarchy. Relates #19254	2016-07-04 19:22:29 -04:00
Jason Tedor	36b887ee7c	Throw translog corrupted exception on malformed op Today when reading a malformed operation from the translog, we throw an assertion error that is immediately caught and wrapped into a translog corrupted exception. This commit replaces this by electing to directly throw a translog corrupted exception instead. Additionally, this cleanup also addressed a double-wrapped translog corrupted exception. Namely, verifying the checksum can throw a translog corrupted exception which the existing code would catch and wrap again in a translog corrupted exception. Relates #19256	2016-07-04 19:22:12 -04:00
Ali Beyad	f5b07e438f	Fixes getting snapshot status checks for preventing duplicate entries from both current snapshots (cluster state) and repository snapshots (on storage).	2016-07-04 17:40:44 -04:00
Boaz Leskes	37c8c0fa03	Improve logging for batched cluster state updates (#19255 ) We've been slowly improving batch support in `ClusterService` so service won't need to implement this tricky logic themselves. These good changes are blessed but our logging infra didn't catch up and we now log things like: ``` [2016-07-04 21:51:22,318][DEBUG][cluster.service ] [node_sm0] processing [put-mapping [type1],put-mapping [type1]]: ``` Depending on the `source` string this can get quite ugly (mostly in the ZenDiscovery area). This PR adds some infra to improve logging, keeping the non-batched task the same. As result the above line looks like: ``` [2016-07-04 21:44:45,047][DEBUG][cluster.service ] [node_s0] processing [put-mapping[type0, type0, type0]]: execute ``` ZenDiscovery waiting on join moved from: ``` [2016-07-04 17:09:45,111][DEBUG][cluster.service ] [node_t0] processing [elected_as_master, [1] nodes joined),elected_as_master, [1] nodes joined)]: execute ``` To ``` [2016-07-04 22:03:30,142][DEBUG][cluster.service ] [node_t3] processing [elected_as_master ([3] nodes joined)[{node_t2}{R3hu3uoSQee0B6bkuw8pjw}{p9n28HDJQdiDMdh3tjxA5g}{127.0.0.1}{127.0.0.1:30107}, {node_t1}{ynYQfk7uR8qR5wKIysFlQg}{wa_OKuJHSl-Oyl9Gis-GXg}{127.0.0.1}{127.0.0.1:30106}, {node_t0}{pweq-2T4TlKPrEVAVW6bJw}{NPBSLXSTTguT1So0JsZY8g}{127.0.0.1}{127.0.0.1:30105}]]: execute ``` As a bonus, I removed all `zen-disco` prefixes to sources from that area.	2016-07-04 22:54:43 +02:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Tanguy Leroux	ed444cd276	Fix CompletionTokenStream modifier redundancy	2016-07-04 16:29:26 +02:00
Tanguy Leroux	d72134b46e	Fix CompletionSuggestSearchIT and CompletionSuggestSearch2xIT	2016-07-04 15:58:27 +02:00
Tanguy Leroux	0e7faf1005	Enable Checkstyle RedundantModifier	2016-07-04 15:22:12 +02:00
Nik Everett	a728c4bb5c	Migrate global, filter, and filters aggs to NamedWriteable Once all of these are migrated we'll be able to remove aggregation's custom "streams" which function that same as NamedWriteable. It also allows us to make most of the fields on aggregations final which is rather nice. Also starts to migrate MultiBucketAggregation.Bucket to Writeable, allowing the buckets to have immutable parts.	2016-07-04 08:53:02 -04:00
Nik Everett	c02de9227c	Migrate remaining calc aggs to NamedWriteable Once all of these are migrated we'll be able to remove aggregation's custom "streams" which function that same as NamedWriteable. It also allows us to make most of the fields on aggregations final which is rather nice.	2016-07-04 08:46:00 -04:00
Jason Tedor	3343ceeae4	Do not catch throwable Today throughout the codebase, catch throwable is used with reckless abandon. This is dangerous because the throwable could be a fatal virtual machine error resulting from an internal error in the JVM, or an out of memory error or a stack overflow error that leaves the virtual machine in an unstable and unpredictable state. This commit removes catch throwable from the codebase and removes the temptation to use it by modifying listener APIs to receive instances of Exception instead of the top-level Throwable. Relates #19231	2016-07-04 08:41:06 -04:00
Boaz Leskes	86d2e88362	re-introduce: Inline reroute with process of node join/master election (#18938 )	2016-07-04 13:06:56 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Daniel Mitterdorfer	25881c265b	Improve error checking and reporting in CBSIT	2016-07-04 10:24:15 +02:00
Daniel Mitterdorfer	4131476db1	Propagate canTripCircuitBreaker for all broadcasted actions With this commit we also propagate the `canTripCircuitBreaker` setting for the main action in TransportBroadcastByNodeAction. Previously, we set it only on the additional action added by this handler.	2016-07-04 10:24:15 +02:00
Ryan Ernst	5cf7583bde	Internal: Remove unused methods and classes This change removes a handful of classes and methods that were simply unused. Some of the classes were intermediate abstract classes that added nothing to the base class they extended.	2016-07-02 11:33:16 -07:00
Yannick Welsch	3e199b1dff	Make testAllOperationsInvoked properly clean up after itself	2016-07-02 11:54:52 +02:00
Yannick Welsch	3221c9d970	Match exception message more exactly in test assertion	2016-07-02 10:07:57 +02:00
Yannick Welsch	b4064ce43f	Make primary relocation handoff non-blocking (#19013 ) Primary relocation and indexing concurrently can currently lead to a deadlock situation as indexing operations are blocked on a (bounded) thread pool during the hand-off phase between old and new primary. This change replaces blocking of indexing operations by putting operations that cannot be executed during relocation hand-off in a queue to be executed once relocation completes. Closes #18553.	2016-07-02 09:35:54 +02:00
Yannick Welsch	50b97ba5f5	Fix test assertion matching exception message Newer versions of the URL class in JDK 9 use a different exception message when throwing a MalformedURLException due to an unknown protocol.	2016-07-02 09:27:07 +02:00
Ryan Ernst	5a66c08ae9	Merge branch 'master' into ingest_plugin_api	2016-07-01 16:27:52 -07:00
Ryan Ernst	c7b9489be8	Merge pull request #19225 from rjernst/we_dont_need_generics Internal: Remove generics from LifecycleComponent	2016-07-01 16:25:34 -07:00
Ryan Ernst	822c995367	Internal: Remove generics from LifecycleComponent The only reason for LifecycleComponent taking a generic type was so that it could return that type on its start and stop methods. However, this chaining has no practical necessity. Instead, start and stop can be void, and a whole bunch of confusing generics disappear.	2016-07-01 16:17:42 -07:00
Ali Beyad	05998224d8	Adding repository index generational files Before, a repository would maintain an index file (named 'index') per repository, that contained the current snapshots in the repository. This file was not atomically written, so repositories had to depend on listing the blobs in the repository to determine what the current snapshots are, and only rely on the index file if the repository does not support the listBlobs operation. This could cause an incorrect view of the current snapshots in the repository if any prior snapshot delete operations failed to delete snapshot metadata files. This commit introduces the atomic writing of the index file, and because atomic writes are not guaranteed if the file already exists, we write to a generational index file (index-N, where N is the current generation). We also maintain an index-latest file that contains the current generation, for those repositories that cannot list blobs. Closes #19002 Relates #18156	2016-07-01 17:52:57 -04:00
Ryan Ernst	40e1fe7d9e	Remove unused ClusterService member of NodeService	2016-07-01 13:17:12 -07:00
Ryan Ernst	f9eed91f33	Fix test oops	2016-07-01 13:00:33 -07:00
Ryan Ernst	2f12d1cb45	Merge branch 'master' into ingest_plugin_api	2016-07-01 12:58:11 -07:00
Ryan Ernst	62397c0c3d	Fix node service to still work when http is disabled	2016-07-01 12:55:41 -07:00
Ryan Ernst	6110dcc710	Fix unit tests for http server that no longer need node service	2016-07-01 12:45:43 -07:00
Ryan Ernst	f6491047a0	Fix line lengths for ingest config utils	2016-07-01 12:36:44 -07:00
Ryan Ernst	e5caadc4f3	Merge branch 'master' into ingest_plugin_api	2016-07-01 12:35:26 -07:00
Ryan Ernst	a200292211	Merge pull request #19218 from rjernst/http_server_without_node_service Internal: Remove cyclic dependency between HttpServer and NodeService	2016-07-01 12:32:52 -07:00
Nik Everett	de71a9abb3	Migrate value_count, percentiles, and percentile_ranks aggregations to NamedWriteable These are the first aggregations with multiple `InternalAggregation`s backing the same `AggregationBuilder`. This required a change in the register method's signature.	2016-07-01 14:48:08 -04:00
Ali Beyad	cb20776439	Includes the index UUID in the _cat/indices API and adds tests for the _cat/indices functionality. Closes #19204 Closes #19132	2016-07-01 14:45:55 -04:00
Ryan Ernst	a206c7a149	Remove unnecessary transport level bwc	2016-07-01 10:16:56 -07:00
Ryan Ernst	dbbfadeefa	Internal: Remove cyclic dependency between HttpServer and NodeService NodeService has an "service attributes" map, which is only set by HttpServer on start/stop. But the only thing it puts in this map is already available as part of the HttpServer info which is added to node info requests. This change removes the attributes map and removes the dependency in HttpServer on NodeService.	2016-07-01 09:50:54 -07:00
Ryan Ernst	76ba10bab6	Remove unnecessary optional injection of ScriptService into NodeService	2016-07-01 09:31:41 -07:00
Ryan Ernst	65c9b0b588	Merge branch 'master' into ingest_plugin_api	2016-07-01 09:26:17 -07:00
Tal Levy	01d7020ee3	Skip the execution of an empty pipeline (#19200 ) main optimization: `sourceToMap` is not called, therefore avoiding creation of Map of Maps Closes #19192.	2016-07-01 09:15:05 -07:00
Tanguy Leroux	5c7ca5cc0c	Fix checkstyle violations	2016-07-01 17:24:43 +02:00
Tanguy Leroux	0a293fad29	Remove some unused code	2016-07-01 17:01:39 +02:00
Tanguy Leroux	8c40b2b54e	Fix order of modifiers	2016-07-01 16:57:14 +02:00
Simon Willnauer	5c8164a561	Clean up BytesReference (#19196 ) BytesReference should be a really simple interface, yet it has a gazillion ways to achieve the same this. Methods like `#hasArray`, `#toBytesArray`, `#copyBytesArray` `#toBytesRef` `#bytes` are all really duplicates. This change simplifies the interface dramatically and makes implementations of it much simpler. All array access has been removed and is streamlined through a single `#toBytesRef` method. Utility methods to materialize a compact byte array has been added too for convenience.	2016-07-01 16:09:31 +02:00
Nik Everett	27e320d5ce	Migrate sum, min, and max aggs to NamedWriteable	2016-07-01 09:23:26 -04:00
Nik Everett	91b66e3cf4	Migration stats and extended stats to NamedWriteable Migrates the `stats` and `extended_stats` aggregations and pipeline aggregations from the special purpose aggregations streams to `NamedWriteable`. These are the first pipeline aggregations so this adds the infrastructure to support both streams and `NamedWriteable`s for pipeline aggregations.	2016-07-01 09:13:15 -04:00
javanna	598c36128e	Revert "Raised IOException on deleteBlob (#18815 )" This reverts commit `d24cc65cad` as it seems to be causing test failures.	2016-07-01 11:00:32 +02:00
gfyoung	d24cc65cad	Raised IOException on deleteBlob (#18815 ) Raise IOException on deleteBlob if the blob doesn't exist This commit raises an IOException on BlobContainer#deleteBlob if the blob does not exist, in conformance with the BlobContainer interface contract. Each implementation of BlobContainer now conforms to this contract (file system, S3, Azure, HDFS). This commit also contains blob container tests for each of the repository implementations. Closes #18530	2016-06-30 23:00:10 -04:00
Ryan Ernst	8275ab497b	Merge pull request #19170 from rjernst/rest_handler_client Changed rest handler interface to take NodeClient	2016-06-30 11:00:09 -07:00
Nik Everett	f5a269b029	Start migration away from aggregation streams We'll migrate to NamedWriteable so we can share code with the rest of the system. So we can work on this in multiple pull requests without breaking Elasticsearch in between the commits this change supports both old style `InternalAggregations.stream` serialization and `NamedWriteable` style serialization. As such it creates about a half dozen `// NORELEASE` comments that will have to be removed once the migration is complete. This also introduces a boolean `transportClient` flag to `SearchModule` which is used to skip inappropriate registrations for for the transport client while still registering the things it needs. In this case that means that the `InternalAggregation` subclasses are registered with the `NamedWriteableRegistry` but the `AggregationBuilder` subclasses are not. Finally, this moves aggregation registration from guice configuration time to `SearchModule` construction time. This will make it simpler to work with in the future as we further clean up Elasticsearch's extension points.	2016-06-30 12:57:34 -04:00
Boaz Leskes	09ca6d6ed2	Add a BridgePartition to be used by testAckedIndexing (#19172 ) We have long worked to capture different partitioning scenarios in our testing infra. This PR adds a new variant, inspired by the Jepsen blogs, which was forgotten far - namely a partition where one node can still see and be seen by all other nodes. It also updates the resiliency page to better reflect all the work that was done in this area.	2016-06-30 17:58:12 +02:00
Ryan Ernst	04a4bcdca0	Add comment explaining bytes reference edge case	2016-06-30 08:47:55 -07:00
Ryan Ernst	e079c83020	Fix test edge case for bytes reference	2016-06-30 08:45:54 -07:00
Ryan Ernst	c762e7aa15	Merge branch 'master' into rest_handler_client	2016-06-30 08:16:25 -07:00
Ryan Ernst	0732004ae8	Merge pull request #19177 from rjernst/ingest_factory_generic Remove generics from ingest Processor.Factory	2016-06-30 08:08:26 -07:00
Christoph Büscher	afb5e6332b	Make sure TimeIntervalRounding is monotonic for increasing dates (#19020 ) Currently there are cases when using TimeIntervalRounding#round() and date1 < date2 that round(date2) < round(date1). These errors can happen when using a non-fixed time zone and the values to be rounded are slightly after a time zone offset change (e.g. DST transition). Here is an example for the "CET" time zone with a 45 minute rounding interval. The dates to be rounded are on the left (with utc time stamp), the rounded values on the right. The error case is marked: 2011-10-30T01:40:00.000+02:00 1319931600000 \| 2011-10-30T01:30:00.000+02:00 1319931000000 2011-10-30T02:02:30.000+02:00 1319932950000 \| 2011-10-30T01:30:00.000+02:00 1319931000000 2011-10-30T02:25:00.000+02:00 1319934300000 \| 2011-10-30T02:15:00.000+02:00 1319933700000 2011-10-30T02:47:30.000+02:00 1319935650000 \| 2011-10-30T02:15:00.000+02:00 1319933700000 2011-10-30T02:10:00.000+01:00 1319937000000 \| 2011-10-30T01:30:00.000+02:00 1319931000000 * 2011-10-30T02:32:30.000+01:00 1319938350000 \| 2011-10-30T02:15:00.000+01:00 1319937300000 2011-10-30T02:55:00.000+01:00 1319939700000 \| 2011-10-30T02:15:00.000+01:00 1319937300000 2011-10-30T03:17:30.000+01:00 1319941050000 \| 2011-10-30T03:00:00.000+01:00 1319940000000 We should correct this by detecting that we are crossing a transition when rounding, and in that case pick the largest valid rounded value before the transition. This change adds this correction logic to the rounding function and adds this invariant to the randomized TimeIntervalRounding tests. Also adding the example test case from above (with corrected behaviour) for illustrative purposes.	2016-06-30 17:05:54 +02:00
Simon Willnauer	40ec639c89	Factor out abstract TCPTransport* classes to reduce the netty footprint (#19096 ) Today we have a ton of logic inside the NettyTransport* codebase. The footprint of the code that has a direct netty dependency is large and alternative implementations are pretty hard today since they need to know all about our proticol etc. This change moves most of the code into TCPTransport* baseclasses and moves all the protocol send code together. The base classes now contain the majority of the logic while NettyTransport* classes remain to implement the glue code, configuration and optimization.	2016-06-30 13:41:53 +02:00
Ryan Ernst	e4f265eb3a	Ingest: Remove generics from Processor.Factory The factory for ingest processor is generic, but that is only for the return type of the create mehtod. However, the actual consumer of the factories only cares about Processor, so generics are not needed. This change removes the generic type from the factory. It also removes AbstractProcessorFactory which only existed in order pull the optional tag from config. This functionality is moved to the caller of the factories in ConfigurationUtil, and the create method now takes the tag. This allows the covariant return of the implementation to work with tests not needing casts.	2016-06-30 02:33:54 -07:00
Martijn van Groningen	299c6fcc63	test: use the reader from the searcher (newSearcher(...) method may change the reader) instead of the reader we create in the test Closes #19151	2016-06-30 11:10:38 +02:00
Ryan Ernst	08b3b6264e	Tests pass, started removing generics from processor factory	2016-06-30 01:49:22 -07:00
Ryan Ernst	f4519c44b7	Merge branch 'master' into ingest_plugin_api	2016-06-29 22:38:23 -07:00
Ryan Ernst	c77dc4a82c	Merge pull request #19136 from rjernst/script_service_deps Scripts: Remove ClusterState from compile api	2016-06-29 22:34:40 -07:00
Ryan Ernst	865b951b7d	Internal: Changed rest handler interface to take NodeClient Previously all rest handlers would take Client in their injected ctor. However, it was only to hold the client around for runtime. Instead, this can be done just once in the HttpService which handles rest requests, and passed along through the handleRequest method. It also should always be a NodeClient, and other types of Clients (eg a TransportClient) would not work anyways (and some handlers can be simplified in follow ups like reindex by taking NodeClient).	2016-06-29 18:02:18 -07:00
Ryan Ernst	7c50de182e	Remove test for closing ingest processors, this is now handled at the plugin level	2016-06-29 16:23:16 -07:00
Ryan Ernst	172ced3e2d	Fix test bug in plugin cli progress tests	2016-06-29 15:56:36 -07:00
Nik Everett	8db43c0107	Move RestHandler registration to ActionModule and ActionPlugin `RestHandler`s are highly tied to actions so registering them in the same place makes sense. Removes the need to for plugins to check if they are in transport client mode before registering a RestHandler - `getRestHandlers` isn't called at all in transport client mode. This caused guice to throw a massive fit about the circular dependency between NodeClient and the allocation deciders. I broke the circular dependency by registering the actions map with the node client after instantiation.	2016-06-29 18:31:44 -04:00
Ryan Ernst	f1376262fe	Merge branch 'master' into ingest_plugin_api	2016-06-29 14:16:16 -07:00
Ryan Ernst	4dcb2b8024	Merge pull request #19137 from rjernst/closeable_plugins Make plugins closeable	2016-06-29 13:54:20 -07:00
Ryan Ernst	b3daf7d683	Remove unnecessary variant of detailedMessage	2016-06-29 11:25:23 -07:00
Ryan Ernst	8b533b7ca9	Internal: Deprecate ExceptionsHelper.detailedMessage This is a trappy "helper" and only hurts. See #19069	2016-06-29 11:09:35 -07:00
Jason Tedor	fc38e503e0	Clearer error when handling fractional time values In `2f638b5a23`, support for fractional time values was removed. While this change is documented, the error message presented does not give an indication that fractional inputs are not supported. This commit fixes this by detecting when the input is a time value that would successfully parse as a double but will not parse as a long and presenting a clear error message that fractional time values are not supported. Relates #19158	2016-06-29 13:36:11 -04:00
Christoph Büscher	0d81dee013	Fix key_as_string for date histogram and epoch_millis/epoch_second format When doing a `date_histogram` aggregation with `"format":"epoch_millis"` or `"format" : "epoch_second"` and using a time zone other than UTC, the `key_as_string` ouput in the response does not reflect the UTC timestamp that is used as the key. This happens because when applying the `time_zone` in DocValueFormat.DateTime to an epoch-based formatter, this adds the time zone offset to the value being formated. Instead we should adjust the added display offset to get back the utc instance in EpochTimePrinter. Closes #19038	2016-06-29 19:18:12 +02:00
Alexander Reelsen	56fa751928	Plugins: Add status bar on download (#18695 ) As some plugins are becoming big now, it is hard for the user to know, if the plugin is being downloaded or just nothing happens. This commit adds a progress bar during download, which can be disabled by using the `-q` parameter. In addition this updates to jimfs 1.1, which allows us to test the batch mode, as adding security policies are now supported due to having jimfs:// protocol support in URL stream handlers.	2016-06-29 16:44:12 +02:00
Britta Weber	6d5666553c	[TEST] mute test because it fails about 1/100 runs	2016-06-29 15:53:57 +02:00
Simon Willnauer	819fe40d61	Extract AbstractBytesReferenceTestCase (#19141 ) We have a ton of tests for PagedBytesReference but not really many for the other implementation of BytesReference. This change factors out a basic AbstractBytesReferenceTestCase that simplifies testing other implementations. It also caught a couple of bug here and there like a missing mask when reading bytes as ints in PagedBytesReference.	2016-06-29 14:45:54 +02:00
Simon Willnauer	872cdffc27	Factor out ChannelBuffer from BytesReference (#19129 ) The ChannelBuffer interface today leaks into the BytesReference abstraction which causes a hard dependency on Netty across the board. This chance moves this dependency and all BytesReference -> ChannelBuffer conversion into NettyUtlis and removes the abstraction leak on BytesReference. This change also removes unused methods on the BytesReference interface and simplifies access to internal pages.	2016-06-29 10:45:05 +02:00
Ryan Ernst	6590e77c1a	Plugins: Make plugins closeable This change allows Plugin implementions to implement Closeable when they have resources that should be released. As a first example of how this can be used, I switched over ingest plugins, which just had the geoip processor. The ingest framework had chains of closeable to support this, which is now removed.	2016-06-28 16:16:26 -07:00
Ryan Ernst	ecf6101798	Scripts: Remove ClusterState from compile api Stored scripts are pulled from the cluster state, and the current api requires passing the ClusterState on each call to compile. However, this means every user of the ScriptService needs to depend on the ClusterService. Instead, this change makes the ScriptService a ClusterStateListener. It also simplifies tests a lot, as they no longer need to create fake cluster states (except when testing stored scripts).	2016-06-28 13:20:00 -07:00
Ryan Ernst	258c3e86ab	Added IngestPlugin api, cutover common and geoip, changed ingest factory api to take ProcessorsRegistry	2016-06-28 10:52:07 -07:00
Simon Willnauer	9b9e17abf7	Cleanup Compressor interface (#19125 ) Today we have several deprecated methods, leaking netty interfaces, support for multiple compressors on the compressor interface. The netty interface can simply be replaced by BytesReference which we already have an implementation for, all the others are not used and are removed in this commit.	2016-06-28 17:51:33 +02:00
Yannick Welsch	0515791846	Fix logger usages	2016-06-28 16:51:06 +02:00
Boaz Leskes	2512594d9e	Testing infra - stablize data folder usage and clean up (#19111 ) The plan for persistent node ids ( #17811 ) is to tie the node identity to a file stored in it's data folders. As such it becomes important that nodes in our testing infra have better affinity with their data folders and that their data folders are not cleaned underneath them. The first is important because we fix the random seed used for node id generation (for reproducibility) and allowing the same node to use two different data folders causes two separate nodes to have the same id, which prevents the cluster from forming. The second is important, for example, where a full cluster restart / single node restart need to maintain node identity and wiping the data folders at the wrong moment prevents this. Concretely this commit does the following: 1) Remove previous attempts to have data folder per role using a prefix. This wasn't effective as it was using the data paths settings which are only used for part of the runs. An attempt to completely separate the paths via the home dir failed due to assumptions made by index custom path about node data folder ordinal uniqueness (see #19076) 2) Change full cluster restarts to start up nodes in the same order their were first created in, only randomly swapping nodes with the same roles. 3) Change test cluster reset methods to first shutdown the unneeded nodes and then re-start the shared nodes that were shut down, so they'll reclaim their data folders. 4) Improve data folder wiping logic and make sure it wipes only folders of "offline" nodes. 5) Add some very basic tests	2016-06-28 16:38:56 +02:00
Jim Ferenczi	6d069078d3	Fixed tests that assumed that broken settings can be updated	2016-06-28 16:14:57 +02:00
Jim Ferenczi	ef0e3db0de	Validates new dynamic settings from the current state Thanks to https://github.com/elastic/elasticsearch/pull/19088 the settings are now validated against dynamic updaters on the master. Though only the new settings are applied to the IndexService created for the validation. Because of this we cannot check the transition from one value to another in a dynamic updaters. This change creates the IndexService from the current settings and validates that the new dynamic settings can replace the current settings. This change also removes the validation of dynamic settings when an index is opened. The validation should have occurred when the settings have been updated.	2016-06-28 15:35:04 +02:00
Nik Everett	fa4844c3f4	Pull actions from plugins Instead of implementing onModule(ActionModule) to register actions, this has plugins implement ActionPlugin to declare actions. This is yet another step in cleaning up the plugin infrastructure. While I was in there I switched AutoCreateIndex and DestructiveOperations to be eagerly constructed which makes them easier to use when de-guice-ing the code base.	2016-06-28 08:36:24 -04:00
Jason Tedor	2f638b5a23	Keep input time unit when parsing TimeValues This commit modifies TimeValue parsing to keep the input time unit. This enables round-trip parsing from instances of String to instances of TimeValue and vice-versa. With this, this commit removes support for the unit "w" representing weeks, and also removes support for fractional values of units (e.g., 0.5s). Relates #19102	2016-06-27 18:41:18 -04:00
Ryan Ernst	3f2946ce6d	Fix line length in new indices module tests.	2016-06-27 11:33:22 -07:00
Ryan Ernst	33ccc5aead	Merge branch 'master' into mapper_plugin_api	2016-06-27 11:19:59 -07:00
Ryan Ernst	f17fcce3ed	Add duplicate mapper detection and tests	2016-06-27 11:17:58 -07:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Simon Willnauer	4fb1c4fe5a	Validate settings against dynamic updaters on the master (#19088 ) Today all settings are only validated against their validators that are available when settings are registered. Yet, some settings updaters have validators that are dynamic ie. their validation depends on other variables that are only available at runtime. We do not run those validators when settings are updated causing index updates to fail on the data nodes instead of on the master. Relates to #19046	2016-06-27 17:18:26 +02:00
Colin Goodheart-Smithe	108ba23073	Pass resolved extended bounds to unmapped histogram aggregator Previous to this change the unresolved extended bounds was passed into the histogram aggregator which meant extendedbounds.min and extendedbounds.max was passed through as null. This had two effects on the histogram aggregator: 1. If the histogram aggregator was unmapped across all shards, the reduce phase would not add buckets for the extended bounds and the response would contain zero buckets 2. If the histogram aggregator was not unmapped in some shards, the reduce phase might sometimes chose to reduce based on the unmapped shard response and therefore the extended bounds would be ignored. This change resolves the extended bounds in the unmapped case and solves the above two issues. Closes #19009	2016-06-27 14:07:37 +01:00
Boaz Leskes	cb0824e957	Make shard store fetch less dependent on the current cluster state, both on master and non data nodes (#19044 ) #18938 has changed the timing in which we send out to nodes to fetch their shard stores. Instead of doing this after the cluster state resulting of the node's join was published, #18938 made it be sent concurrently to the publishing processes. This revealed a couple of points where the shard store fetching is dependent of the current state of affairs of the cluster state, both on the master and the data nodes. The problem discovered were already present without #18938 but required a failure/extreme situations to make them happen.This PR tries to remove as much as possible of these dependencies making shard store fetching simpler and make the way to re-introduce #18938 which was reverted. These are the notable changes: 1) Allow TransportNodesAction (of which shard store fetching is derived) callers to supply concrete disco nodes, so it won't need the cluster state to resolve them. This was a problem because the cluster state containing the needed nodes was not yet made available through ClusterService. Note that long term we can expect the rest layer to resolve node ids to concrete nodes, making this mode the only one needed. 2) The data node relied on the cluster state to have the relevant index meta data so it can find data when custom paths are used. We now fall back to read the meta data from disk if needed. 3) The data node was relying on it's own IndexService state to indicate whether the data it has corresponds to an existing allocation. This is of course something it can not know until it got (and processed) the new cluster state from the master. This flag in the response is now removed. This is not a problem because we used that flag to protect against double assigning of a shard to the same node, but we are already protected from it by the allocation deciders. 4) I removed the redundant filterNodeIds method in TransportNodesAction - if people want to filter they can override resolveRequest.	2016-06-27 15:05:06 +02:00
Martijn van Groningen	d3cd58eb2f	Merges PR #18957 This commit fixes several NPEs caused by implicitly performing a get request for a document that exists with its _source disabled and then trying to access the source. Instead of causing an NPE the following queries will throw an exception with a "source disabled" message (similar behavior as if the document does not exist).: - GeoShape query for pre-indexed shape (throws IllegalArgumentException) - Percolate query for an existing document (throws IllegalArgumentException) A Terms query with a lookup will ignore the document if the source does not exist (same as if the document does not exist). GET and HEAD requests for the document _source will return a 404 if the source is disabled (even if the document exists).	2016-06-27 09:37:28 +02:00
Martijn van Groningen	ba90508b91	fix checkstyle issue	2016-06-27 09:00:13 +02:00
Nik Everett	71b95fb63c	Switch analysis from push to pull Instead of plugins calling `registerTokenizer` to extend the analyzer they now instead have to implement `AnalysisPlugin` and override `getTokenizer`. This lines up extending plugins in with extending scripts. This allows `AnalysisModule` to construct the `AnalysisRegistry` immediately as part of its constructor which makes testing anslysis much simpler. This also moves the default analysis configuration into `AnalysisModule` which is how search is setup. Like `ScriptModule`, `AnalysisModule` no longer extends `AbstractModule`. Instead it is only responsible for building `AnslysisRegistry`. We still bind `AnalysisRegistry` but we only do so in `Node`. This is means it is available at module construction time so we slowly remove the need to bind it in guice.	2016-06-26 07:15:42 -04:00
Jason Tedor	c79e27180e	Require timeout units when parsing query body Today when parsing the timeout field in a query body, if time units are supplied the parser throws a NumberFormatException. Addtionally, the parsing allows the timeout field to not specify units (it assumes milliseconds). This commit fixes this behavior by not only allowing time units to be specified but requires time units to be specified. This is consistent with the documented behavior and the behavior in 2.x. Relates #19077	2016-06-25 16:18:25 -04:00
Simon Willnauer	09c0285d9c	[TEST] Add unittest for settings update validation	2016-06-25 21:23:41 +02:00
Alex Benusovich	3ca909dfea	Fix NPEs due to disabled source This commit fixes several NPEs caused by implicitly performing a get request for a document that exists with its _source disabled and then trying to access the source. Instead of causing an NPE the following queries will throw an exception with a "source disabled" message (similar behavior as if the document does not exist).: - GeoShape query for pre-indexed shape (throws IllegalArgumentException) - Percolate query for an existing document (throws IllegalArgumentException) A Terms query with a lookup will ignore the document if the source does not exist (same as if the document does not exist). GET and HEAD requests for the document _source will return a 404 if the source is disabled (even if the document exists).	2016-06-24 22:03:03 -07:00
Ryan Ernst	6995bde710	Merge branch 'master' into mapper_plugin_api	2016-06-24 11:15:06 -07:00
Martijn van Groningen	599a548998	percolator: Don't verify candidate matches with MemoryIndex that are verified matches If we don't care about scoring then for certain candidate matches we can be certain, that if they are a candidate match, then they will always match. So verifying these queries with the MemoryIndex can be skipped.	2016-06-24 15:46:55 +02:00
Christoph Büscher	6d5b4a78fe	Make parsing of bool queries stricter Currently we don't throw an error when there is more than one query clause specified in a must/must_not/should/filter object of the bool query without using array notation, e.g.: { "bool" : { "must" : { "match" : { ... }, "match": { ... }}}} In these cases, only the first query will be parsed and further behaviour is unspecified, possibly leading to silently ignoring the rest of the query. Instead we should throw a ParsingException if we don't encounter an END_OBJECT token after having parsed the query clause.	2016-06-24 11:49:28 +02:00
Simon Willnauer	148e64d654	[TEST] Port testcase from #19035 to master	2016-06-23 22:53:54 +02:00
Lee Hinman	9a3227108b	[TEST] Add ensureGreen for IpRangeIT Resolves #18584	2016-06-23 09:58:10 -06:00
Yannick Welsch	ca6fa9ef19	Fix block checks when no indices are specified (#19047 ) Global cluster blocks were not checked if an empty set of indices were passed as argument to the block checking method. This would lead to issues where some operations are already executed on a cluster before it has recovered its cluster state.	2016-06-23 17:52:09 +02:00
Jason Tedor	7f10174362	Upgrade JNA to 4.2.2 and remove optionality This commit upgrades JNA from version 4.1.0 to 4.2.2. Additionally, this dependency is now non-optional as JNA is dual-licensed with Apache License 2.0 since JNA 4.0.0. Relates #19045	2016-06-23 09:21:40 -04:00
Adrien Grand	7ba5bceebe	Add a MultiTermAwareComponent marker interface to analysis factories. #19028 This is the same as what Lucene does for its analysis factories, and we hawe tests that make sure that the elasticsearch factories are in sync with Lucene's. This is a first step to move forward on #9978 and #18064.	2016-06-23 10:19:24 +02:00
Adrien Grand	6c8744ecb5	Attempt at fixing IndexStatsIT.testFilterCacheStats. I suspect recent failures are due to the fact that the cache disables itself when there is contention. This runs assertions in an assertBusy block since they should eventually succeed.	2016-06-23 10:16:04 +02:00
Tanguy Leroux	04da1bda0d	Move templates out of the Search API, into lang-mustache module This commit moves template support out of the Search API to its own dedicated Search Template API in the lang-mustache module. It provides a new SearchTemplateAction that can be used to render templates before it gets delegated to the usual Search API. The current REST endpoint are identical, but the Render Search Template endpoint now uses the same Search Template API with a new "simulate" option. When this option is enabled, the Search Template API only renders template and returns immediatly, without executing the search. Closes #17906	2016-06-23 09:30:53 +02:00
Boaz Leskes	4be94cdc95	revert - Inline reroute with process of node join/master election (#18938 ) There are secondary issues with async shard fetch going out to nodes before they have a cluster state published to them that need to be solved first. For example: - async fetch uses transport node action that resolves nodes based on the cluster state (but it's not yet exposed by ClusterService since we inline the reroute) - after disruption nodes will respond with an allocated shard (they didn't clean up their shards yet) which throws of decisions master side. - nodes deed the index meta data in question but they may not have if they didn't recieve the latest CS	2016-06-23 08:41:44 +02:00
Mike McCandless	d3d524568e	merge master	2016-06-22 16:23:56 -04:00
Nik Everett	6dd9cd72b9	Build valid slices in SearchSourceBuilderTests The test had a 1 in 500 chance of building and invalid slice.	2016-06-22 14:56:42 -04:00
Nik Everett	6671c0cf09	Tasks: Add completed to the mapping	2016-06-22 12:34:59 -04:00
Nik Everett	6574243077	Fail to start if plugin tries broken onModule If a plugin declares `onModule(SomethingThatIsntAModule)` then refuse to start. Before this commit we just logged a warning that flies by in the console and is easy to miss. You can't miss refusing to start!	2016-06-22 12:20:52 -04:00
Jason Tedor	6d04c1e78e	Remove duplicated read byte array methods This commit removes duplicated methods for reading byte arrays in StreamInput. One method would read a byte array by repeatedly calling StreamInput#readByte in a loop, and the other would just call StreamInput#readBytes. In this commit, we remove the former. Relates #19023	2016-06-22 11:56:04 -04:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Mike McCandless	cbc7ff3f9c	NodeService.indicesService is never null	2016-06-22 10:04:33 -04:00
Mike McCandless	52fcdf5e8d	merge master	2016-06-22 09:54:40 -04:00
Mike McCandless	1bd0482393	don't include indexing buffer in cluster stats; randomize indexing buffer in NodeInfoStreamingTests; add @Nullable annotation	2016-06-22 09:52:54 -04:00
Nik Everett	b0da4719aa	Add missing field to PersistedTaskInfo	2016-06-22 07:37:58 -04:00
Jason Tedor	9d6d8152ee	Merge pull request #19016 from jasontedor/hot-methods-redux Hot methods redux	2016-06-22 06:41:38 -04:00
javanna	490d9c8cf7	Merge branch 'master' into feature/http_client	2016-06-22 09:50:07 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
Ryan Ernst	e817b5daa3	Plugins: Remove guice from Mapper plugins This changes adds a MapperPlugin interface which allows pull style retrieval of mappers and metadata mappers added by plugins. For now, I have kept the MapperRegistry, but this should be removed in the future as it is just a silly container for 2 maps which could themselves be passed around.	2016-06-21 22:50:39 -07:00
Jason Tedor	4f49a261a7	Refactor InternalEngine inner methods This commit refactors InternalEngine#innerIndex and InternalEngine#innerDelete to collapse some common logic into a single method. This has the advantage that it shrinks the bytecode size of InternalEngine#innerIndex so that it can be inlined.	2016-06-21 21:28:00 -04:00
Jason Tedor	abae58b5fb	Inline TransportSearchAction#doExecute	2016-06-21 20:48:16 -04:00
Jason Tedor	81ba43888f	Inline AbstractSearchAsyncAction#init	2016-06-21 20:48:15 -04:00
Jason Tedor	93c3a89994	Inline StreamOutput#writeGenericValue	2016-06-21 20:48:15 -04:00
Jason Tedor	af7f98205a	Inline StreamInput#readGenericValue	2016-06-21 20:48:15 -04:00
Jason Tedor	9d1ef62431	Inline ReplicationOperation#execute	2016-06-21 20:48:15 -04:00
Jason Tedor	dcd394d83f	Inline TaskManager#register	2016-06-21 20:48:09 -04:00
Nik Everett	8925400f67	Remove guice from ScriptService Makes ScriptModule just a plain class that manages building the ScriptSettings and ScriptService from plugins. When we need to bind ScriptService with guice we bind it in a lambda.	2016-06-21 16:45:45 -04:00
Tal Levy	28fd684eef	Fix ignore_failure behavior in _simulate?verbose (#18987 ) - fix it so that processors with the `ignore_failure` option do not record their exception in the response - add more tests to make empty `on_failure`. This now throws an exception	2016-06-21 13:29:53 -07:00
Simon Willnauer	c80e837606	Beef up Translog testing with random channel exceptions (#18997 ) Today we only throw random exceptions on the translog writer. This commit extends it to also throw exceptions during checkpoint writing etc to test if the correct flags are provided to open method etc.	2016-06-21 21:25:01 +02:00
Martijn van Groningen	c7710daed0	Merge pull request #19011 from martijnvg/inner_hits/index_type_id_serialization1 Also do not serialize `_index` key in search response for parent/child inner hits	2016-06-21 21:24:39 +02:00
Boaz Leskes	e9230dd889	RejectedExecutionException != EdRejectedExecutionException	2016-06-21 21:08:43 +02:00
Nik Everett	5f0292cb81	Fetch result when wait_for_completion This makes this sequence: ``` curl -XDELETE localhost:9200/source,dest?pretty for i in $( seq 1 100 ); do curl -XPOST localhost:9200/source/test -d'{"test": "test"}'; echo done curl localhost:9200/_refresh?pretty curl -XPOST 'localhost:9200/_reindex?pretty&wait_for_completion=false' -d'{ "source": { "index": "source" }, "dest": { "index": "dest" } }' curl 'localhost:9200/_tasks/Jsyd6d9wSRW-O-NiiKbPcQ:237?wait_for_completion&pretty' ``` Return task AND the response to the user. This also renames "result" to "response" in the persisted task info to line it up with how we name the objects in Elasticsearch.	2016-06-21 14:18:53 -04:00
Adrien Grand	8078c205f9	Revert "Remove `_timestamp` and `_ttl` on 5.x indices. #18980" This reverts commit `969e953645`. Docs are failing because of the removed functionality. I will fix the docs before pushing it again.	2016-06-21 19:19:49 +02:00
Martijn van Groningen	b32d9a71e4	inner_hits: Also never serialize `_index` key for parent/child inner hits as the _index is always the same of the parent search hit	2016-06-21 18:23:40 +02:00
Adrien Grand	969e953645	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-21 18:04:58 +02:00
Adrien Grand	6177c0a900	Upgrade `string` fields to `text`/`keyword` even if `include_in_all` is set. #19004 Closes #18974	2016-06-21 17:59:16 +02:00
Jason Tedor	7b68d44ddf	Read Elasticsearch manifest via URL This commit modifies reading the Elasticsearch jar manifest via the URL instead of converting the URL to an NIO path for increased portability. Relates #18999	2016-06-21 11:14:48 -04:00
javanna	886cb37efb	Merge branch 'master' into feature/http_client	2016-06-21 15:53:37 +02:00
Jim Ferenczi	881afcba60	Fixed tests that failed now that BM25 is the default similarity.	2016-06-21 15:42:42 +02:00
Martijn van Groningen	5ad2fdaa8e	inner_hits: Don't include `_id`, `_type` and `_index` keys in search response for inner hits Closes #18091	2016-06-21 14:13:38 +02:00
Jim Ferenczi	9d685f6876	Fix ut: remap default to classic similarity for indices created before 5.0.	2016-06-21 12:05:44 +02:00
Jim Ferenczi	423291b6bc	Change default similarity to BM25 The default similarity was set to `classic` which refers to TFIDF and has not been moved after the upgrade to Lucene 6. Though moving to BM25 could have some downside for queries that relies on coordination factor (match_query, multi_match_query) ? relates #18944	2016-06-21 11:29:36 +02:00
Martijn van Groningen	82f7bfad98	ingest: merged o.e.ingest.core with o.e.ingest and in ingest-common module added o.e.ingest.common package and moved all code to that package.	2016-06-21 09:24:00 +02:00
Boaz Leskes	4401517b85	Revert #18839 as it causes file leaks ``` > Throwable #1: java.lang.RuntimeException: file handle leaks: [SeekableByteChannel(/var/lib/jenkins/workspace/elastic+elasticsearch+master+g1gc/core/build/testrun/integTest/J0/temp/org.elasticsearch.search.suggest.CompletionSuggestSearch2xIT_518545A20D129C8C-001/tempDir-001/data/nodes/1/indices/4sTECv6WSJOJsw9L4CGamg/0/index/segments_1), SeekableByteChannel(/var/lib/jenkins/workspace/elastic+elasticsearch+master+g1gc/core/build/testrun/integTest/J0/temp/org.elasticsearch.search.suggest.CompletionSuggestSearch2xIT_518545A20D129C8C-001/tempDir-001/data/nodes/1/indices/4sTECv6WSJOJsw9L4CGamg/0/index/segments_1)] > at __randomizedtesting.SeedInfo.seed([518545A20D129C8C]:0) > at org.apache.lucene.mockfile.LeakFS.onClose(LeakFS.java:63) > at org.apache.lucene.mockfile.FilterFileSystem.close(FilterFileSystem.java:77) > at org.apache.lucene.mockfile.FilterFileSystem.close(FilterFileSystem.java:78) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.Exception > at org.apache.lucene.mockfile.LeakFS.onOpen(LeakFS.java:46) > at org.apache.lucene.mockfile.HandleTrackingFS.callOpenHook(HandleTrackingFS.java:81) > at org.apache.lucene.mockfile.HandleTrackingFS.newByteChannel(HandleTrackingFS.java:271) > at org.apache.lucene.mockfile.FilterFileSystemProvider.newByteChannel(FilterFileSystemProvider.java:212) > at org.apache.lucene.mockfile.HandleTrackingFS.newByteChannel(HandleTrackingFS.java:240) > at java.nio.file.Files.newByteChannel(Files.java:361) > at java.nio.file.Files.newByteChannel(Files.java:407) > at org.apache.lucene.store.SimpleFSDirectory.openInput(SimpleFSDirectory.java:77) > at org.apache.lucene.store.FilterDirectory.openInput(FilterDirectory.java:94) > at org.apache.lucene.util.LuceneTestCase.slowFileExists(LuceneTestCase.java:2695) > at org.apache.lucene.store.MockDirectoryWrapper.openInput(MockDirectoryWrapper.java:737) > at org.apache.lucene.store.FilterDirectory.openInput(FilterDirectory.java:94) > at org.elasticsearch.common.lucene.Lucene$1.doBody(Lucene.java:237) > at org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:685) > at org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:637) > at org.elasticsearch.common.lucene.Lucene.checkSegmentInfoIntegrity(Lucene.java:242) > at org.elasticsearch.index.store.Store$MetadataSnapshot.loadMetadata(Store.java:847) > at org.elasticsearch.index.store.Store$MetadataSnapshot.<init>(Store.java:740) > at org.elasticsearch.index.store.Store.getMetadata(Store.java:260) > at org.elasticsearch.index.store.Store.getMetadata(Store.java:240) > at org.elasticsearch.index.shard.IndexShard.doCheckIndex(IndexShard.java:1310) > at org.elasticsearch.common.util.CancellableThreads.executeIO(CancellableThreads.java:102) > at org.elasticsearch.index.shard.IndexShard.checkIndex(IndexShard.java:1288) > at org.elasticsearch.index.shard.IndexShard.internalPerformTranslogRecovery(IndexShard.java:921) > at org.elasticsearch.index.shard.IndexShard.skipTranslogRecovery(IndexShard.java:964) > at org.elasticsearch.indices.recovery.RecoveryTarget.prepareForTranslogOperations(RecoveryTarget.java:297) > at ```	2016-06-21 08:45:46 +02:00
Mike McCandless	ebc3c17c34	add indices flag to nodes info request; use boolean to express 'null' indexing buffer value on the wire	2016-06-20 14:20:23 -04:00
Simon Willnauer	9196ff255e	Add extra ctor to FilterClient to support Guice proxies just don't ask it's bad but some plugins are so involved they need this. Closes #the_issue_that_never_existed	2016-06-20 16:47:24 +02:00
Simon Willnauer	5746d96f42	Catch ClosedByInterruptException when interrupting check index	2016-06-20 14:37:13 +02:00
Adrien Grand	93415d4506	Expose MMapDirectory.preLoad(). #18880 The MMapDirectory has a switch that allows the content of files to be loaded into the filesystem cache upon opening. This commit exposes it with the new `index.store.pre_load` setting.	2016-06-20 13:42:56 +02:00
Simon Willnauer	459665914b	Detach BigArrays from Guice (#18973 ) BigArrays can be fully constructed without Guice, this change cleans up it's creation and the mocking in MockNode.	2016-06-20 13:18:19 +02:00
Simon Willnauer	9506f60504	Improve error message if a setting is not found (#18920 ) Today we only emit that the setting wasn't found unless we have some DYM suggestions. Yet, if a setting is not found at all and there are no suggestions due to typos it's likely a removed setting or the plugin that is supposed to be configured is not installed. This commit adds some info text to the exception to help the user debugging the problem before opening bugreports. Instead of emitting: `unknown setting [foo.bar]` we now emit: `unknown setting [foo.bar] please check the migration guide for removed settings and ensure that the plugin you are configuring is installed` Relates to #18663	2016-06-20 13:10:35 +02:00
Boaz Leskes	0cb4e574a9	index shard should be able to cancel check index on close. (#18839 ) If someone sets `index.shard.check_on_startup`, indexing start up time can be slow (by design, it diligently goes and checks all data). If for some reason the shard is closed in that time, the store ref is kept around and prevents a new shard copy to be allocated to this node via the shard level locks. This is especially tricky if the shard is close due to a cancelled recovery which may re-restart soon. This commit adds a cancellable threads instance to each IndexShard and perform index checking underneath it, so it can be cancelled on close.	2016-06-20 12:16:22 +02:00
Simon Willnauer	e50314bb6e	Remove NodeClientModule and PluginsModule	2016-06-20 11:53:07 +02:00
Simon Willnauer	fb713774a1	Don't create enviroment more than once	2016-06-20 11:45:52 +02:00
Simon Willnauer	cfa4689445	Share test injector creation	2016-06-20 11:38:48 +02:00
Simon Willnauer	7fea5bd8e7	Remove obsolete Modules that can simply be inlined in node creation	2016-06-20 11:28:14 +02:00
Simon Willnauer	260f38fd76	Remove VersionModule and use Version#current consistently. We pretended to be able to ackt like a different version node for so long it's time to be honest and remove this ability. It's just confusing and where needed and tested we should build dedicated extension points.	2016-06-20 10:55:52 +02:00
Tanguy Leroux	98951b1203	Compile each Groovy script in its own classloader closes #18572	2016-06-20 08:17:09 +02:00
Florian Hopf	6b46bf13f0	Throw if the local node is not set This commit adds an IllegalStateException if attempting to get the local node from the cluster service when it is not set. Relates #18963	2016-06-19 17:25:18 -04:00
Boaz Leskes	8f96dd53aa	mark ESIndexLevelReplicationTestCase as abstract so it won't fail naming conventions	2016-06-18 19:39:37 +02:00
Boaz Leskes	61b7f49ed2	ESIndexLevelReplicationTestCase.java fix line lengths	2016-06-18 19:13:50 +02:00
Boaz Leskes	14cd8a6794	Introduce Replication unit tests using real shards (#18930 ) This commit introduce unit testing infrastructure to test replication operations using real index shards. This is infra is complementary to the full integration tests and unit testing of ReplicationOperation we already have. The new ESIndexLevelReplicationTestCase base makes it easier to test and simulate failure mode that require real shards and but do not need the full blow stack of a complete node. The commit also add a simple "nothing is wrong" test plus a test that checks we don't drop docs during the various stages of recovery. For now, only single doc indexing is supported but this can be easily extended in the future.	2016-06-18 18:53:47 +02:00
Boaz Leskes	535a245354	testSingleBatchSubmission - random numbers don't have to be unique	2016-06-18 18:44:54 +02:00
Simon Willnauer	4cb2bb635b	Remove forked joda time BaseDateTime class (#18953 ) This class was forked in 0.20 to remove a volatile keyword. While there is no issue attached to the commit, no evidence of the criticality of the change nor does it seem to be correct since we set this value internally as well I think this class should be used as is from joda time even if we have to pay the price of volatile reads. We can't do 3rd party optimization in our codebase that way it just not maintainable. This was added in `2280915d3c`	2016-06-18 17:06:18 +02:00
Simon Willnauer	420dc72124	Add ClusterName#value as the default instead of it's toString method	2016-06-18 08:40:01 +02:00
Jeff Evans	e9f2548ee0	Include script field even if it value is null Include script field even if it value is null. Closes #16408.	2016-06-17 16:41:25 -04:00
Jason Tedor	d09d89f8c5	Remove only node preference This commit removes the search preference _only_node as the same functionality can be obtained by using the search preference _only_nodes. This commit also adds a test that ensures that _only_nodes will continue to support specifying node IDs. Relates #18875	2016-06-17 15:27:46 -04:00
Areek Zillur	9bca264dcd	Merge branch 'master' of https://github.com/elastic/elasticsearch	2016-06-17 11:41:23 -04:00
Areek Zillur	9356a6090f	Merge branch 'master' into enhancement/rollover_api	2016-06-17 11:35:57 -04:00
Boaz Leskes	46b40f73b7	Inline reroute with process of node join/master election (#18938 ) In the past, we had the semantics where the very first cluster state a node processed after joining could not contain shard assignment to it. This was to make sure the node cleans up local / stale shard copies before receiving new ones that might confuse it. Since then a lot of work in this area, most notably the introduction of allocation ids and #17270 . This means we don't have to be careful and just reroute in the same cluster state change where we process the join, keeping things simple and following the same pattern we have in other places.	2016-06-17 17:32:38 +02:00
Jim Ferenczi	fb2a48d0f0	Revert "Remove support for sorting terms aggregation by ascending count" This is delayed after alpha4 since Kibana relies on it.	2016-06-17 17:14:01 +02:00
Simon Willnauer	bdb6dcea3a	Cleanup ClusterService dependencies and detached from Guice (#18941 ) This change removes some unnecessary dependencies from ClusterService and cleans up ClusterName creation. ClusterService is now not created by guice anymore.	2016-06-17 17:07:19 +02:00
Areek Zillur	545ffa7801	Merge branch 'master' into enhancement/rollover_api	2016-06-17 10:33:11 -04:00
Jim Ferenczi	755721953b	Remove support for sorting terms aggregation by ascending count closes #17614	2016-06-17 15:06:49 +02:00
Adrien Grand	712e387058	Rename PipelineAggregatorBuilder to PipelineAggregationBuilder. This is a follow-up to #18377.	2016-06-17 14:35:49 +02:00
javanna	af93533a17	Merge branch 'master' into feature/http_client	2016-06-17 13:50:18 +02:00
Jim Ferenczi	529c2ca13f	Add did-you-mean for plugin cli This commit adds error messages like: `Unknown plugin xpack, did you mean [x-pack]?` Closes #18896	2016-06-17 12:17:48 +02:00
Boaz Leskes	f256769179	Simplify NodeJoinController to make use of new cluster state batching infra (#18832 ) The NodeJoinController is responsible for processing joins from nodes, both normally and during master election. For both use cases, the class processes incoming joins in batches in order to be efficient and to accumulated enough joins (i.e., >= min_master_nodes) to seal an election and ensure the new cluster state can be committed. Since the class was written, we introduced a new infrastructure to support batch changes to the cluster state at the `ClusterService` level. This commit rewrites NodeJoinController to use that infra and be simpler. The PR also introduces a new concept to ClusterService allowing to submit tasks in batches, guaranteeing that all tasks submitted in a batch will be processed together (potentially with more tasks). On top of that I added some extra safety checks to the ClusterService, around potential double submission of task objects into the queue. This is done in preparation to revive #17811	2016-06-17 09:22:15 +02:00
Adrien Grand	600cbb6ab0	Upgrade to Lucene 6.1.0. #18926	2016-06-17 09:03:00 +02:00
Areek Zillur	6adffa6b7b	Merge branch 'master' into enhancement/rollover_api	2016-06-16 17:27:32 -04:00
Ryan Ernst	8196cf01e3	Merge branch 'master' into plugin_name_api	2016-06-16 13:49:28 -07:00
Ryan Ernst	96321d7749	Remove outtdated comment referring to name/description for Plugin class	2016-06-16 10:18:10 -07:00
Alexander Kazakov	9eea1b6833	Fix flat_settings REST parameter * Get XContent params from request in Nodes rest actions * Adding test for nodes info rest api	2016-06-16 10:03:51 -04:00
Simon Willnauer	b22c526b34	Cut over settings registration to a pull model (#18890 ) Today we have a push model for registering basically anything. All our extension points are defined on modules which we pass in to plugins. This is harder to maintain and adds unnecessary dependencies on the modules itself. This change moves towards a pull model where the plugin offers a getter kind of method to get the extensions. This will also help in the future if we need to pass dependencies to the extension points which can easily be defined on the method as arguments if a pull model is used.	2016-06-16 15:52:58 +02:00
Nik Everett	5aa4769b25	Move waitForTaskCompletion into TaskManager This allows for listening for the waiting to start using MockTaskManager. This allows us to work around a race condition in the TasksIT.	2016-06-16 09:45:46 -04:00
Daniel Mitterdorfer	0faa9409b3	Force test infra to use node client in NettyHttpRequestSizeLimitIT	2016-06-16 15:36:29 +02:00
Simon Willnauer	e442f07460	Remove dead code	2016-06-16 15:29:49 +02:00
Tanguy Leroux	3c9712794e	Merge pull request #18586 from a2lin/msearch_error_fix Adding status field in _msearch error request bodies	2016-06-16 14:31:39 +02:00
Jim Ferenczi	ad232aebbe	Set collection mode to breadth_first in the terms aggregation when the cardinality of the field is unknown or smaller than the requested size. closes #9825	2016-06-16 11:33:40 +02:00
Mike McCandless	3f221bf7cb	Add total_indexing_buffer/_in_bytes to nodes info API	2016-06-16 04:39:34 -04:00
Christoph Büscher	01004c72ba	Improve TimeZoneRoundingTests error messages Currently the error messages for failing tests in the TimeZoneRoundingTests test suite are hard to read because they usually report the actual end expected date in milliseconds utc (e.g. "Expected: <1414270860000L> but: was <1414270800000L>". This makes failing tests hard to read. This change introduces a new Matcher that can be used for equality checks for long dates but reports the error both as a formated date string according to some time zone and also as the actual long values, so you get messages like "Expected: 2014-10-26T00:01:00.000+03:00 [1414270860000] but: was "2014-10-26T00:00:00.000+03:00 [1414270800000]". Also clean cleaning up some helper methods and generally simplifying a few test cases. Otherwise this change shouldn't affect either the scope of the test or anything about the rounding implementation itself.	2016-06-16 10:10:04 +02:00
Adrien Grand	9ffb2ff6ba	Expose half-floats. #18887 They have been implemented in https://issues.apache.org/jira/browse/LUCENE-7289. Ranges are implemented so that the accuracy loss only occurs at index time, which means that if you are searching for values between A and B, the query will match exactly all documents whose value rounded to the closest half-float point is between A and B.	2016-06-16 09:46:39 +02:00
Simon Willnauer	18ff051ad5	Simplify ScriptModule and script registration (#18903 ) Registering a script engine or native scripts still uses Guice today and is much more complicated than needed. This change moves to a pull based model where script plugins have to implement a dedicated interface `ScriptPlugin` and defines simple getter returning instances rather than classes.	2016-06-16 09:35:13 +02:00
Alexander Lin	7d42e7e716	Closes #18013 . Added status field to _msearch response bodies.	2016-06-16 00:25:17 -07:00
Daniel Mitterdorfer	edf010f878	Force single-node cluster in NettyHttpRequestSizeLimitIT	2016-06-16 07:30:55 +02:00
Ryan Ernst	a4503c2aed	Plugins: Remove name() and description() from api In 2.0 we added plugin descriptors which require defining a name and description for the plugin. However, we still have name() and description() which must be overriden from the Plugin class. This still exists for classpath plugins. But classpath plugins are mainly for tests, and even then, referring to classpath plugins with their class is a better idea. This change removes name() and description(), replacing the name for classpath plugins with the full class name.	2016-06-15 17:12:22 -07:00
Nik Everett	dc2d7a2a6d	Test: wait for task to start before waiting for it to finish (#18902 )	2016-06-15 18:42:48 -04:00
Tal Levy	a26260fb72	new ScriptProcessor for Ingest (#18193 ) add new ScriptProcessor for executing ES Scripts within pipelines	2016-06-15 14:57:18 -07:00
Areek Zillur	1a59a8418a	Merge branch 'master' into enhancement/shrink_request_parser	2016-06-15 15:49:40 -04:00
Nik Everett	ab2a4a0d72	Fix exception on task not found Silly protected method....	2016-06-15 15:29:42 -04:00
Nik Everett	8cc848f31c	Allow FieldStatsRequest to disable cache	2016-06-15 15:10:46 -04:00
Ryan Ernst	8de90a66a1	Relax plugin id url heuristic, since java uses single slash instead of double	2016-06-15 11:43:38 -07:00
Ryan Ernst	9c65bd4ac4	Merge pull request #18876 from rjernst/plugin_install_unknown Emit nicer error message when trying to install unknown plugin	2016-06-15 09:56:43 -07:00
Daniel Mitterdorfer	cca4529b1c	Mute NHRSLIT while investigating	2016-06-15 18:02:31 +02:00
Simon Willnauer	7df5d05c62	Simplify SubFetchPhase interface (#18881 ) This interface used to have dedicated methods to prevent calling execute methods. These methods are unnecessary as the checks can simply be done inside the execute methods itself. This simplifies the interface as well as its usage.	2016-06-15 15:49:11 +02:00
Nik Everett	e09b6d7ba1	Test: Remove and untrue assertion Task status might change between list and get.	2016-06-15 09:02:36 -04:00
Daniel Mitterdorfer	f32b700472	Exclude admin / diagnostic requests from HTTP request limiting With this commit we exclude certain HTTP requests that are needed to inspect the cluster from HTTP request limiting to ensure these commands are processed even in critical memory conditions. Relates #17951, relates #18145, closes #18833	2016-06-15 14:29:46 +02:00
javanna	ace3a7b146	Merge branch 'master' into feature/http_client	2016-06-15 11:44:46 +02:00
Simon Willnauer	0f87afe2bf	[TEST] Fix Highlighters assertion - not guice injected anymore	2016-06-15 09:26:34 +02:00
Simon Willnauer	429dd3a876	Simplify FetchSubPhase registration and detach it from Guice (#18862 ) this commit removes FetchSubPhrase registration by class to registration by instance. No Guice binding needed anymore.	2016-06-15 09:13:02 +02:00
Ryan Ernst	1ecf14cee0	Add test for plugin install heuristic	2016-06-14 23:42:49 -07:00
Ryan Ernst	6db323164e	Plugins: Emit nicer error message when trying to install unknown plugin When installing plugins, we first try the elastic download service for official plugins, then try maven coordinates, and finally try the argument as a url. This can lead to confusing error messages about unknown protocols when eg an official plugin name is mispelled. This change adds a heuristic for determining if the argument in the final case is in fact a url that we should try, and gives a simplified error message in the case it is definitely not a url. closes #17226	2016-06-14 23:42:34 -07:00
Ryan Ernst	fa77d4d885	Test: make secure mock setup work with ibm jdk	2016-06-14 18:46:43 -07:00
Jason Tedor	e96722d91c	Add search preference to prefer multiple nodes The search preference _prefer_node allows specifying a single node to prefer when routing a request. This functionality can be enhanced by permitting multiple nodes to be preferred. This commit replaces the search preference _prefer_node with the search preference _prefer_nodes which supplants the former by specifying a single node and otherwise adds functionality. Relates #18872	2016-06-14 21:34:24 -04:00
Martijn van Groningen	14d6b04944	test: don't return 0, at least one request must be added to msearch request also the maxConcurrentSearchRequests() setter only support positive values.	2016-06-14 23:29:29 +02:00
Nik Everett	f7f377791f	Test: don't use 0 size for terms aggregation It is no longer allowed.	2016-06-14 17:27:45 -04:00
Nik Everett	c8931768ba	Clean up after test failure If the test fails we properly clean up. Also add a toString implementation so we get useful results on failure.	2016-06-14 16:16:19 -04:00
Nik Everett	3032a7c653	Cache FieldStats This caches FieldStats at the field level. For one off requests or for few indicies this doesn't save anything, but when there are 30 indices, 5 shards, 1 replica, 100 parallel requests this is about twice as fast as not caching. I expect lots of usage won't see much benefit from this but pointing kibana to a cluster with many indexes and shards, will be faster. Closes #18717	2016-06-14 13:57:18 -04:00
Nik Everett	e392e0b1df	Create get task API that falls back to the .tasks index This adds a get task API that supports GET /_tasks/${taskId} and removes that responsibility from the list tasks API. The get task API supports wait_for_complation just as the list tasks API does but doesn't support any of the list task API's filters. In exchange, it supports falling back to the .results index when the task isn't running any more. Like any good GET API it 404s when it doesn't find the task. Then we change reindex, update-by-query, and delete-by-query to persist the task result when wait_for_completion=false. The leads to the neat behavior that, once you start a reindex with wait_for_completion=false, you can fetch the result of the task by using the get task API and see the result when it has finished. Also rename the .results index to .tasks.	2016-06-14 13:37:34 -04:00
Simon Willnauer	ee2ba13cce	Register Highlighter instances instead of classes (#18859 ) This change detaches highlighter registration from Guice. It's just a small step into the right direction.	2016-06-14 17:04:58 +02:00
Colin Goodheart-Smithe	d7e3f9e4eb	#18854 Remove size 0 options in aggregations Remove size 0 options in aggregations	2016-06-14 15:32:42 +01:00
Christoph Büscher	32f141223d	Merge pull request #18800 from cbuescher/fix-interval-rounding-uneven Fix invalid rounding value for TimeIntervalRounding close to DST transitions	2016-06-14 16:22:11 +02:00
Christoph Büscher	03f5aa8ea0	Don't throw IllegalInstantException to determine DST gap By taking the logic from DateTimeZone#convertLocalToUTC(long, boolean) we can avoid throwing the exception.	2016-06-14 15:36:00 +02:00
Simon Willnauer	4d78f280ed	Remove dead code and dead parameters (#18855 )	2016-06-14 15:25:44 +02:00
Christoph Büscher	5abe1f7bb2	Fix invalid rounding value for TimeIntervalRounding close to DST transition There are edge cases where rounding a date to a certain interval using a time zone with DST shifts can currently cause the rounded date to be bigger than the original date. This happens when rounding a date closely after a DST start and the rounded date falls into the DST gap. Here is an example for CET time zone, where local time is set forward by one hour at 2016-03-27T02:00:00+01:00 to 2016-03-27T03:00:00.000+02:00: The date 2016-03-27T03:01:00.000+02:00 (1459040460000) which is just after the DST change is first converted to local time (1459047660000). If we then apply interval rounding for a 14m interval in local time, this takes us to 1459047240000, which unfortunately falls into the DST gap. When converting this back to UTC, joda provides options to throw exceptions on illegal dates like this, or correct this by adjusting the date to the new time zone offset. We currently do the later, but this leads to converting this illegal date back to 2016-03-27T03:54:00.000+02:00 (1459043640000), giving us a date that is larger than the original date we wanted to round. This change fixes this by using the "strict" option of 'convertLocalToUTC()' to detect rounded dates that fall into the DST gap. If this happens, we can use the time of the DST change instead as the interval start. Even before this change, intervals around DST shifts like this can be shorter than the desired interval. This, for example, happens when the requested interval width doesn't completely fit into the remaining time span when the DST shift happens. For example, using a 14m interval in UTC+1 (CET before DST starts) leads to the following valid rounding values around the time where DST happens: 2016-03-27T01:30:00+01:00 2016-03-27T01:44:00+01:00 2016-03-27T01:58:00+01:00 2016-03-27T02:12:00+01:00 2016-03-27T02:26:00+01:00 ... while the rounding values in UTC+2 (CET after DST start) are placed like this around the same time: 2016-03-27T02:40:00+02:00 2016-03-27T02:54:00+02:00 2016-03-27T03:08:00+02:00 2016-03-27T03:22:00+02:00 ... From this we can see then when we switch from UTC+1 to UTC+2 at 02:00 the last rounding value in UTC+1 is at 01:58 and the first valid one in UTC+2 is at 03:08, so even if we decide to put all the dates in between into one rounding interval, it will only cover 10 minutes. With this change we choose to use the moment of DST shift as an aditional interval separator, leaving us with a 2min interval from [01:58,02:00) before the shift and an 8min interval from [03:00,03:08) after the shift. This change also adds tests for the above example and adds randomization to the existing TimeIntervalRounding tests.	2016-06-14 14:59:51 +02:00
Colin Goodheart-Smithe	bec621d46f	changes from review	2016-06-14 13:45:03 +01:00
Colin Goodheart-Smithe	cfd3356ee3	Remove size 0 options in aggregations This removes the ability to set `size: 0` in the `terms`, `significant_terms` and `geohash_grid` aggregations for the reasons described in https://github.com/elastic/elasticsearch/issues/18838 Closes #18838	2016-06-14 13:07:02 +01:00
Boaz Leskes	7a226122e3	MasterFaultDetection can leak an exception during shutdown	2016-06-14 01:16:17 +03:00
Ryan Ernst	991c2221a1	Set next version back to alpha4	2016-06-13 09:26:45 -07:00
Simon Willnauer	7379b17e61	Revert "Make random UUIDs reproducible in tests" This reverts commit `a25b8ee1bf`.	2016-06-13 11:14:30 +02:00
Christoph Büscher	f20928b146	Remove redundant parseElementst() method in RescorePhase and SuggestPhase The default implementation in SearchPhase does the same.	2016-06-13 10:20:23 +02:00
Martijn van Groningen	3b96055b23	msearch: Cap the number of searches the msearch api will concurrently execute By default the number of searches msearch executes is capped by the number of nodes multiplied with the default size of the search threadpool. This default can be overwritten by using the newly added `max_concurrent_searches` parameter. Before the msearch api would concurrently execute all searches concurrently. If many large msearch requests would be executed this could lead to some searches being rejected while other searches in the msearch request would succeed. The goal of this change is to avoid this exhausting of the search TP. Closes #17926	2016-06-13 10:13:08 +02:00
Nik Everett	387155559e	Make TimeValue Writeable instead of Streamable Writeable is better for immutable objects like TimeValue. Switch to writeZLong which takes up less space than the original writeLong in the majority of cases. Since we expect negative TimeValues we shouldn't use writeVLong.	2016-06-10 18:24:16 -04:00
Jason Tedor	86f1bedaab	Rename NettyTransportChannel#close This commit renames the NettyTransportChannel#close method to NettyTransportChannel#release to clarify the semantics.	2016-06-10 15:26:49 -04:00
Areek Zillur	df4a959d6c	removed support for customs from create index request	2016-06-10 12:06:50 -03:00
Areek Zillur	62f98767eb	removed redundant Fields class	2016-06-10 12:02:36 -03:00
Adrien Grand	44c653f5a8	Upgrade to lucene-6.1.0-snapshot-3a57bea.	2016-06-10 16:18:12 +02:00
Jason Tedor	a25b8ee1bf	Make random UUIDs reproducible in tests Today we use a random source of UUIDs for assigning allocation IDs, cluster IDs, etc. Yet, the source of randomness for this is not reproducible in tests. Since allocation IDs end up as keys in hash maps, this means allocation decisions and not reproducible in tests and this leads to non-reproducible test failures. This commit modifies the behavior of random UUIDs so that they are reproducible under tests. The behavior for production code is not changed, we still use a true source of secure randomness but under tests we just use a reproducible source of non-secure randomness. It is important to note that there is a test, UUIDTests#testThreadedRandomUUID that relies on the UUIDs being truly random. Thus, we have to modify the setup for this test to use a true source of randomness. Thus, this is one test that will never be reproducible but it is intentionally so. Relates #18808	2016-06-10 10:18:06 -04:00
Ali Beyad	43e07c0c88	Better handling of an empty shard's segments_N file When trying to restore a snapshot of an index created in a previous version of Elasticsearch, it is possible that empty shards in the snapshot have a segments_N file that has an unsupported Lucene version and a missing checksum. This leads to issues with restoring the snapshot. This commit handles this special case by avoiding a restore of a shard that has no data, since there is nothing to restore anyway. Closes #18707	2016-06-10 09:57:09 -04:00
Nik Everett	d733fb689b	Better error message when mapping configures null Closes #18803	2016-06-10 09:43:18 -04:00
Yannick Welsch	a2c506acd3	Fix sync flush total shards statistics (#18766 )	2016-06-10 13:39:47 +02:00
Yannick Welsch	6ea89004cd	Make IndicesClusterStateService unit testable (#17270 ) Testability of ICSS is achieved by introducing interfaces for IndicesService, IndexService and IndexShard. These interfaces extract all relevant methods used by ICSS (which do not deal directly with store) and give the possibility to easily mock all the store behavior away in the tests (and cuts down on dependencies).	2016-06-10 12:47:41 +02:00
javanna	9cbfa984fa	Merge branch 'master' into feature/http_client	2016-06-10 11:18:21 +02:00
Colin Goodheart-Smithe	1d76177510	Adds aggregation profiling (not including reduce phase) Add Aggregation profiling initially only be for the shard phases (i.e. the reduce phase will not be profiled in this change) This change refactors the query profiling class to extract abstract classes where it is useful for other profiler types to share code.	2016-06-10 09:02:07 +01:00
Jim Ferenczi	439b2a96e5	Add an index setting to limit the maximum number of slices allowed in a scroll request (default to 1024).	2016-06-10 09:43:32 +02:00
Daniel Mitterdorfer	7229c91289	Remove trace logging from NettyHttpRequestSizeLimitIT With this commit we revert back to normal behavior as the underlying issue has been fixed with #18627.	2016-06-10 07:46:04 +02:00
Nik Everett	e02d9f0945	Squash a race condition in RefreshListeners It presented as listeners never being called if you refresh at the same time as the listener is added. It was caught rarely by testConcurrentRefresh. mostly this is removing code and adding a comment: ``` Note that it is not safe for us to abort early if we haven't advanced the position here because we set and read lastRefreshedLocation outside of a synchronized block. We do that so that waiting for a refresh that has already passed is just a volatile read but the cost is that any check whether or not we've advanced the position will introduce a race between adding the listener and the position check. We could work around this by moving this assignment into the synchronized block below and double checking lastRefreshedLocation in addOrNotify's synchronized block but that doesn't seem worth it given that we already skip this process early if there aren't any listeners to iterate. ```	2016-06-09 13:48:41 -04:00
Areek Zillur	41d31541a6	Allow users to override the name for the rollover index	2016-06-09 13:43:19 -04:00
gfyoung	6f222b5be1	Support flags in pattern replace char filter Works just like pattern analyzer's flags param. Closes #18362.	2016-06-09 12:39:23 -04:00
Areek Zillur	a9f24ea2dc	fail rollover request if rollover index already exists	2016-06-09 12:38:12 -04:00
Nik Everett	fb52c258fd	[test] Check if RefreshListeners was called immediately Return a boolean from RefreshListeners, true if we called the listener inline and false if we didn't, and check it in the test.	2016-06-09 12:08:36 -04:00
Areek Zillur	9027e8a719	renamed simulated mode to dry_run mode	2016-06-09 11:55:10 -04:00
javanna	cf6e713d77	Merge branch 'master' into feature/http_client	2016-06-09 17:43:45 +02:00
Nik Everett	bd276ef5f1	[test] Check for listener calling error Failing to call a refresh listener is logger at WARN but that'll cause test failure. This adds explicit assertions that there are no errors.	2016-06-09 11:26:08 -04:00
Areek Zillur	ce211119d0	Add Shrink request source parser to parse create index request body Follow up to https://github.com/elastic/elasticsearch/pull/18732#discussion_r66407196	2016-06-09 10:41:59 -04:00
javanna	437c4f210b	rename ElasticsearchResponse to Response and ElasticsearchResponseException to ResponseException	2016-06-09 14:38:32 +02:00
Areek Zillur	94a7978ef6	add documentation	2016-06-08 18:38:02 -04:00
Jason Tedor	e9017f619e	Improve performance of applyDeletedShards This commit addresses a performance issue in IndicesClusterStateService#applyDeletedShards. Namely, the current implementation is O(number of indices * number of shards). This is because of an outer loop over the indices and an inner loop over the assigned shards, all to check if a shard is in the outer index. Instead, we can group the shards by index, and then just do a map lookup for each index. Testing this on a single-node with 2500 indices, each with 2 shards, creating an index before this optimization takes 0.90s and after this optimization takes 0.19s. Relates #18788	2016-06-08 16:08:00 -04:00
Simon Willnauer	9497b704bb	[TEST] Fix NodeEnvironmentTests on Windows - use Path.resolve instead of platform dependent path seperator	2016-06-08 21:40:35 +02:00
Areek Zillur	ae3eb15caa	fix rest tests	2016-06-08 15:03:35 -04:00
Areek Zillur	0c6d19c40c	add body support for create index request	2016-06-08 14:16:06 -04:00
Nik Everett	4b21157906	Remove setRefresh It has been replaced with `setRefreshPolicy` which has support for waiting until refresh with `setRefreshPolicy(WAIT_FOR)`. Related to #1063	2016-06-08 13:50:59 -04:00
Lee Hinman	92349f70e2	Merge remote-tracking branch 'dakrone/igs-false2'	2016-06-08 10:49:20 -06:00
Lee Hinman	c637fea84b	Change the default of `include_global_state` from true to false for restores This changes the default value to be false only for restore operations. Resolves #18569	2016-06-08 10:48:36 -06:00
Nik Everett	5161afe5e3	Support optional ctor args in ConstructingObjectParser You declare them like ``` static { PARSER.declareInt(optionalConstructorArg(), new ParseField("animal")); } ``` Other than being optional they follow all of the rules of regular `constructorArg()`s. Parsing an object with optional constructor args is going to be slightly less efficient than parsing an object with all required args if some of the optional args aren't specified because ConstructingObjectParser isn't able to build the target before the end of the json object.	2016-06-08 12:38:40 -04:00
Areek Zillur	dec0dcc30b	minor cleanup	2016-06-08 11:33:55 -04:00
Simon Willnauer	bec26015b2	[TEST] add a dedicated test for empty files	2016-06-08 15:40:14 +02:00
Christoph Büscher	a2372778dd	Fix problem with TimeIntervalRounding on DST end Due to an error in our current TimeIntervalRounding, two dates can round to the same key, even when they are 1h apart when using short interval roundings (e.g. 20m) and a time zone with DST change. Here is an example for the CET time zone: On 25 October 2015, 03:00:00 clocks are turned backward 1 hour to 02:00:00 local standard time. The dates "2015-10-25T02:15:00+02:00" (1445732100000) (before DST end) and "2015-10-25T02:15:00+01:00" (1445735700000) (after DST end) are thus 1h apart, but currently they round to the same value "2015-10-25T02:00:00.000+01:00" (1445734800000). This violates an important invariant of rounding, namely that the rounded value must be less or equal to the value that is rounded. It also leads to wrong histogram bucket counts because documents in [02:00:00+02:00, 02:20:00+02:00) go to the same bucket as documents from [02:00:00+01:00, 02:20:00+01:00). The problem happens because in TimeIntervalRounding#roundKey() we need to perform the rounding operation in local time, but on converting back to UTC we don't honor the original values time zone offset. This fix changes that and adds tests both for DST start and DST end as well as a test that demonstrates what happens to bucket sizes when the dst change is not evently divisibly by the interval.	2016-06-08 13:05:52 +02:00
Jim Ferenczi	712c77264d	Fix ut: make sure that the number of slices is bigger than 1 in the SliceBuilder tests.	2016-06-08 11:51:46 +02:00
Areek Zillur	134a4e5e52	incorporate feedback	2016-06-07 22:38:47 -04:00
Lee Hinman	762bbdbd0c	Revert "Change the default of `include_global_state` from true to false." This reverts commit `052a62250c`.	2016-06-07 15:07:37 -06:00
Lee Hinman	052a62250c	Change the default of `include_global_state` from true to false. Resolves #18569	2016-06-07 15:06:20 -06:00
Nik Everett	a405c2ba99	Switch QueryBuilders to new MatchPhraseQueryBuilder It was doing deprecated things with MatchQueryBuilder.	2016-06-07 14:35:23 -04:00
Lee Hinman	32bd869b28	Merge remote-tracking branch 'dakrone/no-cluster-name-in-path'	2016-06-07 10:14:23 -06:00
Lee Hinman	feb244c14a	Remove cluster name from data path Previously Elasticsearch used $DATA_DIR/$CLUSTER_NAME/nodes for the path where data is stored, this commit changes that to be $DATA_DIR/nodes. On startup, if the old folder structure is detected it will be used. This behavior will be removed in Elasticsearch 6.0 Resolves #17810	2016-06-07 10:13:48 -06:00
Jim Ferenczi	43b419b230	rehash the docvalues in DocValuesSliceQuery using BitMixer.mix instead of the naive Long.hashCode.	2016-06-07 17:58:32 +02:00
Martijn van Groningen	f611f1c99e	ingest: Move processors from core to ingest-common module. Folded grok processor into ingest-common module. The rest tests have been moved to ingest-common module as well, because these tests don't run in the rest-api-spec module but in the distribution:integ-test-zip module and adding a test plugin there felt just wrong to me. I think this is ok. I left a tiny ingest rest test behind in that tests with an empty pipeline. Removed messy tests, these tests were already covered in the rest tests Added ingest test plugin in test infra so that each module testing integration with ingest doesn't need write its own plugin Moved reindex ingest tests to qa module Closes #18490	2016-06-07 17:32:52 +02:00
trangvh	c0da8e4060	Fix some typos (#18746 ) * Update java-doc of SearchResponse.getProfileResults() * Fix a trivial typo in Reference document	2016-06-07 16:41:39 +02:00
Jim Ferenczi	692c42b23a	Fix ut	2016-06-07 16:29:18 +02:00
Jim Ferenczi	b9030bf6fe	Add the ability to partition a scroll in multiple slices. API: ``` curl -XGET 'localhost:9200/twitter/tweet/_search?scroll=1m' -d '{ "slice": { "field": "_uid", <1> "id": 0, <2> "max": 10 <3> }, "query": { "match" : { "title" : "elasticsearch" } } } ``` <1> (optional) The field name used to do the slicing (_uid by default) <2> The id of the slice By default the splitting is done on the shards first and then locally on each shard using the _uid field with the following formula: `slice(doc) = floorMod(hashCode(doc._uid), max)` For instance if the number of shards is equal to 2 and the user requested 4 slices then the slices 0 and 2 are assigned to the first shard and the slices 1 and 3 are assigned to the second shard. Each scroll is independent and can be processed in parallel like any scroll request. Closes #13494	2016-06-07 16:21:53 +02:00
Jason Tedor	c3e3a6337e	Use method name in bootstrap check might fork test This commit modifies the bootstrap check invocations in the might fork tests to use the underlying test name when setting up the logging prefix when invoking the bootstrap checks. This is done to give clear logs in case of failure.	2016-06-07 09:33:17 -04:00
Jason Tedor	75d3b13790	Merge pull request #18756 from jasontedor/on-out-of-memory-error Bootstrap check for OnOutOfMemoryError and seccomp	2016-06-07 09:26:57 -04:00
Simon Willnauer	c72ebba5de	checkstyle have your upper L	2016-06-07 11:05:28 +02:00
Simon Willnauer	0a5e06d402	fix javadocs	2016-06-07 10:22:11 +02:00
Simon Willnauer	b2c4c323e1	Allow `_shrink` to N shards if source shards is a multiple of N (#18699 ) Today we allow to shrink to 1 shard but that might not be possible due to too many document or a single shard doesn't meet the requirements for the index. The logic can be expanded to N shards if the source index shards is a multiple of N. This guarantees that there are not hotspots created due to different number of shards being shrunk into one.	2016-06-07 10:06:41 +02:00
Jason Tedor	acc9cea8f6	Fix compilation issue in RefreshListenersTests This commit fixes a compilation issue in RefreshListenersTests that arose from code being integrated into master, and then a large pull request refactoring the handling of thread pools was later merged into master.	2016-06-06 23:26:22 -04:00
Jason Tedor	e94408c0d2	Bootstrap check for OnError and seccomp This commit adds a bootstrap check for the JVM option OnError being in use and seccomp being enabled. These two options are incompatible because OnError allows the user to specify an arbitrary program to fork when the JVM encounters an fatal error, and seccomp enables system call filters that prevents forking.	2016-06-06 22:18:44 -04:00
Jason Tedor	da74323141	Register thread pool settings This commit refactors the handling of thread pool settings so that the individual settings can be registered rather than registering the top level group. With this refactoring, individual plugins must now register their own settings for custom thread pools that they need, but a dedicated API is provided for this in the thread pool module. This commit also renames the prefix on the thread pool settings from "threadpool" to "thread_pool". This enables a hard break on the settings so that: - some of the settings can be given more sensible names (e.g., the max number of threads in a scaling thread pool is now named "max" instead of "size") - change the soft limit on the number of threads in the bulk and indexing thread pools to a hard limit - the settings names for custom plugins for thread pools can be prefixed (e.g., "xpack.watcher.thread_pool.size") - remove dynamic thread pool settings Relates #18674	2016-06-06 22:09:12 -04:00
Areek Zillur	d8d60d294e	remove put method	2016-06-06 18:54:28 -04:00
Areek Zillur	1e329099f9	enhance rollover request	2016-06-06 18:50:45 -04:00
Jason Tedor	9695caa3fb	Bootstrap check for OnOutOfMemoryError and seccomp This commit adds a bootstrap check for the JVM option OnOutOfMemoryError being in use and seccomp being enabled. These two options are incompatible because OnOutOfMemoryError allows the user to specify an arbitrary program to fork when the JVM encounters an OutOfMemoryError, and seccomp enables system call filters that prevents forking. This commit also adds support for bootstrap checks that are always enforced, whether or not Elasticsearch is in production mode.	2016-06-06 17:31:42 -04:00
Jared Carey	080946c915	Fix typo in template validation message This commit addresses a typo in a template validation message in MetaDataIndexTemplateService.java. Relates #18754	2016-06-06 17:00:13 -04:00
Areek Zillur	d96fe20e3a	add named writable registry glue	2016-06-06 16:11:46 -04:00
Areek Zillur	3a2cc22aff	simplify conditions and rollover request	2016-06-06 16:10:36 -04:00
Simon Willnauer	d594d6c07c	[TEST] ensure green before we filter allocations otherwise follow up ensureGreen() will fail	2016-06-06 18:14:05 +02:00
Nik Everett	d8056c8213	Add support for waiting until a refresh occurs This adds support for setting the refresh request parameter to `wait_for` in the `index`, `delete`, `update`, and `bulk` APIs. When `refresh=wait_for` is set those APIs will not return until their results have been made visible to search by a refresh. Also it adds a `forced_refresh` field to the response of `index`, `delete`, `update`, and to each item in a bulk response. This will be true for requests with `?refresh` or `?refresh=true` and will be true for some requests (see below) with `refresh=wait_for` but ought to otherwise always be false. `refresh=wait_for` is implemented as a list of `Tuple<Translog.Location, Consumer<Boolean>>`s in the new `RefreshListeners` class that is managed by `IndexShard`. The dynamic, index scoped `index.max_refresh_listeners` setting controls a maximum number of listeners allowed in any shard. If more than that many listeners accumulate in the engine then a refresh will be forced, the thread that adds the listener will be blocked until the refresh completes, and then the listener will be called with a `forcedRefresh` flag so it knows that it was the "straw that broke the camel's back". These listeners are only used by `refresh=wait_for` and that flag manifests itself as `forced_refresh` being `true` in the response. About half of this change comes from piping async-ness down to the appropriate layer in a way that is compatible with the ongoing with with sequence ids. Closes #1063 You can look up the winding story of all the commits here: https://github.com/elastic/elasticsearch/pull/17986 Here are the commit messages in case they are intersting to you: commit 59a753b89109828d2b8f0de05cb104fc663cf95e Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 10:18:23 2016 -0400 Replace a method reference with implementing an interface Saves a single allocation and forces more commonality between the WriteResults. commit 31f7861a85b457fb7378a6f27fa0a0c171538f68 Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 10:07:55 2016 -0400 Revert "Replace static method that takes consumer with delegate class that takes an interface" This reverts commit 777e23a6592c75db0081a53458cc760f4db69507. commit 777e23a6592c75db0081a53458cc760f4db69507 Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 09:29:35 2016 -0400 Replace static method that takes consumer with delegate class that takes an interface Same number of allocations, much less code duplication. commit 9b49a480ca9587a0a16ebe941662849f38289644 Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 08:25:38 2016 -0400 Patch from boaz commit c2bc36524fda119fd0514415127e8901d94409c8 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:46:27 2016 -0400 Fix docs After updating to master we are actually testing them. commit 03975ac056e44954eb0a371149d410dcf303e212 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:20:11 2016 -0400 Cleanup after merge from master commit 9c9a1deb002c5bebb2a997c89fa12b3d7978e02e Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:09:14 2016 -0400 Breaking changes notes commit 1c3e64ae06c07a85f7af80534fab88279adb30b4 Merge: 9e63ad6 `f67e580` Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:00:05 2016 -0400 Merge branch 'master' into block_until_refresh2 commit 9e63ad6de52d0b28f0b6d7203721baf1ebf6f56b Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 13:21:27 2016 -0400 Test for TransportWriteAction commit 522ecb59d39b3c9e8df0d3b8df34b9e7aeaf0ce9 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:30:18 2016 -0400 Document deprecation commit 0cd67b947f58867e704a1f0e66928a6fb5a11f11 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:26:23 2016 -0400 Deprecate setRefresh(boolean) Users should use `setRefresh(RefreshPolicy)` instead. commit aeb1be3f2c501990b33fb1f8230d496035f498ef Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:12:27 2016 -0400 Remove checkstyle suppression It is fixed commit 00d09a9caa638b6f90f4896b5502dd98d8fad56e Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:08:28 2016 -0400 Improve comment commit 788164b898a6ee2878a273961230122b7386c3c9 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:01:01 2016 -0400 S/ReplicatedWriteResponse/WriteResponse/ Now it lines up with WriteRequest. commit b74cf3fe778352b140355afcaa08d3d4412d749d Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 18:27:52 2016 -0400 Preserve `?refresh` behavior `?refresh` means the same things as `?refresh=true`. commit 30f972bdaeaaa0de6fe67746cdb8628aa86f5a8c Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 17:39:05 2016 -0400 Handle hanging documents If a document is added to the index during a refresh we weren't properly firing its refresh listener. This happened because the way we detect whether a refresh makes something visible or not is imperfect. It is ok because it always errs on the side of thinking that something isn't yet visible. So when a document arrives during a refresh the refresh listeners won't think it made it into a refresh when, often, it does. The way we work around this is by telling Elasticsearch that it ought to trigger a refresh if there are any pending refresh listeners even if there aren't pending documents to update. Lucene short circuits the refresh so it doesn't take that much effort, but the refresh listeners still get the signal that a refresh has come in and they still pick up the change and notify the listener. This means that the time that a listener can wait is actually slightly longer than the refresh interval. commit d523b5702b60c7ba309fb0dcf3cd3a4798f11960 Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 14:34:01 2016 -0400 Explain Integer.MAX_VALUE commit 4ffb7c0e954343cc1c04b3d7be2ebad66d3a016b Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 14:27:39 2016 -0400 Fire all refresh listeners in a single thread Rather than queueing a runnable each. commit 19606ec3bbe612095df45eba734c5b7eb2709c01 Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 14:09:52 2016 -0400 Assert translog ordering commit 6bb4e5c75e850f4a42518f06fbc955f7ec76d245 Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 13:17:44 2016 -0400 Support null RefreshListeners in InternalEngine Just skip using it. commit 74be1480d6e44af2b354ff9ea47c234d4870b6c2 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 18:02:03 2016 -0400 Move funny ShardInfo hack for bulk into bulk This should make it easier to understand because it is closer to where it matters.... commit 2b771f8dabd488e056cfdc9989608d18264ddfb0 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:39:46 2016 -0400 Pull listener out into an inner class with javadoc and stuff commit 058481ad72019c0492b03a7a4ac32a48673697d3 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:33:42 2016 -0400 Fix javadoc links commit d2123b1cabf29bce8ff561d4a4c1c1d5b42bccad Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:28:09 2016 -0400 Make more stuff final commit 8453fc4f7850f6a02fb5971c17a942a3e3fd9f7b Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:26:48 2016 -0400 Javadoc commit fb16d2fc7016c1e8e1621d481e8781c7ef43326c Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 16:14:48 2016 -0400 Rewrite refresh docs commit 5797d1b1c4d233c0db918c0d08c21731ddccd05e Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 15:02:34 2016 -0400 Fix forced_refresh flag It wasn't being set. commit 43ce50a1de250a9e073a2ca6cbf55c1b4c74b11b Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 14:02:56 2016 -0400 Delay translog sync and flush until after refresh The sync might have occurred for us during the refresh so we have less work to do. Maybe. commit bb2739202e084703baf02cfa58f09517598cf14e Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 13:08:08 2016 -0400 Remove duplication in WritePrimaryResult and WriteReplicaResult commit 2f579f89b4867a880396f2e7fcffc508449ff2de Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 12:19:05 2016 -0400 Clean up registration of RefreshListeners commit 87ab6e60ca5ba945bf0fba84784b2bbe53506abf Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 11:28:30 2016 -0400 Shorten lock time in RefreshListeners Also use null to represent no listeners rather than an empty list. This saves allocating a new ArrayList every refresh cycle on every index. commit 0d49d9c5720dadfb67da3fa760397bf6d874601c Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 10:46:18 2016 -0400 Flip relationship between RefreshListeners and Engine Now RefreshListeners comes to Engine from EngineConfig. commit b2704b8a39382953f8f91a9743e894ee289f7514 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 09:37:58 2016 -0400 Remove unused imports Maybe I added them? commit 04343a22647f19304d9dc716b3fac9b183227f63 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 09:37:52 2016 -0400 Javadoc commit da1e765678890a02d61d8a29aa433274beb5e00c Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 09:26:35 2016 -0400 Reply with non-null Also move the fsync and flush to before the refresh listener stuff. commit 5d8eecd0d904b497844b4c81c46477bd6178ed3a Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 08:58:47 2016 -0400 Remove funky synchronization in AsyncReplicaAction commit 1ec71eea0f4e1228ae1497d982307be818ef4b65 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 08:01:14 2016 -0400 s/LinkedTransferQueue/ArrayList/ commit 7da36a4ceed2ccf7955138c3b005237fa41efcb4 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 07:46:38 2016 -0400 More cleanup for RefreshListeners commit 957e9b77007c32ee75dde152c6622bab065d5993 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 07:34:13 2016 -0400 /Consumer<Runnable>/Executor/ commit 4d8bf5d4a70dcc56150c8d8d14165cd23d308b3c Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 22:20:42 2016 -0400 explain commit 15d948a348089bb2937eec5ac4e96f3ec67dbe32 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 22:17:59 2016 -0400 Better.... commit dc28951d02973fc03b4d51913b5f96de14b75607 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 21:09:20 2016 -0400 Javadocs and compromises commit 8eebaa89c0a1ee74982fbe0d56d1485ca2ae09db Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 20:52:49 2016 -0400 Take boaz's changes to their logic conclusion and unbreak important stuff like bulk commit 7056b96ea412f275005b93e3570bcff895859ed5 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 15:49:32 2016 -0400 Patch from boaz commit 87be7eaed09a274cc6a99d1a3da81d2d7bf9dd64 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 15:49:13 2016 -0400 Revert "Move async parts of replica operation outside of the lock" This reverts commit 13807ad10b6f5ecd39f98c9f20874f9f352c5bc2. commit 13807ad10b6f5ecd39f98c9f20874f9f352c5bc2 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 22:53:15 2016 -0400 Move async parts of replica operation outside of the lock commit b8cadcef565908b276484f7f5f988fd58b38d8b6 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 16:17:20 2016 -0400 Docs commit 91149e0580233bf79c2273b419fe9374ca746648 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 15:17:40 2016 -0400 Finally! commit 1ff50c2faf56665d221f00a18d9ac88745904bf5 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 15:01:53 2016 -0400 Remove Translog#lastWriteLocation I wasn't being careful enough with locks so it wasn't right anyway. Instead this builds a synthetic Tranlog.Location when you call getWriteLocation with much more relaxed equality guarantees. Rather than being equal to the last Translog.Location returned it is simply guaranteed to be greater than the last translog returned and less than the next. commit 55596ea68b5484490c3637fbad0d95564236478b Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 14:40:06 2016 -0400 Remove listener from shardOperationOnPrimary Create instead asyncShardOperationOnPrimary which is called after all of the replica operations are started to handle any async operations. commit 3322e26211bf681b37132274ee158ae330afc28b Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 17:20:02 2016 -0400 Increase default maximum number of listeners to 1000 commit 88171a8322a424e624d48960fb4c98dd43e4d671 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 16:40:57 2016 -0400 Rename test commit 179c27c4f829f2c6ded65967652cf85adaf2ae52 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 16:35:27 2016 -0400 Move refresh listeners into their own class They still live at the IndexShard level but they live on their own in RefreshListeners which interacts with IndexShard using a couple of callbacks and a registration method. This lets us test the listeners without standing up an entire IndexShard. We still test the listeners against an InternalEngine, because the interplay between InternalEngine, Translog, and RefreshListeners is complex and important to get right. commit d8926d5fc1d24b4da8ccff7e0f0907b98c583c41 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 11:02:38 2016 -0400 Move refresh listeners into IndexShard commit df91cde398eb720143a85a8c6fa19bdc3a74e07d Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 16:01:03 2016 -0400 unused import commit 066da45b08148b266e4173166662fc1b3f66ed53 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 15:54:11 2016 -0400 Remove RefreshListener interface Just pass a Translog.Location and a Consumer<Boolean> when registering. commit b971d6d3301c7522b2e7eb90d5d8dd96a77fa625 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 14:41:06 2016 -0400 Docs for setForcedRefresh commit 6c43be821eaf61141d3ec520f988aad3a96a3941 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 14:34:39 2016 -0400 Rename refresh setter and getter commit e61b7391f91263a4c4d6107bfbc2a828bbcc805c Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 22:48:09 2016 -0400 Trigger listeners even when there is no refresh Each refresh gives us an opportunity to pick up any listeners we may have left behind. commit 0c9b0477085c021f503db775640d25668e02f635 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 20:30:06 2016 -0400 REST commit 8250343240de7e63118c663a230a7a314807a754 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 19:34:22 2016 -0400 Switch to estimated count We don't need a linear time count of the number of listeners - a volatile variable is good enough to guess. It probably undercounts more than it overcounts but it isn't a huge problem. commit bd531167fe54f1bde6f6d4ddb0a8de5a7bcc18a2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 18:21:02 2016 -0400 Don't try and set forced refresh on bulk items without a response NullPointerExceptions are bad. If the entire request fails then the user has worse problems then "did these force a refresh". commit bcfded11515af5e0b3c3e36f3c2f73f5cd26512e Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 18:14:20 2016 -0400 Replace LinkedList and synchronized with LinkedTransferQueue commit 8a80cc70a76375a7593745884cb987535b37ca80 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 17:38:24 2016 -0400 Support for update commit 1f36966742f851b7328015151ef6fc8f95299af2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 15:46:06 2016 -0400 Cleanup translog tests commit 8d121bf35eb265b8a0aee9710afeb1b054a113d4 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 15:40:53 2016 -0400 Cleanup listener implementation Much more testing too! commit 2058f4a808762c4588309f21b13b677245832f2c Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:45:55 2016 -0400 Pass back information about whether we refreshed commit e445cb0cb91ebdbcfdbf566696edb2bf1c84a882 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:03:31 2016 -0400 Javadoc commit 611cbeeaeb458f4b428bfc43a1ee6652adf4baff Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:01:40 2016 -0400 Move ReplicationResponse now it is in the same package as its request commit 9919758b644fd73895fb88cd6a4909a8387eb2e2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:00:14 2016 -0400 Oh boy that wasn't working commit 247cb483c4459dea8e95e0e3bd2e4bf8d452c598 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 10:29:37 2016 -0400 Basic block_until_refresh exposed to java client and basic "is it plugged in" style tests. commit 46c855c9971cb2b748206d2afa6a2d88724be3ba Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 10:11:10 2016 -0400 Move test to own class commit a5ffd892d0a352ae7e9757f2640fc2a1fa656bf2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 07:44:25 2016 -0400 WIP commit 213bebb6ece11b85d17e44af9a54fc2e5e332d39 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 21:35:52 2016 -0400 Add refresh listeners commit a2bc7f30e6d4857a1224ef5a89909b36c8f33731 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 21:11:55 2016 -0400 Return last written location from refresh commit 85033a87551da89f36a23d4dfd5016db218e08ee Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 20:28:21 2016 -0400 Never reply to replica actions while you have the operation lock This last thing was causing periodic test failures because we were replying while we had the operation lock. Now, we probably could get away with that in most cases but the tests don't like it and it isn't a good idea to do network io while you have a lock anyway. So this prevents it. commit 1f25cf35e796835b3827b8a4110e09e5de61784c Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 19:56:18 2016 -0400 Cleanup commit 52c5f7c3f04710901f503334239a611c0e21c85a Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 19:33:00 2016 -0400 Add a listener to shard operations commit 5b142dc331214c8eef90587144f4b3f959f9eced Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 18:03:52 2016 -0400 Cleanup commit 3d22b2d7ceb473db339259452a7c4f117ce86069 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:59:55 2016 -0400 Push the listener into shardOperationOnPrimary commit 34b378943b8185451acf6350f661c0ad33b5836d Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:48:47 2016 -0400 Doc commit b42b8da968d42cc7414020c7b199606a5dcce50a Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:45:40 2016 -0400 Don't finish early if the primary finishes early We use a "fake" pending shard that we resolve when the replicas have all started. commit 0fc045b56e1e02a48c30383ac50a281d5af7e0b6 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:30:06 2016 -0400 Make performOnPrimary asyncS Instead of returning Tuple<Response, ReplicaRequest> it returns ReplicaRequest and takes a ActionListener<Response> as an argument. We call the listener immediately to preserve backwards compatibility for now. commit 80119b9a26ede96a865af45904c3ac69d5b19b59 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:51:53 2016 -0400 Factor out common code in shardOperationOnPrimary commit 0642083676702618f900fa842c08802a04c1a53e Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:32:29 2016 -0400 Factor out common code from shardOperationOnReplica commit 8bdc415fedaaa9f2d0c555590a13ec4699a7c3f7 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:23:28 2016 -0400 Create ReplicatedMutationRequest Superclass for index, delete, and bulkShard requests. commit 0f8fa846a2822c4293df32fed18c9b99660b39ff Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:10:30 2016 -0400 Create TransportReplicatedMutationAction It is the superclass of replication actions that mutate data: index, delete, and shardBulk. shardFlush and shardRefresh are replication actions but they do not extend TransportReplicatedMutationAction because they don't change the data, only shuffle it around.	2016-06-06 11:37:53 -04:00
Jason Tedor	200d76e6f0	Throw exception if using a closed transport client Today when attempting to use a closed transport client, an exception saying that no nodes are available is thrown. This is because when a transport client is closed, its internal list of nodes is cleared. But this exception is puzzling and can be made clearer. This commit changes the behavior so that attempting to execute a request using a closed transport client throws an illegal state exception. Relates #18722	2016-06-06 11:27:26 -04:00
Yannick Welsch	e3e8f10103	[TEST] fix assertion that index was fully deleted	2016-06-06 15:45:57 +02:00
Yannick Welsch	0a8afa2e72	Add back pending deletes (#18698 ) Triggering the pending deletes logic was accidentally removed in the clean up PR #18602.	2016-06-06 15:14:09 +02:00
Tanguy Leroux	4ca04d6f6c	Close SearchContext if query rewrite failed If a query failed to be rewritten and throws an exception, the SearchContext is not properly closed, skewing the ref count on the underlying Store.	2016-06-06 09:40:56 +02:00
javanna	b891c46657	[TEST] remove status matcher and hasStatus assertion All it does is checking the status code of a response, which can be done with a single line in each test	2016-06-03 23:25:17 +02:00
Jason Tedor	be0036542c	More complete exception message in settings tests This commit adds additional details to the exception message assertions in the YAML settings loader tests.	2016-06-03 16:18:46 -04:00
Areek Zillur	a30437f250	adds _rollover api	2016-06-03 16:09:24 -04:00
Jason Tedor	bbd5f26d45	Merge branch 'master' into rjernst-placeholder * master: (911 commits) [TEST] wait for yellow after setup doc tests (#18726) Fix recovery throttling to properly handle relocating non-primary shards (#18701) Fix merge stats rendering in RestIndicesAction (#18720) [TEST] mute RandomAllocationDeciderTests.testRandomDecisions Reworked docs for index-shrink API (#18705) Improve painless compile-time exceptions Adds UUIDs to snapshots Add test rethrottle test case for delete-by-query Do not start scheduled pings until transport start Adressing review comments Only filter intial recovery (post API) when shrinking an index (#18661) Add tests to check that toQuery() doesn't return null Removing handling of null lucene query where we catch this at parse time Handle empty query bodies at parse time and remove EmptyQueryBuilder Mute failing assertions in IndexWithShadowReplicasIT until fix Remove allow running as root Add upgrade-not-supported warning to alpha release notes remove unrecognized javadoc tag from matrix aggregation module set ValuesSourceConfig fields as private Adding MultiValuesSource support classes and documentation to matrix stats agg module ...	2016-06-03 13:32:03 -04:00
javanna	f17f0f9247	rename ElasticsearchResponse#getFirstHeader to getHeader	2016-06-03 18:28:31 +02:00
javanna	c48b7a7a97	[TEST] create standard RestClient at first request and reuse it A RestClient instance is now created whenever EsIntegTestCase#getRestClient is invoked for the first time. It is then kept until the cluster is cleared (depending on the cluster scope of the test). Renamed other two restClient methods to createRestClient, as that instance needs to be closed and managed in the tests.	2016-06-03 18:21:19 +02:00
javanna	23a94bb974	[TEST] create standard RestClient at first request and reuse it A RestClient instance is now created whenever EsIntegTestCase#getRestClient is invoked for the first time. It is then kept until the cluster is cleared (depending on the cluster scope of the test). Renamed other two restClient methods to createRestClient, as that instance needs to be closed and managed in the tests.	2016-06-03 18:00:54 +02:00
javanna	4fa824f891	[TEST] add a lot of forgotten try with resources to wrap ElasticsearchResponses	2016-06-03 16:42:07 +02:00
javanna	eae914ae8e	Replace rest test client with low level RestClient We still have a wrapper called RestTestClient that is very specific to Rest tests, as well as RestTestResponse etc. but all the low level bits around http connections etc. are now handled by RestClient.	2016-06-03 16:01:07 +02:00
javanna	325b723930	[TEST] add rest client test dependency and replace usage of HttpRequestBuilder with RestClient in integration tests	2016-06-03 16:01:07 +02:00
Yannick Welsch	24a7b7224b	Fix recovery throttling to properly handle relocating non-primary shards (#18701 ) Relocation of non-primary shards is realized by recovering from the primary shard. Recovery throttling wrongly equates non-primary relocation as recovering a shard from the non-primary relocation source, however. Closes #18640	2016-06-03 14:11:34 +02:00
Simon Willnauer	6c28235b03	Fix merge stats rendering in RestIndicesAction (#18720 ) give the table description: ``` table.addCell("merges.total", "sibling:pri;alias:mt,mergesTotal;default:false;text-align:right;desc:number of completed merge ops"); table.addCell("pri.merges.total", "default:false;text-align:right;desc:number of completed merge ops"); table.addCell("merges.total_docs", "sibling:pri;alias:mtd,mergesTotalDocs;default:false;text-align:right;desc:docs merged"); table.addCell("pri.merges.total_docs", "default:false;text-align:right;desc:docs merged"); table.addCell("merges.total_size", "sibling:pri;alias:mts,mergesTotalSize;default:false;text-align:right;desc:size merged"); table.addCell("pri.merges.total_size", "default:false;text-align:right;desc:size merged"); ``` this is how it should be.	2016-06-03 13:42:09 +02:00
Britta Weber	78574d248c	[TEST] mute RandomAllocationDeciderTests.testRandomDecisions we have a pr already: https://github.com/elastic/elasticsearch/pull/18701	2016-06-03 11:37:22 +02:00
Ali Beyad	b720216395	Adds UUIDs to snapshots This commit adds a UUID for each snapshot, in addition to the already existing repository and snapshot name. The addition of UUIDs will enable more robust handling of the deletion of previous snapshots and lingering files from partially failed delete operations, on top of being able to uniquely track each snapshot. Closes #18228 Relates #18156	2016-06-02 17:01:48 -04:00
Jason Tedor	f67e5807ca	Do not start scheduled pings until transport start Today, scheduled pings in NettyTransport can start before the transport is started. Instead, these pings should not be scheduled until after the transport is started. This commit modifies NettyTransport so that this is the case. Relates #18702	2016-06-02 13:12:21 -04:00
Christoph Büscher	b6f26c0c37	Merge pull request #17624 Handle empty query bodies at parse time and remove EmptyQueryBuilder.	2016-06-02 18:40:36 +02:00
Christoph Büscher	9067407cdd	Adressing review comments	2016-06-02 16:19:23 +02:00
Simon Willnauer	22dfc41521	Only filter intial recovery (post API) when shrinking an index (#18661 ) Today we use `index.routing.allocation.include._id` to filter the allocation for the shrink target index. That has the sideeffect that the user has to delete that setting / change it once the primary has been recovered (shrink is done) This PR adds a dedicated filter that can only be set internally that only filters allocation for unassigned shards.	2016-06-02 15:38:51 +02:00
Christoph Büscher	cb145aec68	Removing handling of null lucene query where we catch this at parse time	2016-06-02 11:25:56 +02:00
Christoph Büscher	359f45988f	Handle empty query bodies at parse time and remove EmptyQueryBuilder Currently we support empty query clauses like the filter in "constant_score" : { "filter" : { } } How these clauses are handled depends on the surrounding query. They later are either ignored, converted to match all or no documents or passed up further in the query hierarchy. During parsing these claues are currently represented as EmptyQueryBuilders. When not handled anywhere else, these special cases need to be checked for on the shard when building the lucene query. This is trappy, so this PR changes the parsing of compound queries. Instead of returning QueryBuilder, the core query parsing method QueryShardContext#parseInnerQueryBuilder() now return an Optional which can be empty in the case of empty query clauses. This has the advantage of forcing callers to deal with this sooner or later. When encountering empty Optionals, compound query builders now have the choice to ignore them, pass them on or rewrite to a different query, depending on context.	2016-06-02 11:25:56 +02:00
Yannick Welsch	b2724c0d08	Mute failing assertions in IndexWithShadowReplicasIT until fix	2016-06-02 11:23:44 +02:00
Jason Tedor	3ccd59592a	Remove allow running as root This commit removes the escape hatch for running Elasticsearch as root. Relates #18694	2016-06-02 05:03:28 -04:00
Nicholas Knize	54575e55ca	set ValuesSourceConfig fields as private	2016-06-01 16:39:42 -05:00
Nicholas Knize	90b8f5d0d8	Adding MultiValuesSource support classes and documentation to matrix stats agg module	2016-06-01 16:39:42 -05:00
Jason Tedor	8e2a7d0fe1	Rename boostrap.mlockall to bootstrap.memory_lock The setting bootstrap.mlockall is useful on both POSIX-like systems (POSIX mlockall) and Windows (Win32 VirtualLock). But mlockall is really a POSIX only thing so the name should not be tied POSIX. This commit renames the setting to "bootstrap.memory_lock". Relates #18669	2016-06-01 16:25:51 -04:00
Yannick Welsch	21f0d7da1d	[TEST] Increase timeout to wait on folder deletion in IndexWithShadowReplicasIT	2016-06-01 18:47:25 +02:00
Yannick Welsch	c20bf5d747	[TEST] Fix tests that rely on assumption that data dirs are removed after index deletion (#18681 ) Relates to #18602	2016-06-01 17:02:09 +02:00
Yannick Welsch	bdd1d0703d	Acknowledge index deletion requests based on standard cluster state acknowledgment (#18602 ) Index deletion requests currently use a custom acknowledgement mechanism that wait for the data nodes to actually delete the data before acknowledging the request to the client. This was initially put into place as a new index with same name could only be created if the old index was wiped as we used the index name as data folder on the data nodes. With PR #16442, we now use the index uuid as folder name which avoids collision between indices that are named the same (deleted and recreated). This allows us to get rid of the custom acknowledgment mechanism altogether and rely on the standard cluster state-based acknowledgment instead. Closes #18558	2016-06-01 15:22:55 +02:00
Adrien Grand	3b8c9d87b4	Fix StoreRecoveryTests after 6.0.1 upgrade.	2016-06-01 10:54:51 +02:00
Jun Ohtani	871aaa102f	Merge branch 'fix/validate_index_template'	2016-06-01 17:49:01 +09:00
Adrien Grand	d182e171a4	Upgrade to Lucene 6.0.1.	2016-06-01 10:31:10 +02:00
Martijn van Groningen	766789b0f0	ingest: added `ignore_failure` option to all processors If this option is enabled on a processor it silently catches any processor related failure and continues executing the rest of the pipeline. Closes #18493	2016-06-01 10:29:12 +02:00
Simon Willnauer	966bfc3f2d	Throw IllegalStateException when handshake fails due to version or cluster mismatch (#18676 ) We do throw ConnectTransportException which is logged in trace level hiding a potentially important information when an old or wrong node wants to connect. We should throw ISE and log as warn.	2016-06-01 10:28:35 +02:00
Adrien Grand	a78c7d9911	AggregatorBuilder and PipelineAggregatorBuilder do not need generics. #18368 Similar reasoning as #18133 but for the aggs API. One important change is that I moved the base PipelineAggregatorBuilder class to the o.e.s.aggregations package instead of o.e.s.aggregations.pipeline so that the create method does not need to be public.	2016-06-01 10:19:30 +02:00
Simon Willnauer	88800e8e47	Move PageCacheRecycler into BigArrays (#18666 ) PageCacheRecycler is really just an implementation detail of BigArrays. There is no need to leak this class anywhere outside of it.	2016-06-01 09:43:11 +02:00
Jun Ohtani	fd76291131	Index Template: parse and validate mappings in template when template puts Share applying template with MetaDataCreateIndexService and MetaDataIndexTemplateService Add some unit test Collapse addMappingsToMapperService and move it to MapperService Extract validateTemplate method use expectThrows in testcase Add TODO comment Closes #2415	2016-06-01 16:41:48 +09:00
Ali Beyad	f12b10c48a	Make cluster health classes immutable and have them implement Writeable instead of Streamable Closes #18673	2016-05-31 23:04:13 -04:00
Ali Beyad	dac322a32d	Fix javadoc that stated a throws clause that didn't exist.	2016-05-31 19:56:55 -04:00
Ali Beyad	0efac76f01	Clarify the semantics of the BlobContainer interface This commit clarifies the behavior that must be adhered to by any implementors of the BlobContainer interface. This is done through expanded Javadocs. Closes #18157 Closes #15580	2016-05-31 19:22:55 -04:00
Jason Tedor	e21d8b31f1	Remove thread pool from page cache recycler The page cache recycler has a dependency on thread pool that was there for historical reasons but is no longer needed. This commit removes this now unneeded dependency. Relates #18664	2016-05-31 14:51:58 -04:00
Yannick Welsch	ad093a78c5	Fix ConcurrentModificationException in ReplicaShardAllocator	2016-05-31 19:42:17 +02:00
Clinton Gormley	7df5fcd4b6	Changed version back to 5.0.0	2016-05-31 18:34:54 +02:00
Robert Muir	0373c62a57	Merge pull request #18658 from rmuir/jodaTime improve date api for expressions/painless fields	2016-05-31 12:33:22 -04:00
Yannick Welsch	58a103aca2	Remove custom iterators from RoutingNodes (#18615 ) Contains a number of cleanups related to recent changes in RoutingNodes: - PR #17821 (Immutable ShardRouting) changed RoutingNode to use a map indexed by ShardId to manage ShardRouting elements. This means that we can directly select the right ShardRouting without iterating over all elements. This lets us get rid of RoutingNodeIterator and all kind of iterations all over the place. - Second cleanup is an extension of #18390 (Expose cluster state before reroute in RoutingAllocation instead of RoutingNodes). We should not reexpose RoutingTable in RoutingNodes and only use it in the constructor. This makes it clear that the RoutingTable is only used to construct the RoutingNodes and can diverge from it afterwards (only RoutingNodes is mutable). - Remove AllocationService.applyNewNodes() (that is already done as part of construction of RoutingNodes)	2016-05-31 16:37:58 +02:00
Simon Willnauer	82645563bf	Estimate shard size for shrinked indices (#18659 ) When we shrink an index we can estimate the shards size for the primary from the source index. This is important for allocation decisions since we should try out best to ensure we have enough space on the node we shrink the index.	2016-05-31 16:33:38 +02:00
Christoph Büscher	740bdc8d99	Treat zero token in `common` terms query as MatchNoDocsQuery Currently we return `null` when the query in a common terms query has zero tokens after analysis. It would be better if query builders `toQuery()` would never return null and return a meaningful lucene query instead. Since an ExtendedCommonTermsQuery with no terms gets rewritten to a MatchNoDocsQuery later, it is enough to leave out the check for zero tokens.	2016-05-31 16:28:03 +02:00
Michael McCandless	8f0109c2a5	Merge pull request #18651 from mikemccand/remove_iw_max_memory_stat Remove index_writer_max_memory stat from segment stats	2016-05-31 09:58:55 -04:00
Robert Muir	2d1eb89aef	improve date api for expressions/painless fields	2016-05-31 09:32:33 -04:00
Adrien Grand	adf4712164	Make ip fields backward-compatible at query time. #18593 The fact that ip fields used a different doc values representation in 2.x causes issues when querying 2.x and 5.0 indices in the same request. This changes 2.x doc values on ip fields/2.x to be hidden behind binary doc values that use the same encoding as 5.0. This way the coordinating node will be able to merge shard responses that have different major versions. One known issue is that this makes sorting/aggregating slower on ip fields for indices that have been generated with elasticsearch 2.x.	2016-05-31 14:50:00 +02:00
Mike McCandless	5c525e6606	Remove index_writer_max_memory stat from segment stats	2016-05-31 06:29:29 -04:00
Simon Willnauer	867d35c084	Never trip circuit breaker in liveness request (#18627 ) We don't want transport clients to disconnect due to circuit breakers preventing the liveness request from executing. Relates to #17951	2016-05-31 11:31:52 +02:00
Simon Willnauer	502a775a7c	Add primitive to shrink an index into a single shard (#18270 ) This adds a low level primitive operations to shrink an existing index into a new index with a single shard. This primitive expects all shards of the source index to allocated on a single node. Once the target index is initializing on the shrink node it takes a snapshot of the source index shards and copies all files into the target indices data folder. An [optimization](https://issues.apache.org/jira/browse/LUCENE-7300) coming in Lucene 6.1 will also allow for optional constant time copy if hard-links are supported by the filesystem. All mappings are merged into the new indexes metadata once the snapshots have been taken on the merge node. To shrink an existing index all shards must be moved to a single node (one instance of each shard) and the index must be read-only: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_settings' -d '{ "settings" : { "index.routing.allocation.require._name" : "shrink_node_name", "index.blocks.write" : true } } ``` once all shards are started on the shrink node. the new index can be created via: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_shrink/logs_single_shard' -d '{ "settings" : { "index.codec" : "best_compression", "index.number_of_replicas" : 1 } }' ``` This API will perform all needed check before the new index is created and selects the shrink node based on the allocation of the source index. This call returns immediately, to monitor shrink progress the recovery API should be used since all copy operations are reflected in the recovery API with byte copy progress etc. The shrink operation does not modify the source index, if a shrink operation should be canceled or if the shrink failed, the target index can simply be deleted and all resources are released.	2016-05-31 10:41:44 +02:00
Ryan Ernst	c911f4d951	Merge pull request #18646 from rjernst/jarhell_evil_tests Tests: Remove unnecessary evil jarhell tests	2016-05-31 00:08:36 -07:00
Adrien Grand	9570feaf8d	Process dynamic templates in order. #18638 When calling `findTemplateBuilder(context, currentFieldName, "text", null)`, elasticsearch ignores all templates that have a `match_mapping_type` set since no dynamic type is provided (the last parameter, which is null in that case). So this should only be called _last_. Otherwise, if a path-based template matches, it will have precedence over all type-based templates. Closes #18625	2016-05-31 09:00:10 +02:00
Ryan Ernst	454de6a8f2	Tests: Remove unnecessary evil jarhell tests We have 3 evil tests for jarhell. They have been failing in java 9 because of how evil they are. The first checks the leniency we add for jarhell in the jdk itself. This is unecessary, since if the leniency wasn't there, we would already be failing all jarhell checks. The second is checking the compile version is compatible with the jdk. This is simpler since we don't need to fake the java version: we know 1.7 should be compatibile with both java 8 and 9, so we can use that as a constant. Finally the last test checks if the java version system property is broken. This is simply something we should not check, we have to trust that java specifies it correctly, and again, if it was broken, all jarhell checks would be broken.	2016-05-30 21:39:56 -07:00
Boaz Leskes	38bee27b11	testCorruptionOnNetworkLayerFinalizingRecovery should keep on trying to assign replicas In rare cases it can reach the limit set by index.allocation.max_retries	2016-05-30 21:37:34 +02:00
Simon Willnauer	f74a78940c	Improve phrase suggest test speed (#18633 ) There is no reason to read the entire marvel hero file to test the features, it might take several seconds to do so which is unnecessary. This commit also splits SearchSuggestTests into core and modules/mustache also add @Nighlty to forbidden API to make sure we don't use it since they won't run in CI these days.	2016-05-30 17:22:03 +02:00
Jason Tedor	04cae88ff4	Do not use Lucene SuppressForbidden Lucene SuppressForbidden is marked lucene.internal and should not be used outside of Lucene. This commit removes the uses of this class within Elasticsearch. Instead, org.elasticsearch.common.SuppressForbidden should be used, which was already the case in most places.	2016-05-30 10:57:33 -04:00
Boaz Leskes	a1c4086a17	No need to inject the ClusterService into IndicesService (#18639 ) It's already available where it's needed.	2016-05-30 16:55:44 +02:00
Britta Weber	fc1696822f	Fix filtering of node ids for TransportNodesAction (#18634 ) * Fix filtering of node ids for TransportNodesAction Don't mix up member variables with local variables in constructor. closes #18618	2016-05-30 15:26:20 +02:00
Yannick Welsch	148fd3f891	[TEST] mute testDeleteIndexStore test Issue: https://github.com/elastic/elasticsearch/issues/18558	2016-05-30 11:57:42 +02:00
Simon Willnauer	32e9a879b4	Use at least 2 docs to see deleted docs Closes #18623	2016-05-28 08:14:49 +02:00
Jason Tedor	df7b3c6e82	Fix translog replay multiple operations same doc Modifying the translog replay to not replay again into the translog introduced a bug for the case of multiple operations for the same doc. Namely, since we were no longer updating the version map for each operation, the second operation for a doc would be treated as a creation instead of as an update. This commit fixes this bug by placing these operations into version map. This commit includes a failing test case. Relates #18611	2016-05-27 09:43:19 -04:00
Christoph Büscher	f0020caf7c	Merge pull request #18589 : Improve TimeUnitRounding for edge cases and DST transitions	2016-05-27 15:37:20 +02:00
Simon Willnauer	98fd45dc02	Move DocStats under Engine to get more accurate numbers (#18587 ) Today we pull doc stats from an index reader which might not reflect reality. IndexWriter might have merged all deletes away but due to a missing refresh the stats are completely off. This change pulls doc stats from the IndexWriter directly instead of relying on refreshes to run regularly. Note: Buffered deletes are still not visible until the segments are flushed to disk.	2016-05-27 15:31:40 +02:00
Yannick Welsch	2b47a2643c	Only fail relocation target shard if failing source shard is a primary (#18574 ) If the relocation source fails during the relocation of a shard from one node to another, the relocation target is currently failed as well. For replica shards this is not necessary, however, as the actual shard recovery of the relocation target is done via the primary shard.	2016-05-27 15:28:57 +02:00
Christoph Büscher	29c970958c	Adding tests for derivatives on date histogram aggregation with time zones	2016-05-27 15:18:11 +02:00
Christoph Büscher	4439a86cb8	Improve TimeUnitRounding for edge cases and DST transitions Our current testing for TimeUnitRoundings rounding() and nextRoundingValue() methods that are used especially for date histograms lacked proper randomization for time zones. We only did randomized tests for fixed offset time zones (e.g. +01:00, -05:00) but didn't account for real world time zones with DST transitions. Adding those tests revealed a couple of problems with our current rounding logic. In some cases, usually happening around those transitions, rounding a value down could land on a value that itself is not a proper rounded value. Also sometimes the nextRoundingValue would not line up properly with the rounded value of all dates in the next unit interval. This change improves the current rounding logic in TimeUnitRounding in two ways: it makes sure that calling round(date) always returns a date that when rounded again won't change (making round() idempotent) by handling special cases happening during dst transitions by performing a second rounding. It also changes the nextRoundingValue() method to itself rely on the round method to make sure we always return rounded values for the unit interval boundaries. Also adding tests for randomized TimeUnitRounding that assert important basic properties the rounding values should have. For better understanding and readability a few of the pathological edge cases are also added as a special test case.	2016-05-27 15:18:11 +02:00
Jason Tedor	41710f1028	Upgrade joda-time to 2.9.4 This commit upgrades joda-time to version 2.9.4 to integrate a bug fix there into Elasticsearch. Relates #18609	2016-05-27 08:51:19 -04:00
Martijn van Groningen	0e9f3addd2	Nested inner hits shouldn't use relative paths Like on other places in the query dsl the full field name should be used. Before this change this wasn't the case for nested inner hits when source filtering was used. Highlighting has a workaround, which is now removed as the source of nested inner hits can only be refered by the full name. Closes #16653	2016-05-27 13:41:45 +02:00
Jason Tedor	cebbf0de41	Do not replay into translog on local recovery When performing a local recovery, the engine replays operations recovered from the translog into the translog again. These duplicated operations are eventually cleared after successful recovery on flush, but there is no need to play these operations into the translog at all. This commit modifies the local recovery process so as to not replay these operations into the translog. Relates #18547	2016-05-27 06:04:11 -04:00
Boaz Leskes	318a4e3ef6	Introduce dedicated master nodes in testing infrastructure (#18514 ) This PR changes the InternalTestCluster to support dedicated master nodes. The creation of dedicated master nodes can be controlled using a new `supportsMasterNodes` parameter to the ClusterScope annotation. If set to true (the default), dedicated master nodes will randomly be used. If set to false, no master nodes will be created and data nodes will also be allowed to become masters. If active, test runs will either have 1 or 3 masternodes	2016-05-27 08:44:20 +02:00
Igor Motov	fb763c1e8e	Add ability to store results for long running tasks The results of the tasks are stored in a special index .results	2016-05-26 19:49:13 -04:00
Robert Muir	3f06d9f3b8	Merge pull request #18600 from rmuir/new_script_exception replace ScriptException with a better one	2016-05-26 17:51:34 -04:00
Robert Muir	76ca4af561	move instanceof to catch block	2016-05-26 15:03:13 -04:00
Jason Tedor	d23db39445	Merge pull request #18594 from jasontedor/plugins-cleanup Plugins cleanup	2016-05-26 14:46:09 -04:00
Jason Tedor	f16f65741e	Fix when plugins directory is symlink This commit fixes an issue with the plugins directory being a symbolic link. Namely, the install plugins command attempts to always create the plugins directory just in case it does not exist. The JDK method used here guarantees that the directory is created, and an exception is not thrown if the directory could not be created because it already exists. The problem is that this JDK method does not respect symlinks so its internal existence checks fails, it proceeds to attempt to create the directory, but the directory creation fails because the symlink exists. This is documented as being not an issue. We work around this by checking if there is a symlink where we expect the plugins directory to be, and only attempt to create if not. We add a unit test that plugin installation to a symlinked plugins directory works as expected.	2016-05-26 14:10:32 -04:00
Yannick Welsch	f98ca5310c	Fix ReplicaShardAllocatorTests when unassigned reason is ALLOCATION_FAILED When mocking unassigned shards which have failed with reason ALLOCATION_FAILED we have to ensure that the failed allocation counter is strictly positive.	2016-05-26 19:01:43 +02:00
Ryan Ernst	a8a38c282a	Remove extra mostly duplicate readme file It looks like the readme was duplicated when plugins were merged back into the repo. We removed all these extra files from the plugins, this removes the remaining duplicate from core. closes #18597	2016-05-26 08:54:35 -07:00
Robert Muir	f037807117	replace ScriptException with a better one	2016-05-26 11:43:29 -04:00
Jason Tedor	d29844e597	Remove custom plugins path This commit removes the ability to specify a custom plugins path. Instead, the plugins path will always be a subdirectory called "plugins" off of the home directory.	2016-05-26 10:16:25 -04:00
Jason Tedor	0f529e10a8	Fix plugin command name in remove plugin command This commit fixes the name of the plugin command that is output when a user attempts to remove a plugin that does not exist.	2016-05-26 10:14:39 -04:00
Yannick Welsch	31b0777c91	Simplify delayed shard allocation (#18351 ) This commit simplifies the delayed shard allocation implementation by assigning clear responsibilities to the various components that are affected by delayed shard allocation: - UnassignedInfo gets a boolean flag delayed which determines whether assignment of the shard should be delayed. The flag gets persisted in the cluster state and is thus available across nodes, i.e. each node knows whether a shard was delayed-unassigned in a specific cluster state. Before, nodes other than the current master were unaware of that information. - This flag is initially set as true if the shard becomes unassigned due to a node leaving and the index setting index.unassigned.node_left.delayed_timeout being strictly positive. From then on, unassigned shards can only transition from delayed to non-delayed, never in the other direction. - The reroute step is in charge of removing the delay marker (comparing timestamp when node left to current timestamp). - A dedicated service DelayedAllocationService, reacting to cluster change events, has the responsibility to schedule reroutes to remove the delay marker. Closes #18293	2016-05-26 13:39:55 +02:00
Ryan Ernst	16d029aff7	Merge branch 'master' into staging_plugins	2016-05-25 14:47:12 -07:00
Ryan Ernst	9c15e0c56d	Merge pull request #18583 from rjernst/official_plugins Use resource files for list of modules and plugins	2016-05-25 14:42:24 -07:00
Ryan Ernst	45adab0cb8	Add test that x-pack is in official plugins list	2016-05-25 14:23:57 -07:00
Jason Tedor	9d39b05845	Remove deprecation suppression Failing the build on deprecation warnings was removed in `19b3ec88af`. This commit removes the suppressed deprecation warnings so that their use is surfaced in the build now. Relates #18582	2016-05-25 17:15:36 -04:00
Ryan Ernst	a9e7bdc54c	Plugins: use resource files for list of modules and plugins This adds modules.txt and plugins.txt to the core jar resource files, which the install plugin command statically loads, in place of the previously hardcoded lists (which have often gone out of date).	2016-05-25 13:42:24 -07:00
Ryan Ernst	478877edf7	Plugins: Changing staging property to be the hash instead of a boolean With the unified release, we will have staged releases based on a unified hash (hash of all the hashes), so using the elasticsearch hash for plugins staging will no longer work. This change makes the `es.plugins.staging` property take the staging hash it should use.	2016-05-25 12:36:31 -07:00
Jim Ferenczi	6d62f33702	Make doc_values accessible for _type `doc_values` for _type field are created but any attempt to load them throws an IAE. This PR re-enables `doc_values` loading for _type, it also enables `fielddata` loading for indices created between 2.0 and 2.1 since doc_values were disabled during that period. It also restores the old docs that gives example on how to sort or aggregate on _type field.	2016-05-25 18:56:13 +02:00
Yannick Welsch	1c59c7e349	Log warning if minimum_master_nodes is set to less than a quorum of master-eligible nodes (#15625 ) The setting minimum_master_nodes is crucial to avoid split brains in a cluster. In order to avoid data loss, it should always be configured to at least a quorum (majority) of master-eligible nodes. This commit adds a warning to the logs on the master node if the value is set to less than quorum of master-eligible nodes.	2016-05-25 16:49:08 +02:00
Tanguy Leroux	bdee8c2632	Disable XContent auto closing of object and arrays	2016-05-25 16:46:09 +02:00
Chris Earle	de6b4f35b1	Remove inaccurate Javadoc on Setting constructor The 'Setting' constructor has some outdated Javadoc that suggested that it would automatically apply 'Property.NodeScope' if no scope is supplied, but no scope is added in that case.	2016-05-25 09:28:34 -05:00
Simon Willnauer	eab3113204	Drop 1.x BWC and cut over to Writeable for Translog.Operation (#18565 ) We still maintain BWC for the translog operations back to 1.1 which is not supported in the current version anyway. This commit drops the bwc and moves the operations to the Writeable interface enforcing immutability.	2016-05-25 11:51:28 +02:00
Britta Weber	6862c48791	Merge pull request #18495 from brwe/geo-query-highlight-II skip all geo point queries in plain highlighter	2016-05-25 11:35:59 +02:00
Yannick Welsch	dee34c916c	Expand wildcards to closed indices in /_cat/indices (#18545 ) Closed indices are already displayed when no indices are explicitly selected. This commit ensures that closed indices are also shown when wildcard filtering is used. It also addresses another issue that is caused by the fact that the cat action is based internally on 3 different cluster states (one when we query the cluster state to get all indices, one when we query cluster health, and one when we query indices stats). We currently fail the cat request when the user specifies a concrete index as parameter that does not exist. The implementation works as intended in that regard. It checks this not only for the first cluster state request, but also the subsequent indices stats one. This means that if the index is deleted before the cat action has queried the indices stats, it rightfully fails. In case the user provides wildcards (or no parameter at all), however, we fail the indices stats as we pass the resolved concrete indices to the indices stats request and fail to distinguish whether these indices have been resolved by wildcards or explicitly requested by the user. This means that if an index has been deleted before the indices stats request gets to execute, we fail the overall cat request. The fix is to let the indices stats request do the resolving again and not pass the concrete indices. Closes #16419 Closes #17395	2016-05-25 10:02:14 +02:00
Tal Levy	0fa67e1538	Expose underlying processor to blame for thrown exception within CompoundProcessor (#18342 ) Fixes #17823	2016-05-24 14:25:40 -07:00
Jason Tedor	568c26a76c	Log OS and JVM on startup This commit adds the OS and JVM to the initial logline on startup. Relates #18557	2016-05-24 16:55:55 -04:00
Ali Beyad	105aee08b3	Removes multiple toXContent entry points for SnapshotInfo SnapshotInfo had a toXContent and an externalToXContent, the former for writing snapshot info to the snapshot blob and the latter for writing the snapshot info to the APIs. This commit unifies writing x-content to one method, toXContent, which distinguishes which format to write the snapshot info in based on the Params parameter. In addition, it makes use of the already existing snapshot specific params found in the BlobStoreFormat. Closes #18494	2016-05-24 15:44:22 -04:00
Britta Weber	b3a8c54928	Test for deadlock when relocating and indexing concurrently see #18553 Reproduces with gradle :core:integTest -Dtests.class=org.elasticsearch.recovery.RelocationIT -Dtests.method="testIndexAndRelocateConcurrently" -Dtests.iters=20 -Dtests.failfast=true -Dtests.logger.level=DEBUG -Dtests.seed=9BE7064B819064FA:C44B70F31707D081 -Dtests.timeoutSuite=60000!	2016-05-24 20:06:21 +02:00
Nik Everett	a93f578bf6	Move parsing of allocation commands into REST Port them to the ObjectParser. Don't let plugins register custom allocation commands	2016-05-24 11:59:05 -04:00
Jason Tedor	f210605af8	Add test for parsing fractional seconds This commit adds a test that ensures that strings containing a fractional number of seconds are correctly parsed into milliseconds. Relates #18548	2016-05-24 11:26:36 -04:00
Clinton Gormley	9c9bea9258	Set version to 5.0.0-alpha3 (#18550 ) * Set version to 5.0.0-alpha3 * Updated version in qa/backwards tests too	2016-05-24 16:46:05 +02:00
Tanguy Leroux	1f011f9dea	Remove Delete-By-Query plugin closes #18469	2016-05-24 13:28:20 +02:00
Martijn van Groningen	27cc2fe4dc	Moved the percolator from core to its own module Significant changes: * AbstractQueryTestCase has moved to the test framework module, in order for query builder tests in modules and plugins * Added support to AbstractQueryTestCase to register plugins * Lift the restriction that only one percolator could be added per index. This validation existed in MapperService, but because the percolator moved to a module it could no longer exist there. Instead of bringing it back it was removed. This validation existed since the percolator cache only supported one percolator query per document, since the percolator cache has been removed this restriction could removed as well. * While moving percolator tests to the new module, also removed a couple of tests for the deprecated percolate and mpercolate api. These APIs are now sugar APIs for bwc and rediect to the searvh and msearvh APIs. Some tests were still testing as if percolate and mpercolate API did the percolation, but this no longer the case and these tests could be removed.	2016-05-24 11:01:57 +02:00
Ryan Ernst	ef435d0099	Remove unnecessary boxing and use of deprecated Double ctors	2016-05-23 13:44:59 -07:00
Lee Hinman	bfce901edf	Merge remote-tracking branch 'dakrone/explain-add-fetch-in-progress'	2016-05-23 09:43:16 -06:00
Lee Hinman	8040ed0c16	Add whether the shard state fetch is pending to the allocation explain API If the shard state fetch is still pending, this will now return a message like: ```json { "shard" : { "index" : "i", "index_uuid" : "de1W1374T4qgvUP4a9Ieaw", "id" : 0, "primary" : false }, "assigned" : false, "shard_state_fetch_pending": true, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-04-26T16:34:53.227Z" }, "allocation_delay_ms" : 0, "remaining_delay_ms" : 0, "nodes" : { "z-CbkiELT-SoWT91HIszLA" : { "node_name" : "Brain Cell", "node_attributes" : { "testattr" : "test" }, "store" : { "shard_copy" : "NONE" }, "final_decision" : "NO", "final_explanation" : "the shard state fetch is pending", "weight" : 5.0, "decisions" : [ ] } } } ``` Adds the `shard_state_fetch_pending` field and uses the state to influence the final decision and final explanation. Relates to #17372	2016-05-23 09:42:57 -06:00
Adrien Grand	31e4c16ec3	Merge pull request #18509 from terradatum/epoch Support full range of Java Long for epoch DateTime	2016-05-23 12:27:38 +02:00
Adrien Grand	459916f5dd	Remove custom Base64 implementation. #18413 This replaces o.e.common.Base64 with java.util.Base64.	2016-05-23 11:32:42 +02:00
Tanguy Leroux	e7eb664c78	Change BlobPath.buildAsString() method	2016-05-23 10:50:40 +02:00
Adrien Grand	c5a9edf1c7	Add `Character.MODIFIER_SYMBOL` to the list of symbol categories. #18402 Closes #18388	2016-05-23 10:11:35 +02:00
Jim Ferenczi	238d390637	Fixes for _only_nodes preferences: * Handle multiple attributes/name (coma separated). * Shuffle the nodes that match the preferences. Fix #12546 Fix #12700	2016-05-23 09:44:52 +02:00
Adrien Grand	cb2cfdd9c0	Speed up named queries. #18470 Named queries have a performance bug when they are used with expensive queries that need to perform a lot of work up-front like fuzzy or range queries (including with points). The reason is that they currently re-create the weight and scorer for every hit. Instead we should create weights exactly once and use a single Scorer for all documents that are on the same segment.	2016-05-23 08:56:40 +02:00
Martijn van Groningen	c1a0929123	percolator: Add support dor MatchNoDocsQuery in query terms extract service Before the query extraction would have been aborted and the percolator query would be marked as unknown. This resulted in a situation that these queries always need to be evaluated by the memory index at search time. By adding support for this query many more percolator query candidate hits can skip the expensive memory index verification step. For example the `match` query parser returns a MatchNoDocsQuery if the query terms are removed by text analysis (lets query text only contained stop words).	2016-05-22 22:42:19 +02:00
G. Richard Bellamy	cf54903580	Support full range of Java Long for epoch DateTime Remove the arbitrary limit on epoch_millis and epoch_seconds of 13 and 10 characters, respectively. Instead allow any character combination that can be converted to a Java Long. Update the docs to reflect this change.	2016-05-22 13:08:20 -07:00
Ryan Ernst	37d36f2f4c	Merge branch 'master' into java9	2016-05-21 14:19:58 -07:00
Ryan Ernst	f01f15d3b8	Document the hack	2016-05-21 14:14:12 -07:00
Jason Tedor	ba14aca218	Refactor property placeholder use of env. vars This commit is a slight refactoring of the use of environment variables in replacing property placeholders. In commit `115f983827` the constructor for Settings.Builder was made package visible to provide a hook for tests to mock obtaining environment variables. But we do not need to go that far and can instead provide a small hook for this for tests without opening up the constructor. Thus, in this commit we refactor Settings.Builder#replacePropertyPlaceholders to a package-visible method that accepts a function providing environment variables by names. The public-visible method just delegates to this method passing in System::getenv and tests can use the package-visible method to mock the behavior they need without relying on external environment variables.	2016-05-21 17:06:56 -04:00
Ryan Ernst	41a5c0cfa1	Force java9 log4j hack in testing	2016-05-21 13:41:38 -07:00
Ryan Ernst	b26f5711d6	Fix log4j buggy java version detection	2016-05-21 13:18:12 -07:00
Ryan Ernst	1d40c4bbc1	Make java9 work again This change makes ES compile with java9 again, build 118. * There are a handful of changes due to failure to determine types during compile. * The attachment plugins which use tika needed to have tika upgraded in order to pickup fixes there for java 9. * azure discovery and s3 repository indirectly depend on jaxb, which is no longer in the default modules. They now add a jaxb dependency externally, and make JarHell allow for this package.	2016-05-21 09:41:51 -07:00
Lee Hinman	1d8441b681	Merge remote-tracking branch 'dakrone/remove-script-mode'	2016-05-20 15:22:14 -06:00
Jason Tedor	115f983827	Fix env. var placeholder test so it's reproducible This commit modifies the settings test for environment variables placeholders so that it is reproducible. The underlying issue is that the set of environment variables from system to system can vary, and this means that's we can never be sure that a failing test will be reproducible. This commit simplifies this test to not rely on external forces that could influence reproducibility. Relates #18501	2016-05-20 17:11:37 -04:00
Lee Hinman	fdfd2a2f18	Remove ScriptMode class in favor of boolean true/false This removes the ScriptMode class entirely, which was an enum with two options (ON and OFF) which essentially boiled down to true and false. Now the boolean values are used instead.	2016-05-20 15:01:30 -06:00
Jason Tedor	4c7993ea71	Netty request/response tracer should wait for send We write to Netty channels in an async fashion, but notify listeners via a transport service adapter before we are certain that the channel write succeeded. In particular, the tracer logs are implemented via a transport service adapter and this means that we can write tracer logs before a write was successful and in some cases the write might fail leading to misleading logs. This commit attaches the transport service adapters to channel writes as a listener so that the notification occurs only after a successful write. Relates #18500	2016-05-20 16:26:46 -04:00
Ali Beyad	923d90d434	Remove use of a Fields class in snapshot responses that contains x-content keys, in favor of declaring/using the keys directly. Closes #18497	2016-05-20 15:07:41 -04:00
Simon Willnauer	35e705877b	Limit retries of failed allocations per index (#18467 ) Today if a shard fails during initialization phase due to misconfiguration, broken disks, missing analyzers, not installed plugins etc. elasticsaerch keeps on trying to initialize or rather allocate that shard. Yet, in the worst case scenario this ends in an endless allocation loop. To prevent this loop and all it's sideeffects like spamming log files over and over again this commit adds an allocation decider that stops allocating a shard that failed more than N times in a row to allocate. The number or retries can be configured via `index.allocation.max_retry` and it's default is set to `5`. Once the setting is updated shards with less failures than the number set per index will be allowed to allocate again. Internally we maintain a counter on the UnassignedInfo that is reset to `0` once the shards has been started. Relates to #18417	2016-05-20 20:37:45 +02:00
Britta Weber	b493f0defe	skip all geo point queries in plain highlighter Geo queries and plain highlighter do not seem to work well together (https://issues.apache.org/jira/browse/LUCENE-7293) so we need to skip all geo related queries when we highlight. closes #17537	2016-05-20 20:07:10 +02:00
Jason Tedor	61f40156d3	Do not decode path when sending error Today when sending a REST error to a client, we send the decoded path. But decoding that path can already be the cause of the error in which case decoding it again will just throw an exception leading to us never sending an error back to the client. It would be better to send the entire raw path to the client and that is what we do in this commit. Relates #18477	2016-05-20 12:15:30 -04:00
Igor Motov	ce88d7f9ab	Fix race condition in snapshot initialization When a snapshot initialization fails, the create snapshot method may return before the snapshot metadata in the cluster state is removed. This can cause follow up snapshot-API related calls to fail due to a snapshot still running. This is causing CI failures when we try to delete indices that were participating in failed snapshot to a read-only repository. Closes #18121	2016-05-20 10:52:08 -04:00
Daniel Mitterdorfer	8b962bb234	Increase log level for NettyHttpRequestSizeLimitIT This test fails spuriosly in CI and is not reproducible locally. With this commit we temporarily increase the log level in a few packages that are suspected to reveal the cause.	2016-05-20 15:12:52 +02:00
Jason Tedor	140f9dfe5f	Fix scaling thread pool test bug This commit fixes a test bug in the scaling thread pool configuration test. In particular, the test randomization could select min and max for a thread pool configuration where both are equal to zero. This is a violation of the requirements of the ThreadPoolExecutor. With this commit, we now ensure that the max is bounded below by one.	2016-05-20 08:58:08 -04:00
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00
Christoph Büscher	d3fe22c990	Improve adding clauses to `span_near` and `span_or` query Currently the query builders expose the clauses of the span query as a modifiable list. Instead we should make the that getter return an unmodifiable list. Also renaming the method used to add a clause from `clause(spanQuery)` to `addClause(spanQuery)`.	2016-05-20 13:36:55 +02:00
Boaz Leskes	34ef5306d2	Snapshotting and sync could cause a dead lock TranslogWriter (#18481 ) #18360 introduced an extra lock in order to allow writes while syncing the translog. This caused a potential deadlock with snapshotting code where we first acquire the instance lock, followed by a sync (which acquires the syncLock). However, the sync logic acquires the syncLock first, followed by the instance lock. I considered solving this by not syncing the translog on snapshot - I think we can get away with just flushing it. That however will create subtleties around snapshoting and whether operations in them are persisted. I opted instead to have slightly uglier code with nest synchronized, where the scope of the change is contained to the TranslogWriter class alone.	2016-05-20 12:56:24 +02:00
Jason Tedor	c257e2c51f	Remove settings and system properties entanglement Today when parsing settings during bootstrap, we add a system property for every Elasticsearch setting. Additionally, settings can be set via system properties. This commit simplifies this situation. - settings are no longer propogated to system properties - system properties can not be used to set settings - the "es." prefix on settings is no longer required (nor permitted) - test logging has a dedicated system property (tests.logger.level) Relates #18198	2016-05-19 14:08:08 -04:00
Clinton Gormley	dc33a83231	Remove the preserve_original option from the FingerprintAnalyzer (#18471 ) The preserve_original option to the ASCIIFoldingFilter doesn't play well with the FingerprintFilter, as it ends up producing fingerprints like: "and consistent godel gödel is said sentence this yes" The goal of the OpenRefine algorithm is to product a small normalized ASCII fingerprint. There's no need to expose preserve_original.	2016-05-19 19:37:13 +02:00
Christoph Büscher	d2515727d0	Improve random DateTimeZone creation in tests We often require a random joda DateTimeZone in our tests. Currently there are a few options for generating such a random DateTimeZone from the set of available ids. Currently most random picks are not really reproducable across different jvms because they rely on order in the ids set implementation. The helper in DateProcessorFactoryTests thus performs a sort on the set of ids before random picking from the result, so I moved this to ESTestCase to make it publicly available and changed all other tests to use that method.	2016-05-19 18:12:48 +02:00
Boaz Leskes	4d6887075f	Log IndexShard.refresh logs under trace (#18435 ) We log them every second...	2016-05-19 17:12:37 +02:00
Christoph Büscher	757ccf00b2	Enforce MatchQueryBuilder#maxExpansions() to be strictly positive	2016-05-19 16:59:37 +02:00
Jeff Evans	3cf4214255	Add better error message when analyzer created without tokenizer or analyzer type (#18455 ) Closes #15492	2016-05-19 15:47:07 +02:00
Ali Beyad	fc6df23fea	Rename AggregatorBuilder and all of its subclasses to AggregationBuilder, in keeping consistent with the Java APIs. Closes #18377 Closes #18367	2016-05-19 09:25:29 -04:00
Tanguy Leroux	35d3bdab84	Add Google Cloud Storage repository plugin Closes #12880	2016-05-19 13:26:23 +02:00
Martijn van Groningen	e2691d7e5c	test: Don't generate a value of 0, because FuzzyQuery constructor does't allow that	2016-05-19 13:08:30 +02:00
Martijn van Groningen	050145f61b	parent/child: Allow adding additional child types that point to an existing parent type From 2.0 adding child types to existing types was forbidden because the`_parent` field stores the join between parent and child at index time. This is to protect from the fact that types that weren't a parent before become a parent while previously indexed documents would not have a join field. This would break the parent/child queries. The restriction was a bit too strict in the sense that also if a type was a parent type the restriction would forbid adding child types that point to a parent type (so child points already point to it). This change make sure that the restriction only applies if that type isn't a parent type already. Closes #17956	2016-05-19 11:05:17 +02:00
Simon Willnauer	d77c299cb9	Register `indices.query.bool.max_clause_count` setting (#18341 ) * Register `indices.query.bool.max_clause_count` setting This commit registers `indices.query.bool.max_clause_count` as a node level setting and removes support for its synonym setting `index.query.bool.max_clause_count`. Closes #18336	2016-05-19 10:42:35 +02:00
Simon Willnauer	2b972f1f75	FSync translog outside of the writers global lock (#18360 ) FSync translog outside of the writers global lock Today we aquire a write global lock that blocks all modification to the translog file while we fsync / checkpoint the file. Yet, we don't necessarily needt to block concurrent operations here. This can lead to a lot of blocked threads if the machine has high concurrency (lot os CPUs) but uses slow disks (spinning disks) which is absolutely unnecessary. We just need to protect from fsyncing / checkpointing concurrently but we can fill buffers and write to the underlying file in a concurrent fashion. This change introduces an additional lock that we hold while fsyncing but moves the checkpointing code outside of the writers global lock.	2016-05-19 09:40:10 +02:00
Simon Willnauer	9a9301f7d8	Remove dead BloomFilter code We don't use this class for a quite a while. lets trash it.	2016-05-18 23:00:57 +02:00
Clinton Gormley	cec9a94b96	Added version 2.3.3 with bwc indices	2016-05-18 17:33:21 +02:00
Christoph Büscher	7c665a010b	Fix TimeZoneRounding#nextRoundingValue for hour, minute and second units Currently rounding intervals obtained by nextRoundingValue() for hour, minute and second units can include an extra hour when happening at DST transitions that add an extra hour (eg CEST -> CET). This changes the rounding logic for time units smaller or equal to an hour to fix this. Closes #18326	2016-05-18 17:29:02 +02:00
markharwood	a846ff93e9	Aggregations fix: support include/exclude strings formatted for IP and date fields in terms and significant_terms aggregations. Closes #17705	2016-05-18 16:21:55 +01:00
Tanguy Leroux	27b65e90ca	Merge pull request #18443 from tlrx/fix-18433 Add missing builder.endObject() in FsInfo	2016-05-18 15:40:55 +02:00
Jason Tedor	cad0608cdb	Add GC overhead logging This commit adds simple GC overhead logging. This logging captures intervals where the JVM is spending a lot of time performing GC but it is not necessarily the case that each GC is large. For a start, this logging is simple and does not attempt to incorporate whether or not the collections were efficient (that is, we are only capturing that a lot of GC is happening, not that a lot of useless GC is happening). Relates #18419	2016-05-18 09:31:28 -04:00
Daniel Mitterdorfer	3954306af2	Merge pull request #18432 from danielmitterdorfer/fix-circuit-breaker-it Clear all caches after testing parent breaker	2016-05-18 15:23:15 +02:00
Tanguy Leroux	d7a31c8cf7	Add missing builder.endObject() in FsInfo closes #18433	2016-05-18 15:19:30 +02:00
Christoph Büscher	808ef6cec7	Fix parsing single `rescore` element in SearchSourceBuilder We are currently only parsing the array-syntax for the rescore part in SearchSourceBuilder ("rescore" : [ {...}, {...} ]) . We also need to support "rescore" : {...} Closes #18439	2016-05-18 15:08:28 +02:00
Clinton Gormley	c03dd8a290	Make the index-too-old exception more explicit (#18438 ) Closes #18418	2016-05-18 13:33:25 +02:00
Daniel Mitterdorfer	de3e7d161f	Add tests for null precondition check in BulkRequest Relates #18347 Checked with @javanna	2016-05-18 12:10:13 +02:00
Yannick Welsch	6dacac49b3	Simplify recovery logic in IndicesClusterStateService (#18405 ) - Moves recovery logic into IndexShard - Simplifies logic to cancel peer recovery of shard where recovery source node changed - Ensures routing entry is set on initialization of IndexShard	2016-05-18 10:51:57 +02:00
Daniel Mitterdorfer	c13df3b6c5	Clear all caches after testing parent breaker With this commit we clear all caches after testing the parent circuit breaker. This is necessary as caches hold on to circuit breakers internally. Additionally, due to usage of CircuitBreaker#addWithoutBreaking() in caches, it's even possible to go above the limit. As a consequence, all subsequent requests fall victim to the limit. Hence, right after the parent circuit breaker tripped, we clear all caches to reduce these circuit breakers to 0 again. We also exclude the clear caches transport request from limit check in order to ensure it will succeed. As this is typically a very small and low-volume request, it is deemed ok to exclude it. Closes #18325	2016-05-18 09:31:35 +02:00
Jason Tedor	ecce53f0df	Add I/O statistics on Linux This commit adds a variety of real disk metrics for the block devices that back Elasticsearch data paths. A collection of statistics are read from /proc/diskstats and are used to report the raw metrics for operations and read/write bytes. Relates #15915	2016-05-17 16:16:39 -04:00
Jason Tedor	584be0b3f8	Refactor JvmGcMonitorService for testing This commit refactors the JvmGcMonitorService so that it can be tested. In particular, hooks are added to verify that the JvmMonitorService correctly observes slow GC events, and that the JvmGcMonitorService logs the correct messages. Relates #18378	2016-05-17 13:05:36 -04:00
Yannick Welsch	9ba554dfd2	Expose previous cluster state only in RoutingAllocation (#18390 ) Instead of re-exposing index metadata and blocks in RoutingNodes (which is part of the cluster state before rerouting), expose it as part of the RoutingAllocation which is known to be only temporarily used during reroute.	2016-05-17 19:02:28 +02:00
$polyfractal$ polyfractal	c755a77022	[TEST] Use a reproducible source of randomness in shuffle	2016-05-17 12:55:07 -04:00
Zachary Tong	7c46b57ff2	Add a Sort ingest processor Sorts an array of values in ascending or descending order. If all elements are numerics, they will be sorted numerically. If values are strings, or mixtures of strings/numbers, the elements will be sorted lexicographically.	2016-05-17 12:06:48 -04:00
Colin Goodheart-Smithe	8c9ca8b518	Moves query profiler classes into their own package The change also renames fields and methods in the Profilers class. Note that I had to make ProfileResult a public class (it was package private before) because now classes that call it are in a different package.	2016-05-17 14:20:05 +01:00
Ali Beyad	3764789d96	Removed unused AllocationService member in TransportClusterAllocationExplainAction Closes #18381	2016-05-16 18:41:36 -04:00
Robert Muir	8d4c1befe5	Merge pull request #18364 from rmuir/nukeRunAsFloat Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-16 17:08:25 -04:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Colin Goodheart-Smithe	e37e8af5e2	Refactor of query profile classes to make way for other profile implementations	2016-05-16 16:15:50 +01:00
Colin Goodheart-Smithe	6eda9f5df6	more tests following review	2016-05-16 09:07:22 +01:00
Colin Goodheart-Smithe	0c449fee4a	small fix following rebase on master	2016-05-16 09:07:22 +01:00
Colin Goodheart-Smithe	66d0bdab0c	review comments	2016-05-16 09:07:22 +01:00
Colin Goodheart-Smithe	ab3121c871	Adds a methods to find (and dynamically create) the mappers for the parents of a field with dots in the field name	2016-05-16 09:07:22 +01:00
Robert Muir	8edf213492	Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-15 22:59:28 -04:00
Michael McCandless	0d570352dd	Merge pull request #18355 from mikemccand/iterables_flatten Iterables.flatten should not pre-cache the first iterator	2016-05-15 10:21:35 -04:00
Mike McCandless	8d7db7fd7a	remove whitespace	2016-05-14 18:50:10 -04:00
Mike McCandless	ded8b400b0	Fix concurrency bug in IMC that could lead to negative total indexing bytes	2016-05-14 18:47:26 -04:00
Mike McCandless	48dca45564	leave Iterables.flatten pre-caching the outer Iterable	2016-05-14 17:10:17 -04:00
Mike McCandless	53c2f8b4b6	improve javadocs	2016-05-14 13:46:34 -04:00
Mike McCandless	cf2af8961b	Iterables.flatten should not pre-cache the first iterator	2016-05-14 13:39:48 -04:00
Ali Beyad	d3d57da89f	Removes unused methods in the o/e/common/Strings class. Closes #18346	2016-05-14 08:08:30 -04:00
Daniel Mitterdorfer	009cf434a2	Merge pull request #18347 from danielmitterdorfer/bulk-req-precondition-check Add not-null precondition check in BulkRequest	2016-05-14 11:25:02 +02:00
Daniel Mitterdorfer	a2381640da	Add not-null precondition check in BulkRequest With this commit we add a precondition check to BulkRequest so we fail early if users pass `null` for the request object. For a more detailed discussion, see #12038. This supersedes #12038. Relates #12038.	2016-05-14 09:59:53 +02:00
Robert Muir	2028691e66	painless: improve exception stacktraces closes #18319	2016-05-13 15:40:45 -04:00
Lee Hinman	864ba8dac1	Merge remote-tracking branch 'dakrone/there-can-be-only-one2'	2016-05-13 10:28:41 -06:00
Adrien Grand	b4dec0ddbe	Remove dead code.	2016-05-13 18:27:12 +02:00
Jason Tedor	81898a2e3e	Avoid race while retiring executors Today, a race condition exists when retiring executors. Namely, if an executor is retired and then the thread pool is terminated, the retiring of the executor and the termination of the thread pool can race to remove the retired executor from the queue of retired executors. More precisely, when the executor is initially retired, it is placed on a queue of retired executors, and then removed when it is successfully shutdown. When the pool is terminated, it will also drain the queue of retired executors. This leads to a time-of-check-time-of-use race where the draining can see a retired executor on the queue but that retired executor can be removed upon successful shutdown of that executor. This leads to the draining attempting to remove an element from the queue when there is none. This commit addresses this race condition by instead safely polling the queue. Relates #18333	2016-05-13 12:26:18 -04:00
Lee Hinman	9bcdafedda	Allow only a single extension for a scripting engine Previously multiple extensions could be provided, however, this can lead to confusion with on-disk scripts (ie, "foo.js" and "foo.javascript") having different content. Only a single extension is now supported. The only language currently supporting multiple extensions was the Javascript engine ("js" and "javascript"). It now only supports the `.js` extension. Relates to #10598	2016-05-13 09:54:31 -06:00
Lee Hinman	d5b75491dc	Merge remote-tracking branch 'dakrone/remove-script-sandbox'	2016-05-13 09:50:39 -06:00
Britta Weber	e7c17fc9fb	[TEST] increase logger level until we know what is going on We have an issue for it too: https://github.com/elastic/elasticsearch/issues/18121	2016-05-13 17:37:17 +02:00
Christoph Büscher	a40c397c67	Don't allow `fuzziness` for `multi_match` types cross_fields, phrase and phrase_prefix Currently `fuzziness` is not supported for the `cross_fields` type of the `multi_match` query since it complicates the logic that blends the term queries that cross_fields uses internally. At the moment using this combination is silently ignored, which can lead to confusions. Instead we should throw an exception in this case. The same is true for phrase and phrase_prefix type. Closes #7764	2016-05-13 17:32:14 +02:00
Jason Tedor	786a6a00d9	Add test for fixed executor rejected count This commit adds a test that a fixed executors rejected count behaves as expected. In particular, we test that if we consume the executor, then stuff the executor queue, further tasks will be rejected and the rejected stats are updated appropriately. This test also asserts that if we resize the queue the rejected count is reset to zero. Relates #18301	2016-05-13 11:27:12 -04:00
Lee Hinman	efff3918d8	Remove support for mulitple languages per scripting engine	2016-05-13 09:24:31 -06:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
Adrien Grand	638da06c1d	Add back support for `ip` range aggregations. #17859 This commit adds support for range aggregations on `ip` fields. However it will only work on 5.x indices. Closes #17700	2016-05-13 17:22:01 +02:00
Clinton Gormley	f2797dbccb	Fixed grammar in index-too-old exception (#18327 )	2016-05-13 15:08:15 +02:00
Adrien Grand	61b1f4ad0b	Fix xcontent rendering of ip terms aggs. #18003 Currently terms on an ip address try to put their binary representation in the json response. With this commit, they would return a formatted ip address: ``` "buckets": [ { "key": "192.168.1.7", "doc_count": 1 } ] ```	2016-05-13 14:59:36 +02:00
Daniel Mitterdorfer	ddbfda2c68	Exclude specific transport actions from request size limit check We add support to explicitly exclude specific transport actions from the request size limit check. We also exclude the following request types currently: MasterPingRequest PingRequest	2016-05-13 14:21:24 +02:00
Britta Weber	d3efe37814	[TEST] mute test for now, we have an issue for it https://github.com/elastic/elasticsearch/issues/18325	2016-05-13 14:08:17 +02:00
Britta Weber	0d5a2f25d3	[TEST] muste test, we have an issue for it https://github.com/elastic/elasticsearch/issues/18293	2016-05-13 12:09:30 +02:00
Olivier Bourgain	df43230844	Add index name and uuid in IndexAlreadyExistsException default message Relates #18274	2016-05-12 14:32:30 -04:00
Jason Tedor	0830bd4885	Remove period in min master node check log message As most of our log messages are not sentences and do not end with periods, this commit removes a period from the end of the min master node bootstrap check log message.	2016-05-12 12:48:58 -04:00
Zachary Tong	5ee5cc25cc	Move AsciiFolding earlier in FingerprintAnalyzer filter chain Rearranges the FingerprintAnalyzer so that AsciiFolding comes earlier in the chain (after lowercasing, before stop removal, for maximum deduping power) Closes #18266	2016-05-12 09:34:15 -04:00
Robert Muir	3b66d40f7c	Merge pull request #18284 from rmuir/painless_value_aggregations _value support in painess?	2016-05-11 20:37:35 -04:00
Robert Muir	6b4e47bf96	this makes aggregations per-document _value fast (bypass hash put, hash get, etc) for painless. but i have no clue how to test it, it seems this feature never worked via REST? Should we drop the feature instead?	2016-05-11 15:39:00 -04:00
Ali Beyad	189341da10	CORS handling triggered whether User-Agent is a browser or not This commit ensures that if CORS is enabled, then Origin headers are checked regardless of whether the request came from a browser or not. In the past, we only proceeded with CORS checks if the User-Agent was a browser.	2016-05-11 15:30:15 -04:00
Ali Beyad	fced8dac72	When CORS is enabled, permit requests from the same origin as the request host, as the request is not a cross origin. Relates #18256	2016-05-11 15:24:36 -04:00
Ali Beyad	5189eb41c7	Dangling indices are not imported if a tombstone for the same index (same name and UUID) exists in the cluster state. This resolves a situation where if an index data folder was copied into a node's data directory while the node is running and that index had a tombstone in the cluster state, the index would still get imported. Closes #18250 Closes #18249	2016-05-11 12:56:19 -04:00
Adrien Grand	ce4af4be42	Remove dead code.	2016-05-11 18:38:07 +02:00
Adrien Grand	866a5459f0	Make significant terms work on fields that are indexed with points. #18031 It will keep using the caching terms enum for keyword/text fields and falls back to IndexSearcher.count for fields that do not use the inverted index for searching (such as numbers and ip addresses). Note that this probably means that significant terms aggregations on these fields will be less efficient than they used to be. It should be ok under a sampler aggregation though. This moves tests back to the state they were in before numbers started using points, and also adds a new test that significant terms aggs fail if a field is not indexed. In the long term, we might want to follow the approach that Robert initially proposed that consists in collecting all documents from the background filter in order to compute frequencies using doc values. This would also mean that significant terms aggregations do not require fields to be indexed anymore.	2016-05-11 16:52:58 +02:00
Jason Tedor	d0edd13f7b	Log setting key not setting object in IMC This commit modifies two logging statements in the IndexingMemoryController to log the key for the setting indices.memory.index_buffer_size instead of the object. Relates #18191	2016-05-11 10:37:23 -04:00
Yannick Welsch	ae01a4f7f9	Increased logging level for testDelayedAllocationChangeWithSettingTo100ms	2016-05-11 11:59:41 +02:00
Jason Tedor	5f0cc79562	Sort plugins in list plugins command This commit modifies the list plugins command to produce deterministic output by sorting the plugins by comparing paths. Relates #18260	2016-05-10 22:06:37 -04:00
Yannick Welsch	a0ffe6ea89	[TEST] Ensure creation of valid routing table An additional sanity check introduced by #17821 makes some tests fail. This check verifies that only one shard with same shard id is allocated to a node. This commit fixes a bug in ClusterStateCreationUtils which would construct a cluster state that allocated two shards with same id to the same node.	2016-05-10 19:40:19 +02:00
Yannick Welsch	7753420540	Make ShardRouting and UnassignedInfo immutable (#17821 ) This makes defensive copying of ShardRouting objects obsolete whenever we do a reroute and trashes less objects.	2016-05-10 19:11:04 +02:00
Jason Tedor	81c0b7bfa9	Log exception when join validation fails Today when join validation fails, we log a warning but do not log the exception that led to the join validation failing. This commit modifies this so that we do log this exception.	2016-05-10 11:56:57 -04:00
Yannick Welsch	ad5ce598db	Use uppercase 'L' for long literal	2016-05-10 17:32:21 +02:00
Gabriel Moskovicz	0660386976	Add plugin information for Verbose mode Relates #18051	2016-05-10 11:23:17 -04:00
Lee Hinman	a45e1cc750	Add test for NullPointerException in SQS when analyzing text produces null query	2016-05-10 08:35:48 -06:00
Lee Hinman	1c54033e92	Merge branch 'pr/18068'	2016-05-10 08:27:43 -06:00
Colin Goodheart-Smithe	319ca82510	Improving parsing of sigma param for Extended Stats Bucket Aggregatio Improving parsing of sigma param for Extended Stats Bucket Aggregation	2016-05-10 15:11:02 +01:00
Alexander Kazakov	93d1b385a4	Improving parsing of sigma param for Extended Stats Bucket Aggregation #17499	2016-05-10 14:54:54 +03:00
Daniel Mitterdorfer	8972f39a9c	Reenable CircuitBreakerServiceIT#testParentChecking()	2016-05-10 12:24:49 +02:00
Colin Goodheart-Smithe	17225d9ac1	Removes the now obsolete SearchParseElement implementations All implementations of SearchParseElement have been removed since they are no longer used now that parsing is done on the coordinating node. The SearchParseElement and FetchSubPhaseParseElement classes are not removed as currently they are needed for plugins that add a custom fetch sub phase. These will be removed in a follow up PR that will allow fetch sub phase plugins to register a parser in a different way.	2016-05-10 10:06:54 +01:00
Adrien Grand	b6556e275a	Remove code duplication in FieldsVisitor. #18218 It currently has twice the same method, once with a MapperService instance and once with a DocumentMapper. This commits only keeps the former.	2016-05-10 08:17:59 +02:00
Adrien Grand	f481492af3	Remove FieldMapper.Builder.indexName. #18219 The ability to configure index names that are different from the full name was removed in 2.0.	2016-05-10 08:17:00 +02:00
Adrien Grand	68e7ac4166	Remove old backward compatibility layer for version lookups. #18215 The current code tries to handle the case that document versions are either missing or stored in payloads rather than doc values. Since none of the 2.x releases allowed this, we can remove this logic.	2016-05-10 08:16:05 +02:00
Adrien Grand	5d8f684319	Mapping cleanups. #18180 This removes dead/duplicate code and makes the `_index` field not configurable. (Configuration used to jus be ignored, now we would throw an exception if any is provided.)	2016-05-10 08:14:18 +02:00
Jason Tedor	eed5b0be4f	Increaes timeout on Netty handshake tests This commit increases the timeout on the Netty handshake tests because the previous value could cause timeout exceptions on slow machines.	2016-05-09 19:02:54 -04:00
Johnny Lim	0970c59608	Use Strings.toString() for toString() in SearchResponse and GetResponse (#18102 )	2016-05-09 16:56:02 -04:00
Jim Ferenczi	aef78ceb13	Do not return fieldstats information for fields that exist in the mapping but not in the index.	2016-05-09 19:24:03 +02:00
Daniel Mitterdorfer	0f36c744d0	Merge remote-tracking branch 'danielmitterdorfer/eager-content-length'	2016-05-09 15:10:01 +02:00
Daniel Mitterdorfer	3a46812a33	Free bytes reserved on request breaker With this commit we free all bytes reserved on the request circuit breaker. Closes #18144	2016-05-09 15:07:41 +02:00
Daniel Mitterdorfer	000dd62318	Determine content length eagerly in HttpServer With this commit we eagerly evaluate content length in HttpServer and also pass the same value to ResourceHandlingHttpChannel. With this change it easier to reason about the content length that is freed leaving no doubt that it must be identical to the reserved amount.	2016-05-09 11:11:59 +02:00
Ryan Ernst	c740feb3a7	Mappings: Fix ip mapper to correctly serialize a null null_value We recently added correct serialization for null values, but the helper method used does not allow null. This fixes serialization to handle the null.	2016-05-06 17:07:24 -07:00
Tal Levy	c1afcb543e	add check for non-existent pipelines provided to simulate requests (#18190 ) fixes #18139	2016-05-06 13:34:22 -07:00
Jason Tedor	d0d2d2be8c	Fix exception message in lifecycle This commit fixes the exception messages for lifecycles when stopping in illegal states. Relates #18189	2016-05-06 16:14:43 -04:00
Ryan Ernst	2c3d3d91cb	Remove unnecessary nebula source or javadoc plugins	2016-05-06 13:09:48 -07:00
Ryan Ernst	3d1be071c9	Merge branch 'master' into pom_gen	2016-05-06 12:56:51 -07:00
Chris Earle	5be79ed02c	Add Failure Details to every NodesResponse Most of the current implementations of BaseNodesResponse (plural Nodes) ignore FailedNodeExceptions. - This adds a helper function to do the grouping to TransportNodesAction - Requires a non-null array of FailedNodeExceptions within the BaseNodesResponse constructor - Reads/writes the array to output - Also adds StreamInput and StreamOutput methods for generically reading and writing arrays	2016-05-06 14:59:43 -04:00
Chris Earle	1d5b9f1ce6	Test was not updated with #18187	2016-05-06 13:17:43 -04:00
Tanguy Leroux	0ff5652fff	Add node name to Cat Recovery closes #8041	2016-05-06 16:59:53 +02:00
Britta Weber	901c25b7ff	Merge pull request #18183 from brwe/exclude-all-but-text-from-wildcard-highlighting Exclude all but string fields from highlighting if wildcards are used…	2016-05-06 16:57:25 +02:00
Britta Weber	f270ed26c3	fix highlighing for old version indices with string fields	2016-05-06 16:27:31 +02:00
Jason Tedor	d52537dc7b	Add semicolon query string parameter delimiter This commit adds support for the semicolon character as a valid query string parameter delimiter. Relates #18186	2016-05-06 09:54:26 -04:00
Jason Tedor	e90d00ffce	Remove handshake from transport client This commit removes handshaking from the transport client. This handshaking is not needed because of the existence of the liveness check. Relates #18174	2016-05-06 09:17:18 -04:00
Nik Everett	b6698c3145	Random script fields can't overlap This causes round tripping through xcontent to fail. Closes #18166	2016-05-06 09:01:09 -04:00
Alexander Reelsen	b12a42351e	Pipeline Stats: Fix concurrent modification exception (#18177 ) Due to trying to modify a map while iterating it, a concurrent modification in the pipeline stats could be thrown. This uses an iterator to prevent this. Closes #18126	2016-05-06 15:00:41 +02:00
Britta Weber	7b69e4ef43	keyword fields should also be highlighted	2016-05-06 14:29:19 +02:00
Britta Weber	d3c5f865be	Exclude all but string fields from highlighting if wildcards are used in fieldname We should prevent highlighting if a field is anything but a text or keyword field. However, someone might implement a custom field type that has text and still want to highlight on that. We cannot know in advance if the highlighter will be able to highlight such a field and so we do the following: If the field is only highlighted because the field matches a wildcard we assume it was a mistake and do not process it. If the field was explicitly given we assume that whoever issued the query knew what they were doing and try to highlight anyway. closes #17537	2016-05-06 13:41:16 +02:00
Adrien Grand	93567a2f1b	Remove StringBuilder reuse for uid creation. #18181 This would be better handled by escape analysis.	2016-05-06 13:16:58 +02:00
Adrien Grand	e88ac11633	Add back Version.V_5_0_0. #18176 This was lost whene releasing alpha2 since the version constant got renamed.	2016-05-06 12:30:22 +02:00
Christoph Büscher	7d14728960	Add xContent shuffling to some more tests This adds some random shuffling of xContent to some more test cases. Relates to #5831	2016-05-06 10:46:39 +02:00
Adrien Grand	b91df36a62	Fix and test handling of `null_value`. #18090 This was mostly untested and had some bugs. Closes #18085	2016-05-06 09:28:23 +02:00
Adrien Grand	de8354dd7f	Allow binary sort values. #17959 The `ip` field uses a binary representation internally. This breaks when rendering sort values in search responses since elasticsearch tries to write a binary byte[] as an utf8 json string. This commit extends the `DocValueFormat` API in order to give fields a chance to choose how to render values. Closes #6077	2016-05-06 09:27:02 +02:00
Adrien Grand	7d8708716e	QueryBuilder does not need generics. #18133 QueryBuilder has generics, but those are never used: all call sites use `QueryBuilder<?>`. Only `AbstractQueryBuilder` needs generics so that the base class can contain a default implementation for setters that returns `this`.	2016-05-06 08:38:20 +02:00
Ryan Ernst	e16af604bf	Build: Add pom generation to assemble task In preparation for a unified release process, we need to be able to generate the pom files independently of trying to actually publish. This change adds back the maven-publish plugin just for that purpose. The nexus plugin still exists for now, so that we do not break snapshots, but that can be removed at a later time once snapshots are happenign through the unified tools. Note I also changed the dir jars are written into so that all our artifacts are under build/distributions.	2016-05-05 17:57:44 -07:00
Jason Tedor	1199cd8e2a	Mark IHBT#testFromAndToXContent as awaits fix This commit marks InnerHitsBuilderTests#testFromAndToXContent as awaiting a fix.	2016-05-05 20:16:57 -04:00
Robert Muir	e3ce6c9048	Painless: add fielddata accessors (.value/.values/.distance()/etc) This gives better coverage and consistency with the scripting APIs, by whitelisting the primary search scripting API classes and using them instead of only Map and List methods. For example, accessing fields can now be done with `.value` instead of `.0` because `getValue()` is whitelisted. For now, access to a document's fields in this way (loads) are fast-pathed in the code, to avoid dynamic overhead. Access to geo fields and geo distance functions is now supported. TODO: date support (e.g. whitelist ReadableDateTime methods as a start) TODO: improve docs (like expressions and groovy have for document's fields) TODO: remove fast-path hack Closes #18169 Squashed commit of the following: commit ec9f24b2424891a7429bb4c0a03f9868cba0a213 Author: Robert Muir <rmuir@apache.org> Date: Thu May 5 17:59:37 2016 -0400 cutover to <Def> instead of <Object> here commit 9edb1550438acd209733bc36f0d2e0aecf190ecb Author: Robert Muir <rmuir@apache.org> Date: Thu May 5 17:03:02 2016 -0400 add fast-path for docvalues field loads commit f8e38c0932fccc0cfa217516130ad61522e59fe5 Author: Robert Muir <rmuir@apache.org> Date: Thu May 5 16:47:31 2016 -0400 Painless: add fielddata accessors (.value/.values/.distance()/etc)	2016-05-05 18:38:41 -04:00
Ali Beyad	c4090a1841	Remove the Snapshot class in favor of using SnapshotInfo o/e/snapshots/Snapshot and o/e/snapshots/SnapshotInfo contain the same fields and represent the same information. Snapshot was used to maintain snapshot information to the snapshot repository, while SnapshotInfo was used to represent the snapshot information as presented through the REST layer. This removes the Snapshot class and combines all uses into the SnapshotInfo class. Closes #18167	2016-05-05 16:53:13 -04:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Jason Tedor	e11b96ca9c	Default to server VM and add client VM check Today we softly warn about running with the client VM. However, we should really refuse to start in production mode if running with the client VM as the performance of the client VM is too devastating for a server application. This commit adds an option to jvm.options to ensure that we are starting with the server VM (on all 32-bit non-Windows platforms on server-class machines (2+ CPUs, 2+ GB physical RAM) this is the default and on all 64-bit platforms this is the only option) and adds a bootstrap check for the client VM. Relates #18155	2016-05-05 10:36:21 -04:00
Jason Tedor	784c9e5fb9	Introduce node handshake This commit introduces a handshake when initiating a light connection. During this handshake, node information, cluster name, and version are received from the target node of the connection. This information can be used to immediately validate that the target node is a member of the same cluster, and used to set the version on the stream. This will allow us to extend APIs that are used during initial cluster recovery without a major version change. Relates #15971	2016-05-04 20:06:47 -04:00
Christoph Büscher	5a0cfdd6af	Change scriptFields member in InnerHitBuilder to set Adding random shuffling of xContent to InnterHitBuilderTests shows that the scriptFields are stored in order as a list internally although they are an unordered json objects in the query dsl. This changes the internal representation to a set and updates serialization accordingly.	2016-05-04 17:43:15 +02:00
Christoph Büscher	223d67df4a	Consolidate query generation in QueryShardContext Currently we have a lot of methods left in QueryShardContext that take parsers or BytesReference arguments to do some xContent parsing on the shard. While this still seems necessary in some cases (e.g. percolation, phrase suggester), the shard context should only be concerned with generating lucene queries from QueryBuilders. This change removes all of the parseX() methods in favour of two public methods toQuery(QueryBuilder) and toFilter(QueryBuilder) that either call the query builders toFilter() or toQuery() method and move all code required for parsing out to the respective callers.	2016-05-04 16:52:30 +02:00
Jim Ferenczi	8d3427b44d	Fix build: restore illegalScorer still in use in ExpressionSearchScript (2)	2016-05-04 16:48:08 +02:00
Jim Ferenczi	c4554fedad	Fix build: restore illegalScorer still in use in ExpressionSearchScript	2016-05-04 16:46:44 +02:00
Jim Ferenczi	52eb6f3830	Merge pull request #18127 from jimferenczi/breadth_first_needs_score Add the ability to use the breadth_first mode with nested aggregations (such as `top_hits`) which require access to score information.	2016-05-04 15:57:08 +02:00
Jason Tedor	78d615f320	Merge pull request #18110 from jasontedor/strings-split-as-array Remove Strings#splitStringToArray Remove arbitrary separator/wildcard from PathTrie	2016-05-04 09:38:47 -04:00
Jim Ferenczi	052191f2a2	Add the ability to use the breadth_first mode with nested aggregations (such as `top_hits`) which require access to score information. The score is recomputed lazily for each document belonging to a top bucket. Relates to #9825	2016-05-04 15:35:45 +02:00
Jason Tedor	9fe5ce9342	Remove arbitrary separator/wildcard from PathTrie PathTrie has a constructor that allows for an arbitrary separtor and wildcard, but this constructor is unused and internally we always use '/' as the separator and '*' as the wildcard. There are no tests for the case where the separator differs from the default separator and wildcard. This commit removes this constructor and now all instances of PathTrie have the default separator and wildcard.	2016-05-04 09:18:30 -04:00
Jason Tedor	2dea449949	Remove Strings#splitStringToArray This commit removes the method Strings#splitStringToArray and replaces the call sites with invocations to String#split. There are only two explanations for the existence of this method. The first is that String#split is slightly tricky in that it accepts a regular expression rather than a character to split on. This means that if s is a string, s.split(".") does not split on the character '.', but rather splits on the regular expression '.' which splits on every character (of course, this is easily fixed by invoking s.split("\\.") instead). The second possible explanation is that (again) String#split accepts a regular expression. This means that there could be a performance concern compared to just splitting on a single character. However, it turns out that String#split has a fast path for the case of splitting on a single character and microbenchmarks show that String#split has 1.5x--2x the throughput of Strings#splitStringToArray. There is a slight behavior difference between Strings#splitStringToArray and String#split: namely, the former would return an empty array in cases when the input string was null or empty but String#split will just NPE at the call site on null and return a one-element array containing the empty string when the input string is empty. There was only one place relying on this behavior and the call site has been modified accordingly.	2016-05-04 08:12:41 -04:00
Isabel Drost-Fromm	283a6d27f1	Clean up merge commit changes	2016-05-04 13:34:11 +02:00
Isabel Drost-Fromm	3bc926b7e0	Merge branch 'master' into enhancement/switch_geodistancesortbuilder_to_geovalidationmethod	2016-05-04 13:16:40 +02:00
Isabel Drost-Fromm	6f15f35819	Remove left over references to ESTestCase	2016-05-04 12:14:39 +02:00
Isabel Drost-Fromm	6b9ac46402	Merge branch 'master' into enhancement/switch_geodistancesortbuilder_to_geovalidationmethod	2016-05-04 11:30:15 +02:00
Isabel Drost-Fromm	ad8bf53bbd	Fold helper class into abstract sort test class. Folds the helper class for random object generation into the abstract sort test class. Removes a few references to ESTestCase that were not needed due to inheriting from it along the way.	2016-05-04 11:25:23 +02:00
Isabel Drost-Fromm	a8bf75983f	Merge branch 'master' into tests/switch_to_random_value_other_than_for_sort	2016-05-04 10:24:46 +02:00
Christoph Büscher	ca21aa0cb5	Make reset() in QueryShardContext private The query shard reset() method resets some internal state in the query shard context, like clearing query names, the filter flag or named queries. The problem with this method being public is that it currently (miss?) used for modifying an existing context for recursive invocatiob, but the contexts that have been reseted that way cannot be properly set back to their previous state. This PR is a step towards removing reset() entirely by first making it only be used internally in QueryShardContext. In places where reset() was used we can either create new QueryShardContexts or modify the existing context because it is discarded afterwards anyway.	2016-05-03 18:56:16 +02:00

... 11 12 13 14 15 ...

6268 Commits