OpenSearch

Commit Graph

Author	SHA1	Message	Date
Simon Willnauer	6e58284683	Serialize ignore_throttled also to 6.6 after backport	2018-11-06 13:50:30 +01:00
javanna	17b7d2efcb	[TEST] increase await timeout in RemoteClusterConnectionTests We have seen an improvement when we bumped the timeout from 1s to 5s, but there are still a few failures for this tests. With this commit we bump the timeout to 10 seconds hoping it will stop all the failures.	2018-11-06 13:36:22 +01:00
Jim Ferenczi	999f8f5850	Adapt Lucene BWC version Bump the Lucene version used by ES 6.6 now that the 6.x branch is upgraded to Lucene 7.6.	2018-11-06 12:15:33 +01:00
Nick Knize	a5e1f4d3a2	Upgrade to lucene-8.0.0-snapshot-31d7dfe6b1 (#35224 )	2018-11-06 11:55:23 +01:00
Simon Willnauer	833e0f8ecf	Prevent throttled indices to be searched through wildcards by default (#34354 ) Today if a wildcard, date-math expression or alias expands/resolves to an index that is search-throttled we still search it. This is likely not the desired behavior since it can unexpectedly slow down searches significantly. This change adds a new indices option that allows `search`, `count` and `msearch` to ignore throttled indices by default. Users can force expansion to throttled indices by using `ignore_throttled=true` on the rest request to expand also to throttled indices. Relates to #34352	2018-11-06 09:45:30 +01:00
David Turner	2fb3d1a465	[Zen2] Fix some rarely-failing tests (#35198 ) Recent changes have left a few Zen2 tests occasionally failing. This commit fixes them.	2018-11-05 21:54:53 +00:00
Armin Braun	216c761a5d	MINOR: Remove Dead Code in Routing (#35074 ) * MINOR: Remove Dead Code in Routing	2018-11-05 20:40:27 +01:00
Yannick Welsch	4f35eea8fe	[TEST] Fix testConcurrentTermIncreaseOnReplicaShard This test has a bug that got introduced during the refactoring of #32442. With 2 concurrent term increments, we can only assert under the operation permit that we are in the correct operation term, not that there is not already another term bump pending. Closes #34862	2018-11-05 16:18:20 +01:00
Christoph Büscher	02043a2260	[Tests] Fix rare edge case in SimpleQueryStringBuilderTests (#35201 ) If the random query string is "now" by accident _and_ we are also not setting some field names to use explicitely, then we can hit the "mapped_date" field from default test setup. This correctly leads to the query being was marked as not cacheable, but we assume and check so later. This change fixes this rare edge case by making sure we don't hit the "date" field in this rare cases. Closes #35183	2018-11-05 13:31:13 +01:00
Alexander Reelsen	409050e8de	Refactor: Remove settings from transport action CTOR (#35208 ) As settings are not used in the transport action constructor, this removes the passing of the settings in all the transport actions.	2018-11-05 13:08:18 +01:00
Boaz Leskes	28078642b3	Engine.newChangesSnapshot may cause unneeded refreshes if called concurrently (#35169 ) When the engine is asked for historical operations, we check if some of the requested operations are not yet refreshed and if so we refresh before returning the operations. The refresh check is based on capturing the local checkpoint before each refresh and comparing that value to the one requested when `newChangesSnapshot` was called. If the requested range is above the captured local checkpoint we issue a refresh. This can currently cause unneeded extra refreshes if the method is called concurrently which may cause unwanted degradation in indexing performance. This is especially relevant for CCR where we always ask for a range below the global checkpoint. That range is guaranteed to be below the local checkpoint of the shard and one refresh is enough to serve multiple changes requests. This commit fixes this by introducing a dedicated mutex to make sure the test for whether a refresh is needed actually wait for concurrents for concurrent refreshes that were caused by another change refresh. Note that this is not a big change in semantics as refreshes are serialized by lucene anyway. I also opted not to keep the synchronization to the changes snapshot request only even if in theory we can apply it to all refreshes, not matter where they come from.	2018-11-04 13:43:33 +01:00
Nhat Nguyen	855ab3fa1e	Add equals/hashCode to SeqNoStats (#35223 ) This commit adds equals/hashCode to SeqNoStats so we can verify it wholly in tests.	2018-11-02 21:31:36 -04:00
Jack Conradson	44f08717ba	[Scripting] Make Max Script Length Setting Dynamic (#35184 ) This changes the current script.max_size_in_bytes to be dynamic so it can be set through the cluster settings API. This setting is also applied to inline scripts in the compile method of ScriptService to prevent excessively long inline scripts from being compiled. The script length limit is removed from Painless as this is no longer necessary with the protection in compile.	2018-11-02 16:07:54 -07:00
Tim Brooks	0166388d74	Use single netty event loop group for transports (#35181 ) Currently we create a new netty event loop group for client connections and all server profiles. Each new group creates new threads for io processing. This means 2 * num of processors new threads for each group. A single group should be able to handle all io processing (for the transports). This also brings the netty module inline with what we do for nio. Additionally, this PR renames the worker threads to be the same for netty and nio.	2018-11-02 16:31:19 -06:00
Nhat Nguyen	d6e44129b1	TEST: Only check max_seq_no_of_updates when rollback (#35170 ) Currently, we assume that rollback always happens in the test testRestoreLocalHistoryFromTranslogOnPromotion. However, if the global checkpoint equals max_seq_no, we won't rollback. This causes the max_seq_no_of_updates assertion failed because max_seq_no_of_updates won't be advanced to the global checkpoint. With this commit, we assert max_seq_no_of_updates in two different paths.	2018-11-02 12:27:48 -04:00
Nhat Nguyen	e753e12f61	Do not alloc full buffer for small change requests (#35158 ) Today we always allocate a full buffer (1024 elements) in a LuceneChangesSnapshot even though the requesting size is smaller. With this change, we will use the requesting size as the buffer size if it's smaller than the default batch size; otherwise uses the default batch size.	2018-11-02 08:49:55 -04:00
Daniel Mitterdorfer	ccbe80c3a0	Introduce durability of circuit breaking exception With this commit we differentiate between permanent circuit breaking exceptions (which require intervention from an operator and should not be automatically retried) and transient ones (which may heal themselves eventually and should be retried). Furthermore, the parent circuit breaker will categorize a circuit breaking exception as either transient or permanent based on the categorization of memory usage of its child circuit breakers. Closes #31986 Relates #34460	2018-11-02 13:12:44 +01:00
Colin Goodheart-Smithe	fc6e1f7f3f	Merge branch 'master' into index-lifecycle	2018-11-02 10:56:35 +00:00
Andy Bristol	2a60c24043	[test] mute QueryProfilerIT.testProfileMatchesRegular	2018-11-01 16:59:06 -07:00
Tal Levy	c6c01425bb	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-11-01 11:38:42 -07:00
Julie Tibshirani	8fb3290e5c	Fix a bug in function_score queries where we use the wrong boost_mode. (#35148 )	2018-11-01 11:15:26 -07:00
Tal Levy	c3cf7dd305	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-11-01 10:13:02 -07:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Seong-hyun, Oh	9ef4788c13	Make XContentBuilder in AliasActions build `is_write_index` field (#35071 ) Make XContentBuilder in AliasesActions build `is_write_index` field	2018-10-31 14:15:46 -07:00
lipsill	d181d1bab1	Remove deprecated url parameters `_source_include` and `_source_exclude` (#35097 ) Removes `_source_include` and `_source_exclude` url parameters. These parameters have been deprecated in #33475. Closes #22792	2018-10-31 17:11:59 -04:00
Armin Braun	e6f9f0666e	NETWORKING: MockTransportService Wait for Close (#35038 ) * NETWORKING: MockTransportService Wait for Close * Make `MockTransportService` wait `30s` for close listeners to run before failing the assertion * Closes #34990	2018-10-31 21:33:49 +01:00
Andy Bristol	6492eaa84d	[test] mad tests more lenient approximation	2018-10-31 11:48:58 -07:00
Nik Everett	ca620ff4ce	Loggers: Drop last deprecated logger function (#35082 ) Drop the last function from `Loggers` that just wraps Log4j2. Relates to #32174	2018-10-31 14:38:29 -04:00
Tal Levy	d5d28420b6	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-31 10:47:07 -07:00
Luca Cavanna	ef5181c678	Allow to enable pings for specific remote clusters (#34753 ) When we connect to remote clusters, there may be a few more routers/firewalls in-between compared to when we connect to nodes in the same cluster. We've experienced cases where firewalls drop connections completely and keep-alives seem not to be enough, or they are not properly configured. With this commit we allow to enable application-level pings specifically from CCS nodes to the selected remote nodes through the new setting `cluster.remote.${clusterAlias}.transport.ping_schedule`. The new setting is similar `transport.ping_schedule` but it does not affect intra-cluster communication, pings are only sent to specific remote cluster when specifically enabled, as they are disabled by default. Relates to #34405	2018-10-31 17:32:53 +01:00
Armin Braun	3fa67c5d8a	DISCOVERY: Cleanup AbstractDisruptionTestCase (#34808 ) * DISCOVERY: Cleanup AbstractDisruptionTestCase * Make the internal test cluster manage minimum master nodes where we used the default of (nodes / 2 + 1) before * Remove use of the `NodeConfigurationSource` indirection * Relates #33675	2018-10-31 07:52:37 +01:00
Nik Everett	086ada4c08	Core: Drop settings member from AbstractComponent (#35083 ) Drops the `Settings` member from `AbstractComponent`, moving it from the base class on to the classes that use it. For the most part this is a mechanical change that doesn't drop `Settings` accesses. The one exception to this is naming threads where it switches from an invocation that passes `Settings` and extracts the node name to one that explicitly passes the node name. This change doesn't drop the `Settings` argument from `AbstractComponent`'s ctor because this change is big enough as is. We'll do that in a follow up change.	2018-10-30 16:10:38 -04:00
Ryan Ernst	512319cef7	Test: Filter out deprecated joda tzs in tests (#34868 ) This commit filters out usage of deprecated tzs by tests. These are tested separately and should not require checking for warnings on any test using random timezones. closes #34188	2018-10-30 11:15:34 -07:00
Vladimir Dolzhenko	be75b40a29	Fix LineLength Check Suppressions: index.mapper (#35087 ) Relates #34884	2018-10-30 18:00:14 +01:00
Tal Levy	18c72e86c5	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-30 08:09:57 -07:00
Andy Bristol	b8280ea7cc	median absolute deviation agg (#34482 ) This commit adds a new single value metric aggregation that calculates the statistic called median absolute deviation, which is a measure of variability that works on more types of data than standard deviation Our calculation of MAD is approximated using t-digests. In the collect phase, we collect each value visited into a t-digest. In the reduce phase, we merge all value t-digests, then create a t-digest of deviations using the first t-digest's median and centroids	2018-10-30 07:22:52 -07:00
Andrey Ershov	97f74c5a38	Merge branch 'master' into 'zen2' Conflicts during the merge: 1. >=140 chars line length fixed for a lot of project files and warnings for those files are no longer suppressed 2. Node name is removed from AbstractComponent, it’s no longer taken from settings, but is explicitly passed as constructor argument and there were quite a few new classes on zen2 branch that require this change 3. TransportResponseHandler interface changed (new method added) and Zen2 makes a lot of subclasses in tests 4. Deprecated way of obtaining logger was changed	2018-10-30 14:39:48 +03:00
Alan Woodward	c74232037a	Remove Accountable interface from BytesReference (#34900 )	2018-10-30 10:27:31 +00:00
Przemyslaw Gomulka	995bf0ee66	Bulk Api support for global parameters (#34528 ) Bulk Request in High level rest client should be consistent with what is possible in Rest API, therefore should support global parameters. Global parameters are passed in URL in Rest API. Some parameters are mandatory - index, type - and would fail validation if not provided before before the bulk is executed. Optional parameters - routing, pipeline. The usage of these should be consistent across sync/async execution, bulk processor and BulkRequestBuilder closes #26026	2018-10-30 09:08:12 +01:00
Ryan Ernst	5dda2b0c7a	Remove remaining line length violations in o.e.cluster (#34941 ) relates #34923, #34884	2018-10-29 19:45:35 -07:00
Tal Levy	c9e4d26a53	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-29 14:03:55 -07:00
lipsill	6df1c9e818	Deprecate `_source_include` and `_source_exclude` url parameters (#33475 ) Deprecates `_source_include` and `_source_exclude` url parameters in favor of `_source_inclues` and `_source_excludes` because those are consistent with the rest of Elasticsearch's APIs. Relates to #22792	2018-10-29 12:06:38 -04:00
Nik Everett	b093116a1e	Logging: Drop another deprecated Loggers method (#34520 ) Drop a method from `Loggers` that we deprecated because it just delegated to `LogManager`.	2018-10-29 10:05:24 -04:00
Mark Tozzi	329a94be0c	Cleanup suppressed overlength line for action.support package (#34889 ) Clean up lines over 140 characters in the `org.elasticsearch.action.support.*` packages Relates to #34884	2018-10-29 09:22:20 -04:00
Igor Motov	01c62fc06b	Fix line length for bootstrap/client/discovery/gateway files (#34905 ) Removes the checkstyle suppressions for files in org.elasticsearch.bootstrap/client/discovery/gateway packages. Relates to #34884	2018-10-26 18:13:09 -04:00
Jake Landis	11fa8d3744	Enforce 140 char line lengths for packages action.bulk/delete/explain/get/index (#34885 ) part of #34884	2018-10-26 16:14:04 -05:00
Ryan Ernst	f5200e34ad	Remove line length violations for o.e.cluster (mostly) (#34923 ) This commit removes line length violations in most of the classes under org.elasticsearch.cluster.	2018-10-26 13:37:24 -07:00
Tal Levy	d8322ca069	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-26 12:46:21 -07:00
Nik Everett	9f87fdc7ab	Drop deprecationLogger from AbstractComponent (#34859 ) Drops the `deprecationLogger` from `AbstractComponent`, moving it to places where we need it. This saves us from building a bunch of `DeprecationLogger`s that we don't need. Relates to #34488	2018-10-26 15:40:16 -04:00
Nik Everett	10295b306d	Core: Drop nodeName from AbstractComponent (#34487 ) `AbstractComponent` is trouble because its name implies that everything should extend from it. It is useful, but maybe too broadly useful. The things it offers access too, the `Settings` instance for the entire server and a logger are nice to have around, but not really needed everywhere. The `Settings` instance especially adds a fair bit of ceremony to testing without any value. This removes the `nodeName` method from `AbstractComponent` so it is more clear where we actually need the node name.	2018-10-26 15:26:14 -04:00
Armin Braun	64a044240a	MINOR: Remove Deadcode in aggregtions.support (#34323 ) * Removed methods are just unused (the exceptions being isGeoPoint() and is isFloatingPoint() but those could more efficiently be replaced by enum comparisons to simplify the code) * Remove exceptions aren't thrown	2018-10-26 20:57:57 +02:00
Jack Conradson	aefe2909c4	[Style] Remove line length violations from ingest actions (#34886 )	2018-10-26 09:15:35 -07:00
Jay Modi	a0279bc069	Responses can use Writeable.Reader interface (#34655 ) In order to remove Streamable from the codebase, Response objects need to be read using the Writeable.Reader interface which this change enables. This change enables the use of Writeable.Reader by adding the `Action#getResponseReader` method. The default implementation simply uses the existing `newResponse` method and the readFrom method. As responses are migrated to the Writeable.Reader interface, Action classes can be updated to throw an UnsupportedOperationException when `newResponse` is called and override the `getResponseReader` method. Relates #34389	2018-10-26 09:21:54 -06:00
Lee Hinman	af28d1f648	Fix line length for org.elasticsearch.common.* files (#34888 ) This removes the checkstyle suppressions for things in the `common` package. Relates to #34884	2018-10-26 08:47:39 -06:00
Jim Ferenczi	1b879ea8ac	Refactor children aggregator into a generic ParentJoinAggregator (#34845 ) This commit adds a new ParentJoinAggregator that implements a join using global ordinals in a way that can be reused by the `children` and the upcoming `parent` aggregation. This new aggregator is a refactor of the existing ParentToChildrenAggregator with two main changes: * It uses a dense bit array instead of a long array when the aggregation does not have any parent. * It uses a single aggregator per bucket if it is nested under another aggregation. For the latter case we use a `MultiBucketAggregatorWrapper` in the factory in order to ensure that each instance of the aggregator handles a single bucket. This is more inlined with the strategy we use for other aggregations like `terms` aggregation for instance since the number of buckets to handle should be low (thanks to the breadth_first strategy). This change is also required for #34210 which adds the `parent` aggregation in the parent-join module. Relates #34508	2018-10-26 16:26:45 +02:00
Gordon Brown	5c2c1f44c8	[Style] Fix line lengths in action.admin.indices (#34890 ) Clean up lines over 140 characters in the the `org.elasticsearch.action.admin.indices` packages	2018-10-26 08:01:38 -06:00
Armin Braun	db12005674	Fix LineLength Check Suppressions: index.fielddata (#34891 ) * Fix linelength suppressions in index.fielddata * Some lines that were too long were dead code => Removed them and all code that became dead because of it * Relates #34884	2018-10-26 12:56:19 +02:00
David Turner	33345d96ef	Delete flaky SettingsBasedHostProviderIT test (#34813 ) testClusterFormsByScanningPorts is flaky: sometimes in CI it's not possible to bind to any of the ports we need to in order for the port scanning to work. This change removes this test, and #34809 describes a better way to test this behaviour.	2018-10-26 07:52:31 +01:00
Tal Levy	e1fdd00420	Lowercase static final DeprecationLogger instance names (#34887 ) After discussing on the team's FixItFriday, we concluded that static final instance variables that are mutable should be lowercased. Historically, DeprecationLogger was uppercased more frequently than lowercased.	2018-10-25 21:12:19 -07:00
Tal Levy	810cd46a30	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-25 14:35:33 -07:00
Tim Brooks	cf9aff954e	Reduce channels in AbstractSimpleTransportTestCase (#34863 ) This is related to #30876. The AbstractSimpleTransportTestCase initiates many tcp connections. There are normally over 1,000 connections in TIME_WAIT at the end of the test. This is because every test opens at least two different transports that connect to each other with 13 channel connection profiles. This commit modifies the default connection profile used by this test to 6. One connection for each type, except for REG which gets 2 connections.	2018-10-25 13:37:49 -06:00
Lee Hinman	3e7042832a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-25 11:00:36 -06:00
Christophe Bismuth	70871b5af7	Check self references in metric agg after last doc collection (#33593 ) (#34001 ) * Check self references in metric agg after last doc collection (#33593) * Revert 0aff5a30c5dbad9f476be14f34b81e2d1991bb0f (#33593) * Check self refs in metric agg only once in post collection hook (#33593) * Remove unnecessary mocking (#33593)	2018-10-25 17:12:50 +01:00
Tanguy Leroux	3225b2dcd3	Add 6.6.0 version to master (#34847 ) This commit adds the 6.6.0 version constant to the master branch, and adapts the VersionTests.	2018-10-25 17:30:25 +02:00
lipsill	2b652f3242	Logging: server: clean up logging (#34593 ) Replace internal deprecated calls to `Loggers.getLogger(Class)` with direct calls to log4j `LogManager.getLogger(Class)`	2018-10-25 09:52:50 -04:00
lipsill	185c06bb7f	Logging: tests: clean up logging (#34606 ) Replace internal deprecated calls to `Loggers.getLogger(Class)` with direct calls to log4j `LogManager.getLogger(Class)`	2018-10-25 09:52:41 -04:00
Ryan Ernst	687dc1eb11	Scripting: Remove SearchScript (#34730 ) This commit removes the last non context based script class.	2018-10-24 15:03:38 -07:00
Andrey Atapin	5f588180f9	Improve IndexNotFoundException's default error message (#34649 ) This commit adds the index name to the error message when an index is not found.	2018-10-24 12:53:31 -07:00
Stéphane Campinas	04f3e67c77	Remove redundant method from RestClearScrollAction (#34268 ) The check for null argument is already done in `splitStringByCommaToArray`, hence it can be removed, which allows us to remove the whole splitScrollIds private method.	2018-10-24 21:31:29 +02:00
Mayya Sharipova	bf4d90a5dc	HLRC API for _termvectors (#33447 ) * HLRC API for _termvectors relates to #27205	2018-10-24 14:27:22 -04:00
Alpar Torok	795d57b4f9	Auto configure all test tasks (#34666 ) With this change, we apply the common test config automatically to all newly created tasks instead of opting in specifically. For plugin authors using the plugin externally this means that the configuration will be applied to their RandomizedTestingTasks as well. The purpose of the task is to simplify setup and make it easier to change projects that use the `test` task but actually run integration tests to use a task called `integTest` for clarity, but also because we may want to configure and run them differently. E.x. using different levels of concurrency.	2018-10-24 16:05:50 +03:00
Andrey Ershov	7a3cd10718	[Zen2] Change MetaDataStateFormat write semantics (#34709 ) Currently, if MetaDataStateFormat.write throws an IOExceptions if there was some problem with persisting state to disk. If an exception is thrown, loadLatestState may read either old state or new state. This is not enough for the Zen2 algorithm. In case of failure, we need to distinguish between 2 cases: storage is left in clean state or storage is left in a dirty state. If storage is left in the clean state, loadLatestState may read only old state. If storage is left in a dirty state, loadLatestState may read either old or new state. If an exception occurs when writing the manifest file to disk this distinction is important for Zen2. If storage is clean, the node can continue to be a part of the cluster and may try to accept further cluster state updates (if it fails to accept cluster state updates it will be kicked off from the cluster using different mechanism). But if storage is dirty, the node should be restarted and it will be able to start up successfully only once it successfully re-writes manifest file to disk. This commit changes MetaDataStateFormat.write signature, replacing IOException with WriteStateException, which “isDirty” method could be used to distinguish between 2 failure cases. We need to minimise the number of failures, that leave storage in a dirty state. That’s why this PR changes the algorithm that is used to store state to disk. It has the following layout: 1. For the first state location, create and fsync tmp file with state content. 2. For each extra location, copy and fsync tmp file with state content. 2. Atomically rename tmp file in the first location. 3. For each extra location, atomically rename tmp file. 4. For each location, fsync state directory. 5. Perform cleanup of old files, ignoring exceptions. If an exception occurs in steps 1-3, storage is clearly in the clean state. If an exception occurs in step 5, storage is clearly in dirty state. Exception in step 4 is questionable, there are 2 options: 1. Consider it as a failure. If the first disk fails, state disappears. So this is a failure and storage is in a dirty state. 2. Do not consider it as failure at all, ignore disk failures. This commit prefers 1st approach and MetaDataTestFormatTests.testFailRandomlyAndReadAnyState tests for disk failures.	2018-10-24 13:45:12 +03:00
Ryan Ernst	8da1c9626a	Scripting: Add back params._source access in scripted metric aggs (#34777 ) Access to special variables _source and _fields were accidentally removed in recent refactorings. This commit adds them back, along with a test. closes #33884	2018-10-23 18:07:53 -07:00
Gordon Brown	da20dfd81c	Add cluster-wide shard limit warnings (#34021 ) In a future major version, we will be introducing a soft limit on the number of shards in a cluster based on the number of nodes in the cluster. This limit will be configurable, and checked on operations which create or open shards and issue a warning if the operation would take the cluster over the limit. There is an option to enable strict enforcement of the limit, which turns the warnings into errors. In a future release, the option will be removed and strict enforcement will be the default (and only) behavior.	2018-10-23 16:35:10 -06:00
Julie Tibshirani	c5a0739381	Mute SettingsBasedHostProviderIT to avoid future test flakes.	2018-10-23 15:26:39 -07:00
Zachary Tong	299d044bfc	Collapse pipeline aggs into single package (#34658 ) - Restrict visibility of Aggregators and Factories - Move PipelineAggregatorBuilders up a level so it is consistent with AggregatorBuilders - Checkstyle line length fixes for a few classes - Minor odds/ends (swapping to method references, formatting, etc)	2018-10-23 16:01:01 -04:00
Tal Levy	62ac2fa5ec	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-23 09:43:46 -07:00
Jake Landis	89dc07bdd9	ingest: better support for conditionals with simulate?verbose (#34155 ) This commit introduces two corrections to the way simulate?verbose handles conditionals on processors. 1) Prior to this change when executing simulate?verbose for processors with conditionals that evaluate to false, that processor would still be displayed in the result set. What was displayed was correct, such that no changes to the document occurred. However, if the conditional evaluates to false, the processor should not even be displayed. 2) Prior to this change when executing simulate?verbose for pipeline processors with conditionals, the individual steps would no longer be displayed. Commit `e37e5df` addressed the issue, but failed account for a conditional on the pipeline processor. Since a pipeline processor can introduce cycles and is effectively a single processor that encapsulates multiple other processors that are potentially guarded by a single conditional, special handling is needed to for pipeline and conditional pipeline processors.	2018-10-23 11:33:48 -05:00
Zachary Tong	4dbf498721	[Rollup] Job deletion should be invoked on the allocated task (#34574 ) We should delete a job by directly talking to the allocated task and telling it to shutdown. Today we shut down a job via the persistent task framework. This is not ideal because, while the job has been removed from the persistent task CS, the allocated task continues to live until it gets the shutdown message. This means a user can delete a job, immediately delete the rollup index, and then see new documents appear in the just-deleted index. This happens because the indexer in the allocated task is still running and indexes a few more documents before getting the shutdown command. In this PR, the transport action is changed to a TransportTasksAction, and we invoke onCancelled() directly on the matching job. The race condition still exists after this PR (albeit less likely), but this was a precursor to fixing the issue and a self-contained chunk of code. A second PR will followup to fix the race itself.	2018-10-23 12:23:22 -04:00
Albert Zaharovits	11881e7b50	Empty GetAliases authorization fix (#34444 ) This fixes a bug about aliases authorization. That is, a user might see aliases which he is not authorized to see. This manifests when the user is not authorized to see any aliases and the `GetAlias` request is empty which normally is a marking that all aliases are requested. In this case, no aliases should be returned, but due to this bug, all aliases will have been returned.	2018-10-23 18:50:20 +03:00
Christoph Büscher	583f2852f0	[Test] Remove dead code from ExceptionSerializationTests (#34713 ) The `ignore` set contains entries of type Class<?>, but the check is performed on Path objects. This always returns false so is useless currently. Looking at the first commit of this test that already shows this behaviour this never excluded anything, so it can be removed.	2018-10-23 15:44:47 +02:00
Jake Landis	ad94e79350	ingest: processor stats (#34724 ) This change introduces stats per processors. Total, time, failed, current are currently supported. All pipelines will now show all top level processors that belong to it. Failure processors are not displayed, however, the time taken to execute the failure chain is part of the stats for the top level processor. The processor name is the type of the processor, ordered as defined in the pipeline. If a tag for the processor is found, then the tag is appended to the type. Pipeline processors will have the pipeline name appended to the name of the name of the processors (before the tag if one exists). If more then one pipeline is used to process the document, then each pipeline will carry its own stats. The outer most pipeline will also include the inner most pipeline stats. Conditional processors will only included in the stats if the condition evaluates to true.	2018-10-23 07:30:52 -05:00
Igor Motov	123f784e32	Tests: Add checks to GeoDistanceQueryBuilderTests (#34273 ) Adds checks for parsed geo distance query. It is a bit hack-ish since it compares with query's toString() output, but it is better than no checks. The parsed query itself has default visibility, so we cannot access it here unless we move the test to org.apache.lucene.document package. Fixes #34043	2018-10-23 07:55:41 -04:00
Armin Braun	8e155b8430	INGEST: Rename Pipeline Processor Param. (#34733 ) * `name` is more readable/ergnomic than having `pipeline` twice	2018-10-23 13:43:26 +02:00
Alexander Reelsen	83fd93b2fd	Core: Move IndexNameExpressionResolver to java time (#34507 ) This switches from joda time to java time when resolving index names using date math. This commit also removes two non registered settings from the code, which could not be used anyway. An unused method was removed as well. Relates #27330	2018-10-23 13:26:02 +02:00
Alpar Torok	0536635c44	Upgrade forbiddenapis to 2.6 (#33809 ) * Upgrade forbiddenapis to 2.6 Closes #33759 * Switch forbiddenApis back to official plugin * Remove CLI based task * Fix forbiddenApisJava9	2018-10-23 12:06:46 +03:00
Tal Levy	67bfdb16ad	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-22 13:09:37 -07:00
Julie Tibshirani	f854330e06	Make sure to use the type _doc in the REST documentation. (#34662 ) * Replace custom type names with _doc in REST examples. * Avoid using two mapping types in the percolator docs. * Rename doc -> _doc in the main repository README. * Also replace some custom type names in the HLRC docs.	2018-10-22 11:54:04 -07:00
Lee Hinman	5dd79bf58c	Make accounting circuit breaker settings dynamic (#34372 ) * Make accounting circuit breaker settings dynamic These missed the original property making them dynamic. This fixes the issue so these can now be set at any time. Resolves #34368	2018-10-22 09:55:00 -06:00
Julie Tibshirani	fbb9ac34f9	Deprecate type exists requests. (#34663 )	2018-10-22 08:46:11 -07:00
Yannick Welsch	6d6ac74a08	Zen2: Fail fast on disconnects (#34503 ) Integrates the failure detectors with the Connection lifecycle, to fail nodes as soon as: - a leader detects one of his followers disconnecting. - a follower detects its leader disconnecting.	2018-10-22 17:20:12 +02:00
Jason Tedor	0577703183	Revert "ingest: processor stats (#34202 )" This reverts commit `6567729600`.	2018-10-21 13:16:15 -04:00
Ryan Ernst	222652dfce	Scripting: Convert script fields to use script context (#34164 ) This commit removes the use of SearchScript for script fields and adds a new FieldScript.	2018-10-20 16:33:49 -07:00
Nhat Nguyen	7ab464807d	TEST: Mute testDedupByPrimaryTerm Should be fixed by #34667	2018-10-20 18:23:02 -04:00
Jake Landis	6567729600	ingest: processor stats (#34202 ) This change introduces stats per processors. Total, time, failed, current are currently supported. All pipelines will now show all top level processors that belong to it. Failure processors are not displayed, however, the time taken to execute the failure chain is part of the stats for the top level processor. The processor name is the type of the processor, ordered as defined in the pipeline. If a tag for the processor is found, then the tag is appended to the type. Pipeline processors will have the pipeline name appended to the name of the name of the processors (before the tag if one exists). If more then one pipeline is used to process the document, then each pipeline will carry its own stats. The outer most pipeline will also include the inner most pipeline stats. Conditional processors will only included in the stats if the condition evaluates to true.	2018-10-20 16:01:01 -05:00
Nhat Nguyen	d90b6730c7	CCR: Following primary should process NoOps once (#34408 ) This is a follow-up for #34288. Relates #34412	2018-10-19 21:10:13 -04:00
Jim Ferenczi	ba87c543c0	[TEST] Fix sporadic failures in CompletionSuggestSearchIT#testTiebreak Relates #34508	2018-10-20 01:05:48 +02:00
David Turner	bfd24fc030	[Zen2] Reconfigure cluster as its membership changes (#34592 ) As master-eligible nodes join or leave the cluster we should give them votes or take them away, in order to maintain the optimal level of fault-tolerance in the system. #33924 introduced the `Reconfigurator` to calculate the optimal configuration of the cluster, and in this change we add the plumbing needed to actually perform the reconfigurations needed as the cluster grows or shrinks.	2018-10-19 19:24:54 +01:00
Nhat Nguyen	bd92a28cfc	CCR: Replicate existing ops with old term on follower (#34412 ) Since #34288, we might hit deadlock if the FollowTask has more fetchers than writers. This can happen in the following scenario: Suppose the leader has two operations [seq#0, seq#1]; the FollowTask has two fetchers and one writer. 1. The FollowTask issues two concurrent fetch requests: {from_seq_no: 0, num_ops:1} and {from_seq_no: 1, num_ops:1} to read seq#0 and seq#1 respectively. 2. The second request which fetches seq#1 completes before, and then it triggers a write request containing only seq#1. 3. The primary of a follower fails after it has replicated seq#1 to replicas. 4. Since the old primary did not respond, the FollowTask issues another write request containing seq#1 (resend the previous write request). 5. The new primary has seq#1 already; thus it won't replicate seq#1 to replicas but will wait for the global checkpoint to advance at least seq#1. The problem is that the FollowTask has only one writer and that writer is waiting for seq#0 which won't be delivered until the writer completed. This PR proposes to replicate existing operations with the old primary term (instead of the current term) on the follower. In particular, when the following primary detects that it has processed an process already, it will look up the term of an existing operation with the same seq_no in the Lucene index, then rewrite that operation with the old term before replicating it to the following replicas. This approach is wait-free but requires soft-deletes on the follower. Relates #34288	2018-10-19 13:56:00 -04:00
Igor Motov	94bde37bcf	Geo: Don't flip longitude of envelopes crossing dateline (#34535 ) When a envelope that crosses the dateline is specified as a part of geo_shape query is parsed it shouldn't have its left and right points flipped. Fixes #34418	2018-10-19 13:53:54 -04:00
Jim Ferenczi	fba5d39bbb	Fix completion suggester's score tie-break (#34508 ) The shard suggestion sort uses a different tie-break than the one that is used to merge different shards responses. The former uses the internal document identifier when scores are the same whereas the latter compares the surface form first. Because of this discrepancy some suggestion outputs are linked to the wrong documents because the merge sort reorders the shard suggestions differently. This change fixes this bug by duplicating the Lucene collector in order to be able to apply the same tiebreak strategy than the merge sort. This logic will be removed when https://issues.apache.org/jira/browse/LUCENE-8529 is fixed. Closes #34378	2018-10-19 19:46:55 +02:00
Nhat Nguyen	90ca5b1fde	Fill LocalCheckpointTracker with Lucene commit (#34474 ) Today we rely on the LocalCheckpointTracker to ensure no duplicate when enabling optimization using max_seq_no_of_updates. The problem is that the LocalCheckpointTracker is not fully reloaded when opening an engine with an out-of-order index commit. Suppose the starting commit has seq#0 and seq#2, then the current LocalCheckpointTracker would return "false" when asking if seq#2 was processed before although seq#2 in the commit. This change scans the existing sequence numbers in the starting commit, then marks these as completed in the LocalCheckpointTracker to ensure the consistent state between LocalCheckpointTracker and Lucene commit.	2018-10-19 12:38:06 -04:00
David Turner	3de266e3cf	Merge branch 'master' into zen2	2018-10-19 14:30:07 +01:00
Colin Goodheart-Smithe	84ef91529c	Merge branch 'master' into index-lifecycle	2018-10-19 13:24:04 +01:00
Christophe Bismuth	3036ab1048	Don't omit default values when updating routing exclusions (#33638 ) Exclusion setting `cluster.routing.allocation.exclude._host` default value is an empty string. When an exclusion setting is sent with a null value the o.e.c.s.Setting#innerGetRaw API return an empty string (probably to avoid a NullPointerException to be raised). The o.e.c.r.a.d.FilterAllocationDecider class is developed to omit updates of default values for exclusion setting. That's why a null exclusion setting value is translated to an empty string which is equals to the exclusion default value which is configured to be ignored. A simple fix would be to not omit default values for exclusion setting and keep the NullPointerException guard. This is the purpose of this commit. Closes #32721	2018-10-19 13:57:41 +02:00
Jim Ferenczi	7b49beb9b0	Fix threshold frequency computation in Suggesters (#34312 ) The `term` and `phrase` suggesters have different options to filter candidates based on their frequencies. The `popular` mode for instance filters candidate terms that occur in less docs than the original term. However when we compute this threshold we use the total term frequency of a term instead of the document frequency. This is not inline with the actual filtering which is always based on the document frequency. This change fixes this discrepancy and clarifies the meaning of the different frequencies in use in the suggesters. It also ensures that the threshold doesn't overflow the maximum allowed value (Integer.MAX_VALUE). Closes #34282	2018-10-19 13:33:19 +02:00
markharwood	fe623acf66	Docs - removed experimental/beta markers from adjacency matrix aggregation (#34599 )	2018-10-19 09:33:59 +01:00
Daniel Mitterdorfer	dbb6fe58fa	Remove hand-coded XContent duplicate checks With this commit we cleanup hand-coded duplicate checks in XContent parsing. They were necessary previously but since we reconfigured the underlying parser in #22073 and #22225, these checks are obsolete and were also ineffective unless an undocumented system property has been set. As we also remove this escape hatch, we can remove the additional checks as well. Closes #22253 Relates #34588	2018-10-19 10:13:13 +02:00
Alexander Reelsen	e498b7d437	Core: Parse floats in epoch millis parser (#34504 ) In order to stay BWC compatible with joda time, the epoch millis date formatter needs to parse dates with a dot like `123.45`. This adds this functionality for the epoch millis parser in the same way as for the epoch seconds parser. It also adds support for scientific notations like `1.0e3` and fixes parsing of negative values for epoch seconds and epoch millis.	2018-10-19 10:02:45 +02:00
Christoph Büscher	4f7895800e	Remove unused methods in ValueType (#34624 ) The removed methods seem unused in the rest of the project.	2018-10-19 09:50:45 +02:00
David Turner	e13ce66a3c	[Zen2] Calculate optimal cluster configuration (#33924 ) We wish to commit a cluster state update after having received a response from more than half of the master-eligible nodes in the cluster. This is optimal: requiring either more or fewer votes than half harms resilience. For instance if we have three master nodes then, we want to be able to commit a cluster state after receiving responses from any two nodes; requiring responses from all three is clearly not resilient to the failure of any node, and if we could commit an update after a response from just one node then that node would be required for every commit, which is also not resilient. However, this means we must adjust the configuration (the set of voting nodes in the cluster) whenever a master-eligible node joins or leaves. The calculation of the best configuration for the cluster is the job of the Reconfigurator, introduced here.	2018-10-18 13:19:27 +01:00
Christoph Büscher	7bcf496315	[Tests] Correct map lookup in ReplicationTrackerTests (#34565 )	2018-10-18 11:23:53 +02:00
Tal Levy	09067c8942	Merge remote-tracking branch 'upstream/master' into index-lifecycle	2018-10-17 15:37:11 -07:00
Ryan Ernst	8734540345	Ensure map keys cannot be self referencing (#34569 ) This commit improves self reference checking to map keys, as well as adds it to ingest script processing.	2018-10-17 15:16:13 -07:00
Jason Tedor	9be87adb95	Increment settings version when upgrading index (#34566 ) When we upgrade an index, we set the settings version upgraded setting. This should be considered a settings change, and therefore we need to increment the settings version. This commit addresses that.	2018-10-17 18:00:17 -04:00
Nik Everett	b6aa42777a	Search: Wrap lucene classes at 140 columns (#34491 ) Applies our line length guidance for all classes in the server in `lucene` directories except `XMoreLikeThis`. The only long line in `XMoreLikeThis` says "remove this when we upgrade to Lucene 5. Given that we're on Lucene 8, this is a little terrifying and deserves another look.	2018-10-17 15:54:35 -04:00
Armin Braun	08d4bf6e84	TESTS: Remove Dead Code in Test Infra. (#34548 ) * None of this infrastructure is used * Some redundant throws and resulting catch code removed	2018-10-17 20:08:39 +01:00
Colin Goodheart-Smithe	90f7cec7a5	Merge branch 'master' into index-lifecycle	2018-10-17 18:22:23 +01:00
Simon Willnauer	b0e98cbce2	Pass the host name on as `server_name` if proxy mode is on (#34559 ) In remote cluster setup if we see a configured proxy we should set the seed nodes host name as the `server_name` to trigger SNI based routing even for seed nodes. Since remote cluster connections are plain TCP connections we have to set the host manually since the other side can't take it from the request URL like in the HTTP case. This also adds some more informative logging to remote cluster connection.	2018-10-17 19:11:50 +02:00
Andrey Ershov	51f38ddc0c	Switch MetaDataStateFormat to Lucene directory abstraction (#33989 ) Switch MetaDataStateFormat to Lucene directory abstraction This commit switches MetaDataStateFormat class to Lucene directory abstraction to make it easier to test MetaDataStateFormat for different IO failures. This commits also adds different IO failures tests to MetaDataStateFormatTests.	2018-10-17 18:17:17 +02:00
Andrey Ershov	93bb24e1f8	Merge branch 'master' into zen2	2018-10-17 14:37:53 +02:00
Armin Braun	3954d041a0	SCRIPTING: Move sort Context to its Own Class (#33717 ) * SCRIPTING: Move sort Context to its own Class	2018-10-17 10:02:44 +01:00
Tal Levy	fbe8dc014c	Merge branch 'master' into index-lifecycle	2018-10-16 13:58:53 -07:00
Simon Willnauer	a93aefb4a4	Assume that rollover datemath tests run on the same day. (#34527 ) in #28741 RolloverIT fails because we are cutting over to the next day while the test executes. We assume that this doesn't happen based on the assertions in the test. This adds a assumeTrue to ensure we are at least 5 min away form a date-flip. Closes #28741	2018-10-16 20:22:32 +02:00
David Turner	303575f742	Fix up merge of master	2018-10-16 15:29:47 +01:00
Armin Braun	ea576a8ca2	Disc: Move AbstractDisruptionTC to filebased D. (#34461 ) * Discovery: Move AbstractDisruptionTestCase to file-based discovery. * Relates #33675 * Simplify away ClusterDiscoveryConfiguration	2018-10-16 15:28:40 +01:00
David Turner	950ca3adda	Merge branch 'master' into zen2	2018-10-16 14:41:14 +01:00
Simon Willnauer	d43a1fac33	Lock down Engine.Searcher (#34363 ) `Engine.Searcher` is non-final today which makes it error prone in the case of wrapping the underlying reader or lucene `IndexSearcher` like we do in `IndexSearcherWrapper`. Yet, there is no subclass of it yet that would be dramatic to just drop on the floor. With the start of development of frozen indices this changed since in #34357 functionality was added to a subclass which would be dropped if a `IndexSearcherWrapper` is installed on an index. This change locks down the `Engine.Searcher` to prevent such a functionality trap.	2018-10-16 14:53:07 +02:00
Martijn van Groningen	a1ec91395c	Changed CCR internal integration tests to use a leader and follower cluster instead of a single cluster (#34344 ) The `AutoFollowTests` needs to restart the clusters between each tests, because it is using auto follow stats in assertions. Auto follow stats are only reset by stopping the elected master node. Extracted the `testGetOperationsBasedOnGlobalSequenceId()` test to its own test, because it just tests the shard changes api. * Renamed AutoFollowTests to AutoFollowIT, because it is an integration test. Renamed ShardChangesIT to IndexFollowingIT, because shard changes it the name of an internal api and isn't a good name for an integration test. * move creation of NodeConfigurationSource to a seperate method * Fixes issues after merge, moved assertSeqNos() and assertSameDocIdsOnShards() methods from ESIntegTestCase to InternalTestCluster, so that ccr tests can use these methods too.	2018-10-16 14:45:46 +02:00
Jason Tedor	05911fb499	Adjust settings version BWC version after backport This commit adjusts the settings version BWC version after backporting the change to the 6.x branch which currently is versioned as 6.5.0.	2018-10-16 06:38:38 -04:00
Jim Ferenczi	544de13d8e	Disallow negative query boost (#34486 ) This change disallows negative query boosts. Negative scores are not allowed in Lucene 8 so it is easier to just disallow negative boosts entirely. We should also deprecate negative boosts in 6x in order to ensure that users are aware when they'll upgrade to ES 7. Relates #33309	2018-10-16 11:31:53 +01:00
Jason Tedor	4b2052c683	Introduce index settings version (#34429 ) This commit introduces settings version to index metadata. This value is monotonically increasing and is updated on settings updates. This will be useful in cross-cluster replication so that we can request settings updates from the leader only when there is a settings update.	2018-10-16 06:22:20 -04:00
Daniel Mitterdorfer	92b2e1a209	Remove lenient boolean handling With this commit we remove some leftovers from #26389 which cleaned up lenient boolean handling. Relates #26389 Relates #22298 Relates #34467	2018-10-16 06:30:00 +02:00
Jason Tedor	55dee53046	Do not update number of replicas on no indices (#34481 ) Today when submitting an update settings request to update the number of replicas with a wildcard that does not match any indices and allow no indices is set to true, the request ends up being interpreted as updating the number of replicas for all indices. That is, consider the following sequence: PUT /test-index { "settings": { "index.number_of_replicas": 0 } } PUT /non-existent-*/_settings?expand_wildcards=open&allow_no_indices=true { "settings": { "index.number_of_replicas": 1 } } GET /test-index/_settings The latter will show that the number of replicas on test-index is now one. This is surprising, and should be considered a bug. The underlying problem here is treating no indices in the underlying methods used to update the routing table and the metadata as meaning all indices. This commit takes away this assumption. Tests that relied on this behavior have been changed to no longer rely on this. A test for this situation is added in UpdateNumberOfReplicasIT.	2018-10-15 19:49:58 -04:00
Nik Everett	23ece922c9	Core: Remove two methods from AbstractComponent (#34336 ) This removes another two methods from `AbstractComponent`. One isn't used at all and another is only used in a single class in watcher. I've moved the method that watcher uses into the single class that uses it.	2018-10-15 16:05:14 -04:00
Nik Everett	a6d1cc6ca9	Revert "Search: Fix spelling mistake in Javadoc (#34480 )" This reverts commit `4e1d7baed0`.	2018-10-15 15:42:11 -04:00
fonxian	4e1d7baed0	Search: Fix spelling mistake in Javadoc (#34480 ) "iff" -> "if".	2018-10-15 15:38:37 -04:00
Ryan Ernst	26f1d7fc94	Tests: Handle epoch date formatters edge cases (#34437 ) This commit handles cases testing withLocale and withZone when the zone and locale in question is the same as the special base case. This can happen sometimes since the locale and zoneids are randomized.	2018-10-15 12:18:18 -07:00
Jim Ferenczi	67577fca56	Fix handling of empty keyword in terms aggregation (#34457 ) Empty values on keyword fields are filtered by the `map` execution mode of the `terms` aggregation. This commit restores them as valid buckets. Closes #34434	2018-10-15 19:33:52 +01:00
Armin Braun	ebca27371c	SCRIPTING: Move Aggregation Script Context to its own class (#33820 ) * SCRIPTING: Move Aggregation Script Context to its own class	2018-10-15 17:28:05 +01:00
Colin Goodheart-Smithe	0b42eda0e3	Merge branch 'master' into index-lifecycle	2018-10-15 16:03:37 +01:00
David Turner	9bb620eece	Mute PartitionedRoutingIT#testShrinking on Windows	2018-10-15 13:18:00 +01:00
Ryan Ernst	72d818c304	Tests: Fix DateFormatter equals tests with locale (#34435 ) This commit removes randomization of locale for DateFormatter equals tests, instead using explicit locales. The test framework already randomizes locales, so the random choice of the second locale can sometimes be equal to the already chosen locale. Randomization also does not provide any extra protection, as the equality of DateFormatter does not implement equality of the locales itself. closes #34337	2018-10-14 23:54:49 +01:00
Yannick Welsch	5fbead00a3	Zen2: Add infrastructure for integration tests (#34365 ) Adds the infrastructure to run integration tests against Zen2.	2018-10-14 20:55:04 +01:00
David Turner	8b9fa55c93	Add storage-layer disruptions to CoordinatorTests (#34347 ) Today we assume the storage layer operates perfectly in CoordinatorTests, which means we are not testing that the system's invariants are preserved if the storage layer fails for some reason. This change injects (rare) storage-layer failures during the safety phase to cover these cases.	2018-10-13 14:24:15 +01:00
David Turner	d98199df14	Extend duration of fixLag() (#34364 ) Today, fixLag() waits for a new cluster state to be committed. However, it does not account for the fact that a term bump may occur, requiring a new election to take place after the cluster state is committed. This change fixes this.	2018-10-11 23:24:08 +01:00
David Turner	a32e303b0c	Account for election duration (#34362 ) Today we may schedule two elections very close together, which can cause the first election to fail even if there are no other nodes. This change adds a delay in between subsequent elections on the same node, effectively allowing time for each election to complete before scheduling the next one.	2018-10-11 15:31:08 +01:00
Jay Modi	6d99d7dafc	ListenableFuture should preserve ThreadContext (#34394 ) ListenableFuture may run a listener on the same thread that called the addListener method or it may execute on another thread after the future has completed. Whenever the ListenableFuture stores the listener for execution later, it should preserve the thread context which is what this change does.	2018-10-11 15:24:38 +01:00
Nhat Nguyen	33791ac27c	CCR: Following primary should process operations once (#34288 ) Today we rewrite the operations from the leader with the term of the following primary because the follower should own its history. The problem is that a newly promoted primary may re-assign its term to operations which were replicated to replicas before by the previous primary. If this happens, some operations with the same seq_no may be assigned different terms. This is not good for the future optimistic locking using a combination of seqno and term. This change ensures that the primary of a follower only processes an operation if that operation was not processed before. The skipped operations are guaranteed to be delivered to replicas via either primary-replica resync or peer-recovery. However, the primary must not acknowledge until the global checkpoint is at least the highest seqno of all skipped ops (i.e., they all have been processed on every replica). Relates #31751 Relates #31113	2018-10-10 15:39:57 -04:00
Simon Willnauer	34b935ae57	Improve `getRestHandlerWrapper` JavaDocs (#34376 ) Questions on how to work with `ActionPlugin#getRestHandlerWrapper()` come up in discuss forums all the time. This change adds an example to the javadoc how this method should/could be used.	2018-10-10 17:28:07 +01:00
David Turner	52a3a19551	Add low-level bootstrap implementation (#34345 ) Today we inject the initial configuration of the cluster (i.e. the set of voting nodes) at startup. In reality we must support injecting the initial configuration after startup too. This commit adds low-level support for doing so as safely as possible.	2018-10-08 15:56:48 +01:00
Yannick Welsch	49cbcaff4f	Allow excluding folder names when scanning for dangling indices (#34349 ) ES is scanning for dangling indices on every cluster state update. For this, it lists the subfolders of the indices directory to determine which extra index directories exist on the node where there's no corresponding index in the cluster state. These are potential targets for dangling index import. On certain machine types, and with large number of indices, this subfolder listing can be horribly slow. This means that every cluster state update will be slowed down by potentially hundreds of milliseconds. One of the reasons for this poor performance is that Files.isDirectory() is a relatively expensive call on some OS and JDK versions. There is no need though to do all these isDirectory calls for folders which we know we are going to discard anyhow in the next step of the dangling indices logic. This commit allows adding an exclusion predicate to the availableIndexFolders methods which can dramatically speed up this method when scanning for dangling indices.	2018-10-08 15:35:50 +02:00
David Turner	ac99d1d66d	Fix bugs in fixLag() (#34346 ) The hack to work around lag detection had some issues: - it always called runFor(), even if no lag was detected - it looked at the last-accepted state not the last-applied state, so missed some lag situations. This fixes these issues.	2018-10-08 11:33:25 +01:00
Nik Everett	06993e0c35	Logging: Make ESLoggerFactory package private (#34199 ) Since all calls to `ESLoggerFactory` outside of the logging package were deprecated, it seemed like it'd simplify things to migrate all of the deprecated calls and declare `ESLoggerFactory` to be package private. This does that.	2018-10-06 09:54:08 -04:00
David Turner	03da4f6c51	Gather votes from all nodes (#34335 ) Today we accept that some nodes may vote for the wrong master in an election. This is mostly fine because they do end up joining the correct master in the end, but the lack of a vote from every follower may prevent a future desirable reconfiguration from taking place. The solution is to hold another election in a yet-higher term in order to collect a complete set of votes. Elections are somewhat disruptive so we should think carefully about when this election should take place. One option is to wait as late as possible (on the grounds that it might not ever be necessary). This unfortunately makes it harder to predict how an apparently-smoothly-running cluster will react to nodes leaving and joining. Instead we prefer to perform the election as soon as possible in the leader's term, adding "votes from all followers" to the invariants that we expect to hold in a stable cluster. The start of a leader's term is already a somewhat disrupted time for the cluster, so performing another election at this point does not materially change the cluster's behaviour. This change implements the logic needed to trigger a new election in order to satisfy this extra stabilisation condition.	2018-10-06 07:22:04 +01:00
Daniel Mitterdorfer	7d826916b9	Adjust size of BigArrays in circuit breaker test With this commit we restore the previous behavior in `BigArraysTests#testMaxSizeExceededOnResize` but lower the sizes that are tested to the range between 256 bytes to 16 kB so the test does not produce a whole lot of garbage. The previous attempt to reduce the amount of garbage produced by that test was to properly size the array initially but it failed to account for object alignment which lead to test failures in some cases. While it would be possible to account for object alignment, we would need to open up BigArrays or directly use the underlying Lucene API which would require us to allocate an array upfront only to find its size (incl. object alignment). Instead we have fixed this issue by conservatively sizing the array initially (so the initial allocation will never trip the circuit breaker) and reduce garbage by reducing the circuit breaker's upper bound as described previously. Closes #33750 Relates #34325	2018-10-05 15:39:08 +02:00
Jim Ferenczi	5c7b52e930	Adapt bwc version after backport Relates #33587	2018-10-05 13:07:39 +02:00
eray	daf88335d7	Add max_children limit to nested sort (#33587 ) Add an option to `nested` sort to limit the number of children to visit when picking the sort value of the root document. Closes #33592	2018-10-05 12:02:47 +02:00
David Turner	29d7d1d503	Minor housekeeping of tests (#34315 ) From experience with #34257, here are a few things that help with analysing logs from test runs. Also we prevent trying to stabilise a cluster with raised delay variability, because lowering the delay variability requires time to allow all the extra-varied-scheduled tasks to work their way out of the system.	2018-10-05 07:57:03 +01:00
Dimitris Athanasiou	4dacfa95d2	[ML] Allow asynchronous job deletion (#34058 ) This changes the delete job API by adding the choice to delete a job asynchronously. The commit adds a `wait_for_completion` parameter to the delete job request. When set to `false`, the action returns immediately and the response contains the task id. This also changes the handling of subsequent delete requests for a job that is already being deleted. It now uses the task framework to check if the job is being deleted instead of the cluster state. This is a beneficial for it is going to also be working once the job configs are moved out of the cluster state and into an index. Also, force delete requests that are waiting for the job to be deleted will not proceed with the deletion if the first task fails. This will prevent overloading the cluster. Instead, the failure is communicated better via notifications so that the user may retry. Finally, this makes the `deleting` property of the job visible (also it was renamed from `deleted`). This allows a client to render a deleting job differently. Closes #32836	2018-10-05 02:41:28 +03:00
Nik Everett	09aaed4fe4	Tasks: Document that status is not semvered (#34270 ) The `status` part of the tasks API reflects the internal status of a running task. In general, we do not make backwards breaking changes to the `status` but because it is internal we reserve the right to do so. I suspect we will very rarely excercise that right but it is important that we have it so we're not boxed into any particular implementation for a request. In some sense this is policy making by documentation change. In another it is clarification of the way we've always thought of this field. I also reflect the documentation change into the Javadoc in a few places. There I acknowledge Kibana's "special relationship" with Elasticsearch. Kibana parses `_reindex`'s `status` field and, because we're friends with those folks, we should talk to them before we make backwards breaking changes to it. We want to be friends with everyone but there is only so much time in the day and we don't want to make backwards breaking fields to `status` at all anyway. So we hope that breaking changes documentation should be enough for other folks. Relates to #34245.	2018-10-04 14:42:37 -04:00
Yannick Welsch	b32abcbd00	Zen2: Add Cluster State Applier (#34257 ) Adds the cluster state applier to Coordinator, and adds tests for cluster state acking.	2018-10-04 20:33:28 +02:00
Vladimir Dolzhenko	dcfe64e0e4	[CI] Fix bogus ScheduleWithFixedDelayTests.testRunnableRunsAtMostOnceAfterCancellation Closes #34004	2018-10-04 16:31:56 +02:00
Armin Braun	3ccfc3de58	SCRIPTING: Terms set query expression (#33856 ) * SCRIPTING: Add Expr. Compile for TermSetQuery Ctx. * Follow up to #33602 adding the ability to compile TermsSetQuery scripts with the expressions engine in the same way we support SearchScript in Expressions * Duplicated the code here for now to make the change less complex, the only difference to SearchScript is that `_score` and `_value` are not handled for TermsSetQuery * remove redundant check	2018-10-04 16:03:57 +02:00
Nik Everett	ab8a5563f2	Logging: Drop remaining Settings log ctor (#34149 ) Drops the last logging constructor that takes `Settings` because it is no longer needed. Watcher goes through a lot of effort to pass `Settings` to `Logger` constructors and dropping `Settings` from all of those calls allowed us to remove quite a bit of log-based ceremony from watcher.	2018-10-04 09:18:04 -04:00
David Turner	c6b0f08472	Add safety phase to CoordinatorTests (#34241 ) Today's CoordinatorTests have a limited amount of randomisation in how things are scheduled. However, to be fully confident in Zen2's liveness we require the system to stabilise after any permitted sequence of events. We can achieve this by running the system in a much more random fashion for a while, with much larger variation in when things are scheduled (simulating GC pressure and network disruption) and then continuing to assert that the system stabilises as we expect. When running randomly, we do not expect to make significant progress and merely verify that no safety property is violated. This change introduces the runRandomly() test method which implements this idea. It also fixes a handful of liveness bugs that this first version of runRandomly() exposed.	2018-10-04 07:40:26 +01:00
Jim Ferenczi	e8b986cc37	Fix sporadic failure in NestedObjectMapperTests Relates #34225	2018-10-04 07:40:46 +02:00
Nhat Nguyen	6dd716b0c4	Replace version with reader cache key in IndicesRequestCache (#34189 ) Today we use the version of a DirectoryReader as a component of the key of IndicesRequestCache. This usage is perfectly fine since the version is advanced every time a new change is made into IndexWriter. In other words, two DirectoryReaders with the same version should have the same content. However, this invariant is only guaranteed in the context of a single IndexWriter because the version is reset to the committed version value when IndexWriter is re-opened. Since #33473, each IndexShard may have more than one IndexWriter, and using the version of a DirectoryReader as a part of the cache key can cause IndicesRequestCache to return stale cached values. For example, in #27650, we rollback the engine (i.e., re-open IndexWriter), index new documents, refresh, then make a count request, but the search layer mistakenly returns the count of the DirectoryReader of the previous IndexWriter because the current DirectoryReader has the same version of the old DirectoryReader even their documents are different. This is possible because these two readers come from different IndexWriters. This commit replaces the the version with the reader cache key of IndexReader as a component of the cache key of IndicesRequestCache. Closes #27650 Relates #33473	2018-10-03 21:03:24 -04:00
David Turner	cbe1cf98c6	Merge branch 'master' into zen2	2018-10-03 22:12:56 +01:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Jim Ferenczi	ee21067a41	Add early termination support for min/max aggregations (#33375 ) This commit adds the support to early terminate the collection of a leaf in the min/max aggregator. If the query matches all documents the min and max value for a numeric field can be retrieved efficiently in the points reader. This change applies this optimization when possible.	2018-10-03 18:33:39 +02:00
Lee Hinman	90c55f5e36	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-03 09:11:28 -06:00
albendz	f09190c14d	Require combine and reduce scripts in scripted metrics aggregation (#33452 ) * Make text message not required in constructor for slack * Remove unnecessary comments in test file * Throw exception when reduce or combine is not provided; update tests * Update integration tests for scripted metrics to always include reduce and combine * Remove some old changes from previous branches * Rearrange script presence checks to be earlier in build * Change null check order in script builder for aggregated metrics; correct test scripts in IT * Add breaking change details to PR	2018-10-03 15:22:01 +01:00
Jim Ferenczi	41528c0813	Adapt bwc version after backport (bis) Relates #34225	2018-10-03 14:24:01 +02:00
Jim Ferenczi	1aa8e72be7	Adapt bwc version after backport Relates #34225	2018-10-03 12:24:07 +02:00
Jim Ferenczi	5a3e031831	Preserve the order of nested documents in the Lucene index (#34225 ) Today we reverse the initial order of the nested documents when we index them in order to ensure that parents documents appear after their children. This means that a query will always match nested documents in the reverse order of their offsets in the source document. Reversing all documents is not needed so this change ensures that parents documents appear after their children without modifying the initial order in each nested level. This allows to match children in the order of their appearance in the source document which is a requirement to efficiently implement #33587. Old indices created before this change will continue to reverse the order of nested documents to ensure backwark compatibility.	2018-10-03 11:55:30 +02:00
Colin Goodheart-Smithe	2d64e3db9a	Adds trace logging to IndicesRequestCache (#34180 ) * Adds trace logging to IndicesRequestCache This change adds trace level logging to `IndicesrrequestCache` witht eh primary aim of helping to identify the cause of teh failures in https://github.com/elastic/elasticsearch/issues/32827. The cache will log at trace level when a cache hit or miss occurs including the reader version and the cache key. Note that this change adds a `cacheKeyRenderer` whcih supplies a human readable String of the cache key since the actual cache key itself is a `BytesReference` containing the wire protocol serialised form of the request. Logging is also added for the case where a search timeout occurs and fr that reason the cache entry is invalidated. * Adds comment to remaind us to remove cacheKeyRenderer	2018-10-03 08:58:33 +01:00
David Turner	a9eae1d068	Merge branch 'master' into zen2	2018-10-03 08:36:34 +01:00
Gordon Brown	fb907706ec	Merge branch 'master' into index-lifecycle	2018-10-02 13:43:46 -06:00
Dimitrios Liappis	f12e0a8398	Add ES version 6.4.3 (#34239 ) Version bump	2018-10-02 21:15:58 +03:00
David Turner	a7ce4b31ed	Fix logging of cluster state update descriptions (#34182 ) In #28941 we changed the computation of cluster state task descriptions but this introduced a bug in which we only log the empty descriptions (rather than the non-empty ones). This change fixes that.	2018-10-02 19:08:19 +01:00
Christoph Büscher	5183ea3d68	Use OptionalInt instead of Optional<Integer> (#34220 ) Optionals containing boxed primitive types are prohibitively costly because they have two level of boxing. For Optional<Integer> the analogous OptionalInt can be used to avoid the boxing of the contained int value.	2018-10-02 15:58:07 +02:00
Jim Ferenczi	ead6ffce54	Fix cross fields mode of the query_string query (#34216 ) This change fixes a bug in the cross fields mode of the `query_string` query. The multi fields query builder must be reseted before parsing in order to clear the list of expanded fields coming from the previous text block. Closes #34215	2018-10-02 14:53:26 +02:00
Przemyslaw Gomulka	3f8cc89c9f	Completion types with multi-fields support (#34081 ) Mappings with completion type and multi-fields, were not able to index array or object format on completion fields. Only string format was supported. This is fixed by providing multiField parser with externalValueContext with already parsed object closes #15115	2018-10-02 14:32:56 +02:00
Alexander Reelsen	b1b0f3276b	Core: Add methods to get locale/timezone in DateFormatter (#34113 ) This adds some method into the `DateFormatter` interface, namely * `withLocale()` to change the locale of a date formatter * `getLocale()` * `getZone()` * `hashCode()` * `equals()` These methods will be needed for aggregations and mapping changes, where zones and locales can be specified in the mapping or in search/aggs parts of a search request.	2018-10-02 14:13:30 +02:00
David Turner	a127805b4a	[Zen2] Simulate scheduling delays (#34181 ) Today we schedule tasks (both immediate and future ones) exactly when requested. In fact it is more realistic to allow for a small amount of delay in the scheduling of tasks, and this helps to exercise more interleavings of actions and therefore to improve test coverage. This change adds to the DeterministicTaskQueue the ability to add a random delay to the scheduling of tasks. This change also provides more explicit timeouts for stabilisation in the CoordinatorTests. Using the randomised scheduling feature in the CoordinatorTests also found a situation in which we could become a leader, then a candidate, and then a leader again very quickly, causing a clash of the _BECOME_MASTER_ and _FINISH_ELECTION_ tasks. We change their behaviour to not consider these duplicates to be problematic.	2018-10-02 11:22:05 +01:00
Jim Ferenczi	aba4a59d0d	Handle terms query when detecting if a query can match nested docs (#34072 ) When nested objects are present in the mappings, we add a filter in queries to exclude them if there is no evidence that the query cannot match in this space. In 6x we visit the query in order to find a mandatory clause that can match root documents only. If we find one we can omit the nested documents filter. Currently only `term` and `range` queries are checked, this change adds the support for `terms` query to effectively remove the nested filter if a mandatory `terms` clause targets a non-nested field. Closes #34067	2018-10-02 09:30:23 +02:00
David Turner	2aff005a69	Clean up TransportMasterNodeAction (#34076 ) Mainly this fixes a warning by replacing the unchecked `new ActionListener` with the checked `new ActionListener<Response>`, and it also fixes the line length violations in this class.	2018-10-02 03:17:55 +01:00
Lee Hinman	2d9cb21490	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-01 14:10:09 -06:00
Christophe Bismuth	2923fb5b31	Disallow "enabled" attribute change for types in mapping update (#33933 ) This commit adds a check for "enabled" attribute change for types when a RestPutMappingAction is received. A MappingException is thrown when such a change is detected. Change are prevented in both ways: "false -> true" and "true -> false". Closes #33566	2018-10-01 20:49:08 +02:00
Vladimir Dolzhenko	2e2ae19b97	drop elasticsearch-translog for 7.0 (#33373 ) #32281 adds elasticsearch-shard to provide bwc version of elasticsearch-translog for 6.x; have to remove elasticsearch-translog for 7.0 Relates to #31389	2018-10-01 16:21:14 +02:00
Christoph Büscher	17e6932bf3	[Tests] Rename DocumentMapperMergeTests (#34121 ) Renaming to simply DocumentMapperTests to indicate this is where other unit tests should go. Also removing outdates Todo in DocumentMapperParserTests.	2018-10-01 10:29:19 +02:00
Jason Tedor	e2bd2028d8	Allow specifying shard changes batch sizes in bytes (#34168 ) This commit changes the shard changes requests from using a raw byte value to being able to be specified using bytes units (e.g., 4mb).	2018-09-30 14:22:22 -04:00
Martijn van Groningen	b1a27b2e6b	[CCR] Add unfollow API (#34132 ) The unfollow API changes a follower index into a regular index, so that it will accept write requests from clients. For the unfollow api to work the index follow needs to be stopped and the index needs to be closed. Closes #33931	2018-09-30 19:19:34 +02:00
Nhat Nguyen	ad61398879	CCR: Optimize indexing ops using seq_no on followers (#34099 ) This change introduces the indexing optimization using sequence numbers in the FollowingEngine. This optimization uses the max_seq_no_updates which is tracked on the primary of the leader and replicated to replicas and followers. Relates #33656	2018-09-28 20:42:26 -04:00
Ryan Ernst	47cbae9b26	Scripting: Remove ExecutableScript (#34154 ) This commit removes the legacy ExecutableScript, which was no longer used except in tests. All uses have previously been converted to script contexts.	2018-09-28 17:13:08 -07:00
Lee Hinman	6ea396a476	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-28 15:40:12 -06:00
Yannick Welsch	412face402	Move NodeRemovalClusterStateTaskExecutor out of ZenDiscovery (#34147 ) Allows this class to be cleanly shared between Zen1 and Zen2. Follow-up to #33917	2018-09-28 23:12:59 +02:00
Armin Braun	76dd3948f3	TESTS: Relax Assertion About Deleting Shard Dir (#34120 ) * TESTS: Relax Assertion About Deleting Shard Dir * Allow empty state directory to prevent test from failing * Closes #32686	2018-09-28 19:09:49 +02:00
Ryan Ernst	95977f4db9	Scripting: Add watcher script contexts (#34059 ) This commit removes the use of ExecutableScript from watcher in favor of custom script contexts for both watcher condition scripts and transform scripts.	2018-09-28 07:58:17 -07:00
Hendrik Muhs	e2f310b56c	Fix AggregationFactories.Builder equality and hash regarding order (#34005 ) Fixes the equals and hash function to ignore the order of aggregations to ensure equality after serialization and deserialization. This ensures storing configs with aggregation works properly. This also addresses a potential issue in caching when the same query contains aggregations but in different order. 1st it will not hit in the cache, 2nd cache objects which shall be equal might end up twice in the cache.	2018-09-28 13:30:50 +02:00
David Turner	980cfc69d6	Integrate FollowerChecker with Coordinator (#34075 ) This change ensures that the leader node periodically checks that its followers are healthy, and that they are removed from the cluster if not.	2018-09-28 12:29:34 +01:00
Armin Braun	c4b831645c	MINOR: Remove some deadcode in NodeEnv and Related (#34133 )	2018-09-28 12:40:20 +02:00
Alexander Reelsen	bc7d69f74a	Core: Don't rely on java time for epoch seconds formatting (#34086 ) In order to be compatible with joda time, this adds an epoch seconds formatter, that is able to parse floating point values. However joda time discards the floating point values, but still parses the data, where as this one is able to parse the whole value including milliseconds.	2018-09-28 10:53:33 +02:00
Alan Woodward	f243d75f59	Remove special-casing of Synonym filters in AnalysisRegistry (#34034 ) The synonym filters no longer need access to the AnalysisRegistry in their constructors, so we can remove the special-case code and move them to the common analysis module. This commit means that synonyms are no longer available for `server` integration tests, so several of these are either rewritten or migrated to the common analysis module as rest-spec-api tests	2018-09-28 09:02:47 +01:00
Julie Tibshirani	9cd4f70a67	Support 'string'-style queries on metadata fields when reasonable. (#34089 ) * Make sure 'ignored' and 'routing' field types inherit from StringFieldType. * Add tests for prefix and regexp queries. * Support prefix and regexp queries on _index fields.	2018-09-27 20:59:03 -07:00
Ryan Ernst	a2c941806b	Tests: Add support for custom contexts to mock scripts (#34100 ) This commit adds the ability to plug in compilation of custom contexts in mock script engine. This is needed for testing plugins which add custom contexts like watcher.	2018-09-27 12:23:59 -07:00
Jake Landis	73ee721b29	ingest: correctly measure chained pipeline stats (#33912 ) Prior to this change when a pipeline processor called another pipeline, only the stats for the first processor were recorded. The stats for the subsequent pipelines were ignored. This change properly accounts for pipelines irregardless if they are the first or subsequently called pipelines. This change moves the state of the stats from the IngestService to the pipeline itself. Cluster updates are safe since the pipelines map is atomically swapped, and if a cluster update happens while iterating over stats (now read directly from the pipeline) a slightly stale view of stats may be shown.	2018-09-27 13:54:26 -05:00
Lee Hinman	a26cc1a242	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-27 11:00:37 -06:00
Jason Tedor	899a7c7d99	Fix remote cluster seeds fallback (#34090 ) Recently we introduced the settings cluster.remote to take the place of search.remote for configuring remote cluster connections. We made this change due to the fact that we have generalized the remote cluster infrastructure to also be used within cross-cluster replication and not only cross-cluster search. For backwards compatibility, when we made this change, we allowed that cluster.remote would fallback to search.remote. Alas, the initial change for this contained a bug for handling the proxy and seeds settings. The bug for the seeds settings arose because we were manually iterating over the concrete settings only for cluster.remote seeds but not for search.remote seeds. This commit addresses this by iterating over both cluster.remote seeds and search.remote seeds. Additionally, when checking for existence of proxy settings, we have to not only check cluster.remote proxy settings, but also fallback to search.remote proxy settings. This commit addresses both issues, and adds tests for these situations.	2018-09-27 09:47:51 -04:00
Jim Ferenczi	269ae0bc15	Handle MatchNoDocsQuery in span query wrappers (#34106 ) * Handle MatchNoDocsQuery in span query wrappers This change adds a new SpanMatchNoDocsQuery query that replaces MatchNoDocsQuery in the span query wrappers. The `wildcard` query now returns MatchNoDocsQuery if the target field is not in the mapping (#34093) so we need the equivalent span query in order to be able to pass it to other span wrappers. Closes #34105	2018-09-27 14:19:08 +02:00
Christoph Büscher	cb4cdf17f0	Update MovAvgIT AwaitsFix bug url	2018-09-27 11:11:21 +02:00
Simon Willnauer	bda7bc145b	Fold EngineSearcher into Engine.Searcher (#34082 ) EngineSearcher can be easily folded into Engine.Searcher which removes a level of inheritance that is necessary for most of it's subclasses. This change folds it into Engine.Searcher and removes the dependency on ReferenceManager.	2018-09-27 09:06:04 +02:00
Armin Braun	acd80a1e07	TESTS: Enable DEBUG Logging in Flaky Test (#34091 ) * This should surface what errors are thrown on CI and in org.elasticsearch.transport.RemoteClusterConnection.ConnectHandler#collectRemoteNodes (the sequence of caught error in the last catch block and moving on to the next seed node seems to be the only path by which the errors logged in #33756 could come about) * Relates #33756	2018-09-27 06:02:24 +02:00
Nhat Nguyen	ea9b33527e	TEST: Add engine is closed as expected failure msg This commit adds "engine is closed" as an expected failure message. This change is due to #33967 in which we might access a closed engine on promotion. Relates #33967	2018-09-26 22:38:55 -04:00
Nhat Nguyen	12d94e44b8	Adjust bwc version for max_seq_no_of_updates Relates #33967 Relates #33842	2018-09-26 22:12:19 -04:00
Simon Willnauer	ae8e54493d	Build DocStats from SegmentInfos in ReadOnlyEngine (#34079 ) This change is related to #33903 that ports the DocStats simplification to the master branch. This change builds the docStats in the ReadOnlyEngine from the last committed segment infos rather than the reader. Co-authored-by: Tanguy Leroux <tlrx.dev@gmail.com>	2018-09-27 00:16:17 +02:00
Julie Tibshirani	1d08f63eff	When creating wildcard queries, use MatchNoDocsQuery when the field type doesn't exist. (#34093 )	2018-09-26 15:08:35 -07:00
Simon Willnauer	2b730d1b9d	Mute MovAvgIT#testHoltWintersNotEnoughData Relates to #34098	2018-09-26 23:50:31 +02:00
Mayya Sharipova	80c5d30f30	XContentBuilder to handle BigInteger and BigDecimal (#32888 ) Although we allow to index BigInteger and BigDecimal into a keyword field, source filtering on these fields would fail as XContentBuilder was not able to deserialize BigInteger and BigDecimal to json. This modifies XContentBuilder to allow to handle BigInteger and BigDecimal. Closes #32395	2018-09-26 14:24:31 -04:00
Julie Tibshirani	de8bfb908f	Delegate wildcard query creation to MappedFieldType. (#34062 ) * Delegate wildcard query creation to MappedFieldType. * Disallow wildcard queries on collation fields. * Disallow wildcard queries on non-string fields.	2018-09-26 09:36:41 -07:00
Nik Everett	ddce9704d4	Logging: Drop two deprecated methods (#34055 ) This drops two deprecated methods from `ESLoggerFactory`, switching all calls to those methods to calls to methods of the same name on `LogManager`.	2018-09-26 11:20:52 -04:00
Ryan Ernst	7800b4fa91	Core: Abstract DateMathParser in an interface (#33905 ) This commits creates a DateMathParser interface, which is already implemented for both joda and java time. While currently the java time DateMathParser is not used, this change will allow a followup which will create a DateMathParser from a DateFormatter, so the caller does not need to know the internals of the DateFormatter they have.	2018-09-26 07:56:25 -07:00
Zachary Tong	25d74bd0cb	Prefer mapped aggs to lead reductions (#33528 ) Previously, unmapped aggs try to delegate reduction to a sibling agg that is mapped. That delegated agg will run the reductions, and also reduce any pipeline aggs. But because delegation comes before running pipelines, the unmapped agg _also_ tries to run pipeline aggs. This causes the pipeline to run twice, and potentially double it's output in buckets which can create invalid JSON (e.g. same key multiple times) and break when converting to maps. This fixes by sorting the list of aggregations ahead of time so that mapped aggs appear first, meaning they preferentially lead the reduction. If all aggs are unmapped, the first unmapped agg simply creates a new unmapped object and returns that for the reduction. This means that unmapped aggs no longer defer and there is no chance for a secondary execution of pipelines (or other side effects caused by deferring execution). Closes #33514	2018-09-26 10:09:31 -04:00
Nik Everett	1871e7f7e9	Search: Simply SingleFieldsVisitor (#34052 ) `SingleFieldsVisitor` is meant to load a single stored field but it manages to be quite complex to reason about because it inherits from our "basic" `FieldsVisitor` which is designed to load many fields. This breaks that inheritance and adds logic to `SingleFieldsVisitor` so it can be properly stand alone. While this amounts to more lines of code they ought to be significantly easier to reason about.	2018-09-26 09:48:15 -04:00
David Roberts	1413ace74f	Mute testSplitFromOneToN and testCreateShrinkIndexToN on Windows Relates #34080	2018-09-26 14:02:14 +01:00
Christoph Büscher	ba3ceeaccf	Clean up "unused variable" warnings (#31876 ) This change cleans up "unused variable" warnings. There are several cases were we most likely want to suppress the warnings (especially in the client documentation test where the snippets contain many unused variables). In a lot of cases the unused variables can just be deleted though.	2018-09-26 14:09:32 +02:00
David Turner	d995fc85c6	Integrate LeaderChecker with Coordinator (#34049 ) This change ensures that follower nodes periodically check that their leader is healthy, and that they elect a new leader if not.	2018-09-26 12:18:13 +01:00
Jim Ferenczi	a255880497	Add nested and object fields to field capabilities response (#33803 ) This commit adds nested and object fields to the field capabilities response. Closes #33237	2018-09-26 08:59:41 +02:00
Ryan Ernst	be8475955e	Scripting: Use ParameterMap for deprecated ctx var in update scripts (#34065 ) This commit removes the sysprop controlling whether ctx is in params for update scripts and replaces it with use of the new ParameterMap, which outputs a deprecation warning whenever params.ctx is used.	2018-09-25 22:08:02 -07:00
Nhat Nguyen	8a56369f5b	Move max_unsafe_auto_id_timestamp constant to Engine (#34025 ) We should not access InternalEngine in other classes.	2018-09-25 19:20:00 -04:00
Jim Ferenczi	0f878eff19	Add a limit for graph phrase query expansion (#34031 ) Today query parsers throw TooManyClauses exception when a query creates too many clauses. However graph phrase queries do not respect this limit. This change adds a protection against crazy expansions that can happen when building a graph phrase query. This is a temporary copy of the fix available in https://issues.apache.org/jira/browse/LUCENE-8479 but not merged yet. This logic will be removed when we integrate the Lucene patch in a future release.	2018-09-25 21:38:47 +02:00
Igor Motov	1e6780d703	Mute AckClusterUpdateSettingsIT Tracked by #33673	2018-09-25 14:16:47 -04:00
Armin Braun	0ba1855740	INGEST: Tests for Drop Processor (#33430 ) * INGEST: Tests for Drop Processor * UT for behavior of dropped callback and drop processor * Moved drop processor to `server` project to enable this test * Simple IT * Relates #32278	2018-09-25 19:29:22 +02:00
Christoph Büscher	ecc087a5bb	Remove Join utility class (#34037 ) The functionality can be replaces with String.join in new Java versions.	2018-09-25 15:25:54 +02:00
David Turner	f886eebd99	Fix CoordinatorTests some more (#34039 ) Today the `CoordinatorTests` are not completely reliable. These changes make them more so, by removing a couple of assertions that we do not expect to pass (yet).	2018-09-25 14:04:22 +01:00
David Turner	7c63f5455b	Use a threadsafe map in SearchAsyncActionTests (#33700 ) Today `SearchAsyncActionTests#testFanOutAndCollect` uses a simple `HashMap` for the `nodeToContextMap` variable, which is then accessed from multiple threads without, apparently, explicit synchronisation. This provides an explanation for the test failure identified in #29242 in which `.toString()` returns `"[]"` just before `.isEmpty` returns `false`, without any concurrent modifications. This change converts `nodeToContextMap` to a `newConcurrentMap()` so that this cannot occur. It also fixes a race condition in the detection of double-calling the subsequent search phase. Closes #29242.	2018-09-25 13:58:05 +01:00
Nhat Nguyen	5166dd0a4c	Replicate max seq_no of updates to replicas (#33967 ) We start tracking max seq_no_of_updates on the primary in #33842. This commit replicates that value from a primary to its replicas in replication requests or the translog phase of peer-recovery. With this change, we guarantee that the value of max seq_no_of_updates on a replica when any index/delete operation is performed at least the max_seq_no_of_updates on the primary when that operation was executed. Relates #33656	2018-09-25 08:07:57 -04:00
Luca Cavanna	970407c663	[DOCS] add comment to clarify cluster name resolution (#34014 ) We currently fallback to local indices whenever a remote cluster is not found, as there may still be indices / aliases with the same name. Such behaviour is lenient but needs to be kept for backwards compatibility. Clarified that in the code so we don't forget. Relates to #26247	2018-09-25 14:03:07 +02:00
Adrien Grand	612201aee0	Fix created version for similarity validation. (#33890 ) It mistakenly uses the Elasticsearch major version instead of the Lucene major version. I noticed it when backporting, it is not noticeable on master because the only two Lucene versions that are supported, 7 and 8, encode norms the same way, unlike Lucene 6.	2018-09-25 13:48:25 +02:00
Yannick Welsch	679fb698d0	Zen2: Trigger join when active master detected (#34008 ) Triggers a join when an active master is detected. In order to avoid spamming joins, deduplicates join request based on <target, join> pair. This ensures that a new join is sent whenever the term is incremented or when a new master is found. Also changes the logging of join failures from DEBUG to INFO. These join failures should be happening rarely, and can either indicate a failed election (which should be rare) or a configuration issue.	2018-09-25 09:44:35 +02:00
David Turner	1d47c9582b	Fix CoordinatorTests (#34002 ) Today the CoordinatorTests are not very reliable if two elections are scheduled concurrently. Although we expect occasional failures due to this, in fact the failures are much more common than expected due to a handful of issues. This PR fixes these issues.	2018-09-25 08:43:47 +01:00
Hendrik Muhs	bf6cf6b6d9	refactor CompositeValuesSourceParserHelper for reusage by making it public (#33945 ) refactor CompositeValuesSourceParserHelper for reusage by making it public and moving toXContent into it	2018-09-25 09:15:52 +02:00
David Turner	3af8fc74c7	Make TransportService more test-friendly (#33869 ) Today, TransportService uses System.currentTimeMillis() to get the current time to report on things like timeouts, and enqueues lambdas for future execution. However, in tests it is useful to be able to fake out the current time and to see what all these enqueued lambdas are really for. This change alters the situation so that we can obtain the time from the more easily-faked ThreadPool#relativeTimeInMillis(), and implements some friendlier toString() methods on the various Runnables so we can see what they are later.	2018-09-25 07:50:18 +01:00
David Turner	02b483c372	Logging improvements in CoordinatorTests (#33991 ) Today, we know that CoordinatorTests sometimes fail to stabilise due to an election collision. This change improves the logging that occurs when an election collision occurs so it will be easier to see if this is happening when analysing a test failure. We also wrap the call to masterService.submitStateUpdateTask() in a context that logs the node on which it runs. We also introduce the InitialJoinAccumulator instead of using a placeholder CandidateJoinAccumulator at startup, which reduces the cases to consider in CandidateJoinAccumulator.close() and tightens up the assertions we can make here.	2018-09-24 20:07:32 +01:00
Lee Hinman	243e863f6e	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-24 10:33:51 -06:00
Armin Braun	25bc8c4b5a	Fix typo `NodeEnvironment#assertPathsDoNotExist` (#33996 ) * We want to check the individual paths here one by one to get a better to interpret assertion message	2018-09-24 17:57:27 +02:00
Julie Tibshirani	8e8bd56cc7	In MatchQuery, remove a check for fragile search analyzers. (#33927 ) As far as I can tell this guard against fragile analyzers is no longer relevant, since we stopped setting special analyzers on numeric fields (3bf6f4). Instead of removing the guard completely, I opted to keep a check for untokenized + unnormalized fields to avoid going through the analysis process unnecessarily. My motivation for simplifying this check is that I'd like to add support for `split_queries_on_whitespace` to the new 'queryable object' fields. As it stands, I would have to add a dedicated instanceof check for the new mapper, which is not optimal.	2018-09-24 08:56:13 -07:00
Yannick Welsch	2e774e146d	Zen2: Update PeerFinder term on term bump (#33992 ) Ensures that the PeerFinder always uses the correct term.	2018-09-24 17:47:15 +02:00
Tim Brooks	78e483e8d8	Introduce abstract security transport testcase (#33878 ) This commit introduces an AbstractSimpleSecurityTransportTestCase for security transports. This classes provides transport tests that are specific for security transports. Additionally, it fixes the tests referenced in #33285.	2018-09-24 09:44:44 -06:00
Ignacio Vera	df333ca305	TESTS: Make score Float#NaN when there is no max score (#33997 ) * TESTS: Make score Float#NaN when there is no max score Fixes test failure due to maxScore set to Float#MinValue instead on Float#NaN. In addition the initial value for maxScore is set to Float#NEGATIVE_INFINITY so it is an illegal value. Closes #33993	2018-09-24 17:36:48 +02:00
Luca Cavanna	e389d9e296	Clarify RemoteClusterService#groupIndices behaviour (#33899 ) When executing a cross-cluster search, we need to search against all local indices (and no remote indices) in case no indices are specified. Also, if only remote indices are specified, no local indices will be queried. We previously added empty local indices whenever they were not present in the map of the grouped indices, then we would act differently later based on the extracted remote indices. Instead, we now add the empty array for local indices only in case we need to search all local indices; the entry for local indices is not added when local indices should not be searched. This way the grouped indices reflect reality and provide a better indication of what indices will be searched.	2018-09-24 11:45:33 +02:00
Christophe Bismuth	47ed6c79ee	[TEST] Add validate query tests for empty and malformed queries (#33862 ) Relates to #33095	2018-09-24 11:21:47 +02:00
Simon Willnauer	7d703c2f92	Fix AutoQueueAdjustingExecutorBuilder settings validation (#33922 ) Settings validation in AutoQueueAdjustingExecutorBuilder always checked against a default value which means that we never can change a max queue size that is lower than the default. This change adds tests and fixes this validation.	2018-09-24 07:45:50 +02:00
Nhat Nguyen	432e61c971	Adjust bwc for resync request (#33964 ) Relates #33964	2018-09-22 19:29:38 -04:00
Nhat Nguyen	f2f08dd6c5	Adjust bwc for recovery request (#33693 ) Relates #33693	2018-09-22 19:28:20 -04:00
Nhat Nguyen	e7ae2f9d36	Propagate auto_id_timestamp in primary-replica resync (#33964 ) A follow-up of #33693 to propagate max_seen_auto_id_timestamp in a primary-replica resync. Relates #33693	2018-09-22 11:40:10 -04:00
Nhat Nguyen	7944a0cb25	Track max seq_no of updates or deletes on primary (#33842 ) This PR is the first step to use seq_no to optimize indexing operations. The idea is to track the max seq_no of either update or delete ops on a primary, and transfer this information to replicas, and replicas use it to optimize indexing plan for index operations (with assigned seq_no). The max_seq_no_of_updates on primary is initialized once when a primary finishes its local recovery or peer recovery in relocation or being promoted. After that, the max_seq_no_of_updates is only advanced internally inside an engine when processing update or delete operations. Relates #33656	2018-09-22 08:02:57 -04:00
David Turner	1761b6c85c	Introduce FollowersChecker (#33917 ) It is important that the leader periodically checks that its followers are still healthy and can remain part of its cluster. If these checks fail repeatedly then the leader should remove the faulty node from the cluster. The FollowerChecker, introduced in this commit, performs these periodic checks and deals with retries.	2018-09-22 11:34:16 +01:00
Yannick Welsch	a612dd1272	Zen2: Add node id to log output of CoordinatorTests (#33929 ) With recent changes to the logging framework, the node name can no longer be injected into the logging output using the node.name setting, which means that for the CoordinatorTests (which are simulating a cluster in a fully deterministic fashion using a single thread), as all the different nodes are running under the same test thread, we are not able to distinguish which log lines are coming from which node. This commit readds logging for node ids in the CoordinatorTests, making two very small changes to DeterministicTaskQueue and TestThreadInfoPatternConverter.	2018-09-21 18:40:12 +02:00
Vladimir Dolzhenko	9c0316869b	Store: keep IndexFormatTooOldException and IndexFormatTooNewException in corruption marker (#33920 ) Closes #33916	2018-09-21 14:00:02 +02:00
Nik Everett	cac93949fe	API: Drop deprecated methods from Retry (#33925 ) We deprecated the `Retry.withBackoff` flavors with `Settings` in 6.5 because they were no longer needed. This drops them form 7.0.	2018-09-21 07:55:50 -04:00
Christoph Büscher	b654d986d7	Add OneStatementPerLineCheck to Checkstyle rules (#33682 ) This change adds the OneStatementPerLineCheck to our checkstyle precommit checks. This rule restricts the number of statements per line to one. The resoning behind this is that it is very difficult to read multiple statements on one line. People seem to mostly use it in short lambdas and switch statements in our code base, but just going through the changes already uncovered some actual problems in randomization in test code, so I think its worth it.	2018-09-21 11:52:31 +02:00
Nhat Nguyen	5f7f793f43	Propagate max_auto_id_timestamp in peer recovery (#33693 ) Today we don't store the auto-generated timestamp of append-only operations in Lucene; and assign -1 to every index operations constructed from LuceneChangesSnapshot. This looks innocent but it generates duplicate documents on a replica if a retry append-only arrives first via peer-recovery; then an original append-only arrives via replication. Since the retry append-only (delivered via recovery) does not have timestamp, the replica will happily optimizes the original request while it should not. This change transmits the max auto-generated timestamp from the primary to replicas before translog phase in peer recovery. This timestamp will prevent replicas from optimizing append-only requests if retry counterparts have been processed. Relates #33656 Relates #33222	2018-09-20 19:53:30 -04:00
Vladimir Dolzhenko	dbe6405354	mute RemoveCorruptedShardDataCommandTests.testCorruptedIndex	2018-09-20 21:30:40 +02:00
David Turner	187f787f52	[Zen2] Introduce LeaderChecker (#33024 ) It is important that follower nodes periodically check that their leader is still healthy and that they remain part of its cluster. If these checks fail repeatedly then followers should attempt to find and join a new leader, possibly electing one in the process. The LeaderChecker, introduced in this commit, performs these periodic checks and deals with retries.	2018-09-20 20:05:55 +01:00
Nhat Nguyen	76a1a863e3	TEST: stop assertSeqNos if shards movement (#33875 ) Currently, assertSeqNos assumes that the cluster is stable at the end of the test (i.e., no more shard movement). However, this assumption does not always hold. In these cases, we can stop the assertion instead of failing a test. Closes #33704	2018-09-20 13:44:26 -04:00
Christoph Büscher	28b1d41007	Fix unused import checktyle issue	2018-09-20 19:42:15 +02:00
Nhat Nguyen	002f763c48	Restore local history from translog on promotion (#33616 ) If a shard was serving as a replica when another shard was promoted to primary, then its Lucene index was reset to the global checkpoint. However, if the new primary fails before the primary/replica resync completes and we are now being promoted, we have to restore the reverted operations by replaying the translog to avoid losing acknowledged writes. Relates #33473 Relates #32867	2018-09-20 13:21:11 -04:00
Nhat Nguyen	b13a434f59	Remove wrong assert in LocalCheckpointTrackerTests It's possible for the set "seqNos" to contain only the "unFinishedSeq" in the testConcurrentReplica test. If this is the case, the call `randomValueOtherThan` won't make any progress because the predicate will never be false. This commit removes this expectation because it's incorrect and it's no longer needed as we have a dedicated test to verify the contains method. Relates #33871	2018-09-20 13:12:19 -04:00
Alan Woodward	b33c18d316	Move SoraniNormalizationFilterFactory to the common analysis plugin (#33892 ) Follow up to #25715	2018-09-20 17:31:41 +01:00
Yannick Welsch	db327818dd	[TEST] Enable DEBUG logging on testCreateShrinkIndexToN	2018-09-20 18:16:20 +02:00
Nik Everett	f963c29876	Logging: Drop Settings from some logger lookups (#33859 ) Drops `Settings` from some of the methods to lookup loggers and deprecates another logger lookup that takes `Settings` because `Settings` is no longer required to build a logger.	2018-09-20 10:42:48 -04:00
David Turner	0b4a6ae97c	Merge commit '3522b9084b611c89ec4f06c1863542883840ed0e' into zen2	2018-09-20 15:17:47 +01:00
Jake Landis	e37e5dfc04	ingest: support simulate with verbose for pipeline processor (#33839 ) * ingest: support simulate with verbose for pipeline processor This change better supports the use of simulate?verbose with the pipeline processor. Prior to this change any pipeline processors executed with simulate?verbose would not show all intermediate processors for the inner pipelines. This changes also moves the PipelineProcess and TrackingResultProcessor classes to enable instance checks and to avoid overly public classes. As well this updates the error message for when cycles are detected in pipelines calling other pipelines.	2018-09-20 08:33:07 -05:00
Simon Willnauer	3522b9084b	Introduce a `search_throttled` threadpool (#33732 ) Today all searches happen on the search threadpool which is the correct behavior in almost any case. Yet, there are exceptions where for instance searches searches should be passed through a single-thread thread-pool to reduce impact on a node. This change adds a index-private setting that allows to mark an index as throttled for searches and forks off all non-stats searcher access to this thread-pool for indices that are marked as `index.search.throttled`	2018-09-20 13:43:11 +02:00
David Turner	c041e94349	Test that transient settings beat persistent ones (#33818 ) Transient settings override persistent settings, but in fact all of the tests that run as part of `:server:test` and `:server:integTest` will pass if the precedence is changed to be the other way round. This change adds a test that verifies the precedence is as documented.	2018-09-20 11:17:19 +01:00
Tim Vernum	8d50c10208	Mute ShrinkIndexIT.testCreateShrinkIndexToN on Windows Relates: #33857	2018-09-20 18:21:15 +10:00
Daniel Mitterdorfer	b1cc58e425	Allow to clear the fielddata cache per field With this commit we clear the fielddata cache per field as it is supposed to be. Previously we retrieved the proper field from the cache but then cleared the entire cache anyway. Closes #33798 Relates #33807	2018-09-20 08:59:53 +02:00
Tim Vernum	1f1ebb4656	Add additional null check in _cat/shards The target of the func lambda may be null (e.g. in a mixed cluster where older nodes lack some of the values) Relates: #33858 / 331caba Closes #33877	2018-09-20 06:44:13 +02:00
Nhat Nguyen	05bf9dc2e8	Add contains method to LocalCheckpointTracker (#33871 ) This change adds "contains" method to LocalCheckpointTracker. One of the use cases is to check if a given operation has been processed in an engine or not by looking up its seq_no in LocalCheckpointTracker. Relates #33656	2018-09-19 20:29:36 -04:00
Gordon Brown	90de436e55	Use custom index metadata for ILM state (#33783 ) Using index settings for ILM state is fragile and exposes too much information that doesn't need to be exposed. Using custom index metadata is more resilient and allows more controlled access to internal information. As part of these changes, moves away from using defaults for ILM-related values, in favor of using null values to clearly indicate that the value is not present.	2018-09-19 14:50:48 -06:00
Nik Everett	26c4f1fb6c	Core: Default node.name to the hostname (#33677 ) Changes the default of the `node.name` setting to the hostname of the machine on which Elasticsearch is running. Previously it was the first 8 characters of the node id. This had the advantage of producing a unique name even when the node name isn't configured but the disadvantage of being unrecognizable and not being available until fairly late in the startup process. Of particular interest is that it isn't available until after logging is configured. This forces us to use a volatile read whenever we add the node name to the log. Using the hostname is available immediately on startup and is generally recognizable but has the disadvantage of not being unique when run on machines that don't set their hostname or when multiple elasticsearch processes are run on the same host. I believe that, taken together, it is better to default to the hostname. 1. Running multiple copies of Elasticsearch on the same node is a fairly advanced feature. We do it all the as part of the elasticsearch build for testing but we make sure to set the node name then. 2. That the node.name defaults to some flavor of "localhost" on an unconfigured box feels like it isn't going to come up too much in production. I expect most production deployments to at least set the hostname. As a bonus, production deployments need no longer set the node name in most cases. At least in my experience most folks set it to the hostname anyway.	2018-09-19 15:21:29 -04:00
Simon Willnauer	a92dda2e7e	Move CompletionStats into the Engine (#33847 ) By moving CompletionStats into the engine we can easily cache the stats for read-only engines if necessary. It also moves the responsibiltiy out of IndexShard which has quiet some complexity already. Relates to #33835	2018-09-19 20:35:57 +02:00
Simon Willnauer	0fa5758bc6	Fix potential NPE in `_cat/shards/` with partial CommonStats (#33858 ) Today if we fetch common stats from a shard we might get a partial response if the shard is closed while we fetch the stats. This causes hard to track and reproduce NPEs. This change streamlines null checking to ensure we only render stats we actually received.	2018-09-19 20:34:54 +02:00
Nik Everett	3ede13a454	Test framework fall cleaning (#33423 ) Wraps all lines in our test framework at 140 characters because that is our standard line length and removes all of the checkstyle suppressions for the test framework. Drops most of `ModuleTestCase` because it isn't used and we're moving away from using guice in the way that it wants to test anyway. Also switches a few classes that extend it but don't use it to extend `ESTestCase` instead.	2018-09-19 14:34:02 -04:00
Lee Hinman	81e9150c7a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-19 09:43:26 -06:00
Simon Willnauer	6ec12bef0d	Add missing IndexShard#readAllowed() This was lost in #33835	2018-09-19 17:07:13 +02:00
Alan Woodward	5107949402	Allow TokenFilterFactories to rewrite themselves against their preceding chain (#33702 ) We currently special-case SynonymFilterFactory and SynonymGraphFilterFactory, which need to know their predecessors in the analysis chain in order to correctly analyze their synonym lists. This special-casing doesn't work with Referring filter factories, such as the Multiplexer or Conditional filters. We also have a number of filters (eg the Multiplexer) that will break synonyms when they appear before them in a chain, because they produce multiple tokens at the same position. This commit adds two methods to the TokenFilterFactory interface. * `getChainAwareTokenFilterFactory()` allows a filter factory to rewrite itself against its preceding filter chain, or to resolve references to other filters. It replaces `ReferringFilterFactory` and `CustomAnalyzerProvider.checkAndApplySynonymFilter`, and by default returns `this`. * `getSynonymFilter()` defines whether or not a filter should be applied when building a synonym list `Analyzer`. By default it returns `true`. Fixes #33609	2018-09-19 15:52:14 +01:00
Christoph Büscher	546e7361ed	[Tests] Nudge wait time in RemoteClusterServiceTests (#33853 ) This test occasionally fails in `testCollectSearchShards` waiting on what seems to be a search request to a remote cluster for one second. Given that the test fails here very rarely I suspect maybe one second is very rarely not enough so we could fix it by increasing the max wait time slightly. Closes #33852	2018-09-19 15:58:35 +02:00
Yannick Welsch	6551b4f651	Zen2: Integrate publication pipeline into Coordinator (#33771 ) Replaces the mock integration of Publication in CoordinatorTests by the real thing.	2018-09-19 13:36:11 +02:00
Yannick Welsch	10009434bf	Merge remote-tracking branch 'elastic/master' into zen2	2018-09-19 11:18:01 +02:00
Simon Willnauer	0c77f45dc6	Move DocsStats into Engine (#33835 ) By moving DocStats into the engine we can easily cache the stats for read-only engines if necessary. It also moves the responsibility out of IndexShard which has quiet some complexity already.	2018-09-19 11:03:11 +02:00
Vladimir Dolzhenko	a3e8b831ee	add elasticsearch-shard tool (#32281 ) Relates #31389	2018-09-19 10:28:22 +02:00
Simon Willnauer	251489d59a	Cut over to unwrap segment reader (#33843 ) The fix in #33757 introduces some workaround since FilterCodecReader didn't support unwrapping. This cuts over to a more elegant fix to access the readers segment infos.	2018-09-19 10:18:03 +02:00
Jim Ferenczi	61e1df0274	Use the global doc id to generate a random score (#33599 ) This commit changes the random_score function to use the global docID of the document rather than the segment docID to generate random scores. As a result documents that have the same segment docID within the shard will generate different scores.	2018-09-19 09:28:38 +02:00
Adrien Grand	c4261bab44	Add minimal sanity checks to custom/scripted similarities. (#33564 ) Add minimal sanity checks to custom/scripted similarities. Lucene 8 introduced more constraints on similarities, in particular: - scores must not be negative, - scores must not decrease when term freq increases, - scores must not increase when norm (interpreted as an unsigned long) increases. We can't check every single case, but could at least run some sanity checks. Relates #33309	2018-09-19 09:19:13 +02:00
Ignacio Vera	7f473b683d	Profiler: Don’t profile NEXTDOC for ConstantScoreQuery. (#33196 ) * Profiler: Don’t profile NEXTDOC for ConstantScoreQuery. A ConstantScore query will return the iterator of its inner query. However, when profiling, the constant score query is wrapped separately from its inner query, which distorts the times emitted by the profiler. Return the iterator directly in such a case. Closes #23430	2018-09-18 23:32:16 -07:00
Lee Hinman	c87cff22b4	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-18 13:57:41 -06:00
Zachary Tong	f4cbbcf98b	Add ES version 6.4.2 (#33831 ) Version and properties files	2018-09-18 15:25:20 -04:00
Armin Braun	c6462057a1	MINOR: Remove Some Dead Code in Scripting (#33800 ) * The is default check method is not used in ScriptType * The removed vars on ExpressionSearchScript are unused	2018-09-18 20:43:31 +02:00
Simon Willnauer	9026c3ee92	Ensure realtime `_get` and `_termvectors` don't run on the network thread (#33814 ) The change in #27500 introduces this regression that causes `_get` and `_term_vector` actions to run on the network thread if the realtime flag is set. This fixes the issue by delegating to the super method forking on the corresponding threadpool.	2018-09-18 19:53:42 +02:00
Simon Willnauer	98ccd94962	Factor out a ChannelActionListener (#33819 ) We use similar / same concepts in SerachTransportService and HandledTransportAction but both duplicate the efforts with slightly different implementation details. This streamlines sending responses / exceptions back to a channel in an ActionListener with appropriate logging.	2018-09-18 19:53:26 +02:00
Jim Ferenczi	241c74efb2	upgrade to a new snapshot of Lucene 8 (7d0a7782fa) (#33812 )	2018-09-18 18:16:40 +02:00
David Turner	421f58e172	Remove discovery-file plugin (#33257 ) In #33241 we moved the file-based discovery functionality to core Elasticsearch, but preserved the `discovery-file` plugin, and support for the existing location of the `unicast_hosts.txt` file, for BWC reasons. This commit completes the removal of this plugin.	2018-09-18 12:01:16 +01:00
Yannick Welsch	758b2f9111	Zen2: Add DisruptableMockTransport (#33713 ) Adds a mock transport implementation that allows to simulate network disruptions.	2018-09-18 11:48:24 +02:00
markharwood	2fa09f062e	New plugin - Annotated_text field type (#30364 ) New plugin for annotated_text field type. Largely a copy of `text` field type but adds ability to include markdown-like syntax in the text. The “AnnotatedText” class parses text+markup and converts into plain text and AnnotationTokens. The annotation token values are injected unchanged alongside the regular text tokens to provide a form of additional indexed overlay useful in positional searches and highlighting. Annotated_text fields do not support fielddata as we want to phase this out. Also includes a new "annotated" highlighter type that retains annotations and merges in search hits as additional annotation markup. Closes #29467	2018-09-18 10:25:27 +01:00
Armin Braun	87cedef3cf	NETWORKING:Def CName in Http Publish Addr to True (#33631 ) * Follow up to #32806 setting the setting to true for 7.x	2018-09-18 10:29:02 +02:00
Armin Braun	615f494c77	MINOR: Drop Redundant Ctx. Check in ScriptService (#33782 ) * MINOR: Drop Redundant Ctx. Check in ScriptService * This check is completely redundant, the expression script engine will throw anyway (and with a similar message) for those contexts that it cannot compile. Moreover, the update context is not the only context that is not suported by the expression engine at this point so handling the update context separately here makes no sense.	2018-09-18 07:25:22 +02:00
Or Bin	a5bad4d92c	Docs: Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' (#33744 ) Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' Closes #33728	2018-09-17 15:35:54 -04:00
Lee Hinman	7ff11b4ae1	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-17 10:41:10 -06:00
Vladimir Dolzhenko	4d0bea705c	Do not report negative free bytes for DiskThresholdDecider#canAllocate (#33641 ) Do not report negative free bytes for DiskThresholdDecider#canAllocate (#33641) Closes #33596	2018-09-17 17:56:47 +02:00
Armin Braun	a654f21599	TESTS: Fix Concurent Remote Connection Updates (#33707 ) * Same fix idea as in #10666a4 to prevent background threads trying to reconnect after the tests are done from throwing `ExecutionCancelledException` and breaking the test * Closes #30714	2018-09-17 16:38:44 +02:00
David Turner	c79fbea923	[Zen2] Implement basic cluster formation (#33668 ) This PR integrates the following pieces of machinery in the Coordinator: - discovery - pre-voting - randomised election scheduling - joining (of a new master) - publication of cluster state updates Together, these things are everything needed to form a cluster. We therefore also add the start of a test suite that allows us to assert higher-level properties of the interactions between all these pieces of machinery, with as little fake behaviour as possible. We assert one such property: "a cluster successfully forms".	2018-09-17 15:00:30 +02:00
Bukhtawar	14d57c1115	Skip rebalancing when cluster_concurrent_rebalance threshold reached (#33329 ) Allows to skip shard balancing when the cluster_concurrent_rebalance threshold is already reached, which cuts down the time spent in the rebalance method of BalancedShardsAllocator.	2018-09-17 13:13:44 +02:00
Adrien Grand	b06a082725	Improve reproducibility of BigArraysTests. Close #33750	2018-09-17 11:59:15 +02:00
Christoph Büscher	1f2a90cb39	Mute DateTimeUnitTests.testConversion	2018-09-17 11:16:50 +02:00
Yannick Welsch	01b3be917a	Merge remote-tracking branch 'elastic/master' into zen2	2018-09-17 09:59:37 +02:00
Martijn van Groningen	34379887b4	Make custom index metadata completely immutable (#33735 ) Currently `IndexMetadata#getCustomData(...)` wraps the custom metadata in an unmodifiable map, but in case there is no entry for the specified key then a NPE is thrown by Collections.unmodifiableMap(...). This is not ideal in case callers like to throw an exception with a specific message. (like in the case for ccr to indicate that the follow index was not created by the create_and_follow api and therefor incompatible as follow index) I think making `DiffableStringMap` itself immutable is better then just wrapping custom metadata with `Collections.unmodifiableMap(...)` in all methods that access it. Also removed the `equals()`, `hashcode()` and to `toString()` methods of `DiffableStringMap`, because `AbstractMap` already implements these methods.	2018-09-17 07:51:34 +02:00
Ryan Ernst	3046656ab1	Scripting: Rework joda time backcompat (#33486 ) This commit switches the joda time backcompat in scripting to use augmentation over ZonedDateTime. The augmentation methods provide compatibility with the missing methods between joda's DateTime and java's ZonedDateTime. Due to getDayOfWeek returning an enum in the java API, ZonedDateTime is wrapped so that the method can return int like the joda time does. The java time api version is renamed to getDayOfWeekEnum, which will be kept through 7.x for compatibility while users switch back to getDayOfWeek once joda compatibility is removed.	2018-09-16 19:18:00 -07:00
Ryan Ernst	e5d82c3dea	Test: Fix dv date bwc tests when no docs have a value (#32798 ) This commit adds a guard around the rare case that no documents in the 10 iterations actually have any values, thus making the warning check incorrect. closes #32779	2018-09-16 11:11:51 -07:00
Lee Hinman	e6cbaa5a78	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-14 16:27:37 -06:00
Jason Tedor	a0f0d7860e	Cleanup assertions in global checkpoint listeners (#33722 ) This commit is a cleanup of the assertions in global checkpoint listeners, simplifying them and adding some messages to them in case the assertions trip.	2018-09-14 14:45:58 -04:00
Christoph Büscher	bcbbbdf660	[Tests] Fix randomization in StringTermsIT (#33678 ) It looks like the COLLECT_SEGMENT_ORDS flag should be randomized.	2018-09-14 15:52:47 +02:00
Jason Tedor	39191331d1	Only notify ready global checkpoint listeners (#33690 ) When we add a global checkpoint listener, it is also carries along with it a value that it thinks is the current global checkpoint. This value can be above the actual global checkpoint on a shard if the listener knows the global checkpoint from another shard copy (e.g., the primary), and the current shard copy is lagging behind. Today we notify the listener whenever the global checkpoint advances, regardless if it goes above the current global checkpoint known to the listener. This commit reworks this implementation. Rather than thinking of the value associated with the listener as the current global checkpoint known to the listener, we think of it as the value that the listener is waiting for the global checkpoint to advance to (inclusive). Now instead of notifying all waiting listeners when the global checkpoint advances, we only notify those that are waiting for a value not larger than the actual global checkpoint that we advanced to.	2018-09-14 09:32:03 -04:00
Adrien Grand	4f68104865	Don't count hits via the collector if the hit count can be computed from index stats. (#33701 ) This is something that we were already doing when sorting by field, which is now also done when sorting by score. As-is this change will speed up top-k `term` queries. This could work for `match_all` queries as well when we implement the `setMinCompetitiveScore` API on their Scorer.	2018-09-14 14:59:16 +02:00
David Turner	31e8781eaa	Merge branch 'master' into zen2	2018-09-14 14:28:28 +02:00
Alexander Reelsen	faa3c16241	Core: Add DateFormatter interface for java time parsing (#33467 ) The existing approach used date formatters when a format based string like `date_time\|\|epoch_millis` was used, instead of the custom code. In order to properly solve this, a new interface called `DateFormatter` has been added, which now can be implemented for custom formatters. Currently there are two implementations, one using java time and one doing the epoch_millis formatter, which simply parses a number and then converts it to a date in UTC timezone. The DateFormatter interface now also has a method to retrieve the name of the formatter pattern, which is needed for mapping changes anyway. The existing `CompoundDateTimeFormatter` class has been removed, the name was not really nice anyway. One more minor change is the fact, that the new java time using FormatDateFormatter does not try to parse the date with its printer implementation first (which might be a strict one and fail), but a printer can now be specified in addition. This saves one potential failure/exception when parsing less strict dates. If only a printer is specified, the printer will also be used as a parser.	2018-09-14 13:55:16 +02:00
Igor Motov	b8fb83d7a4	Mute ClusterDisruptionIT#testSendingShardFailure Tracked by #33704	2018-09-14 14:24:06 +04:00
Armin Braun	0b4960ff6b	SCRIPTING: Move terms_set Context to its Own Class (#33602 ) * SCRIPTING: Move terms_set Context to its Own Class * Extracted TermsSetQueryScript * Kept mechanics close to what they were with SearchScript	2018-09-14 06:21:18 +02:00
Armin Braun	040695b64e	CORE: Disable Setting Type Validation (#33660 ) (#33669 ) * Reverts setting type validation introduced in #33503	2018-09-13 20:45:48 +02:00
Jason Tedor	e4eb631b8e	Revert "Use serializable exception in GCP listeners (#33657 )" This reverts commit `6dfe54c838`.	2018-09-13 13:55:19 -04:00
Nhat Nguyen	b3071133d4	TEST: decrease logging level in the flush test Relates #31629	2018-09-13 11:18:03 -04:00
Jason Tedor	d806a0e59d	Fix race in global checkpoint listeners test This race can occur if the latch from the listener notifies the test thread and the test thread races ahead before the scheduler thread has a chance to emit the log message. This commit fixes this test by not counting down the latch until after the log message we are going to assert on has been emitted.	2018-09-13 07:00:40 -04:00
Jason Tedor	6dfe54c838	Use serializable exception in GCP listeners (#33657 ) We used TimeoutException here but that's not serializable. This commit switches to a serializable exception so that we can test for the exception type on the remote side.	2018-09-13 06:35:36 -04:00
Colin Goodheart-Smithe	8e59de3eb2	Merge branch 'master' into index-lifecycle	2018-09-13 09:46:14 +01:00
Jim Ferenczi	6ca36bba15	Fix field mapping updates with similarity (#33634 ) This change fixes a bug introduced in 6.3 that prevents fields with an explicit similarity to be updated. It also adds a test that checks this case for similarities but also for analyzers since they could suffer from the same problem. Closes #33611	2018-09-13 09:21:27 +02:00
David Turner	5a3fd8e4e7	Use file-based discovery not MockUncasedHostsProvider (#33554 ) Today we use a special unicast hosts provider, the `MockUncasedHostsProvider`, in many integration tests, to deal with the dynamic nature of the allocation of ports to nodes. However #33241 allows us to use file-based discovery to achieve the same goal, so the special test-only `MockUncasedHostsProvider` is no longer required. This change removes `MockUncasedHostProvider` and replaces it with file-based discovery in tests based on `EsIntegTestCase`.	2018-09-13 07:37:15 +02:00
Nhat Nguyen	b097eff342	Resync fails to notify on unavaiable exceptions (#33615 ) We fail to notify the resync listener if the resync replication hits a shard unavailable exception. Moreover, we no longer need to swallow these unavailable exceptions. Relates #28571 Closes #33613	2018-09-12 21:27:59 -04:00
Jason Tedor	9b8fe85edb	Remove volatile from global checkpoint listeners (#33636 ) This field does not need to be volatile because all accesses are done under a lock. This commit removes the unnecessary volatile modifier from this field.	2018-09-12 14:38:24 -04:00
Jason Tedor	c023f67c5d	Add migration note for remote cluster settings (#33632 ) The remote cluster settings search.remote.* have been renamed to cluster.remote.* and are automatically upgraded in the cluster state on gateway recovery, and on put. This commit adds a note to the migration docs for these changes.	2018-09-12 13:37:11 -04:00
Simon Willnauer	c783488e97	Add `_source`-only snapshot repository (#32844 ) This change adds a `_source` only snapshot repository that allows to wrap any existing repository as a _backend_ to snapshot only the `_source` part including live docs markers. Snapshots taken with the `source` repository won't include any indices, doc-values or points. The snapshot will be reduced in size and functionality such that it requires full re-indexing after it's successfully restored. The restore process will copy the `_source` data locally starts a special shard and engine to allow `match_all` scrolls and searches. Any other query, or get call will fail with and unsupported operation exception. The restored index is also marked as read-only. This feature aims mainly for disaster recovery use-cases where snapshot size is a concern or where time to restore is less of an issue. NOTE: The snapshot produced by this repository is still a valid lucene index. This change doesn't allow for any longer retention policies which is out of scope for this change.	2018-09-12 17:47:10 +02:00
Jason Tedor	36ba3cda7e	Enable global checkpoint listeners to timeout (#33620 ) In cross-cluster replication, we will use global checkpoint listeners to long poll for updates to a shard. However, we do not want these polls to wait indefinitely as it could be difficult to discern if the listener is still waiting for updates versus something has gone horribly wrong and cross-cluster replication is stuck. Instead, we want these listeners to timeout after some period (for example, one minute) so that they are notified and we can update status on the following side that cross-cluster replication is still active. After this, we will immediately enter back into a poll mode. To do this, we need the ability to associate a timeout with a global checkpoint listener. This commit adds this capability.	2018-09-12 10:53:22 -04:00
Nhat Nguyen	d9bbb89b26	TEST: Adjust rollback condition when shard is empty If a shard is empty, it won't rollback its engine on promotion. This commit adjusts the expectation in the rollback test. Relates #33473	2018-09-12 08:26:02 -04:00
lipsill	c92ec1c5d7	Forbid negative `weight` in Function Score Query (#33390 ) This change forbids negative `weight` in Function Score query. Negative scores are forbidden in Lucene 8.	2018-09-12 09:16:40 +02:00
Jim Ferenczi	4561c5ee83	Clarify context suggestions filtering and boosting (#33601 ) This change clarifies the documentation of the context completion suggester regarding filtering and boosting with contexts. Unlike the suggester v1, filtering on multiple contexts works as a disjunction, a suggestion matches if it contains at least one of the provided context values and boosting selects the maximum score among the matching contexts. This commit also adapts an old test that was written for the v1 suggester and commented out for version 2 because the behavior changed.	2018-09-12 08:47:32 +02:00
Jason Tedor	c74c46edc3	Upgrade remote cluster settings (#33537 ) This commit adds settings upgraders for the search.remote.* settings that can be in the cluster state to automatically upgrade these settings to cluster.remote.*. Because of the infrastructure that we have here, these settings can be upgraded when recovering the cluster state, but also when a user tries to make a dynamic update for these settings.	2018-09-12 01:14:43 -04:00
Armin Braun	94cdf0ceba	NETWORKING: http.publish_host Should Contain CNAME (#32806 ) * NETWORKING: http.publish_host Should Contain CNAME * Closes #22029	2018-09-12 06:15:36 +02:00
Jason Tedor	9752540866	Add test coverage for global checkpoint listeners This commit adds test coverage for two cases not previously covered by the existing testing. Namely, we add coverage ensuring that the executor is used to notify listeners being added that are immediately notified because the shard is closed or because the global checkpoint is already beyond what the listener knows.	2018-09-11 23:19:27 -04:00
Nhat Nguyen	743327efc2	Reset replica engine to global checkpoint on promotion (#33473 ) When a replica starts following a newly promoted primary, it may have some operations which don't exist on the new primary. Thus we need to throw those operations to align a replica with the new primary. This can be done by first resetting an engine from the safe commit, then replaying the local translog up to the global checkpoint. Relates #32867	2018-09-11 22:09:37 -04:00

... 5 6 7 8 9 ...

1995 Commits