OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	1847bbac4d	Tests: Use random analyzer only on string fields in Match/MultiMatchBuilderTests Currently we can run into test errors by accidently using e.g. a "simple" analyzer on a numeric field which might lead to number parsing errors. While these errors are correct, we should avoid these combinations in our regular tests.	2017-04-12 11:32:48 +02:00
Ryan Ernst	1207103b6d	S3 Repository: Eagerly load static settings (#23910 ) The S3 repostiory has many levels of settings it looks at to create a repository, and these settings were read at repository creation time. This meant secure settings like access and secret keys had to be available after node construction. This change makes setting loading for every except repository level settings eager, so that secure settings can be stashed, and the keystore can once again be closed after bootstrapping the node is complete.	2017-04-11 15:42:56 -07:00
Jason Tedor	b4c3bb5d21	Reject duplicate settings on the command line Today Elasticsearch and other CLI tools that rely on environment aware command leniently accept duplicate settings with the last one winning. This commit removes this leniency. Relates #24053	2017-04-11 18:30:05 -04:00
Tim Brooks	cf6b03c8f4	Wildcard cluster names for cross cluster search (#23985 ) This is related to #23893. This commit allows users to use wilcards for cluster names when executing a cross cluster search. So instead of defining every cluster such as: GET one:,two:,three:/_search A user could just search: GET :*/_search As ":" characters are currently allowed in index names, if the text up to the first ":" does not match a defined cluster name, the entire string is treated as an index name.	2017-04-11 13:56:26 -05:00
Lee Hinman	5cace8e48a	Remove shadow replicas Resolves #22024	2017-04-11 11:26:26 -06:00
Simon Willnauer	e30a275bfe	Add a dedicated TransportRemoteInfoAction for consistency (#24040 ) All our actions that are invoked from rest actions have corresponding transport actions. This adds the transport action for RestRemoteClusterInfoAction for consistency. Relates to #23969	2017-04-11 14:40:37 +02:00
Yannick Welsch	88a54f14c7	Trigger replica recovery restarts by master when primary relocation completes (#23926 ) When a primary relocation completes while there are ongoing replica recoveries, the recoveries for these replicas need to be restarted (as a new primary is in charge of replicating changes). Before this commit, the need for a recovery restart was detected by the data nodes that had the replicas, by checking on each cluster state update if the recovery process had completed before the recovery source changed. That code had a race, however, which could lead to a not-fully recovered shard exposing itself as started (see #23904). This commit takes a different approach: When the primary relocation completes and the master updates the cluster state to move the primary shard from relocating to started, it will reinitialize all initializing replica shards, by giving them a fresh allocation id. Data nodes that have the replica shard will simply detect that the allocation id changed and restart the recovery process (instead of trying to determine the need to restart based on ongoing recoveries). Note: Removal of the code in IndicesClusterStateService that checks whether the recovery source has changed will not be backported to the 5.x branch. This ensures backward compatibility for the situation where the master node is older and does not have the code changes that have been introduced in this PR. Closes #23904	2017-04-11 11:21:57 +02:00
Colin Goodheart-Smithe	0114f0061c	Removes version 2.x constants from Version (#24011 ) * Removes version 2.x constants from Version Closes #21887 * Addresses review comments	2017-04-11 08:31:22 +01:00
Simon Willnauer	f22e0dc30b	Add cross-cluster search remote cluster info API (#23969 ) This commit adds an API to discover information like seed nodes, http addresses and connection status of a configured remote cluster. Closes #23925	2017-04-11 09:24:40 +02:00
Nik Everett	16a2048416	Remove real time from tests (#24025 ) The `AsyncBulkByScrollActionTests` were brittle because they used the current time. That was a mistake. This removes the current time from the test, instead adding it to the parameters passed in to the appropriate methods. This means that we take the current time slightly earlier in all cases, but that shouldn't make a difference. Closes #24005 Example failure: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+nfs/161/consoleFull	2017-04-10 17:55:02 -04:00
Ryan Ernst	65f7a76630	Settings: Add secure file setting to keystore (#24001 ) Some systems like GCE rely on a plaintext file containing credentials. Rather than extract the information out of that credentials file and store each peace individually in the keystore, it is cleaner to just store the entire file. This commit adds support to the keystore wrapper for secure file settings. These are settings that contain an entire file that would normally be stored on the local filesystem. Retrieving the file returns an input stream to the file contents. This also adds a `add-file` command to the keystore cli. In order to support both strings and files as values for settings, the metadata format of the keystore has also been updated (with backcompat) to keep a map of setting name to type.	2017-04-10 13:10:42 -07:00
Simon Willnauer	a61fb3f708	Remote support for lucene versions without checksums (#24021 ) We are still carrying some legacy code that deals with lucene indices that don't have checksums. Yet, we do not support these indices for a while now, in fact since version 5.0 such an index is not supported anymore. This commit removes all the special handling and leniency involved.	2017-04-10 18:16:34 +02:00
Martijn van Groningen	887f3ed8dc	inner_hits: Replace `NestedChildrenQuery` with `ParentChildrenBlockJoinQuery`. Closes #24009	2017-04-10 17:36:45 +02:00
Lee Hinman	53d4d747a6	Mark IndexWithShadowReplicasIT as AwaitsFix Relates to #24007 and #23906	2017-04-10 09:32:20 -06:00
Simon Willnauer	040b86a76b	Set shard count limit to unlimited (#24012 ) Now that we have incremental reduce functions for topN and aggregations we can set the default for `action.search.shard_count.limit` to unlimited. This still allows users to restrict these settings while by default we executed across all shards matching the search requests index pattern.	2017-04-10 17:09:21 +02:00
Luca Cavanna	2c545c064d	Move getProperty method out of MultiBucketsAggregation.Bucket interface (#23988 ) The getProperty method is an internal method needed to run pipeline aggregations and retrieve info by path from the aggs tree. It is not needed in the MultiBucketsAggregation.Bucket interface, which is returned to users running aggregations from the transport client. The method is moved to the InternalMultiBucketAggregation class as that's where it belongs.	2017-04-10 13:35:01 +02:00
Luca Cavanna	93f159429f	Remove getProperty method from Aggregations interface and impl (#23972 ) The `getProperty` method is an internal method needed to run pipeline aggregations and retrieve info by path from the aggs tree. It is not needed in the `Aggregations` interface, which is returned to users running aggregations from the transport client. Furthermore, the method is currenty unused by pipeline aggs too, as only InternalAggregation#getProperty is used. It can then be removed	2017-04-10 12:31:45 +02:00
Luca Cavanna	b283c8b768	Move aggs CommonFields and TYPED_KEYS_DELIMITER from InternalAggregation to Aggregation (#23987 ) These will be shared between internal objects and objects exposed through high level REST client, so they should be moved from internal classes.	2017-04-10 12:30:02 +02:00
Luca Cavanna	9db8a266e6	Un-deprecate NamedXContentRegistry.Entry constructor that takes a context (#23986 ) We deprecated this method in the past because we thought it was a temporary thing that could go away over time. We radically trimmed down the usages of a context while parsing when we got rid of the ParseFieldMatcher, but the usages that are left are legit and we will hardly get rid of them. Also, working on aggs parsing we will need a context to carry around the aggregation name that gets parsed through XContentParser#namedObject .	2017-04-10 12:28:56 +02:00
Yannick Welsch	12471c4f76	[TEST] Fix wait condition on testMultipleNodesShutdownNonMasterNodes After two nodes are being stopped and two more are joining the cluster, we first have to wait on the cluster to consist of the right nodes before waiting on green status, otherwise we might get a green status for a cluster with dead nodes.	2017-04-10 11:38:56 +02:00
Jim Ferenczi	9b3c85dd88	Deprecate _field_stats endpoint (#23914 ) _field_stats has evolved quite a lot to become a multi purpose API capable of retrieving the field capabilities and the min/max value for a field. In the mean time a more focused API called `_field_caps` has been added, this enpoint is a good replacement for _field_stats since he can retrieve the field capabilities by just looking at the field mapping (no lookup in the index structures). Also the recent improvement made to range queries makes the _field_stats API obsolete since this queries are now rewritten per shard based on the min/max found for the field. This means that a range query that does not match any document in a shard can return quickly and can be cached efficiently. For these reasons this change deprecates _field_stats. The deprecation should happen in 5.4 but we won't remove this API in 6.x yet which is why this PR is made directly to 6.0. The rest tests have also been adapted to not throw an error while this change is backported to 5.4.	2017-04-10 10:10:16 +02:00
Simon Willnauer	1f40f8a2d2	Introduce incremental reduction of TopDocs (#23946 ) This commit adds support for incremental top N reduction if the number of expected shards in the search request is high enough. The changes here also clean up more code in SearchPhaseController to make the separation between values that are the same on each search result and values that are per response. The reduced search phase result doesn't hold an arbitrary result to obtain values like `from`, `size` or sort values which is now cleanly encapsulated.	2017-04-10 09:37:52 +02:00
Boaz Leskes	b636ca79d5	Engine: version logic on replicas should not be hard coded (#23998 ) The refactoring in #23711 hardcoded version logic for replica to assume monotonic versions. Sadly that's wrong for `FORCE` and `VERSION_GTE`. Instead we should use the methods in VersionType to detect conflicts. Note - once replicas use sequence numbers for out of order delivery, this logic goes away.	2017-04-09 22:04:12 +02:00
Boaz Leskes	f0df5e64d8	InternalEngineTests: fix a potential NPE in assertOpsOnPrimary assertOpsOnPrimary may inherit a situation where the document exist but it doesn't the last indexed value. This cloud cause an NPE.	2017-04-09 21:21:00 +02:00
Jason Tedor	61c5976aee	Upgrade to Log4j 2.8.2 This commit upgrades the Log4j dependencies from version 2.7 to version 2.8.2. This release includes a fix for a case where Log4j could lose exceptions in the presence of a security manager. Relates #23995	2017-04-09 07:19:16 -04:00
Jason Tedor	5c8d5677a4	Suppress ExtrasFS in plugins service tests The ExtrasFS filesystem creates extra directories when creating temp directories during tests to ensure that Lucene does not care about extra files. These extra files get in our way in the plugins service tests because some of these tests are counting only on certain directories existing. This commit suppresses the ExtrasFS filesystem for the plugins service tests, and fixes a test that was passing for the wrong reason (because of the existence of an extra directory from ExtrasFS).	2017-04-08 20:42:18 -04:00
Jason Tedor	9056e0cb49	Remove hidden file leniency from plugin service This commit removes some leniency from the plugin service which skips hidden files in the plugins directory. We really want to ensure the integrity of the plugin folder, so hasta la vista leniency. Relates #23982	2017-04-08 18:22:44 -04:00
Ryan Ernst	73b8aad9a3	Settings: Disallow secure setting to exist in normal settings (#23976 ) This commit removes the "legacy" feature of secure settings, which setup a parallel setting that was a fallback in the insecure elasticsearch.yml. This was previously used to allow the new secure setting name to be that of the old setting name, but is now not in use due to other refactorings. It is much cleaner to just have all secure settings use new setting names. If in the future we want to reuse the previous setting name, once support for the insecure settings have been removed, we can then rename the secure setting. This also adds a test for the behavior.	2017-04-07 14:18:06 -07:00
Simon Willnauer	0c465b1931	Add comment why we check for null fetch results during merge	2017-04-07 21:00:19 +02:00
Jason Tedor	457a76c1c6	Fix import order in Spawner The imports are not in alphabetical order in Spawner.java and this is a crime that is rectified by this commit.	2017-04-07 14:52:22 -04:00
Yannick Welsch	a3cceb8a00	[TEST] Fix testMultipleNodesShutdownNonMasterNodes to wait for the right nodes to rejoin the cluster This test was sporadically failing for the following reason: - 4 nodes (nodes 0, 1, 2, and 3) running with `minimum_master_nodes` set to 3 - we stop 2 nodes (node 0 and 3) - wait for cluster block to be in place on all nodes - start 2 nodes (node 4 and node 5) and do a `prepareHealth().setWaitForNodes("4")` - then do a search request The search request runs into the `ClusterBlockException` as the `prepareHealth().setWaitForNodes("4")` check succeeds on a cluster state that has nodes 1, 2, 3, and 4, i.e., only one of the two new nodes has joined the cluster and only one of the two dead nodes was removed by the master (removing the dead nodes only happens after there are again `minimum_master_nodes` nodes in the cluster). This commit fixes the issue by reusing a method from InternalTestCluster that checks that the right nodes have rejoined the cluster.	2017-04-07 15:26:21 +02:00
Jim Ferenczi	0821fa23ff	Restore special case for wilcard on _all query to rewrite to a match all query (#23967 ) This change restores the rewrite to a match all query that we used to apply on wildcard query * on the query_string parser before #23433.	2017-04-07 15:15:43 +02:00
Yannick Welsch	8522b43ce7	[TEST] Take cluster state batching into account in testNodeFailuresAreProcessedOnce The test assumes that two nodes leaving the cluster results in two cluster state updates on the master, which is invalidated by cluster state batching.	2017-04-07 14:43:38 +02:00
Christoph Büscher	4f94aa8a6a	Tests: Fix highlighter fields order in TopHitsTests (#23968 ) Shuffling xContent breaks the order of the highlighter fields in the internal list if the highlighter doesn't use the array syntax. In other tests we avoid shuffling this json level, but since this is done in the base test for aggregations we should ensure the highlight builder uses the array syntax here.	2017-04-07 14:24:32 +02:00
Luca Cavanna	e156dbaf42	Move getProperty method out of Aggregation interface (#23949 ) The `getProperty` method is an internal method needed to run pipeline aggregations and retrieve info by path from the aggs tree. It is not needed in the `Aggregation` interface, which is returned to users running aggregations from the transport client. The method is moved to the InternalAggregation class as that's where it belongs.	2017-04-07 10:55:35 +02:00
Luca Cavanna	13cf8aaa52	[TEST] fix shuffling of xContent keys (#23929 ) ESTestCase has methods to shuffle xContent keys given a builder or a parser. Shuffling wasn't actually doing what was expected but rather reordering the keys in their natural ordering, hence the output was always the same at every run. Corrected that and added tests, also fixed a couple of tests that were affected by this fix.	2017-04-07 10:20:32 +02:00
Ali Beyad	480cfe3fe0	Fixes snapshot status on failed snapshots (#23833 ) If a snapshot is taken on multiple indices, and some of them are "good" indices that don't contain any corruption or failures, and some of them are "bad" indices that contain missing shards or corrupted shards, and if the snapshot request is set to partial=false (meaning don't take a snapshot if there are any failures), then the good indices will not be snapshotted either. Previously, when getting the status of such a snapshot, a 500 error would be thrown, because the snap-*.dat blob for the shards in the good index could not be found. This commit fixes the problem by reporting shards of good indices as failed due to a failed snapshot, instead of throwing the NoSuchFileException. Closes #23716	2017-04-06 20:54:21 -04:00
Jay Modi	495bf21b46	Preserve response headers when creating an index (#23950 ) This commit preserves the response headers when creating an index and updating settings for an index. Closes #23947	2017-04-06 20:38:09 +01:00
Jim Ferenczi	042f7566e8	update Version.V_5_3_1_UNRELEASED to the latest bugfix release of Lucene:6_4_2	2017-04-06 10:03:17 +02:00
Jim Ferenczi	38009efedd	Disable graph analysis at query time for shingle and cjk filters producing tokens of different size (#23920 ) This change disables graph analysis of token streams containing a shingle or a cjk filters that produce shingle or ngram of different size. The graph analysis is disabled for phrase and boolean queries. Closes #23918	2017-04-06 08:55:00 +02:00
Tim Brooks	5b1fbe5e6c	Decouple BulkProcessor from client implementation (#23373 ) This commit modifies the BulkProcessor to be decoupled from the client implementation. Instead it just takes a BiConsumer<BulkRequest, ActionListener<BulkResponse>> that executes the BulkRequest.	2017-04-05 12:12:43 -05:00
Lee Hinman	0257a7b97a	Only re-parse operation if a mapping update was needed When executing an index operation on the primary shard, `TransportShardBulkAction` first parses the document, sees if there are any mapping updates that needs to be applied, and then updates the mapping on the master node. It then re-parses the document to make sure that the mappings have been applied and propagated. This adds a check that skips the second parsing of the document in the event there was not a mapping update applied in the first case. Fixes a performance regression introduced in #23665	2017-04-05 09:29:44 -06:00
Adrien Grand	d5d0f140d6	The `filter` and `significant_terms` aggregations should parse the `filter` as a filter, not as a query. (#23797 ) This is important for some queries like `bool`, which are parsed differently depending on whether we want to get a query or a filter.	2017-04-05 16:46:21 +02:00
Simon Willnauer	adccdbb3cf	Simplify sorted top docs merging in SearchPhaseController (#23881 ) Today we have several code paths to merge top docs based on the number of search results returned from the shards. If there is a only a single shard holding any hits we go a different code path with quite some complexity while if there are more than one the code is basically duplicated to safe the creation of a dense array of top docs which can be large if there are many results. This commit removes the need of the dense array and in-turn the justification for the optimization. This commit introduces a single code path to merge top docs.	2017-04-05 14:49:35 +02:00
Boaz Leskes	75b4f408e0	Refactor InternalEngine's index/delete flow for better clarity (#23711 ) The InternalEngine Index/Delete methods (plus satellites like version loading from Lucene) have accumulated some cruft over the years making it hard to clearly the code flows for various use cases (primary indexing/recovery/replicas etc). This PR refactors those methods for better readability. The methods are broken up into smaller sub methods, albeit at the price of less code I reused. To support the refactoring I have considerably beefed up the versioning tests. This PR is a spin-off from #23543 , which made it clear this is needed.	2017-04-05 14:43:01 +02:00
Boaz Leskes	c89fdd938e	ZenDiscovery - only validate min_master_nodes values if local node is master (#23915 ) The purpose of this validation is to make sure that the master doesn't step down due to a change in master nodes, which also means that there is no way to revert an accidental change. Since we validate using the current cluster state (and not the one from which the settings come from) we have to be careful and only validate if the local node is already a master. Doing so all the time causes subtle issues. For example, a node that joins a cluster has no nodes in its current cluster state. When it receives a cluster state from the master with a dynamic minimum master nodes setting int it, we must make sure we don't reject it. Closes #23695	2017-04-05 14:31:32 +02:00
Jason Tedor	24127bf416	Remove hardcoded ports from SingleNodeDiscoveryIT SingleNodeDiscoveryIT uses a hardcoded port for the purpose of binding two nodes within the limited port range that an unconfigured unicast zen ping hosts list would try to discover another node on. This commit at least removes this hardcoding for the first node to come up, although still tries to bind the second node to the limited port range after the first node has bound.	2017-04-05 08:17:33 -04:00
Luca Cavanna	318d365b12	[TEST] make sure that fromXContent doesn't rely on keys ordering (#23901 ) We shuffle the keys before we parse our responses for the high level client so that we make sure we never rely on keys ordering.	2017-04-05 11:12:34 +02:00
Jason Tedor	afd45c1432	Revert "Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23572 )" This reverts commit `6bfecdf921`.	2017-04-04 20:33:51 -04:00
Jay Modi	6bfecdf921	Closing a ReleasableBytesStreamOutput closes the underlying BigArray (#23572 ) This commit makes closing a ReleasableBytesStreamOutput release the underlying BigArray so that we can use try-with-resources with these streams and avoid leaking memory by not returning the BigArray. As part of this change, the ReleasableBytesStreamOutput adds protection to only release the BigArray once. In order to make some of the changes cleaner, the ReleasableBytesStream interface has been removed. The BytesStream interface is changed to a abstract class so that we can use it as a useable return type for a new method, Streams#flushOnCloseStream. This new method wraps a given stream and overrides the close method so that the stream is simply flushed and not closed. This behavior is used in the TcpTransport when compression is used with a ReleasableBytesStreamOutput as we need to close the compressed stream to ensure all of the data is written from this stream. Closing the compressed stream will try to close the underlying stream but we only want to flush so that all of the written bytes are available. Additionally, an error message method added in the BytesRestResponse did not use a builder provided by the channel and instead created its own JSON builder. This changes that method to use the channel builder and in turn the bytes stream output that is managed by the channel.	2017-04-04 17:01:30 +01:00
Jason Tedor	3136ed1490	Rename random ASCII helper methods This commit renames the random ASCII helper methods in ESTestCase. This is because this method ultimately uses the random ASCII methods from randomized runner, but these methods actually only produce random strings generated from [a-zA-Z]. Relates #23886	2017-04-04 11:04:18 -04:00
Jason Tedor	a01f77210a	Fix Javadocs for BootstrapChecks#enforceLimits This commit adds a description for a parameter that was added to BootstrapChecks#enforceLimits(BoundTransportAddress, String) without the Javadocs having been updated.	2017-04-04 09:42:19 -04:00
Jason Tedor	51b5dbffb7	Disable bootstrap checks for single-node discovery While there are use-cases where a single-node is in production, there are also use-cases for starting a single-node that binds transport to an external interface where the node is not in production (for example, for testing the transport client against a node started in a Docker container). It's tricky to balance the desire to always enforce the bootstrap checks when a node might be in production with the need for the community to perform testing in situations that would trip the bootstrap checks. This commit enables some flexibility for these users. By setting the discovery type to "single-node", we disable the bootstrap checks independently of how transport is bound. While this sounds like a hole in the bootstrap checks, the bootstrap checks can already be avoided in the single-node use-case by binding only HTTP but not transport. For users that are genuinely in production on a single-node use-case with transport bound to an external use-case, they can set the system property "es.enable.bootstrap.checks" to force running the bootstrap checks. It would be a mistake for them not to do this. Relates #23598	2017-04-04 09:39:04 -04:00
Jim Ferenczi	c14be20744	Add unit tests for the missing aggregator (#23895 ) * Add unit tests for the missing aggregator Relates #22278	2017-04-04 14:37:33 +02:00
Jim Ferenczi	a04350f0dd	Add a property to mark setting as final (#23872 ) This change adds a setting property that sets the value of a setting as final. Updating a final setting is prohibited in any context, for instance an index setting marked as final must be set at index creation and will refuse any update even if the index is closed. This change also marks the setting `index.number_of_shards` as Final and the special casing for refusing the updates on this setting has been removed.	2017-04-04 12:35:48 +02:00
Jason Tedor	71293a89bf	Introduce single-node discovery This commit adds a single node discovery type. With this discovery type, a node will elect itself as master and never form a cluster with another node. Relates #23595	2017-04-04 03:02:58 -04:00
Jason Tedor	3bd2efa177	Await termination after shutting down executors When terminating an executor service or a thread pool, we first shutdown. Then, we do a timed await termination. If the await termination fails because there are still tasks running, we then shutdown now. However, this method does not wait for actively executing tasks to terminate, so we should again wait for termination of these tasks before returning. This commit does that. Relates #23889	2017-04-04 03:01:00 -04:00
Jason Tedor	6234a49fb3	Fix initialization issue in ElasticsearchException If a test touches ElasticsearchExceptionHandle before the class initialzer for ElasticsearchException has run, a circular class initialization problem can arise. Namely, the class initializer for ElasticsearchExceptionHandle depends on the class initializer for ElasticsearchExceptionHandle which depends on the class initializer for all the classes that extend ElasticsearchException, but these classes can not be loaded because ElasticsearchException has not finished its class initializer. There are tests that can trigger this before ElasticsearchException has been loaded due to an unlucky ordering of test execution. This commit addresses this issue by making ElasticsearchExceptionHandle private, and then exposing methods that provide the necessary values from ElasticsearchExceptionHandle. Touching these methods will force the class initializer for ElasticsearchException to run first.	2017-04-04 00:33:00 -04:00
Boaz Leskes	48b0121f60	SpecificMasterNodesIT shouldn't use autoMinMasterNodes as it tweaks the `discovery.initial_state_timeout` setting.	2017-04-03 16:23:17 +02:00
Boaz Leskes	40eb68c95a	testRestorePersistentSettings doesn't to mess with discovery settings	2017-04-03 16:23:17 +02:00
Colin Goodheart-Smithe	8482503f9b	Adds tests for cardinality and filter aggregations Relates to #22278	2017-04-03 10:09:27 +01:00
Colin Goodheart-Smithe	cad4fcd9c9	Revert "Adds tests for cardinality and filter aggregations (#23826 )" This reverts commit `058869ed54`.	2017-04-03 09:45:16 +01:00
Colin Goodheart-Smithe	058869ed54	Adds tests for cardinality and filter aggregations (#23826 ) * Adds tests for cardinality and filter aggregations Relates to #22278 * addresses review comments	2017-04-03 09:39:03 +01:00
Jim Ferenczi	7316b663e2	Replace custom sort field with SortedSetSortField and SortedNumericSortField when possible (#23827 ) Currently for field sorting we always use a custom sort field and a custom comparator source. Though for numeric fields this custom sort field could be replaced with a standard SortedNumericSortField unless the field is nested especially since we removed the FieldData for numerics. We can also use a SortedSetSortField for string sort based on doc_values when the field is not nested. This change replaces IndexFieldData#comparatorSource with IndexFieldData#sortField that returns a Sorted{Set,Numeric}SortField when possible or a custom sort field when the field sort spec is not handled by the SortedSortFields.	2017-04-03 09:57:26 +02:00
Simon Willnauer	bdb1cabe71	Prevent nodes from joining if newer indices exist in the cluster (#23843 ) Today we prevent nodes from joining when indices exists that are too old. Yet, the opposite can happen too since lucene / elasticsearch is not forward compatible when it gets to indices we won't let nodes join the cluster once there are indices in the clusterstate that are newer than the nodes version. This prevents forward compatibility issues which we never test against. Yet, this will not prevent rolling restarts or anything like this since indices are always created with the minimum node version in the cluster such that an index can only get the version of the higher nodes once all nodes are upgraded to this version.	2017-04-03 09:52:09 +02:00
Simon Willnauer	998eeb7687	Synchronized CollapseTopFieldDocs with lucenes relatives (#23854 ) TopDocs et.al. got additional parameters to incrementally reduce top docs. In order to add incremental reduction `CollapseTopFieldDocs` needs to have the same properties.	2017-04-03 09:50:44 +02:00
Jason Tedor	7082baaed9	Stricter parsing of remote node attribute This commit enables stricter parsing of the remote node attribute, instead of leniently parsing values that are not "true" as false.	2017-04-01 13:18:46 -04:00
Jason Tedor	38b3fec885	Fix cross-cluster remote node gateway attributes Remote nodes in cross-cluster search can be marked as eligible for acting a gateway node via a remote node attribute setting. For example, if search.remote.node.attr is set to "gateway", only nodes that have node.attr.gateway set to "true" can be connected to for cross-cluster search. Unfortunately, there is a bug in the handling of these attributes due to the use of a dangerous method Boolean#getBoolean(String) which obtains the system property with specified name as a boolean. We are not looking at system properties here, but node settings. This commit fixes this situation, and adds a test. A follow-up will ban the use of Boolean#getBoolean. Relates #23863	2017-04-01 13:04:51 -04:00
Jim Ferenczi	ee68e75332	FieldCapabilitiesRequest should implements Replaceable since it accepts index patterns	2017-03-31 20:21:06 +02:00
Alexander Reelsen	f720767cbc	Cleanup: Remove unused FieldMappers class (#23851 ) This class is unused, so it can be removed.	2017-03-31 18:26:59 +02:00
Nik Everett	ba62229f47	Fix FieldCapabilities compilation in Eclipse (#23855 ) Eclipse can't deal with the generics, maybe the fixed but unreleased https://bugs.eclipse.org/bugs/show_bug.cgi?id=511750	2017-03-31 12:10:15 -04:00
Tanguy Leroux	28099162ab	Cluster stats should not render empty http/transport types (#23735 ) This commit changes the ClusterStatsNodes.NetworkTypes so that is does not print out empty field names when no Transport or HTTP type is defined: ``` { "network_types": { ... "http_types": { "": 2 } } } ``` is now rendered as: ``` { "network_types": { ... "http_types": { } } } ```	2017-03-31 17:13:27 +02:00
Simon Willnauer	135eae42b9	Cleanup SearchPhaseController interface (#23844 ) SearchPhaseController is tighly coupled to AtomicArray which makes non-dense representations of results very difficult. This commit removes the coupling and cuts over to Collection rather than List to ensure no order or random access lookup is implied.	2017-03-31 16:25:15 +02:00
Jim Ferenczi	a8250b26e7	Add FieldCapabilities (_field_caps) API (#23007 ) This change introduces a new API called `_field_caps` that allows to retrieve the capabilities of specific fields. Example: ```` GET t,s,v,w/_field_caps?fields=field1,field2 ```` ... returns: ```` { "fields": { "field1": { "string": { "searchable": true, "aggregatable": true } }, "field2": { "keyword": { "searchable": false, "aggregatable": true, "non_searchable_indices": ["t"] "indices": ["t", "s"] }, "long": { "searchable": true, "aggregatable": false, "non_aggregatable_indices": ["v"] "indices": ["v", "w"] } } } } ```` In this example `field1` have the same type `text` across the requested indices `t`, `s`, `v`, `w`. Conversely `field2` is defined with two conflicting types `keyword` and `long`. Note that `_field_caps` does not treat this case as an error but rather return the list of unique types seen for this field.	2017-03-31 15:34:46 +02:00
Colin Goodheart-Smithe	9f66b8cd38	Improves disabled fielddata error message (#23841 ) Closes #22768	2017-03-31 10:01:07 +01:00
Simon Willnauer	5badf68bd9	Add infrastructure to mark contexts as system contexts (#23830 ) Today we have no way to mark an execution as internal. This commit adds a simple thread context header that allows executing code in a system context. This allows intercepting code can make better decisions down the road when it gets to authentication.	2017-03-31 10:47:10 +02:00
Tim Brooks	5fa80a6521	Pass exception from sendMessage to listener (#23559 ) This commit changes the listener passed to sendMessage from a Runnable to a ActionListener. This change also removes IOException from the sendMessage signature. That signature is misleading as it allows implementers to assume an exception will be thrown in case of failure. That does not happen due to Netty's async nature.	2017-03-30 15:08:23 -05:00
Jason Tedor	48357e43d3	Honor update request timeout When executing an update request, the request timeout is not transferred to the index/delete request executed on behalf of the update request. This leads to update requests not timing out when they should (e.g., if not all shards are available when the request specifies wait_for_shards=all with a small timeout). This commit causes the index/delete requests to honor the update request timeout. Relates #23825	2017-03-30 14:38:34 -04:00
Christoph Büscher	b92371a4dc	Tests: Add base tests for InternalSimpleValue and InternalDerivative (#23799 ) As an addition to #22278 we should probably also have base tests for InternalSimpleValue and InternalDerivative.	2017-03-30 20:23:49 +02:00
Simon Willnauer	4125f012b9	Streamline shard index availability in all SearchPhaseResults (#23788 ) Today we have the shard target and the target request ID available in SearchPhaseResults. Yet, the coordinating node maintains a shard index to reference the request, response tuples internally which is also used in many other classes to reference back from fetch results to query results etc. Today this shard index is implicitly passed via the index in AtomicArray which causes an undesirable dependency on this interface. This commit moves the shard index into the SearchPhaseResult and removes some dependencies on AtomicArray. Further removals will follow in the future. The most important refactoring here is the removal of AtomicArray.Entry which used to be created for every element in the atomic array to maintain the shard index during result processing. This caused an unnecessary indirection, dependency and potentially thousands of unnecessary objects in every search phase.	2017-03-30 14:32:42 +02:00
Jim Ferenczi	3b559e01be	Fixed sliced search tests that rely on BytesRef.hashCode output	2017-03-30 10:14:41 +02:00
David Causse	a49e1c0062	Use a fixed seed for computing term hashCode in TermsSliceQuery (#23795 ) I think this query should not use the hashCode provided BytesRef#hashCode(). It uses StringHelper#GOOD_FAST_HASH_SEED which is initialized in a static block to System.currentTimeMillis(). Running this query on different replicas may return inconsistent results. Using a fixed seed should guaranty that the docs are sliced consistently accross replicas. Fixes #23096	2017-03-30 10:10:32 +02:00
Lee Hinman	c8081bde91	Further refactor and extend testing for `TransportShardBulkAction` This moves `updateReplicaRequest` to `createPrimaryResponse` and separates the translog updating to be a separate function so that the function purpose is more easily understood (and testable). It also separates the logic for `MappingUpdatePerformer` into two functions, `updateMappingsIfNeeded` and `verifyMappings` so they don't do too much in a single function. This allows finer-grained error testing for when a mapping fails to parse or be applied. Finally, it separates parsing and version validation for `executeIndexRequestOnReplica` into a separate method (`prepareIndexOperationOnReplica`) and adds a test for it. Relates to #23359	2017-03-29 10:56:51 -06:00
Jason Tedor	72824609df	Add lower bound for translog generation threshold The translog already occupies 43 bytes on disk when empty. If the translog generation threshold is below this, the flush thread can get stuck in an infinite loop repeatedly rolling the generation. This commit adds a lower bound on the translog generation to avoid this problem, however we keep the lower bound small for convenience in testing. Relates #23779	2017-03-28 14:11:50 -04:00
Ali Beyad	2d3c2a4800	Adds backwards compatibility index and repository for v5.3.0	2017-03-28 14:02:07 -04:00
Ali Beyad	c675d92a56	Adds v5.3.1 to the version constants	2017-03-28 13:04:28 -04:00
Ali Beyad	2120086d82	Adds pattern keyword marker filter support (#23600 ) This commit adds support for the pattern keyword marker filter in Lucene. Previously, the keyword marker filter in Elasticsearch supported specifying a keywords set or a path to a set of keywords. This commit exposes the regular expression pattern based keyword marker filter also available in Lucene, so that any token matching the pattern specified by the `keywords_pattern` setting is excluded from being stemmed by any stemming filters. Closes #4877	2017-03-28 11:13:34 -04:00
Dimitris Athanasiou	34f116eae3	Require explicit query in _delete_by_query API (#23632 ) As the query of a search request defaults to match_all, calling _delete_by_query without an explicit query may result in deleting all data. In order to protect users against falling into that pitfall, this commit adds a check to require the explicit setting of a query. Closes #23629	2017-03-28 15:44:57 +01:00
Ali Beyad	8359dd05c9	Adds boolean similarity to Elasticsearch (#23637 ) This commit adds the boolean similarity scoring from Lucene to Elasticsearch. The boolean similarity provides a means to specify that a field should not be scored with typical full-text ranking algorithms, but rather just whether the query terms match the document or not. Boolean similarity scores a query term equal to its query boost only. Boolean similarity is available as a default similarity option and thus a field can be specified to have boolean similarity by declaring in its mapping: "similarity": "boolean" Closes #6731	2017-03-28 10:17:23 -04:00
Stuart Neivandt	3caf887632	Improve error handling for epoch format parser with time zone (#23689 ) Change the error response when using a non UTF timezone for range queries with epoch_millis or epoch_second formats to an illegal argument exception. The goal is to provide a better explanation of why the query has failed. The current behavior is to respond with a parse exception. Closes #22621	2017-03-28 14:45:20 +02:00
Jason Tedor	742d929b56	Validate top-level keys when parsing mget requests Today, when parsing mget requests, we silently ignore keys in the top level that do not match "docs" or "ids". This commit addresses this situation by throwing an exception if any other key occurs here, and providing the names of valid keys. Relates #23746	2017-03-28 08:27:31 -04:00
Jason Tedor	4f2dfb6819	Fix serialization for plugin info This commit fixes the serialization for plugin info. Namely, the serialization incorrectly specified the backwards compatibility version as strictly after version 5.4.0, whereas it should be on or after version 5.4.0.	2017-03-27 21:04:31 -04:00
Jason Tedor	b54a9e9c83	Introduce translog generation rolling This commit introduces a maximum size for a translog generation and automatically rolls the translog when a generation exceeds the threshold into a new generation. This threshold is configurable per index and defaults to sixty-four megabytes. We introduce this constraint as sequence numbers will require keeping around more than the current generation (to ensure that we can rollback to the global checkpoint). Without keeping the size of generations under control, having to keep old generations around could consume excessive disk space. A follow-up will enable commits to trim previous generations based on the global checkpoint. Relates #23606	2017-03-27 16:43:54 -04:00
Jason Tedor	defd0452e7	Modify permissions dialog for plugins This commit modifies the handling of plugins that require special permissions to cover a case that was not previously covered. Relates #23742	2017-03-27 15:52:45 -04:00
Christoph Büscher	fc8cb417e7	FuzzyQueryBuilder should error when parsing array of values (#23762 ) Closes #23759	2017-03-27 17:02:01 +02:00
Marios Trivyzas	4f694a3312	Remove obsolete index setting `index.version.minimum_compatible`. (#23593 )	2017-03-27 15:59:48 +02:00
Jim Ferenczi	0e95c90e9f	Upgrade to Lucene 6.5.0 (#23750 )	2017-03-27 15:57:54 +02:00
Jason Tedor	a6c4234575	Add early-access check The OpenJDK project provides early-access builds of upcoming releases. These early-access builds are not suitable for production. These builds sometimes end up on systems due to aggressive packaging (e.g., Ubuntu). This commit adds a bootstrap check to ensure these early-access builds are not being used in production. Relates #23743	2017-03-24 14:52:50 -04:00
Christoph Büscher	396785ccb1	Tests: Lower expected precision for InternalAvgTests Closes #23723	2017-03-24 12:18:07 +01:00
Christoph Büscher	e7a8e69900	Test: Check that parsing SearchHit without _type/_id works (#23715 ) The hit object can be very small e.g. when using "stored_fields": ["_none_"], this adds a test that checks that we can still parse back the object. * also check type/id null	2017-03-24 10:14:52 +01:00

1 2 3 4 5 ...

7863 Commits