OpenSearch

Commit Graph

Author	SHA1	Message	Date
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
Simon Willnauer	ec1afe30ea	Ensure remote cluster alias is preserved in inner hits aggs (#25627 ) We lost the cluster alias due to some special caseing in inner hits and due to the fact that we didn't pass on the alias to the shard request. This change ensures that we have the cluster alias present on the shard to ensure all SearchShardTarget reads preserve the alias. Relates to #25606	2017-07-11 11:34:06 +02:00
Jay Modi	b2901f536e	Do not search locally if remote index pattern resolves to no indices (#25436 ) This commit changes how we determine if there were any remote indices that a search should have been executed against. Previously, we used the list of remote shard iterators but if the remote index pattern resolved to no indices there would be no remote shard iterators even though the request specified remote indices. The map of remote cluster names to the original indices is used instead so that we can determine if there were remote indices even when there are no remote shard iterators. Closes #25426	2017-06-28 12:41:37 -06:00
Simon Willnauer	bc7ec68e76	Add Cross Cluster Search support for scroll searches (#25094 ) To complete the cross cluster search capabilities for all search types and function this change adds cross cluster search support for scroll searches.	2017-06-13 17:22:49 +02:00
Simon Willnauer	cf846af0e5	Fix `_field_caps` serialization in order to support cross cluster search (#24722 ) Today the `_field_caps` API doesn't implement its request serialization correctly since indices and indices options are not serialized at all. This will likely break with all transport clients etc. and if this request must be send across the network. This commit fixes this and adds correct handling if we have only remote indices to prevent the inclusion of all local indices.	2017-05-17 14:02:45 +02:00
Ryan Ernst	2a65bed243	Tests: Change rest test extension from .yaml to .yml (#24659 ) This commit renames all rest test files to use the .yml extension instead of .yaml. This way the extension used within all of elasticsearch for yaml is consistent.	2017-05-16 17:24:35 -07:00
Simon Willnauer	8356df0846	[TEST] Add a test that alias requests are dense for all indices	2017-05-04 14:29:59 +02:00
Simon Willnauer	07f106d39c	[TEST] Rollback temporarily disabled field_caps test (#24483 )	2017-05-04 14:14:22 +02:00
Simon Willnauer	14e57bf9f8	Add cross cluster support to `_field_caps` (#24463 ) To support kibana this commit adds an internal optimization to support the cross cluster syntax for indices on the `_field_caps` API. Closes #24334	2017-05-04 11:44:54 +02:00
Ryan Ernst	212f24aa27	Tests: Clean up rest test file handling (#21392 ) This change simplifies how the rest test runner finds test files and removes all leniency. Previously multiple prefixes and suffixes would be tried, and tests could exist inside or outside of the classpath, although outside of the classpath never quite worked. Now only classpath tests are supported, and only one resource prefix is supported, `/rest-api-spec/tests`. closes #20240	2017-04-18 15:07:08 -07:00
Ryan Ernst	a8017ff020	Tests: Move cluster dependencies from runner to cluster (#24142 ) After splitting integ tests into cluster configuration and the test runner task, we still have dependencies of the test runner added as deps of the cluster. This commit adds dependencies directly to the cluster, so that the runner can have other dependencies independent of what is needed for the cluster.	2017-04-17 16:02:46 -07:00
Tim Brooks	cf6b03c8f4	Wildcard cluster names for cross cluster search (#23985 ) This is related to #23893. This commit allows users to use wilcards for cluster names when executing a cross cluster search. So instead of defining every cluster such as: GET one:,two:,three:/_search A user could just search: GET :*/_search As ":" characters are currently allowed in index names, if the text up to the first ":" does not match a defined cluster name, the entire string is treated as an index name.	2017-04-11 13:56:26 -05:00
Simon Willnauer	e30a275bfe	Add a dedicated TransportRemoteInfoAction for consistency (#24040 ) All our actions that are invoked from rest actions have corresponding transport actions. This adds the transport action for RestRemoteClusterInfoAction for consistency. Relates to #23969	2017-04-11 14:40:37 +02:00
Simon Willnauer	f22e0dc30b	Add cross-cluster search remote cluster info API (#23969 ) This commit adds an API to discover information like seed nodes, http addresses and connection status of a configured remote cluster. Closes #23925	2017-04-11 09:24:40 +02:00
Luca Cavanna	cc65a94fd4	[TEST] improve yaml test sections parsing (#23407 ) Throw error when skip or do sections are malformed, such as they don't start with the proper token (START_OBJECT). That signals bad indentation, which would be ignored otherwise. Thanks (or due to) our pull parsing code, we were still able to properly parse the sections, yet other runners weren't able to. Closes #21980 * [TEST] fix indentation in matrix_stats yaml tests * [TEST] fix indentation in painless yaml test * [TEST] fix indentation in analysis yaml tests * [TEST] fix indentation in generated docs yaml tests * [TEST] fix indentation in multi_cluster_search yaml tests	2017-03-02 12:43:20 +01:00
Ryan Ernst	175bda64a0	Build: Rework integ test setup and shutdown to ensure stop runs when desired (#23304 ) Gradle's finalizedBy on tasks only ensures one task runs after another, but not immediately after. This is problematic for our integration tests since it allows multiple project's integ test clusters to be simultaneously. While this has not been a problem thus far (gradle 2.13 happened to keep the finalizedBy tasks close enough that no clusters were running in parallel), with gradle 3.3 the task graph generation has changed, and numerous clusters may be running simultaneously, causing memory pressure, and thus generally slower tests, or even failure if the system has a limited amount of memory (eg in a vagrant host). This commit reworks how integ tests are configured. It adds an `integTestCluster` extension to gradle which is equivalent to the current `integTest.cluster` and moves the rest test runner task to `integTestRunner`. The `integTest` task is then just a dummy task, which depends on the cluster runner task, as well as the cluster stop task. This means running `integTest` in one project will both run the rest tests, and shut down the cluster, before running `integTest` in another project.	2017-02-22 12:43:15 -08:00
Simon Willnauer	dc659feeb4	Add a setting to disable remote cluster connections on a node (#23005 ) Today either all nodes in the cluster connect to remote clusters of only nodes that have remote clusters configured in their node config. To allow global remote cluster configuration but restrict connections to a set of nodes in the cluster this change adds a new setting `search.remote.connect` (defaults to `true`) to allow to disable remote cluster connections on a per node basis.	2017-02-07 09:59:24 +01:00
Simon Willnauer	4c61f1d75d	Cut over to use affix setting for remote cluster configuration Instead of `search.remote.seeds.${clustername}` we now specify the seeds as: `search.remote.${clustername}.seeds` which is a real list setting compared to an unvalidated group setting before.	2017-01-11 12:38:46 +01:00
Simon Willnauer	349ea0f9b6	cut over to use : instead of \| for cross cluster search	2017-01-05 17:03:12 +01:00
Simon Willnauer	e642965804	Cleanup lots of code, add javadocs and tests	2017-01-04 17:26:00 +01:00
Simon Willnauer	422cd1ef77	Add support for proxy nodes this commit adds full support for proxy nodes on the search layer. This allows to connection only to a small set of nodes on a remote cluster to exectue the search. The nodes will proxy the request to the correct node in the cluster while the coordinting node doesn't need to be connected to the target node.	2017-01-03 17:24:32 +01:00
javanna	29d7c0d50d	[TEST] check _shards.total to make sure that cross cluster search hits the right number of shards	2016-12-02 01:38:19 +01:00
javanna	7d4b1d94b9	[TEST] test also aggregations and fix indendation	2016-12-02 01:34:11 +01:00
javanna	f9eaee3c9c	[TEST] rename local index so that the name is the same as the remote one This way we make sure that index names are disambiguated.	2016-12-02 01:02:37 +01:00
javanna	bd816f02ee	[TEST] Remove flush calls that are not needed	2016-12-02 01:00:20 +01:00
Simon Willnauer	32eeaef6cf	Add WIP support for alias filters and additional tests	2016-11-25 00:17:50 +01:00
Simon Willnauer	89a2384988	Use GString and closures to delay evaluating remote cluster URL until runtime	2016-11-24 15:44:37 +01:00
Simon Willnauer	ec86771f6e	Add a dedicated integ test project for multi-cluster-search This commit adds `qa/multi-cluster-search` which currently does a simple search across 2 clusters. This commit also adds support for IPv6 addresses and fixes an issue where all shards of the local cluster are searched when only a remote index was given.	2016-11-24 14:17:53 +01:00

28 Commits