OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	0efba0675e	[CCR] Add qa test library (#34611 ) * Introduced test qa lib that all CCR qa modules depend on to avoid test code duplication.	2018-10-23 23:24:32 +02:00
Martijn van Groningen	ed817fb265	[CCR] Move leader_index and leader_cluster parameters from resume follow to put follow api (#34638 ) As part of this change the leader index name and leader cluster name are stored in the CCR metadata in the follow index. The resume follow api will read that when a resume follow request is executed.	2018-10-23 19:37:45 +02:00
Martijn van Groningen	36baf3823d	[CCR] Auto follow pattern APIs adjustments (#34518 ) * Changed the resource id of auto follow patterns to be a user defined name instead of being the leader cluster alias name. * Fail when an unfollowed leader index matches with two or more auto follow patterns.	2018-10-23 15:48:51 +02:00
Jason Tedor	7af19b8f81	Migrate wait for pending tasks helper to server (#34675 ) In some of our X-Pack REST tests we have to wait for pending tasks to complete. We are now needing this functionality in ESRestTestCase for the docs tests where we run against X-Pack features. This commit moves the helper method that we have in X-Pack to ESRestTestCase, and removes duplicate logic from waiting for rollup tasks to complete.	2018-10-22 11:14:02 -04:00
Martijn van Groningen	56d4f69718	Renamed remaining leader_cluster_alias / cluster_alias to leader_cluster	2018-10-19 07:59:56 +02:00
Martijn van Groningen	44b461aff2	[CCR] Make leader cluster a required argument. (#34580 ) This change makes it no longer possible to follow / auto follow without specifying a leader cluster. If a local index needs to be followed then `cluster.remote.*.seeds` should point to nodes in the local cluster. Closes #34258	2018-10-19 07:41:46 +02:00
Martijn van Groningen	0d62f6102c	[CCR] Split cluster alias from leader index field into its own field in follow APIs (#34366 )	2018-10-18 12:11:48 +02:00
Jason Tedor	3e067123a1	Remove dead methods from ChainIT This commit removes some unused methods from ChainIT.	2018-10-16 10:45:33 -04:00
Jason Tedor	e0b6721df4	Add dedicated test for chain replication (#34497 ) This commit adds a dedicated test that chain replication leader -> middle -> follow is successful.	2018-10-16 06:21:28 -04:00
Martijn van Groningen	899e48395b	[CCR] Change unfollow API's privilege scheme. (#34175 ) Unfollow should be allowed / disallowed on a per index level instead of cluster level. Also renamed `create_follow_index` index privilege to `manage_follow_index` privilege and include unfollow and close APIs.	2018-10-06 07:38:28 -04:00
Jason Tedor	7d57bdb3a0	Follow stats structure (#34301 ) This commit modifies the follow stats API response structure to more clearly highlight meaning of the higher level fields. In particular, previously the response had a top-level key for each index. Instead, we nest the indices under an "indices" field which is now an array. The values in this array are objects containing two fields: "index" which is the name of the follower index, and "shards" which is an array where each value in the array is the follower stats for that shard. That is, we have gone from: { "bar": [ { "shard_id": 0... }... ]... } to { "indices": [ { "index": "bar", "shards": [ { "shard_id": 0... }... ] }... }	2018-10-05 06:38:20 -04:00
Martijn van Groningen	b1a27b2e6b	[CCR] Add unfollow API (#34132 ) The unfollow API changes a follower index into a regular index, so that it will accept write requests from clients. For the unfollow api to work the index follow needs to be stopped and the index needs to be closed. Closes #33931	2018-09-30 19:19:34 +02:00
Martijn van Groningen	a984f8afb3	[CCR] Validate index privileges prior to following an index (#33758 ) Prior to following an index in the follow API, check whether current user has sufficient privileges in the leader cluster to read and monitor the leader index. Also check this in the create and follow API prior to creating the follow index. Also introduced READ_CCR cluster privilege that include the minimal cluster level actions that are required for ccr in the leader cluster. So a user can follow indices in a cluster, but not use the ccr admin APIs. Closes #33553 Co-authored-by: Jason Tedor <jason@tedor.me>	2018-09-28 17:51:23 +02:00
Martijn van Groningen	9129948f60	Rename CCR APIs (#34027 ) * Renamed CCR APIs Renamed: * `/{index}/_ccr/create_and_follow` to `/{index}/_ccr/follow` * `/{index}/_ccr/unfollow` to `/{index}/_ccr/pause_follow` * `/{index}/_ccr/follow` to `/{index}/_ccr/resume_follow` Relates to #33931	2018-09-28 08:02:20 +02:00
Martijn van Groningen	793b2a94b4	[CCR] Expose auto follow stats to monitoring (#33886 )	2018-09-25 07:19:46 +02:00
Martijn van Groningen	2795ef561f	[CCR] Add get auto follow pattern api (#33849 ) Relates to #33007	2018-09-24 20:26:13 +02:00
Martijn van Groningen	44c7c4b166	[CCR] Add auto follow stats api (#33801 ) GET /_ccr/auto_follow/stats Returns: ``` { "number_of_successful_follow_indices": ... "number_of_failed_follow_indices": ... "number_of_failed_remote_cluster_state_requests": ... "recent_auto_follow_errors": [ ... ] } ``` Relates to #33007	2018-09-20 07:16:20 +02:00
Martijn van Groningen	013b64a07c	[CCR] Change FollowIndexAction.Request class to be more user friendly (#33810 ) Instead of having one constructor that accepts all arguments, all parameters should be provided via setters. Only leader and follower index are required arguments. This makes using this class in tests and transport client easier.	2018-09-19 07:18:24 +02:00
Martijn van Groningen	805a12361f	[CCR] Fail with a descriptive error if leader index does not exist (#33797 ) Closes #33737	2018-09-18 21:47:02 +02:00
Martijn van Groningen	9fe5a273aa	[TEST] handle failed search requests differently	2018-09-18 15:55:27 +02:00
Martijn van Groningen	47b86d6e6a	[CCR] Changed AutoFollowCoordinator to keep track of certain statistics (#33684 ) The following stats are being kept track of: 1) The total number of times that auto following a leader index succeed. 2) The total number of times that auto following a leader index failed. 3) The total number of times that fetching a remote cluster state failed. 4) The most recent 256 auto follow failures per auto leader index (e.g. create_and_follow api call fails) or cluster alias (e.g. fetching remote cluster state fails). Each auto follow run now produces a result that is being used to update the stats being kept track of in AutoFollowCoordinator. Relates to #33007	2018-09-18 09:43:50 +02:00
Jason Tedor	2d81fc3873	Keep CCR REST API specification with all of X-Pack (#33743 ) This commit moves the CCR REST API specification out of the CCR sub-project to locate them with the rest of the REST API specifications for X-Pack.	2018-09-17 09:59:22 -04:00
Martijn van Groningen	481f8a9a07	[CCR] Make auto follow patterns work with security (#33501 ) Relates to #33007	2018-09-17 07:29:00 +02:00
Jason Tedor	069605bd91	Do not count shard changes tasks against REST tests (#33738 ) When executing CCR REST tests it is going to be expected after global checkpoint polling goes in that shard changes tasks can still be pending at the end of the test. One way to deal with this is to set a low timeout on these polls, but then that means we are not executing our REST tests with our default production settings and instead would be using an unrealistic low timeout. Alternatively, since we expect these tasks to be there, we can not count them against the test. That is what this commit does.	2018-09-16 07:32:12 -04:00
Jason Tedor	73417bf09a	Move CCR REST tests to a sub-project of ccr This commit moves these REST tests (possibly temporarily) to a sub-project of ccr. We do this (again, possibly temporarily) to keep them within the ccr sub-project yet there are changes within 6.x that prevent these from being in the top-level project (the cluster formation tasks are trying to install x-pack-ccr into the integ-test-zip). Therefore, we isolate these for now until we can understand why there are differences between 6.x and master.	2018-09-15 10:18:59 -04:00
Martijn van Groningen	4bcad95fe7	[TEST] wait for no initializing shards	2018-09-14 09:59:24 +02:00
Jason Tedor	eb715d5290	Add follower index to CCR monitoring and status (#33645 ) This commit adds the follower index to CCR shard follow task status, and to monitoring.	2018-09-12 17:35:06 -04:00
Martijn van Groningen	5fa81310cc	[CCR] Added history uuid validation (#33546 ) For correctness we need to verify whether the history uuid of the leader index shards never changes while that index is being followed. * The history UUIDs are recorded as custom index metadata in the follow index. * The follow api validates whether the current history UUIDs of the leader index shards are the same as the recorded history UUIDs. If not the follow api fails. * While a follow index is following a leader index; shard follow tasks on each shard changes api call verify whether their current history uuid is the same as the recorded history uuid. Relates to #30086 Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>	2018-09-12 19:42:00 +02:00
Martijn van Groningen	901d8035d9	[CCR] Update es monitoring mapping and (#33635 ) * [CCR] Update es monitoring mapping and change qa tests to query based on leader index. Co-authored-by: Jason Tedor <jason@tedor.me>	2018-09-12 19:36:17 +02:00
Jason Tedor	23f12e42c1	Expose CCR stats to monitoring (#33617 ) This commit exposes the CCR stats endpoint to monitoring collection. Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2018-09-12 09:13:07 -04:00
Martijn van Groningen	74d41857c6	mute test on windows Relates #33570	2018-09-10 16:49:17 +02:00
Martijn van Groningen	c4adcee3ea	[CCR] Add create_follow_index privilege (#33559 ) This is a new index privilege that the user needs to have in the follow cluster. This privilege is required in addition to the `manage_ccr` cluster privilege in order to execute the create and follow api. Closes #33555	2018-09-10 13:08:20 +02:00
Jason Tedor	d1b99877fa	Remove underscore from auto-follow API (#33550 ) This commit removes the leading underscore from _auto_follow in the auto-follow API endpoints.	2018-09-09 14:42:49 -04:00
Jason Tedor	c67b0ba33e	Create temporary directory if needed in CCR test In the multi-cluster-with-non-compliant-license tests, we try to write out a java.policy to a temporary directory. However, if this temporary directory does not already exist then writing the java.policy file will fail. This commit ensures that the temporary directory exists before we attempt to write the java.policy file.	2018-09-09 07:16:56 -04:00
Jason Tedor	5a38c930fc	Add license checks for auto-follow implementation (#33496 ) This commit adds license checks for the auto-follow implementation. We check the license on put auto-follow patterns, and then for every coordination round we check that the local and remote clusters are licensed for CCR. In the case of non-compliance, we skip coordination yet continue to schedule follow-ups.	2018-09-09 07:06:55 -04:00
Martijn van Groningen	a721d09c81	[CCR] Added auto follow patterns feature (#33118 ) Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices. This change adds an `AutoFollowCoordinator` component that is only active on the elected master node. Periodically this component checks the the cluster state of remote clusters if there new leader indices that match with configured auto follow patterns that have been defined in `AutoFollowMetadata` custom metadata. This change also adds two new APIs to manage auto follow patterns. A put auto follow pattern api: ``` PUT /_ccr/_autofollow/{{remote_cluster}} { "leader_index_pattern": ["logs-*", ...], "follow_index_pattern": "{{leader_index}}-copy", "max_concurrent_read_batches": 2 ... // other optional parameters } ``` and delete auto follow pattern api: ``` DELETE /_ccr/_autofollow/{{remote_cluster_alias}} ``` The auto follow patterns are directly tied to the remote cluster aliases configured in the follow cluster. Relates to #33007 Co-authored-by: Jason Tedor jason@tedor.me	2018-09-06 08:01:58 +02:00
Jason Tedor	d71ced1b00	Generalize search.remote settings to cluster.remote (#33413 ) With features like CCR building on the CCS infrastructure, the settings prefix search.remote makes less sense as the namespace for these remote cluster settings than does a more general namespace like cluster.remote. This commit replaces these settings with cluster.remote with a fallback to the deprecated settings search.remote.	2018-09-05 20:43:44 -04:00
Alpar Torok	7f7e8fd733	Disable assemble task instead of removing it (#33348 )	2018-09-04 07:32:14 +03:00
Jason Tedor	7fa8a728c4	Make CCR QA tests build again (#33113 ) Welp, I broke this. I merged a change to auto-discover the CCR QA tests by making :x-pack:plugin:ccr:check auto-discover the check tasks in the qa sub-project. Yet, the check tasks for these sub-projects did not depend on the necessary test tasks (as we were previously doing this directly from the ccr build file. This commit fixes this!	2018-08-24 09:48:54 -04:00
Martijn van Groningen	575f33941c	Required changes after merging in master branch.	2018-08-24 12:51:26 +07:00
Jason Tedor	b08d02e3b7	Implement CCR licensing (#33002 ) This commit implements licensing for CCR. CCR will require a platinum license, and administrative endpoints will be disabled when a license is non-compliant.	2018-08-20 23:33:18 -04:00
Jason Tedor	2387616c80	Remove _xpack from CCR APIs (#32563 ) For a new feature like CCR we will go without this extra layer of indirection. This commit replaces all /_xpack/ccr/_(\S+) endpoints by /_ccr/$1 endpoints.	2018-08-02 20:21:43 -04:00
Nhat Nguyen	cd8b80da58	Use shadow plugin in ccr/qa	2018-07-25 00:16:33 -04:00
Martijn van Groningen	815faf34fc	[CCR] Move api parameters from url to request body. (#31949 ) Relates to #30102	2018-07-11 10:16:43 +02:00
Martijn van Groningen	8e1ef0cff9	Rewrite shard follow node task logic (#31581 ) The current shard follow mechanism is complex and does not give us easy ways the have visibility into the system (e.g. why we are falling behind). The main reason why it is complex is because the current design is highly asynchronous. Also in the current model it is hard to apply backpressure other than reducing the concurrent reads from the leader shard. This PR has the following changes: * Rewrote the shard follow task to coordinate the shard follow mechanism between a leader and follow shard in a single threaded manner. This allows for better unit testing and makes it easier to add stats. * All write operations read from the shard changes api should be added to a buffer instead of directly sending it to the bulk shard operations api. This allows to apply backpressure. In this PR there is a limit that controls how many write ops are allowed in the buffer after which no new reads will be performed until the number of ops is below that limit. * The shard changes api includes the current global checkpoint on the leader shard copy. This allows reading to be a more self sufficient process; instead of relying on a background thread to fetch the leader shard's global checkpoint. * Reading write operations from the leader shard (via shard changes api) is a separate step then writing the write operations (via bulk shards operations api). Whereas before a read would immediately result into a write. * The bulk shard operations api returns the local checkpoint on the follow primary shard, to keep the shard follow task up to date with what has been written. * Moved the shard follow logic that was previously in ShardFollowTasksExecutor to ShardFollowNodeTask. * Moved over the changes from #31242 to make shard follow mechanism resilient from node and shard failures. Relates to #30086	2018-07-10 16:00:55 +02:00
Simon Willnauer	5c6711b8a4	Use a `_recovery_source` if source is omitted or modified (#31106 ) Today if a user omits the `_source` entirely or modifies the source on indexing we have no chance to re-create the document after it has been added. This is an issue for CCR and recovery based on soft deletes which we are going to make the default. This change adds an additional recovery source if the source is disabled or modified that is only kept around until the document leaves the retention policy window. This change adds a merge policy that efficiently removes this extra source on merge for all document that are live and not in the retention policy window anymore.	2018-06-07 07:39:28 +02:00
Jason Tedor	d230548401	Remove use of deprecated methods to perform request (#31117 ) The old perform request methods on the REST client have been deprecated in favor using request-flavored methods. This commit addresses the use of these deprecated methods in the CCR test suite.	2018-06-06 05:09:55 -04:00
Martijn van Groningen	7e8cf768cf	changed persistent task name to be of similar structure as the others	2018-05-31 15:16:13 +02:00
Martijn van Groningen	e477147143	[CCR] Add create and follow api (#30602 ) Also renamed FollowExisting* internal names to just Follow* and fixed tests	2018-05-26 15:05:40 +02:00
Martijn van Groningen	596ec1848e	[CCR] Add validation checks that were left out of #30120 (#30463 )	2018-05-16 09:46:03 +02:00
Martijn van Groningen	23204e3d09	[CCR] Fixed follow and unfollow api url path according to design. The TODOs in the rest actions was incorrect. The problem was that these rest actions used `follow_index` as first named variable in the path under which the rest actions were registered. Other candidate rest actions that also have a named variable as first element in the path (but with a different name) get resolved as rest parameters too and passed down to the rest action that actually ends up getting executed. In the case of the follow index api, a `index` parameter got passed down to `RestFollowExistingAction`, but that param was never used. This caused the follow index api call to fail, because of unused http parameters. This change doesn't fixes that problem, but works around it by using `index` as named variable for the follow index (instead of `follow_index`). Relates to #30102	2018-05-16 09:07:50 +02:00
Martijn van Groningen	64b97313d5	[CCR] Make cross cluster replication work with security (#30239 ) If security is enabled today with ccr then the follow index api will fail with the fact that system user does not have privileges to use the shard changes api. The reason that system user is used is because the persistent tasks that keep the shards in sync runs in the background and the user that invokes the follow index api only start those background processes. I think it is better that the system user isn't used by the persistent tasks that keep shards in sync, but rather runs as the same user that invoked the follow index api and use the permissions that that user has. This is what this PR does, and this is done by keeping track of security headers inside the persistent task (similar to how rollup does this). This PR also adds a cluster ccr priviledge that allows a user to follow or unfollow an index. Finally if a user that wants to follow an index, it needs to have read and monitor privileges on the leader index and monitor and write privileges on the follow index.	2018-05-16 07:48:32 +02:00
Martijn van Groningen	bb6586dc5f	[CCR] Read changes from Lucene instead of translog (#30120 ) This commit adds an API to read translog snapshot from Lucene, then cut-over from the existing translog to the new API in CCR. Relates #30086 Relates #29530	2018-05-09 17:35:27 -04:00
Martijn van Groningen	5a67a0f78f	Applying changes required for ccr after moving ccr code to elasticsearch	2018-04-25 08:03:29 +02:00
Martijn van Groningen	56ca59a513	Add the ability to the follow index to follow an index in a remote cluster. The follow index api completely reuses CCS infrastructure that was exposed via: https://github.com/elastic/elasticsearch/pull/29495 This means that the leader index parameter support the same ccs index to indicate that an index resides in a different cluster. I also added a qa module that smoke tests the cross cluster nature of ccr. The idea is that this test just verifies that ccr can read data from a remote leader index and that is it, no crazy randomization or indirectly testing other features.	2018-04-17 07:36:40 +02:00

1 2 3

105 Commits