OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	11935cd480	Replace Streamable w/ Writeable in BaseTasksResponse and subclasses (#36176 ) This commit replaces usages of Streamable with Writeable for the BaseTasksResponse / TransportTasksAction classes and subclasses of these classes. Note that where possible response fields were made final. Relates to #34389	2018-12-05 13:14:10 +01:00
Martijn van Groningen	b9707c29a1	[CCR] Change get autofollow patterns API response format (#36203 ) The current response format is: ``` { "pattern1": { ... }, "pattern2": { ... } } ``` The new format is: ``` { "patterns": [ { "name": "pattern1", "pattern": { ... } }, { "name": "pattern2", "pattern": { ... } } ] } ``` This format is more structured and more friendly for parsing and generating specs. This is a breaking change, but it is better to do this now while ccr is still a beta feature than later. Follow up from #36049	2018-12-05 08:41:27 +01:00
Tim Brooks	8bde608979	Register CcrRepository based on settings update (#36086 ) This commit adds an empty CcrRepository snapshot/restore repository. When a new cluster is registered in the remote cluster settings, a new CcrRepository is registered for that cluster. This is implemented using a new concept of "internal repositories". RepositoryPlugin now allows implementations to return factories for "internal repositories". The "internal repositories" are different from normal repositories in that they cannot be registered through the external repository api. Additionally, "internal repositories" are local to a node and are not stored in the cluster state. The repository will be unregistered if the remote cluster is removed.	2018-12-04 14:36:50 -07:00
Adrien Grand	0df08dd458	Set Lucene version upon index creation. (#36038 ) It is important that all shards of a given index have the same `indexCreatedVersionMajor` to Lucene, or eg. merging those shards is going to be considered illegal. At the moment, we use the latest Lucene version when creating a shard, which could cause shards to have different created versions eg. in case of forced allocation. This commit makes sure to reuse the appropriate Lucene version in order to avoid such issues. Closes #33826	2018-12-04 17:53:20 +01:00
Martijn van Groningen	6e1ff31222	[CCR] AutoFollowCoordinator should tolerate that auto follow patterns may be removed (#35945 ) AutoFollowCoordinator should take into account that after auto following an index and while updating that a leader index has been followed, that the auto follow pattern may have been removed via delete auto follow patterns api. Also fixed a bug that when a remote cluster connection has been removed, the auto follow coordinator does not die when it tries get a remote client for that cluster. Closes #35480	2018-12-04 15:55:15 +01:00
Martijn van Groningen	43773a32a4	Replace Streamable w/ Writeable in BaseTasksRequest and subclasses (#35854 ) * Replace Streamable w/ Writeable in BaseTasksRequest and subclasses This commit replaces usages of Streamable with Writeable for the BaseTasksRequest / TransportTasksAction classes and subclasses of these classes. Relates to #34389	2018-12-03 08:04:29 +01:00
Martijn van Groningen	32f7fbd9f0	[TEST] Set 'index.unassigned.node_left.delayed_timeout' to 0 in ccr tests Some tests kill nodes and otherwise it would take 60s by default for replicas to get allocated and that is longer than we wait for getting in a green state in tests. Relates to #35403	2018-11-30 11:03:36 +01:00
Martijn van Groningen	1390f366d4	[CCR] Only auto follow indices when all primary shards have started (#35814 ) This change adds an extra check that verifies that all primary shards have been started of an index that is about to be auto followed. If not all primary shards have been started for an index then the next auto follow run will try to follow to auto follow this index again. Closes #35480	2018-11-29 09:46:09 +01:00
Jason Tedor	a3186e4a32	Deprecate X-Pack centric license endpoints (#35959 ) This commit is part of our plan to deprecate and ultimately remove the use of _xpack in the REST APIs.	2018-11-28 08:24:35 -05:00
Jason Tedor	2887680acb	Avoid NPE in follower stats when no tasks metadata (#35802 ) When there is no persistent tasks metadata we could hit a null pointer exception when executing a follower stats request. This is because we inspect the persistent tasks metadata. Yet, if no tasks have been registered, this is null (as opposed to empty). We need to avoid de-referencing the persistent tasks metadata in this case. That is what this commit does, and we add a test for this situation.	2018-11-21 19:16:28 -05:00
Tim Brooks	a989b675b5	Remove NPE from IndexFollowingIT (#35717 ) Currently there is a common NPE in the IndexFollowingIT that does not indicate the test failing. This is when a cluster state listener is called and certain index metadata is not yet available. This commit checks that the metadata is not null before performing the logic that depends on the metadata.	2018-11-19 20:38:49 -07:00
Arthur Gavlyukovskiy	022726011c	Remove use of AbstractComponent in server (#35444 ) Removed extending of AbstractComponent and changed logger usage to explicit declaration. Abstract classes still have logger declaration using this.getClass() in order to show implementation class name in its logs. See #34488	2018-11-16 16:10:32 -05:00
Martijn van Groningen	0487181d0f	[TEST] Force flush to ensure multiple segments. Relates to #35333	2018-11-13 14:58:17 +01:00
Jason Tedor	3859d21661	Fix the names of CCR stats endpoints in usage API (#35438 ) This commit fixes the names of the CCR stats endpoints reported in the usage API.	2018-11-12 10:27:12 -05:00
Martijn van Groningen	ef10461caf	[TEST] Instead of ignoring the ccr downgrade to basic license qa test avoid the assertions that check the log files, because that does not work on Windows. The rest of the test is still useful and should work on Windows CI. Currently on Windows CI this qa module fails because there is just one test and that test si ignored if OS is Windows.	2018-11-12 10:17:33 +01:00
Martijn van Groningen	ae2af20ae5	[CCR] Validate remote cluster license as part of put auto follow pattern api call (#35364 ) Validate remote cluster license as part of put auto follow pattern api call in addition of validation that when auto follow coordinator starts auto following indices in the leader cluster. Also added qa module that tests what happens to ccr after downgrading to basic license. Existing active follow indices should remain to follow, but the auto follow feature should not pickup new leader indices.	2018-11-09 17:43:43 +01:00
Martijn van Groningen	807ce10f73	[TEST] Increased timeout for verifying ccr monitoring.	2018-11-09 15:40:15 +01:00
Martijn van Groningen	fba811fa3a	[TEST] increased the number of index and delete ops to make it less likely that all ops exist as soft delete docs.	2018-11-09 15:31:51 +01:00
Martijn van Groningen	83152b3835	[CCR] Get all auto follow patterns and no auto follow metadata (#35381 ) Return empty response when querying all auto follow patterns, but there is no auto follow metadata.	2018-11-09 14:24:27 +01:00
Martijn van Groningen	07a69a528b	[CCR] Rename leaderClient variables and parameters to remoteClient (#35368 )	2018-11-08 16:26:14 +01:00
Martijn van Groningen	8a85251da0	[CCR] Auto follow Coordinator fetch cluster state in system context (#35120 ) Auto follow Coordinator should fetch the leader cluster state using system context.	2018-11-08 10:48:27 +01:00
Martijn van Groningen	2f2090f562	[CCR] Adjust list of dynamic index settings that should be replicated (#35195 ) Adjust list of dynamic index settings that should be replicated and added a test that verifies whether builtin dynamic index settings are classified as replicated or non replicated (whitelisted).	2018-11-07 21:59:58 -05:00
Jason Tedor	4f4fc3b8f8	Replicate index settings to followers (#35089 ) This commit uses the index settings version so that a follower can replicate index settings changes as needed from the leader. Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2018-11-07 21:20:51 -05:00
Martijn van Groningen	314b9ca44c	[CCR] Enforce auto follow pattern name restrictions (#35197 ) An auto follow pattern: * cannot start with `_` * cannot contain a `,` * can be encoded in UTF-8 * the length of UTF-8 encoded bytes is no longer than 255 bytes	2018-11-07 20:16:26 +01:00
Martijn van Groningen	e685cfe8f9	[CCR] Fail with a better error if leader index is red (#35298 ) as part of fetching history uuids from leader index.	2018-11-07 13:23:30 +01:00
Martijn van Groningen	2395e16d84	[CCR] Change resume follow api to be a master node action (#35249 ) In order to start shard follow tasks, the resume follow api already needs execute N requests to the elected master node. The pause follow API is also a master node action, which would make how both APIs execute more consistent.	2018-11-07 07:38:44 +01:00
Martijn van Groningen	a937d7f5f3	[CCR] Forgot missing return statement, Error was thrown if leader index had no soft deletes enabled, but it then continued creating the follower index. The test caught this bug, but very rarely due to timing issue. Build failure instance: ``` 1> [2018-11-05T20:29:38,597][INFO ][o.e.x.c.LocalIndexFollowingIT] [testDoNotCreateFollowerIfLeaderDoesNotHaveSoftDeletes] before test 1> [2018-11-05T20:29:38,599][INFO ][o.e.c.s.ClusterSettings ] [node_s_0] updating [cluster.remote.local.seeds] from [[]] to [["127.0.0.1:9300"]] 1> [2018-11-05T20:29:38,599][INFO ][o.e.c.s.ClusterSettings ] [node_s_0] updating [cluster.remote.local.seeds] from [[]] to [["127.0.0.1:9300"]] 1> [2018-11-05T20:29:38,609][INFO ][o.e.c.m.MetaDataCreateIndexService] [node_s_0] [leader-index] creating index, cause [api], templates [random-soft-deletes-templat e, one_shard_index_template], shards [2]/[0], mappings [] 1> [2018-11-05T20:29:38,628][INFO ][o.e.c.r.a.AllocationService] [node_s_0] Cluster health status changed from [YELLOW] to [GREEN] (reason: [shards started [[leader- index][0]] ...]). 1> [2018-11-05T20:29:38,660][INFO ][o.e.x.c.a.TransportPutFollowAction] [node_s_0] [follower-index] creating index, cause [ccr_create_and_follow], shards [2]/[0] 1> [2018-11-05T20:29:38,675][INFO ][o.e.c.s.ClusterSettings ] [node_s_0] updating [cluster.remote.local.seeds] from [["127.0.0.1:9300"]] to [[]] 1> [2018-11-05T20:29:38,676][INFO ][o.e.c.s.ClusterSettings ] [node_s_0] updating [cluster.remote.local.seeds] from [["127.0.0.1:9300"]] to [[]] 1> [2018-11-05T20:29:38,678][INFO ][o.e.x.c.LocalIndexFollowingIT] [testDoNotCreateFollowerIfLeaderDoesNotHaveSoftDeletes] after test 1> [2018-11-05T20:29:38,678][INFO ][o.e.x.c.LocalIndexFollowingIT] [testDoNotCreateFollowerIfLeaderDoesNotHaveSoftDeletes] [LocalIndexFollowingIT#testDoNotCreateFoll owerIfLeaderDoesNotHaveSoftDeletes]: cleaning up after test 1> [2018-11-05T20:29:38,678][INFO ][o.e.c.m.MetaDataDeleteIndexService] [node_s_0] [follower-index/TlWlXp0JSVasju2Kr_hksQ] deleting index 1> [2018-11-05T20:29:38,678][INFO ][o.e.c.m.MetaDataDeleteIndexService] [node_s_0] [leader-index/FQ6EwIWcRAKD8qvOg2eS8g] deleting index FAILURE 0.23s J0 \| LocalIndexFollowingIT.testDoNotCreateFollowerIfLeaderDoesNotHaveSoftDeletes <<< FAILURES! > Throwable #1: java.lang.AssertionError: > Expected: <false> > but: was <true> > at __randomizedtesting.SeedInfo.seed([7A3C89DA3BCA17DD:65C26CBF6FEF0B39]:0) > at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:20) > at org.elasticsearch.xpack.ccr.LocalIndexFollowingIT.testDoNotCreateFollowerIfLeaderDoesNotHaveSoftDeletes(LocalIndexFollowingIT.java:83) > at java.lang.Thread.run(Thread.java:748) ``` Build failure: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.5+intake/46/console	2018-11-06 16:05:35 +01:00
Martijn van Groningen	46c238d792	[CCR] Improve error when operations are missing (#35179 ) Improve error when operations are missing	2018-11-06 08:42:47 +01:00
Martijn van Groningen	cac67f8bcc	[CCR] Add extra validation to unfollow api (#35245 ) Validate whether the follow index actually exists and whether the follow index actually has custom ccr metadata.	2018-11-06 08:00:34 +01:00
Nik Everett	f72ef9b5fd	Build: Pull "skip assemble on qa" to common build (#35214 ) Pull all of the logic that we use to skip the `assemble` and `dependenciesInfo` tasks on `qa` projects into one spot in our root build file.	2018-11-05 16:16:00 -05:00
Alexander Reelsen	409050e8de	Refactor: Remove settings from transport action CTOR (#35208 ) As settings are not used in the transport action constructor, this removes the passing of the settings in all the transport actions.	2018-11-05 13:08:18 +01:00
Martijn van Groningen	ddda2d419c	[CCR] Change max_read_request_size default (#35247 ) This changes the max_read_request_size default from unlimited to 32MB.	2018-11-05 12:51:42 +01:00
Nhat Nguyen	54e1231ebd	CCR/TEST: Limit indexing docs in FollowerFailOverIT (#35228 ) The suite FollowerFailOverIT is failing because some documents are not replicated to the follower. Maybe the FollowTask is not working as expected or the background indexers eat all resources while the follower cluster is trying to reform after a failover; then CI is not fast enough to replicate all the indexed docs within 60 seconds (sometimes I see 80k docs on the leader). This commit limits the number of documents to be indexed into the leader index by the background threads so that we can eliminate the latter case. This change also replaces a docCount assertion with a docIds assertion so we can have more information if these tests fail again. Relates #33337	2018-11-03 10:00:54 -04:00
David Kyle	43cff56aec	Mute FollowerFailOverIT.testFollowIndexAndCloseNode Issue #33337	2018-11-02 15:13:09 +00:00
Nhat Nguyen	4875d6fb0b	CCR: Add NodeClosedException to retryable list (#35191 ) This change adds NodeClosedException to the retry-able exception list.	2018-11-02 07:01:46 -04:00
Martijn van Groningen	19d6cf1b9e	[CCR] Change response classes to not use from Streamable. (#35085 ) Only the response classes of get auto follow pattern, the follow and stats APIs were moved away from Streamable. The other APIs use `AcknowledgedResponse` or `BaseTasksResponse` as response class and moving that class away from Streamable is a bigger change.	2018-11-02 08:02:17 +01:00
Nhat Nguyen	dbc7c9259c	CCR/TEST: Add debug log to testFailOverOnFollower	2018-11-01 22:28:06 -04:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Martijn van Groningen	6da2fb7d5b	Change CCR API request classes to use Writeable serialization instead of Streamable (#34911 ) Only the follow stats request couldn't be changed to use Writeable serialization, because that requires changes in `TransportTasksAction` and `BaseTasksRequest` base classes.	2018-10-30 11:03:30 +01:00
Jason Tedor	cc9894d78a	Fix name of CCR stats transport action This class name should include an indication that it is for CCR. This commit adds that to the name of this class.	2018-10-29 09:50:13 -04:00
Jason Tedor	26f5c509af	Fix CCR API specification (#34963 ) This commit fixes two issues with the CCR API specification: - remove the CCR stats endpoint, it is not currently implemented - fix the documentation links	2018-10-29 09:37:13 -04:00
Martijn van Groningen	b2daaf15d1	[CCR] move tests that modify test cluster from the main test class to a dedicated class.	2018-10-29 13:45:59 +01:00
Martijn van Groningen	1801518527	[CCR] Refactor stats APIs (#34912 ) * Changed the auto follow stats to also include follow stats. * Renamed the auto follow stats api to stats api and changed its url path from `/_ccr/auto_follow/stats` `/_ccr/stats`. * Removed `/_ccr/stats` url path for the follow stats api, which makes the index parameter a required parameter. * Fixed docs.	2018-10-29 07:45:27 +01:00
Martijn van Groningen	bad5972f62	[CCR] Fix request serialization bug (#34917 ) and some parameters that were not set in tests.	2018-10-29 07:38:55 +01:00
Jason Tedor	43f6ba1c63	Fix put/resume follow request parsing (#34913 ) This commit adds some fields that were missing from put follow, and fixes a bug in resume follow.	2018-10-26 11:09:55 -04:00
Martijn van Groningen	306f1d78f8	[CCR] Retry when no index shard stats can be found (#34852 ) Index shard stats for the follower shard are fetched, when a shard follow task is started. This is needed in order to bootstap the shard follow task with the follower global checkpoint. Sometimes index shard stats are not available (e.g. during a restart) and we fail now, while it is very likely that these stats will be available some time later.	2018-10-26 15:14:24 +02:00
Nhat Nguyen	ff49e79d40	CCR: Rename follow-task parameters and stats (#34836 ) * CCR: Rename follow parameters and stats This commit renames the follow-task parameters and its stats. Below are the changes: ## Params - remote_cluster (unchanged) - leader_index (unchanged) - max_read_request_operation_count -> max_read_request_operation_count - max_batch_size -> max_read_request_size - max_write_request_operation_count (new) - max_write_request_size (new) - max_concurrent_read_batches -> max_outstanding_read_requests - max_concurrent_write_batches -> max_outstanding_write_requests - max_write_buffer_size (unchanged) - max_write_buffer_count (unchanged) - max_retry_delay (unchanged) - poll_timeout -> read_poll_timeout ## Stats - remote_cluster (unchanged) - leader_index (unchanged) - follower_index (unchanged) - shard_id (unchanged) - leader_global_checkpoint (unchanged) - leader_max_seq_no (unchanged) - follower_global_checkpoint (unchanged) - follower_max_seq_no (unchanged) - last_requested_seq_no (unchanged) - number_of_concurrent_reads -> outstanding_read_requests - number_of_concurrent_writes -> outstanding_write_requests - buffer_size_in_bytes -> write_buffer_size_in_bytes (new) - number_of_queued_writes -> write_buffer_operation_count - mapping_version -> follower_mapping_version - total_fetch_time_millis -> total_read_time_millis - total_fetch_remote_time_millis -> total_read_remote_exec_time_millis - number_of_successful_fetches -> successful_read_requests - number_of_failed_fetches -> failed_read_requests - operation_received -> operations_read - total_transferred_bytes -> bytes_read - total_index_time_millis -> total_write_time_millis [?] - number_of_successful_bulk_operations -> successful_write_requests - number_of_failed_bulk_operations -> failed_write_requests - number_of_operations_indexed -> operations_written - fetch_exception -> read_exceptions - time_since_last_read_millis -> time_since_last_read_millis * add test for max_write_request_(operation_count\|size)	2018-10-25 10:36:15 +02:00
Martijn van Groningen	6fe0e62b7a	[CCR] Added write buffer size limit (#34797 ) This limit is based on the size in bytes of the operations in the write buffer. If this limit is exceeded then no more read operations will be coordinated until the size in bytes of the write buffer has dropped below the configured write buffer size limit. Renamed existing `max_write_buffer_size` to ``max_write_buffer_count` to indicate that limit is count based. Closes #34705	2018-10-24 23:48:49 +02:00
Andrey Atapin	5f588180f9	Improve IndexNotFoundException's default error message (#34649 ) This commit adds the index name to the error message when an index is not found.	2018-10-24 12:53:31 -07:00
Nhat Nguyen	d73768f812	CCR: Do not follow if leader does not have soft-deletes (#34767 ) We should not create a follower index and abort a follow request if the leader does not have soft-deletes. Moreover, we also should not auto-follow an index if it does not have soft-deletes.	2018-10-24 11:19:39 -04:00

1 2 3 4 5 ...

261 Commits