OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ryan Ernst	fab5e21e7d	Build: Split distributions into oss and default This commit makes x-pack a module and adds it to the default distrubtion. It also creates distributions for zip, tar, deb and rpm which contain only oss code.	2018-04-20 15:33:57 -07:00
Ryan Ernst	cb56bf49d5	Fixup build after opening x-pack	2018-04-20 15:33:46 -07:00
Ryan Ernst	2efd22454a	Migrate x-pack-elasticsearch source to elasticsearch	2018-04-20 15:29:54 -07:00
Martijn van Groningen	9b9d0f9057	Enabled licence header check and fixed unchecked casts. (#4408 )	2018-04-20 11:15:52 +02:00
Martijn van Groningen	cfd7847628	fixed issues after merging in master	2018-04-20 07:59:13 +02:00
Nhat Nguyen	f97aec7b8b	Sibling of enforce access to translog via engine Since elastic/elasticsearch#29542, we no longer expose translog instance but only provide creating translog snapshot method. This commit adapts that change in CCR branch. Relates elastic/elasticsearch#29542	2018-04-18 11:54:00 -04:00
Martijn van Groningen	56ca59a513	Add the ability to the follow index to follow an index in a remote cluster. The follow index api completely reuses CCS infrastructure that was exposed via: https://github.com/elastic/elasticsearch/pull/29495 This means that the leader index parameter support the same ccs index to indicate that an index resides in a different cluster. I also added a qa module that smoke tests the cross cluster nature of ccr. The idea is that this test just verifies that ccr can read data from a remote leader index and that is it, no crazy randomization or indirectly testing other features.	2018-04-17 07:36:40 +02:00
Martijn van Groningen	c0d42e9cd1	Fixed test	2018-04-16 10:48:46 +02:00
Martijn van Groningen	a94b38b88e	Fixed compile errors and test failures after merging master into ccr.	2018-04-13 16:35:09 +02:00
Martijn van Groningen	d77f756f5c	ccr: use indices stats api to fetch global checkpoint of the follower shards and keep track of shard follow stats inside shard follow stats' node task instead of persistent task status. By maintaining the shard follow stats inside its node task the stats update is quicker as no cluster state update is required. The stats are now transient; meaning if the task is going to run a different node then the stats are gone too. Currently only the processed global checkpoint is being tracked and this is being restored when a shard follow node task starts via the indices stats api (the reason of the first change of this change). Other stats that we may add in the future (like fetch_time, see: https://gist.github.com/s1monw/dba13daf8493bf48431b72365e110717) it is ok if we start from zero in case a shard follow task moves to another node.	2018-04-05 14:52:20 +02:00
Martijn van Groningen	d976fa44e7	Removed LocalCheckpointTracker usage.	2018-03-29 07:41:23 +02:00
Martijn van Groningen	a22a7d079d	ccr: Added maximum translog limit that a single shard changes response can return. This limit is based on the number of estimate bytes in each translog operation that fall between the minimum and maximum request sequence number. If this limit is met then the shard follow task executor will make sure that a subsequent shard changes request will be performed to fetch the remaining translog operations. This limit is needed in order to protect against returning too many translog operations in a single shard changes response. Relates to #2436	2018-03-28 15:49:57 +02:00
Martijn van Groningen	282740610b	Fixed test after merging in master branch.	2018-03-28 09:54:41 +02:00
Nhat Nguyen	51111a8106	CCR: Stop FollowExistingIndexAction after report failure (#4111 ) We check for the existence of both leader and follower index, then properly report to the caller. However, we do not return after reporting failure. This causes the caller receive exception twice: IllegalArgumentException then NullPointerException. This commit makes sure to stop the action after reporting failure.	2018-03-26 13:56:47 -04:00
Martijn van Groningen	9e4c68c389	Fixed compile and test errors after merging in master	2018-03-16 17:47:10 +01:00
Martijn van Groningen	10cfa21a68	required changes after merge master branch into ccr branch.	2018-02-22 15:03:33 +01:00
Martijn van Groningen	1a9a7ffe97	removed hack	2018-02-07 17:54:28 +01:00
Martijn van Groningen	c442d14f1d	Several changes that were required after merging master into the ccr branch.	2018-02-05 13:25:58 +01:00
Martijn van Groningen	4e818254ad	re-enabled java integration tests	2018-01-25 14:18:34 +01:00
Martijn van Groningen	05d3d2e49c	fix packages after merge	2018-01-24 09:28:42 +01:00
Jason Tedor	9b6bb2c635	Enable run task for CCR This commit enables the run task for ccr by specifying that the ccr project not be evaluated until after core is evaluated. This is important since ccr is alphabetically before core and thus Gradle evaluates it first. Relates #3665	2018-01-22 15:07:20 -05:00
Martijn van Groningen	83a82d83d0	Moved ccr source code to its own gradle module after xpack split.	2018-01-22 11:09:04 +01:00
Martijn van Groningen	7a4860452a	Update the persistent task status in the background. Follow up for #3256	2018-01-16 10:48:55 +01:00
Jason Tedor	bad5135d35	Fix checkpoints in assertion The checkpoints in the assertion message that the follower checkpoint is less than the leader checkpoint are backwards. This commit fixes this message.	2018-01-15 11:50:12 -05:00
Martijn van Groningen	18788531c8	Fixed test failure. Relates to #3256	2018-01-12 14:04:26 +01:00
Martijn van Groningen	a56b0479e1	Changed the chunk coordinator to process the chunks concurrently. (#3256 ) The shard follow task executor determines the range of translog operations between the leader shard's global checkpoint and the last know processed seqno by the current shard follow task that are missing. Then the chunks coordinator can then chunk this range up in smaller ranges if the requested range is above the configured max chunk size. If it is smaller than the entire range then the chunk coordinator has just one chuck to coordinate. Each chunk is added to a queue and is processed by the ChunkProcessor, that reads the translog ops from the leader shard and then indexes these translog ops into the follow shard. After that a new chuck is polled from the queue and the ChunkProcessor performs the same actions until there are no more chunks in the queue to process. After that the shard follow task executor will determine a new range of translog operations to process. This change changes the chunk coordinator to start polling from the chunk queue with multiple threads at the same time to handle dealing with a higher indexing load on the leader side better.	2018-01-10 15:29:11 +01:00
Martijn van Groningen	38aea9588b	Fixed compile errors after merge.	2018-01-04 18:18:46 +01:00
Martijn van Groningen	dbdefc5689	Fix issue after merging	2017-12-21 14:53:12 +01:00
Martijn van Groningen	973f126c24	Merge branch 'master' into ccr	2017-12-18 16:52:09 +01:00
Martijn van Groningen	6c06aacb57	Fixed NumberFormatException in the rest layer by only setting batch_size if it has been specified.	2017-12-07 09:34:25 +01:00
Martijn van Groningen	0dc5a89bc5	fixed compile error	2017-12-01 14:14:17 +01:00
Martijn van Groningen	cc5665bce6	pass down batch size correctly	2017-12-01 14:14:17 +01:00
Martijn van Groningen	3cd5eed91e	fixed code violations	2017-12-01 14:14:17 +01:00
Martijn van Groningen	c3f7d4f580	* Made ChunkCoordinator unittest testable * Fixed a small issue where each batch would fetch / index the previous batch last operation * Made batch size a request param on the follow existing index api request. This makes is easy to tune this param when running tests from scripts. * Changed default batch size from 256 to 1024.	2017-12-01 14:14:17 +01:00
Martijn van Groningen	2e382bf7f3	Disallow dynamic mapping updated in follow shards I forgot to configure a mapping in the follow shard shard, which caused a dynamic update (due to type auto creation), but this was ignored. Subsequent searches in follow index then failed due to a mapping missing. (The _id couldn't be fetched during fetch phase, because the mapping was missing) We should at a later stage investigate how to best solve this, but for know to avoid confusion just fail if a dynamic update happens in a follow shard.	2017-12-01 09:33:06 +01:00
Jason Tedor	be5f83a6bd	Implement translog operation bulk action This commit adds a bulk action for apply translog operations in bulk to an index. This action is then used in the persistent task for CCR to apply shard changes from a leader shard. Relates #3147	2017-11-29 07:29:56 -05:00
Jason Tedor	11aa83011c	Fix get operations between test This test was broken by an upstream change that no longer guarantees we see the operations from the upstream translog in the order they appear in that translog. As such, the assertions in this test were too strong so this commit relaxes them. Relates #3153	2017-11-29 07:16:52 -05:00
Jason Tedor	abd263fca9	Merge branch 'master' into ccr * master: [DOCS] Fixed cross cluster search docs issue (#3113) Watcher: Add transform input for chained input (#2861) [DOCS] Fix cleaning up ML resources after tests [ML] Specify ML_ORIGIN when calling the job update endpoint internally (#3110) Refactor CLI commands as logging-aware commands [ML] Stop datafeed when job fails (#3107)	2017-11-27 14:03:10 -05:00
Martijn van Groningen	e6ad1c3e9d	ccr: Added apis and persistent tasks for following an index and log changes instead of indexing.	2017-11-22 09:04:59 +01:00
Jason Tedor	8e0b34b507	Override indexing plans Operations from a leader shard will be indexed into the engine with the origin set to primary. The problem is here is that then we have primary semantics in the engine such as assertions about sequence numbers being unassigned, and we do not have correct semantics for out-of-order delivery of operations (as we should on a following engine, whether or not it is primary since the ordering is determined from the leader). This commit handles this by always using the replica plan for indexing into a following engine, whether or not the engine is for a primary shard. Relates #3000	2017-11-15 13:38:46 -05:00
Jason Tedor	913936f2b0	Maintain order of operations semantics on follower A following engine even for a primary shard needs to maintain order of operations semantics as if it were behaving like a replica. That is, rather than assuming that the order of operations presented to the engine is the de facto order of operations as is the case for a leader engine for a primary shard, a following engine must behave like all replicas behave which is that they resolve order of operations based on sequence numbers. This commit causes this to be the case for following engines. Relates #2931	2017-11-09 15:18:54 -05:00
Martijn van Groningen	bf4e18fdfc	fixed compile error	2017-11-02 15:12:35 +01:00
Jason Tedor	769349a9ab	Add following engine implementation This commit is a first step towards a following engine implementation. Future work will build on this by using this engine to execute operations on a following engine from another engine (typically a remote leader engine) that has already assigned sequence numbers to such operations. Relates #2776	2017-10-30 13:38:02 -04:00
Martijn van Groningen	41c3dc91c1	Changed the shard changes api to use get threadpool instead of management threadpool.	2017-10-17 10:02:28 +02:00
Martijn van Groningen	2663b5ef39	remove unused import	2017-10-16 10:36:22 +02:00
Jason Tedor	85c1935d9d	Increase generations in operations between test This commit sets an index setting for the size of a translog generation and increases the number of documents indexed to increase the chance of multiple generations being present when testing getting operations between two sequence numbers.	2017-10-16 08:59:18 +02:00
Jason Tedor	bcd61bfca5	Fix off-by-one error in shard changes action test This commit fixes an off-by-one error in the shard changes action test for getting operations between two sequence numbers. The off-by-one error arises because sequence numbers are indexed from zero, so if N documents are indexed then the maximum sequence number starting from zero would be N - 1.	2017-10-16 08:54:04 +02:00
Martijn van Groningen	1f495f59a1	[CCR] Add an internal api to read translog operations between a sequence number range (#2563 ) * xdcr: Add an internal api to read translog operations between a sequence number range. This api will be used later by the persistent task for the following index to pull data from the leader index. The persistent task can fetch the global checkpoint from the shard stats for each primary shard of the leader index. Based on the global checkpoint of the primary shards of the following index, the persistent task can send several calls to the internal api added in this commit to replicate changes from follow index to leader index in a batched manner.	2017-10-16 07:38:03 +02:00
Jason Tedor	3033aba67b	Introduce pluggable engine factory for CCR This commit utilizes the pluggable engine factory feature in core to introduce a pluggable engine factory for XDCR. For now this is only a skeleton implementation to proof out the pluggable engine factory concept. Future work will implement a genuine following engine for XDCR. Relates #2655	2017-10-12 10:52:33 -04:00
Jason Tedor	063a76c3db	Introduce CCR container class This commit introduces the container class for CCR functionality. Future work will expose more specific CCR functionality to the X-Pack plugin through this class. Relates #2704	2017-10-06 14:46:39 -04:00

1 2 3

150 Commits