mirror of https://github.com/apache/lucene.git
SOLR-11412: Add docs for bi-directional CDCR; split CDCR pages into multiple child pages
This commit is contained in:
parent
83753d0a2a
commit
52cefbe742
|
@ -0,0 +1,313 @@
|
||||||
|
= CDCR API
|
||||||
|
// Licensed to the Apache Software Foundation (ASF) under one
|
||||||
|
// or more contributor license agreements. See the NOTICE file
|
||||||
|
// distributed with this work for additional information
|
||||||
|
// regarding copyright ownership. The ASF licenses this file
|
||||||
|
// to you under the Apache License, Version 2.0 (the
|
||||||
|
// "License"); you may not use this file except in compliance
|
||||||
|
// with the License. You may obtain a copy of the License at
|
||||||
|
//
|
||||||
|
// http://www.apache.org/licenses/LICENSE-2.0
|
||||||
|
//
|
||||||
|
// Unless required by applicable law or agreed to in writing,
|
||||||
|
// software distributed under the License is distributed on an
|
||||||
|
// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
||||||
|
// KIND, either express or implied. See the License for the
|
||||||
|
// specific language governing permissions and limitations
|
||||||
|
// under the License.
|
||||||
|
|
||||||
|
The CDCR API is used to control and monitor the replication process. Control actions are performed at a collection level, i.e., by using the following base URL for API calls: `\http://localhost:8983/solr/<collection>/cdcr`.
|
||||||
|
|
||||||
|
Monitor actions are performed at a core level, i.e., by using the following base URL for API calls: `\http://localhost:8983/solr/<core>/cdcr`.
|
||||||
|
|
||||||
|
Currently, none of the CDCR API calls have parameters.
|
||||||
|
|
||||||
|
== API Entry Points
|
||||||
|
|
||||||
|
*Control*
|
||||||
|
|
||||||
|
* `<collection>/cdcr?action=STATUS`: <<CDCR STATUS,Returns the current state>> of CDCR.
|
||||||
|
* `<collection>/cdcr?action=START`: <<CDCR START,Starts CDCR>> replication
|
||||||
|
* `<collection>/cdcr?action=STOP`: <<CDCR STOP,Stops CDCR>> replication.
|
||||||
|
* `<collection>/cdcr?action=ENABLEBUFFER`: <<ENABLEBUFFER,Enables the buffering>> of updates.
|
||||||
|
* `<collection>/cdcr?action=DISABLEBUFFER`: <<DISABLEBUFFER,Disables the buffering>> of updates.
|
||||||
|
|
||||||
|
*Monitoring*
|
||||||
|
|
||||||
|
* `core/cdcr?action=QUEUES`: <<QUEUES,Fetches statistics about the queue>> for each replica and about the update logs.
|
||||||
|
* `core/cdcr?action=OPS`: <<OPS,Fetches statistics about the replication performance>> (operations per second) for each replica.
|
||||||
|
* `core/cdcr?action=ERRORS`: <<ERRORS,Fetches statistics and other information about replication errors>> for each replica.
|
||||||
|
|
||||||
|
== Control Commands
|
||||||
|
|
||||||
|
=== CDCR STATUS
|
||||||
|
|
||||||
|
`solr/<collection>/cdcr?action=STATUS`
|
||||||
|
|
||||||
|
==== CDCR Status Example
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/techproducts/cdcr?action=STATUS
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader": {
|
||||||
|
"status": 0,
|
||||||
|
"QTime": 0
|
||||||
|
},
|
||||||
|
"status": {
|
||||||
|
"process": "stopped",
|
||||||
|
"buffer": "enabled"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
=== ENABLEBUFFER
|
||||||
|
|
||||||
|
`solr/<collection>/cdcr?action=ENABLEBUFFER`
|
||||||
|
|
||||||
|
==== Enable Buffer Example
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/techproducts/cdcr?action=ENABLEBUFFER
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader": {
|
||||||
|
"status": 0,
|
||||||
|
"QTime": 0
|
||||||
|
},
|
||||||
|
"status": {
|
||||||
|
"process": "started",
|
||||||
|
"buffer": "enabled"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
=== DISABLEBUFFER
|
||||||
|
|
||||||
|
`solr/<collection>/cdcr?action=DISABLEBUFFER`
|
||||||
|
|
||||||
|
==== Disable Buffer Example
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/techproducts/cdcr?action=DISABLEBUFFER
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader": {
|
||||||
|
"status": 0,
|
||||||
|
"QTime": 0
|
||||||
|
},
|
||||||
|
"status": {
|
||||||
|
"process": "started",
|
||||||
|
"buffer": "disabled"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
=== CDCR START
|
||||||
|
|
||||||
|
`solr/<collection>/cdcr?action=START`
|
||||||
|
|
||||||
|
==== CDCR Start Examples
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/techproducts/cdcr?action=START
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader": {
|
||||||
|
"status": 0,
|
||||||
|
"QTime": 0
|
||||||
|
},
|
||||||
|
"status": {
|
||||||
|
"process": "started",
|
||||||
|
"buffer": "enabled"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
=== CDCR STOP
|
||||||
|
|
||||||
|
`solr/<collection>/cdcr?action=STOP`
|
||||||
|
|
||||||
|
==== CDCR Stop Examples
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/techproducts/cdcr?action=STOP
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader": {
|
||||||
|
"status": 0,
|
||||||
|
"QTime": 0
|
||||||
|
},
|
||||||
|
"status": {
|
||||||
|
"process": "stopped",
|
||||||
|
"buffer": "enabled"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
|
||||||
|
== CDCR Monitoring Commands
|
||||||
|
|
||||||
|
=== QUEUES
|
||||||
|
|
||||||
|
`solr/<core>/cdcr?action=QUEUES`
|
||||||
|
|
||||||
|
==== QUEUES Response
|
||||||
|
|
||||||
|
The output is composed of a list “queues” which contains a list of (ZooKeeper) Target hosts, themselves containing a list of Target collections. For each collection, the current size of the queue and the timestamp of the last update operation successfully processed is provided. The timestamp of the update operation is the original timestamp, i.e., the time this operation was processed on the Source SolrCloud. This allows an estimate the latency of the replication process.
|
||||||
|
|
||||||
|
The “queues” object also contains information about the update logs, such as the size (in bytes) of the update logs on disk (`tlogTotalSize`), the number of transaction log files (`tlogTotalCount`) and the status of the update logs synchronizer (`updateLogSynchronizer`).
|
||||||
|
|
||||||
|
==== QUEUES Examples
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/<replica_name>/cdcr?action=QUEUES
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader":{
|
||||||
|
"status": 0,
|
||||||
|
"QTime": 1
|
||||||
|
},
|
||||||
|
"queues":{
|
||||||
|
"127.0.0.1: 40342/solr":{
|
||||||
|
"Target_collection":{
|
||||||
|
"queueSize": 104,
|
||||||
|
"lastTimestamp": "2014-12-02T10:32:15.879Z"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"tlogTotalSize":3817,
|
||||||
|
"tlogTotalCount":1,
|
||||||
|
"updateLogSynchronizer": "stopped"
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
=== OPS
|
||||||
|
|
||||||
|
`solr/<core>/cdcr?action=OPS`
|
||||||
|
|
||||||
|
|
||||||
|
==== OPS Response
|
||||||
|
|
||||||
|
Provides the average number of operations as a sum and broken down by adds/deletes.
|
||||||
|
|
||||||
|
==== OPS Examples
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/<replica_name>/cdcr?action=OPS
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader":{
|
||||||
|
"status":0,
|
||||||
|
"QTime":1
|
||||||
|
},
|
||||||
|
"operationsPerSecond":{
|
||||||
|
"127.0.0.1: 59661/solr":{
|
||||||
|
"Target_collection":{
|
||||||
|
"all": 297.102944952749052,
|
||||||
|
"adds": 297.102944952749052,
|
||||||
|
"deletes": 0.0
|
||||||
|
}
|
||||||
|
}
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
||||||
|
|
||||||
|
=== ERRORS
|
||||||
|
|
||||||
|
`solr/<core>/cdcr?action=ERRORS`
|
||||||
|
|
||||||
|
==== ERRORS Response
|
||||||
|
|
||||||
|
Provides the number of consecutive errors encountered by the replicator thread, the number of bad requests or internal errors since the start of the replication process, and a list of the last errors encountered ordered by timestamp.
|
||||||
|
|
||||||
|
==== ERRORS Examples
|
||||||
|
|
||||||
|
*Input*
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
http://localhost:8983/solr/<replica_name>/cdcr?action=ERRORS
|
||||||
|
----
|
||||||
|
|
||||||
|
*Output*
|
||||||
|
|
||||||
|
[source,json]
|
||||||
|
----
|
||||||
|
{
|
||||||
|
"responseHeader":{
|
||||||
|
"status":0,
|
||||||
|
"QTime":2
|
||||||
|
},
|
||||||
|
"errors": {
|
||||||
|
"127.0.0.1: 36872/solr":{
|
||||||
|
"Target_collection":{
|
||||||
|
"consecutiveErrors":3,
|
||||||
|
"bad_request":0,
|
||||||
|
"internal":3,
|
||||||
|
"last":{
|
||||||
|
"2014-12-02T11:04:42.523Z":"internal",
|
||||||
|
"2014-12-02T11:04:39.223Z":"internal",
|
||||||
|
"2014-12-02T11:04:38.22Z":"internal"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
}
|
||||||
|
}
|
||||||
|
}
|
||||||
|
----
|
|
@ -0,0 +1,157 @@
|
||||||
|
= CDCR Architecture
|
||||||
|
// Licensed to the Apache Software Foundation (ASF) under one
|
||||||
|
// or more contributor license agreements. See the NOTICE file
|
||||||
|
// distributed with this work for additional information
|
||||||
|
// regarding copyright ownership. The ASF licenses this file
|
||||||
|
// to you under the Apache License, Version 2.0 (the
|
||||||
|
// "License"); you may not use this file except in compliance
|
||||||
|
// with the License. You may obtain a copy of the License at
|
||||||
|
//
|
||||||
|
// http://www.apache.org/licenses/LICENSE-2.0
|
||||||
|
//
|
||||||
|
// Unless required by applicable law or agreed to in writing,
|
||||||
|
// software distributed under the License is distributed on an
|
||||||
|
// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
||||||
|
// KIND, either express or implied. See the License for the
|
||||||
|
// specific language governing permissions and limitations
|
||||||
|
// under the License.
|
||||||
|
|
||||||
|
== CDCR Architecture Overview
|
||||||
|
|
||||||
|
With CDCR, Source and Target data centers can each serve search queries when CDCR is operating. The Target data center will lag somewhat behind the Source cluster due to propagation delays.
|
||||||
|
|
||||||
|
Data changes on the Source data center are replicated to the Target data center only after they are persisted to disk. The data changes can be replicated in near real-time (with a small delay) or could be scheduled to be sent at longer intervals to the Target data center. CDCR can "bootstrap" the collection to the Target data center. Since this is a full copy of the entire index, network bandwidth should be considered. Of course both Source and Target collections may be empty to start.
|
||||||
|
|
||||||
|
Each shard leader in the Source data center will be responsible for replicating its updates to the corresponding leader in the Target data center. When receiving updates from the Source data center, shard leaders in the Target data center will replicate the changes to their own replicas as normal SolrCloud updates.
|
||||||
|
|
||||||
|
This replication model is designed to tolerate some degradation in connectivity, accommodate limited bandwidth, and support batch updates to optimize communication.
|
||||||
|
|
||||||
|
Replication supports both a new empty index and pre-built indexes. In the scenario where the replication is set up on a pre-built index in the Source cluster and nothing on the Target cluster, CDCR will replicate the _entire_ index from the Source to Target.
|
||||||
|
|
||||||
|
The directional nature of the implementation implies a "push" model from the Source collection to the Target collection. Therefore, the Source configuration must be able to "see" the ZooKeeper ensemble in the Target cluster. The ZooKeeper ensemble is provided configured in the Source's `solrconfig.xml` file.
|
||||||
|
|
||||||
|
CDCR is configured to replicate from collections in the Source cluster to collections in the Target cluster on a collection-by-collection basis. Since CDCR is configured in `solrconfig.xml` (on both Source and Target clusters), the settings can be tailored for the needs of each collection.
|
||||||
|
|
||||||
|
CDCR can be configured to replicate from one collection to a second collection _within the same cluster_. That is a specialized scenario not covered in this Guide.
|
||||||
|
|
||||||
|
=== Uni-Directional Architecture
|
||||||
|
|
||||||
|
When uni-directional updates are configured, updates and deletes are first written to the Source cluster, then forwarded to one or more Target data centers, as illustrated in this graphic:
|
||||||
|
|
||||||
|
.Uni-Directional Data Flow
|
||||||
|
image::images/cross-data-center-replication-cdcr-/CDCR_arch.png[image,width=700,height=525]
|
||||||
|
|
||||||
|
With uni-directional updates, the Target data center(s) will not propagate updates such as adds, updates, or deletes to the Source data center and updates should not be sent to any of the Target data center(s).
|
||||||
|
|
||||||
|
The data flow sequence is:
|
||||||
|
|
||||||
|
. A shard leader receives a new update that is processed by its update processor chain.
|
||||||
|
. The data update is first applied to the local index.
|
||||||
|
. Upon successful application of the data update on the local index, the data update is added to CDCR's Update Logs queue.
|
||||||
|
. After the data update is persisted to disk, the data update is sent to the replicas within the data center.
|
||||||
|
. After Step 4 is successful, CDCR reads the data update from the Update Logs and pushes it to the corresponding collection in the Target data center. This is necessary in order to ensure consistency between the Source and Target data centers.
|
||||||
|
. The leader on the Target data center writes the data locally and forwards it to all its followers.
|
||||||
|
|
||||||
|
Steps 1, 2, 3 and 4 are performed synchronously by SolrCloud; Step 5 is performed asynchronously by a background thread. Given that CDCR replication is performed asynchronously, it becomes possible to push batch updates in order to minimize network communication overhead. Also, if CDCR is unable to push the update at a given time, for example, due to a degradation in connectivity, it can retry later without any impact on the Source data center.
|
||||||
|
|
||||||
|
One implication of the architecture is that the leaders in the Source cluster must be able to "see" the leaders in the Target cluster. Since leaders may change in both Source and Target collections, all nodes in the Source cluster must be able to "see" all Solr nodes in the Target cluster. Firewalls, ACL rules, etc., must be configured to allow this.
|
||||||
|
|
||||||
|
This design works most robustly if both the Source and Target clusters have the same number of shards. There is no requirement that the shards in the Source and Target collection have the same number of replicas.
|
||||||
|
|
||||||
|
Having different numbers of shards on the Source and Target cluster is possible, but is also an "expert" configuration as that option imposes certain constraints and is not generally recommended. Most of the scenarios where having differing numbers of shards are contemplated are better accomplished by hosting multiple shards on each Solr instance.
|
||||||
|
|
||||||
|
=== Bi-Directional Architecture
|
||||||
|
|
||||||
|
When bi-directional updates are configured, either cluster can act as a Source or a Target, and that role can shift between the clusters, as illustrated in this graphic:
|
||||||
|
|
||||||
|
.Bi-Directional Data Flow
|
||||||
|
image::images/cross-data-center-replication-cdcr-/CDCR_bidir.png[image,width=700,height=525]
|
||||||
|
|
||||||
|
With bi-directional updates, indexing and querying must be done on a single cluster at a time to maintain consistency. The second cluster is used when the first cluster is down. Simplifying, one cluster can act as Source and other as Target but both roles, Source and Target, cannot be assigned to any single cluster at the same time. Failover is handled smoothly without any configuration changes. Updates sent from Source data center to Target is not propagated back to Source when bi-directional updates are configured.
|
||||||
|
|
||||||
|
The data flow sequence is similar from Step 1 to 6 above, with an additional step:
|
||||||
|
|
||||||
|
[start=7]
|
||||||
|
. When bi-directional updates are configured, the updates received from Source are flagged on Target and not forwarded further.
|
||||||
|
|
||||||
|
All the behavior(s) and constraint(s) explained in uni-directional data flow are applicable to the respective Source and Target clusters in this scenario.
|
||||||
|
|
||||||
|
== Major Components of CDCR
|
||||||
|
|
||||||
|
What follows is a discussion of the key features and components in CDCR’s architecture:
|
||||||
|
|
||||||
|
=== CDCR Configuration
|
||||||
|
|
||||||
|
In order to configure CDCR, the Source data center requires the host address of the ZooKeeper cluster associated with the Target data center. The ZooKeeper host address is the only information needed by CDCR to instantiate the communication with the Target Solr cluster. The CDCR configuration section of `solrconfig.xml` file on the Source cluster will therefore contain a list of ZooKeeper hosts. The CDCR configuration section of `solrconfig.xml` might also contain secondary/optional configuration, such as the number of CDC Replicator threads, batch updates related settings, etc.
|
||||||
|
|
||||||
|
=== CDCR Initialization
|
||||||
|
|
||||||
|
CDCR supports incremental updates to either new or existing collections. CDCR may not be able to keep up with very high volume updates, especially if there are significant communications latencies due to a slow "pipe" between the data centers. Some scenarios:
|
||||||
|
|
||||||
|
* There is an initial bulk load of a corpus followed by lower volume incremental updates. In this case, one can do the initial bulk load and then enable CDCR. See the section <<cdcr-config.adoc#initial-startup>> for more information.
|
||||||
|
* The index is being built up from scratch, without a significant initial bulk load. CDCR can be set up on empty collections and keep them synchronized from the start.
|
||||||
|
* The index is always being updated at a volume too high for CDCR to keep up. This is especially possible in situations where the connection between the Source and Target data centers is poor. This scenario is unsuitable for CDCR in its current form.
|
||||||
|
|
||||||
|
=== Inter-Data Center Communication
|
||||||
|
|
||||||
|
The CDCR REST API is the primary form of end-user communication for admin commands.
|
||||||
|
|
||||||
|
A SolrJ client is used internally for CDCR operations. The SolrJ client gets its configuration information from the `solrconfig.xml` file. Users of CDCR will not interact directly with the internal SolrJ implementation and will interact with CDCR exclusively through the REST API.
|
||||||
|
|
||||||
|
=== Updates Tracking & Pushing
|
||||||
|
|
||||||
|
CDCR replicates data updates from the Source to the Target data center by leveraging Update Logs. These logs will replace SolrCloud's transaction log.
|
||||||
|
|
||||||
|
A background thread regularly checks the Update Logs for new entries, and then forwards them to the Target data center. The thread therefore needs to keep a checkpoint in the form of a pointer to the last update successfully processed in the Update Logs. Upon acknowledgement from the Target data center that updates have been successfully processed, the Update Logs pointer is updated to reflect the current checkpoint.
|
||||||
|
|
||||||
|
This pointer must be synchronized across all the replicas. In the case where the leader goes down and a new leader is elected, the new leader will be able to resume replication from the last update by using this synchronized pointer. The strategy to synchronize such a pointer across replicas will be explained next.
|
||||||
|
|
||||||
|
If for some reason, the Target data center is offline or fails to process the updates, the thread will periodically try to contact the Target data center and push the updates while buffering updates on the Source cluster. One implication of this is that the Source Update Logs directory should be periodically monitored as the updates will continue to accumulate and will not be purged until the connection to the Target data center is restored.
|
||||||
|
|
||||||
|
=== Synchronization of Update Checkpoints
|
||||||
|
|
||||||
|
A reliable synchronization of the update checkpoints between the shard leader and shard replicas is critical to avoid introducing inconsistency between the Source and Target data centers. Another important requirement is that the synchronization must be performed with minimal network traffic to maximize scalability.
|
||||||
|
|
||||||
|
In order to achieve this, the strategy is to:
|
||||||
|
|
||||||
|
* Uniquely identify each update operation. This unique identifier will serve as pointer.
|
||||||
|
* Rely on two storages: an ephemeral storage on the Source shard leader, and a persistent storage on the Target cluster.
|
||||||
|
|
||||||
|
The shard leader in the Source cluster will be in charge of generating a unique identifier for each update operation, and will keep a copy of the identifier of the last processed updates in memory. The identifier will be sent to the Target cluster as part of the update request. On the Target data center side, the shard leader will receive the update request, store it along with the unique identifier in the Update Logs, and replicate it to the other shards.
|
||||||
|
|
||||||
|
SolrCloud already provides a unique identifier for each update operation, i.e., a “version” number. This version number is generated using a time-based lmport clock which is incremented for each update operation sent. This provides a “happened-before” ordering of the update operations that will be leveraged in (1) the initialization of the update checkpoint on the Source cluster, and in (2) the maintenance strategy of the Update Logs.
|
||||||
|
|
||||||
|
The persistent storage on the Target cluster is used only during the election of a new shard leader on the Source cluster. If a shard leader goes down on the Source cluster and a new leader is elected, the new leader will contact the Target cluster to retrieve the last update checkpoint and instantiate its ephemeral pointer. On such a request, the Target cluster will retrieve the latest identifier received across all the shards, and send it back to the Source cluster. To retrieve the latest identifier, every shard leader will look up the identifier of the first entry in its Update Logs and send it back to a coordinator. The coordinator will have to select the highest among them.
|
||||||
|
|
||||||
|
This strategy does not require any additional network traffic and ensures reliable pointer synchronization. Consistency is principally achieved by leveraging SolrCloud. The update workflow of SolrCloud ensures that every update is applied to the leader and also to any of the replicas. If the leader goes down, a new leader is elected. During the leader election, a synchronization is performed between the new leader and the other replicas. This ensures that the new leader has a consistent Update Logs with the previous leader. Having a consistent Update Logs means that:
|
||||||
|
|
||||||
|
* On the Source cluster, the update checkpoint can be reused by the new leader.
|
||||||
|
* On the Target cluster, the update checkpoint will be consistent between the previous and new leader. This ensures the correctness of the update checkpoint sent by a newly elected leader from the Target cluster.
|
||||||
|
|
||||||
|
=== Maintenance of Update Logs
|
||||||
|
|
||||||
|
The CDCR replication logic requires modification to the maintenance logic of Update Logs on the Source data center. Initially, the Update Logs acts as a fixed size queue, limited to 100 update entries by default. In CDCR, the Update Logs must act as a queue of variable size as they need to keep track of all the updates up through the last processed update by the Target data center. Entries in the Update Logs are removed only when all pointers (one pointer per Target data center) are after them.
|
||||||
|
|
||||||
|
If the communication with one of the Target data center is slow, the Update Logs on the Source data center can grow to a substantial size. In such a scenario, it is necessary for the Update Logs to be able to efficiently find a given update operation given its identifier. Given that its identifier is an incremental number, it is possible to implement an efficient search strategy. Each transaction log file contains as part of its filename the version number of the first element. This is used to quickly traverse all the transaction log files and find the transaction log file containing one specific version number.
|
||||||
|
|
||||||
|
=== Monitoring Operations
|
||||||
|
|
||||||
|
CDCR provides the following monitoring capabilities over the replication operations:
|
||||||
|
|
||||||
|
* Monitoring of the outgoing and incoming replications, with information such as the Source and Target nodes, their status, etc.
|
||||||
|
* Statistics about the replication, with information such as operations (add/delete) per second, number of documents in the queue, etc.
|
||||||
|
|
||||||
|
Information about the lifecycle and statistics will be provided on a per-shard basis by the CDC Replicator thread. The CDCR API can then aggregate this information an a collection level.
|
||||||
|
|
||||||
|
=== CDC Replicator
|
||||||
|
|
||||||
|
The CDC Replicator is a background thread that is responsible for replicating updates from a Source data center to one or more Target data centers. It is responsible for providing monitoring information on a per-shard basis. As there can be a large number of collections and shards in a cluster, we will use a fixed-size pool of CDC Replicator threads that will be shared across shards.
|
||||||
|
|
||||||
|
=== CDCR Limitations
|
||||||
|
|
||||||
|
The current design of CDCR has some limitations. CDCR will continue to evolve over time and many of these limitations will be addressed. Among them are:
|
||||||
|
|
||||||
|
* CDCR is unlikely to be satisfactory for bulk-load situations where the update rate is high, especially if the bandwidth between the Source and Target clusters is restricted. In this scenario, the initial bulk load should be performed, the Source and Target data centers synchronized and CDCR be utilized for incremental updates.
|
||||||
|
* CDCR works most robustly with the same number of shards in the Source and Target collection. The shards in the two collections may have different numbers of replicas.
|
||||||
|
* Running CDCR with the indexes on HDFS is not currently supported, see the https://issues.apache.org/jira/browse/SOLR-9861[Solr CDCR over HDFS] JIRA issue.
|
||||||
|
* Configuration files (`solrconfig.xml`, `managed-schema`, etc.) are not automatically synchronized between the Source and Target clusters. This means that when the Source schema or `solrconfig.xml` files are changed, those changes must be replicated manually to the Target cluster. This includes adding fields by the <<schema-api.adoc#schema-api,Schema API>> or <<managed-resources.adoc#managed-resources,Managed Resources>> as well as hand editing those files.
|
|
@ -0,0 +1,342 @@
|
||||||
|
= CDCR Configuration
|
||||||
|
// Licensed to the Apache Software Foundation (ASF) under one
|
||||||
|
// or more contributor license agreements. See the NOTICE file
|
||||||
|
// distributed with this work for additional information
|
||||||
|
// regarding copyright ownership. The ASF licenses this file
|
||||||
|
// to you under the Apache License, Version 2.0 (the
|
||||||
|
// "License"); you may not use this file except in compliance
|
||||||
|
// with the License. You may obtain a copy of the License at
|
||||||
|
//
|
||||||
|
// http://www.apache.org/licenses/LICENSE-2.0
|
||||||
|
//
|
||||||
|
// Unless required by applicable law or agreed to in writing,
|
||||||
|
// software distributed under the License is distributed on an
|
||||||
|
// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
||||||
|
// KIND, either express or implied. See the License for the
|
||||||
|
// specific language governing permissions and limitations
|
||||||
|
// under the License.
|
||||||
|
|
||||||
|
The Source and Target configurations differ in the case of the data centers being in separate clusters. "Cluster" here means separate ZooKeeper ensembles controlling disjoint Solr instances. Whether these data centers are physically separated or not is immaterial for this discussion.
|
||||||
|
|
||||||
|
As described in the section <<cdcr-architecture.adoc#cdcr-architecture,CDCR Architecture>>, two approaches are supported: uni-directional updates and bi-directional updates.
|
||||||
|
|
||||||
|
All CDCR configuration is done in the `solrconfig.xml` file. Because this is a per-collection configuration file, all CDCR configuration is done for each collection.
|
||||||
|
|
||||||
|
== Uni-Directional Updates
|
||||||
|
|
||||||
|
=== Source Configuration
|
||||||
|
|
||||||
|
Here is a sample of a Source configuration file, a section in `solrconfig.xml`. The presence of the `<replica>` section causes CDCR to use this cluster as the Source and it should not be present in the Target collections. Details about each setting are after the two examples. The source example has buffering disabled, the default is enabled:
|
||||||
|
|
||||||
|
[source,xml]
|
||||||
|
----
|
||||||
|
<updateRequestProcessorChain name="cdcr-processor-chain">
|
||||||
|
<processor class="solr.CdcrUpdateProcessorFactory"/>
|
||||||
|
<processor class="solr.RunUpdateProcessorFactory"/>
|
||||||
|
</updateRequestProcessorChain>
|
||||||
|
|
||||||
|
<requestHandler name="/update" class="solr.UpdateRequestHandler">
|
||||||
|
<lst name="defaults">
|
||||||
|
<str name="update.chain">cdcr-processor-chain</str>
|
||||||
|
</lst>
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<requestHandler name="/cdcr" class="solr.CdcrRequestHandler">
|
||||||
|
<lst name="replica">
|
||||||
|
<str name="zkHost">10.240.18.211:2181,10.240.18.212:2181</str>
|
||||||
|
<!--
|
||||||
|
If you have chrooted your Solr information at the target you must include the chroot, for example:
|
||||||
|
<str name="zkHost">10.240.18.211:2181,10.240.18.212:2181/solr</str>
|
||||||
|
-->
|
||||||
|
<str name="source">collection1</str>
|
||||||
|
<str name="target">collection1</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
<lst name="replicator">
|
||||||
|
<str name="threadPoolSize">8</str>
|
||||||
|
<str name="schedule">1000</str>
|
||||||
|
<str name="batchSize">128</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
<lst name="updateLogSynchronizer">
|
||||||
|
<str name="schedule">1000</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<!-- Modify the <updateLog> section of your existing <updateHandler>
|
||||||
|
in your config as below -->
|
||||||
|
<updateHandler class="solr.DirectUpdateHandler2">
|
||||||
|
<updateLog class="solr.CdcrUpdateLog">
|
||||||
|
<str name="dir">${solr.ulog.dir:}</str>
|
||||||
|
<!--Any parameters from the original <updateLog> section -->
|
||||||
|
</updateLog>
|
||||||
|
|
||||||
|
<!-- Other configuration options such as autoCommit should still be present -->
|
||||||
|
</updateHandler>
|
||||||
|
----
|
||||||
|
|
||||||
|
=== Target Configuration
|
||||||
|
|
||||||
|
Here is a typical Target configuration.
|
||||||
|
|
||||||
|
Target instance must configure an update processor chain that is specific to CDCR. The update processor chain must include the `CdcrUpdateProcessorFactory`. The task of this processor is to ensure that the version numbers attached to update requests coming from a CDCR Source SolrCloud are reused and not overwritten by the Target. A properly configured Target configuration looks similar to this:
|
||||||
|
|
||||||
|
[source,xml]
|
||||||
|
----
|
||||||
|
<requestHandler name="/cdcr" class="solr.CdcrRequestHandler">
|
||||||
|
<!-- recommended for Target clusters -->
|
||||||
|
<lst name="buffer">
|
||||||
|
<str name="defaultState">disabled</str>
|
||||||
|
</lst>
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<requestHandler name="/update" class="solr.UpdateRequestHandler">
|
||||||
|
<lst name="defaults">
|
||||||
|
<str name="update.chain">cdcr-processor-chain</str>
|
||||||
|
</lst>
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<updateRequestProcessorChain name="cdcr-processor-chain">
|
||||||
|
<processor class="solr.CdcrUpdateProcessorFactory"/>
|
||||||
|
<processor class="solr.RunUpdateProcessorFactory"/>
|
||||||
|
</updateRequestProcessorChain>
|
||||||
|
|
||||||
|
<!-- Modify the <updateLog> section of your existing <updateHandler> in your
|
||||||
|
config as below -->
|
||||||
|
<updateHandler class="solr.DirectUpdateHandler2">
|
||||||
|
<updateLog class="solr.CdcrUpdateLog">
|
||||||
|
<str name="dir">${solr.ulog.dir:}</str>
|
||||||
|
<!--Any parameters from the original <updateLog> section -->
|
||||||
|
</updateLog>
|
||||||
|
|
||||||
|
<!-- Other configuration options such as autoCommit should still be present -->
|
||||||
|
|
||||||
|
</updateHandler>
|
||||||
|
----
|
||||||
|
|
||||||
|
== Bi-Directional Updates
|
||||||
|
|
||||||
|
The configurations in both Cluster 1 and 2 are identical with respective `zkHost` string specified in each cluster's `solrconfig.xml`.
|
||||||
|
|
||||||
|
TIP: Both Cluster 1 and Cluster 2 can act as Source and Target at any given point of time but a cluster cannot be both Source and Target at the same time.
|
||||||
|
|
||||||
|
=== Cluster 1 Configuration
|
||||||
|
|
||||||
|
Here is a sample of a Cluster 1 configuration file, a section in `solrconfig.xml`. Cluster 2 `zkhost` string is specified in a `CdcrRequestHandler` declaration:
|
||||||
|
|
||||||
|
[source,xml]
|
||||||
|
----
|
||||||
|
<requestHandler name="/update" class="solr.UpdateRequestHandler">
|
||||||
|
<lst name="defaults">
|
||||||
|
<str name="update.chain">cdcr-processor-chain</str>
|
||||||
|
</lst>
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<updateRequestProcessorChain name="cdcr-processor-chain">
|
||||||
|
<processor class="solr.CdcrUpdateProcessorFactory"/>
|
||||||
|
<processor class="solr.RunUpdateProcessorFactory"/>
|
||||||
|
</updateRequestProcessorChain>
|
||||||
|
|
||||||
|
<requestHandler name="/cdcr" class="solr.CdcrRequestHandler">
|
||||||
|
<lst name="replica">
|
||||||
|
<str name="zkHost">10.240.19.241:2181,10.240.19.242:2181</str>
|
||||||
|
<!--
|
||||||
|
If you have chrooted your Solr information at the target you must include the chroot, for example:
|
||||||
|
<str name="zkHost">10.240.19.241:2181,10.240.19.242:2181/solr</str>
|
||||||
|
-->
|
||||||
|
<str name="source">collection1</str>
|
||||||
|
<str name="target">collection1</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
<lst name="replicator">
|
||||||
|
<str name="threadPoolSize">8</str>
|
||||||
|
<str name="schedule">1000</str>
|
||||||
|
<str name="batchSize">128</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
<lst name="updateLogSynchronizer">
|
||||||
|
<str name="schedule">1000</str>
|
||||||
|
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<!-- Modify the <updateLog> section of your existing <updateHandler>
|
||||||
|
in your config as below -->
|
||||||
|
<updateHandler class="solr.DirectUpdateHandler2">
|
||||||
|
<updateLog class="solr.CdcrUpdateLog">
|
||||||
|
<str name="dir">${solr.ulog.dir:}</str>
|
||||||
|
<!--Any parameters from the original <updateLog> section -->
|
||||||
|
</updateLog>
|
||||||
|
</updateHandler>
|
||||||
|
----
|
||||||
|
|
||||||
|
=== Cluster 2 Configuration
|
||||||
|
|
||||||
|
The configuration of the 2nd cluster is identical to the configuration of Cluster 1, with the Cluster 1 `zkHost` string specified in `CdcrRequestHandler` definition:
|
||||||
|
|
||||||
|
[source,xml]
|
||||||
|
----
|
||||||
|
<requestHandler name="/update" class="solr.UpdateRequestHandler">
|
||||||
|
<lst name="defaults">
|
||||||
|
<str name="update.chain">cdcr-processor-chain</str>
|
||||||
|
</lst>
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<updateRequestProcessorChain name="cdcr-processor-chain">
|
||||||
|
<processor class="solr.CdcrUpdateProcessorFactory"/>
|
||||||
|
<processor class="solr.RunUpdateProcessorFactory"/>
|
||||||
|
</updateRequestProcessorChain>
|
||||||
|
|
||||||
|
<requestHandler name="/cdcr" class="solr.CdcrRequestHandler">
|
||||||
|
<lst name="replica">
|
||||||
|
<str name="zkHost">10.250.18.211:2181,10.250.18.212:2181</str>
|
||||||
|
<!--
|
||||||
|
If you have chrooted your Solr information at the target you must include the chroot, for example:
|
||||||
|
<str name="zkHost">10.250.18.211:2181,10.250.18.212:2181/solr</str>
|
||||||
|
-->
|
||||||
|
<str name="source">collection1</str>
|
||||||
|
<str name="target">collection1</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
<lst name="replicator">
|
||||||
|
<str name="threadPoolSize">8</str>
|
||||||
|
<str name="schedule">1000</str>
|
||||||
|
<str name="batchSize">128</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
<lst name="updateLogSynchronizer">
|
||||||
|
<str name="schedule">1000</str>
|
||||||
|
</lst>
|
||||||
|
|
||||||
|
</requestHandler>
|
||||||
|
|
||||||
|
<!-- Modify the <updateLog> section of your existing <updateHandler>
|
||||||
|
in your config as below -->
|
||||||
|
<updateHandler class="solr.DirectUpdateHandler2">
|
||||||
|
<updateLog class="solr.CdcrUpdateLog">
|
||||||
|
<str name="dir">${solr.ulog.dir:}</str>
|
||||||
|
<!--Any parameters from the original <updateLog> section -->
|
||||||
|
</updateLog>
|
||||||
|
</updateHandler>
|
||||||
|
----
|
||||||
|
|
||||||
|
== CDCR Configuration Parameters
|
||||||
|
|
||||||
|
The configuration details, defaults and options are as follows:
|
||||||
|
|
||||||
|
=== The Replica Element
|
||||||
|
|
||||||
|
CDCR can be configured to forward update requests to one or more Target collections. A Target collection is defined with a “replica” list as follows:
|
||||||
|
|
||||||
|
`zkHost`::
|
||||||
|
The host address for ZooKeeper of the Target SolrCloud. Usually this is a comma-separated list of addresses to each node in the Target ZooKeeper ensemble. This parameter is required.
|
||||||
|
|
||||||
|
`Source`::
|
||||||
|
The name of the collection on the Source SolrCloud to be replicated. This parameter is required.
|
||||||
|
|
||||||
|
`Target`::
|
||||||
|
The name of the collection on the Target SolrCloud to which updates will be forwarded. This parameter is required.
|
||||||
|
|
||||||
|
=== The Replicator Element
|
||||||
|
|
||||||
|
The CDC Replicator is the component in charge of forwarding updates to the replicas. The replicator will monitor the update logs of the Source collection and will forward any new updates to the Target collection.
|
||||||
|
|
||||||
|
The replicator uses a fixed thread pool to forward updates to multiple replicas in parallel. If more than one replica is configured, one thread will forward a batch of updates from one replica at a time in a round-robin fashion. The replicator can be configured with a “replicator” list as follows:
|
||||||
|
|
||||||
|
`threadPoolSize`::
|
||||||
|
The number of threads to use for forwarding updates. One thread per replica is recommended. The default is `2`.
|
||||||
|
|
||||||
|
`schedule`::
|
||||||
|
The delay in milliseconds for the monitoring the update log(s). The default is `10`.
|
||||||
|
|
||||||
|
`batchSize`::
|
||||||
|
The number of updates to send in one batch. The optimal size depends on the size of the documents. Large batches of large documents can increase your memory usage significantly. The default is `128`.
|
||||||
|
|
||||||
|
=== The updateLogSynchronizer Element
|
||||||
|
|
||||||
|
Expert: Non-leader nodes need to synchronize their update logs with their leader node from time to time in order to clean deprecated transaction log files. By default, such a synchronization process is performed every minute. The schedule of the synchronization can be modified with a “updateLogSynchronizer” list as follows:
|
||||||
|
|
||||||
|
TIP: If the updateLogSynchronizer element is omitted from the Source cluster, transaction logs may accumulate on non-leaders.
|
||||||
|
|
||||||
|
`schedule`::
|
||||||
|
The delay in milliseconds for synchronizing the update logs. The default is `60000`.
|
||||||
|
|
||||||
|
=== The Buffer Element
|
||||||
|
|
||||||
|
When buffering updates, the update logs will store all the updates indefinitely. It is best to disable buffering on both the Source and Target clusters during normal operation as when buffering is enabled the Update Logs will grow without limit. Enbling buffering is intended for special maintenance periods. Buffering can be disabled at startup with a “buffer” list and the parameter “defaultState” as follows:
|
||||||
|
|
||||||
|
`defaultState`::
|
||||||
|
The state of the buffer at startup. The default is `enabled`.
|
||||||
|
|
||||||
|
[TIP]
|
||||||
|
.Buffering should be enabled only for maintenance windows
|
||||||
|
====
|
||||||
|
Buffering is designed to augment maintenance windows. The following points should be kept in mind:
|
||||||
|
|
||||||
|
* When buffering is enabled, the Update Logs will grow without limit; they will never be purged.
|
||||||
|
* During normal operation, the Update Logs will automatically accrue on the Source data center if the Target data center is unavailable; It is not necessary to enable buffering for CDCR to handle routine network disruptions.
|
||||||
|
** For this reason, monitoring disk usage on the Source data center is recommended as an additional check that the Target data center is receiving updates.
|
||||||
|
* For uni-directional updates, buffering should _not_ be enabled on the Target data center as Update Logs would accrue without limit.
|
||||||
|
* If buffering is enabled and then disabled, the Update Logs will be removed when their contents have been sent to the Target data center. This process may take some time and is triggered by additional updates the Source cluster.
|
||||||
|
** Update Log cleanup is not triggered until a new update is sent to the Source data center.
|
||||||
|
====
|
||||||
|
|
||||||
|
== Initial Startup
|
||||||
|
|
||||||
|
.CDCR Bootstrapping
|
||||||
|
[TIP]
|
||||||
|
====
|
||||||
|
Solr 6.2 added the functionality to allow CDCR to replicate the entire index from the Source to the Target data centers on first time startup as an alternative to the following procedure. For very large indexes, time should be allocated for the initial synchronization if this option is chosen.
|
||||||
|
====
|
||||||
|
|
||||||
|
This is a general approach for initializing CDCR in a production environment based upon an approach taken by the initial working installation of CDCR and generously contributed to illustrate a "real world" scenario.
|
||||||
|
|
||||||
|
|
||||||
|
* Customer uses the CDCR approach to keep a remote disaster-recovery instance available for production backup. This is a uni-directional solution.
|
||||||
|
* Customer has 26 clouds with 200 million assets per cloud (15GB indexes). Total document count is over 4.8 billion.
|
||||||
|
** Source and Target clouds were synched in 2-3 hour maintenance windows to establish the base index for the Targets.
|
||||||
|
|
||||||
|
As usual, it is good to start small. Sync a single cloud and monitor for a period of time before doing the others. You may need to adjust your settings several times before finding the right balance.
|
||||||
|
|
||||||
|
* Before starting, stop or pause the indexers. This is best done during a small maintenance window.
|
||||||
|
* Stop the SolrCloud instances at the Source.
|
||||||
|
* Upload the modified `solrconfig.xml` to ZooKeeper on both Source and Target as appropriate, see the examples above.
|
||||||
|
* Sync the index directories from the Source collection to Target collection across to the corresponding shard nodes. `rsync` works well for this.
|
||||||
|
+
|
||||||
|
For example, if there are 2 shards on collection1 with 2 replicas for each shard, copy the corresponding index directories from:
|
||||||
|
+
|
||||||
|
[width="75%",cols="45,10,45"]
|
||||||
|
|===
|
||||||
|
|shard1replica1Source |to |shard1replica1Target
|
||||||
|
|shard1replica2Source |to |shard1replica2Target
|
||||||
|
|shard2replica1Source |to |shard2replica1Target
|
||||||
|
|shard2replica2Source |to |shard2replica2Target
|
||||||
|
|===
|
||||||
|
|
||||||
|
* Start the ZooKeeper on the Target (DR) side.
|
||||||
|
* Start the SolrCloud on the Target (DR) side.
|
||||||
|
* Start the ZooKeeper on the Source side.
|
||||||
|
* Start the SolrCloud on the Source side. As a general rule, the Target (DR) side of the SolrCloud should be started before the Source side.
|
||||||
|
* Activate the CDCR on Source instance using the CDCR API:
|
||||||
|
+
|
||||||
|
[source,text]
|
||||||
|
http://host:port/solr/<collection_name>/cdcr?action=START
|
||||||
|
+
|
||||||
|
There is no need to run the `/cdcr?action=START` command on the Target.
|
||||||
|
* Disable the buffer on the Target and Source:
|
||||||
|
+
|
||||||
|
[source,text]
|
||||||
|
http://host:port/solr/collection_name/cdcr?action=DISABLEBUFFER
|
||||||
|
+
|
||||||
|
* Re-enable indexing.
|
||||||
|
|
||||||
|
== ZooKeeper Settings
|
||||||
|
|
||||||
|
With CDCR, the Target ZooKeepers will have connections from the Target clouds and the Source clouds. You may need to increase the `maxClientCnxns` setting in `zoo.cfg`.
|
||||||
|
|
||||||
|
[source,text]
|
||||||
|
----
|
||||||
|
## set numbers of connection to 800 from client
|
||||||
|
## is maxClientCnxns=0 that means no limit
|
||||||
|
maxClientCnxns=800
|
||||||
|
----
|
|
@ -0,0 +1,41 @@
|
||||||
|
= Cross Data Center Replication Operations
|
||||||
|
// Licensed to the Apache Software Foundation (ASF) under one
|
||||||
|
// or more contributor license agreements. See the NOTICE file
|
||||||
|
// distributed with this work for additional information
|
||||||
|
// regarding copyright ownership. The ASF licenses this file
|
||||||
|
// to you under the Apache License, Version 2.0 (the
|
||||||
|
// "License"); you may not use this file except in compliance
|
||||||
|
// with the License. You may obtain a copy of the License at
|
||||||
|
//
|
||||||
|
// http://www.apache.org/licenses/LICENSE-2.0
|
||||||
|
//
|
||||||
|
// Unless required by applicable law or agreed to in writing,
|
||||||
|
// software distributed under the License is distributed on an
|
||||||
|
// "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
||||||
|
// KIND, either express or implied. See the License for the
|
||||||
|
// specific language governing permissions and limitations
|
||||||
|
// under the License.
|
||||||
|
|
||||||
|
== Monitoring
|
||||||
|
|
||||||
|
. Network and disk space monitoring are essential. Ensure that the system has plenty of available storage to queue up changes if there is a disconnect between the Source and Target. A network outage between the two data centers can cause your disk usage to grow. Some tips:
|
||||||
|
.. Set a monitor for your disks to send alerts when the disk gets over a certain percentage (e.g., 70%).
|
||||||
|
.. Run a test. With moderate indexing, how long can the system queue changes before you run out of disk space?
|
||||||
|
. Create a simple way to check the counts between the Source and the Target.
|
||||||
|
.. Keep in mind that if indexing is running, the Source and Target may not match document for document. Set an alert to fire if the difference is greater than some percentage of the overall cloud size.
|
||||||
|
|
||||||
|
== Upgrading and Patching Production
|
||||||
|
|
||||||
|
When rolling in upgrades to your indexer or application, you should shutdown the Source and the Target. Depending on your setup, you may want to pause/stop indexing, deploy the release or patch, then re-enable indexing. Then start the Target last.
|
||||||
|
|
||||||
|
* There is no need to reissue the DISABLEBUFFERS or START commands. These are persisted.
|
||||||
|
* After starting the Target, run a simple test. Add a test document to each of the Source clouds. Then check for it on the Target.
|
||||||
|
|
||||||
|
[source,bash]
|
||||||
|
----
|
||||||
|
#send to the Source
|
||||||
|
curl http://<Source>/solr/cloud1/update -H 'Content-type:application/json' -d '[{"SKU":"ABC"}]'
|
||||||
|
|
||||||
|
#check the Target
|
||||||
|
curl "http://<Target>:8983/solr/<collection_name>/select?q=SKU:ABC&indent=true"
|
||||||
|
----
|
|
@ -1,4 +1,5 @@
|
||||||
= Cross Data Center Replication (CDCR)
|
= Cross Data Center Replication (CDCR)
|
||||||
|
:page-children: cdcr-architecture, cdcr-config, cdcr-operations, cdcr-api
|
||||||
// Licensed to the Apache Software Foundation (ASF) under one
|
// Licensed to the Apache Software Foundation (ASF) under one
|
||||||
// or more contributor license agreements. See the NOTICE file
|
// or more contributor license agreements. See the NOTICE file
|
||||||
// distributed with this work for additional information
|
// distributed with this work for additional information
|
||||||
|
@ -18,38 +19,27 @@
|
||||||
|
|
||||||
Cross Data Center Replication (CDCR) allows you to create multiple SolrCloud data centers and keep them in sync.
|
Cross Data Center Replication (CDCR) allows you to create multiple SolrCloud data centers and keep them in sync.
|
||||||
|
|
||||||
The <<solrcloud.adoc#solrcloud,SolrCloud>> architecture is designed to support <<near-real-time-searching.adoc#near-real-time-searching,Near Real Time Searching>> (NRT) searches on a Solr collection usually consisting of multiple nodes in a single data center. "CDCR" augments this model by forwarding updates from a Solr collection in one data center to a parallel Solr collection in another data center where the network latencies are greater than the SolrCloud model was designed to accommodate.
|
|
||||||
|
|
||||||
== What is CDCR?
|
== What is CDCR?
|
||||||
|
|
||||||
CDCR supports replicating data from one data center to multiple data centers. The initial version of the solution supports a uni-directional scenario where data updates are replicated from a Source data center to one or more Target data centers.
|
The <<solrcloud.adoc#solrcloud,SolrCloud>> architecture is designed to support <<near-real-time-searching.adoc#near-real-time-searching,Near Real Time (NRT)>> searches on a Solr collection that usual consists of multiple nodes in a single data center. CDCR augments this model by forwarding updates from a Solr collection in one data center to a parallel Solr collection in another data center where the network latencies are greater than the SolrCloud model was designed to accommodate.
|
||||||
|
|
||||||
The Target data center(s) will not propagate updates such as adds, updates, or deletes to the Source data center and updates should _not_ be sent to any of the Target data center(s).
|
For more information about CDCR, see the following sections:
|
||||||
|
|
||||||
Source and Target data centers can serve search queries when CDCR is operating. The Target data centers will lag somewhat behind the Source cluster due to propagation delays.
|
* *<<cdcr-architecture.adoc#cdcr-architecture,CDCR Architecture>>*: A detailed overview of how CDCR works.
|
||||||
|
* *<<cdcr-config.adoc#cdcr-config,CDCR Configuration>>*: How to set up and initialize CDCR for your cluster.
|
||||||
|
* *<<cdcr-operations.adoc#cdcr-operations,CDCR Operations>>*: Information on monitoring CDCR and upgrading your cluster when using CDCR.
|
||||||
|
* *<<cdcr-api.adoc#cdcr-api,CDCR API>>*: Reference for the CDCR API.
|
||||||
|
|
||||||
Data changes on the Source data center are replicated to the Target data center only after they are persisted to disk. The data changes can be replicated in near real-time (with a small delay) or could be scheduled to be sent at longer intervals to the Target data center. CDCR can "bootstrap" the collection to the Target data center. Since this is a full copy of the entire index, network bandwidth should be considered. Of course both Source and Target collections may be empty to start.
|
|
||||||
|
|
||||||
Each shard leader in the Source data center will be responsible for replicating its updates to the corresponding leader in the Target data center. When receiving updates from the Source data center, shard leaders in the Target data center will replicate the changes to their own replicas as normal SolrCloud updates.
|
|
||||||
|
|
||||||
This replication model is designed to tolerate some degradation in connectivity, accommodate limited bandwidth, and support batch updates to optimize communication.
|
|
||||||
|
|
||||||
Replication supports both a new empty index and pre-built indexes. In the scenario where the replication is set up on a pre-built index in the Source cluster and nothing on the Target cluster, CDCR will replicate the _entire_ index from the Source to Target. This functionality was added in Solr 6.2.
|
|
||||||
|
|
||||||
The uni-directional nature of the initial implementation implies a "push" model from the Source collection to the Target collection. Therefore, the Source configuration must be able to "see" the ZooKeeper ensemble in the Target cluster. The ZooKeeper ensemble is provided configured in the Source's `solrconfig.xml` file.
|
|
||||||
|
|
||||||
CDCR is configured to replicate from collections in the Source cluster to collections in the Target cluster on a collection-by-collection basis. Since CDCR is configured in `solrconfig.xml` (on both Source and Target clusters), the settings can be tailored for the needs of each collection.
|
|
||||||
|
|
||||||
CDCR can be configured to replicate from one collection to a second collection _within the same cluster_. That is a specialized scenario not covered in this document.
|
|
||||||
|
|
||||||
|
// Are there any terms here that are new? If not, I think we should remove this.
|
||||||
== CDCR Glossary
|
== CDCR Glossary
|
||||||
|
|
||||||
Terms used in this document include:
|
For the purposes of discussing CDCR, the following terminology is used. If you are already familiar with SolrCloud, many of these terms will already be familiar to you.
|
||||||
|
|
||||||
[glossary]
|
[glossary]
|
||||||
Node:: A JVM instance running Solr; a server.
|
Node:: A JVM instance running Solr; a server.
|
||||||
Cluster:: A set of Solr nodes managed as a single unit by a ZooKeeper ensemble hosting one or more Collections.
|
Cluster:: A set of Solr nodes managed as a single unit by a ZooKeeper ensemble hosting one or more Collections.
|
||||||
Data Center:: A group of networked servers hosting a Solr cluster. In this document, the terms _Cluster_ and _Data Center_ are interchangeable as we assume that each Solr cluster is hosted in a different group of networked servers.
|
Data Center:: A group of networked servers hosting a Solr cluster. For CDCR, the terms _Cluster_ and _Data Center_ are interchangeable as we assume that each Solr cluster is hosted in a different group of networked servers.
|
||||||
Shard:: A sub-index of a single logical collection. This may be spread across multiple nodes of the cluster. Each shard can have 1-N replicas.
|
Shard:: A sub-index of a single logical collection. This may be spread across multiple nodes of the cluster. Each shard can have 1-N replicas.
|
||||||
Leader:: Each shard has replica identified as its leader. All the writes for documents belonging to a shard are routed through the leader.
|
Leader:: Each shard has replica identified as its leader. All the writes for documents belonging to a shard are routed through the leader.
|
||||||
Replica:: A copy of a shard for use in failover or load balancing. Replicas comprising a shard can either be leaders or non-leaders.
|
Replica:: A copy of a shard for use in failover or load balancing. Replicas comprising a shard can either be leaders or non-leaders.
|
||||||
|
@ -57,665 +47,3 @@ Follower:: A convenience term for a replica that is _not_ the leader of a shard.
|
||||||
Collection:: A logical index, consisting of one or more shards. A cluster can have multiple collections.
|
Collection:: A logical index, consisting of one or more shards. A cluster can have multiple collections.
|
||||||
Update:: An operation that changes the collection's index in any way. This could be adding a new document, deleting documents or changing a document.
|
Update:: An operation that changes the collection's index in any way. This could be adding a new document, deleting documents or changing a document.
|
||||||
Update Log(s):: An append-only log of write operations maintained by each node.
|
Update Log(s):: An append-only log of write operations maintained by each node.
|
||||||
|
|
||||||
== CDCR Architecture
|
|
||||||
|
|
||||||
Here is a picture of the data flow.
|
|
||||||
|
|
||||||
.CDCR Data Flow
|
|
||||||
image::images/cross-data-center-replication-cdcr-/CDCR_arch.png[image,width=700,height=525]
|
|
||||||
|
|
||||||
Updates and deletes are first written to the Source cluster, then forwarded to the Target cluster. The data flow sequence is:
|
|
||||||
|
|
||||||
. A shard leader receives a new update that is processed by its update processor chain.
|
|
||||||
. The data update is first applied to the local index.
|
|
||||||
. Upon successful application of the data update on the local index, the data update is added to the Update Logs queue.
|
|
||||||
. After the data update is persisted to disk, the data update is sent to the replicas within the data center.
|
|
||||||
. After Step 4 is successful, CDCR reads the data update from the Update Logs and pushes it to the corresponding collection in the Target data center. This is necessary in order to ensure consistency between the Source and Target data centers.
|
|
||||||
. The leader on the Target data center writes the data locally and forwards it to all its followers.
|
|
||||||
|
|
||||||
Steps 1, 2, 3 and 4 are performed synchronously by SolrCloud; Step 5 is performed asynchronously by a background thread. Given that CDCR replication is performed asynchronously, it becomes possible to push batch updates in order to minimize network communication overhead. Also, if CDCR is unable to push the update at a given time, for example, due to a degradation in connectivity, it can retry later without any impact on the Source data center.
|
|
||||||
|
|
||||||
One implication of the architecture is that the leaders in the Source cluster must be able to "see" the leaders in the Target cluster. Since leaders may change in both Source and Target collections, which means that all nodes in the Source cluster must be able to "see" all Solr nodes in the Target cluster so firewalls, ACL rules, etc., must be configured to allow this.
|
|
||||||
|
|
||||||
The current design works most robustly if both the Source and Target clusters have the same number of shards. There is no requirement that the shards in the Source and Target collection have the same number of replicas.
|
|
||||||
|
|
||||||
Having different numbers of shards on the Source and Target cluster is possible, but is also an "expert" configuration as that option imposes certain constraints and is not generally recommended. Most of the scenarios where having differing numbers of shards are contemplated are better accomplished by hosting multiple shards on each Solr instance.
|
|
||||||
|
|
||||||
== Major Components of CDCR
|
|
||||||
|
|
||||||
There are a number of key features and components in CDCR’s architecture:
|
|
||||||
|
|
||||||
=== CDCR Configuration
|
|
||||||
|
|
||||||
In order to configure CDCR, the Source data center requires the host address of the ZooKeeper cluster associated with the Target data center. The ZooKeeper host address is the only information needed by CDCR to instantiate the communication with the Target Solr cluster. The CDCR configuration section of `solrconfig.xml` file on the Source cluster will therefore contain a list of ZooKeeper hosts. The CDCR configuration section of `solrconfig.xml` might also contain secondary/optional configuration, such as the number of CDC Replicator threads, batch updates related settings, etc.
|
|
||||||
|
|
||||||
=== CDCR Initialization
|
|
||||||
|
|
||||||
CDCR supports incremental updates to either new or existing collections. CDCR may not be able to keep up with very high volume updates, especially if there are significant communications latencies due to a slow "pipe" between the data centers. Some scenarios:
|
|
||||||
|
|
||||||
* There is an initial bulk load of a corpus followed by lower volume incremental updates. In this case, one can do the initial bulk load and then enable CDCR. See the section <<Initial Startup>> for more information.
|
|
||||||
* The index is being built up from scratch, without a significant initial bulk load. CDCR can be set up on empty collections and keep them synchronized from the start.
|
|
||||||
* The index is always being updated at a volume too high for CDCR to keep up. This is especially possible in situations where the connection between the Source and Target data centers is poor. This scenario is unsuitable for CDCR in its current form.
|
|
||||||
|
|
||||||
=== Inter-Data Center Communication
|
|
||||||
|
|
||||||
The CDCR REST API is the primary form of end-user communication for admin commands. A SolrJ client is used internally for CDCR operations. The SolrJ client gets its configuration information from the `solrconfig.xml` file. Users of CDCR will not interact directly with the internal SolrJ implementation and will interact with CDCR exclusively through the REST API.
|
|
||||||
|
|
||||||
=== Updates Tracking & Pushing
|
|
||||||
|
|
||||||
CDCR replicates data updates from the Source to the Target data center by leveraging the Update Logs.
|
|
||||||
|
|
||||||
A background thread regularly checks the Update Logs for new entries, and then forwards them to the Target data center. The thread therefore needs to keep a checkpoint in the form of a pointer to the last update successfully processed in the Update Logs. Upon acknowledgement from the Target data center that updates have been successfully processed, the Update Logs pointer is updated to reflect the current checkpoint.
|
|
||||||
|
|
||||||
This pointer must be synchronized across all the replicas. In the case where the leader goes down and a new leader is elected, the new leader will be able to resume replication from the last update by using this synchronized pointer. The strategy to synchronize such a pointer across replicas will be explained next.
|
|
||||||
|
|
||||||
If for some reason, the Target data center is offline or fails to process the updates, the thread will periodically try to contact the Target data center and push the updates while buffering updates on the Source cluster. One implication of this is that the Source Update Logs directory should be periodically monitored as the updates will continue to accumulate amd will not be purged until the connection to the Target data center is restored.
|
|
||||||
|
|
||||||
=== Synchronization of Update Checkpoints
|
|
||||||
|
|
||||||
A reliable synchronization of the update checkpoints between the shard leader and shard replicas is critical to avoid introducing inconsistency between the Source and Target data centers. Another important requirement is that the synchronization must be performed with minimal network traffic to maximize scalability.
|
|
||||||
|
|
||||||
In order to achieve this, the strategy is to:
|
|
||||||
|
|
||||||
* Uniquely identify each update operation. This unique identifier will serve as pointer.
|
|
||||||
* Rely on two storages: an ephemeral storage on the Source shard leader, and a persistent storage on the Target cluster.
|
|
||||||
|
|
||||||
The shard leader in the Source cluster will be in charge of generating a unique identifier for each update operation, and will keep a copy of the identifier of the last processed updates in memory. The identifier will be sent to the Target cluster as part of the update request. On the Target data center side, the shard leader will receive the update request, store it along with the unique identifier in the Update Logs, and replicate it to the other shards.
|
|
||||||
|
|
||||||
SolrCloud already provides a unique identifier for each update operation, i.e., a “version” number. This version number is generated using a time-based lmport clock which is incremented for each update operation sent. This provides an “happened-before” ordering of the update operations that will be leveraged in (1) the initialization of the update checkpoint on the Source cluster, and in (2) the maintenance strategy of the Update Logs.
|
|
||||||
|
|
||||||
The persistent storage on the Target cluster is used only during the election of a new shard leader on the Source cluster. If a shard leader goes down on the Source cluster and a new leader is elected, the new leader will contact the Target cluster to retrieve the last update checkpoint and instantiate its ephemeral pointer. On such a request, the Target cluster will retrieve the latest identifier received across all the shards, and send it back to the Source cluster. To retrieve the latest identifier, every shard leader will look up the identifier of the first entry in its Update Logs and send it back to a coordinator. The coordinator will have to select the highest among them.
|
|
||||||
|
|
||||||
This strategy does not require any additional network traffic and ensures reliable pointer synchronization. Consistency is principally achieved by leveraging SolrCloud. The update workflow of SolrCloud ensures that every update is applied to the leader and also to any of the replicas. If the leader goes down, a new leader is elected. During the leader election, a synchronization is performed between the new leader and the other replicas. This ensures that the new leader has a consistent Update Logs with the previous leader. Having a consistent Update Logs means that:
|
|
||||||
|
|
||||||
* On the Source cluster, the update checkpoint can be reused by the new leader.
|
|
||||||
* On the Target cluster, the update checkpoint will be consistent between the previous and new leader. This ensures the correctness of the update checkpoint sent by a newly elected leader from the Target cluster.
|
|
||||||
|
|
||||||
=== Maintenance of Update Logs
|
|
||||||
|
|
||||||
The CDCR replication logic requires modification to the maintenance logic of the Update Logs on the Source data center. Initially, the Update Logs acts as a fixed size queue, limited to 100 update entries by default. In the CDCR scenario, the Update Logs must act as a queue of variable size as they need to keep track of all the updates up through the last processed update by the Target data center. Entries in the Update Logs are removed only when all pointers (one pointer per Target data center) are after them.
|
|
||||||
|
|
||||||
If the communication with one of the Target data center is slow, the Update Logs on the Source data center can grow to a substantial size. In such a scenario, it is necessary for the Update Logs to be able to efficiently find a given update operation given its identifier. Given that its identifier is an incremental number, it is possible to implement an efficient search strategy. Each transaction log file contains as part of its filename the version number of the first element. This is used to quickly traverse all the transaction log files and find the transaction log file containing one specific version number.
|
|
||||||
|
|
||||||
=== Monitoring
|
|
||||||
|
|
||||||
CDCR provides the following monitoring capabilities over the replication operations:
|
|
||||||
|
|
||||||
* Monitoring of the outgoing and incoming replications, with information such as the Source and Target nodes, their status, etc.
|
|
||||||
* Statistics about the replication, with information such as operations (add/delete) per second, number of documents in the queue, etc.
|
|
||||||
|
|
||||||
Information about the lifecycle and statistics will be provided on a per-shard basis by the CDC Replicator thread. The CDCR API can then aggregate this information an a collection level.
|
|
||||||
|
|
||||||
=== CDC Replicator
|
|
||||||
|
|
||||||
The CDC Replicator is a background thread that is responsible for replicating updates from a Source data center to one or more Target data centers. It is responsible in providing monitoring information on a per-shard basis. As there can be a large number of collections and shards in a cluster, we will use a fixed-size pool of CDC Replicator threads that will be shared across shards.
|
|
||||||
|
|
||||||
=== CDCR Limitations
|
|
||||||
|
|
||||||
The current design of CDCR has some limitations. CDCR will continue to evolve over time and many of these limitations will be addressed. Among them are:
|
|
||||||
|
|
||||||
* CDCR is unlikely to be satisfactory for bulk-load situations where the update rate is high, especially if the bandwidth between the Source and Target clusters is restricted. In this scenario, the initial bulk load should be performed, the Source and Target data centers synchronized and CDCR be utilized for incremental updates.
|
|
||||||
* CDCR is currently only uni-directional; data is pushed from the Source cluster to the Target cluster. There is active work being done in this area to remove this limitation.
|
|
||||||
* CDCR works most robustly with the same number of shards in the Source and Target collection. The shards in the two collections may have different numbers of replicas.
|
|
||||||
* Running CDCR with the indexes on HDFS is not currently supported, see the https://issues.apache.org/jira/browse/SOLR-9861[Solr CDCR over HDFS] JIRA issue.
|
|
||||||
* Configuration files `(solrconfig.xml, schema etc.)` are not automatically synchronized between the Source and Target clusters. This means that when the Source schema or `solrconfig.xml` files are changed, those changes must be replicated manually to the Target cluster. This includes adding fields by the <<schema-api.adoc#schema-api,Schema API>> or <<managed-resources.adoc#managed-resources,Managed Resources>> as well as hand editing those files.
|
|
||||||
|
|
||||||
== CDCR Configuration
|
|
||||||
|
|
||||||
The Source and Target configurations differ in the case of the data centers being in separate clusters. "Cluster" here means separate ZooKeeper ensembles controlling disjoint Solr instances. Whether these data centers are physically separated or not is immaterial for this discussion.
|
|
||||||
|
|
||||||
=== Source Configuration
|
|
||||||
|
|
||||||
Here is a sample of a Source configuration file, a section in `solrconfig.xml`. The presence of the <replica> section causes CDCR to use this cluster as the Source and should not be present in the Target collections. Details about each setting are after the two examples. The source example has buffering disabled, the default is enabled:
|
|
||||||
|
|
||||||
[source,xml]
|
|
||||||
----
|
|
||||||
<updateRequestProcessorChain name="cdcr-processor-chain">
|
|
||||||
<processor class="solr.CdcrUpdateProcessorFactory"/>
|
|
||||||
<processor class="solr.RunUpdateProcessorFactory"/>
|
|
||||||
</updateRequestProcessorChain>
|
|
||||||
|
|
||||||
<requestHandler name="/update" class="solr.UpdateRequestHandler">
|
|
||||||
<lst name="defaults">
|
|
||||||
<str name="update.chain">cdcr-processor-chain</str>
|
|
||||||
</lst>
|
|
||||||
</requestHandler>
|
|
||||||
|
|
||||||
<requestHandler name="/cdcr" class="solr.CdcrRequestHandler">
|
|
||||||
<lst name="replica">
|
|
||||||
<str name="zkHost">10.240.18.211:2181,10.240.18.212:2181</str>
|
|
||||||
<!--
|
|
||||||
If you have chrooted your Solr information at the target you must include the chroot, for example:
|
|
||||||
<str name="zkHost">10.240.18.211:2181,10.240.18.212:2181/solr</str>
|
|
||||||
-->
|
|
||||||
<str name="source">collection1</str>
|
|
||||||
<str name="target">collection1</str>
|
|
||||||
</lst>
|
|
||||||
|
|
||||||
<lst name="replicator">
|
|
||||||
<str name="threadPoolSize">8</str>
|
|
||||||
<str name="schedule">1000</str>
|
|
||||||
<str name="batchSize">128</str>
|
|
||||||
</lst>
|
|
||||||
|
|
||||||
<lst name="updateLogSynchronizer">
|
|
||||||
<str name="schedule">1000</str>
|
|
||||||
</lst>
|
|
||||||
|
|
||||||
<!-- optional -->
|
|
||||||
<lst name="buffer">
|
|
||||||
<str name="defaultState">DISABLED</str>
|
|
||||||
</lst>
|
|
||||||
|
|
||||||
</requestHandler>
|
|
||||||
|
|
||||||
<!-- Modify the <updateLog> section of your existing <updateHandler>
|
|
||||||
in your config as below -->
|
|
||||||
<updateHandler class="solr.DirectUpdateHandler2">
|
|
||||||
<updateLog class="solr.CdcrUpdateLog">
|
|
||||||
<str name="dir">${solr.ulog.dir:}</str>
|
|
||||||
<!--Any parameters from the original <updateLog> section -->
|
|
||||||
</updateLog>
|
|
||||||
|
|
||||||
<!-- Other configuration options such as autoCommit should still be present -->
|
|
||||||
</updateHandler>
|
|
||||||
----
|
|
||||||
|
|
||||||
=== Target Configuration
|
|
||||||
|
|
||||||
Here is a typical Target configuration.
|
|
||||||
|
|
||||||
Target instance must configure an update processor chain that is specific to CDCR. The update processor chain must include the *CdcrUpdateProcessorFactory*. The task of this processor is to ensure that the version numbers attached to update requests coming from a CDCR Source SolrCloud are reused and not overwritten by the Target. A properly configured Target configuration looks similar to this.
|
|
||||||
|
|
||||||
[source,xml]
|
|
||||||
----
|
|
||||||
<requestHandler name="/cdcr" class="solr.CdcrRequestHandler">
|
|
||||||
<!-- recommended for Target clusters -->
|
|
||||||
<lst name="buffer">
|
|
||||||
<str name="defaultState">disabled</str>
|
|
||||||
</lst>
|
|
||||||
</requestHandler>
|
|
||||||
|
|
||||||
<requestHandler name="/update" class="solr.UpdateRequestHandler">
|
|
||||||
<lst name="defaults">
|
|
||||||
<str name="update.chain">cdcr-processor-chain</str>
|
|
||||||
</lst>
|
|
||||||
</requestHandler>
|
|
||||||
|
|
||||||
<updateRequestProcessorChain name="cdcr-processor-chain">
|
|
||||||
<processor class="solr.CdcrUpdateProcessorFactory"/>
|
|
||||||
<processor class="solr.RunUpdateProcessorFactory"/>
|
|
||||||
</updateRequestProcessorChain>
|
|
||||||
|
|
||||||
<!-- Modify the <updateLog> section of your existing <updateHandler> in your
|
|
||||||
config as below -->
|
|
||||||
<updateHandler class="solr.DirectUpdateHandler2">
|
|
||||||
<updateLog class="solr.CdcrUpdateLog">
|
|
||||||
<str name="dir">${solr.ulog.dir:}</str>
|
|
||||||
<!--Any parameters from the original <updateLog> section -->
|
|
||||||
</updateLog>
|
|
||||||
|
|
||||||
<!-- Other configuration options such as autoCommit should still be present -->
|
|
||||||
|
|
||||||
</updateHandler>
|
|
||||||
----
|
|
||||||
|
|
||||||
=== Configuration Details
|
|
||||||
|
|
||||||
The configuration details, defaults and options are as follows:
|
|
||||||
|
|
||||||
==== The Replica Element
|
|
||||||
|
|
||||||
CDCR can be configured to forward update requests to one or more Target collections. A Target collection is defined with a “replica” list as follows:
|
|
||||||
|
|
||||||
`zkHost`::
|
|
||||||
The host address for ZooKeeper of the Target SolrCloud. Usually this is a comma-separated list of addresses to each node in the Target ZooKeeper ensemble. This parameter is required.
|
|
||||||
|
|
||||||
`Source`::
|
|
||||||
The name of the collection on the Source SolrCloud to be replicated. This parameter is required.
|
|
||||||
|
|
||||||
`Target`::
|
|
||||||
The name of the collection on the Target SolrCloud to which updates will be forwarded. This parameter is required.
|
|
||||||
|
|
||||||
==== The Replicator Element
|
|
||||||
|
|
||||||
The CDC Replicator is the component in charge of forwarding updates to the replicas. The replicator will monitor the update logs of the Source collection and will forward any new updates to the Target collection.
|
|
||||||
|
|
||||||
The replicator uses a fixed thread pool to forward updates to multiple replicas in parallel. If more than one replica is configured, one thread will forward a batch of updates from one replica at a time in a round-robin fashion. The replicator can be configured with a “replicator” list as follows:
|
|
||||||
|
|
||||||
`threadPoolSize`::
|
|
||||||
The number of threads to use for forwarding updates. One thread per replica is recommended. The default is `2`.
|
|
||||||
|
|
||||||
`schedule`::
|
|
||||||
The delay in milliseconds for the monitoring the update log(s). The default is `10`.
|
|
||||||
|
|
||||||
`batchSize`::
|
|
||||||
The number of updates to send in one batch. The optimal size depends on the size of the documents. Large batches of large documents can increase your memory usage significantly. The default is `128`.
|
|
||||||
|
|
||||||
==== The updateLogSynchronizer Element
|
|
||||||
|
|
||||||
Expert: Non-leader nodes need to synchronize their update logs with their leader node from time to time in order to clean deprecated transaction log files. By default, such a synchronization process is performed every minute. The schedule of the synchronization can be modified with a “updateLogSynchronizer” list as follows:
|
|
||||||
|
|
||||||
TIP: If the updateLogSynchronizer element is omitted from the Source cluster, transaction logs may accumulate on non-leaders.
|
|
||||||
|
|
||||||
`schedule`::
|
|
||||||
The delay in milliseconds for synchronizing the update logs. The default is `60000`.
|
|
||||||
|
|
||||||
==== The Buffer Element
|
|
||||||
|
|
||||||
When buffering updates, the update logs will store all the updates indefinitely. It is best to disable buffering on both the Source and Target clusters during normal operation as when buffering is enabled the Update Logs will grow without limit. Enbling buffering is intended for special maintenance periods. Buffering can be disabled at startup with a “buffer” list and the parameter “defaultState” as follows:
|
|
||||||
|
|
||||||
`defaultState`::
|
|
||||||
The state of the buffer at startup. The default is `enabled`.
|
|
||||||
|
|
||||||
[TIP]
|
|
||||||
.Buffering is should be enabled only for maintenance windows
|
|
||||||
====
|
|
||||||
Buffering is designed to augment maintenance windows. The following points should be kept in mind:
|
|
||||||
|
|
||||||
* When buffering is enabled, the Update Logs will grow without limit; they will never be purged.
|
|
||||||
* During normal operation, the Update Logs will automatically accrue on the Source data center if the Target data center is unavailable; It is not necessary to enable buffering for CDCR to handle routine network disruptions.
|
|
||||||
** For this reason, monitoring disk usage on the Source data center is recommended as an additional check that the Target data center is receiving updates.
|
|
||||||
* Buffering should _not_ be enabled on the Target data center as Update Logs would accrue without limit.
|
|
||||||
* If buffering is enabled then disabled, the Update Logs will be removed when their contents have been sent to the Target data center. This process may take some time and is triggered by additional updates the Source cluster.
|
|
||||||
** Update Log cleanup is not triggered until a new update is sent to the Source data center.
|
|
||||||
|
|
||||||
====
|
|
||||||
|
|
||||||
== CDCR API
|
|
||||||
|
|
||||||
The CDCR API is used to control and monitor the replication process. Control actions are performed at a collection level, i.e., by using the following base URL for API calls: `\http://localhost:8983/solr/<collection>/cdcr`.
|
|
||||||
|
|
||||||
Monitor actions are performed at a core level, i.e., by using the following base URL for API calls: `\http://localhost:8983/solr/<core>/cdcr`.
|
|
||||||
|
|
||||||
Currently, none of the CDCR API calls have parameters.
|
|
||||||
|
|
||||||
=== API Entry Points (Control)
|
|
||||||
|
|
||||||
* `<collection>/cdcr?action=STATUS`: <<CDCR STATUS,Returns the current state>> of CDCR.
|
|
||||||
* `<collection>/cdcr?action=START`: <<CDCR START,Starts CDCR>> replication
|
|
||||||
* `<collection>/cdcr?action=STOP`: <<CDCR STOP,Stops CDCR>> replication.
|
|
||||||
* `<collection>/cdcr?action=ENABLEBUFFER`: <<ENABLEBUFFER,Enables the buffering>> of updates.
|
|
||||||
* `<collection>/cdcr?action=DISABLEBUFFER`: <<DISABLEBUFFER,Disables the buffering>> of updates.
|
|
||||||
|
|
||||||
=== API Entry Points (Monitoring)
|
|
||||||
|
|
||||||
* `core/cdcr?action=QUEUES`: <<QUEUES,Fetches statistics about the queue>> for each replica and about the update logs.
|
|
||||||
* `core/cdcr?action=OPS`: <<OPS,Fetches statistics about the replication performance>> (operations per second) for each replica.
|
|
||||||
* `core/cdcr?action=ERRORS`: <<ERRORS,Fetches statistics and other information about replication errors>> for each replica.
|
|
||||||
|
|
||||||
=== Control Commands
|
|
||||||
|
|
||||||
==== CDCR STATUS
|
|
||||||
|
|
||||||
`/collection/cdcr?action=STATUS`
|
|
||||||
|
|
||||||
===== CDCR Status Example
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=STATUS
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader": {
|
|
||||||
"status": 0,
|
|
||||||
"QTime": 0
|
|
||||||
},
|
|
||||||
"status": {
|
|
||||||
"process": "stopped",
|
|
||||||
"buffer": "enabled"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
==== ENABLEBUFFER
|
|
||||||
|
|
||||||
`/collection/cdcr?action=ENABLEBUFFER`
|
|
||||||
|
|
||||||
===== Enable Buffer Response
|
|
||||||
|
|
||||||
The status of the process and an indication of whether the buffer is enabled.
|
|
||||||
|
|
||||||
===== Enable Buffer Example
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=ENABLEBUFFER
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader": {
|
|
||||||
"status": 0,
|
|
||||||
"QTime": 0
|
|
||||||
},
|
|
||||||
"status": {
|
|
||||||
"process": "started",
|
|
||||||
"buffer": "enabled"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
==== DISABLEBUFFER
|
|
||||||
|
|
||||||
`/collection/cdcr?action=DISABLEBUFFER`
|
|
||||||
|
|
||||||
===== Disable Buffer Response
|
|
||||||
|
|
||||||
The status of CDCR and an indication that the buffer is disabled.
|
|
||||||
|
|
||||||
===== Disable Buffer Example
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=DISABLEBUFFER
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader": {
|
|
||||||
"status": 0,
|
|
||||||
"QTime": 0
|
|
||||||
},
|
|
||||||
"status": {
|
|
||||||
"process": "started",
|
|
||||||
"buffer": "disabled"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
==== CDCR START
|
|
||||||
|
|
||||||
`/collection/cdcr?action=START`
|
|
||||||
|
|
||||||
===== CDCR Start Response
|
|
||||||
|
|
||||||
Confirmation that CDCR is started and the status of buffering
|
|
||||||
|
|
||||||
===== CDCR Start Examples
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=START
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader": {
|
|
||||||
"status": 0,
|
|
||||||
"QTime": 0
|
|
||||||
},
|
|
||||||
"status": {
|
|
||||||
"process": "started",
|
|
||||||
"buffer": "enabled"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
==== CDCR STOP
|
|
||||||
|
|
||||||
`/collection/cdcr?action=STOP`
|
|
||||||
|
|
||||||
===== CDCR Stop Response
|
|
||||||
|
|
||||||
The status of CDCR, including the confirmation that CDCR is stopped.
|
|
||||||
|
|
||||||
===== CDCR Stop Examples
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=STOP
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader": {
|
|
||||||
"status": 0,
|
|
||||||
"QTime": 0
|
|
||||||
},
|
|
||||||
"status": {
|
|
||||||
"process": "stopped",
|
|
||||||
"buffer": "enabled"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
|
|
||||||
=== CDCR Monitoring Commands
|
|
||||||
|
|
||||||
==== QUEUES
|
|
||||||
|
|
||||||
`/core/cdcr?action=QUEUES`
|
|
||||||
|
|
||||||
===== QUEUES Response
|
|
||||||
|
|
||||||
*Output Content*
|
|
||||||
|
|
||||||
The output is composed of a list “queues” which contains a list of (ZooKeeper) Target hosts, themselves containing a list of Target collections. For each collection, the current size of the queue and the timestamp of the last update operation successfully processed is provided. The timestamp of the update operation is the original timestamp, i.e., the time this operation was processed on the Source SolrCloud. This allows an estimate the latency of the replication process.
|
|
||||||
|
|
||||||
The “queues” object also contains information about the update logs, such as the size (in bytes) of the update logs on disk (“tlogTotalSize”), the number of transaction log files (“tlogTotalCount”) and the status of the update logs synchronizer (“updateLogSynchronizer”).
|
|
||||||
|
|
||||||
===== QUEUES Examples
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<replica_name>/cdcr?action=QUEUES
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader":{
|
|
||||||
"status": 0,
|
|
||||||
"QTime": 1
|
|
||||||
},
|
|
||||||
"queues":{
|
|
||||||
"127.0.0.1: 40342/solr":{
|
|
||||||
"Target_collection":{
|
|
||||||
"queueSize": 104,
|
|
||||||
"lastTimestamp": "2014-12-02T10:32:15.879Z"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
},
|
|
||||||
"tlogTotalSize":3817,
|
|
||||||
"tlogTotalCount":1,
|
|
||||||
"updateLogSynchronizer": "stopped"
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
==== OPS
|
|
||||||
|
|
||||||
`/core/cdcr?action=OPS`
|
|
||||||
|
|
||||||
===== OPS Response
|
|
||||||
|
|
||||||
The output is composed of `operationsPerSecond` which contains a list of (ZooKeeper) target hosts, themselves containing a list of Target collections. For each collection, the average number of processed operations per second since the start of the replication process is provided. The operations are further broken down into two groups: add and delete operations.
|
|
||||||
|
|
||||||
===== OPS Examples
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=OPS
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader":{
|
|
||||||
"status":0,
|
|
||||||
"QTime":1
|
|
||||||
},
|
|
||||||
"operationsPerSecond":{
|
|
||||||
"127.0.0.1: 59661/solr":{
|
|
||||||
"Target_collection":{
|
|
||||||
"all": 297.102944952749052,
|
|
||||||
"adds": 297.102944952749052,
|
|
||||||
"deletes": 0.0
|
|
||||||
}
|
|
||||||
}
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
==== ERRORS
|
|
||||||
|
|
||||||
`/core/cdcr?action=ERRORS`
|
|
||||||
|
|
||||||
===== ERRORS Response
|
|
||||||
|
|
||||||
The output is composed of a list “errors” which contains a list of (ZooKeeper) target hosts, themselves containing a list of Target collections. For each collection, information about errors encountered during the replication is provided, such as the number of consecutive errors encountered by the replicator thread, the number of bad requests or internal errors since the start of the replication process, and a list of the last errors encountered ordered by timestamp.
|
|
||||||
|
|
||||||
===== ERRORS Examples
|
|
||||||
|
|
||||||
*Input*
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
http://host:8983/solr/<collection_name>/cdcr?action=ERRORS
|
|
||||||
----
|
|
||||||
|
|
||||||
*Output*
|
|
||||||
|
|
||||||
[source,json]
|
|
||||||
----
|
|
||||||
{
|
|
||||||
"responseHeader":{
|
|
||||||
"status":0,
|
|
||||||
"QTime":2
|
|
||||||
},
|
|
||||||
"errors": {
|
|
||||||
"127.0.0.1: 36872/solr":{
|
|
||||||
"Target_collection":{
|
|
||||||
"consecutiveErrors":3,
|
|
||||||
"bad_request":0,
|
|
||||||
"internal":3,
|
|
||||||
"last":{
|
|
||||||
"2014-12-02T11:04:42.523Z":"internal",
|
|
||||||
"2014-12-02T11:04:39.223Z":"internal",
|
|
||||||
"2014-12-02T11:04:38.22Z":"internal"
|
|
||||||
}
|
|
||||||
}
|
|
||||||
}
|
|
||||||
}
|
|
||||||
}
|
|
||||||
----
|
|
||||||
|
|
||||||
== Initial Startup
|
|
||||||
|
|
||||||
.CDCR Bootstrapping
|
|
||||||
[TIP]
|
|
||||||
====
|
|
||||||
Solr 6.2, added the additional functionality to allow CDCR to replicate the entire index from the Source to the Target data centers on first time startup as an alternative to the following procedure. For very large indexes, time should be allocated for this initial synchronization if this option is chosen.
|
|
||||||
====
|
|
||||||
|
|
||||||
This is a general approach for initializing CDCR in a production environment based upon an approach taken by the initial working installation of CDCR and generously contributed to illustrate a "real world" scenario.
|
|
||||||
|
|
||||||
|
|
||||||
* Customer uses the CDCR approach to keep a remote disaster-recovery instance available for production backup. This is a uni-directional solution.
|
|
||||||
* Customer has 26 clouds with 200 million assets per cloud (15GB indexes). Total document count is over 4.8 billion.
|
|
||||||
** Source and Target clouds were synched in 2-3 hour maintenance windows to establish the base index for the Targets.
|
|
||||||
|
|
||||||
As usual, it is good to start small. Sync a single cloud and monitor for a period of time before doing the others. You may need to adjust your settings several times before finding the right balance.
|
|
||||||
|
|
||||||
* Before starting, stop or pause the indexers. This is best done during a small maintenance window.
|
|
||||||
* Stop the SolrCloud instances at the Source
|
|
||||||
* Upload the modified `solrconfig.xml` to ZooKeeper on both Source and Target as appropriate, see the examples above.
|
|
||||||
* Sync the index directories from the Source collection to Target collection across to the corresponding shard nodes. `rsync` works well for this.
|
|
||||||
+
|
|
||||||
For example, if there are 2 shards on collection1 with 2 replicas for each shard, copy the corresponding index directories from
|
|
||||||
+
|
|
||||||
[width="75%",cols="45,10,45"]
|
|
||||||
|===
|
|
||||||
|shard1replica1Source |to |shard1replica1Target
|
|
||||||
|shard1replica2Source |to |shard1replica2Target
|
|
||||||
|shard2replica1Source |to |shard2replica1Target
|
|
||||||
|shard2replica2Source |to |shard2replica2Target
|
|
||||||
|===
|
|
||||||
+
|
|
||||||
* Start the ZooKeeper on the Target (DR) side
|
|
||||||
* Start the SolrCloud on the Target (DR) side
|
|
||||||
* Start the ZooKeeper on the Source side
|
|
||||||
* Start the SolrCloud on the Source side. As a general rule, the Target (DR) side of the SolrCloud should be started before the Source side.
|
|
||||||
* Activate the CDCR on Source instance using the CDCR API: `\http://host:port/solr/collection_name/cdcr?action=START`
|
|
||||||
+
|
|
||||||
[source,text]
|
|
||||||
http://host:port/solr/<collection_name>/cdcr?action=START
|
|
||||||
+
|
|
||||||
* There is no need to run the /cdcr?action=START command on the Target
|
|
||||||
* Disable the buffer on the Target and Source
|
|
||||||
+
|
|
||||||
[source,text]
|
|
||||||
http://host:port/solr/collection_name/cdcr?action=DISABLEBUFFER
|
|
||||||
+
|
|
||||||
* Renable indexing
|
|
||||||
|
|
||||||
== Monitoring
|
|
||||||
|
|
||||||
. Network and disk space monitoring are essential. Ensure that the system has plenty of available storage to queue up changes if there is a disconnect between the Source and Target. A network outage between the two data centers can cause your disk usage to grow.
|
|
||||||
.. Tip: Set a monitor for your disks to send alerts when the disk gets over a certain percentage (e.g., 70%)
|
|
||||||
.. Tip: Run a test. With moderate indexing, how long can the system queue changes before you run out of disk space?
|
|
||||||
. Create a simple way to check the counts between the Source and the Target.
|
|
||||||
.. Keep in mind that if indexing is running, the Source and Target may not match document for document. Set an alert to fire if the difference is greater than some percentage of the overall cloud size.
|
|
||||||
|
|
||||||
== ZooKeeper Settings
|
|
||||||
|
|
||||||
With CDCR, the Target ZooKeepers will have connections from the Target clouds and the Source clouds. You may need to increase the `maxClientCnxns` setting in `zoo.cfg`.
|
|
||||||
|
|
||||||
[source,text]
|
|
||||||
----
|
|
||||||
## set numbers of connection to 200 from client
|
|
||||||
## is maxClientCnxns=0 that means no limit
|
|
||||||
maxClientCnxns=800
|
|
||||||
----
|
|
||||||
|
|
||||||
== Upgrading and Patching Production
|
|
||||||
|
|
||||||
When rolling in upgrades to your indexer or application, you should shutdown the Source (production) and the Target (DR). Depending on your setup, you may want to pause/stop indexing. Deploy the release or patch and renable indexing. Then start the Target (DR).
|
|
||||||
|
|
||||||
* There is no need to reissue the DISABLEBUFFERS or START commands. These are persisted.
|
|
||||||
* After starting the Target, run a simple test. Add a test document to each of the Source clouds. Then check for it on the Target.
|
|
||||||
|
|
||||||
[source,bash]
|
|
||||||
----
|
|
||||||
#send to the Source
|
|
||||||
curl http://<Source>/solr/cloud1/update -H 'Content-type:application/json' -d '[{"SKU":"ABC"}]'
|
|
||||||
|
|
||||||
#check the Target
|
|
||||||
curl "http://<Target>:8983/solr/<collection_name>/select?q=SKU:ABC&indent=true"
|
|
||||||
----
|
|
||||||
|
|
Binary file not shown.
After Width: | Height: | Size: 69 KiB |
Loading…
Reference in New Issue