OpenSearch/docs/reference/setup/rolling_upgrade.asciidoc

[[rolling-upgrades]]
=== Rolling upgrades

A rolling upgrade allows the Elasticsearch cluster to be upgraded one node at
a time, with no downtime for end users.  Running multiple versions of
Elasticsearch in the same cluster for any length of time beyond that required
for an upgrade is not supported, as shards will not be replicated from the
more recent version to the older version.

Consult this <<setup-upgrade,table>> to verify that rolling upgrades are
supported for your version of Elasticsearch.

To perform a rolling upgrade:

. *Disable shard allocation*
+
--

When you shut down a node, the allocation process will wait for one minute
before starting to replicate the shards that were on that node to other nodes
in the cluster, causing a lot of wasted I/O.  This can be avoided by disabling
allocation before shutting down a node:

[source,js]
--------------------------------------------------
PUT _cluster/settings
{
  "transient": {
    "cluster.routing.allocation.enable": "none"
  }
}
--------------------------------------------------
// CONSOLE
// TEST[skip:indexes don't assign]
--

. *Stop non-essential indexing and perform a synced flush (Optional)*
+
--

You may happily continue indexing during the upgrade.  However, shard recovery
will be much faster if you temporarily stop non-essential indexing and issue a
<<indices-synced-flush, synced-flush>> request:

[source,js]
--------------------------------------------------
POST _flush/synced
--------------------------------------------------
// CONSOLE

A synced flush request is a ``best effort'' operation. It will fail if there
are any pending indexing operations, but it is safe to reissue the request
multiple times if necessary.
--

. [[upgrade-node]] *Stop and upgrade a single node*
+
--

Shut down one of the nodes in the cluster *before* starting the upgrade.

[TIP]
================================================

When using the zip or tarball packages, the `config`, `data`, `logs` and
`plugins` directories are placed within the Elasticsearch home directory by
default.

It is a good idea to place these directories in a different location so that
there is no chance of deleting them when upgrading Elasticsearch.  These custom
paths can be <<path-settings,configured>> with the `CONF_DIR` environment
variable, and the `path.logs`, and `path.data` settings.

The <<deb,Debian>> and <<rpm,RPM>> packages place these directories in the
appropriate place for each operating system.

================================================

To upgrade using a <<deb,Debian>> or <<rpm,RPM>> package:

*   Use `rpm` or `dpkg` to install the new package.  All files should be
    placed in their proper locations, and config files should not be
    overwritten.

To upgrade using a zip or compressed tarball:

*   Extract the zip or tarball to a new directory, to be sure that you don't
    overwrite the `config` or `data` directories.

*   Either copy the files in the `config` directory from your old installation
    to your new installation, or set the environment variable
    <<config-files-location,`CONF_DIR`>> to point to a custom config directory.

*   Either copy the files in the `data` directory from your old installation
    to your new installation, or configure the location of the data directory
    in the `config/elasticsearch.yml` file, with the `path.data` setting.
--

. *Upgrade any plugins*
+
--

Elasticsearch plugins must be upgraded when upgrading a node.  Use the
`elasticsearch-plugin` script to install the correct version of any plugins
that you need.
--

. *Start the upgraded node*
+
--

Start the now upgraded node and confirm that it joins the cluster by checking
the log file or by checking the output of this request:

[source,sh]
--------------------------------------------------
GET _cat/nodes
--------------------------------------------------
// CONSOLE
--

. *Reenable shard allocation*
+
--

Once the node has joined the cluster, reenable shard allocation to start using
the node:

[source,js]
--------------------------------------------------
PUT _cluster/settings
{
  "transient": {
    "cluster.routing.allocation.enable": "all"
  }
}
--------------------------------------------------
// CONSOLE
--

. *Wait for the node to recover*
+
--

You should wait for the cluster to finish shard allocation before upgrading
the next node.  You can check on progress with the <<cat-health,`_cat/health`>>
request:

[source,sh]
--------------------------------------------------
GET _cat/health
--------------------------------------------------
// CONSOLE

Wait for the `status` column to move from `yellow` to `green`.  Status `green`
means that all primary and replica shards have been allocated.

[IMPORTANT]
====================================================
During a rolling upgrade, primary shards assigned to a node with the higher
version will never have their replicas assigned to a node with the lower
version, because the newer version may have a different data format which is
not understood by the older version.

If it is not possible to assign the replica shards to another node with the
higher version -- e.g. if there is only one node with the higher version in
the cluster -- then the replica shards will remain unassigned and the
cluster health will remain status `yellow`.

In this case, check that there are no initializing or relocating shards (the
`init` and `relo` columns) before proceding.

As soon as another node is upgraded, the replicas should be assigned and the
cluster health will reach status `green`.

====================================================

Shards that have not been <<indices-synced-flush,sync-flushed>> may take some time to
recover.  The recovery status of individual shards can be monitored with the
<<cat-recovery,`_cat/recovery`>> request:

[source,sh]
--------------------------------------------------
GET _cat/recovery
--------------------------------------------------
// CONSOLE

If you stopped indexing, then it is safe to resume indexing as soon as
recovery has completed.
--

. *Repeat*
+
--

When the cluster is stable and the node has recovered, repeat the above steps
for all remaining nodes.
--

[IMPORTANT]
====================================================

During a rolling upgrade the cluster will continue to operate as normal. Any
new functionality will be disabled or work in a backward compatible manner
until all nodes of the cluster have been upgraded. Once the upgrade is
completed and all nodes are on the new version, the new functionality will
become operational. Once that has happened, it is practically impossible to
go back to operating in a backward compatible mode. To protect against such a
scenario, nodes from the previous major version (e.g. 5.x) will not be allowed
to join a cluster where all nodes are of a higher major version (e.g. 6.x).

In the unlikely case of a network malfunction during upgrades, where all
remaining old nodes are isolated from the cluster, you will have to take all
old nodes offline and upgrade them before they can rejoin the cluster.

====================================================
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`[[rolling-upgrades]]`
			`=== Rolling upgrades`

			`A rolling upgrade allows the Elasticsearch cluster to be upgraded one node at`
			`a time, with no downtime for end users. Running multiple versions of`
			`Elasticsearch in the same cluster for any length of time beyond that required`
			`for an upgrade is not supported, as shards will not be replicated from the`
			`more recent version to the older version.`

			`Consult this <<setup-upgrade,table>> to verify that rolling upgrades are`
			`supported for your version of Elasticsearch.`

			`To perform a rolling upgrade:`

Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Disable shard allocation`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Docs: Complete rewrite of setup, installation, and configuration docs 2016-04-03 10:09:24 -04:00			`When you shut down a node, the allocation process will wait for one minute`
			`before starting to replicate the shards that were on that node to other nodes`
			`in the cluster, causing a lot of wasted I/O. This can be avoided by disabling`
			`allocation before shutting down a node:`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting 2015-07-14 12:14:09 -04:00			`[source,js]`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`--------------------------------------------------`
Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start. 2016-04-29 10:42:03 -04:00			`PUT _cluster/settings`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`{`
			`"transient": {`
			`"cluster.routing.allocation.enable": "none"`
			`}`
			`}`
			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 09:42:23 -04:00			`// CONSOLE`
Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start. 2016-04-29 10:42:03 -04:00			`// TEST[skip:indexes don't assign]`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Stop non-essential indexing and perform a synced flush (Optional)`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`You may happily continue indexing during the upgrade. However, shard recovery`
			`will be much faster if you temporarily stop non-essential indexing and issue a`
			`<<indices-synced-flush, synced-flush>> request:`

Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting 2015-07-14 12:14:09 -04:00			`[source,js]`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`--------------------------------------------------`
Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start. 2016-04-29 10:42:03 -04:00			`POST _flush/synced`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 09:42:23 -04:00			`// CONSOLE`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			A synced flush request is a ``best effort'' operation. It will fail if there
			`are any pending indexing operations, but it is safe to reissue the request`
			`multiple times if necessary.`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. [[upgrade-node]] Stop and upgrade a single node`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`Shut down one of the nodes in the cluster before starting the upgrade.`

			`[TIP]`
			`================================================`

			When using the zip or tarball packages, the `config`, `data`, `logs` and
			`plugins` directories are placed within the Elasticsearch home directory by
			`default.`

			`It is a good idea to place these directories in a different location so that`
Use config directory to find jvm.options This commit removes the environment variable ES_JVM_OPTIONS that allows the jvm.options file to sit separately from the rest of the config directory. Instead, we use the CONF_DIR environment variable for custom configuration location just as we do for the other configuration files. Relates #25679 2017-07-12 15:29:13 -04:00			`there is no chance of deleting them when upgrading Elasticsearch. These custom`
			paths can be <<path-settings,configured>> with the `CONF_DIR` environment
			variable, and the `path.logs`, and `path.data` settings.
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Docs: Complete rewrite of setup, installation, and configuration docs 2016-04-03 10:09:24 -04:00			`The <<deb,Debian>> and <<rpm,RPM>> packages place these directories in the`
			`appropriate place for each operating system.`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`================================================`

Docs: Complete rewrite of setup, installation, and configuration docs 2016-04-03 10:09:24 -04:00			`To upgrade using a <<deb,Debian>> or <<rpm,RPM>> package:`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			* Use `rpm` or `dpkg` to install the new package. All files should be
			`placed in their proper locations, and config files should not be`
			`overwritten.`

			`To upgrade using a zip or compressed tarball:`

			`* Extract the zip or tarball to a new directory, to be sure that you don't`
			overwrite the `config` or `data` directories.

			* Either copy the files in the `config` directory from your old installation
Update config files docs This commit updates the docs for the config files to explain the new mechanism for customizing the configuration directory via the environment variable CONF_DIR. Relates #25990 2017-07-31 20:52:23 -04:00			`to your new installation, or set the environment variable`
			<<config-files-location,`CONF_DIR`>> to point to a custom config directory.
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			* Either copy the files in the `data` directory from your old installation
			`to your new installation, or configure the location of the data directory`
			in the `config/elasticsearch.yml` file, with the `path.data` setting.
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Upgrade any plugins`
			`+`
			`--`
Docs: Complete rewrite of setup, installation, and configuration docs 2016-04-03 10:09:24 -04:00
			`Elasticsearch plugins must be upgraded when upgrading a node. Use the`
			`elasticsearch-plugin` script to install the correct version of any plugins
			`that you need.`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Complete rewrite of setup, installation, and configuration docs 2016-04-03 10:09:24 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Start the upgraded node`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`Start the now upgraded node and confirm that it joins the cluster by checking`
			`the log file or by checking the output of this request:`

			`[source,sh]`
			`--------------------------------------------------`
			`GET _cat/nodes`
			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 09:42:23 -04:00			`// CONSOLE`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Reenable shard allocation`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`Once the node has joined the cluster, reenable shard allocation to start using`
			`the node:`

Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting 2015-07-14 12:14:09 -04:00			`[source,js]`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`--------------------------------------------------`
Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start. 2016-04-29 10:42:03 -04:00			`PUT _cluster/settings`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`{`
			`"transient": {`
			`"cluster.routing.allocation.enable": "all"`
			`}`
			`}`
			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 09:42:23 -04:00			`// CONSOLE`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Wait for the node to recover`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`You should wait for the cluster to finish shard allocation before upgrading`
			the next node. You can check on progress with the <<cat-health,`_cat/health`>>
			`request:`

			`[source,sh]`
			`--------------------------------------------------`
			`GET _cat/health`
			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 09:42:23 -04:00			`// CONSOLE`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			Wait for the `status` column to move from `yellow` to `green`. Status `green`
			`means that all primary and replica shards have been allocated.`

			`[IMPORTANT]`
			`====================================================`
			`During a rolling upgrade, primary shards assigned to a node with the higher`
			`version will never have their replicas assigned to a node with the lower`
			`version, because the newer version may have a different data format which is`
			`not understood by the older version.`

			`If it is not possible to assign the replica shards to another node with the`
			`higher version -- e.g. if there is only one node with the higher version in`
			`the cluster -- then the replica shards will remain unassigned and the`
			cluster health will remain status `yellow`.

			`In this case, check that there are no initializing or relocating shards (the`
			`init` and `relo` columns) before proceding.

			`As soon as another node is upgraded, the replicas should be assigned and the`
			cluster health will reach status `green`.

			`====================================================`

Fixed bad asciidoc 2015-06-19 10:33:14 -04:00			`Shards that have not been <<indices-synced-flush,sync-flushed>> may take some time to`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00			`recover. The recovery status of individual shards can be monitored with the`
			<<cat-recovery,`_cat/recovery`>> request:

			`[source,sh]`
			`--------------------------------------------------`
			`GET _cat/recovery`
			`--------------------------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 09:42:23 -04:00			`// CONSOLE`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`If you stopped indexing, then it is safe to resume indexing as soon as`
			`recovery has completed.`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`. Repeat`
			`+`
			`--`
Docs: Rewrote the upgrade section 2015-06-19 10:27:28 -04:00
			`When the cluster is stable and the node has recovered, repeat the above steps`
			`for all remaining nodes.`
Added upgrade docs explaining how to reindex in place or reindex from remote Closes #20675 2016-10-11 06:14:35 -04:00			`--`
Validate a joining node's version with version of existing cluster nodes (#25808) When a node tries to join a cluster, it goes through a validation step to make sure the node is compatible with the cluster. Currently we validation that the node can read the cluster state and that it is compatible with the indexes of the cluster. This PR adds validation that the joining node's version is compatible with the versions of existing nodes. Concretely we check that: 1) The node's min compatible version is higher or equal to any node in the cluster (this prevents a too-new node from joining) 2) The node's version is higher or equal to the min compat version of all cluster nodes (this prevents a too old join where, for example, the master is on 5.6, there's another 6.0 node in the cluster and a 5.4 node tries to join). 3) The node's major version is at least as higher as the lowest node in the cluster. This is important as we use the minimum version in the cluster to stop executing bwc code for operations that require multiple nodes. If the nodes are already operating in "new cluster mode", we should prevent nodes from the previous major to join (even if they are wire level compatible). This does mean that if you have a very unlucky partition during the upgrade which partitions all old nodes which are also a minority / data nodes only, the may not be able to re-join the cluster. We feel this edge case risk is well worth the simplification it brings to BWC layers only going one way. This restriction only holds if the cluster state has been recovered (i.e., the cluster has properly formed). Also, the node join validation can now selectively fail specific nodes (previously the entire batch was failed). This is an important preparation for a follow up PR where we plan to have a rejected joining node die with dignity. 2017-07-20 14:11:29 -04:00
			`[IMPORTANT]`
			`====================================================`

			`During a rolling upgrade the cluster will continue to operate as normal. Any`
			`new functionality will be disabled or work in a backward compatible manner`
			`until all nodes of the cluster have been upgraded. Once the upgrade is`
			`completed and all nodes are on the new version, the new functionality will`
			`become operational. Once that has happened, it is practically impossible to`
			`go back to operating in a backward compatible mode. To protect against such a`
			`scenario, nodes from the previous major version (e.g. 5.x) will not be allowed`
			`to join a cluster where all nodes are of a higher major version (e.g. 6.x).`

			`In the unlikely case of a network malfunction during upgrades, where all`
			`remaining old nodes are isolated from the cluster, you will have to take all`
			`old nodes offline and upgrade them before they can rejoin the cluster.`

			`====================================================`