OpenSearch

Commit Graph

Author	SHA1	Message	Date
James Rodewig	9f1f468cef	[DOCS] Document dynamic discovery settings (#61420 ) (#62002 )	2020-09-04 11:36:34 -04:00
Yannick Welsch	25404cbe3d	Provide option to allow writes when master is down (#60605 ) Elasticsearch currently blocks writes by default when a master is unavailable. The cluster.no_master_block setting allows a user to change this behavior to also block reads when a master is unavailable. This PR introduces a way to now also still allow writes when a master is offline. Writes will continue to work as long as routing table changes are not needed (as those require the master for consistency), or if dynamic mapping updates are not required (as again, these require the master for consistency). Eventually we should switch the default of cluster.no_master_block to this new mode.	2020-08-12 16:56:45 +02:00
David Turner	f44c28b595	Deprecate and ignore join timeout (#60872 ) There is no point in timing out a join attempt any more once a cluster is entirely in 7.x. Timing out and retrying with the same master is pointless, and an in-flight join attempt to one master no longer blocks attempts to join other masters. This commit deprecates this unnecessary setting and removes its effect from the joining process. Relates #60873 which removes this setting in master.	2020-08-10 13:57:41 +01:00
James Rodewig	988e8c8fc6	[DOCS] Swap `[float]` for `[discrete]` (#60134 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 12:42:33 -04:00
David Turner	8f4f844e6e	Add docs for filesystem health checks (#59134 ) Documents the feature and settings introduced in #52680. Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-07-07 14:14:58 +01:00
James Rodewig	cde5b7d2b3	[DOCS] Relocate discovery module content (#56611 ) (#57454 ) * Moves `Discovery and cluster formation` content from `Modules` to `Set up Elasticsearch`. * Combines `Adding and removing nodes` with `Adding nodes to your cluster`. Adds related redirect. * Removes and redirects the `Modules` page. * Rewrites parts of `Discovery and cluster formation` to remove `module` references and meta references to the section.	2020-06-01 14:13:13 -04:00
David Turner	8e618fdf10	Adjust docs for voting config exclusions API (#55006 ) In #50836 we deprecated the existing voting config exclusions API and added a new one. This commit adjust the docs to match.	2020-04-20 19:47:33 +01:00
David Turner	69b78f7f8a	"Adding nodes" instructions only work on localhost (#52677 ) The introductory sections of the reference manual contains some simplified instructions for adding a node to the cluster. Unfortunately they are a little too simplified and only really work for clusters running on `localhost`. If you try and follow these instructions for a distributed cluster then the new node will, confusingly, auto-bootstrap itself into a distinct one-node cluster. Multiple nodes running on localhost is a valid config, of course, but we should spell out that these instructions are really only for experimentation and that it takes a bit more work to add nodes to a distributed cluster. This commit does so. Also, the "important config" instructions for discovery say that you MUST set `discovery.seed_hosts` whereas in fact it is fine to ignore this setting and use a dynamic discovery mechanism instead. This commit weakens this statement and links to the docs for dynamic discovery mechanisms. Finally, this section is also overloaded with some technical details that are not important for this context and are adequately covered elsewhere, and completely fails to note that the default discovery port is 9300. This commit addresses this.	2020-02-27 09:18:37 +00:00
David Turner	00b9098250	Ignore timeouts with single-node discovery (#52159 ) Today we use `cluster.join.timeout` to prevent nodes from waiting indefinitely if joining a faulty master that is too slow to respond, and `cluster.publish.timeout` to allow a faulty master to detect that it is unable to publish its cluster state updates in a timely fashion. If these timeouts occur then the node restarts the discovery process in an attempt to find a healthier master. In the special case of `discovery.type: single-node` there is no point in looking for another healthier master since the single node in the cluster is all we've got. This commit suppresses these timeouts and instead lets the node wait for joins and publications to succeed no matter how long this might take.	2020-02-11 14:15:01 +00:00
David Turner	86a40f6d8b	Drop snapshot instructions for autobootstrap fix (#49755 ) The "Restore any snapshots as required" step is a trap: it's somewhere between tricky and impossible to restore multiple clusters into a single one. Also add a note about configuring discovery during a rolling upgrade to proscribe any rare cases where you might accidentally autobootstrap during the upgrade.	2019-12-02 14:33:42 +00:00
glerb	baabc21a04	[DOCS] Correct typo in Discovery docs (#48494 )	2019-11-05 08:48:43 -05:00
David Turner	ecb20ebc6c	More bootstrap docs tweaks (#47809 ) Clarifies not to set `cluster.initial_master_nodes` on nodes that are joining an existing cluster. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-10-10 09:55:30 +01:00
David Turner	5c85b0998b	Clarify that discovery ignores master-ineligibles (#44835 ) The changes in #32006 mean that the discovery process can no longer use master-ineligible nodes as a stepping-stone between master-eligible nodes. This was normally an indication of a strange and possibly-fragile configuration and was not recommended, but this commit adds a note to the breaking changes docs to note that this kind of configuration is more obviously broken in recent versions.	2019-09-12 11:07:34 +01:00
James Rodewig	e253ee6ba6	[DOCS] Change // CONSOLE comments to [source,console] (#46440 ) (#46494 )	2019-09-09 12:35:50 -04:00
David Turner	532ade7816	More logging for slow cluster state application (#45007 ) Today the lag detector may remove nodes from the cluster if they fail to apply a cluster state within a reasonable timeframe, but it is rather unclear from the default logging that this has occurred and there is very little extra information beyond the fact that the removed node was lagging. Moreover the only forewarning that the lag detector might be invoked is a message indicating that cluster state publication took unreasonably long, which does not contain enough information to investigate the problem further. This commit adds a good deal more detail to make the issues of slow nodes more prominent: - after 10 seconds (by default) we log an INFO message indicating that a publication is still waiting for responses from some nodes, including the identities of the problematic nodes. - when the publication times out after 30 seconds (by default) we log a WARN message identifying the nodes that are still pending. - the lag detector logs a more detailed warning when a fatally-lagging node is detected. - if applying a cluster state takes too long then the cluster applier service logs a breakdown of all the tasks it ran as part of that process.	2019-08-01 13:20:46 +01:00
Lisa Cawley	757c6a45a0	[DOCS] Adds discovery.type (#42823 ) Co-Authored-By: David Turner <david.turner@elastic.co>	2019-06-05 12:37:17 -07:00
David Turner	9f470c20ed	More improvements to cluster coordination docs (#42799 ) This commit addresses a few more frequently-asked questions: * clarifies that bootstrapping doesn't happen even after a full cluster restart. * removes the example that uses IP addresses, to try and further encourage the use of node names for bootstrapping. * clarifies that auto-bootstrapping might form different clusters on different hosts, and gives a process for starting again if this wasn't what you wanted. * adds the "do not stop half-or-more of the master-eligible nodes" slogan that was notably absent. * reformats one of the console examples to a narrower width	2019-06-04 08:25:41 +01:00
David Turner	15fd233ae3	Minor cluster coordination docs fixes (#42111 ) Fixes a typo and a badly-formatted warning.	2019-05-15 09:27:08 -04:00
David Turner	99b5a27ea0	Node names in bootstrap config have no ports (#41569 ) In cases where node names and transport addresses can be muddled, it is unclear that `cluster.initial_master_nodes: master-a:9300` means to look for a node called `master-a:9300` rather than a node called `master-a` with transport port `9300`. This commit adds docs to that effect.	2019-05-08 10:38:40 +01:00
David Turner	36a8c7aa0b	Add 'DO NOT TOUCH' warnings to disco settings docs (#41211 )	2019-04-16 06:26:52 +01:00
David Turner	5ef247dc91	Further clarify cluster.initial_master_nodes (#41179 ) The following phrase causes confusion: > Alternatively the IP addresses or hostnames (if node name defaults to the > host name) can be used. This change clarifies the conditions under which you can use a hostname, and adds an anchor to the note introduced in (#41137) so we can link directly to it in conversations with users.	2019-04-14 10:39:47 +01:00
David Turner	b74d02944e	Clarify initial_master_nodes must match node.name (#41137 ) ... and emphasize that this includes any trailing qualifiers.	2019-04-12 10:45:43 +01:00
Yannick Welsch	368b5482fa	Add note about cluster state diffs (#39847 ) Mentions cluster state diffs in CS publishing docs.	2019-03-11 15:40:07 +01:00
David Turner	5a3c452480	Align docs etc with new discovery setting names (#38492 ) In #38333 and #38350 we moved away from the `discovery.zen` settings namespace since these settings have an effect even though Zen Discovery itself is being phased out. This change aligns the documentation and the names of related classes and methods with the newly-introduced naming conventions.	2019-02-06 11:34:38 +00:00
David Turner	3b2a0d7959	Rename no-master-block setting (#38350 ) Replaces `discovery.zen.no_master_block` with `cluster.no_master_block`. Any value set for the old setting is now ignored.	2019-02-05 08:47:56 +00:00
David Turner	2d114a02ff	Rename static Zen1 settings (#38333 ) Renames the following settings to remove the mention of `zen` in their names: - `discovery.zen.hosts_provider` -> `discovery.seed_providers` - `discovery.zen.ping.unicast.concurrent_connects` -> `discovery.seed_resolver.max_concurrent_resolvers` - `discovery.zen.ping.unicast.hosts.resolve_timeout` -> `discovery.seed_resolver.timeout` - `discovery.zen.ping.unicast.hosts` -> `discovery.seed_addresses`	2019-02-05 08:46:52 +00:00
Yannick Welsch	ece8c659c5	Decrease leader and follower check timeout (#38298 ) Reduces the leader and follower check timeout to 3 * 10 = 30s instead of 3 * 30 = 90s, with 30s still being a very long time for a node to be completely unresponsive.	2019-02-04 15:11:12 +01:00
Yannick Welsch	504a89feaf	Step down as master when configured out of voting configuration (#37802 ) Abdicates to another master-eligible node once the active master is reconfigured out of the voting configuration, for example through the use of voting configuration exclusions. Follow-up to #37712	2019-01-29 12:43:04 +01:00
Lisa Cawley	f307847f29	[DOCS] Adds overview and API ref for cluster voting configurations (#36954 )	2019-01-07 09:11:14 -08:00
Lisa Cawley	33e9cf3892	[DOCS] Merges list of discovery and cluster formation settings (#36909 )	2018-12-21 11:24:48 -08:00
David Turner	3f5dd792b3	Remove duplicate paragraph (#36942 )	2018-12-21 16:09:35 +00:00
David Turner	1a23417aeb	[Zen2] Update documentation for Zen2 (#34714 ) This commit overhauls the documentation of discovery and cluster coordination, removing mention of the Zen Discovery module and replacing it with docs for the new cluster coordination mechanism introduced in 7.0. Relates #32006	2018-12-20 13:02:44 +00:00
Tim Brooks	47a9a8de49	Update transport docs and settings for changes (#36786 ) This is related to #36652. In 7.0 we plan to deprecate a number of settings that make reference to the concept of a tcp transport. We mostly just have a single transport type now (based on tcp). Settings should only reference tcp if they are referring to socket options. This commit updates the settings in the docs. And removes string usages of the old settings. Additionally it adds a missing remote compress setting to the docs.	2018-12-18 13:09:58 -07:00
David Turner	51cbc61135	Fix docs build after #33241 Recently-merged PR #33241 broke the docs build, and this fixes it.	2018-08-30 09:38:23 +01:00
David Turner	47859e56ac	Move file-based discovery to core (#33241 ) Today we support a static list of seed hosts in core Elasticsearch, and allow a dynamic list of seed hosts to be provided via a file using the `discovery-file` plugin. In fact the ability to provide a dynamic list of seed hosts is increasingly useful, so this change moves this functionality to core Elasticsearch to avoid the need for a plugin. Furthermore, in order to start up nodes in integration tests we currently assign a known port to each node before startup, which unfortunately sometimes fails if another process grabs the selected port in the meantime. By moving the `discovery-file` functionality into the core product we can use it to avoid this race. This change also moves the expected path to the file from `$ES_PATH_CONF/discovery-file/unicast_hosts.txt` to `$ES_PATH_CONF/unicast_hosts.txt`. An example of this file is not included in distributions. For BWC purposes the plugin still exists, but does nothing more than create the example file in the old location, and issue a warning when it is used. We also continue to support the old location for the file, but warn about its deprecation. Relates #29244 Closes #33030	2018-08-30 06:43:04 +01:00
Jason Tedor	ff3c19ed13	Move DNS cache settings to important configuration This commit moves the DNS cache settings for the JVM to the important settings section of the docs. Relates #27592	2017-11-29 18:02:26 -05:00
Christoph Büscher	0d11b9fe34	[Docs] Unify spelling of Elasticsearch (#27567 ) Removes occurences of "elasticsearch" or "ElasticSearch" in favour of "Elasticsearch" where appropriate.	2017-11-29 09:44:25 +01:00
Yannick Welsch	7791e72626	Add additional explanations around discovery.zen.ping_timeout (#27231 ) Makes it clearer that this setting should only be changed with extra care.	2017-11-02 16:52:10 +01:00
Jason Tedor	7066ec44ca	Add recommendation on unicast hosts to docs This commit adds a small note to the discovery docs to include a note that we recommend that the unicast hosts list be maintained as the list of master-eligible nodes in the cluster. Relates #25991	2017-08-01 18:15:50 +09:00
Till Backhaus	b744dc3bcc	Link to minimum master nodes docs from Zen docs This commit adds a link to the minimum master nodes section of the important settings docs from the Zen discovery docs to clarify the meaning and importance of setting minimum master nodes to a quorum of master-eligible nodes. Relates #24311	2017-04-25 16:53:05 -04:00
Iliiaz Akhmedov	688fa309bc	Changing some grammar in docs (#24164 )	2017-04-19 08:49:13 -06:00
Jason Tedor	b7995fbc0d	Fix default port for unicast zen ping hosts Today when you do not specify a port for an entry in discovery.zen.ping.unicast.hosts, the default port is the value of the setting transport.profiles.default.port and falls back to the value of transport.tcp.port if this is not set. For a node that is explicitly bound to a different port than the default port, this means that the default port will be equal to this explicitly bound port. Yet, the docs say that we fall back to 9300 here. This commit corrects the docs. Relates #22568	2017-01-11 17:10:56 -05:00
Jason Tedor	32e6fcf256	Fix markup in Zen discovery docs This commit fixes a markup issue in the Zen discovery docs where a link and its referring text were not on the same line tripping the renderer.	2016-11-23 10:02:13 -05:00
Jason Tedor	9dc65037bc	Lazy resolve unicast hosts Today we eagerly resolve unicast hosts. This means that if DNS changes, we will never find the host at the new address. Moreover, a single host failng to resolve causes startup to abort. This commit introduces lazy resolution of unicast hosts. If a DNS entry changes, there is an opportunity for the host to be discovered. Note that under the Java security manager, there is a default positive cache of infinity for resolved hosts; this means that if a user does want to operate in an environment where DNS can change, they must adjust networkaddress.cache.ttl in their security policy. And if a host fails to resolve, we warn log the hostname but continue pinging other configured hosts. When doing DNS resolutions for unicast hostnames, we wait until the DNS lookups timeout. This appears to be forty-five seconds on modern JVMs, and it is not configurable. If we do these serially, the cluster can be blocked during ping for a lengthy period of time. This commit introduces doing the DNS lookups in parallel, and adds a user-configurable timeout for these lookups. Relates #21630	2016-11-22 14:17:04 -05:00
Praveen Shukla	2e18f2e818	[DOCS] clarifies master nodes to be master eligible nodes	2016-10-24 08:58:44 -04:00
kingrhoton	1307aa7e77	clarify awkward text (#19608 )	2016-07-27 20:03:20 +02:00
kingrhoton	643ccb8cc1	[docs] Switch contraction to possesive	2016-07-26 14:01:30 -04:00
David Pilato	527a9c7f48	Deprecate discovery-azure and rename it to discovery-azure-classic As discussed at https://github.com/elastic/elasticsearch-cloud-azure/issues/91#issuecomment-229113595, we know that the current `discovery-azure` plugin only works with Azure Classic VMs / Services (which is somehow Legacy now). The proposal here is to rename `discovery-azure` to `discovery-azure-classic` in case some users are using it. And deprecate it for 5.0. Closes #19144.	2016-06-30 14:42:40 +02:00
Kyle Gochenour	b12cabd2f5	[docs] Add missing article [docs] Add missing article to zen.asciidoc	2016-05-17 11:39:47 -04:00
javanna	27d4994aff	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-24 18:10:11 +01:00

1 2

82 Commits