OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	1f0673c9bd	Default max local storage nodes to one This commit defaults the max local storage nodes to one. The motivation for this change is that a default value greather than one is dangerous as users sometimes end up unknowingly starting a second node and start thinking that they have encountered data loss. Relates #19964	2016-08-12 09:26:20 -04:00
Boaz Leskes	cd596772ee	Persistent Node Names (#19456 ) With #19140 we started persisting the node ID across node restarts. Now that we have a "stable" anchor, we can use it to generate a stable default node name and make it easier to track nodes over a restarts. Sadly, this means we will not have those random fun Marvel characters but we feel this is the right tradeoff. On the implementation side, this requires a bit of juggling because we now need to read the node id from disk before we can log as the node node is part of each log message. The PR move the initialization of NodeEnvironment as high up in the starting sequence as possible, with only one logging message before it to indicate we are initializing. Things look now like this: ``` [2016-07-15 19:38:39,742][INFO ][node ] [_unset_] initializing ... [2016-07-15 19:38:39,826][INFO ][node ] [aAmiW40] node name set to [aAmiW40] by default. set the [node.name] settings to change it [2016-07-15 19:38:39,829][INFO ][env ] [aAmiW40] using [1] data paths, mounts [[ /(/dev/disk1)]], net usable_space [5.5gb], net total_space [232.6gb], spins? [unknown], types [hfs] [2016-07-15 19:38:39,830][INFO ][env ] [aAmiW40] heap size [1.9gb], compressed ordinary object pointers [true] [2016-07-15 19:38:39,837][INFO ][node ] [aAmiW40] version[5.0.0-alpha5-SNAPSHOT], pid[46048], build[473d3c0/2016-07-15T17:38:06.771Z], OS[Mac OS X/10.11.5/x86_64], JVM[Oracle Corporation/Java HotSpot(TM) 64-Bit Server VM/1.8.0_51/25.51-b03] [2016-07-15 19:38:40,980][INFO ][plugins ] [aAmiW40] modules [percolator, lang-mustache, lang-painless, reindex, aggs-matrix-stats, lang-expression, ingest-common, lang-groovy, transport-netty], plugins [] [2016-07-15 19:38:43,218][INFO ][node ] [aAmiW40] initialized ``` Needless to say, settings `node.name` explicitly still works as before. The commit also contains some clean ups to the relationship between Environment, Settings and Plugins. The previous code suggested the path related settings could be changed after the initial Environment was changed. This did not have any effect as the security manager already locked things down.	2016-07-23 22:46:48 +02:00
Simon Willnauer	5616251f22	Remove `node.mode` and `node.local` settings (#19428 ) Today `node.mode` and `node.local` serve almost the same purpose, they are a shortcut for `discovery.type` and `transport.type`. If `node.local: true` or `node.mode: local` is set elasticsearch will start in _local_ mode which means only nodes within the same JVM are discovered and a non-network based transport is used. The _local_ mode it only really used in tests or if nodes are embedded. For both, embedding and tests explicit configuration via `discovery.type` and `transport.type` should be preferred. This change removes all the usage of these settings and by-default doesn't configure a default transport implemenation since netty is now a module. Yet, to make the user expericence flawless, plugins or modules can set a `http.type.default` and `transport.type.default`. Plugins set this via `PluginService#additionalSettings()` which enforces _set-once_ which prevents node startup if set multiple times. This means that our distributions will just startup with netty transport since it's packaged as a module unless `transport.type` or `http.transport.type` is explicitly set. This change also found a bunch of bugs since several NamedWriteables were not registered if a transport client is used. Now that we don't rely on the `node.mode` leniency which is inherited instead of using explicit settings, `TransportClient` uses `AssertingLocalTransport` which detects these problems since it serializes all messages. Closes #16234	2016-07-14 13:21:10 +02:00
Boaz Leskes	ef33183a19	update migration docs to include removal of `netty.epollBugWorkaround`	2016-07-14 12:20:35 +02:00
Jason Tedor	ce5a382c69	Remove support for properties This commit removes support for properties syntax and config files: - removed support for elasticsearch.properties - removed support for logging.properties - removed support for properties content detection in REST APIs - removed support for properties content detection in Java API Relates #19398	2016-07-12 17:55:18 -04:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Jason Tedor	00356edd33	Clarify time units usage in docs This commit clarifies the distinction between supported time units for durations and supported time units for durations in the docs. Relates #19159	2016-06-29 17:02:15 -04:00
Jason Tedor	2f638b5a23	Keep input time unit when parsing TimeValues This commit modifies TimeValue parsing to keep the input time unit. This enables round-trip parsing from instances of String to instances of TimeValue and vice-versa. With this, this commit removes support for the unit "w" representing weeks, and also removes support for fractional values of units (e.g., 0.5s). Relates #19102	2016-06-27 18:41:18 -04:00
Damien Alexandre	fec4a18835	Rename plainless into painless in migration doc The scripting language was wrongly named.	2016-06-26 17:41:34 +02:00
Jason Tedor	245def80f0	Add note that thread pool settings are node-level This commit adds a note to the breaking changes docs that since commit `da74323141`, thread pool settings are no longer cluster-level settings and thus not dynamically updatable.	2016-06-17 15:19:52 -04:00
Lee Hinman	c637fea84b	Change the default of `include_global_state` from true to false for restores This changes the default value to be false only for restore operations. Resolves #18569	2016-06-08 10:48:36 -06:00
Lee Hinman	762bbdbd0c	Revert "Change the default of `include_global_state` from true to false." This reverts commit `052a62250c`.	2016-06-07 15:07:37 -06:00
Lee Hinman	052a62250c	Change the default of `include_global_state` from true to false. Resolves #18569	2016-06-07 15:06:20 -06:00
Jason Tedor	da74323141	Register thread pool settings This commit refactors the handling of thread pool settings so that the individual settings can be registered rather than registering the top level group. With this refactoring, individual plugins must now register their own settings for custom thread pools that they need, but a dedicated API is provided for this in the thread pool module. This commit also renames the prefix on the thread pool settings from "threadpool" to "thread_pool". This enables a hard break on the settings so that: - some of the settings can be given more sensible names (e.g., the max number of threads in a scaling thread pool is now named "max" instead of "size") - change the soft limit on the number of threads in the bulk and indexing thread pools to a hard limit - the settings names for custom plugins for thread pools can be prefixed (e.g., "xpack.watcher.thread_pool.size") - remove dynamic thread pool settings Relates #18674	2016-06-06 22:09:12 -04:00
Jason Tedor	8e2a7d0fe1	Rename boostrap.mlockall to bootstrap.memory_lock The setting bootstrap.mlockall is useful on both POSIX-like systems (POSIX mlockall) and Windows (Win32 VirtualLock). But mlockall is really a POSIX only thing so the name should not be tied POSIX. This commit renames the setting to "bootstrap.memory_lock". Relates #18669	2016-06-01 16:25:51 -04:00
Jason Tedor	c257e2c51f	Remove settings and system properties entanglement Today when parsing settings during bootstrap, we add a system property for every Elasticsearch setting. Additionally, settings can be set via system properties. This commit simplifies this situation. - settings are no longer propogated to system properties - system properties can not be used to set settings - the "es." prefix on settings is no longer required (nor permitted) - test logging has a dedicated system property (tests.logger.level) Relates #18198	2016-05-19 14:08:08 -04:00
Simon Willnauer	d77c299cb9	Register `indices.query.bool.max_clause_count` setting (#18341 ) * Register `indices.query.bool.max_clause_count` setting This commit registers `indices.query.bool.max_clause_count` as a node level setting and removes support for its synonym setting `index.query.bool.max_clause_count`. Closes #18336	2016-05-19 10:42:35 +02:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
Pius	66686040ca	Update settings.asciidoc Changed "must bet set" to "must be set" under Discovery Settings.	2016-04-26 00:15:39 -07:00
Pius	f6656aa6ff	Update settings.asciidoc Added the 2 missing ` under Request Cache Settings section.	2016-04-25 23:57:42 -07:00
Lee Hinman	5fe1916be9	Merge pull request #17924 from elastic/russcam-patch-1 Update settings.asciidoc	2016-04-24 18:25:08 -06:00
Clinton Gormley	b9978ace40	Update settings.asciidoc Asciidoc typo	2016-04-23 13:44:42 +02:00
Russ Cam	fb58ae3b4f	Update settings.asciidoc Add note for removal of index.translog.interval	2016-04-23 11:44:59 +10:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Martijn van Groningen	ba08313417	settings: Removed `action.get.realtime` setting Closes #12543	2016-04-19 17:14:23 +02:00
Simon Willnauer	8b075dbb75	Remove ability to specify arbitrary node attributes with `node.` prefix Today the basic node settings like `node.data` and `node.master` can't really be fully validated since we allow to specify custom user attributes on the node level. We have to, in order to support that, add a wildcard setting for `node.*` to let these setting pass validation. Instead we should require a more contraint prefix like `node.attr.` that defines a namespace that is reserved for user attributes. This commit adds a new namespace for attributes in `node.attr`. Closes #17280	2016-03-30 13:29:48 +02:00
javanna	19eeb68bc4	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 21:53:22 +02:00
javanna	ae34c20a62	add node.client breaking changes to migrate guide	2016-03-29 20:33:59 +02:00
spalger	ce44bbfadf	[docs] clarify where discovery.zen.minimum_master_node is required https://github.com/elastic/elasticsearch/pull/17288 added a check to enforce that the `discovery.zen.minimum_master_nodes` configuration is set when nodes have the `host`, `port`, or `bind_host` set in either `transport` or general `network` configuration sections. This was documented incorrectly as "nodes that are bound to a non-loopback interface", which lead to confusion as I set `network.host: "localhost"` and the check was still failing. This change updates the docs to detail the actual check. I think it also highlights how complex the check is and the need for a simpler solution.	2016-03-28 12:53:40 -07:00
Boaz Leskes	b8227a7222	Enforce `discovery.zen.minimum_master_nodes` is set when bound to a public ip #17288 discovery.zen.minimum_master_nodes is the single most important setting to set on a production cluster. We have no way of supplying a good default so it must be set by the user. Binding a node to a public IP (as opposed to the default local host) is a good enough indication that a node will be part of a production cluster cluster and thus it's a good tradeoff to enforce the settings. Note that nothing prevent users from setting it to 1 in a single node cluster. Closes #17288	2016-03-25 12:56:20 +01:00
Boaz Leskes	6dd164d0bd	Include pings from client nodes in master election We currently have a `discovery.zen.master_election.filter_client` setting that control whether their ping responses are ignored for master election (which is the current default). With the push to treat client nodes as normal nodes (and promote the transport/rest clients for client work), this should be changed. This commit remove this setting and it's companion `discovery.zen.master_election.filter_data` setting (currently defaulting to false) in favor of singe `discovery.zen.master_election.ignore_non_master_pings` setting with more intuitive name (defaulting to false). Resolves #17325 Closes #17329	2016-03-24 17:48:05 +01:00
Areek Zillur	e16e113691	Remove suggest threadpool In #17198, we removed suggest transport action, which used the `suggest` threadpool to execute requests. Now `suggest` threadpool is unused and suggest requests are executed on the `search` threadpool.	2016-03-23 18:01:45 -04:00
Simon Willnauer	7f16a1d9a7	Improve upgrade experience of node level index settings In 5.0 we don't allow index settings to be specified on the node level ie. in yaml files or via commandline argument. This can cause problems during upgrade if this was used extensively. For instance if analyzers where specified on a node level this might cause the index to be closed when imported (see #17187). In such a case all indices relying on this must be updated via `PUT /${index}/_settings`. Yet, this API has slightly different semantics since it overrides existing settings. To make this less painful this change adds a `preserve_existing` parameter on that API to ensure we have the same semantics as if the setting was applied on the node level. This change also adds a better error message and a change to the migration guide to ensure upgrades are smooth if index settings are specified on the node level. If a index setting is detected this change fails the node startup and prints a message like this: ``` *********************************************************************************** Found index level settings on node level configuration. Since elasticsearch 5.x index level settings can NOT be set on the nodes configuration like the elasticsearch.yaml, in system properties or command line arguments.In order to upgrade all indices the settings must be updated via the /${index}/_settings API. Unless all settings are dynamic all indices must be closed in order to apply the upgradeIndices created in the future should use index templates to set default values. Please ensure all required values are updated on all indices by executing: curl -XPUT 'http://localhost:9200/_all/_settings?preserve_existing=true' -d '{ "index.number_of_shards" : "1", "index.query.default_field" : "main_field", "index.translog.durability" : "async", "index.ttl.disable_purge" : "true" }' *********************************************************************************** ```	2016-03-21 20:12:18 +01:00
Areek Zillur	da165f425f	update migration doc for removing gateway.format setting	2016-03-16 18:48:02 -04:00
Jason Tedor	2f7e181318	Fix typo inadvertently introduced	2016-03-15 10:05:28 -04:00
Jason Tedor	8a05c2a2be	Bootstrap does not set system properties Today, certain bootstrap properties are set and read via system properties. This action-at-distance way of managing these properties is rather confusing, and completely unnecessary. But another problem exists with setting these as system properties. Namely, these system properties are interpreted as Elasticsearch settings, not all of which are registered. This leads to Elasticsearch failing to startup if any of these special properties are set. Instead, these properties should be kept as local as possible, and passed around as method parameters where needed. This eliminates the action-at-distance way of handling these properties, and eliminates the need to register these non-setting properties. This commit does exactly that. Additionally, today we use the "-D" command line flag to set the properties, but this is confusing because "-D" is a special flag to the JVM for setting system properties. This creates confusion because some "-D" properties should be passed via arguments to the JVM (so via ES_JAVA_OPTS), and some should be passed as arguments to Elasticsearch. This commit changes the "-D" flag for Elasticsearch settings to "-E".	2016-03-13 20:09:15 -04:00
Clinton Gormley	5c845f8bb5	Reworked 5.0 breaking changes docs	2016-03-13 21:17:48 +01:00

37 Commits