OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-09 06:25:07 +00:00

Author	SHA1	Message	Date
Martijn van Groningen	2fdf79d8d4	Deprecate template query. Closes #19390	2016-07-27 09:50:44 +02:00
Boaz Leskes	cd596772ee	Persistent Node Names (#19456 ) With #19140 we started persisting the node ID across node restarts. Now that we have a "stable" anchor, we can use it to generate a stable default node name and make it easier to track nodes over a restarts. Sadly, this means we will not have those random fun Marvel characters but we feel this is the right tradeoff. On the implementation side, this requires a bit of juggling because we now need to read the node id from disk before we can log as the node node is part of each log message. The PR move the initialization of NodeEnvironment as high up in the starting sequence as possible, with only one logging message before it to indicate we are initializing. Things look now like this: ``` [2016-07-15 19:38:39,742][INFO ][node ] [_unset_] initializing ... [2016-07-15 19:38:39,826][INFO ][node ] [aAmiW40] node name set to [aAmiW40] by default. set the [node.name] settings to change it [2016-07-15 19:38:39,829][INFO ][env ] [aAmiW40] using [1] data paths, mounts [[ /(/dev/disk1)]], net usable_space [5.5gb], net total_space [232.6gb], spins? [unknown], types [hfs] [2016-07-15 19:38:39,830][INFO ][env ] [aAmiW40] heap size [1.9gb], compressed ordinary object pointers [true] [2016-07-15 19:38:39,837][INFO ][node ] [aAmiW40] version[5.0.0-alpha5-SNAPSHOT], pid[46048], build[473d3c0/2016-07-15T17:38:06.771Z], OS[Mac OS X/10.11.5/x86_64], JVM[Oracle Corporation/Java HotSpot(TM) 64-Bit Server VM/1.8.0_51/25.51-b03] [2016-07-15 19:38:40,980][INFO ][plugins ] [aAmiW40] modules [percolator, lang-mustache, lang-painless, reindex, aggs-matrix-stats, lang-expression, ingest-common, lang-groovy, transport-netty], plugins [] [2016-07-15 19:38:43,218][INFO ][node ] [aAmiW40] initialized ``` Needless to say, settings `node.name` explicitly still works as before. The commit also contains some clean ups to the relationship between Environment, Settings and Plugins. The previous code suggested the path related settings could be changed after the initial Environment was changed. This did not have any effect as the security manager already locked things down.	2016-07-23 22:46:48 +02:00
Jun Ohtani	cebad703fe	Analyze: Specify anonymous char_filters/tokenizer/token_filters in the analyze API Add parser for anonymous char_filters/tokenizer/token_filters Using Settings in AnalyzeRequest for anonymous definition Add breaking changes document Closed #8878	2016-07-21 11:06:36 +09:00
Nik Everett	3a82c613e4	Migrate query registration from push to pull Remove `ParseField` constants used for names where there are no deprecated names and just use the `String` version of the registration method instead. This is step 2 in cleaning up the plugin interface for extending search time actions. Aggregations are next. This is breaking for plugins because those that register a new query should now implement `SearchPlugin` rather than `onModule(SearchModule)`.	2016-07-20 12:33:51 -04:00
Martijn van Groningen	e0ebf5da1c	Template cleanup: * Removed `Template` class and unified script & template parsing logic. Templates are scripts, so they should be defined as a script. Unless there will be separate template infrastructure, templates should share as much code as possible with scripts. * Removed ScriptParseException in favour for ElasticsearchParseException * Moved TemplateQueryBuilder to lang-mustache module because this query is hard coded to work with mustache only	2016-07-18 10:16:01 +02:00
Nik Everett	7aeea764ba	Remove wait_for_status=yellow from the docs It is no longer required after 687e2e12b31ed3c12ef4c411333bff9da58fc808.	2016-07-15 16:02:07 -04:00
Simon Willnauer	5616251f22	Remove `node.mode` and `node.local` settings (#19428 ) Today `node.mode` and `node.local` serve almost the same purpose, they are a shortcut for `discovery.type` and `transport.type`. If `node.local: true` or `node.mode: local` is set elasticsearch will start in _local_ mode which means only nodes within the same JVM are discovered and a non-network based transport is used. The _local_ mode it only really used in tests or if nodes are embedded. For both, embedding and tests explicit configuration via `discovery.type` and `transport.type` should be preferred. This change removes all the usage of these settings and by-default doesn't configure a default transport implemenation since netty is now a module. Yet, to make the user expericence flawless, plugins or modules can set a `http.type.default` and `transport.type.default`. Plugins set this via `PluginService#additionalSettings()` which enforces _set-once_ which prevents node startup if set multiple times. This means that our distributions will just startup with netty transport since it's packaged as a module unless `transport.type` or `http.transport.type` is explicitly set. This change also found a bunch of bugs since several NamedWriteables were not registered if a transport client is used. Now that we don't rely on the `node.mode` leniency which is inherited instead of using explicit settings, `TransportClient` uses `AssertingLocalTransport` which detects these problems since it serializes all messages. Closes #16234	2016-07-14 13:21:10 +02:00
Boaz Leskes	ef33183a19	update migration docs to include removal of `netty.epollBugWorkaround`	2016-07-14 12:20:35 +02:00
Clinton Gormley	1e2d0c1000	More bad asciidoc	2016-07-13 16:30:49 +02:00
Clinton Gormley	599727e38f	Fixed bad ASCIIDOC	2016-07-13 16:09:41 +02:00
Martijn van Groningen	2c3165d080	Removed deprecated 1.x script and template syntax Closes #13729	2016-07-13 15:07:36 +02:00
Jason Tedor	ce5a382c69	Remove support for properties This commit removes support for properties syntax and config files: - removed support for elasticsearch.properties - removed support for logging.properties - removed support for properties content detection in REST APIs - removed support for properties content detection in Java API Relates #19398	2016-07-12 17:55:18 -04:00
Nik Everett	8263873783	Switch search extension from push to pull Switches most search behavior extensions from push (`onModule(SearchModule)`) to pull (`implements SearchPlugin`). This effort in general gives plugin authors a much cleaner view of how to extend Elasticsearch and starts to set up portions of Elasticsearch as "the plugin API". This commit in particular does that for search-time behavior like customized suggesters, highlighters, score functions, and significance heuristics. It also switches most such customization to being done at search module construction time which is much, much easier to reason about from a testing perspective. It also helps significantly in the process of de-guice-ing Elasticsearch's startup. There are at least two major search time extensions that aren't covered in this commit that will simply have to wait for the next commit on the topic because this one has already grown large: custom aggregations and custom queries. These will likely live in the same SearchPlugin interface as well.	2016-07-11 18:49:05 -04:00
Martijn van Groningen	ff5527f037	percolator: Forbid the usage or `range` queries with a range based on the current time If there are percolator queries containing `range` queries with ranges based on the current time then this can lead to incorrect results if the `percolate` query gets cached. These ranges are changing each time the `percolate` query gets executed and if this query gets cached then the results will be based on how the range was at the time when the `percolate` query got cached. The ExtractQueryTermsService has been renamed `QueryAnalyzer` and now only deals with analyzing the query (extracting terms and deciding if the entire query is a verified match) . The `PercolatorFieldMapper` is responsible for adding the right fields based on the analysis the `QueryAnalyzer` has performed, because this is highly dependent on the field mappings. Also the `PercolatorFieldMapper` is responsible for creating the percolate query.	2016-07-08 14:20:56 +02:00
Jason Tedor	e86aa29f67	Die with dignity Today when a thread encounters a fatal unrecoverable error that threatens the stability of the JVM, Elasticsearch marches on. This includes out of memory errors, stack overflow errors and other errors that leave the JVM in a questionable state. Instead, the Elasticsearch JVM should die when these errors are encountered. This commit causes this to be the case. Relates #19272	2016-07-07 14:44:03 -04:00
Jim Ferenczi	dcf6a96725	Add doc values support to the _size field in the mapper-size plugin This change activates the doc_values on the _size field for indices created after 5.0.0-alpha4. It also adds a note in the breaking changes that explain the situation and how to get around it. Closes #18334	2016-07-05 14:47:58 +02:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
David Pilato	527a9c7f48	Deprecate discovery-azure and rename it to discovery-azure-classic As discussed at https://github.com/elastic/elasticsearch-cloud-azure/issues/91#issuecomment-229113595, we know that the current `discovery-azure` plugin only works with Azure Classic VMs / Services (which is somehow Legacy now). The proposal here is to rename `discovery-azure` to `discovery-azure-classic` in case some users are using it. And deprecate it for 5.0. Closes #19144.	2016-06-30 14:42:40 +02:00
Nik Everett	8db43c0107	Move RestHandler registration to ActionModule and ActionPlugin `RestHandler`s are highly tied to actions so registering them in the same place makes sense. Removes the need to for plugins to check if they are in transport client mode before registering a RestHandler - `getRestHandlers` isn't called at all in transport client mode. This caused guice to throw a massive fit about the circular dependency between NodeClient and the allocation deciders. I broke the circular dependency by registering the actions map with the node client after instantiation.	2016-06-29 18:31:44 -04:00
Jason Tedor	00356edd33	Clarify time units usage in docs This commit clarifies the distinction between supported time units for durations and supported time units for durations in the docs. Relates #19159	2016-06-29 17:02:15 -04:00
Nik Everett	fa4844c3f4	Pull actions from plugins Instead of implementing onModule(ActionModule) to register actions, this has plugins implement ActionPlugin to declare actions. This is yet another step in cleaning up the plugin infrastructure. While I was in there I switched AutoCreateIndex and DestructiveOperations to be eagerly constructed which makes them easier to use when de-guice-ing the code base.	2016-06-28 08:36:24 -04:00
Jason Tedor	2f638b5a23	Keep input time unit when parsing TimeValues This commit modifies TimeValue parsing to keep the input time unit. This enables round-trip parsing from instances of String to instances of TimeValue and vice-versa. With this, this commit removes support for the unit "w" representing weeks, and also removes support for fractional values of units (e.g., 0.5s). Relates #19102	2016-06-27 18:41:18 -04:00
Ryan Ernst	a07a3a9333	Add migration docs for MapperPlugin	2016-06-27 11:22:07 -07:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit 2f46f53dc8feb78412e6d648751ffe97b1e35119.	2016-06-27 17:20:32 +02:00
Damien Alexandre	fec4a18835	Rename plainless into painless in migration doc The scripting language was wrongly named.	2016-06-26 17:41:34 +02:00
Nik Everett	71b95fb63c	Switch analysis from push to pull Instead of plugins calling `registerTokenizer` to extend the analyzer they now instead have to implement `AnalysisPlugin` and override `getTokenizer`. This lines up extending plugins in with extending scripts. This allows `AnalysisModule` to construct the `AnalysisRegistry` immediately as part of its constructor which makes testing anslysis much simpler. This also moves the default analysis configuration into `AnalysisModule` which is how search is setup. Like `ScriptModule`, `AnalysisModule` no longer extends `AbstractModule`. Instead it is only responsible for building `AnslysisRegistry`. We still bind `AnalysisRegistry` but we only do so in `Node`. This is means it is available at module construction time so we slowly remove the need to bind it in guice.	2016-06-26 07:15:42 -04:00
Clinton Gormley	5a08e36f9c	Update migrate_5_0.asciidoc Updated breaking changes to state that upgraded indices still need to be reindexed, and to mention the migration plugin	2016-06-23 13:10:50 +02:00
Tanguy Leroux	04da1bda0d	Move templates out of the Search API, into lang-mustache module This commit moves template support out of the Search API to its own dedicated Search Template API in the lang-mustache module. It provides a new SearchTemplateAction that can be used to render templates before it gets delegated to the usual Search API. The current REST endpoint are identical, but the Render Search Template endpoint now uses the same Search Template API with a new "simulate" option. When this option is enabled, the Search Template API only renders template and returns immediatly, without executing the search. Closes #17906	2016-06-23 09:30:53 +02:00
Nik Everett	02761f5fe0	Docs: migration notes for _timestamp and _ttl We aren't able to actually create an index with _timestamp enabled to test the migration, or, at least, we won't be able to after #18980 is re-merged. But the docs are still ok. Closes #19007	2016-06-22 14:43:12 -04:00
Nik Everett	6574243077	Fail to start if plugin tries broken onModule If a plugin declares `onModule(SomethingThatIsntAModule)` then refuse to start. Before this commit we just logged a warning that flies by in the console and is easy to miss. You can't miss refusing to start!	2016-06-22 12:20:52 -04:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
Martijn van Groningen	5dc88ffd26	docs: added note the inner hits migrate section	2016-06-22 08:29:50 +02:00
Clinton Gormley	2f2ea0c280	Improved docs explaining the index upgrade process in breaking changes	2016-06-21 18:03:19 +02:00
Clinton Gormley	70482d1e39	Update java.asciidoc Fixed asciidoc	2016-06-21 16:02:25 +02:00
Martijn van Groningen	5ad2fdaa8e	inner_hits: Don't include `_id`, `_type` and `_index` keys in search response for inner hits Closes #18091	2016-06-21 14:13:38 +02:00
Jim Ferenczi	423291b6bc	Change default similarity to BM25 The default similarity was set to `classic` which refers to TFIDF and has not been moved after the upgrade to Lucene 6. Though moving to BM25 could have some downside for queries that relies on coordination factor (match_query, multi_match_query) ? relates #18944	2016-06-21 11:29:36 +02:00
Jason Tedor	d09d89f8c5	Remove only node preference This commit removes the search preference _only_node as the same functionality can be obtained by using the search preference _only_nodes. This commit also adds a test that ensures that _only_nodes will continue to support specifying node IDs. Relates #18875	2016-06-17 15:27:46 -04:00
Jason Tedor	245def80f0	Add note that thread pool settings are node-level This commit adds a note to the breaking changes docs that since commit da74323141a69bd06fc19a70bc072143e9f0e986, thread pool settings are no longer cluster-level settings and thus not dynamically updatable.	2016-06-17 15:19:52 -04:00
Jim Ferenczi	fb2a48d0f0	Revert "Remove support for sorting terms aggregation by ascending count" This is delayed after alpha4 since Kibana relies on it.	2016-06-17 17:14:01 +02:00
Jim Ferenczi	755721953b	Remove support for sorting terms aggregation by ascending count closes #17614	2016-06-17 15:06:49 +02:00
Jason Tedor	e96722d91c	Add search preference to prefer multiple nodes The search preference _prefer_node allows specifying a single node to prefer when routing a request. This functionality can be enhanced by permitting multiple nodes to be preferred. This commit replaces the search preference _prefer_node with the search preference _prefer_nodes which supplants the former by specifying a single node and otherwise adds functionality. Relates #18872	2016-06-14 21:34:24 -04:00
Colin Goodheart-Smithe	cfd3356ee3	Remove size 0 options in aggregations This removes the ability to set `size: 0` in the `terms`, `significant_terms` and `geohash_grid` aggregations for the reasons described in https://github.com/elastic/elasticsearch/issues/18838 Closes #18838	2016-06-14 13:07:02 +01:00
Nik Everett	4b21157906	Remove setRefresh It has been replaced with `setRefreshPolicy` which has support for waiting until refresh with `setRefreshPolicy(WAIT_FOR)`. Related to #1063	2016-06-08 13:50:59 -04:00
Lee Hinman	c637fea84b	Change the default of `include_global_state` from true to false for restores This changes the default value to be false only for restore operations. Resolves #18569	2016-06-08 10:48:36 -06:00
Lee Hinman	762bbdbd0c	Revert "Change the default of `include_global_state` from true to false." This reverts commit 052a62250ca880f4e5d2746822ac58139b36efda.	2016-06-07 15:07:37 -06:00
Lee Hinman	052a62250c	Change the default of `include_global_state` from true to false. Resolves #18569	2016-06-07 15:06:20 -06:00
Lee Hinman	32bd869b28	Merge remote-tracking branch 'dakrone/no-cluster-name-in-path'	2016-06-07 10:14:23 -06:00
Lee Hinman	feb244c14a	Remove cluster name from data path Previously Elasticsearch used $DATA_DIR/$CLUSTER_NAME/nodes for the path where data is stored, this commit changes that to be $DATA_DIR/nodes. On startup, if the old folder structure is detected it will be used. This behavior will be removed in Elasticsearch 6.0 Resolves #17810	2016-06-07 10:13:48 -06:00

1 2 3 4 5 ...

547 Commits