OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-03 09:29:11 +00:00

Author	SHA1	Message	Date
Folusho Oladipo	1e7495a7fa	corrected the use of two synonymous words (#19498 ) Two synonyms were jointly used in the sentence(i.e "problems" and "issues"), so I deleted one of them.	2016-07-21 12:21:12 +02:00
Jun Ohtani	cebad703fe	Analyze: Specify anonymous char_filters/tokenizer/token_filters in the analyze API Add parser for anonymous char_filters/tokenizer/token_filters Using Settings in AnalyzeRequest for anonymous definition Add breaking changes document Closed #8878	2016-07-21 11:06:36 +09:00
Nik Everett	3a82c613e4	Migrate query registration from push to pull Remove `ParseField` constants used for names where there are no deprecated names and just use the `String` version of the registration method instead. This is step 2 in cleaning up the plugin interface for extending search time actions. Aggregations are next. This is breaking for plugins because those that register a new query should now implement `SearchPlugin` rather than `onModule(SearchModule)`.	2016-07-20 12:33:51 -04:00
Adrien Grand	1ed6c5d110	Docs: Add more points to the chart that gives accuracy for the cardinality aggregation. This also adds instructions how to regenerate the chart.	2016-07-20 10:37:12 +02:00
Adrien Grand	37d5bcb264	Clarify `function_score` docs. Closes #18315	2016-07-19 10:25:48 +02:00
Nik Everett	d573541f66	Support requests_per_second=-1 to mean no throttling in reindex This is entirely on the REST level, Float.POSITIVE_INFINITY is still how you get no throttling over the transport api. Closes #19089	2016-07-18 13:05:06 -04:00
Colin Goodheart-Smithe	b717ad8eb6	Enable option to use request cache for size > 0 Previously if the size of the search request was greater than zero we would not cache the request in the request cache. This change retains the default behaviour of not caching requests with size > 0 but also allows the `request_cache=true` query parameter to enable the cache for requests with size > 0	2016-07-18 13:33:59 +01:00
Adrien Grand	398d70b567	Add `scaled_float`. #19264 This is a tentative to revive #15939 motivated by elastic/beats#1941. Half-floats are a pretty bad option for storing percentages. They would likely require 2 bytes all the time while they don't need more than one byte. So this PR exposes a new `scaled_float` type that requires a `scaling_factor` and internally indexes `valuescaling_factor` in a long field. Compared to the original PR it exposes a lower-level API so that the trade-offs are clearer and avoids any reference to fixed precision that might imply that this type is more accurate (actually it is less* accurate). In addition to being more space-efficient for some use-cases that beats is interested in, this is also faster that `half_float` unless we can improve the efficiency of decoding half-float bits (which is currently done using software) or until Java gets first-class support for half-floats.	2016-07-18 12:36:23 +02:00
Adrien Grand	bde99bad2e	Use a static default precision for the cardinality aggregation. #19215 Today the default precision for the cardinality aggregation depends on how many parent bucket aggregations it had. The reasoning was that the more parent bucket aggregations, the more buckets the cardinality had to be computed on. And this number could be huge depending on what the parent aggregations actually are. However now that we run terms aggregations in breadth-first mode by default when there are sub aggregations, it is less likely that we have to run the cardinality aggregation on kagilions of buckets. So we could use a static default, which will be less confusing to users.	2016-07-18 11:30:41 +02:00
Martijn van Groningen	e0ebf5da1c	Template cleanup: * Removed `Template` class and unified script & template parsing logic. Templates are scripts, so they should be defined as a script. Unless there will be separate template infrastructure, templates should share as much code as possible with scripts. * Removed ScriptParseException in favour for ElasticsearchParseException * Moved TemplateQueryBuilder to lang-mustache module because this query is hard coded to work with mustache only	2016-07-18 10:16:01 +02:00
Clinton Gormley	d2f25416e4	Update node.asciidoc Typo	2016-07-17 21:31:35 +02:00
Clinton Gormley	49d0f3406c	Update node.asciidoc Master nodes must have access to a persistent data directory	2016-07-17 21:10:33 +02:00
Nik Everett	777ea124c7	Fix health docs test It failed inconsistently when there were pending tasks.	2016-07-16 07:18:11 -04:00
Nik Everett	9f78f8cc91	Convert snippets in health docs to CONSOLE This should make them easier to read and adds them to the test suite I changed the example from a two node cluster to a single node cluster because that is what we have running in the integration tests. It is also what a user just starting out is likely to see so I think that is ok.	2016-07-15 16:31:37 -04:00
Nik Everett	7aeea764ba	Remove wait_for_status=yellow from the docs It is no longer required after 687e2e12b31ed3c12ef4c411333bff9da58fc808.	2016-07-15 16:02:07 -04:00
Clinton Gormley	6f17736eb1	Fixed asciidoc	2016-07-15 12:58:38 +02:00
Clinton Gormley	05271d58ca	Updated fielddata docs to make it easier for users with old mappings	2016-07-14 19:58:12 +02:00
Zachary Tong	c950ea0023	Record method counts while profiling (#18302 ) Invocation counts can be used to help judge the selectivity of individual query components in the context of the entire query. E.g. a query may not look selective when run by itself (matches most of the index), but when run in context of a full search request, is evaluated only rarely due to execution order Since this is modifying the base timing class, it'll enrich both query and agg profiles (as well as future profile results)	2016-07-14 09:46:24 -04:00
Simon Willnauer	5616251f22	Remove `node.mode` and `node.local` settings (#19428 ) Today `node.mode` and `node.local` serve almost the same purpose, they are a shortcut for `discovery.type` and `transport.type`. If `node.local: true` or `node.mode: local` is set elasticsearch will start in _local_ mode which means only nodes within the same JVM are discovered and a non-network based transport is used. The _local_ mode it only really used in tests or if nodes are embedded. For both, embedding and tests explicit configuration via `discovery.type` and `transport.type` should be preferred. This change removes all the usage of these settings and by-default doesn't configure a default transport implemenation since netty is now a module. Yet, to make the user expericence flawless, plugins or modules can set a `http.type.default` and `transport.type.default`. Plugins set this via `PluginService#additionalSettings()` which enforces _set-once_ which prevents node startup if set multiple times. This means that our distributions will just startup with netty transport since it's packaged as a module unless `transport.type` or `http.transport.type` is explicitly set. This change also found a bunch of bugs since several NamedWriteables were not registered if a transport client is used. Now that we don't rely on the `node.mode` leniency which is inherited instead of using explicit settings, `TransportClient` uses `AssertingLocalTransport` which detects these problems since it serializes all messages. Closes #16234	2016-07-14 13:21:10 +02:00
Boaz Leskes	ef33183a19	update migration docs to include removal of `netty.epollBugWorkaround`	2016-07-14 12:20:35 +02:00
Martijn van Groningen	1bc12f5214	docs: fix broken link Closes #19430	2016-07-14 11:12:47 +02:00
Tal Levy	8fd01554bc	update foreach processor to only support one applied processor. (#19402 ) Closes #19345.	2016-07-13 13:13:00 -07:00
Clinton Gormley	1e2d0c1000	More bad asciidoc	2016-07-13 16:30:49 +02:00
Clinton Gormley	599727e38f	Fixed bad ASCIIDOC	2016-07-13 16:09:41 +02:00
Clinton Gormley	ab7a976e49	Make Prefer Parameters admon block linkable	2016-07-13 16:02:34 +02:00
Martijn van Groningen	2c3165d080	Removed deprecated 1.x script and template syntax Closes #13729	2016-07-13 15:07:36 +02:00
Lee Hinman	95cf2407ee	Merge remote-tracking branch 'dakrone/include-cluster-info-in-explain-api'	2016-07-12 16:26:46 -06:00
Jason Tedor	ce5a382c69	Remove support for properties This commit removes support for properties syntax and config files: - removed support for elasticsearch.properties - removed support for logging.properties - removed support for properties content detection in REST APIs - removed support for properties content detection in Java API Relates #19398	2016-07-12 17:55:18 -04:00
Lee Hinman	58db63b610	Expose the ClusterInfo object in the allocation explain output This adds an optional parameter to the cluster allocation explain API that will return the cluster info object, `include_disk_info`, the output looks like: GET /_cluster/allocation/explain?include_disk_info -d' {"index": "i", "shard": 0, "primary": false}' { ... other info ... "cluster_info" : { "nodes" : { "7Uws-vL7R6WVm3ZwQA1n5A" : { "node_name" : "Kraven the Hunter", "least_available" : { "path" : "/path/to/data1", "total_bytes" : 165999570944, "used_bytes" : 118180614144, "free_bytes" : 47818956800, "free_disk_percent" : 28.80667493781158, "used_disk_percent" : 71.19332506218842 }, "most_available" : { "path" : "/path/to/data2", "total_bytes" : 165999570944, "used_bytes" : 118180614144, "free_bytes" : 47818956800, "free_disk_percent" : 28.80667493781158, "used_disk_percent" : 71.19332506218842 } } }, "shard_sizes" : { "[i][2][p]_bytes" : 0, "[i][4][p]_bytes" : 130, "[i][1][p]_bytes" : 0, "[i][3][p]_bytes" : 0, "[i][0][p]_bytes" : 130 }, "shard_paths" : { "[i][3], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=LegZLDniTVaw0Y1urv7s3g]" : "/path/to/data1/nodes/0", "[i][1], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=lAU_4vf_SKmoRdtg0ACnjQ]" : "/path/to/data1/nodes/0", "[i][2], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=Aurpeuj7SeGeyPDDpCtRgg]" : "/path/to/data1/nodes/0", "[i][0], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=Vgg8GlQTQ82C2j6HYBq8DQ]" : "/path/to/data1/nodes/0", "[i][4], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=t8hQlVSxQe-58fSeaXcAqg]" : "/path/to/data1/nodes/0" } } } Resolves #14405	2016-07-12 15:52:20 -06:00
Michael Sander	c493774093	Fix typo in cluster module docs This commit fixes a simple typo in the cluster module docs. Closes #19393	2016-07-12 16:32:23 -04:00
Nik Everett	8263873783	Switch search extension from push to pull Switches most search behavior extensions from push (`onModule(SearchModule)`) to pull (`implements SearchPlugin`). This effort in general gives plugin authors a much cleaner view of how to extend Elasticsearch and starts to set up portions of Elasticsearch as "the plugin API". This commit in particular does that for search-time behavior like customized suggesters, highlighters, score functions, and significance heuristics. It also switches most such customization to being done at search module construction time which is much, much easier to reason about from a testing perspective. It also helps significantly in the process of de-guice-ing Elasticsearch's startup. There are at least two major search time extensions that aren't covered in this commit that will simply have to wait for the next commit on the topic because this one has already grown large: custom aggregations and custom queries. These will likely live in the same SearchPlugin interface as well.	2016-07-11 18:49:05 -04:00
Sho Minagawa	6aa598e3fb	Fix typo on analyze.asciidoc (#19354 )	2016-07-11 15:49:39 +02:00
Clinton Gormley	982e01d463	Update network.asciidoc `network.publish_host` defaults to `network.host`, not `network.bind_host` Closes #19304	2016-07-08 17:13:10 +02:00
Jason Tedor	527980c995	Fix nesting of stopping docs This commit fixes errant nesting of the stopping docs due to using a section header instead of a chapter header at the top of the stopping docs.	2016-07-08 10:43:35 -04:00
Martijn van Groningen	ff5527f037	percolator: Forbid the usage or `range` queries with a range based on the current time If there are percolator queries containing `range` queries with ranges based on the current time then this can lead to incorrect results if the `percolate` query gets cached. These ranges are changing each time the `percolate` query gets executed and if this query gets cached then the results will be based on how the range was at the time when the `percolate` query got cached. The ExtractQueryTermsService has been renamed `QueryAnalyzer` and now only deals with analyzing the query (extracting terms and deciding if the entire query is a verified match) . The `PercolatorFieldMapper` is responsible for adding the right fields based on the analysis the `QueryAnalyzer` has performed, because this is highly dependent on the field mappings. Also the `PercolatorFieldMapper` is responsible for creating the percolate query.	2016-07-08 14:20:56 +02:00
Glen Smith	d7099f05b9	slight clarification	2016-07-07 20:46:18 -04:00
Jason Tedor	e86aa29f67	Die with dignity Today when a thread encounters a fatal unrecoverable error that threatens the stability of the JVM, Elasticsearch marches on. This includes out of memory errors, stack overflow errors and other errors that leave the JVM in a questionable state. Instead, the Elasticsearch JVM should die when these errors are encountered. This commit causes this to be the case. Relates #19272	2016-07-07 14:44:03 -04:00
Jason Tedor	c05f818160	Fix casing of "Elasticsearch" in how-to docs	2016-07-07 12:33:27 -04:00
Adrien Grand	873661df17	Fix typo.	2016-07-07 17:49:01 +02:00
Adrien Grand	f295a218a0	Add notes about sparsity.	2016-07-07 17:47:19 +02:00
Clinton Gormley	ee86a9f634	Update field-stats.asciidoc Change use of index constraints to correctly identify any indices containing relevant docs Closes #19232	2016-07-07 14:56:40 +02:00
Nik Everett	b3c015e2bb	Reindex from remote This adds a remote option to reindex that looks like ``` curl -POST 'localhost:9200/_reindex?pretty' -d'{ "source": { "remote": { "host": "http://otherhost:9200" }, "index": "target", "query": { "match": { "foo": "bar" } } }, "dest": { "index": "target" } }' ``` This reindex has all of the features of local reindex: * Using queries to filter what is copied * Retry on rejection * Throttle/rethottle The big advantage of this version is that it goes over the HTTP API which can be made backwards compatible. Some things are different: The query field is sent directly to the other node rather than parsed on the coordinating node. This should allow it to support constructs that are invalid on the coordinating node but are valid on the target node. Mostly, that means old syntax.	2016-07-05 16:13:17 -04:00
Christoph Wurm	c9da56dc80	Reword Refresh API reference (#19270 )	2016-07-05 18:37:28 +02:00
Britta Weber	f36c1b4e60	Update fielddata.asciidoc	2016-07-05 16:21:52 +02:00
Jim Ferenczi	dcf6a96725	Add doc values support to the _size field in the mapper-size plugin This change activates the doc_values on the _size field for indices created after 5.0.0-alpha4. It also adds a note in the breaking changes that explain the situation and how to get around it. Closes #18334	2016-07-05 14:47:58 +02:00
Christoph Wurm	768beea6c7	Update refresh.asciidoc Fix grammar and example	2016-07-05 13:49:25 +02:00
Christoph Wurm	d1727653dd	Update shrink-index.asciidoc Fix half-finished sentence	2016-07-05 13:34:58 +02:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Clinton Gormley	f572f8cc17	Bad asciidoc link	2016-07-04 11:02:06 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00

1 2 3 4 5 ...

2799 Commits