OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	bdebd02d8c	Only write forced_refresh if we forced a refresh Otherwise it just adds noise to the response. Closes #19629	2016-07-29 15:00:30 -04:00
Martijn van Groningen	a91bb29585	ingest: Made the response format of the get pipeline api match with the response format of the index template api Closes #19585	2016-07-29 17:58:30 +02:00
Colin Goodheart-Smithe	3f13f02575	[DOCS] updated documentation for transport client changes Updated dependency in Java API docs and added section in breaking changes	2016-07-29 14:25:12 +01:00
Clinton Gormley	1cbe3bf2ad	Updated release notes to use breaking-java tag	2016-07-29 14:43:31 +02:00
Clinton Gormley	3922392218	Added release notes for 5.0.0-alpha5	2016-07-29 14:43:31 +02:00
Adrien Grand	dcc598c414	Make the heuristic to compute the default shard size less aggressive. The current heuristic to compute a default shard size is pretty aggressive, it returns `max(10, number_of_shards * size)` as a value for the shard size. I think making it less aggressive has the benefit that it would reduce the likelyness of running into OOME when there are many shards (yearly aggregations with time-based indices can make numbers of shards in the thousands) and make the use of breadth-first more likely/efficient. This commit replaces the heuristic with `size * 1.5 + 10`, which is enough to have good accuracy on zipfian distributions.	2016-07-29 09:59:29 +02:00
Brandon Wulf	6b7d40929c	Switch example from inclusion to exclusion. Page is explaining allocation exclusion- example should be about exclusion as well.	2016-07-28 21:54:22 -04:00
Areek Zillur	69941931c7	Merge pull request #19610 from areek/enhancement/19484 Add zero-padding to auto-generated rollover index name increment	2016-07-28 11:44:50 -04:00
Nik Everett	f159156931	[docs] Deprecate found and created (#19633 ) These parts of delete and index response have been replaced with the operation field.	2016-07-28 10:20:48 -04:00
Clinton Gormley	3c639f0673	Removed array-of-string example from search template Relates to #19643	2016-07-28 13:49:53 +02:00
markwalkom	ebf96bbc35	Update gateway.asciidoc (#19572 ) * Update gateway.asciidoc Added a note to clarify that, in cases where nodes in a cluster have different setting, the node that is the elected master takes precedence over anything else. * Update gateway.asciidoc Updated as per @bleskes's comments	2016-07-28 13:09:05 +02:00
kingrhoton	1307aa7e77	clarify awkward text (#19608 )	2016-07-27 20:03:20 +02:00
Jared McQueen	d97b3fd817	[docs] missing a comma in the terms aggregation example	2016-07-27 12:59:38 -04:00
Clinton Gormley	8315a64a33	provide code example for processors setting A simple example but was missing Closes #19567	2016-07-27 17:54:52 +02:00
Areek Zillur	4e3602a790	Add zero-padding to auto-generated rollover index name increment closes #19484	2016-07-27 10:50:47 -04:00
Isabel Drost-Fromm	2a148a5b25	Update to current format.	2016-07-27 14:30:14 +02:00
Martijn van Groningen	2fdf79d8d4	Deprecate template query. Closes #19390	2016-07-27 09:50:44 +02:00
Colin Goodheart-Smithe	3f344d3154	[DOCS] fix documentation for selecting algorithm for percentiles agg	2016-07-27 08:48:51 +01:00
Martijn van Groningen	24d7fa6d54	ingest: Change the `foreach` processor to use the `_ingest._value` ingest metadata attribute to store the current array element being processed. Closes #19592	2016-07-27 09:35:09 +02:00
kingrhoton	643ccb8cc1	[docs] Switch contraction to possesive	2016-07-26 14:01:30 -04:00
Nik Everett	3c0288ee98	Consolify term and phrase suggester docs This includes a working example of reverse filters to support correcting prefix errors.	2016-07-26 12:28:31 -04:00
Alexander Lin	8f2882a442	Add _operation field to index, update, delete responses Performing the bulk request shown in #19267 now results in the following: ``` {"_index":"test","_type":"test","_id":"1","_version":1,"_operation":"create","forced_refresh":false,"_shards":{"total":2,"successful":1,"failed":0},"status":201} {"_index":"test","_type":"test","_id":"1","_version":1,"_operation":"noop","forced_refresh":false,"_shards":{"total":2,"successful":1,"failed":0},"status":200} ```	2016-07-26 11:16:19 -04:00
Colin Goodheart-Smithe	7ed64af639	[DOCS] fix callout in buckets path docs	2016-07-26 11:33:54 +01:00
Isabel Drost-Fromm	1080df51fc	Merge branch 'master' into docs/add_console_to_search	2016-07-26 11:29:35 +02:00
Colin Goodheart-Smithe	2c12c3e628	Add _bucket_count option to buckets_path This change adds a new special path to the buckets_path syntax `_bucket_count`. This new option will return the number of buckets for a multi-bucket aggregation, which can then be used in pipeline aggregations. Closes #19553	2016-07-26 09:28:21 +01:00
Lee Hinman	1623cff6c0	Merge remote-tracking branch 'dakrone/bucket-circuit-breaker'	2016-07-25 13:37:26 -06:00
Lee Hinman	124a9fabe3	Circuit break on aggregation bucket numbers with request breaker This adds new circuit breaking with the "request" breaker, which adds circuit breaks based on the number of buckets created during aggregations. It consists of incrementing during AggregatorBase creation This also bumps the REQUEST breaker to 60% of the JVM heap now. The output when circuit breaking an aggregation looks like: ```json { "shard" : 0, "index" : "i", "node" : "a5AvjUn_TKeTNYl0FyBW2g", "reason" : { "type" : "exception", "reason" : "java.util.concurrent.ExecutionException: QueryPhaseExecutionException[Query Failed [Failed to execute main query]]; nested: CircuitBreakingException[[request] Data too large, data for [<agg [otherthings]>] would be larger than limit of [104857600/100mb]];", "caused_by" : { "type" : "execution_exception", "reason" : "QueryPhaseExecutionException[Query Failed [Failed to execute main query]]; nested: CircuitBreakingException[[request] Data too large, data for [<agg [myagg]>] would be larger than limit of [104857600/100mb]];", "caused_by" : { "type" : "circuit_breaking_exception", "reason" : "[request] Data too large, data for [<agg [otherthings]>] would be larger than limit of [104857600/100mb]", "bytes_wanted" : 104860781, "bytes_limit" : 104857600 } } } } ``` Relates to #14046	2016-07-25 11:33:37 -06:00
Isabel Drost-Fromm	00a8516780	Merge branch 'master' into docs/add_console_to_search	2016-07-25 11:54:26 +02:00
Boaz Leskes	cd596772ee	Persistent Node Names (#19456 ) With #19140 we started persisting the node ID across node restarts. Now that we have a "stable" anchor, we can use it to generate a stable default node name and make it easier to track nodes over a restarts. Sadly, this means we will not have those random fun Marvel characters but we feel this is the right tradeoff. On the implementation side, this requires a bit of juggling because we now need to read the node id from disk before we can log as the node node is part of each log message. The PR move the initialization of NodeEnvironment as high up in the starting sequence as possible, with only one logging message before it to indicate we are initializing. Things look now like this: ``` [2016-07-15 19:38:39,742][INFO ][node ] [_unset_] initializing ... [2016-07-15 19:38:39,826][INFO ][node ] [aAmiW40] node name set to [aAmiW40] by default. set the [node.name] settings to change it [2016-07-15 19:38:39,829][INFO ][env ] [aAmiW40] using [1] data paths, mounts [[ /(/dev/disk1)]], net usable_space [5.5gb], net total_space [232.6gb], spins? [unknown], types [hfs] [2016-07-15 19:38:39,830][INFO ][env ] [aAmiW40] heap size [1.9gb], compressed ordinary object pointers [true] [2016-07-15 19:38:39,837][INFO ][node ] [aAmiW40] version[5.0.0-alpha5-SNAPSHOT], pid[46048], build[473d3c0/2016-07-15T17:38:06.771Z], OS[Mac OS X/10.11.5/x86_64], JVM[Oracle Corporation/Java HotSpot(TM) 64-Bit Server VM/1.8.0_51/25.51-b03] [2016-07-15 19:38:40,980][INFO ][plugins ] [aAmiW40] modules [percolator, lang-mustache, lang-painless, reindex, aggs-matrix-stats, lang-expression, ingest-common, lang-groovy, transport-netty], plugins [] [2016-07-15 19:38:43,218][INFO ][node ] [aAmiW40] initialized ``` Needless to say, settings `node.name` explicitly still works as before. The commit also contains some clean ups to the relationship between Environment, Settings and Plugins. The previous code suggested the path related settings could be changed after the initial Environment was changed. This did not have any effect as the security manager already locked things down.	2016-07-23 22:46:48 +02:00
Folusho Oladipo	1e7495a7fa	corrected the use of two synonymous words (#19498 ) Two synonyms were jointly used in the sentence(i.e "problems" and "issues"), so I deleted one of them.	2016-07-21 12:21:12 +02:00
Jun Ohtani	cebad703fe	Analyze: Specify anonymous char_filters/tokenizer/token_filters in the analyze API Add parser for anonymous char_filters/tokenizer/token_filters Using Settings in AnalyzeRequest for anonymous definition Add breaking changes document Closed #8878	2016-07-21 11:06:36 +09:00
Nik Everett	3a82c613e4	Migrate query registration from push to pull Remove `ParseField` constants used for names where there are no deprecated names and just use the `String` version of the registration method instead. This is step 2 in cleaning up the plugin interface for extending search time actions. Aggregations are next. This is breaking for plugins because those that register a new query should now implement `SearchPlugin` rather than `onModule(SearchModule)`.	2016-07-20 12:33:51 -04:00
Adrien Grand	1ed6c5d110	Docs: Add more points to the chart that gives accuracy for the cardinality aggregation. This also adds instructions how to regenerate the chart.	2016-07-20 10:37:12 +02:00
Adrien Grand	37d5bcb264	Clarify `function_score` docs. Closes #18315	2016-07-19 10:25:48 +02:00
Nik Everett	d573541f66	Support requests_per_second=-1 to mean no throttling in reindex This is entirely on the REST level, Float.POSITIVE_INFINITY is still how you get no throttling over the transport api. Closes #19089	2016-07-18 13:05:06 -04:00
Colin Goodheart-Smithe	b717ad8eb6	Enable option to use request cache for size > 0 Previously if the size of the search request was greater than zero we would not cache the request in the request cache. This change retains the default behaviour of not caching requests with size > 0 but also allows the `request_cache=true` query parameter to enable the cache for requests with size > 0	2016-07-18 13:33:59 +01:00
Adrien Grand	398d70b567	Add `scaled_float`. #19264 This is a tentative to revive #15939 motivated by elastic/beats#1941. Half-floats are a pretty bad option for storing percentages. They would likely require 2 bytes all the time while they don't need more than one byte. So this PR exposes a new `scaled_float` type that requires a `scaling_factor` and internally indexes `valuescaling_factor` in a long field. Compared to the original PR it exposes a lower-level API so that the trade-offs are clearer and avoids any reference to fixed precision that might imply that this type is more accurate (actually it is less* accurate). In addition to being more space-efficient for some use-cases that beats is interested in, this is also faster that `half_float` unless we can improve the efficiency of decoding half-float bits (which is currently done using software) or until Java gets first-class support for half-floats.	2016-07-18 12:36:23 +02:00
Adrien Grand	bde99bad2e	Use a static default precision for the cardinality aggregation. #19215 Today the default precision for the cardinality aggregation depends on how many parent bucket aggregations it had. The reasoning was that the more parent bucket aggregations, the more buckets the cardinality had to be computed on. And this number could be huge depending on what the parent aggregations actually are. However now that we run terms aggregations in breadth-first mode by default when there are sub aggregations, it is less likely that we have to run the cardinality aggregation on kagilions of buckets. So we could use a static default, which will be less confusing to users.	2016-07-18 11:30:41 +02:00
Martijn van Groningen	e0ebf5da1c	Template cleanup: * Removed `Template` class and unified script & template parsing logic. Templates are scripts, so they should be defined as a script. Unless there will be separate template infrastructure, templates should share as much code as possible with scripts. * Removed ScriptParseException in favour for ElasticsearchParseException * Moved TemplateQueryBuilder to lang-mustache module because this query is hard coded to work with mustache only	2016-07-18 10:16:01 +02:00
Clinton Gormley	d2f25416e4	Update node.asciidoc Typo	2016-07-17 21:31:35 +02:00
Clinton Gormley	49d0f3406c	Update node.asciidoc Master nodes must have access to a persistent data directory	2016-07-17 21:10:33 +02:00
Nik Everett	777ea124c7	Fix health docs test It failed inconsistently when there were pending tasks.	2016-07-16 07:18:11 -04:00
Nik Everett	9f78f8cc91	Convert snippets in health docs to CONSOLE This should make them easier to read and adds them to the test suite I changed the example from a two node cluster to a single node cluster because that is what we have running in the integration tests. It is also what a user just starting out is likely to see so I think that is ok.	2016-07-15 16:31:37 -04:00
Nik Everett	7aeea764ba	Remove wait_for_status=yellow from the docs It is no longer required after `687e2e12b3`.	2016-07-15 16:02:07 -04:00
Clinton Gormley	6f17736eb1	Fixed asciidoc	2016-07-15 12:58:38 +02:00
Clinton Gormley	05271d58ca	Updated fielddata docs to make it easier for users with old mappings	2016-07-14 19:58:12 +02:00
Zachary Tong	c950ea0023	Record method counts while profiling (#18302 ) Invocation counts can be used to help judge the selectivity of individual query components in the context of the entire query. E.g. a query may not look selective when run by itself (matches most of the index), but when run in context of a full search request, is evaluated only rarely due to execution order Since this is modifying the base timing class, it'll enrich both query and agg profiles (as well as future profile results)	2016-07-14 09:46:24 -04:00
Simon Willnauer	5616251f22	Remove `node.mode` and `node.local` settings (#19428 ) Today `node.mode` and `node.local` serve almost the same purpose, they are a shortcut for `discovery.type` and `transport.type`. If `node.local: true` or `node.mode: local` is set elasticsearch will start in _local_ mode which means only nodes within the same JVM are discovered and a non-network based transport is used. The _local_ mode it only really used in tests or if nodes are embedded. For both, embedding and tests explicit configuration via `discovery.type` and `transport.type` should be preferred. This change removes all the usage of these settings and by-default doesn't configure a default transport implemenation since netty is now a module. Yet, to make the user expericence flawless, plugins or modules can set a `http.type.default` and `transport.type.default`. Plugins set this via `PluginService#additionalSettings()` which enforces _set-once_ which prevents node startup if set multiple times. This means that our distributions will just startup with netty transport since it's packaged as a module unless `transport.type` or `http.transport.type` is explicitly set. This change also found a bunch of bugs since several NamedWriteables were not registered if a transport client is used. Now that we don't rely on the `node.mode` leniency which is inherited instead of using explicit settings, `TransportClient` uses `AssertingLocalTransport` which detects these problems since it serializes all messages. Closes #16234	2016-07-14 13:21:10 +02:00
Boaz Leskes	ef33183a19	update migration docs to include removal of `netty.epollBugWorkaround`	2016-07-14 12:20:35 +02:00
Martijn van Groningen	1bc12f5214	docs: fix broken link Closes #19430	2016-07-14 11:12:47 +02:00
Tal Levy	8fd01554bc	update foreach processor to only support one applied processor. (#19402 ) Closes #19345.	2016-07-13 13:13:00 -07:00
Clinton Gormley	1e2d0c1000	More bad asciidoc	2016-07-13 16:30:49 +02:00
Clinton Gormley	599727e38f	Fixed bad ASCIIDOC	2016-07-13 16:09:41 +02:00
Clinton Gormley	ab7a976e49	Make Prefer Parameters admon block linkable	2016-07-13 16:02:34 +02:00
Martijn van Groningen	2c3165d080	Removed deprecated 1.x script and template syntax Closes #13729	2016-07-13 15:07:36 +02:00
Lee Hinman	95cf2407ee	Merge remote-tracking branch 'dakrone/include-cluster-info-in-explain-api'	2016-07-12 16:26:46 -06:00
Jason Tedor	ce5a382c69	Remove support for properties This commit removes support for properties syntax and config files: - removed support for elasticsearch.properties - removed support for logging.properties - removed support for properties content detection in REST APIs - removed support for properties content detection in Java API Relates #19398	2016-07-12 17:55:18 -04:00
Lee Hinman	58db63b610	Expose the ClusterInfo object in the allocation explain output This adds an optional parameter to the cluster allocation explain API that will return the cluster info object, `include_disk_info`, the output looks like: GET /_cluster/allocation/explain?include_disk_info -d' {"index": "i", "shard": 0, "primary": false}' { ... other info ... "cluster_info" : { "nodes" : { "7Uws-vL7R6WVm3ZwQA1n5A" : { "node_name" : "Kraven the Hunter", "least_available" : { "path" : "/path/to/data1", "total_bytes" : 165999570944, "used_bytes" : 118180614144, "free_bytes" : 47818956800, "free_disk_percent" : 28.80667493781158, "used_disk_percent" : 71.19332506218842 }, "most_available" : { "path" : "/path/to/data2", "total_bytes" : 165999570944, "used_bytes" : 118180614144, "free_bytes" : 47818956800, "free_disk_percent" : 28.80667493781158, "used_disk_percent" : 71.19332506218842 } } }, "shard_sizes" : { "[i][2][p]_bytes" : 0, "[i][4][p]_bytes" : 130, "[i][1][p]_bytes" : 0, "[i][3][p]_bytes" : 0, "[i][0][p]_bytes" : 130 }, "shard_paths" : { "[i][3], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=LegZLDniTVaw0Y1urv7s3g]" : "/path/to/data1/nodes/0", "[i][1], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=lAU_4vf_SKmoRdtg0ACnjQ]" : "/path/to/data1/nodes/0", "[i][2], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=Aurpeuj7SeGeyPDDpCtRgg]" : "/path/to/data1/nodes/0", "[i][0], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=Vgg8GlQTQ82C2j6HYBq8DQ]" : "/path/to/data1/nodes/0", "[i][4], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=t8hQlVSxQe-58fSeaXcAqg]" : "/path/to/data1/nodes/0" } } } Resolves #14405	2016-07-12 15:52:20 -06:00
Michael Sander	c493774093	Fix typo in cluster module docs This commit fixes a simple typo in the cluster module docs. Closes #19393	2016-07-12 16:32:23 -04:00
Nik Everett	8263873783	Switch search extension from push to pull Switches most search behavior extensions from push (`onModule(SearchModule)`) to pull (`implements SearchPlugin`). This effort in general gives plugin authors a much cleaner view of how to extend Elasticsearch and starts to set up portions of Elasticsearch as "the plugin API". This commit in particular does that for search-time behavior like customized suggesters, highlighters, score functions, and significance heuristics. It also switches most such customization to being done at search module construction time which is much, much easier to reason about from a testing perspective. It also helps significantly in the process of de-guice-ing Elasticsearch's startup. There are at least two major search time extensions that aren't covered in this commit that will simply have to wait for the next commit on the topic because this one has already grown large: custom aggregations and custom queries. These will likely live in the same SearchPlugin interface as well.	2016-07-11 18:49:05 -04:00
Sho Minagawa	6aa598e3fb	Fix typo on analyze.asciidoc (#19354 )	2016-07-11 15:49:39 +02:00
Clinton Gormley	982e01d463	Update network.asciidoc `network.publish_host` defaults to `network.host`, not `network.bind_host` Closes #19304	2016-07-08 17:13:10 +02:00
Jason Tedor	527980c995	Fix nesting of stopping docs This commit fixes errant nesting of the stopping docs due to using a section header instead of a chapter header at the top of the stopping docs.	2016-07-08 10:43:35 -04:00
Martijn van Groningen	ff5527f037	percolator: Forbid the usage or `range` queries with a range based on the current time If there are percolator queries containing `range` queries with ranges based on the current time then this can lead to incorrect results if the `percolate` query gets cached. These ranges are changing each time the `percolate` query gets executed and if this query gets cached then the results will be based on how the range was at the time when the `percolate` query got cached. The ExtractQueryTermsService has been renamed `QueryAnalyzer` and now only deals with analyzing the query (extracting terms and deciding if the entire query is a verified match) . The `PercolatorFieldMapper` is responsible for adding the right fields based on the analysis the `QueryAnalyzer` has performed, because this is highly dependent on the field mappings. Also the `PercolatorFieldMapper` is responsible for creating the percolate query.	2016-07-08 14:20:56 +02:00
Glen Smith	d7099f05b9	slight clarification	2016-07-07 20:46:18 -04:00
Jason Tedor	e86aa29f67	Die with dignity Today when a thread encounters a fatal unrecoverable error that threatens the stability of the JVM, Elasticsearch marches on. This includes out of memory errors, stack overflow errors and other errors that leave the JVM in a questionable state. Instead, the Elasticsearch JVM should die when these errors are encountered. This commit causes this to be the case. Relates #19272	2016-07-07 14:44:03 -04:00
Jason Tedor	c05f818160	Fix casing of "Elasticsearch" in how-to docs	2016-07-07 12:33:27 -04:00
Adrien Grand	873661df17	Fix typo.	2016-07-07 17:49:01 +02:00
Adrien Grand	f295a218a0	Add notes about sparsity.	2016-07-07 17:47:19 +02:00
Clinton Gormley	ee86a9f634	Update field-stats.asciidoc Change use of index constraints to correctly identify any indices containing relevant docs Closes #19232	2016-07-07 14:56:40 +02:00
Nik Everett	b3c015e2bb	Reindex from remote This adds a remote option to reindex that looks like ``` curl -POST 'localhost:9200/_reindex?pretty' -d'{ "source": { "remote": { "host": "http://otherhost:9200" }, "index": "target", "query": { "match": { "foo": "bar" } } }, "dest": { "index": "target" } }' ``` This reindex has all of the features of local reindex: * Using queries to filter what is copied * Retry on rejection * Throttle/rethottle The big advantage of this version is that it goes over the HTTP API which can be made backwards compatible. Some things are different: The query field is sent directly to the other node rather than parsed on the coordinating node. This should allow it to support constructs that are invalid on the coordinating node but are valid on the target node. Mostly, that means old syntax.	2016-07-05 16:13:17 -04:00
Christoph Wurm	c9da56dc80	Reword Refresh API reference (#19270 )	2016-07-05 18:37:28 +02:00
Britta Weber	f36c1b4e60	Update fielddata.asciidoc	2016-07-05 16:21:52 +02:00
Jim Ferenczi	dcf6a96725	Add doc values support to the _size field in the mapper-size plugin This change activates the doc_values on the _size field for indices created after 5.0.0-alpha4. It also adds a note in the breaking changes that explain the situation and how to get around it. Closes #18334	2016-07-05 14:47:58 +02:00
Christoph Wurm	768beea6c7	Update refresh.asciidoc Fix grammar and example	2016-07-05 13:49:25 +02:00
Christoph Wurm	d1727653dd	Update shrink-index.asciidoc Fix half-finished sentence	2016-07-05 13:34:58 +02:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Clinton Gormley	f572f8cc17	Bad asciidoc link	2016-07-04 11:02:06 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Leon Weidauer	1297a707da	non-binary gender option in term aggr. example (#19188 ) * non-binary gender option in term aggr. example * replace gender with music genre for term aggregation docs	2016-07-01 14:59:03 +02:00
javanna	62462f5d9b	[TEST] replace ResponseBodyAssertion with existing MatchAssertion We introduced a special response_body assertion to test our docs snippets. The match assertion does the same job though and can be reused and adapted where needed. ResponseBodyAssertion contains provides much better and accurate errors though, which can be now utilized in MatchAssertion so that many more REST tests can benefit from readable error messages. Each response body gets always stashed and can be retrieved for later evaluations already. Instead of providing the response body as strings that get parsed to json objects separately, then converted to maps as ResponseBodyAssertion did, we parse everything once, the json is part of the yaml test, which is supported. The only downside is that json comments cannot be used, rather yaml comments should be used (// C style vs # ). There were only two docs tests that were using comments in ingest-node.asciidoc where I went ahead and remove the comments which didn't seem that useful anyways.	2016-07-01 11:13:10 +02:00
Clinton Gormley	e1ab3f16fd	Add link to alpha4 release notes	2016-06-30 18:32:15 +02:00
jalvar08	dbf1f61c5b	Fixing typo for path.conf location (#19098 ) Changing -Ees.path.conf to -Epath.conf	2016-06-30 16:42:01 +02:00
Tanguy Leroux	5903966dc8	Merge pull request #19180 from tlrx/doc-version-number-zero-with-dbq-and-ubq [Doc] Document Update/Delete-By-Query with version number zero	2016-06-30 15:51:46 +02:00
Tanguy Leroux	dc53ce929d	Document Update/Delete-By-Query with version number zero Update-By-Query and Delete-By-Query use internal versioning to update/delete documents. But documents can have a version number equal to zero using the external versioning... making the UBQ/DBQ request fail because zero is not a valid version number and they only support internal versioning for now. Sequence numbers might help to solve this issue in the future.	2016-06-30 15:45:14 +02:00
David Pilato	535157474e	Merge branch 'pr/19144-discovery-azure-classic'	2016-06-30 15:44:28 +02:00
Clinton Gormley	b5bb27cf90	Bumped version to 5.0.0-alpha4	2016-06-30 15:20:59 +02:00
David Pilato	8a2b27076e	Merge branch 'master' into pr/19144-discovery-azure-classic # Conflicts: # plugins/discovery-azure-classic/LICENSE.txt	2016-06-30 14:46:21 +02:00
David Pilato	527a9c7f48	Deprecate discovery-azure and rename it to discovery-azure-classic As discussed at https://github.com/elastic/elasticsearch-cloud-azure/issues/91#issuecomment-229113595, we know that the current `discovery-azure` plugin only works with Azure Classic VMs / Services (which is somehow Legacy now). The proposal here is to rename `discovery-azure` to `discovery-azure-classic` in case some users are using it. And deprecate it for 5.0. Closes #19144.	2016-06-30 14:42:40 +02:00
David Pilato	8c6c00ff15	Update documentation for cat/plugins API Cat API for plugins doesn't display anymore url or jvm/site flag	2016-06-30 13:57:43 +02:00
Colin Goodheart-Smithe	0d7c11ea1d	[DOCS] put profiling performance and limitations section on same page	2016-06-30 12:28:46 +01:00
Britta Weber	57a734e641	[doc] explain avg in function_score better (#19154 ) * [doc] explain avg in function_score better	2016-06-30 11:52:53 +02:00
Nik Everett	8db43c0107	Move RestHandler registration to ActionModule and ActionPlugin `RestHandler`s are highly tied to actions so registering them in the same place makes sense. Removes the need to for plugins to check if they are in transport client mode before registering a RestHandler - `getRestHandlers` isn't called at all in transport client mode. This caused guice to throw a massive fit about the circular dependency between NodeClient and the allocation deciders. I broke the circular dependency by registering the actions map with the node client after instantiation.	2016-06-29 18:31:44 -04:00
Jason Tedor	00356edd33	Clarify time units usage in docs This commit clarifies the distinction between supported time units for durations and supported time units for durations in the docs. Relates #19159	2016-06-29 17:02:15 -04:00
Jim Ferenczi	6d2df0dc18	Fix docs example for the _id field, the field is not accessible in scripts	2016-06-29 15:25:51 +02:00
Isabel Drost-Fromm	9e155e48a5	Fix request-body search test.	2016-06-29 11:32:27 +02:00
Isabel Drost-Fromm	9f30ae3359	Merge branch 'master' into docs/add_console_to_search	2016-06-29 10:20:25 +02:00
Tanguy Leroux	4820d49120	Mustache: Add util functions to render JSON and join array values This pull request adds two util functions to the Mustache templating engine: - {{#toJson}}my_map{{/toJson}} to render a Map parameter as a JSON string - {{#join}}my_iterable{{/join}} to render any iterable (including arrays) as a comma separated list of values like `1, 2, 3`. It's also possible de change the default delimiter (comma) to something else. closes #18970	2016-06-29 09:48:58 +02:00
Nik Everett	67bfecc070	Painless: add "".replaceAll and "".replaceFirst These are useful methods in groovy that give you control over the replacements used: ``` 'the quick brown fox'.replaceAll(/[aeiou]/, m -> m.group().toUpperCase(Locale.ROOT)) ```	2016-06-28 16:39:11 -04:00
Colin Goodheart-Smithe	1aa31ec934	#19133 Added documentation for aggregation profiling Added documentation for aggregation profiling	2016-06-28 19:33:55 +01:00
Colin Goodheart-Smithe	44ee56c073	Added documentation for aggregation profiling	2016-06-28 19:33:29 +01:00
Robert Muir	6d52cec2a0	Merge pull request #19092 from rmuir/more_painless_docs cutover some docs to painless	2016-06-28 13:40:25 -04:00
Nik Everett	fa4844c3f4	Pull actions from plugins Instead of implementing onModule(ActionModule) to register actions, this has plugins implement ActionPlugin to declare actions. This is yet another step in cleaning up the plugin infrastructure. While I was in there I switched AutoCreateIndex and DestructiveOperations to be eagerly constructed which makes them easier to use when de-guice-ing the code base.	2016-06-28 08:36:24 -04:00
Clinton Gormley	fc9fa3afaf	Added release notes for 5.0.0-alpha4	2016-06-28 12:26:03 +02:00
Jason Tedor	2f638b5a23	Keep input time unit when parsing TimeValues This commit modifies TimeValue parsing to keep the input time unit. This enables round-trip parsing from instances of String to instances of TimeValue and vice-versa. With this, this commit removes support for the unit "w" representing weeks, and also removes support for fractional values of units (e.g., 0.5s). Relates #19102	2016-06-27 18:41:18 -04:00
Ryan Ernst	a07a3a9333	Add migration docs for MapperPlugin	2016-06-27 11:22:07 -07:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Robert Muir	6fc1a22977	cutover some docs to painless	2016-06-27 09:55:16 -04:00
Tanguy Leroux	453a4b9647	Fix documentation typo in How-To docs	2016-06-27 14:49:37 +02:00
Jerry Liu	1863ab95f8	fixed typo 'if' -> 'is' (#19051 )	2016-06-27 14:20:23 +02:00
Martijn van Groningen	d3cd58eb2f	Merges PR #18957 This commit fixes several NPEs caused by implicitly performing a get request for a document that exists with its _source disabled and then trying to access the source. Instead of causing an NPE the following queries will throw an exception with a "source disabled" message (similar behavior as if the document does not exist).: - GeoShape query for pre-indexed shape (throws IllegalArgumentException) - Percolate query for an existing document (throws IllegalArgumentException) A Terms query with a lookup will ignore the document if the source does not exist (same as if the document does not exist). GET and HEAD requests for the document _source will return a 404 if the source is disabled (even if the document exists).	2016-06-27 09:37:28 +02:00
Damien Alexandre	fec4a18835	Rename plainless into painless in migration doc The scripting language was wrongly named.	2016-06-26 17:41:34 +02:00
Nik Everett	71b95fb63c	Switch analysis from push to pull Instead of plugins calling `registerTokenizer` to extend the analyzer they now instead have to implement `AnalysisPlugin` and override `getTokenizer`. This lines up extending plugins in with extending scripts. This allows `AnalysisModule` to construct the `AnalysisRegistry` immediately as part of its constructor which makes testing anslysis much simpler. This also moves the default analysis configuration into `AnalysisModule` which is how search is setup. Like `ScriptModule`, `AnalysisModule` no longer extends `AbstractModule`. Instead it is only responsible for building `AnslysisRegistry`. We still bind `AnalysisRegistry` but we only do so in `Node`. This is means it is available at module construction time so we slowly remove the need to bind it in guice.	2016-06-26 07:15:42 -04:00
Alex Benusovich	3ca909dfea	Fix NPEs due to disabled source This commit fixes several NPEs caused by implicitly performing a get request for a document that exists with its _source disabled and then trying to access the source. Instead of causing an NPE the following queries will throw an exception with a "source disabled" message (similar behavior as if the document does not exist).: - GeoShape query for pre-indexed shape (throws IllegalArgumentException) - Percolate query for an existing document (throws IllegalArgumentException) A Terms query with a lookup will ignore the document if the source does not exist (same as if the document does not exist). GET and HEAD requests for the document _source will return a 404 if the source is disabled (even if the document exists).	2016-06-24 22:03:03 -07:00
Robert Muir	0b2baa7f63	Merge pull request #19065 from rmuir/help_painless_docs Bring painless docs closer to reality	2016-06-24 12:52:30 -04:00
Martijn van Groningen	2a196d4068	docs: update example for finding percolator where query terms couldn't be extracted successfully	2016-06-24 18:18:02 +02:00
Robert Muir	001a060c84	Bring painless docs closer to reality	2016-06-24 12:06:41 -04:00
Adrien Grand	fbad3af352	Add a how-to section to the docs. #18998 This moves the "Performance Considerations for Elasticsearch Indexing" blog post to the reference guide and adds similar recommendations for tuning disk usage and search speed.	2016-06-24 10:58:33 +02:00
Martijn van Groningen	0cae9ad30e	docs: removed obsolete information, percolator queries are not longer loaded into jvm heap memory.	2016-06-23 15:32:26 +02:00
Clinton Gormley	5a08e36f9c	Update migrate_5_0.asciidoc Updated breaking changes to state that upgraded indices still need to be reindexed, and to mention the migration plugin	2016-06-23 13:10:50 +02:00
Adrien Grand	c87ba0bfa8	Fix docs build.	2016-06-23 09:44:33 +02:00
Tanguy Leroux	04da1bda0d	Move templates out of the Search API, into lang-mustache module This commit moves template support out of the Search API to its own dedicated Search Template API in the lang-mustache module. It provides a new SearchTemplateAction that can be used to render templates before it gets delegated to the usual Search API. The current REST endpoint are identical, but the Render Search Template endpoint now uses the same Search Template API with a new "simulate" option. When this option is enabled, the Search Template API only renders template and returns immediatly, without executing the search. Closes #17906	2016-06-23 09:30:53 +02:00
David Pilato	157645fe9e	Merge pull request #18981 from elastic/doc/ingest-foreach Wrong name for values field	2016-06-22 23:14:02 +02:00
Mike McCandless	d3d524568e	merge master	2016-06-22 16:23:56 -04:00
Nik Everett	ee2a77143b	Docs: Convert aggs/misc to CONSOLE They should be more readable and tested during the build.	2016-06-22 14:52:06 -04:00
Nik Everett	02761f5fe0	Docs: migration notes for _timestamp and _ttl We aren't able to actually create an index with _timestamp enabled to test the migration, or, at least, we won't be able to after #18980 is re-merged. But the docs are still ok. Closes #19007	2016-06-22 14:43:12 -04:00
Nik Everett	6574243077	Fail to start if plugin tries broken onModule If a plugin declares `onModule(SomethingThatIsntAModule)` then refuse to start. Before this commit we just logged a warning that flies by in the console and is easy to miss. You can't miss refusing to start!	2016-06-22 12:20:52 -04:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Mike McCandless	52fcdf5e8d	merge master	2016-06-22 09:54:40 -04:00
Clinton Gormley	c7bd1a80af	Changed path.script to path.scripts in docs	2016-06-22 12:39:52 +02:00
Adrien Grand	7d63f4b8db	Fix doc build.	2016-06-22 09:34:49 +02:00
Adrien Grand	db9af54ec0	Remove `_timestamp` and `_ttl` on 5.x indices. #18980 This removes the ability to use `_timestamp` and `_ttl` on indices created on or after 5.0. Closes #18280	2016-06-22 08:35:54 +02:00
Martijn van Groningen	5dc88ffd26	docs: added note the inner hits migrate section	2016-06-22 08:29:50 +02:00
Clinton Gormley	2f2ea0c280	Improved docs explaining the index upgrade process in breaking changes	2016-06-21 18:03:19 +02:00
Clinton Gormley	70482d1e39	Update java.asciidoc Fixed asciidoc	2016-06-21 16:02:25 +02:00
Jim Ferenczi	881afcba60	Fixed tests that failed now that BM25 is the default similarity.	2016-06-21 15:42:42 +02:00
Clinton Gormley	0160d91c2c	Removed docs for precision_step - no longer used	2016-06-21 15:19:12 +02:00
Martijn van Groningen	5ad2fdaa8e	inner_hits: Don't include `_id`, `_type` and `_index` keys in search response for inner hits Closes #18091	2016-06-21 14:13:38 +02:00
Sakthipriyan Vairamani	8d5a5e500a	file is -> file name (#18994 )	2016-06-21 13:20:56 +02:00
Jim Ferenczi	423291b6bc	Change default similarity to BM25 The default similarity was set to `classic` which refers to TFIDF and has not been moved after the upgrade to Lucene 6. Though moving to BM25 could have some downside for queries that relies on coordination factor (match_query, multi_match_query) ? relates #18944	2016-06-21 11:29:36 +02:00
Mike McCandless	eecf094ac1	add indices nodes info flag to docs	2016-06-20 14:23:32 -04:00
David Pilato	cb8073e990	Wrong name for values field We wrote that the document is: ```json { "value" : ["foo", "bar", "baz"] } ``` But the processor is using a `values` field: ```json { "foreach" : { "field" : "values", "processors" : [ // ... ] } } ``` It should be `values`.	2016-06-20 18:58:41 +02:00
Nik Everett	6569d35094	Fail doc tests when any shard fails ES only sends a non-200 response all shards fail but we should fail the tests generated by docs if any of them fail. Depending on the outcome of #18978 this might be a temporary workaround.	2016-06-20 12:49:30 -04:00
Eric Sherman	0660de7472	Update snapshots.asciidoc (#18923 )	2016-06-20 16:00:59 +02:00
Adrien Grand	93415d4506	Expose MMapDirectory.preLoad(). #18880 The MMapDirectory has a switch that allows the content of files to be loaded into the filesystem cache upon opening. This commit exposes it with the new `index.store.pre_load` setting.	2016-06-20 13:42:56 +02:00
debadair	084b35c08b	Docs: Fixed code callout error.	2016-06-17 14:31:03 -07:00
Jason Tedor	d09d89f8c5	Remove only node preference This commit removes the search preference _only_node as the same functionality can be obtained by using the search preference _only_nodes. This commit also adds a test that ensures that _only_nodes will continue to support specifying node IDs. Relates #18875	2016-06-17 15:27:46 -04:00
Jason Tedor	245def80f0	Add note that thread pool settings are node-level This commit adds a note to the breaking changes docs that since commit `da74323141`, thread pool settings are no longer cluster-level settings and thus not dynamically updatable.	2016-06-17 15:19:52 -04:00
Areek Zillur	a3bd2de430	[DOCS] fix missing rollover-index link	2016-06-17 12:14:45 -04:00
Areek Zillur	9356a6090f	Merge branch 'master' into enhancement/rollover_api	2016-06-17 11:35:57 -04:00
Jim Ferenczi	fb2a48d0f0	Revert "Remove support for sorting terms aggregation by ascending count" This is delayed after alpha4 since Kibana relies on it.	2016-06-17 17:14:01 +02:00
Areek Zillur	545ffa7801	Merge branch 'master' into enhancement/rollover_api	2016-06-17 10:33:11 -04:00
Jim Ferenczi	755721953b	Remove support for sorting terms aggregation by ascending count closes #17614	2016-06-17 15:06:49 +02:00
Glen Smith	5284c5094d	grammar	2016-06-17 10:09:21 +02:00
Areek Zillur	615920df2e	update docs	2016-06-17 00:03:38 -04:00
Areek Zillur	6adffa6b7b	Merge branch 'master' into enhancement/rollover_api	2016-06-16 17:27:32 -04:00
Nik Everett	b665d8a187	Painless: Add flag support to regexes Painless: Add support for //m Painless: Add support for //s Painless: Add support for //i Painless: Add support for //u Painless: Add support for //U Painless: Add support for //l This means "literal" and is exposed for completeness sake with the java api. Painless: Add support for //c c enables Java's CANON_EQ (canonical equivalence) flag which makes unicode characters that are canonically equal match. Java's javadoc gives "a\u030A" being equal to "\u00E5". That is that the "a" code point followed by the "combining ring above" code point is equal to the "a with combining ring above" code point. Update docs and add multi-flag test Whitelist most of the Pattern class.	2016-06-16 15:00:31 -04:00
Nik Everett	8d3ef742db	Painless: =~ and ==~ operators Adds support for the find operator (=~) and the match operator (==~) to painless's regexes. Also whitelists most of the Matcher class and documents regex support in painless. The find operator (=~) returns a boolean that is the result of building a matcher on the lhs with the Pattern on the RHS and calling `find` on it. Use it like this: ``` if (ctx._source.last =~ /b/) ``` The match operator (==~) returns boolean like find but instead of calling `find` on the Matcher it calls `matches`. ``` if (ctx._source.last ==~ /[^aeiou].*[aeiou]/) ``` Finally, if you want the actual matcher you do: ``` Matcher m = /[aeiou]/.matcher(ctx._source.last) ```	2016-06-16 08:42:33 -04:00
Tanguy Leroux	3c9712794e	Merge pull request #18586 from a2lin/msearch_error_fix Adding status field in _msearch error request bodies	2016-06-16 14:31:39 +02:00
Jim Ferenczi	ad232aebbe	Set collection mode to breadth_first in the terms aggregation when the cardinality of the field is unknown or smaller than the requested size. closes #9825	2016-06-16 11:33:40 +02:00
Mike McCandless	3f221bf7cb	Add total_indexing_buffer/_in_bytes to nodes info API	2016-06-16 04:39:34 -04:00
Adrien Grand	9ffb2ff6ba	Expose half-floats. #18887 They have been implemented in https://issues.apache.org/jira/browse/LUCENE-7289. Ranges are implemented so that the accuracy loss only occurs at index time, which means that if you are searching for values between A and B, the query will match exactly all documents whose value rounded to the closest half-float point is between A and B.	2016-06-16 09:46:39 +02:00
Alexander Lin	7d42e7e716	Closes #18013 . Added status field to _msearch response bodies.	2016-06-16 00:25:17 -07:00
Jason Tedor	8caaf9ad11	Fix thread pool docs regarding dynamic settings Thread pool settings are no longer dynamically updatable since `da74323141`. This commit removes a leftover note from the thread pool module docs that incorrectly states that thread pool settings are dynamically updatable.	2016-06-15 18:25:25 -04:00
Tal Levy	a26260fb72	new ScriptProcessor for Ingest (#18193 ) add new ScriptProcessor for executing ES Scripts within pipelines	2016-06-15 14:57:18 -07:00
Areek Zillur	eb9b4437b2	update docs	2016-06-15 14:57:17 -04:00
Martijn van Groningen	a2ad5c0282	docs: fix typo Closes #18877	2016-06-15 10:56:46 +02:00
Jason Tedor	e96722d91c	Add search preference to prefer multiple nodes The search preference _prefer_node allows specifying a single node to prefer when routing a request. This functionality can be enhanced by permitting multiple nodes to be preferred. This commit replaces the search preference _prefer_node with the search preference _prefer_nodes which supplants the former by specifying a single node and otherwise adds functionality. Relates #18872	2016-06-14 21:34:24 -04:00
Nik Everett	e392e0b1df	Create get task API that falls back to the .tasks index This adds a get task API that supports GET /_tasks/${taskId} and removes that responsibility from the list tasks API. The get task API supports wait_for_complation just as the list tasks API does but doesn't support any of the list task API's filters. In exchange, it supports falling back to the .results index when the task isn't running any more. Like any good GET API it 404s when it doesn't find the task. Then we change reindex, update-by-query, and delete-by-query to persist the task result when wait_for_completion=false. The leads to the neat behavior that, once you start a reindex with wait_for_completion=false, you can fetch the result of the task by using the get task API and see the result when it has finished. Also rename the .results index to .tasks.	2016-06-14 13:37:34 -04:00
Colin Goodheart-Smithe	d7e3f9e4eb	#18854 Remove size 0 options in aggregations Remove size 0 options in aggregations	2016-06-14 15:32:42 +01:00
Vladimir Kovpak	d5f71f9e85	Updated from parameter description. (#18852 ) Not sure that my description better but origin description looks very weird, and i try to make emphasize to offset...	2016-06-14 14:33:15 +02:00
Itamar Syn-Hershko	5a9303dec2	Fixing typos (#18851 )	2016-06-14 14:22:55 +02:00
Colin Goodheart-Smithe	cfd3356ee3	Remove size 0 options in aggregations This removes the ability to set `size: 0` in the `terms`, `significant_terms` and `geohash_grid` aggregations for the reasons described in https://github.com/elastic/elasticsearch/issues/18838 Closes #18838	2016-06-14 13:07:02 +01:00
Aaron Mildenstein	41810bd63c	Pluralize "index" (#18811 ) This doesn't just happen to "an index" unless you're restoring just one. It reads better this way, IMO.	2016-06-13 20:05:33 +02:00
eratio08	26aacfff72	default values for BM25 Similarity (#18778 ) assuming elasticsearch uses the lucene default values	2016-06-13 18:57:44 +02:00
Martijn van Groningen	3b96055b23	msearch: Cap the number of searches the msearch api will concurrently execute By default the number of searches msearch executes is capped by the number of nodes multiplied with the default size of the search threadpool. This default can be overwritten by using the newly added `max_concurrent_searches` parameter. Before the msearch api would concurrently execute all searches concurrently. If many large msearch requests would be executed this could lead to some searches being rejected while other searches in the msearch request would succeed. The goal of this change is to avoid this exhausting of the search TP. Closes #17926	2016-06-13 10:13:08 +02:00
Nik Everett	25fde039fd	[docs] Flow the refresh docs They were making multiple pages but that is silly. They should all be one page.	2016-06-10 10:42:56 -04:00
Jim Ferenczi	439b2a96e5	Add an index setting to limit the maximum number of slices allowed in a scroll request (default to 1024).	2016-06-10 09:43:32 +02:00
Areek Zillur	41d31541a6	Allow users to override the name for the rollover index	2016-06-09 13:43:19 -04:00
Nik Everett	a0585269be	[docs] s/lags/Flags/ Copy and paste lots an `F`.	2016-06-09 13:08:53 -04:00
Nik Everett	09cc4c449a	[docs] Pattern replace char filter now support flags	2016-06-09 12:41:20 -04:00
Areek Zillur	a9f24ea2dc	fail rollover request if rollover index already exists	2016-06-09 12:38:12 -04:00
Areek Zillur	9027e8a719	renamed simulated mode to dry_run mode	2016-06-09 11:55:10 -04:00
Britta Weber	053a615686	[TEST] wait for yellow before query execution We can remove this once https://github.com/elastic/elasticsearch/pull/18759 is in.	2016-06-09 15:11:48 +02:00
Areek Zillur	94a7978ef6	add documentation	2016-06-08 18:38:02 -04:00
Nik Everett	4b21157906	Remove setRefresh It has been replaced with `setRefreshPolicy` which has support for waiting until refresh with `setRefreshPolicy(WAIT_FOR)`. Related to #1063	2016-06-08 13:50:59 -04:00
Lee Hinman	c637fea84b	Change the default of `include_global_state` from true to false for restores This changes the default value to be false only for restore operations. Resolves #18569	2016-06-08 10:48:36 -06:00
Lee Hinman	762bbdbd0c	Revert "Change the default of `include_global_state` from true to false." This reverts commit `052a62250c`.	2016-06-07 15:07:37 -06:00
Lee Hinman	052a62250c	Change the default of `include_global_state` from true to false. Resolves #18569	2016-06-07 15:06:20 -06:00
Lee Hinman	32bd869b28	Merge remote-tracking branch 'dakrone/no-cluster-name-in-path'	2016-06-07 10:14:23 -06:00
Lee Hinman	feb244c14a	Remove cluster name from data path Previously Elasticsearch used $DATA_DIR/$CLUSTER_NAME/nodes for the path where data is stored, this commit changes that to be $DATA_DIR/nodes. On startup, if the old folder structure is detected it will be used. This behavior will be removed in Elasticsearch 6.0 Resolves #17810	2016-06-07 10:13:48 -06:00
trangvh	c0da8e4060	Fix some typos (#18746 ) * Update java-doc of SearchResponse.getProfileResults() * Fix a trivial typo in Reference document	2016-06-07 16:41:39 +02:00
Jim Ferenczi	b9030bf6fe	Add the ability to partition a scroll in multiple slices. API: ``` curl -XGET 'localhost:9200/twitter/tweet/_search?scroll=1m' -d '{ "slice": { "field": "_uid", <1> "id": 0, <2> "max": 10 <3> }, "query": { "match" : { "title" : "elasticsearch" } } } ``` <1> (optional) The field name used to do the slicing (_uid by default) <2> The id of the slice By default the splitting is done on the shards first and then locally on each shard using the _uid field with the following formula: `slice(doc) = floorMod(hashCode(doc._uid), max)` For instance if the number of shards is equal to 2 and the user requested 4 slices then the slices 0 and 2 are assigned to the first shard and the slices 1 and 3 are assigned to the second shard. Each scroll is independent and can be processed in parallel like any scroll request. Closes #13494	2016-06-07 16:21:53 +02:00
Christoph Wurm	d71894a226	Update ingest-node.asciidoc Fixed forgotten rename from `match_formats` to `formats` in documentation (changed in `dd2184ab25`)	2016-06-07 15:47:38 +02:00
Jason Tedor	75d3b13790	Merge pull request #18756 from jasontedor/on-out-of-memory-error Bootstrap check for OnOutOfMemoryError and seccomp	2016-06-07 09:26:57 -04:00
Simon Willnauer	b2c4c323e1	Allow `_shrink` to N shards if source shards is a multiple of N (#18699 ) Today we allow to shrink to 1 shard but that might not be possible due to too many document or a single shard doesn't meet the requirements for the index. The logic can be expanded to N shards if the source index shards is a multiple of N. This guarantees that there are not hotspots created due to different number of shards being shrunk into one.	2016-06-07 10:06:41 +02:00
Jason Tedor	e94408c0d2	Bootstrap check for OnError and seccomp This commit adds a bootstrap check for the JVM option OnError being in use and seccomp being enabled. These two options are incompatible because OnError allows the user to specify an arbitrary program to fork when the JVM encounters an fatal error, and seccomp enables system call filters that prevents forking.	2016-06-06 22:18:44 -04:00
Jason Tedor	da74323141	Register thread pool settings This commit refactors the handling of thread pool settings so that the individual settings can be registered rather than registering the top level group. With this refactoring, individual plugins must now register their own settings for custom thread pools that they need, but a dedicated API is provided for this in the thread pool module. This commit also renames the prefix on the thread pool settings from "threadpool" to "thread_pool". This enables a hard break on the settings so that: - some of the settings can be given more sensible names (e.g., the max number of threads in a scaling thread pool is now named "max" instead of "size") - change the soft limit on the number of threads in the bulk and indexing thread pools to a hard limit - the settings names for custom plugins for thread pools can be prefixed (e.g., "xpack.watcher.thread_pool.size") - remove dynamic thread pool settings Relates #18674	2016-06-06 22:09:12 -04:00
Jason Tedor	9695caa3fb	Bootstrap check for OnOutOfMemoryError and seccomp This commit adds a bootstrap check for the JVM option OnOutOfMemoryError being in use and seccomp being enabled. These two options are incompatible because OnOutOfMemoryError allows the user to specify an arbitrary program to fork when the JVM encounters an OutOfMemoryError, and seccomp enables system call filters that prevents forking. This commit also adds support for bootstrap checks that are always enforced, whether or not Elasticsearch is in production mode.	2016-06-06 17:31:42 -04:00
Nik Everett	d8056c8213	Add support for waiting until a refresh occurs This adds support for setting the refresh request parameter to `wait_for` in the `index`, `delete`, `update`, and `bulk` APIs. When `refresh=wait_for` is set those APIs will not return until their results have been made visible to search by a refresh. Also it adds a `forced_refresh` field to the response of `index`, `delete`, `update`, and to each item in a bulk response. This will be true for requests with `?refresh` or `?refresh=true` and will be true for some requests (see below) with `refresh=wait_for` but ought to otherwise always be false. `refresh=wait_for` is implemented as a list of `Tuple<Translog.Location, Consumer<Boolean>>`s in the new `RefreshListeners` class that is managed by `IndexShard`. The dynamic, index scoped `index.max_refresh_listeners` setting controls a maximum number of listeners allowed in any shard. If more than that many listeners accumulate in the engine then a refresh will be forced, the thread that adds the listener will be blocked until the refresh completes, and then the listener will be called with a `forcedRefresh` flag so it knows that it was the "straw that broke the camel's back". These listeners are only used by `refresh=wait_for` and that flag manifests itself as `forced_refresh` being `true` in the response. About half of this change comes from piping async-ness down to the appropriate layer in a way that is compatible with the ongoing with with sequence ids. Closes #1063 You can look up the winding story of all the commits here: https://github.com/elastic/elasticsearch/pull/17986 Here are the commit messages in case they are intersting to you: commit 59a753b89109828d2b8f0de05cb104fc663cf95e Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 10:18:23 2016 -0400 Replace a method reference with implementing an interface Saves a single allocation and forces more commonality between the WriteResults. commit 31f7861a85b457fb7378a6f27fa0a0c171538f68 Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 10:07:55 2016 -0400 Revert "Replace static method that takes consumer with delegate class that takes an interface" This reverts commit 777e23a6592c75db0081a53458cc760f4db69507. commit 777e23a6592c75db0081a53458cc760f4db69507 Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 09:29:35 2016 -0400 Replace static method that takes consumer with delegate class that takes an interface Same number of allocations, much less code duplication. commit 9b49a480ca9587a0a16ebe941662849f38289644 Author: Nik Everett <nik9000@gmail.com> Date: Mon Jun 6 08:25:38 2016 -0400 Patch from boaz commit c2bc36524fda119fd0514415127e8901d94409c8 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:46:27 2016 -0400 Fix docs After updating to master we are actually testing them. commit 03975ac056e44954eb0a371149d410dcf303e212 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:20:11 2016 -0400 Cleanup after merge from master commit 9c9a1deb002c5bebb2a997c89fa12b3d7978e02e Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:09:14 2016 -0400 Breaking changes notes commit 1c3e64ae06c07a85f7af80534fab88279adb30b4 Merge: 9e63ad6 `f67e580` Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 14:00:05 2016 -0400 Merge branch 'master' into block_until_refresh2 commit 9e63ad6de52d0b28f0b6d7203721baf1ebf6f56b Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 13:21:27 2016 -0400 Test for TransportWriteAction commit 522ecb59d39b3c9e8df0d3b8df34b9e7aeaf0ce9 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:30:18 2016 -0400 Document deprecation commit 0cd67b947f58867e704a1f0e66928a6fb5a11f11 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:26:23 2016 -0400 Deprecate setRefresh(boolean) Users should use `setRefresh(RefreshPolicy)` instead. commit aeb1be3f2c501990b33fb1f8230d496035f498ef Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:12:27 2016 -0400 Remove checkstyle suppression It is fixed commit 00d09a9caa638b6f90f4896b5502dd98d8fad56e Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:08:28 2016 -0400 Improve comment commit 788164b898a6ee2878a273961230122b7386c3c9 Author: Nik Everett <nik9000@gmail.com> Date: Thu Jun 2 10:01:01 2016 -0400 S/ReplicatedWriteResponse/WriteResponse/ Now it lines up with WriteRequest. commit b74cf3fe778352b140355afcaa08d3d4412d749d Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 18:27:52 2016 -0400 Preserve `?refresh` behavior `?refresh` means the same things as `?refresh=true`. commit 30f972bdaeaaa0de6fe67746cdb8628aa86f5a8c Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 17:39:05 2016 -0400 Handle hanging documents If a document is added to the index during a refresh we weren't properly firing its refresh listener. This happened because the way we detect whether a refresh makes something visible or not is imperfect. It is ok because it always errs on the side of thinking that something isn't yet visible. So when a document arrives during a refresh the refresh listeners won't think it made it into a refresh when, often, it does. The way we work around this is by telling Elasticsearch that it ought to trigger a refresh if there are any pending refresh listeners even if there aren't pending documents to update. Lucene short circuits the refresh so it doesn't take that much effort, but the refresh listeners still get the signal that a refresh has come in and they still pick up the change and notify the listener. This means that the time that a listener can wait is actually slightly longer than the refresh interval. commit d523b5702b60c7ba309fb0dcf3cd3a4798f11960 Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 14:34:01 2016 -0400 Explain Integer.MAX_VALUE commit 4ffb7c0e954343cc1c04b3d7be2ebad66d3a016b Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 14:27:39 2016 -0400 Fire all refresh listeners in a single thread Rather than queueing a runnable each. commit 19606ec3bbe612095df45eba734c5b7eb2709c01 Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 14:09:52 2016 -0400 Assert translog ordering commit 6bb4e5c75e850f4a42518f06fbc955f7ec76d245 Author: Nik Everett <nik9000@gmail.com> Date: Wed Jun 1 13:17:44 2016 -0400 Support null RefreshListeners in InternalEngine Just skip using it. commit 74be1480d6e44af2b354ff9ea47c234d4870b6c2 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 18:02:03 2016 -0400 Move funny ShardInfo hack for bulk into bulk This should make it easier to understand because it is closer to where it matters.... commit 2b771f8dabd488e056cfdc9989608d18264ddfb0 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:39:46 2016 -0400 Pull listener out into an inner class with javadoc and stuff commit 058481ad72019c0492b03a7a4ac32a48673697d3 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:33:42 2016 -0400 Fix javadoc links commit d2123b1cabf29bce8ff561d4a4c1c1d5b42bccad Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:28:09 2016 -0400 Make more stuff final commit 8453fc4f7850f6a02fb5971c17a942a3e3fd9f7b Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 17:26:48 2016 -0400 Javadoc commit fb16d2fc7016c1e8e1621d481e8781c7ef43326c Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 16:14:48 2016 -0400 Rewrite refresh docs commit 5797d1b1c4d233c0db918c0d08c21731ddccd05e Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 15:02:34 2016 -0400 Fix forced_refresh flag It wasn't being set. commit 43ce50a1de250a9e073a2ca6cbf55c1b4c74b11b Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 14:02:56 2016 -0400 Delay translog sync and flush until after refresh The sync might have occurred for us during the refresh so we have less work to do. Maybe. commit bb2739202e084703baf02cfa58f09517598cf14e Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 13:08:08 2016 -0400 Remove duplication in WritePrimaryResult and WriteReplicaResult commit 2f579f89b4867a880396f2e7fcffc508449ff2de Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 12:19:05 2016 -0400 Clean up registration of RefreshListeners commit 87ab6e60ca5ba945bf0fba84784b2bbe53506abf Author: Nik Everett <nik9000@gmail.com> Date: Tue May 31 11:28:30 2016 -0400 Shorten lock time in RefreshListeners Also use null to represent no listeners rather than an empty list. This saves allocating a new ArrayList every refresh cycle on every index. commit 0d49d9c5720dadfb67da3fa760397bf6d874601c Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 10:46:18 2016 -0400 Flip relationship between RefreshListeners and Engine Now RefreshListeners comes to Engine from EngineConfig. commit b2704b8a39382953f8f91a9743e894ee289f7514 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 09:37:58 2016 -0400 Remove unused imports Maybe I added them? commit 04343a22647f19304d9dc716b3fac9b183227f63 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 09:37:52 2016 -0400 Javadoc commit da1e765678890a02d61d8a29aa433274beb5e00c Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 09:26:35 2016 -0400 Reply with non-null Also move the fsync and flush to before the refresh listener stuff. commit 5d8eecd0d904b497844b4c81c46477bd6178ed3a Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 08:58:47 2016 -0400 Remove funky synchronization in AsyncReplicaAction commit 1ec71eea0f4e1228ae1497d982307be818ef4b65 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 08:01:14 2016 -0400 s/LinkedTransferQueue/ArrayList/ commit 7da36a4ceed2ccf7955138c3b005237fa41efcb4 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 07:46:38 2016 -0400 More cleanup for RefreshListeners commit 957e9b77007c32ee75dde152c6622bab065d5993 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 24 07:34:13 2016 -0400 /Consumer<Runnable>/Executor/ commit 4d8bf5d4a70dcc56150c8d8d14165cd23d308b3c Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 22:20:42 2016 -0400 explain commit 15d948a348089bb2937eec5ac4e96f3ec67dbe32 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 22:17:59 2016 -0400 Better.... commit dc28951d02973fc03b4d51913b5f96de14b75607 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 21:09:20 2016 -0400 Javadocs and compromises commit 8eebaa89c0a1ee74982fbe0d56d1485ca2ae09db Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 20:52:49 2016 -0400 Take boaz's changes to their logic conclusion and unbreak important stuff like bulk commit 7056b96ea412f275005b93e3570bcff895859ed5 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 15:49:32 2016 -0400 Patch from boaz commit 87be7eaed09a274cc6a99d1a3da81d2d7bf9dd64 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 23 15:49:13 2016 -0400 Revert "Move async parts of replica operation outside of the lock" This reverts commit 13807ad10b6f5ecd39f98c9f20874f9f352c5bc2. commit 13807ad10b6f5ecd39f98c9f20874f9f352c5bc2 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 22:53:15 2016 -0400 Move async parts of replica operation outside of the lock commit b8cadcef565908b276484f7f5f988fd58b38d8b6 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 16:17:20 2016 -0400 Docs commit 91149e0580233bf79c2273b419fe9374ca746648 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 15:17:40 2016 -0400 Finally! commit 1ff50c2faf56665d221f00a18d9ac88745904bf5 Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 15:01:53 2016 -0400 Remove Translog#lastWriteLocation I wasn't being careful enough with locks so it wasn't right anyway. Instead this builds a synthetic Tranlog.Location when you call getWriteLocation with much more relaxed equality guarantees. Rather than being equal to the last Translog.Location returned it is simply guaranteed to be greater than the last translog returned and less than the next. commit 55596ea68b5484490c3637fbad0d95564236478b Author: Nik Everett <nik9000@gmail.com> Date: Fri May 20 14:40:06 2016 -0400 Remove listener from shardOperationOnPrimary Create instead asyncShardOperationOnPrimary which is called after all of the replica operations are started to handle any async operations. commit 3322e26211bf681b37132274ee158ae330afc28b Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 17:20:02 2016 -0400 Increase default maximum number of listeners to 1000 commit 88171a8322a424e624d48960fb4c98dd43e4d671 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 16:40:57 2016 -0400 Rename test commit 179c27c4f829f2c6ded65967652cf85adaf2ae52 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 16:35:27 2016 -0400 Move refresh listeners into their own class They still live at the IndexShard level but they live on their own in RefreshListeners which interacts with IndexShard using a couple of callbacks and a registration method. This lets us test the listeners without standing up an entire IndexShard. We still test the listeners against an InternalEngine, because the interplay between InternalEngine, Translog, and RefreshListeners is complex and important to get right. commit d8926d5fc1d24b4da8ccff7e0f0907b98c583c41 Author: Nik Everett <nik9000@gmail.com> Date: Tue May 17 11:02:38 2016 -0400 Move refresh listeners into IndexShard commit df91cde398eb720143a85a8c6fa19bdc3a74e07d Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 16:01:03 2016 -0400 unused import commit 066da45b08148b266e4173166662fc1b3f66ed53 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 15:54:11 2016 -0400 Remove RefreshListener interface Just pass a Translog.Location and a Consumer<Boolean> when registering. commit b971d6d3301c7522b2e7eb90d5d8dd96a77fa625 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 14:41:06 2016 -0400 Docs for setForcedRefresh commit 6c43be821eaf61141d3ec520f988aad3a96a3941 Author: Nik Everett <nik9000@gmail.com> Date: Mon May 16 14:34:39 2016 -0400 Rename refresh setter and getter commit e61b7391f91263a4c4d6107bfbc2a828bbcc805c Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 22:48:09 2016 -0400 Trigger listeners even when there is no refresh Each refresh gives us an opportunity to pick up any listeners we may have left behind. commit 0c9b0477085c021f503db775640d25668e02f635 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 20:30:06 2016 -0400 REST commit 8250343240de7e63118c663a230a7a314807a754 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 19:34:22 2016 -0400 Switch to estimated count We don't need a linear time count of the number of listeners - a volatile variable is good enough to guess. It probably undercounts more than it overcounts but it isn't a huge problem. commit bd531167fe54f1bde6f6d4ddb0a8de5a7bcc18a2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 18:21:02 2016 -0400 Don't try and set forced refresh on bulk items without a response NullPointerExceptions are bad. If the entire request fails then the user has worse problems then "did these force a refresh". commit bcfded11515af5e0b3c3e36f3c2f73f5cd26512e Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 18:14:20 2016 -0400 Replace LinkedList and synchronized with LinkedTransferQueue commit 8a80cc70a76375a7593745884cb987535b37ca80 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 17:38:24 2016 -0400 Support for update commit 1f36966742f851b7328015151ef6fc8f95299af2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 15:46:06 2016 -0400 Cleanup translog tests commit 8d121bf35eb265b8a0aee9710afeb1b054a113d4 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 15:40:53 2016 -0400 Cleanup listener implementation Much more testing too! commit 2058f4a808762c4588309f21b13b677245832f2c Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:45:55 2016 -0400 Pass back information about whether we refreshed commit e445cb0cb91ebdbcfdbf566696edb2bf1c84a882 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:03:31 2016 -0400 Javadoc commit 611cbeeaeb458f4b428bfc43a1ee6652adf4baff Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:01:40 2016 -0400 Move ReplicationResponse now it is in the same package as its request commit 9919758b644fd73895fb88cd6a4909a8387eb2e2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 11:00:14 2016 -0400 Oh boy that wasn't working commit 247cb483c4459dea8e95e0e3bd2e4bf8d452c598 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 10:29:37 2016 -0400 Basic block_until_refresh exposed to java client and basic "is it plugged in" style tests. commit 46c855c9971cb2b748206d2afa6a2d88724be3ba Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 10:11:10 2016 -0400 Move test to own class commit a5ffd892d0a352ae7e9757f2640fc2a1fa656bf2 Author: Nik Everett <nik9000@gmail.com> Date: Mon Apr 25 07:44:25 2016 -0400 WIP commit 213bebb6ece11b85d17e44af9a54fc2e5e332d39 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 21:35:52 2016 -0400 Add refresh listeners commit a2bc7f30e6d4857a1224ef5a89909b36c8f33731 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 21:11:55 2016 -0400 Return last written location from refresh commit 85033a87551da89f36a23d4dfd5016db218e08ee Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 20:28:21 2016 -0400 Never reply to replica actions while you have the operation lock This last thing was causing periodic test failures because we were replying while we had the operation lock. Now, we probably could get away with that in most cases but the tests don't like it and it isn't a good idea to do network io while you have a lock anyway. So this prevents it. commit 1f25cf35e796835b3827b8a4110e09e5de61784c Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 19:56:18 2016 -0400 Cleanup commit 52c5f7c3f04710901f503334239a611c0e21c85a Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 19:33:00 2016 -0400 Add a listener to shard operations commit 5b142dc331214c8eef90587144f4b3f959f9eced Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 18:03:52 2016 -0400 Cleanup commit 3d22b2d7ceb473db339259452a7c4f117ce86069 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:59:55 2016 -0400 Push the listener into shardOperationOnPrimary commit 34b378943b8185451acf6350f661c0ad33b5836d Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:48:47 2016 -0400 Doc commit b42b8da968d42cc7414020c7b199606a5dcce50a Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:45:40 2016 -0400 Don't finish early if the primary finishes early We use a "fake" pending shard that we resolve when the replicas have all started. commit 0fc045b56e1e02a48c30383ac50a281d5af7e0b6 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 17:30:06 2016 -0400 Make performOnPrimary asyncS Instead of returning Tuple<Response, ReplicaRequest> it returns ReplicaRequest and takes a ActionListener<Response> as an argument. We call the listener immediately to preserve backwards compatibility for now. commit 80119b9a26ede96a865af45904c3ac69d5b19b59 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:51:53 2016 -0400 Factor out common code in shardOperationOnPrimary commit 0642083676702618f900fa842c08802a04c1a53e Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:32:29 2016 -0400 Factor out common code from shardOperationOnReplica commit 8bdc415fedaaa9f2d0c555590a13ec4699a7c3f7 Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:23:28 2016 -0400 Create ReplicatedMutationRequest Superclass for index, delete, and bulkShard requests. commit 0f8fa846a2822c4293df32fed18c9b99660b39ff Author: Nik Everett <nik9000@gmail.com> Date: Fri Apr 22 16:10:30 2016 -0400 Create TransportReplicatedMutationAction It is the superclass of replication actions that mutate data: index, delete, and shardBulk. shardFlush and shardRefresh are replication actions but they do not extend TransportReplicatedMutationAction because they don't change the data, only shuffle it around.	2016-06-06 11:37:53 -04:00
Nicholas Knize	371c73e140	refactor matrix agg documentation from modules to main agg section	2016-06-06 07:39:00 -05:00
Tanguy Leroux	a1172d816c	Implement ctx.op = "delete" on _update_by_query and _reindex closes #18043	2016-06-06 11:11:29 +02:00
Britta Weber	d55f719f8a	[TEST] wait for yellow after setup doc tests (#18726 ) * [TEST] wait for yellow after setup doc tests We have many places in the doc where we expect and index to be yellow before we execute a query. Therefore we have to always wait for yellow after setup.	2016-06-03 16:37:28 +02:00
Clinton Gormley	e6aaaf11ed	Reworked docs for index-shrink API (#18705 )	2016-06-03 09:50:51 +02:00
Simon Willnauer	22dfc41521	Only filter intial recovery (post API) when shrinking an index (#18661 ) Today we use `index.routing.allocation.include._id` to filter the allocation for the shrink target index. That has the sideeffect that the user has to delete that setting / change it once the primary has been recovered (shrink is done) This PR adds a dedicated filter that can only be set internally that only filters allocation for unassigned shards.	2016-06-02 15:38:51 +02:00
Clinton Gormley	56d86ef875	Add upgrade-not-supported warning to alpha release notes	2016-06-02 10:18:16 +02:00
Nicholas Knize	90b8f5d0d8	Adding MultiValuesSource support classes and documentation to matrix stats agg module	2016-06-01 16:39:42 -05:00
Jason Tedor	fb893a993f	Add note regarding Windows service heap size This commit adds a note regarding the difference in configuration for the Windows service heap size from any other installation of Elasticsearch. Relates #18606	2016-06-01 16:31:16 -04:00
Jason Tedor	8e2a7d0fe1	Rename boostrap.mlockall to bootstrap.memory_lock The setting bootstrap.mlockall is useful on both POSIX-like systems (POSIX mlockall) and Windows (Win32 VirtualLock). But mlockall is really a POSIX only thing so the name should not be tied POSIX. This commit renames the setting to "bootstrap.memory_lock". Relates #18669	2016-06-01 16:25:51 -04:00
Clinton Gormley	a98856663b	Update reindex.asciidoc (#18687 ) Potentially fixing some copy/paste errors # Conflicts: # docs/reference/docs/reindex.asciidoc	2016-06-01 20:16:12 +02:00
Martijn van Groningen	766789b0f0	ingest: added `ignore_failure` option to all processors If this option is enabled on a processor it silently catches any processor related failure and continues executing the rest of the pipeline. Closes #18493	2016-06-01 10:29:12 +02:00
Robert Muir	0373c62a57	Merge pull request #18658 from rmuir/jodaTime improve date api for expressions/painless fields	2016-05-31 12:33:22 -04:00
Michael McCandless	8f0109c2a5	Merge pull request #18651 from mikemccand/remove_iw_max_memory_stat Remove index_writer_max_memory stat from segment stats	2016-05-31 09:58:55 -04:00
Christoph Wurm	de7688ae5d	Update configuration.asciidoc Correct configuration of `path.conf`	2016-05-31 15:43:16 +02:00
Robert Muir	2d1eb89aef	improve date api for expressions/painless fields	2016-05-31 09:32:33 -04:00
Mike McCandless	5c525e6606	Remove index_writer_max_memory stat from segment stats	2016-05-31 06:29:29 -04:00
Clinton Gormley	85bf48b4c1	Added release notes for 5.0.0-alpha3	2016-05-31 11:51:10 +02:00
Clinton Gormley	589b6c63c6	Include shrink-index.asciidoc	2016-05-31 11:50:50 +02:00
Simon Willnauer	502a775a7c	Add primitive to shrink an index into a single shard (#18270 ) This adds a low level primitive operations to shrink an existing index into a new index with a single shard. This primitive expects all shards of the source index to allocated on a single node. Once the target index is initializing on the shrink node it takes a snapshot of the source index shards and copies all files into the target indices data folder. An [optimization](https://issues.apache.org/jira/browse/LUCENE-7300) coming in Lucene 6.1 will also allow for optional constant time copy if hard-links are supported by the filesystem. All mappings are merged into the new indexes metadata once the snapshots have been taken on the merge node. To shrink an existing index all shards must be moved to a single node (one instance of each shard) and the index must be read-only: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_settings' -d '{ "settings" : { "index.routing.allocation.require._name" : "shrink_node_name", "index.blocks.write" : true } } ``` once all shards are started on the shrink node. the new index can be created via: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_shrink/logs_single_shard' -d '{ "settings" : { "index.codec" : "best_compression", "index.number_of_replicas" : 1 } }' ``` This API will perform all needed check before the new index is created and selects the shrink node based on the allocation of the source index. This call returns immediately, to monitor shrink progress the recovery API should be used since all copy operations are reflected in the recovery API with byte copy progress etc. The shrink operation does not modify the source index, if a shrink operation should be canceled or if the shrink failed, the target index can simply be deleted and all resources are released.	2016-05-31 10:41:44 +02:00
Jason Tedor	37a3588c37	Fix min. master nodes links in boostrap check docs This commit fixes two links to the minimum master nodes configuration section of the docs in the bootstrap check docs.	2016-05-29 08:01:16 -04:00
Clinton Gormley	e35bd11581	Update bootstrap-checks.asciidoc Fixed asciidoc	2016-05-29 11:56:02 +02:00
Clinton Gormley	b2f2d38ebb	Update configuring.asciidoc Minor asciidoc changes	2016-05-29 11:50:36 +02:00
Jason Tedor	46162a40e7	Additional bootstrap check doc fixes This commit fixes some additional poorly-formatted internal and external links in the bootstrap check docs.	2016-05-27 10:58:13 -04:00
Jason Tedor	123e40726e	Fix bootstrap check docs This commit fixes some incorrect links in the bootstrap check docs.	2016-05-27 09:19:49 -04:00
Martijn van Groningen	0e9f3addd2	Nested inner hits shouldn't use relative paths Like on other places in the query dsl the full field name should be used. Before this change this wasn't the case for nested inner hits when source filtering was used. Highlighting has a workaround, which is now removed as the source of nested inner hits can only be refered by the full name. Closes #16653	2016-05-27 13:41:45 +02:00
Jason Tedor	82713bab6d	Add bootstrap check docs This commit adds documentation for the bootstrap checks and provides either links or inline guidance for setting the necessary settings to pass the bootstrap checks. Relates #18605	2016-05-27 06:03:35 -04:00
Boaz Leskes	318a4e3ef6	Introduce dedicated master nodes in testing infrastructure (#18514 ) This PR changes the InternalTestCluster to support dedicated master nodes. The creation of dedicated master nodes can be controlled using a new `supportsMasterNodes` parameter to the ClusterScope annotation. If set to true (the default), dedicated master nodes will randomly be used. If set to false, no master nodes will be created and data nodes will also be allowed to become masters. If active, test runs will either have 1 or 3 masternodes	2016-05-27 08:44:20 +02:00
Jason Tedor	7c3715e009	Remove a leftover "es." prefix from docs This commit removes a leftover usage of the "es." in the CLI syntax in the zip/targz docs.	2016-05-26 15:03:42 -04:00
Jason Tedor	d23db39445	Merge pull request #18594 from jasontedor/plugins-cleanup Plugins cleanup	2016-05-26 14:46:09 -04:00
Jason Tedor	d29844e597	Remove custom plugins path This commit removes the ability to specify a custom plugins path. Instead, the plugins path will always be a subdirectory called "plugins" off of the home directory.	2016-05-26 10:16:25 -04:00
Mike McCandless	dbe0b42140	Document the hard limits from #15585 on index and bulk thread pool sizes	2016-05-26 09:40:22 -04:00
Tal Levy	edfbdf2748	add ability to specify multiple grok patterns (#18074 ) - now you can specify a list of grok patterns to match your field with and the first one to successfully match wins. - only non-null captures will be inserted into your matched document. Fixes #17903.	2016-05-25 12:20:39 -07:00
Jim Ferenczi	6d62f33702	Make doc_values accessible for _type `doc_values` for _type field are created but any attempt to load them throws an IAE. This PR re-enables `doc_values` loading for _type, it also enables `fielddata` loading for indices created between 2.0 and 2.1 since doc_values were disabled during that period. It also restores the old docs that gives example on how to sort or aggregate on _type field.	2016-05-25 18:56:13 +02:00
Nik Everett	2a2730405e	Add wait for yellow to doc snippet so it runs cleanly Found by http://build-us-00.elastic.co/job/es_core_master_window-2008/3866/console	2016-05-24 12:15:52 -04:00
Nik Everett	a93f578bf6	Move parsing of allocation commands into REST Port them to the ObjectParser. Don't let plugins register custom allocation commands	2016-05-24 11:59:05 -04:00
Nik Everett	72eb621bce	Docs: Replace [source,json] with [source,js] The syntax highlighter only supports [source,js]. Also adds a check to the rest test generator that runs during the build that'll fail the build if it sees `[source,json]`.	2016-05-24 11:17:27 -04:00
Tanguy Leroux	1f011f9dea	Remove Delete-By-Query plugin closes #18469	2016-05-24 13:28:20 +02:00
Isabel Drost-Fromm	d76f87155a	Merge pull request #18544 from MaineC/docs/add_autosense_to_query_dsl Add back doc execution to query dsl.	2016-05-24 12:47:21 +02:00
Isabel Drost-Fromm	4c02e97bcd	Add back doc execution to query dsl. Relates to #18211 This reverts commit `20aafb1196`.	2016-05-24 12:43:41 +02:00
Isabel Drost-Fromm	ea3320e171	Merge pull request #18424 from MaineC/docs/add_console_to_highlighting Docs/add console to highlighting	2016-05-24 12:14:36 +02:00
Martijn van Groningen	27cc2fe4dc	Moved the percolator from core to its own module Significant changes: * AbstractQueryTestCase has moved to the test framework module, in order for query builder tests in modules and plugins * Added support to AbstractQueryTestCase to register plugins * Lift the restriction that only one percolator could be added per index. This validation existed in MapperService, but because the percolator moved to a module it could no longer exist there. Instead of bringing it back it was removed. This validation existed since the percolator cache only supported one percolator query per document, since the percolator cache has been removed this restriction could removed as well. * While moving percolator tests to the new module, also removed a couple of tests for the deprecated percolate and mpercolate api. These APIs are now sugar APIs for bwc and rediect to the searvh and msearvh APIs. Some tests were still testing as if percolate and mpercolate API did the percolation, but this no longer the case and these tests could be removed.	2016-05-24 11:01:57 +02:00
Lee Hinman	bfce901edf	Merge remote-tracking branch 'dakrone/explain-add-fetch-in-progress'	2016-05-23 09:43:16 -06:00
Lee Hinman	8040ed0c16	Add whether the shard state fetch is pending to the allocation explain API If the shard state fetch is still pending, this will now return a message like: ```json { "shard" : { "index" : "i", "index_uuid" : "de1W1374T4qgvUP4a9Ieaw", "id" : 0, "primary" : false }, "assigned" : false, "shard_state_fetch_pending": true, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-04-26T16:34:53.227Z" }, "allocation_delay_ms" : 0, "remaining_delay_ms" : 0, "nodes" : { "z-CbkiELT-SoWT91HIszLA" : { "node_name" : "Brain Cell", "node_attributes" : { "testattr" : "test" }, "store" : { "shard_copy" : "NONE" }, "final_decision" : "NO", "final_explanation" : "the shard state fetch is pending", "weight" : 5.0, "decisions" : [ ] } } } ``` Adds the `shard_state_fetch_pending` field and uses the state to influence the final decision and final explanation. Relates to #17372	2016-05-23 09:42:57 -06:00
Isabel Drost-Fromm	1e91123555	Merge branch 'master' into docs/add_console_to_search	2016-05-23 15:27:19 +02:00
Adrien Grand	31e4c16ec3	Merge pull request #18509 from terradatum/epoch Support full range of Java Long for epoch DateTime	2016-05-23 12:27:38 +02:00
Martijn van Groningen	e714a04c67	docs: fix typo	2016-05-22 22:50:31 +02:00
Martijn van Groningen	c1a0929123	percolator: Add support dor MatchNoDocsQuery in query terms extract service Before the query extraction would have been aborted and the percolator query would be marked as unknown. This resulted in a situation that these queries always need to be evaluated by the memory index at search time. By adding support for this query many more percolator query candidate hits can skip the expensive memory index verification step. For example the `match` query parser returns a MatchNoDocsQuery if the query terms are removed by text analysis (lets query text only contained stop words).	2016-05-22 22:42:19 +02:00
G. Richard Bellamy	cf54903580	Support full range of Java Long for epoch DateTime Remove the arbitrary limit on epoch_millis and epoch_seconds of 13 and 10 characters, respectively. Instead allow any character combination that can be converted to a Java Long. Update the docs to reflect this change.	2016-05-22 13:08:20 -07:00
Simon Willnauer	35e705877b	Limit retries of failed allocations per index (#18467 ) Today if a shard fails during initialization phase due to misconfiguration, broken disks, missing analyzers, not installed plugins etc. elasticsaerch keeps on trying to initialize or rather allocate that shard. Yet, in the worst case scenario this ends in an endless allocation loop. To prevent this loop and all it's sideeffects like spamming log files over and over again this commit adds an allocation decider that stops allocating a shard that failed more than N times in a row to allocate. The number or retries can be configured via `index.allocation.max_retry` and it's default is set to `5`. Once the setting is updated shards with less failures than the number set per index will be allowed to allocate again. Internally we maintain a counter on the UnassignedInfo that is reset to `0` once the shards has been started. Relates to #18417	2016-05-20 20:37:45 +02:00
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00

... 3 4 5 6 7 ...

3036 Commits