OpenSearch

Commit Graph

Author	SHA1	Message	Date
Shaunak Kashyap	0a83968650	Add cluster UUID to Cluster Stats API response (#32206 ) * Make cluster stats response contain cluster UUID * Updating constructor usage in Monitoring tests * Adding cluster_uuid field to Cluster Stats API reference doc * Adding rest api spec test for expecting cluster_uuid in cluster stats response * Adding missing newline * Indenting do section properly * Missed a spot! * Fixing the test cluster ID	2018-08-02 17:14:19 -07:00
mzbeck	63a0436764	Update update-settings.asciidoc (#31378 )	2018-07-26 17:16:53 +02:00
Sohaib Iftikhar	c55d11f8b5	rest-high-level: added get cluster settings (#31706 ) Relates to #27205	2018-07-02 13:25:17 -04:00
Nik Everett	73549281e8	Docs: Use the default distribution to test docs (#31251 ) This switches the docs tests from the `oss-zip` distribution to the `zip` distribution so they have xpack installed and configured with the default basic license. The goal is to be able to merge the `x-pack/docs` directory into the `docs` directory, marking the x-pack docs with some kind of marker. This is the first step in that process. This also enables `-Dtests.distribution` support for the `docs` directory so you can run the tests against the `oss-zip` distribution with something like ``` ./gradlew -p docs check -Dtests.distribution=oss-zip ``` We can set up Jenkins to run both. Relates to #30665	2018-06-18 12:06:42 -04:00
Van0SS	d5e8a5cd69	REST high-level client: add Cluster Health API (#29331 ) Relates to #27205	2018-06-12 13:34:06 +02:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Nik Everett	d20e8e2bb4	Docs: Use task_id in examples of tasks (#30436 ) We had been using `task_id:1` or `taskId:1` because it is parses as a valid task identifier but the `:1` part is confusing. This replaces those examples with `task_id` which matches the response from the list tasks API. Closes #28314	2018-05-08 14:23:32 -04:00
Chris Earle	725a5af2c6	_cluster/state should always return cluster_uuid (#30143 ) Currently, the only way to get the REST response for the `/_cluster/state` call to return the `cluster_uuid` is to request the `metadata` metrics, which is one of the most expensive response structures. However, external monitoring agents will likely want the `cluster_uuid` to correlate the response with other API responses whether or not they want cluster metadata.	2018-04-30 10:16:11 -04:00
David Turner	5e4d0b4510	Minor tweaks to reroute documentation (#30246 ) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113.	2018-04-30 13:09:03 +01:00
Nik Everett	f4ed902698	CCS: Drop http address from remote cluster info (#29568 ) They are expensive to fetch and no longer needed by Kibana so they shouldn't be needed by anyone else either. Closes #29207	2018-04-27 14:19:00 -04:00
Jason Tedor	d99d0fa669	Add distribution type to startup scripts This commit adds the distribution type to the startup scripts so that we can discern from log output and the main response the type of the distribution (deb/rpm/tar/zip).	2018-04-20 15:34:01 -07:00
Jason Tedor	0045111ce2	Deprecate the suggest metrics (#29627 ) The suggest stats were folded into the search stats as part of the indices stats API in 5.0.0. However, the suggest metric remained as a synonym for the search metric for BWC reasons. This commit deprecates usage of the suggest metric on the indices stats API. Similarly, due to the changes to fold the suggest stats into the search stats, requesting the suggest index metric on the indices metric on the nodes stats API has produced an empty object as the response since 5.0.0. This commit deprecates this index metric on the indices metric on the nodes stats API.	2018-04-20 09:47:38 -04:00
Boaz Leskes	602145a562	Add a note about using the `retry_failed` flag before accepting data loss (#29160 )	2018-03-20 18:11:45 +01:00
Jason Tedor	105dcb544c	Enable selecting adaptive selection stats The node stats API enables filtlering the top-level stats for only desired top-level stats. Yet, this was never enabled for adaptive replica selection stats. This commit enables this. We also add setting these stats on the request builder, and fix an inconsistent name in a setter. Relates #28721	2018-02-19 16:56:36 -05:00
Igor Motov	c75ac319a6	Add ability to associate an ID with tasks (#27764 ) Adds support for capturing the X-Opaque-Id header from a REST request and storing it's value in the tasks that this request started. It works for all user-initiated tasks (not only search). Closes #23250 Usage: ``` $ curl -H "X-Opaque-Id: imotov" -H "foo:bar" "localhost:9200/_tasks?pretty&group_by=parents" { "tasks" : { "7qrTVbiDQKiZfubUP7DPkg:6998" : { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6998, "type" : "transport", "action" : "cluster:monitor/tasks/lists", "start_time_in_millis" : 1513029940042, "running_time_in_nanos" : 266794, "cancellable" : false, "headers" : { "X-Opaque-Id" : "imotov" }, "children" : [ { "node" : "V-PuCjPhRp2ryuEsNw6V1g", "id" : 6088, "type" : "netty", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 67785, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } }, { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6999, "type" : "direct", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 98754, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } } ] } } } ```	2018-01-12 15:34:17 -05:00
Sandeep Kanabar	7e0fc8a112	[Docs] Correct spelling in update-settings.asciidoc (#27808 )	2017-12-14 10:16:50 +01:00
Clinton Gormley	e1aa6e2cda	Fix cluster usage docs test #27611 broke the docs tests because $node_name in the URL doesn't (#27616)seem to be replaced. Changing this to a * to match all nodes seems to fix the test	2017-12-01 16:55:10 +01:00
佛陀.RML	756e170674	[Docs] Fix order of nodes usage example (#27611 )	2017-12-01 10:42:42 +01:00
Nhat Nguyen	46b508d6c9	Add wait_for_no_initializing_shards to cluster health API (#27489 ) This adds a new option to the cluster health request allowing to wait until there is no initializing shards. Closes #25623	2017-11-23 15:09:58 -05:00
Luca Cavanna	29450de7b5	Cross Cluster Search: make remote clusters optional (#27182 ) Today Cross Cluster Search requires at least one node in each remote cluster to be up once the cross cluster search is run. Otherwise the whole search request fails despite some of the data (either local and/or remote) is available. This happens when performing the _search/shards calls to find out which remote shards the query has to be executed on. This scenario is different from shard failures that may happen later on when the query is actually executed, in case e.g. remote shards are missing, which is not going to fail the whole request but rather yield partial results, and the _shards section in the response will indicate that. This commit introduces a boolean setting per cluster called search.remote.$cluster_alias.skip_if_disconnected, set to false by default, which allows to skip certain clusters if they are down when trying to reach them through a cross cluster search requests. By default all clusters are mandatory. Scroll requests support such setting too when they are first initiated (first search request with scroll parameter), but subsequent scroll rounds (_search/scroll endpoint) will fail if some of the remote clusters went down meanwhile. The search API response contains now a new _clusters section, similar to the _shards section, that gets returned whenever one or more clusters were disconnected and got skipped: "_clusters" : { "total" : 3, "successful" : 2, "skipped" : 1 } Such section won't be part of the response if no clusters have been skipped. The per cluster skip_unavailable setting value has also been added to the output of the remote/info API.	2017-11-21 11:41:47 +01:00
David Roberts	a292740b9e	Add cgroup memory usage/limit to OS stats on Linux (#26166 ) This change adds cgroup memory usage/limit to the OS stats section of the node stats on Linux. This information is useful because in Docker containers the standard node stats report the host memory limit, not taking account of extra restrictions that may have been applied to the container. The original idea was to store these values as Long, truncating any values outside the range of long. However, this meant that in the relatively common case of no limit being applied, users would not see the same value in the OS stats as they see by querying Linux directly. So instead the values are stored as String. This change places a burden on consumers of the strings to convert the strings to numbers and decide what to do about extremely large values, but there will be very few consumers and they would need to have a policy for dealing with "no limit" in any case.	2017-10-03 12:08:36 +01:00
Peter Dyson	1f9e0fd0dd	[Docs] improved description for fs.total.available_in_bytes (#26657 )	2017-09-18 16:56:19 +10:00
Tanguy Leroux	643eb286dc	[Docs] Convert remaining code snippets in docs (#26422 ) This commit converts the last remaining code snippets so that they are now testable.	2017-08-30 12:11:10 +02:00
Tanguy Leroux	db54c4dc7c	[Docs] Convert more doc snippets (#26404 ) This commit converts some remaining doc snippets so that they are now testable.	2017-08-30 09:30:36 +02:00
Tanguy Leroux	f95dec797d	[Docs] Convert more doc snippets (#26359 ) This commit converts some remaining doc snippets so that they are now testable.	2017-08-28 11:23:09 +02:00
Jason Tedor	fd18e3239a	Remove mention of http_address in nodes info docs This commit removes an outdated reference to http_address in the nodes info docs. This information is available in the http object for each node in the nodes info API response. Relates #25980	2017-07-31 22:04:16 +09:00
Daniel Mitterdorfer	db90455afd	Update plugin-related output in reference docs (#25897 ) The example output for node info and cluster stats was outdated w.r.t. to the information that is shown for plugins. With this commit we updated the example output and update the explanation of the respective fields.	2017-07-28 11:27:54 +02:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Colin Goodheart-Smithe	779fb9a1c0	Adds nodes usage API to monitor usages of actions (#24169 ) * Adds nodes usage API to monitor usages of actions The nodes usage API has 2 main endpoints /_nodes/usage and /_nodes/{nodeIds}/usage return the usage statistics for all nodes and the specified node(s) respectively. At the moment only one type of usage statistics is available, the REST actions usage. This records the number of times each REST action class is called and when the nodes usage api is called will return a map of rest action class name to long representing the number of times each of the action classes has been called. Still to do: * [x] Create usage service to store usage statistics * [x] Record usage in REST layer * [x] Add Transport Actions * [x] Add REST Actions * [x] Tests * [x] Documentation * Rafactors UsageService so counts are done by the handlers * Fixing up docs tests * Adds a name to all rest actions * Addresses review comments	2017-06-02 08:46:38 +01:00
Clinton Gormley	1b0c93b07c	Documented the level parameter to nodes stats Closes #24999	2017-06-01 12:11:21 +02:00
propulkit	25516868fe	TCorrecting api name (#24924 ) As per REST request signature for reroute, API has no underscore.	2017-05-29 13:58:31 +02:00
Simon Willnauer	f22e0dc30b	Add cross-cluster search remote cluster info API (#23969 ) This commit adds an API to discover information like seed nodes, http addresses and connection status of a configured remote cluster. Closes #23925	2017-04-11 09:24:40 +02:00
Christoph Wurm	2720fc0b43	Clarify task cancellation command (#23667 ) Makes it explicit that the node_id has to be included when canceling a task.	2017-03-30 20:21:21 +02:00
Florian Hopf	09753d6a86	Fix typo in allocation explain API docs This commit addresses a simple typo in the application explain API docs. Relates #23669	2017-03-21 08:41:54 -04:00
Ali Beyad	577d2a6a1d	Adds cluster state size to /_cluster/state response (#23440 ) This commit adds the size of the cluster state to the response for the get cluster state API call (GET /_cluster/state). The size that is returned is the size of the full cluster state in bytes when compressed. This is the same size of the full cluster state when serialized to transmit over the network. Specifying the ?human flag displays the compressed size in a more human friendly manner. Note that even if the cluster state request filters items from the cluster state (so a subset of the cluster state is returned), the size that is returned is the compressed size of the entire cluster state. Closes #3415	2017-03-02 14:20:29 -05:00
Lee Hinman	6c9b89b882	[TEST] Fix incorrect test cluster name in cluster health doc tests	2017-02-22 17:18:11 -07:00
Lee Hinman	5443f7d625	Console-ify curl statements for allocation explain API docs (#23190 ) * Console-ify curl statements for allocation explain API docs Relates to #23001 * Fix tests * Remove exclusion from build.gradle * Call out index creation in prose * Add console back and skip test	2017-02-15 17:18:07 -07:00
Ryan Ernst	c91848e6a7	Docs: Consoleify cluster and indices settings docs (#23030 ) relates #23001	2017-02-10 14:57:43 -08:00
Nik Everett	0e98c9107a	Docs: CONSOLEify some more docs These need to be CONSOLEified now because we're starting to require Content-Type headers and they didn't have any. * cluster/reroute: Marked as CONSOLE but skipped because the docs build runs with a single node. * docs/bulk: Marked as NOTCONSOLE because the snippets describe either examples or `curl` commands. Fixed the `curl` command to include the `Content-Type` header. * query-dsl/terms-query: Marked as CONSOLE. * search/request/rescore: Marked as CONSOLE. Fixed deprecated syntax. Relates #23001 Relates #18160	2017-02-07 16:49:01 -05:00
Clinton Gormley	f5a0d18c4c	Docs: Cluster allocation explain should be on one page	2017-01-26 11:38:19 +01:00
Ali Beyad	26f92f8482	Cluster allocation explain API documentation (#22436 ) This commit updates the cluster allocation explain API documentation to explain the new request parameters and response formats, and gives examples of the explain API responses under various scenarios.	2017-01-10 08:55:39 -06:00
Nik Everett	923820c6c9	Document the `detailed` parameter of tasks API (#22425 ) Provides an example of using is and an example return description and explains that we've added descriptions for some tasks but not even close to all of them. And that we expect to change the descriptions as we learn more. Closes #22407 * Fix example Getting a single task is always detailed, no need to specify. * Rewrite like imotov wants it	2017-01-06 10:24:52 -05:00
Jason Tedor	41ffb008ad	Fix doc bug for cgroup cpuacct usage metric This commit fixes a silly doc bug where the field that represents the total CPU time consumed by all tasks in the same cgroup was mistakenly reported as "usage" instead of "usage_nanos". Relates #21029	2016-12-15 23:22:54 -05:00
Clinton Gormley	cfabc95f59	Fixed bad asciidoc ID in node stats	2016-11-15 17:39:15 +00:00
Jason Tedor	f5ac0e5076	Remove lenient stats parsing Today when parsing a stats request, Elasticsearch silently ignores incorrect metrics. This commit removes lenient parsing of stats requests for the nodes stats and indices stats APIs. Relates #21417	2016-11-15 12:17:26 -05:00
Jason Tedor	aec09a76d6	Clarify requesting all stats in node stats docs This commit clarifies how to explicitly obtain all stats from the node stats API.	2016-11-08 13:47:15 -05:00
Igor Motov	17ad88d539	Makes search action cancelable by task management API Long running searches now can be cancelled using standard task cancellation mechanism.	2016-10-25 12:27:34 -10:00
Jason Tedor	900ee0536e	Strengthen handling of unavailable cgroup stats On some systems, cgroups will be available but not configured. And in some cases, cgroups will be configured, but not for the subsystems that we are expecting (e.g., cpu and cpuacct). This commit strengthens the handling of cgroup stats on such systems. Relates #21094	2016-10-24 16:36:51 -04:00
Jason Tedor	3d642ab0eb	Add basic cgroup CPU metrics This commit adds basic cgroup CPU metrics to the node stats API. Relates #21029	2016-10-24 08:26:56 -04:00
Jason Tedor	51d53791fe	Remove lenient URL parameter parsing Today when parsing a request, Elasticsearch silently ignores incorrect (including parameters with typos) or unused parameters. This is bad as it leads to requests having unintended behavior (e.g., if a user hits the _analyze API and misspell the "tokenizer" then Elasticsearch will just use the standard analyzer, completely against intentions). This commit removes lenient URL parameter parsing. The strategy is simple: when a request is handled and a parameter is touched, we mark it as such. Before the request is actually executed, we check to ensure that all parameters have been consumed. If there are remaining parameters yet to be consumed, we fail the request with a list of the unconsumed parameters. An exception has to be made for parameters that format the response (as opposed to controlling the request); for this case, handlers are able to provide a list of parameters that should be excluded from tripping the unconsumed parameters check because those parameters will be used in formatting the response. Additionally, some inconsistencies between the parameters in the code and in the docs are corrected. Relates #20722	2016-10-04 12:45:29 -04:00
Tanguy Leroux	656596c2a9	[DOC] Remove obsolete node names from documentation Funny node names have been removed in #19456 and replaced by UUID. This commit removes these obsolete node names and replace them by real UUIDs in the documentation. closes #20065	2016-09-19 11:56:28 +02:00
Lee Hinman	fd3392aef8	[DOCS] Mark cluster allocation explain API as experimental in docs	2016-09-06 11:29:33 -06:00
Nik Everett	5cff2a046d	Remove most of the need for `// NOTCONSOLE` and be much more stingy about what we consider a console candidate. * Add `// CONSOLE` to check-running * Fix version in some snippets * Mark groovy snippets as groovy * Fix versions in plugins * Fix language marker errors * Fix language parsing in snippets This adds support for snippets who's language is written like `[source, txt]` and `["source","js",subs="attributes,callouts"]`. This also makes language required for snippets which is nice because then we can be sure we can grep for snippets in a particular language.	2016-09-06 10:32:54 -04:00
javanna	5f299ff46f	add mem section back to cluster stats The mem section was buggy in cluster stats and removed. It is now added back with the same structure as in node stats, containing total memory, available memory, used memory and percentages. All the values are the sum of all the nodes across the cluster (or at least the ones that we were able to get the values from).	2016-09-01 11:26:03 +02:00
Ali Beyad	4641254ea6	Parameter improvements to Cluster Health API wait for shards (#20223 ) * Params improvements to Cluster Health API wait for shards Previously, the cluster health API used a strictly numeric value for `wait_for_active_shards`. However, with the introduction of ActiveShardCount and the removal of write consistency level for replication operations, `wait_for_active_shards` is used for write operations to represent values for ActiveShardCount. This commit moves the cluster health API's usage of `wait_for_active_shards` to be consistent with its usage in the write operation APIs. This commit also changes `wait_for_relocating_shards` from a numeric value to a simple boolean value `wait_for_no_relocating_shards` to set whether the cluster health operation should wait for all relocating shards to complete relocation. * Addresses code review comments * Don't be lenient if `wait_for_relocating_shards` is set	2016-08-31 11:58:19 -04:00
Nik Everett	777ea124c7	Fix health docs test It failed inconsistently when there were pending tasks.	2016-07-16 07:18:11 -04:00
Nik Everett	9f78f8cc91	Convert snippets in health docs to CONSOLE This should make them easier to read and adds them to the test suite I changed the example from a two node cluster to a single node cluster because that is what we have running in the integration tests. It is also what a user just starting out is likely to see so I think that is ok.	2016-07-15 16:31:37 -04:00
Lee Hinman	58db63b610	Expose the ClusterInfo object in the allocation explain output This adds an optional parameter to the cluster allocation explain API that will return the cluster info object, `include_disk_info`, the output looks like: GET /_cluster/allocation/explain?include_disk_info -d' {"index": "i", "shard": 0, "primary": false}' { ... other info ... "cluster_info" : { "nodes" : { "7Uws-vL7R6WVm3ZwQA1n5A" : { "node_name" : "Kraven the Hunter", "least_available" : { "path" : "/path/to/data1", "total_bytes" : 165999570944, "used_bytes" : 118180614144, "free_bytes" : 47818956800, "free_disk_percent" : 28.80667493781158, "used_disk_percent" : 71.19332506218842 }, "most_available" : { "path" : "/path/to/data2", "total_bytes" : 165999570944, "used_bytes" : 118180614144, "free_bytes" : 47818956800, "free_disk_percent" : 28.80667493781158, "used_disk_percent" : 71.19332506218842 } } }, "shard_sizes" : { "[i][2][p]_bytes" : 0, "[i][4][p]_bytes" : 130, "[i][1][p]_bytes" : 0, "[i][3][p]_bytes" : 0, "[i][0][p]_bytes" : 130 }, "shard_paths" : { "[i][3], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=LegZLDniTVaw0Y1urv7s3g]" : "/path/to/data1/nodes/0", "[i][1], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=lAU_4vf_SKmoRdtg0ACnjQ]" : "/path/to/data1/nodes/0", "[i][2], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=Aurpeuj7SeGeyPDDpCtRgg]" : "/path/to/data1/nodes/0", "[i][0], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=Vgg8GlQTQ82C2j6HYBq8DQ]" : "/path/to/data1/nodes/0", "[i][4], node[7Uws-vL7R6WVm3ZwQA1n5A], [P], s[STARTED], a[id=t8hQlVSxQe-58fSeaXcAqg]" : "/path/to/data1/nodes/0" } } } Resolves #14405	2016-07-12 15:52:20 -06:00
Mike McCandless	eecf094ac1	add indices nodes info flag to docs	2016-06-20 14:23:32 -04:00
Mike McCandless	3f221bf7cb	Add total_indexing_buffer/_in_bytes to nodes info API	2016-06-16 04:39:34 -04:00
Nik Everett	e392e0b1df	Create get task API that falls back to the .tasks index This adds a get task API that supports GET /_tasks/${taskId} and removes that responsibility from the list tasks API. The get task API supports wait_for_complation just as the list tasks API does but doesn't support any of the list task API's filters. In exchange, it supports falling back to the .results index when the task isn't running any more. Like any good GET API it 404s when it doesn't find the task. Then we change reindex, update-by-query, and delete-by-query to persist the task result when wait_for_completion=false. The leads to the neat behavior that, once you start a reindex with wait_for_completion=false, you can fetch the result of the task by using the get task API and see the result when it has finished. Also rename the .results index to .tasks.	2016-06-14 13:37:34 -04:00
Mike McCandless	5c525e6606	Remove index_writer_max_memory stat from segment stats	2016-05-31 06:29:29 -04:00
Lee Hinman	bfce901edf	Merge remote-tracking branch 'dakrone/explain-add-fetch-in-progress'	2016-05-23 09:43:16 -06:00
Lee Hinman	8040ed0c16	Add whether the shard state fetch is pending to the allocation explain API If the shard state fetch is still pending, this will now return a message like: ```json { "shard" : { "index" : "i", "index_uuid" : "de1W1374T4qgvUP4a9Ieaw", "id" : 0, "primary" : false }, "assigned" : false, "shard_state_fetch_pending": true, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-04-26T16:34:53.227Z" }, "allocation_delay_ms" : 0, "remaining_delay_ms" : 0, "nodes" : { "z-CbkiELT-SoWT91HIszLA" : { "node_name" : "Brain Cell", "node_attributes" : { "testattr" : "test" }, "store" : { "shard_copy" : "NONE" }, "final_decision" : "NO", "final_explanation" : "the shard state fetch is pending", "weight" : 5.0, "decisions" : [ ] } } } ``` Adds the `shard_state_fetch_pending` field and uses the state to influence the final decision and final explanation. Relates to #17372	2016-05-23 09:42:57 -06:00
Simon Willnauer	35e705877b	Limit retries of failed allocations per index (#18467 ) Today if a shard fails during initialization phase due to misconfiguration, broken disks, missing analyzers, not installed plugins etc. elasticsaerch keeps on trying to initialize or rather allocate that shard. Yet, in the worst case scenario this ends in an endless allocation loop. To prevent this loop and all it's sideeffects like spamming log files over and over again this commit adds an allocation decider that stops allocating a shard that failed more than N times in a row to allocate. The number or retries can be configured via `index.allocation.max_retry` and it's default is set to `5`. Once the setting is updated shards with less failures than the number set per index will be allowed to allocate again. Internally we maintain a counter on the UnassignedInfo that is reset to `0` once the shards has been started. Relates to #18417	2016-05-20 20:37:45 +02:00
Jason Tedor	ecce53f0df	Add I/O statistics on Linux This commit adds a variety of real disk metrics for the block devices that back Elasticsearch data paths. A collection of statistics are read from /proc/diskstats and are used to report the raw metrics for operations and read/write bytes. Relates #15915	2016-05-17 16:16:39 -04:00
Clinton Gormley	3f594089c2	Renamed all AUTOSENSE snippets to CONSOLE (#18210 )	2016-05-09 15:42:23 +02:00
Nik Everett	4b1c116461	Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start.	2016-05-05 13:58:03 -04:00
Lee Hinman	5648253d45	Add documentation for shard store output in allocation explain API Relates to #17689	2016-05-03 09:51:15 -06:00
Igor Motov	81c59cae18	Add _cat/tasks Adds new _cat endpoint that lists all tasks	2016-04-07 09:28:21 -06:00
Igor Motov	f599ac5d5a	Expose whether a task is cancellable in the _tasks list API Closes #17369	2016-04-05 19:16:08 -06:00
Daniel Mitterdorfer	930ce1bfec	Add up-to-date example of cluster stats API output	2016-03-31 14:41:37 +02:00
Igor Motov	e073b0c75d	Add ability to group tasks by common parent By default, tasks are grouped by node. However, task execution in elasticsearch can be quite complex and an individual task that runs on a coordinating node can have many subtasks running on other nodes in the cluster. This commit makes it possible to list task grouped by common parents instead of by node. When this option is enabled all subtask are grouped under the coordinating node task that started all subtasks in the group. To group tasks by common parents, use the following syntax: GET /tasks?group_by=parents	2016-03-30 17:50:27 -04:00
javanna	061f09d9a4	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 20:19:33 +02:00
Igor Motov	c356b30cff	Update task management docs to reflect the latest changes in the interface Brings docs in line with new list task syntax and adds task cancellation API docs.	2016-03-29 12:26:37 -04:00
javanna	8fc9dbbb99	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 14:27:04 +02:00
Clinton Gormley	978b24327e	Docs: Included Nodes Task API and tidied reindex/update-by-query	2016-03-29 13:51:11 +02:00
javanna	de5cbda8e7	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 10:48:47 +02:00
Lee Hinman	80ab366de4	Add API to explain why a shard is or isn't assigned This adds a new `/_cluster/allocation/explain` API that explains why a shard can or cannot be allocated to nodes in the cluster. Additionally, it will show where the master desires to put the shard, according to the `ShardsAllocator`. It looks like this: ``` GET /_cluster/allocation/explain?pretty { "index": "only-foo", "shard": 0, "primary": false } ``` Though, you can optionally send an empty body, which means "explain the allocation for the first unassigned shard you find". The output when a shard is unassigned looks like this: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : false }, "assigned" : false, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-03-22T20:04:23.620Z" }, "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 0.06666675, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "NO", "weight" : -1.3833332, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 2.3166666, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` And when the shard is assigned, the output looks like: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : true }, "assigned" : true, "assigned_node_id" : "Qc6VL8c5RWaw1qXZ0Rg57g", "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 1.4499999, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "CURRENTLY_ASSIGNED", "weight" : 0.0, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 3.6999998, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` Only "NO" decisions are returned by default, but all decisions can be shown by specifying the `?include_yes_decisions=true` parameter in the request. Resolves #14593	2016-03-28 15:21:02 -06:00
javanna	bf390a935e	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-21 17:18:23 +01:00
Robin Clarke	046212035c	Clarification about precedence of settings Closes #14559	2016-03-10 14:29:51 +01:00
Martijn van Groningen	2fa33d5c47	Added ingest statistics to node stats API The ingest stats include the following statistics: * `ingest.total.count`- The total number of document ingested during the lifetime of this node * `ingest.total.time_in_millis` - The total time spent on ingest preprocessing documents during the lifetime of this node * `ingest.total.current` - The total number of documents currently being ingested. * `ingest.total.failed` - The total number ingest preprocessing operations failed during the lifetime of this node Also these stats are returned on a per pipeline basis.	2016-03-10 13:21:43 +01:00
Martijn van Groningen	82d01e4315	Added ingest info to node info API, which contains a list of available processors. Internally the put pipeline API uses this information in node info API to validate if all specified processors in a pipeline exist on all nodes in the cluster.	2016-03-07 14:44:50 +01:00
javanna	9c4a5bbe7e	adapt cluster stats api to node.client setting removal The cluster stats api now returns counts for each node role. The `master_data`, `master_only`, `data_only` and `client` fields have been removed from the response in favour of `master`, `data`, `ingest` and `coordinating_only`. The same node can have multiple roles, hence contribute to multiple roles counts. Every node is implicitly a coordinating node, so whenever a node has no explicit roles, it will be counted as coordinating only.	2016-03-05 10:55:19 +01:00
Clinton Gormley	4e5316591a	Update stats.asciidoc Renamed filter_cache->query_cache and removed id_cache Closes #16626	2016-01-26 13:48:46 +01:00
Yannick Welsch	d5b691b68e	Extend reroute with an option to force assign stale primary shard copies Closes #15708	2016-01-19 12:07:01 +01:00
Jason Tedor	df598e8129	Modify load average formats This commit modifies the load_average in the node stats API response to be an object containing the one-minute, five-minute and fifteen-minute load averages as fields (if those values are available). Additionally, this commit modifies the cat nodes API response to format the one-minute, five-minute and fifteen-minute load averages as null if any of the respective values are not available.	2016-01-18 11:41:34 -05:00
Jason Tedor	1de2081ed3	Reintroduce five-minute and fifteen-minute load averages on Linux This commit reintroduces the five-minute and fifteen-minute load stats on Linux, and changes the format of the load_average field back to an array.	2016-01-11 23:42:47 -05:00
Simon Willnauer	6ea266a89c	Merge branch 'master' into settings_prototype	2015-12-15 16:33:01 +01:00
Felipe Forbeck	708abcc59a	Added desc for parameter <local>	2015-12-11 22:26:33 -02:00
Simon Willnauer	ce417540c5	apply review from @clintongormley	2015-12-09 12:24:40 +01:00
Simon Willnauer	2e27ee393f	add rest API to reset settings	2015-12-08 14:39:16 +01:00
Jason Tedor	6872d545ac	Add system CPU percent to OS stats This commit adds the system CPU percent reflecting the recent CPU usage for the whole system.	2015-11-17 13:48:46 -05:00
xuzha	fb1d8bb149	Add os.allocated_processors Current processors setting is not reflected in nodes info API ("os.available_processors"). Add os.allocated_processors to shows actual number of processors that we are using.	2015-11-03 09:50:17 -08:00
xuzha	97ecd7bf5a	Expose pending cluster state queue size in node stats Add 3 stats about the queue: total queue size, number of committed cluster states, and number of pending cluster states.	2015-10-28 10:59:15 -07:00
Tanguy Leroux	db7aecab4d	update list of available os stats os cpu information is no longer exposed through the nodes stats api	2015-08-31 17:03:45 +02:00
Tanguy Leroux	8e052f0da2	Make platform specific assumptions in OS & Process probes tests	2015-08-17 14:47:23 +02:00
Andrey Fadeyev	081fb1a899	Fixes #11571 - update "Cluster Stats" documentation with valid example	2015-08-13 12:09:31 +02:00
Tanguy Leroux	03c327ff12	Expose ClassloadingMXBean in Node Stats Closes #12738	2015-08-12 14:29:13 +02:00
Clinton Gormley	db541d6fbe	Docs: Add warning about allow_primary to the cluster reroute docs Closes #12503	2015-08-07 12:03:19 +02:00

1 2 3 4 5

214 Commits