OpenSearch

Commit Graph

Author	SHA1	Message	Date
Mike McCandless	5c525e6606	Remove index_writer_max_memory stat from segment stats	2016-05-31 06:29:29 -04:00
Simon Willnauer	502a775a7c	Add primitive to shrink an index into a single shard (#18270 ) This adds a low level primitive operations to shrink an existing index into a new index with a single shard. This primitive expects all shards of the source index to allocated on a single node. Once the target index is initializing on the shrink node it takes a snapshot of the source index shards and copies all files into the target indices data folder. An [optimization](https://issues.apache.org/jira/browse/LUCENE-7300) coming in Lucene 6.1 will also allow for optional constant time copy if hard-links are supported by the filesystem. All mappings are merged into the new indexes metadata once the snapshots have been taken on the merge node. To shrink an existing index all shards must be moved to a single node (one instance of each shard) and the index must be read-only: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_settings' -d '{ "settings" : { "index.routing.allocation.require._name" : "shrink_node_name", "index.blocks.write" : true } } ``` once all shards are started on the shrink node. the new index can be created via: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_shrink/logs_single_shard' -d '{ "settings" : { "index.codec" : "best_compression", "index.number_of_replicas" : 1 } }' ``` This API will perform all needed check before the new index is created and selects the shrink node based on the allocation of the source index. This call returns immediately, to monitor shrink progress the recovery API should be used since all copy operations are reflected in the recovery API with byte copy progress etc. The shrink operation does not modify the source index, if a shrink operation should be canceled or if the shrink failed, the target index can simply be deleted and all resources are released.	2016-05-31 10:41:44 +02:00
Yannick Welsch	dee34c916c	Expand wildcards to closed indices in /_cat/indices (#18545 ) Closed indices are already displayed when no indices are explicitly selected. This commit ensures that closed indices are also shown when wildcard filtering is used. It also addresses another issue that is caused by the fact that the cat action is based internally on 3 different cluster states (one when we query the cluster state to get all indices, one when we query cluster health, and one when we query indices stats). We currently fail the cat request when the user specifies a concrete index as parameter that does not exist. The implementation works as intended in that regard. It checks this not only for the first cluster state request, but also the subsequent indices stats one. This means that if the index is deleted before the cat action has queried the indices stats, it rightfully fails. In case the user provides wildcards (or no parameter at all), however, we fail the indices stats as we pass the resolved concrete indices to the indices stats request and fail to distinguish whether these indices have been resolved by wildcards or explicitly requested by the user. This means that if an index has been deleted before the indices stats request gets to execute, we fail the overall cat request. The fix is to let the indices stats request do the resolving again and not pass the concrete indices. Closes #16419 Closes #17395	2016-05-25 10:02:14 +02:00
Tal Levy	b40628d4e8	fix simulate spec test for new exception handling (#18564 )	2016-05-24 22:32:50 -07:00
Tanguy Leroux	1f011f9dea	Remove Delete-By-Query plugin closes #18469	2016-05-24 13:28:20 +02:00
Martijn van Groningen	27cc2fe4dc	Moved the percolator from core to its own module Significant changes: * AbstractQueryTestCase has moved to the test framework module, in order for query builder tests in modules and plugins * Added support to AbstractQueryTestCase to register plugins * Lift the restriction that only one percolator could be added per index. This validation existed in MapperService, but because the percolator moved to a module it could no longer exist there. Instead of bringing it back it was removed. This validation existed since the percolator cache only supported one percolator query per document, since the percolator cache has been removed this restriction could removed as well. * While moving percolator tests to the new module, also removed a couple of tests for the deprecated percolate and mpercolate api. These APIs are now sugar APIs for bwc and rediect to the searvh and msearvh APIs. Some tests were still testing as if percolate and mpercolate API did the percolation, but this no longer the case and these tests could be removed.	2016-05-24 11:01:57 +02:00
spalger	be2ba53fca	[rest api spec] fix doc urls	2016-05-20 13:48:16 -07:00
Spencer	332f6ffe59	[rest api spec] fix url for reindex api docs	2016-05-20 12:22:56 -07:00
Simon Willnauer	35e705877b	Limit retries of failed allocations per index (#18467 ) Today if a shard fails during initialization phase due to misconfiguration, broken disks, missing analyzers, not installed plugins etc. elasticsaerch keeps on trying to initialize or rather allocate that shard. Yet, in the worst case scenario this ends in an endless allocation loop. To prevent this loop and all it's sideeffects like spamming log files over and over again this commit adds an allocation decider that stops allocating a shard that failed more than N times in a row to allocate. The number or retries can be configured via `index.allocation.max_retry` and it's default is set to `5`. Once the setting is updated shards with less failures than the number set per index will be allowed to allocate again. Internally we maintain a counter on the UnassignedInfo that is reset to `0` once the shards has been started. Relates to #18417	2016-05-20 20:37:45 +02:00
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00
Tanguy Leroux	a01ecb20ea	Port Delete By Query to Reindex infrastructure closes #16883	2016-05-19 16:07:50 +02:00
markharwood	a846ff93e9	Aggregations fix: support include/exclude strings formatted for IP and date fields in terms and significant_terms aggregations. Closes #17705	2016-05-18 16:21:55 +01:00
Tanguy Leroux	d7a31c8cf7	Add missing builder.endObject() in FsInfo closes #18433	2016-05-18 15:19:30 +02:00
Christoph Büscher	de321fb159	Disabling nodes.stats/30_discovery.yaml rest test This is tracked in issue #18433, so temporarily disabling the tests.	2016-05-18 13:02:55 +02:00
$polyfractal$ polyfractal	978c1e3e36	[TEST] Add missing sort processor to test Also fixes the naming of the sort REST test to follow the numbering convention	2016-05-17 14:52:17 -04:00
Zachary Tong	7c46b57ff2	Add a Sort ingest processor Sorts an array of values in ascending or descending order. If all elements are numerics, they will be sorted numerically. If values are strings, or mixtures of strings/numbers, the elements will be sorted lexicographically.	2016-05-17 12:06:48 -04:00
Chris Earle	9303ffbf82	Adding REST tests to ensure key_as_string behavior stays consistent	2016-05-13 11:48:49 -04:00
Adrien Grand	61b1f4ad0b	Fix xcontent rendering of ip terms aggs. #18003 Currently terms on an ip address try to put their binary representation in the json response. With this commit, they would return a formatted ip address: ``` "buckets": [ { "key": "192.168.1.7", "doc_count": 1 } ] ```	2016-05-13 14:59:36 +02:00
Lee Hinman	1c54033e92	Merge branch 'pr/18068'	2016-05-10 08:27:43 -06:00
Jim Ferenczi	aef78ceb13	Do not return fieldstats information for fields that exist in the mapping but not in the index.	2016-05-09 19:24:03 +02:00
Tanguy Leroux	0ff5652fff	Add node name to Cat Recovery closes #8041	2016-05-06 16:59:53 +02:00
Nik Everett	230697c202	[reindex] Switch throttle to Float.POSITIVE_INFITINTY/"unlimited" All other values are errors. Add java test for throttling. We had a REST test but it only ran against one node so it didn't catch serialization errors. Add Simple round trip test for rethrottle request	2016-05-04 16:14:32 -04:00
Martijn van Groningen	7aca1389e2	ingest: Add `date_index_name` processor. Closes #17814	2016-04-29 17:20:48 +02:00
David Pilato	2232a7cdf3	Merge branch 'pr/cat-size-time-units'	2016-04-29 15:09:14 +02:00
Jim Ferenczi	573c4f3ed1	Extend field stats: * Add isSearchable and isAggregatable (collapsed to true if any of the instances of that field are searchable or aggregatable). * Accept wildcards in field names. * Add a section named conflicts for fields with the same name but with incompatible types (instead of throwing an exception).	2016-04-27 16:51:53 +02:00
Alexander Kazakov	a8a33a1a94	Row-centric output for _cat/fielddata	2016-04-27 13:29:02 +03:00
Martijn Laarman	98a37a1e54	Remove trailing / in rest spec for ingest.simulate (#17976 ) The url that takes an id has a trailing forward slash, not really an error but as its the only url in the whole spec that does this it triggered my OCD :)	2016-04-26 11:03:17 +02:00
Nik Everett	1c2e84ba46	Fail request if rescore window > 10,000 The setting is named `index.max_rescore_window` and defaults to `index.max_result_window` which defaults to 10,000.	2016-04-22 11:10:01 -04:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Martijn van Groningen	dd2184ab25	ingest: Streamline option naming for several processors: * `rename` processor, renamed `to` to `target_field` * `date` processor, renamed `match_field` to `field` and renamed `match_formats` to `formats` * `geoip` processor, renamed `source_field` to `field` and renamed `fields` to `properties` * `attachment` processor, renamed `source_field` to `field` and renamed `fields` to `properties` Closes #17835	2016-04-21 13:40:43 +02:00
Jun Ohtani	9eb242a5fe	Analyze API : Rename filters/token_filters/char_filter to filter/token_filter/char_filter Closes #15189	2016-04-21 18:05:11 +09:00
Martijn van Groningen	40c22fc654	percolator: removed .percolator type instead a field of type `percolator` should be configured before indexing percolator queries * Added an extra `field` parameter to the `percolator` query to indicate what percolator field should be used. This must be an existing field in the mapping of type `percolator`. * The `.percolator` type is now forbidden. (just like any type that starts with a `.`) This only applies for new indices created on 5.0 and later. Indices created on previous versions the .percolator type is still allowed to exist. The new `percolator` field type isn't active in such indices and the `PercolatorQueryCache` knows how to load queries from these legacy indices. The `PercolatorQueryBuilder` will not enforce that the `field` parameter is of type `percolator`.	2016-04-19 11:20:31 +02:00
David Pilato	5e1f26c22a	Add support for documented byte/size units and for micros as a time unit in _cat API We advertise in our documentation that byte units are like `kb`, `mb`... But we actually only support the simple notation `k` or `m`. This commit adds support for the documented form and keeps the non documented options to avoid any breaking change. It also adds support for `micros`, `nanos` and `d` as a time unit in `_cat` API. Remove the support for `b` as a SizeValue unit. Actually, for numbers, when using raw numbers without unit, there is no text to add/parse after the number. For example, you don't write `10` as `10b`. We support option like `size=` in `_cat` API which means that we want to display raw data without unit (singles). Documentation updated accordingly. Add test for the empty size option. Fix missing TimeValues options for some cat APIs	2016-04-15 20:55:41 +02:00
jaymode	293ede3237	Enable installing the rest-api-spec artifact Running `gradle install` on the rest-api-spec fails because there is no available install task. This change applies the nexus plugin so that we can install and should also enable publishing as part of the uploadArchives task.	2016-04-13 11:26:47 -04:00
Jun Ohtani	048d273408	Cat: cat health supports ts=0 option If ts=0, cat health disable epoch and timestamp Be Constant String timestamp and epoch Move timestamp and epoch to Table Add rest-api test and test Closes #10109	2016-04-13 18:08:30 +09:00
Nik Everett	7908759949	Make aggregation registration more like query registration Creates a class to hold behavior common to these `ParseFieldRegistry`s. Also renames some weirdly named methods on StreamInput.	2016-04-12 14:50:44 -04:00
Igor Motov	81c59cae18	Add _cat/tasks Adds new _cat endpoint that lists all tasks	2016-04-07 09:28:21 -06:00
Jimmy Jones	f157dae053	Disallow unquoted field names, fix testcases using unquoted JSON	2016-04-06 14:37:15 -06:00
Spencer	7037670aeb	[REST API] set correct default value The correct default value for the `expand_wildcards` parameter to `indices.get_alias` is `all` as of all `f4d75f0212`	2016-04-05 09:19:21 -07:00
Mpdreamz	dd0c99bb65	index is a required url part	2016-04-04 14:14:44 +02:00
Mpdreamz	2944869ab9	Document task id's as string in the rest spec	2016-04-01 11:04:15 +02:00
Martijn Laarman	dfe2c0ff0a	Remove deprecated indices.get_aliases This has been deprecated since the first release of Elasticsearch 1.0	2016-03-31 14:46:53 +02:00
Igor Motov	e073b0c75d	Add ability to group tasks by common parent By default, tasks are grouped by node. However, task execution in elasticsearch can be quite complex and an individual task that runs on a coordinating node can have many subtasks running on other nodes in the cluster. This commit makes it possible to list task grouped by common parents instead of by node. When this option is enabled all subtask are grouped under the coordinating node task that started all subtasks in the group. To group tasks by common parents, use the following syntax: GET /tasks?group_by=parents	2016-03-30 17:50:27 -04:00
Nik Everett	78ab6c5b7f	[reindex] Dynamic throttle! This allows the user to update the reindex throttle on the fly, with changes that speed up the throttling being applied immediately and changes that slow down the throttling being applied during the next batch. This means that if a user throttles reindex in such a way that it tries to sleep for 16 years and then realizes that they've done something wrong then they can change the throttle and reindex will wake up again. We don't apply slow downs immediately so we never get in danger of losing the scan context. Also, if reindex is canceled while it is sleeping (how it honor throttling) then it'll immediately wake up and cancel itself.	2016-03-30 16:40:42 -04:00
Isabel Drost-Fromm	132f96b6ba	Merge pull request #17403 from elastic/docs/remove-rest-api-utils-reference Remove reference to utils for generating REST docs	2016-03-30 21:18:41 +02:00
Adrien Grand	068c788ec8	Disable fielddata on text fields by defaults. #17386 `text` fields will have fielddata disabled by default. Fielddata can still be enabled on an existing index by setting `fielddata=true` in the mappings.	2016-03-30 14:35:32 +02:00
Isabel Drost-Fromm	72d1ed65a4	Remove reference to utils for generating REST docs This removes the reference to a no longer existing utils directory that used to be there for generating docs and tests from Java source.	2016-03-30 13:36:51 +02:00
javanna	8fc9dbbb99	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 14:27:04 +02:00
Clinton Gormley	b87beeb05f	Rename update-by-query REST tests to update_by_query	2016-03-29 13:13:49 +02:00
Clinton Gormley	647437ce56	REST: The body is required in the reindex API	2016-03-29 11:45:20 +02:00
Clinton Gormley	97606850e8	Renamed update-by-query REST spec to update_by_query	2016-03-29 11:45:20 +02:00
javanna	de5cbda8e7	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 10:48:47 +02:00
Lee Hinman	80ab366de4	Add API to explain why a shard is or isn't assigned This adds a new `/_cluster/allocation/explain` API that explains why a shard can or cannot be allocated to nodes in the cluster. Additionally, it will show where the master desires to put the shard, according to the `ShardsAllocator`. It looks like this: ``` GET /_cluster/allocation/explain?pretty { "index": "only-foo", "shard": 0, "primary": false } ``` Though, you can optionally send an empty body, which means "explain the allocation for the first unassigned shard you find". The output when a shard is unassigned looks like this: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : false }, "assigned" : false, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-03-22T20:04:23.620Z" }, "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 0.06666675, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "NO", "weight" : -1.3833332, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 2.3166666, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` And when the shard is assigned, the output looks like: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : true }, "assigned" : true, "assigned_node_id" : "Qc6VL8c5RWaw1qXZ0Rg57g", "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 1.4499999, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "CURRENTLY_ASSIGNED", "weight" : 0.0, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 3.6999998, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` Only "NO" decisions are returned by default, but all decisions can be shown by specifying the `?include_yes_decisions=true` parameter in the request. Resolves #14593	2016-03-28 15:21:02 -06:00
Nik Everett	0e6141e675	Replace is_true: took with took >= 0 This prevents tests from failing on machines that can finish the request less than half a millisecond.	2016-03-28 13:03:48 -04:00
javanna	a685148268	[TEST] expand REST tests to check for roles in nodes info, nodes stats and tasks list response	2016-03-25 22:53:21 +01:00
javanna	a9f4982c40	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-25 20:16:40 +01:00
Clinton Gormley	30d78f4be0	In cat.snapshots, repository is required Closes #17216	2016-03-25 14:23:52 +01:00
javanna	27d4994aff	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-24 18:10:11 +01:00
Areek Zillur	e16e113691	Remove suggest threadpool In #17198, we removed suggest transport action, which used the `suggest` threadpool to execute requests. Now `suggest` threadpool is unused and suggest requests are executed on the `search` threadpool.	2016-03-23 18:01:45 -04:00
Areek Zillur	91dd9b3301	Merge suggest stats into search stats	2016-03-23 16:37:56 -04:00
Honza Král	b139f4e0bf	[TEST] Move yaml test requiring yaml, add skip:yaml Clients don't ship with yaml (de)serializer by default so this test must be optionally skipped	2016-03-23 14:50:23 +01:00
javanna	030453d320	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-23 11:25:34 +01:00
Honza Král	f8e84f0bbb	[TEST] fix incorrect indent in ingest/70_bulk.yaml	2016-03-22 20:53:23 +01:00
Honza Král	ca4b8667bb	[TEST] Move yaml test requiring header, add skip:headers	2016-03-22 20:53:23 +01:00
Nik Everett	da96b6e41d	[reindex] Add thottling support The throttle is applied when starting the next scroll request so that its timeout can include the throttle time.	2016-03-22 12:34:14 -04:00
javanna	eebd0cfccd	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-22 10:34:40 +01:00
Simon Willnauer	7f16a1d9a7	Improve upgrade experience of node level index settings In 5.0 we don't allow index settings to be specified on the node level ie. in yaml files or via commandline argument. This can cause problems during upgrade if this was used extensively. For instance if analyzers where specified on a node level this might cause the index to be closed when imported (see #17187). In such a case all indices relying on this must be updated via `PUT /${index}/_settings`. Yet, this API has slightly different semantics since it overrides existing settings. To make this less painful this change adds a `preserve_existing` parameter on that API to ensure we have the same semantics as if the setting was applied on the node level. This change also adds a better error message and a change to the migration guide to ensure upgrades are smooth if index settings are specified on the node level. If a index setting is detected this change fails the node startup and prints a message like this: ``` *********************************************************************************** Found index level settings on node level configuration. Since elasticsearch 5.x index level settings can NOT be set on the nodes configuration like the elasticsearch.yaml, in system properties or command line arguments.In order to upgrade all indices the settings must be updated via the /${index}/_settings API. Unless all settings are dynamic all indices must be closed in order to apply the upgradeIndices created in the future should use index templates to set default values. Please ensure all required values are updated on all indices by executing: curl -XPUT 'http://localhost:9200/_all/_settings?preserve_existing=true' -d '{ "index.number_of_shards" : "1", "index.query.default_field" : "main_field", "index.translog.durability" : "async", "index.ttl.disable_purge" : "true" }' *********************************************************************************** ```	2016-03-21 20:12:18 +01:00
javanna	4077a2c9e1	adapt cluster stats REST tests after merge	2016-03-21 18:24:12 +01:00
javanna	bf390a935e	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-21 17:18:23 +01:00
Martijn van Groningen	e3b7e5d75a	percolator: Replace percolate api with the new percolator query Also replaced the PercolatorQueryRegistry with the new PercolatorQueryCache. The PercolatorFieldMapper stores the rewritten form of each percolator query's xcontext in a binary doc values field. This make sure that the query rewrite happens only during indexing (some queries for example fetch shapes, terms in remote indices) and the speed up the loading of the queries in the percolator query cache. Because the percolator now works inside the search infrastructure a number of features (sorting fields, pagination, fetch features) are available out of the box. The following feature requests are automatically implemented via this refactoring: Closes #10741 Closes #7297 Closes #13176 Closes #13978 Closes #11264 Closes #10741 Closes #4317	2016-03-21 12:21:50 +01:00
Clinton Gormley	0543d46c1d	Fixed regex in cat.recovery REST tes The time column should accept integer ms or floating point seconds	2016-03-16 17:22:00 +01:00
Simon Willnauer	121e7c8ca4	Add infrastructure to run REST tests on a multi-version cluster This change adds the infrastructure to run the rest tests on a multi-node cluster that users 2 different minor versions of elasticsearch. It doesn't implement any dedicated BWC tests but rather leverages the existing REST tests. Since we don't have a real version to test against, the tests uses the current version until the first minor / RC is released to ensure the infrastructure works. Relates to #14406 Closes #17072	2016-03-13 10:52:39 +01:00
Jason Tedor	f465d98eb3	Add raw recovery progress to cat recovery API This commit adds fields bytes_recovered and files_recovered to the cat recovery API. These fields, respectively, indicate the total number of bytes and files recovered. Additionally, for consistency, some totals fields and translog recovery fields have been renamed. Closes #17064	2016-03-11 08:27:09 -05:00
Nik Everett	b8d931d23c	[reindex] Timeout if sub-requests timeout Sadly, it isn't easy to simulate a timeout during an integration test, you just have to cause one. Groovy's sleep should do the job.	2016-03-10 13:05:23 -05:00
Martijn van Groningen	0bbb84c19a	test: 'Test bulk request with default pipeline' may get run first and then the total ingest count for pipeline1 is 2.	2016-03-10 15:18:08 +01:00
Martijn van Groningen	2fa33d5c47	Added ingest statistics to node stats API The ingest stats include the following statistics: * `ingest.total.count`- The total number of document ingested during the lifetime of this node * `ingest.total.time_in_millis` - The total time spent on ingest preprocessing documents during the lifetime of this node * `ingest.total.current` - The total number of documents currently being ingested. * `ingest.total.failed` - The total number ingest preprocessing operations failed during the lifetime of this node Also these stats are returned on a per pipeline basis.	2016-03-10 13:21:43 +01:00
Nik Everett	6d0efae713	Teach list tasks api to wait for tasks to finish _wait_for_completion defaults to false. If set to true then the API will wait for all the tasks that it finds to stop running before returning. You can use the timeout parameter to prevent it from waiting forever. If you don't set a timeout parameter it'll default to 30 seconds. Also adds a log message to rest tests if any tasks overrun the test. This is just a log (instead of failing the test) because lots of tasks are run by the cluster on its own and they shouldn't cause the test to fail. Things like fetching disk usage from the other nodes, for example. Switches the request to getter/setter style methods as we're going that way in the Elasticsearch code base. Reindex is all getter/setter style. Closes #16906	2016-03-08 11:53:57 -05:00
Jun Ohtani	071d578953	Analysis : Allow string explain param in JSON Move some test methods from AnalylzeActionIT to RestAnalyzeActionTest Allow string explain param if it can parse Fix wrong param name in rest-api-spec Closes #16925	2016-03-08 16:19:02 +09:00
Martijn van Groningen	82d01e4315	Added ingest info to node info API, which contains a list of available processors. Internally the put pipeline API uses this information in node info API to validate if all specified processors in a pipeline exist on all nodes in the cluster.	2016-03-07 14:44:50 +01:00
javanna	9c4a5bbe7e	adapt cluster stats api to node.client setting removal The cluster stats api now returns counts for each node role. The `master_data`, `master_only`, `data_only` and `client` fields have been removed from the response in favour of `master`, `data`, `ingest` and `coordinating_only`. The same node can have multiple roles, hence contribute to multiple roles counts. Every node is implicitly a coordinating node, so whenever a node has no explicit roles, it will be counted as coordinating only.	2016-03-05 10:55:19 +01:00
javanna	f786e9866c	adapt _cat/nodes to node.client removal _cat/nodes used to return `c` for client node or `d` for data node as part of the node.role column. This commit changes it to return `m` for master eligible, `d` for data and/or `i` for ingest. A node with no explicit roles will be a coordinating only node and marked with `-`. A node can obviously have multiple roles. The master column has been adapted to return only whether a node is the current master (`*`) or not (`-`).	2016-03-05 10:55:19 +01:00
Nik Everett	4d6cb34417	[reindex] Add ingest support	2016-03-04 10:05:13 -05:00
Clinton Gormley	30669f63e8	Document required settings when running the REST test suite	2016-03-04 13:50:40 +01:00
Simon Willnauer	5008694ba1	Remove support for legacy checksums Elasticsearch 5.0 doesn't support indices wiht legacy checksums anymore. The last time we write legacy checksums was in 1.3.0 which was based on lucene 4.9 already which means that all files have CRC32 checksums. All indices that Elasticsearch can read today must be written with lucene version >= 4.8 anyway so we can drop this layer of backwards compatibility entirely. Since we are close to upgrading to Lucene 6.0 we should get rid of this in a more contiained change than the lucene upgrade.	2016-03-03 22:58:18 +01:00
Adrien Grand	fc0cc4a6bb	Fix field_stats tests to use text/keyword instead of string.	2016-03-03 16:24:02 +01:00
Clinton Gormley	6b27de3f8c	Fixed REST test to not rely on dynamic mapping	2016-03-03 14:38:10 +01:00
Clinton Gormley	ce7fccb287	Fixed bad YAML in REST tests	2016-03-03 14:38:06 +01:00
Martijn van Groningen	75387001df	Added `ingest_took` to bulk response to indicate how much time was spent on ingest preprocessing. The `ingest_took` is separate from `took`, which keeps track how much time is spent on indexing/deleting/updating. The `ingest_took` is only visible in the rest response if at least for one bulk item has ingest enabled.	2016-03-01 18:24:26 +01:00
Nik Everett	c7c8bb357a	Merge pull request #16861 from nik9000/reindex_is_ready Reindex required some parsing changes for search requests to support differing defaults from the regular search api.	2016-03-01 10:02:48 -05:00
Spencer	3f80feb899	[REST_API_SPEC] remove invalid use of catch: param `catch: param` is designed to catch errors generated by client-side validation logic when users don't supply valid parameters to an API request. This test though is testing the server-side validation of pipeline aggregations, and so a "param" catch is invalid. Instead we will just test for a parse_exception error type using a regex.	2016-02-29 09:27:36 -07:00
Nik Everett	c38119bae9	Merge branch 'master' into feature/reindex	2016-02-26 16:59:54 -05:00
Igor Motov	d6af669776	Combine node name and task id into single string task id This commit changes the URL for task operations from `/_tasks/{nodeId}/{taskId}` to `/_tasks/{taskId}`, where `{taskId}` has a form of nodeid:id	2016-02-24 12:44:12 -08:00
Simon Willnauer	354aae2fec	Merge pull request #16770 from s1monw/http_on_cat Expose http address in cat/nodes and cat/nodeattrs APIs We expose a lot of information like IP address and port but never expose the http address/ip:port in the CAT API. It's nice to have it there too since otherwise json parsing is required to get this information	2016-02-22 14:20:33 -08:00
David Pilato	a0a6eff0d0	Fix test for [cat/recovery] Make recovery time a TimeValue() Related to #16743	2016-02-22 13:37:11 -08:00
Simon Willnauer	3c15200f6f	Expose http address in cat/nodes and cat/nodeattrs APIs We expose a lot of information like IP address and port but never expose the http address/ip:port in the CAT API. It's nice to have it there too since otherwise json parsing is required to get this information	2016-02-22 13:22:54 -08:00
Lee Hinman	99052c3fef	Limit the accepted length of the _id Elasticsearch should reject ids that are this long, to ensure a document always remains retrievable for clients that impose a maximum URI length Closes #16034	2016-02-22 12:34:18 -07:00
Spencer	31847c1e9d	[REST API] use a block literal for request bodies	2016-02-20 12:55:23 -08:00
Spencer	a859595dcd	[REST API] use a block literal for request bodies	2016-02-20 12:53:39 -08:00
Adrien Grand	4f8895eae3	Add a text field. This new field is intended to replace analyzed string fields.	2016-02-15 10:43:44 +01:00
Jason Tedor	3bbd1c129e	Remove host from cat nodes API As the host and ip fields are always equal by design, the host field in the cat nodes API is redundant and should be removed. Closes #16656	2016-02-14 09:21:32 -05:00
Nik Everett	821a20f582	Merge branch 'master' into feature/reindex	2016-02-11 17:41:05 -05:00
Nik Everett	18808b7576	Move reindex from a plugin to a module	2016-02-11 17:39:49 -05:00
Adrien Grand	bc47c577d2	Add a new `keyword` field. The `keyword` field is intended to replace `not_analyzed` string fields. It is indexed and has doc values by default, and doesn't support enabling term vectors. Although it doesn't support setting an analyzer for now, there are plans for it to support basic normalization in the future such as case folding.	2016-02-11 18:19:53 +01:00
Igor Motov	99a7d8e41f	Add task cancellation mechanism Only tasks that extend CancellableTask can be cancelled using this mechanism. If a cancellable task has children it can elect to cancel all child tasks as well. In this case a special ban parent request is sent to all nodes. This request does two things: 1) it prevents any tasks with the banned parent task from being started, and 2) it cancels all currently running tasks that have the banned task as a parent. The ban is lifted as soon as the coordinating node notifies all other nodes that the cancelled task has finished executing. If the coordinating node leaves the cluster before it has a chance to lift its bans, all bans set by this coordinating node are automatically removed. As an option a task can elect to automatically cancel all child tasks if their parent task was running on a node that just left the cluster. This option makes sense for cancellable heavy tasks that have no side-effects and only return results to the coordinating node. With the coordinating node gone, it doesn't make sense to run such tasks any longer since their results will be most likely discarded.	2016-02-09 22:30:57 -05:00
Yannick Welsch	0d11443aba	Fix filters and null parameters in _aliases command Closes #16549 Closes #16547	2016-02-09 21:43:42 +01:00
Andrej Kazakov	7f2b369dfd	Use Accept header field in cat API The cat API previously used the Content-Type header field for determining the media type of the response. This is in opposition to the HTTP spec which specifies the Accept header field for this purpose. This commit replaces the use of the Content-Type header field with the Accept header field in the cat API. Closes #14421	2016-02-05 06:28:39 -05:00
Martijn van Groningen	7a6adfd93a	ingest: Added foreach processor. This processor is useful when all elements of a json array need to be processed in the same way. This avoids that a processor needs to be defined for each element in an array. Also it is very likely that it is unknown how many elements are inside an json array.	2016-02-04 23:44:01 +01:00
Simon Willnauer	450ee70038	Remove DFS support from TermVector API Retrieving distributed DF for TermVectors is beside it's esotheric justification a very slow process and can cause serious load on the cluster. We also don't have nearly enough testing for this stuff and given the complexity we should remove it rather than carrying it around.	2016-02-04 16:20:24 +01:00
Yannick Welsch	4937531a17	Remove obsolete version in ShardRouting Closes #16243	2016-02-04 15:50:25 +01:00
Tal Levy	9e7e2ab10b	remove DeDotProcessor from Ingest	2016-02-02 14:16:01 -08:00
Tal Levy	3191fc7347	Merge pull request #16355 from talevy/fix_ingest_exception revert PipelineFactoryError handling with throwing ElasticsearchParseException in ingest pipeline creation	2016-02-02 14:11:24 -08:00
Tal Levy	0a1580eefa	revert PipelineFactoryError handling with throwing ElasticsearchParseException in ingest pipeline creation	2016-02-02 14:08:22 -08:00
Greg Marzouka	e7fc98a33f	Remove detect_noop from REST spec Unless this should be supported as a query string parameter instead, right now it only works when specified in the body.	2016-02-02 15:32:14 -05:00
Tal Levy	fca442f4d1	Introduce Pipeline Factory Error Responses in Node Ingest When there is an exception thrown during pipeline creation within Rest calls (in put pipeline, and simulate) We now return a structured error response to the user with details around which processor's configuration is the cause of the issue, or which configuration property is misconfigured, etc.	2016-01-29 13:37:27 -08:00
Jim Ferenczi	1343d6cbd1	Remove search_after from the query string param of the rest api spec. Handle null values in search_after. Ensure that the cluster is green after each index creation in the integ tests.	2016-01-27 19:21:01 +01:00
javanna	8006e5cd15	[TEST] re-enable and merge cluster settings REST tests We used to have a disabled test around cluster put settings as it left cluster settings behind without a way to remove them. That has been in fixed in the cluster put settings api, so the test can be re-enabled.	2016-01-27 17:37:42 +01:00
Jim Ferenczi	aea7660e37	Add search_after parameter in the Search API. The search_after parameter provides a way to efficiently paginate from one page to the next. This parameter accepts an array of sort values, those values are then used by the searcher to sort the top hits from the first document that is greater to the sort values. This parameter must be used in conjunction with the sort parameter, it must contain exactly the same number of values than the number of fields to sort on. NOTE: A field with one unique value per document should be used as the last element of the sort specification. Otherwise the sort order for documents that have the same sort values would be undefined. The recommended way is to use the field `_uuid` which is certain to contain one unique value for each document. Fixes #8192	2016-01-27 09:42:58 +01:00
Tal Levy	ff0e8272cb	[ingest] update test to verify that documents are deep-copied between verbose results	2016-01-26 14:12:42 -08:00
Martijn van Groningen	df0be87b18	Merge pull request #16049 from elastic/feature/ingest Merge feature/ingest branch into master branch. This adds the ingest feature to ES that allows to preprocess document before indexing on an ingest node. By default a node is an ingest node. Documents are preprocessed via a pipeline. A pipeline consists out of one or more processors Each processor makes one or more modifications to a document processed. There are many types of processors available out-of-the-box that are designed to make a specific change to a document being processed. In a cluster many pipeline can be configured via dedicated pipeline APIs. An new option on the bulk and index APIs allows to control what pipeline is picked for preprocessing. If no pipeline is specified then the ingest feature is skipped and no preprocessing takes place.	2016-01-26 13:41:13 +01:00
Martijn van Groningen	8b02f214c4	percolator: The percolate api shouldn't add field mappings for unmapped fields inside the document being percolated to the mapping. Closes #15751	2016-01-26 10:26:46 +01:00
javanna	36d98478bf	Merge branch 'master' into feature/ingest	2016-01-25 18:01:09 +01:00
Ryan Ernst	df24019261	Merge pull request #16038 from rjernst/remove_site_plugin Plugins: Remove site plugins	2016-01-21 12:32:22 -08:00
Tal Levy	3a6c2d008e	rename processor_tag to tag	2016-01-21 09:05:42 -08:00
Martijn van Groningen	602a0f183e	Merge remote-tracking branch 'es/master' into feature/ingest	2016-01-19 22:01:38 +01:00
Tal Levy	4ef85eda36	add default separator test to dedot rest test	2016-01-18 09:25:36 -08:00
Simon Willnauer	9562fb76bc	expose default settings via rest API	2016-01-18 12:48:47 +01:00
Simon Willnauer	13e5547537	Add REST tests for reset index settings and for listing defaults.	2016-01-18 10:02:37 +01:00
Simon Willnauer	dc05669fd9	replace unsupported setting translog.disable_flush with a high value of translog.flush_threshold_size	2016-01-18 09:23:35 +01:00
Ryan Ernst	3b78267c71	Plugins: Remove site plugins Site plugins used to be used for things like kibana and marvel, but there is no longer a need since kibana (and marvel as a kibana plugin) uses node.js. This change removes site plugins, as well as the flag for jvm plugins. Now all plugins are jvm plugins.	2016-01-16 22:45:37 -08:00
Tal Levy	9f48df9736	Add on_failure support for verbose _simulate execution and introduce optional processor_tag to Processors	2016-01-15 14:56:20 -08:00
Tal Levy	1754eece66	introduce DeDotProcessor fixes #15944.	2016-01-15 11:35:18 -08:00
javanna	9c06736dbd	Merge branch 'master' into feature/ingest	2016-01-15 10:11:56 +01:00
javanna	07a82d0c09	make get alias expand to open and closed indices by default This change affects get alias, get aliases as well as cat aliases. They all return closed indices too by default. get alias and get aliases also allow to return open indices only through the `expand_wildcards` option (set it to `open`). Closes #14982	2016-01-14 10:40:31 +01:00
Martijn van Groningen	f3883343cb	Move the pipeline configuration from the dedicated index to the cluster state. Closes #15842	2016-01-13 22:59:36 +01:00
javanna	ea8065aa3d	Merge branch 'master' into feature/ingest	2016-01-12 18:28:42 +01:00
Jason Tedor	1de2081ed3	Reintroduce five-minute and fifteen-minute load averages on Linux This commit reintroduces the five-minute and fifteen-minute load stats on Linux, and changes the format of the load_average field back to an array.	2016-01-11 23:42:47 -05:00
javanna	90743d8db0	add REST test for bulk api integration with ingest	2016-01-11 19:04:34 +01:00
javanna	ae69d46f92	move processors that have no deps to core, also move to core rest spec and tests and set node.inget to true by default	2016-01-08 10:39:39 +01:00
Adrien Grand	67d233cecd	Remove warmers and the warmer API. Warmers are now barely useful and will be removed in 3.0. Note that this only removes the warmer API and query-based warmers. We still have warmers internally for eg. global ordinals. Close #15607	2016-01-07 09:57:07 +01:00
Martijn van Groningen	2d6adf6428	Percolator refactoring: * Added percolator field mapper that extracts the query terms and indexes these terms with the percolator query. * At percolate time these extracted terms are used to query percolator queries that are like to be evaluated. This can significantly cut down the time it takes to percolate. Whereas before all percolator queries were evaluated if they matches with the document being percolated. * Changes made to percolator queries are no longer immediately visible, a refresh needs to happen before the changes are visible. * By default the percolate api only returns upto 10 matches instead of returning all matching percolator queries. * Made percolate more modular, so that it is easier to add unit tests. * Added unit tests for the percolator. Closes #12664 Closes #13646	2016-01-06 16:08:10 +01:00
Igor Motov	a89dba27c2	Task Management: Add framework for registering and communicating with tasks Adds task manager class and enables all activities to register with the task manager. Currently, the immutable Transport*Activity class represents activity itself shared across all requests. This PR adds and an additional structure Task that keeps track of currently running requests and can be used to communicate with these requests using TransportTaskAction. Related to #15117	2016-01-05 12:24:43 -05:00
Adrien Grand	6d3c9b074c	Remove support for the `multi_field` type. It is officially unsupported since version 1.0.	2015-12-30 12:03:15 +01:00
Lee Hinman	482843e27b	Fix build to run correctly on FreeBSD This adds the required changes/checks so that the build can run on FreeBSD. There are a few things that differ between FreeBSD and Linux: - CPU probes return -1 for CPU usage - `hot_threads` cannot be supported on FreeBSD From OpenJDK's `os_bsd.cpp`: ```c++ bool os::is_thread_cpu_time_supported() { #ifdef __APPLE__ return true; #else return false; #endif } ``` So this API now returns (for each FreeBSD node): ``` curl -s localhost:9200/_nodes/hot_threads ::: {Devil Hunter Gabriel}{q8OJnKCcQS6EB9fygU4R4g}{127.0.0.1}{127.0.0.1:9300} hot_threads is not supported on FreeBSD ``` - multicast fails in native `join` method - known bug: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=193246 Which causes: ``` 1> Caused by: java.net.SocketException: Invalid argument 1> at java.net.PlainDatagramSocketImpl.join(Native Method) 1> at java.net.AbstractPlainDatagramSocketImpl.join(AbstractPlainDatagramSocketImpl.java:179) 1> at java.net.MulticastSocket.joinGroup(MulticastSocket.java:323) 1> at org.elasticsearch.plugin.discovery.multicast.MulticastChannel$Plain.buildMulticastSocket(MulticastChannel.java:309) ``` So these tests are skipped on FreeBSD. Resolves #15562	2015-12-22 12:36:04 -07:00
Simon Willnauer	6ea266a89c	Merge branch 'master' into settings_prototype	2015-12-15 16:33:01 +01:00
Jun Ohtani	fab44398d9	Analysis: Add detail response support add explain option fix char_filter bug Closes #11076 #15257	2015-12-10 23:10:51 +09:00
Robert Muir	e454fadc22	Merge branch 'master' into shave_mustache	2015-12-10 07:58:24 -05:00
Yannick Welsch	bef0bedba9	Add support to _aliases endpoint to specify multiple indices and aliases in one action Closes #15305	2015-12-09 19:08:27 +01:00
Robert Muir	a6e1655fe9	fix integ tests	2015-12-09 00:30:32 -05:00
Jim Ferenczi	23aeaa88b2	Fixes random failures of org.apache.elasticsearch.test.rest.RestIT RestTable: ignores right padding for the last cell of a column.	2015-12-08 20:52:24 +01:00
Myll	73a3c326c9	_cat APIs: remove space at the end of a line Fixes #9464	2015-12-08 15:03:59 +01:00

1 2 3 4 5 ...

1117 Commits