OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	7908759949	Make aggregation registration more like query registration Creates a class to hold behavior common to these `ParseFieldRegistry`s. Also renames some weirdly named methods on StreamInput.	2016-04-12 14:50:44 -04:00
Igor Motov	81c59cae18	Add _cat/tasks Adds new _cat endpoint that lists all tasks	2016-04-07 09:28:21 -06:00
Jimmy Jones	f157dae053	Disallow unquoted field names, fix testcases using unquoted JSON	2016-04-06 14:37:15 -06:00
Spencer	7037670aeb	[REST API] set correct default value The correct default value for the `expand_wildcards` parameter to `indices.get_alias` is `all` as of all `f4d75f0212`	2016-04-05 09:19:21 -07:00
Mpdreamz	dd0c99bb65	index is a required url part	2016-04-04 14:14:44 +02:00
Mpdreamz	2944869ab9	Document task id's as string in the rest spec	2016-04-01 11:04:15 +02:00
Martijn Laarman	dfe2c0ff0a	Remove deprecated indices.get_aliases This has been deprecated since the first release of Elasticsearch 1.0	2016-03-31 14:46:53 +02:00
Igor Motov	e073b0c75d	Add ability to group tasks by common parent By default, tasks are grouped by node. However, task execution in elasticsearch can be quite complex and an individual task that runs on a coordinating node can have many subtasks running on other nodes in the cluster. This commit makes it possible to list task grouped by common parents instead of by node. When this option is enabled all subtask are grouped under the coordinating node task that started all subtasks in the group. To group tasks by common parents, use the following syntax: GET /tasks?group_by=parents	2016-03-30 17:50:27 -04:00
Nik Everett	78ab6c5b7f	[reindex] Dynamic throttle! This allows the user to update the reindex throttle on the fly, with changes that speed up the throttling being applied immediately and changes that slow down the throttling being applied during the next batch. This means that if a user throttles reindex in such a way that it tries to sleep for 16 years and then realizes that they've done something wrong then they can change the throttle and reindex will wake up again. We don't apply slow downs immediately so we never get in danger of losing the scan context. Also, if reindex is canceled while it is sleeping (how it honor throttling) then it'll immediately wake up and cancel itself.	2016-03-30 16:40:42 -04:00
Isabel Drost-Fromm	132f96b6ba	Merge pull request #17403 from elastic/docs/remove-rest-api-utils-reference Remove reference to utils for generating REST docs	2016-03-30 21:18:41 +02:00
Adrien Grand	068c788ec8	Disable fielddata on text fields by defaults. #17386 `text` fields will have fielddata disabled by default. Fielddata can still be enabled on an existing index by setting `fielddata=true` in the mappings.	2016-03-30 14:35:32 +02:00
Isabel Drost-Fromm	72d1ed65a4	Remove reference to utils for generating REST docs This removes the reference to a no longer existing utils directory that used to be there for generating docs and tests from Java source.	2016-03-30 13:36:51 +02:00
javanna	8fc9dbbb99	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 14:27:04 +02:00
Clinton Gormley	b87beeb05f	Rename update-by-query REST tests to update_by_query	2016-03-29 13:13:49 +02:00
Clinton Gormley	647437ce56	REST: The body is required in the reindex API	2016-03-29 11:45:20 +02:00
Clinton Gormley	97606850e8	Renamed update-by-query REST spec to update_by_query	2016-03-29 11:45:20 +02:00
javanna	de5cbda8e7	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 10:48:47 +02:00
Lee Hinman	80ab366de4	Add API to explain why a shard is or isn't assigned This adds a new `/_cluster/allocation/explain` API that explains why a shard can or cannot be allocated to nodes in the cluster. Additionally, it will show where the master desires to put the shard, according to the `ShardsAllocator`. It looks like this: ``` GET /_cluster/allocation/explain?pretty { "index": "only-foo", "shard": 0, "primary": false } ``` Though, you can optionally send an empty body, which means "explain the allocation for the first unassigned shard you find". The output when a shard is unassigned looks like this: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : false }, "assigned" : false, "unassigned_info" : { "reason" : "INDEX_CREATED", "at" : "2016-03-22T20:04:23.620Z" }, "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 0.06666675, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "NO", "weight" : -1.3833332, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 2.3166666, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` And when the shard is assigned, the output looks like: ``` { "shard" : { "index" : "only-foo", "index_uuid" : "KnW0-zELRs6PK84l0r38ZA", "id" : 0, "primary" : true }, "assigned" : true, "assigned_node_id" : "Qc6VL8c5RWaw1qXZ0Rg57g", "nodes" : { "V-Spi0AyRZ6ZvKbaI3691w" : { "node_name" : "Susan Storm", "node_attributes" : { "bar" : "baz" }, "final_decision" : "NO", "weight" : 1.4499999, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] }, "Qc6VL8c5RWaw1qXZ0Rg57g" : { "node_name" : "Slipstream", "node_attributes" : { "bar" : "baz", "foo" : "bar" }, "final_decision" : "CURRENTLY_ASSIGNED", "weight" : 0.0, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "the shard cannot be allocated on the same node id [Qc6VL8c5RWaw1qXZ0Rg57g] on which it already exists" } ] }, "PzdyMZGXQdGhqTJHF_hGgA" : { "node_name" : "The Symbiote", "node_attributes" : { }, "final_decision" : "NO", "weight" : 3.6999998, "decisions" : [ { "decider" : "filter", "decision" : "NO", "explanation" : "node does not match index include filters [foo:\"bar\"]" } ] } } } ``` Only "NO" decisions are returned by default, but all decisions can be shown by specifying the `?include_yes_decisions=true` parameter in the request. Resolves #14593	2016-03-28 15:21:02 -06:00
Nik Everett	0e6141e675	Replace is_true: took with took >= 0 This prevents tests from failing on machines that can finish the request less than half a millisecond.	2016-03-28 13:03:48 -04:00
javanna	a685148268	[TEST] expand REST tests to check for roles in nodes info, nodes stats and tasks list response	2016-03-25 22:53:21 +01:00
javanna	a9f4982c40	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-25 20:16:40 +01:00
Clinton Gormley	30d78f4be0	In cat.snapshots, repository is required Closes #17216	2016-03-25 14:23:52 +01:00
javanna	27d4994aff	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-24 18:10:11 +01:00
Areek Zillur	e16e113691	Remove suggest threadpool In #17198, we removed suggest transport action, which used the `suggest` threadpool to execute requests. Now `suggest` threadpool is unused and suggest requests are executed on the `search` threadpool.	2016-03-23 18:01:45 -04:00
Areek Zillur	91dd9b3301	Merge suggest stats into search stats	2016-03-23 16:37:56 -04:00
Honza Král	b139f4e0bf	[TEST] Move yaml test requiring yaml, add skip:yaml Clients don't ship with yaml (de)serializer by default so this test must be optionally skipped	2016-03-23 14:50:23 +01:00
javanna	030453d320	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-23 11:25:34 +01:00
Honza Král	f8e84f0bbb	[TEST] fix incorrect indent in ingest/70_bulk.yaml	2016-03-22 20:53:23 +01:00
Honza Král	ca4b8667bb	[TEST] Move yaml test requiring header, add skip:headers	2016-03-22 20:53:23 +01:00
Nik Everett	da96b6e41d	[reindex] Add thottling support The throttle is applied when starting the next scroll request so that its timeout can include the throttle time.	2016-03-22 12:34:14 -04:00
javanna	eebd0cfccd	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-22 10:34:40 +01:00
Simon Willnauer	7f16a1d9a7	Improve upgrade experience of node level index settings In 5.0 we don't allow index settings to be specified on the node level ie. in yaml files or via commandline argument. This can cause problems during upgrade if this was used extensively. For instance if analyzers where specified on a node level this might cause the index to be closed when imported (see #17187). In such a case all indices relying on this must be updated via `PUT /${index}/_settings`. Yet, this API has slightly different semantics since it overrides existing settings. To make this less painful this change adds a `preserve_existing` parameter on that API to ensure we have the same semantics as if the setting was applied on the node level. This change also adds a better error message and a change to the migration guide to ensure upgrades are smooth if index settings are specified on the node level. If a index setting is detected this change fails the node startup and prints a message like this: ``` *********************************************************************************** Found index level settings on node level configuration. Since elasticsearch 5.x index level settings can NOT be set on the nodes configuration like the elasticsearch.yaml, in system properties or command line arguments.In order to upgrade all indices the settings must be updated via the /${index}/_settings API. Unless all settings are dynamic all indices must be closed in order to apply the upgradeIndices created in the future should use index templates to set default values. Please ensure all required values are updated on all indices by executing: curl -XPUT 'http://localhost:9200/_all/_settings?preserve_existing=true' -d '{ "index.number_of_shards" : "1", "index.query.default_field" : "main_field", "index.translog.durability" : "async", "index.ttl.disable_purge" : "true" }' *********************************************************************************** ```	2016-03-21 20:12:18 +01:00
javanna	4077a2c9e1	adapt cluster stats REST tests after merge	2016-03-21 18:24:12 +01:00
javanna	bf390a935e	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-21 17:18:23 +01:00
Martijn van Groningen	e3b7e5d75a	percolator: Replace percolate api with the new percolator query Also replaced the PercolatorQueryRegistry with the new PercolatorQueryCache. The PercolatorFieldMapper stores the rewritten form of each percolator query's xcontext in a binary doc values field. This make sure that the query rewrite happens only during indexing (some queries for example fetch shapes, terms in remote indices) and the speed up the loading of the queries in the percolator query cache. Because the percolator now works inside the search infrastructure a number of features (sorting fields, pagination, fetch features) are available out of the box. The following feature requests are automatically implemented via this refactoring: Closes #10741 Closes #7297 Closes #13176 Closes #13978 Closes #11264 Closes #10741 Closes #4317	2016-03-21 12:21:50 +01:00
Clinton Gormley	0543d46c1d	Fixed regex in cat.recovery REST tes The time column should accept integer ms or floating point seconds	2016-03-16 17:22:00 +01:00
Simon Willnauer	121e7c8ca4	Add infrastructure to run REST tests on a multi-version cluster This change adds the infrastructure to run the rest tests on a multi-node cluster that users 2 different minor versions of elasticsearch. It doesn't implement any dedicated BWC tests but rather leverages the existing REST tests. Since we don't have a real version to test against, the tests uses the current version until the first minor / RC is released to ensure the infrastructure works. Relates to #14406 Closes #17072	2016-03-13 10:52:39 +01:00
Jason Tedor	f465d98eb3	Add raw recovery progress to cat recovery API This commit adds fields bytes_recovered and files_recovered to the cat recovery API. These fields, respectively, indicate the total number of bytes and files recovered. Additionally, for consistency, some totals fields and translog recovery fields have been renamed. Closes #17064	2016-03-11 08:27:09 -05:00
Nik Everett	b8d931d23c	[reindex] Timeout if sub-requests timeout Sadly, it isn't easy to simulate a timeout during an integration test, you just have to cause one. Groovy's sleep should do the job.	2016-03-10 13:05:23 -05:00
Martijn van Groningen	0bbb84c19a	test: 'Test bulk request with default pipeline' may get run first and then the total ingest count for pipeline1 is 2.	2016-03-10 15:18:08 +01:00
Martijn van Groningen	2fa33d5c47	Added ingest statistics to node stats API The ingest stats include the following statistics: * `ingest.total.count`- The total number of document ingested during the lifetime of this node * `ingest.total.time_in_millis` - The total time spent on ingest preprocessing documents during the lifetime of this node * `ingest.total.current` - The total number of documents currently being ingested. * `ingest.total.failed` - The total number ingest preprocessing operations failed during the lifetime of this node Also these stats are returned on a per pipeline basis.	2016-03-10 13:21:43 +01:00
Nik Everett	6d0efae713	Teach list tasks api to wait for tasks to finish _wait_for_completion defaults to false. If set to true then the API will wait for all the tasks that it finds to stop running before returning. You can use the timeout parameter to prevent it from waiting forever. If you don't set a timeout parameter it'll default to 30 seconds. Also adds a log message to rest tests if any tasks overrun the test. This is just a log (instead of failing the test) because lots of tasks are run by the cluster on its own and they shouldn't cause the test to fail. Things like fetching disk usage from the other nodes, for example. Switches the request to getter/setter style methods as we're going that way in the Elasticsearch code base. Reindex is all getter/setter style. Closes #16906	2016-03-08 11:53:57 -05:00
Jun Ohtani	071d578953	Analysis : Allow string explain param in JSON Move some test methods from AnalylzeActionIT to RestAnalyzeActionTest Allow string explain param if it can parse Fix wrong param name in rest-api-spec Closes #16925	2016-03-08 16:19:02 +09:00
Martijn van Groningen	82d01e4315	Added ingest info to node info API, which contains a list of available processors. Internally the put pipeline API uses this information in node info API to validate if all specified processors in a pipeline exist on all nodes in the cluster.	2016-03-07 14:44:50 +01:00
javanna	9c4a5bbe7e	adapt cluster stats api to node.client setting removal The cluster stats api now returns counts for each node role. The `master_data`, `master_only`, `data_only` and `client` fields have been removed from the response in favour of `master`, `data`, `ingest` and `coordinating_only`. The same node can have multiple roles, hence contribute to multiple roles counts. Every node is implicitly a coordinating node, so whenever a node has no explicit roles, it will be counted as coordinating only.	2016-03-05 10:55:19 +01:00
javanna	f786e9866c	adapt _cat/nodes to node.client removal _cat/nodes used to return `c` for client node or `d` for data node as part of the node.role column. This commit changes it to return `m` for master eligible, `d` for data and/or `i` for ingest. A node with no explicit roles will be a coordinating only node and marked with `-`. A node can obviously have multiple roles. The master column has been adapted to return only whether a node is the current master (`*`) or not (`-`).	2016-03-05 10:55:19 +01:00
Nik Everett	4d6cb34417	[reindex] Add ingest support	2016-03-04 10:05:13 -05:00
Clinton Gormley	30669f63e8	Document required settings when running the REST test suite	2016-03-04 13:50:40 +01:00
Simon Willnauer	5008694ba1	Remove support for legacy checksums Elasticsearch 5.0 doesn't support indices wiht legacy checksums anymore. The last time we write legacy checksums was in 1.3.0 which was based on lucene 4.9 already which means that all files have CRC32 checksums. All indices that Elasticsearch can read today must be written with lucene version >= 4.8 anyway so we can drop this layer of backwards compatibility entirely. Since we are close to upgrading to Lucene 6.0 we should get rid of this in a more contiained change than the lucene upgrade.	2016-03-03 22:58:18 +01:00
Adrien Grand	fc0cc4a6bb	Fix field_stats tests to use text/keyword instead of string.	2016-03-03 16:24:02 +01:00
Clinton Gormley	6b27de3f8c	Fixed REST test to not rely on dynamic mapping	2016-03-03 14:38:10 +01:00
Clinton Gormley	ce7fccb287	Fixed bad YAML in REST tests	2016-03-03 14:38:06 +01:00
Martijn van Groningen	75387001df	Added `ingest_took` to bulk response to indicate how much time was spent on ingest preprocessing. The `ingest_took` is separate from `took`, which keeps track how much time is spent on indexing/deleting/updating. The `ingest_took` is only visible in the rest response if at least for one bulk item has ingest enabled.	2016-03-01 18:24:26 +01:00
Nik Everett	c7c8bb357a	Merge pull request #16861 from nik9000/reindex_is_ready Reindex required some parsing changes for search requests to support differing defaults from the regular search api.	2016-03-01 10:02:48 -05:00
Spencer	3f80feb899	[REST_API_SPEC] remove invalid use of catch: param `catch: param` is designed to catch errors generated by client-side validation logic when users don't supply valid parameters to an API request. This test though is testing the server-side validation of pipeline aggregations, and so a "param" catch is invalid. Instead we will just test for a parse_exception error type using a regex.	2016-02-29 09:27:36 -07:00
Nik Everett	c38119bae9	Merge branch 'master' into feature/reindex	2016-02-26 16:59:54 -05:00
Igor Motov	d6af669776	Combine node name and task id into single string task id This commit changes the URL for task operations from `/_tasks/{nodeId}/{taskId}` to `/_tasks/{taskId}`, where `{taskId}` has a form of nodeid:id	2016-02-24 12:44:12 -08:00
Simon Willnauer	354aae2fec	Merge pull request #16770 from s1monw/http_on_cat Expose http address in cat/nodes and cat/nodeattrs APIs We expose a lot of information like IP address and port but never expose the http address/ip:port in the CAT API. It's nice to have it there too since otherwise json parsing is required to get this information	2016-02-22 14:20:33 -08:00
David Pilato	a0a6eff0d0	Fix test for [cat/recovery] Make recovery time a TimeValue() Related to #16743	2016-02-22 13:37:11 -08:00
Simon Willnauer	3c15200f6f	Expose http address in cat/nodes and cat/nodeattrs APIs We expose a lot of information like IP address and port but never expose the http address/ip:port in the CAT API. It's nice to have it there too since otherwise json parsing is required to get this information	2016-02-22 13:22:54 -08:00
Lee Hinman	99052c3fef	Limit the accepted length of the _id Elasticsearch should reject ids that are this long, to ensure a document always remains retrievable for clients that impose a maximum URI length Closes #16034	2016-02-22 12:34:18 -07:00
Spencer	31847c1e9d	[REST API] use a block literal for request bodies	2016-02-20 12:55:23 -08:00
Spencer	a859595dcd	[REST API] use a block literal for request bodies	2016-02-20 12:53:39 -08:00
Adrien Grand	4f8895eae3	Add a text field. This new field is intended to replace analyzed string fields.	2016-02-15 10:43:44 +01:00
Jason Tedor	3bbd1c129e	Remove host from cat nodes API As the host and ip fields are always equal by design, the host field in the cat nodes API is redundant and should be removed. Closes #16656	2016-02-14 09:21:32 -05:00
Nik Everett	821a20f582	Merge branch 'master' into feature/reindex	2016-02-11 17:41:05 -05:00
Nik Everett	18808b7576	Move reindex from a plugin to a module	2016-02-11 17:39:49 -05:00
Adrien Grand	bc47c577d2	Add a new `keyword` field. The `keyword` field is intended to replace `not_analyzed` string fields. It is indexed and has doc values by default, and doesn't support enabling term vectors. Although it doesn't support setting an analyzer for now, there are plans for it to support basic normalization in the future such as case folding.	2016-02-11 18:19:53 +01:00
Igor Motov	99a7d8e41f	Add task cancellation mechanism Only tasks that extend CancellableTask can be cancelled using this mechanism. If a cancellable task has children it can elect to cancel all child tasks as well. In this case a special ban parent request is sent to all nodes. This request does two things: 1) it prevents any tasks with the banned parent task from being started, and 2) it cancels all currently running tasks that have the banned task as a parent. The ban is lifted as soon as the coordinating node notifies all other nodes that the cancelled task has finished executing. If the coordinating node leaves the cluster before it has a chance to lift its bans, all bans set by this coordinating node are automatically removed. As an option a task can elect to automatically cancel all child tasks if their parent task was running on a node that just left the cluster. This option makes sense for cancellable heavy tasks that have no side-effects and only return results to the coordinating node. With the coordinating node gone, it doesn't make sense to run such tasks any longer since their results will be most likely discarded.	2016-02-09 22:30:57 -05:00
Yannick Welsch	0d11443aba	Fix filters and null parameters in _aliases command Closes #16549 Closes #16547	2016-02-09 21:43:42 +01:00
Andrej Kazakov	7f2b369dfd	Use Accept header field in cat API The cat API previously used the Content-Type header field for determining the media type of the response. This is in opposition to the HTTP spec which specifies the Accept header field for this purpose. This commit replaces the use of the Content-Type header field with the Accept header field in the cat API. Closes #14421	2016-02-05 06:28:39 -05:00
Martijn van Groningen	7a6adfd93a	ingest: Added foreach processor. This processor is useful when all elements of a json array need to be processed in the same way. This avoids that a processor needs to be defined for each element in an array. Also it is very likely that it is unknown how many elements are inside an json array.	2016-02-04 23:44:01 +01:00
Simon Willnauer	450ee70038	Remove DFS support from TermVector API Retrieving distributed DF for TermVectors is beside it's esotheric justification a very slow process and can cause serious load on the cluster. We also don't have nearly enough testing for this stuff and given the complexity we should remove it rather than carrying it around.	2016-02-04 16:20:24 +01:00
Yannick Welsch	4937531a17	Remove obsolete version in ShardRouting Closes #16243	2016-02-04 15:50:25 +01:00
Tal Levy	9e7e2ab10b	remove DeDotProcessor from Ingest	2016-02-02 14:16:01 -08:00
Tal Levy	3191fc7347	Merge pull request #16355 from talevy/fix_ingest_exception revert PipelineFactoryError handling with throwing ElasticsearchParseException in ingest pipeline creation	2016-02-02 14:11:24 -08:00
Tal Levy	0a1580eefa	revert PipelineFactoryError handling with throwing ElasticsearchParseException in ingest pipeline creation	2016-02-02 14:08:22 -08:00
Greg Marzouka	e7fc98a33f	Remove detect_noop from REST spec Unless this should be supported as a query string parameter instead, right now it only works when specified in the body.	2016-02-02 15:32:14 -05:00
Tal Levy	fca442f4d1	Introduce Pipeline Factory Error Responses in Node Ingest When there is an exception thrown during pipeline creation within Rest calls (in put pipeline, and simulate) We now return a structured error response to the user with details around which processor's configuration is the cause of the issue, or which configuration property is misconfigured, etc.	2016-01-29 13:37:27 -08:00
Jim Ferenczi	1343d6cbd1	Remove search_after from the query string param of the rest api spec. Handle null values in search_after. Ensure that the cluster is green after each index creation in the integ tests.	2016-01-27 19:21:01 +01:00
javanna	8006e5cd15	[TEST] re-enable and merge cluster settings REST tests We used to have a disabled test around cluster put settings as it left cluster settings behind without a way to remove them. That has been in fixed in the cluster put settings api, so the test can be re-enabled.	2016-01-27 17:37:42 +01:00
Jim Ferenczi	aea7660e37	Add search_after parameter in the Search API. The search_after parameter provides a way to efficiently paginate from one page to the next. This parameter accepts an array of sort values, those values are then used by the searcher to sort the top hits from the first document that is greater to the sort values. This parameter must be used in conjunction with the sort parameter, it must contain exactly the same number of values than the number of fields to sort on. NOTE: A field with one unique value per document should be used as the last element of the sort specification. Otherwise the sort order for documents that have the same sort values would be undefined. The recommended way is to use the field `_uuid` which is certain to contain one unique value for each document. Fixes #8192	2016-01-27 09:42:58 +01:00
Tal Levy	ff0e8272cb	[ingest] update test to verify that documents are deep-copied between verbose results	2016-01-26 14:12:42 -08:00
Martijn van Groningen	df0be87b18	Merge pull request #16049 from elastic/feature/ingest Merge feature/ingest branch into master branch. This adds the ingest feature to ES that allows to preprocess document before indexing on an ingest node. By default a node is an ingest node. Documents are preprocessed via a pipeline. A pipeline consists out of one or more processors Each processor makes one or more modifications to a document processed. There are many types of processors available out-of-the-box that are designed to make a specific change to a document being processed. In a cluster many pipeline can be configured via dedicated pipeline APIs. An new option on the bulk and index APIs allows to control what pipeline is picked for preprocessing. If no pipeline is specified then the ingest feature is skipped and no preprocessing takes place.	2016-01-26 13:41:13 +01:00
Martijn van Groningen	8b02f214c4	percolator: The percolate api shouldn't add field mappings for unmapped fields inside the document being percolated to the mapping. Closes #15751	2016-01-26 10:26:46 +01:00
javanna	36d98478bf	Merge branch 'master' into feature/ingest	2016-01-25 18:01:09 +01:00
Ryan Ernst	df24019261	Merge pull request #16038 from rjernst/remove_site_plugin Plugins: Remove site plugins	2016-01-21 12:32:22 -08:00
Tal Levy	3a6c2d008e	rename processor_tag to tag	2016-01-21 09:05:42 -08:00
Martijn van Groningen	602a0f183e	Merge remote-tracking branch 'es/master' into feature/ingest	2016-01-19 22:01:38 +01:00
Tal Levy	4ef85eda36	add default separator test to dedot rest test	2016-01-18 09:25:36 -08:00
Simon Willnauer	9562fb76bc	expose default settings via rest API	2016-01-18 12:48:47 +01:00
Simon Willnauer	13e5547537	Add REST tests for reset index settings and for listing defaults.	2016-01-18 10:02:37 +01:00
Simon Willnauer	dc05669fd9	replace unsupported setting translog.disable_flush with a high value of translog.flush_threshold_size	2016-01-18 09:23:35 +01:00
Ryan Ernst	3b78267c71	Plugins: Remove site plugins Site plugins used to be used for things like kibana and marvel, but there is no longer a need since kibana (and marvel as a kibana plugin) uses node.js. This change removes site plugins, as well as the flag for jvm plugins. Now all plugins are jvm plugins.	2016-01-16 22:45:37 -08:00
Tal Levy	9f48df9736	Add on_failure support for verbose _simulate execution and introduce optional processor_tag to Processors	2016-01-15 14:56:20 -08:00
Tal Levy	1754eece66	introduce DeDotProcessor fixes #15944.	2016-01-15 11:35:18 -08:00
javanna	9c06736dbd	Merge branch 'master' into feature/ingest	2016-01-15 10:11:56 +01:00
javanna	07a82d0c09	make get alias expand to open and closed indices by default This change affects get alias, get aliases as well as cat aliases. They all return closed indices too by default. get alias and get aliases also allow to return open indices only through the `expand_wildcards` option (set it to `open`). Closes #14982	2016-01-14 10:40:31 +01:00
Martijn van Groningen	f3883343cb	Move the pipeline configuration from the dedicated index to the cluster state. Closes #15842	2016-01-13 22:59:36 +01:00
javanna	ea8065aa3d	Merge branch 'master' into feature/ingest	2016-01-12 18:28:42 +01:00
Jason Tedor	1de2081ed3	Reintroduce five-minute and fifteen-minute load averages on Linux This commit reintroduces the five-minute and fifteen-minute load stats on Linux, and changes the format of the load_average field back to an array.	2016-01-11 23:42:47 -05:00
javanna	90743d8db0	add REST test for bulk api integration with ingest	2016-01-11 19:04:34 +01:00
javanna	ae69d46f92	move processors that have no deps to core, also move to core rest spec and tests and set node.inget to true by default	2016-01-08 10:39:39 +01:00
Adrien Grand	67d233cecd	Remove warmers and the warmer API. Warmers are now barely useful and will be removed in 3.0. Note that this only removes the warmer API and query-based warmers. We still have warmers internally for eg. global ordinals. Close #15607	2016-01-07 09:57:07 +01:00
Martijn van Groningen	2d6adf6428	Percolator refactoring: * Added percolator field mapper that extracts the query terms and indexes these terms with the percolator query. * At percolate time these extracted terms are used to query percolator queries that are like to be evaluated. This can significantly cut down the time it takes to percolate. Whereas before all percolator queries were evaluated if they matches with the document being percolated. * Changes made to percolator queries are no longer immediately visible, a refresh needs to happen before the changes are visible. * By default the percolate api only returns upto 10 matches instead of returning all matching percolator queries. * Made percolate more modular, so that it is easier to add unit tests. * Added unit tests for the percolator. Closes #12664 Closes #13646	2016-01-06 16:08:10 +01:00
Igor Motov	a89dba27c2	Task Management: Add framework for registering and communicating with tasks Adds task manager class and enables all activities to register with the task manager. Currently, the immutable Transport*Activity class represents activity itself shared across all requests. This PR adds and an additional structure Task that keeps track of currently running requests and can be used to communicate with these requests using TransportTaskAction. Related to #15117	2016-01-05 12:24:43 -05:00
Adrien Grand	6d3c9b074c	Remove support for the `multi_field` type. It is officially unsupported since version 1.0.	2015-12-30 12:03:15 +01:00
Lee Hinman	482843e27b	Fix build to run correctly on FreeBSD This adds the required changes/checks so that the build can run on FreeBSD. There are a few things that differ between FreeBSD and Linux: - CPU probes return -1 for CPU usage - `hot_threads` cannot be supported on FreeBSD From OpenJDK's `os_bsd.cpp`: ```c++ bool os::is_thread_cpu_time_supported() { #ifdef __APPLE__ return true; #else return false; #endif } ``` So this API now returns (for each FreeBSD node): ``` curl -s localhost:9200/_nodes/hot_threads ::: {Devil Hunter Gabriel}{q8OJnKCcQS6EB9fygU4R4g}{127.0.0.1}{127.0.0.1:9300} hot_threads is not supported on FreeBSD ``` - multicast fails in native `join` method - known bug: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=193246 Which causes: ``` 1> Caused by: java.net.SocketException: Invalid argument 1> at java.net.PlainDatagramSocketImpl.join(Native Method) 1> at java.net.AbstractPlainDatagramSocketImpl.join(AbstractPlainDatagramSocketImpl.java:179) 1> at java.net.MulticastSocket.joinGroup(MulticastSocket.java:323) 1> at org.elasticsearch.plugin.discovery.multicast.MulticastChannel$Plain.buildMulticastSocket(MulticastChannel.java:309) ``` So these tests are skipped on FreeBSD. Resolves #15562	2015-12-22 12:36:04 -07:00
Simon Willnauer	6ea266a89c	Merge branch 'master' into settings_prototype	2015-12-15 16:33:01 +01:00
Jun Ohtani	fab44398d9	Analysis: Add detail response support add explain option fix char_filter bug Closes #11076 #15257	2015-12-10 23:10:51 +09:00
Robert Muir	e454fadc22	Merge branch 'master' into shave_mustache	2015-12-10 07:58:24 -05:00
Yannick Welsch	bef0bedba9	Add support to _aliases endpoint to specify multiple indices and aliases in one action Closes #15305	2015-12-09 19:08:27 +01:00
Robert Muir	a6e1655fe9	fix integ tests	2015-12-09 00:30:32 -05:00
Jim Ferenczi	23aeaa88b2	Fixes random failures of org.apache.elasticsearch.test.rest.RestIT RestTable: ignores right padding for the last cell of a column.	2015-12-08 20:52:24 +01:00
Myll	73a3c326c9	_cat APIs: remove space at the end of a line Fixes #9464	2015-12-08 15:03:59 +01:00
Simon Willnauer	8502926327	fuck you linefeed	2015-12-08 14:39:16 +01:00
Simon Willnauer	2e27ee393f	add rest API to reset settings	2015-12-08 14:39:16 +01:00
Lee Hinman	f709b7283f	Remove `GET` option for /_forcemerge POST should be used to indicate this is not just a retrieval operation. Resolves #15165	2015-12-03 13:56:15 -07:00
Jim Ferenczi	e182072b6f	Merge pull request #15017 from jimferenczi/fields_option Refuse to load fields from _source when using the `fields` option and support wildcards.	2015-11-30 11:01:21 +01:00
Jim Ferenczi	731833cfc6	Fixes #14489 Do not to load fields from _source when using the `fields` option. Non stored (non existing) fields are ignored by the fields visitor when using the `fields` option. Fixes #10783 Support * wildcard to retrieve stored fields when using the `fields` option. Supported pattern styles are "xxx", "xxx", "xxx" and "xxx*yyy".	2015-11-30 11:00:32 +01:00
Clinton Gormley	27dac8dc2c	REST spec: Added the verbose flag to indices.segments Relates to #9111	2015-11-30 07:41:29 +01:00
Jayson Minard	815c53e6b4	body attribute was at wrong nesting level	2015-11-26 14:34:02 -03:00
Lee Hinman	a25b407aeb	Add support for headers in REST tests This adds support for arbitrary headers sent with each REST request, it will allow us to test things like different xcontent-encoding (see 50_with_headers.yaml for what this looks like). Headers are specified at the same level as `catch`, so a request would look like: ```yaml - do: headers: Content-Type: application/yaml get: index: test_1 type: _all id: 1 ```	2015-11-24 08:25:02 -07:00
Martijn van Groningen	48771f1a76	field stats: Added `min_value_as_string` and `max_value_as_string` response elements for all number based fields. The existing `min_value` and `max_value` will return the values as numbers instead. Closes #14404	2015-11-23 08:48:28 +01:00
Xu Zhang	2e6d72de27	Catch exception when reading corrupted snapshot. Single corrupted snapshot file shouldn't prevent listing all other snapshot in repository.	2015-11-18 21:43:46 -08:00
Jason Tedor	185027a0ff	Update REST tests to reflect changes to cat nodes default response This commit updates the cat nodes REST test to include the CPU percent that was recently added to the default output of the cat nodes response.	2015-11-17 14:51:03 -05:00
Jason Tedor	95c4846e58	Fix race condition in cat shards test This commit fixes a test bug in the cat shards REST test. In particular, there was a race condition in the test that would cause the test to sometimes fail. The race condition is that some of the shards would go to state STARTED after the sync flush was issued. These shards would (correctly) show up in the output as having state started but without a sync_id. However, the expected output was written to only look for shards that have state STARTED and a sync_id, or shards that are still INITIALIZING or are UNASSIGNED and (of course) do not have a sync_id. The best approach here is to just simplify the test.	2015-11-13 12:16:22 -05:00
Jason Tedor	99abb76c78	Fix cat shards test bug	2015-11-13 09:31:44 -05:00
Jason Tedor	a9ab35a487	Add sync_id to cat shards API This commit adds the ability to get the sync_id from the cat shards API. Closes #14705	2015-11-13 05:13:08 -05:00
Areek Zillur	dd1c687ace	Completion Suggester V2 The completion suggester provides auto-complete/search-as-you-type functionality. This is a navigational feature to guide users to relevant results as they are typing, improving search precision. It is not meant for spell correction or did-you-mean functionality like the term or phrase suggesters. The completions are indexed as a weighted FST (finite state transducer) to provide fast Top N prefix-based searches suitable for serving relevant results as a user types. closes #10746	2015-11-07 17:46:27 -05:00
Yannick Welsch	825d0c64e6	Add duration field to /_cat/snapshots Closes #14385	2015-11-04 10:34:00 +01:00
javanna	b56bbf62dd	Validate query api: move query parsing on the coordinating node Similarly to what we did with the search api, we can now also move query parsing on the coordinating node for the validate query api. Given that the explain api is a single shard operation (compared to search which is instead a broadcast operation), this doesn't change a lot in how the api works internally. The main benefit is that we can simplify the java api by requiring a structured query object to be provided rather than a bytes array that will get parsed on the data node. Previously if you specified a QueryBuilder it would be serialized in json format and would get reparsed on the data node, while now it doesn't go through parsing anymore (as expected), given that after the query-refactoring we are able to properly stream queries natively. Note that the WrapperQueryBuilder can be used from the java api to provide a query as a string, in that case the actual parsing of the inner query will happen on the data node. Relates to #10217 Closes #14384	2015-11-02 11:21:20 +01:00
Ryan Ernst	542522531a	Build: Remove maven pom files and supporting ant files This change removes the leftover pom files. A couple files were left for reference, namely in qa tests that have not yet been migrated (vagrant and multinode). The deb and rpm assemblies also still exist for reference when finishing their setup in gradle. See #13930	2015-10-29 23:53:49 -07:00
Lee Hinman	3b5058017e	Merge branch 'remove-optimize-rest'	2015-10-29 15:18:03 -06:00
Ryan Ernst	c86100f636	Switch build system to Gradle See #13930	2015-10-29 11:40:19 -07:00
xuzha	97ecd7bf5a	Expose pending cluster state queue size in node stats Add 3 stats about the queue: total queue size, number of committed cluster states, and number of pending cluster states.	2015-10-28 10:59:15 -07:00
Lee Hinman	3a458af0b7	Remove /_optimize REST API endpoint The `/_optimize` endpoint was deprecated in 2.1.0 and can now be removed entirely.	2015-10-27 10:17:16 -06:00
javanna	dc900a08a6	Remove "query" query and fix related parsing bugs We have two types of parse methods for queries: one for the inner query, to be used once the parser is positioned within the query element, and one for the whole query source, including the query element that wraps the actual query. With the search refactoring we ended up using the former in count, cat count and delete by query, whereas we should have used the former. It ends up working properly given that we have a registered (deprecated) query called "query", which used to allow to wrap a filter into a query, but this has the following downsides: 1) prevents us from removing the deprecated "query" query 2) we end up supporting a top level query that is not wrapped within a query element (pre 1.0 syntax iirc that shouldn't be supported anymore) This commit finally removes the "query" query and fixes the related parsing bugs. We also had some tests that were providing queries in the wrong format, those have been fixed too. Closes #13326 Closes #14304	2015-10-27 14:54:30 +01:00
Yannick Welsch	5959058719	Unique repository names for rest tests	2015-10-26 21:02:25 +01:00
Yannick Welsch	2f10300a03	Merge pull request #14247 from ywelsch/feature/cat-snapshots Add cat API for repositories and snapshots	2015-10-26 18:40:39 +01:00
Yannick Welsch	ca75b7b6ce	Add cat API for repositories and snapshots Closes #14247 Closes #13919	2015-10-26 18:37:10 +01:00
javanna	75cedca0da	Remove search exists api Closes #13682 Closes #13911	2015-10-21 17:39:32 +02:00
Lee Hinman	9ea4909035	Add Force Merge API, deprecate Optimize API This adds an API for force merging lucene segments. The `/_optimize` API is now deprecated and replaced by the `/_forcemerge` API, which has all the same flags and action, just a different name.	2015-10-20 09:00:24 -06:00
Spencer	ff9999876c	[REST tests] prevent backtracking in cat.nodeattrs	2015-10-19 09:36:19 -07:00
Colin Goodheart-Smithe	4557d1b560	Merge branch 'master' into feature/search-request-refactoring # Conflicts: # plugins/lang-groovy/src/test/java/org/elasticsearch/messy/tests/FunctionScoreTests.java	2015-10-08 11:39:34 +01:00
Igor Motov	a358f34276	Expose nodes operation timeout in REST API Currently it's not possible to specify a timeout for nodes operations (such as node info, node stats, cluster stats and hot threads) via REST-based APIs.	2015-10-07 18:07:59 -04:00
Robert	331d2d9955	Update update.json Missing spec for [detect_noop](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-update.html?q=update%20a#_literal_detect_noop_literal) parameter.	2015-10-07 16:13:31 -04:00
Boaz Leskes	bcb3fab6ac	Engine: Remove Engine.Create The `_create` API is handy way to specify an index operation should only be done if the document doesn't exist. This is currently implemented in explicit code paths all the way down to the engine. However, conceptually this is no different than any other versioned operation - instead of requiring a document is on a specific version, we require it to be deleted (or non-existent). This PR removes Engine.Create in favor of a slight extension in the VersionType logic. There are however a couple of side effects: - DocumentAlreadyExistsException is removed and VersionConflictException is used instead (with an improved error message) - Update will reject version parameters if the upsert option is used (it doesn't compute anyway). - Translog.Create is also removed infavor of Translog.Index (that's OK because their binary format was the same, so we can just read Translog.Index of the translog file) Closes #13955	2015-10-07 12:37:34 +02:00
Greg Marzouka	9200c446a1	Merge pull request #13946 from elastic/fix/indices-get-restspec Add options for indices.get feature	2015-10-06 14:28:11 -04:00
javanna	1915c74e93	Merge branch 'master' into feature/search-request-refactoring	2015-10-06 16:21:58 +02:00

1 2 3 4 5 ...

1082 Commits