OpenSearch

Commit Graph

Author	SHA1	Message	Date
Enrico Zimuel	a12be619f6	Fix URL documentation in API specs (#44487 )	2019-07-24 10:33:15 -07:00
James Rodewig	d46545f729	[DOCS] Update anchors and links for Elasticsearch API relocation (#44500 )	2019-07-19 09:18:23 -04:00
Przemyslaw Gomulka	e23ecc5838	JSON logging refactoring and X-Opaque-ID support backport(#41354 ) (#44178 ) This is a refactor to current JSON logging to make it more open for extensions and support for custom ES log messages used inDeprecationLogger IndexingSlowLog , SearchSLowLog We want to include x-opaque-id in deprecation logs. The easiest way to have this as an additional JSON field instead of part of the message is to create a custom DeprecatedMessage (extends ESLogMEssage) These messages are regular log4j messages with a text, but also carry a map of fields which can then populate the log pattern. The logic for this lives in ESJsonLayout and ESMessageFieldConverter. Similar approach can be used to refactor IndexingSlowLog and SearchSlowLog JSON logs to contain fields previously only present as escaped JSON string in a message field. closes #41350 backport #41354	2019-07-12 16:53:27 +02:00
Tanguy Leroux	b977f019b8	Expose translog stats in ReadOnlyEngine (#43752 ) (#43823 ) Backport of #43752 for 7.x.	2019-07-02 13:39:00 +02:00
Zachary Tong	1e47ea5f18	Update rare_term version skips, fix SetBackedScalingCuckooFilter javadoc	2019-07-01 10:52:06 -04:00
Zachary Tong	ea1794832f	Add RareTerms aggregation (#35718 ) This adds a `rare_terms` aggregation. It is an aggregation designed to identify the long-tail of keywords, e.g. terms that are "rare" or have low doc counts. This aggregation is designed to be more memory efficient than the alternative, which is setting a terms aggregation to size: LONG_MAX (or worse, ordering a terms agg by count ascending, which has unbounded error). This aggregation works by maintaining a map of terms that have been seen. A counter associated with each value is incremented when we see the term again. If the counter surpasses a predefined threshold, the term is removed from the map and inserted into a cuckoo filter. If a future term is found in the cuckoo filter we assume it was previously removed from the map and is "common". The map keys are the "rare" terms after collection is done.	2019-07-01 10:30:02 -04:00
Alan Woodward	81dbcfb268	Wildcard intervals (#43691 ) This commit adds a wildcard intervals source, similar to the prefix. It also changes the term parameter in prefix to read prefix, to bring it in to line with the pattern parameter in wildcard. Closes #43198	2019-06-28 14:04:03 +01:00
Alan Woodward	76d0edd1a4	Add prefix intervals source (#43635 ) This commit adds a prefix intervals source, allowing you to search for intervals that contain terms starting with a given prefix. The source can make use of the index_prefixes mapping option. Relates to #43198	2019-06-26 16:22:12 +01:00
Yu	c88f2f23a5	Make Recovery API support `detailed` params (#29076 ) Properly forwards the `detailed` parameter to show the recovery stats details. Closes #28910	2019-06-21 09:05:33 +02:00
Martijn Laarman	8b1b9f8ab9	Introduce stability description to the REST API specification (#38413 ) (#43278 ) * introduce state to the REST API specification * change state over to stability * CCR is no GA updated to stable * SQL is now GA so marked as stable * Introduce `internal` as state for API's, marks stable in terms of lifetime but unstable in terms of guarantees on its output format since it exposes internal representations * make setting a wrong stability value, or not setting it at all an error that causes the YAML test suite to fail * update spec files to be explicit about their stability state * Document the fact that stability needs to be defined Otherwise the YAML test runner will fail (with a nice exception message) * address check style violations * update rest spec unit tests to include stability * found one more test spec file not declaring stability, made sure stability appears after documentation everywhere * cluster.state is stable, mark response in some way to denote its a key value format that can be changed during minors * mark data frame API's as beta * remove internal and private as states for an API * removed the wrong enum values in the Stability Enum in the previous commit (cherry picked from commit 61c34bbd92f8f7e5f22fa411c6b682b0ebd8a99d)	2019-06-17 16:57:13 +02:00
Ryan Ernst	5be0fb32f8	Move painless context api spec to test local (#43122 ) The painless context api is internal and currently meant only for use in generating docs. This commit moves the spec file for the api so that it is only used by the test for this api, and not externally by any clients building from the public rest spec.	2019-06-12 08:19:45 -07:00
Luca Cavanna	e538592652	Update max_concurrent_shard_request parameter docs (#42227 ) Some of the docs were outdated as they did not mention that the limit is not per node. Also, The default value changed. Relates to #31206	2019-06-12 11:25:03 +02:00
Martijn Laarman	cb7ce865b7	remove path from rest-api-spec (#41452 ) (#43084 ) (cherry picked from commit f5fde1d0843d2f0f53d3b9a15b9cfc8b94471ab7)	2019-06-11 12:52:36 +02:00
Alan Woodward	8e23e4518a	Move construction of custom analyzers into AnalysisRegistry (#42940 ) Both TransportAnalyzeAction and CategorizationAnalyzer have logic to build custom analyzers for index-independent analysis. A lot of this code is duplicated, and it requires the AnalysisRegistry to expose a number of internal provider classes, as well as making some assumptions about when analysis components are constructed. This commit moves the build logic directly into AnalysisRegistry, reducing the registry's API surface considerably.	2019-06-10 14:33:25 +01:00
Henning Andersen	dea935ac31	Reindex max_docs parameter name (#42942 ) Previously, a reindex request had two different size specifications in the body: * Outer level, determining the maximum documents to process * Inside the source element, determining the scroll/batch size. The outer level size has now been renamed to max_docs to avoid confusion and clarify its semantics, with backwards compatibility and deprecation warnings for using size. Similarly, the size parameter has been renamed to max_docs for update/delete-by-query to keep the 3 interfaces consistent. Finally, all 3 endpoints now support max_docs in both body and URL. Relates #24344	2019-06-07 12:16:36 +02:00
Gordon Brown	6eb4600e93	Add custom metadata to snapshots (#41281 ) Adds a metadata field to snapshots which can be used to store arbitrary key-value information. This may be useful for attaching a description of why a snapshot was taken, tagging snapshots to make categorization easier, or identifying the source of automatically-created snapshots.	2019-06-05 17:30:31 -06:00
Przemyslaw Gomulka	0ce7e7a4d8	Enable tests failing due to java-joda warnings (#42693 ) Tests were failing in mixed cluster after more broad warnings were introduced in 6.x These tests were using `yyyy-MM-dd` pattern which is now warning about the change of `y` to `u`. However, using predefined pattern `strict_date` which uses the same format prevents the warning from being generate and allow smooth upgrade/work in mixed cluster. relates #42679	2019-05-31 09:42:32 +02:00
James Baiera	41208b7041	Muting prefilter shard tests as they are breaking in BWC testing See #42679	2019-05-29 14:12:38 -04:00
Tanguy Leroux	6bec876682	Improve Close Index Response (#39687 ) This changes the `CloseIndexResponse` so that it reports closing result for each index. Shard failures or exception are also reported per index, and the global acknowledgment flag is computed from the index results only. The response looks like: ``` { "acknowledged" : true, "shards_acknowledged" : true, "indices" : { "docs" : { "closed" : true } } } ``` The response reports shard failures like: ``` { "acknowledged" : false, "shards_acknowledged" : false, "indices" : { "docs-1" : { "closed" : true }, "docs-2" : { "closed" : false, "shards" : { "1" : { "failures" : [ { "shard" : 1, "index" : "docs-2", "status" : "BAD_REQUEST", "reason" : { "type" : "index_closed_exception", "reason" : "closed", "index_uuid" : "JFmQwr_aSPiZbkAH_KEF7A", "index" : "docs-2" } } ] } } }, "docs-3" : { "closed" : true } } } ``` Co-authored-by: Tanguy Leroux <tlrx.dev@gmail.com>	2019-05-24 21:57:55 -04:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Zachary Tong	072a9bdf55	Fix FiltersAggregation NPE when `filters` is empty (#41459 ) If `keyedFilters` is null it assumes there are unkeyed filters...which will NPE if the unkeyed filters was actually empty. This refactors to simplify the filter assignment a bit, adds an empty check and tidies up some formatting.	2019-05-20 10:04:21 -04:00
Tomas Della Vedova	4e9bf3f18a	Remove deprecated _source_exclude and _source_include from get API spec (#42188 ) Support for these parameters was removed in #35097. The spec were left outdated.	2019-05-20 12:40:45 +02:00
Russ Cam	8f838198fa	Remove parent query string parameter (#41098 ) This commit removes the deprecated parent query string parameter. The routing parameter should be used instead.	2019-05-20 12:08:02 +02:00
jaymode	7c6d7997db	Fix skip version in indices open test	2019-05-01 15:18:26 -06:00
Jason Tedor	7f3ab4524f	Bump 7.x branch to version 7.2.0 This commit adds the 7.2.0 version constant to the 7.x branch, and bumps BWC logic accordingly.	2019-05-01 13:38:57 -04:00
Jim Ferenczi	6184efaff6	Handle unmapped fields in _field_caps API (#34071 ) (#41426 ) Today the `_field_caps` API returns the list of indices where a field is present only if this field has different types within the requested indices. However if the request is an index pattern (or an alias, or both...) there is no way to infer the indices if the response contains only fields that have the same type in all indices. This commit changes the response to always return the list of indices in the response. It also adds a way to retrieve unmapped field in a specific section per field called `unmapped`. This section is created for each field that is present in some indices but not all if the parameter `include_unmapped` is set to true in the request (defaults to false).	2019-04-25 18:13:48 +02:00
Zachary Tong	ec5dd0594f	Disallow null/empty or duplicate composite sources (#41359 ) Adds some validation to prevent duplicate source names from being used in the composite agg. Also refactored to use a ConstructingObjectParser and removed the private ctor and setter for sources, making it mandatory.	2019-04-24 13:23:31 -04:00
Martijn Laarman	85b9dc18a7	fix #35262 define deprecations of API's as a whole and urls (#39063 ) * fix #35262 define deprecations of API's as a whole and urls * document hot threads deprecated paths * deprecate scroll_id as part of the URL, documented only as part of the body which is a safer behaviour as well * use version numbers up to patch version * rest spec parser picks up deprecated paths as paths too (cherry picked from commit 7e06023e7603b7584bfd9ee4e8a1ccd82c208ce7)	2019-04-23 14:28:36 +02:00
Jim Ferenczi	8f73e1e883	Fix unmapped field handling in the composite aggregation (#41280 ) The `composite` aggregation maps unknown fields as numerics, this means that any `after` value that is set on a query with an unmapped field on some indices will fail if the provided value is not numeric. This commit changes the default value source to use keyword instead in order to be able to parse any type of after values.	2019-04-18 23:08:13 +02:00
Zachary Tong	f19b052e03	Better error messages when pipelines reference incompatible aggs (#40068 ) Pipelines require single-valued agg or a numeric to be returned. If they don't get that, they throw an exception. Unfortunately, this exception text is very confusing to users because it usually arises from pathing "through" multiple terms aggs. The final target is a numeric, but it's the intermediary aggs that cause the problem. This commit adds the current agg name to the exception message so the user knows which "level" is the issue.	2019-04-15 10:35:53 -04:00
Nik Everett	c379206c1e	Fix some documentation urls in rest-api-spec (#40618 ) (#41145 ) Fixes some documentation urls in the rest-api-spec. Some of these URLs pointed to 404s and a few others pointed to deprecated documentation when we have better documentation now. I'm not consistent about `master` vs `current` because we're not consistent in other places and I think we should solve all of those at once with something a little more automatic.	2019-04-12 10:11:14 -04:00
Nhat Nguyen	9c36ab4ab4	Adjust bwc version for flush parameter validation Relates to #40213	2019-04-11 18:02:48 -04:00
Jason Tedor	24446ceae0	Add packaging to cluster stats response (#41048 ) This commit adds a packaging_types field to the cluster stats response that outlines the build flavors and types present in a cluster.	2019-04-10 13:47:19 -04:00
Adrien Grand	7c27e5f243	Revert "Mute failing test" This reverts commit `1af2b2bfe6`.	2019-04-04 15:55:09 +02:00
Nhat Nguyen	2756a3936b	Reject illegal flush parameters (#40213 ) This change rejects an illegal combination of flush parameters where force is true, but wait_if_ongoing is false. This combination is trappy and should be forbidden. Closes #36342	2019-04-04 09:02:31 -04:00
Alpar Torok	1af2b2bfe6	Mute failing test Tracked in #40838	2019-04-04 14:54:21 +03:00
Adrien Grand	670e76669c	Fix alias resolution runtime complexity. (#40263 ) (#40788 ) A user reported that the same query that takes ~900ms when querying an index pattern only takes ~50ms when only querying indices that have matches. The query is a date range query and we confirmed that the `can_match` phase works as expected. I was able to reproduce this issue locally with a single node: with 900 1-shard indices, a query to an index pattern that matches all indices runs in ~90ms while a query to the only index that has matches runs in 0-1ms. This ended up not being related to the `can_match` phase but to the cost of resolving aliases when querying an index pattern that matches lots of indices. In that case, we first resolve the index pattern to a list of concrete indices and then for each concrete index, we check whether it was matched through an alias, meaning we might have to apply alias filters. Unfortunately this second per-index operation runs in linear time with the number of matched concrete indices, which means that alias resolution runs in O(num_indices^2) overall. So queries get exponentially slower as an index pattern matches more indices. I reorganized alias resolution into a one-step operation that runs in linear time with the number of matches indices, and then a per-index operation that runs in linear time with the number of aliases of this index. This makes alias resolution run is O(num_indices * num_aliases_per_index) overall instead. When testing the scenario described above, the `took` went down from ~90ms to ~10ms. It is still more than the 0-1ms latency that one gets when only querying the single index that has data, but still much better than what we had before. Closes #40248	2019-04-04 11:40:42 +02:00
David Turner	5a2ba34174	Get node ID from nodes info in REST tests (#40052 ) (#40532 ) We discussed recently that the cluster state API should be considered "internal" and therefore our usual cast-iron stability guarantees do not hold for this API. However, there are a good number of REST tests that try to identify the master node. Today they call `GET /_cluster/state` API and extract the master node ID from the response. In fact many of these tests just want an arbitary node ID (or perhaps a data node ID) so an alternative is to call `GET _nodes` or `GET _nodes/data:true` and obtain a node ID from the keys of the `nodes` map in the response. This change adds the ability for YAML-based REST tests to extract an arbitrary key from a map so that they can obtain a node ID from the nodes info API instead of using the master node ID from the cluster state API. Relates #40047.	2019-03-27 23:08:10 +00:00
Andy Bristol	23395a9b9f	search as you type fieldmapper (#35600 ) Adds the search_as_you_type field type that acts like a text field optimized for as-you-type search completion. It creates a couple subfields that analyze the indexed terms as shingles, against which full terms are queried, and a prefix subfield that analyze terms as the largest shingle size used and edge-ngrams, against which partial terms are queried Adds a match_bool_prefix query type that creates a boolean clause of a term query for each term except the last, for which a boolean clause with a prefix query is created. The match_bool_prefix query is the recommended way of querying a search as you type field, which will boil down to term queries for each shingle of the input text on the appropriate shingle field, and the final (possibly partial) term as a term query on the prefix field. This field type also supports phrase and phrase prefix queries however	2019-03-27 13:29:13 -07:00
Nhat Nguyen	b9f96a8e1f	Expose external refreshes through the stats API (#38643 ) Right now, the stats API only provides refresh metrics regarding internal refreshes. This isn't very useful and somewhat misleading for cluster administrators since the internal refreshes are not indicative of documents being available for search. In this PR I added a new metric for collecting external refreshes as they occur and exposing them through the stats API. Now, calling an endpoint for stats will yield external refresh metrics as well. Relates #36712	2019-03-24 22:21:00 -04:00
Mayya Sharipova	49a7c6e0e8	Expose proximity boosting (#39385 ) (#40251 ) Expose DistanceFeatureQuery for geo, date and date_nanos types Closes #33382	2019-03-20 09:24:41 -04:00
Simon Willnauer	235f57989f	Return cached segments stats if `include_unloaded_segments` is true (#39698 ) Today we don't return segments stats for closed indices which makes it hard to tell how much memory such an index would require. With this change we return the statistics if requested by setting `include_unloaded_segments` to true on the rest request. Relates to #39512	2019-03-20 12:08:41 +01:00
Jason Tedor	86d1d03c37	Remove cluster state size (#40109 ) This commit removes the cluster state size field from the cluster state response, and drops the backwards compatibility layer added in 6.7.0 to continue to support this field. As calculation of this field was expensive and had dubious value, we have elected to remove this field.	2019-03-15 17:16:25 -04:00
Jack Conradson	b57af6c401	Add a Painless Context REST API (#39382 ) This PR adds an internal REST API for querying context information about Painless whitelists. Commands include the following: GET /_scripts/painless/_context -- retrieves a list of contexts GET /_scripts/painless/_context?context=%name% retrieves all available information about the API for this specific context	2019-03-14 12:42:12 -07:00
Jason Tedor	24973cf464	Adjust BWC version on cluster state size response This work has been backported all the way now, so this commit adjusts the BWC version. Relates #40016	2019-03-14 09:42:12 -04:00
Jason Tedor	9181668edf	Stop returning cluster state size by default (#40016 ) Computing the compressed size of the cluster state on every invocation of cluster:monitor/state action is expensive, and the value of this field is dubious anyway. Therefore we want to remove computing this field. As a first step, we stop computing and return this field by default. To avoid breaking users, we will give them a system property to use to tide them over until the next major release when we will actually remove this field. This comes with a deprecation warning too, and the backport to the appropriate minor will also include a note in the migration guide. There will be a follow-up to remove this field in the next major version.	2019-03-14 08:57:55 -04:00
Ioannis Kakavas	87ec511684	Mute locale dependent mapping tests (#39996 )	2019-03-13 17:07:20 +02:00
Tal Levy	6c52da54c8	fix index refresh in test within 20_mix_typeless_typeful (#39198 ) (#39804 ) the test "Implicitly create a typeless ... typed template" fails occasionally because the index operation hasn't propogated to update the index mapping in time for the following assertion about a dynamically mapped field "bar". error failed with: ``` field [test-1.mappings.my_type.properties.bar] doesn't have a true value Expected: not null but: was null ``` refreshing the index should resolve this timing issue.	2019-03-08 12:15:32 -08:00
Martijn Laarman	af4e740500	Document scroll param on reindex.json (#38615 ) The Reindex API also exposes `scroll` as a querystring parameter.	2019-03-07 18:14:29 +01:00
Jim Ferenczi	160dc29f0e	Handle total hits equal to track_total_hits (#37907 ) This change ensures that a total hits equal to the value set for track_total_hits is not considered as a lower bound.	2019-03-05 16:28:48 +01:00
Simon Willnauer	d112c89041	Allow inclusion of unloaded segments in stats (#39512 ) Today we have no chance to fetch actual segment stats for segments that are currently unloaded. This is relevant in the case of frozen indices. This allows to monitor how much memory a frozen index would use if it was unfrozen.	2019-03-05 14:02:20 +01:00
Tanguy Leroux	e005eeb0b3	Backport support for replicating closed indices to 7.x (#39506 )(#39499 ) Backport support for replicating closed indices (#39499) Before this change, closed indexes were simply not replicated. It was therefore possible to close an index and then decommission a data node without knowing that this data node contained shards of the closed index, potentially leading to data loss. Shards of closed indices were not completely taken into account when balancing the shards within the cluster, or automatically replicated through shard copies, and they were not easily movable from node A to node B using APIs like Cluster Reroute without being fully reopened and closed again. This commit changes the logic executed when closing an index, so that its shards are not just removed and forgotten but are instead reinitialized and reallocated on data nodes using an engine implementation which does not allow searching or indexing, which has a low memory overhead (compared with searchable/indexable opened shards) and which allows shards to be recovered from peer or promoted as primaries when needed. This new closing logic is built on top of the new Close Index API introduced in 6.7.0 (#37359). Some pre-closing sanity checks are executed on the shards before closing them, and closing an index on a 8.0 cluster will reinitialize the index shards and therefore impact the cluster health. Some APIs have been adapted to make them work with closed indices: - Cluster Health API - Cluster Reroute API - Cluster Allocation Explain API - Recovery API - Cat Indices - Cat Shards - Cat Health - Cat Recovery This commit contains all the following changes (most recent first): * c6c42a1 Adapt NoOpEngineTests after #39006 * 3f9993d Wait for shards to be active after closing indices (#38854) * 5e7a428 Adapt the Cluster Health API to closed indices (#39364) * 3e61939 Adapt CloseFollowerIndexIT for replicated closed indices (#38767) * 71f5c34 Recover closed indices after a full cluster restart (#39249) * 4db7fd9 Adapt the Recovery API for closed indices (#38421) * 4fd1bb2 Adapt more tests suites to closed indices (#39186) * 0519016 Add replica to primary promotion test for closed indices (#39110) * b756f6c Test the Cluster Shard Allocation Explain API with closed indices (#38631) * c484c66 Remove index routing table of closed indices in mixed versions clusters (#38955) * 00f1828 Mute CloseFollowerIndexIT.testCloseAndReopenFollowerIndex() * e845b0a Do not schedule Refresh/Translog/GlobalCheckpoint tasks for closed indices (#38329) * cf9a015 Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#38327) * b9becdd Adapt testPendingTasks() for replicated closed indices (#38326) * 02cc730 Allow shards of closed indices to be replicated as regular shards (#38024) * e53a9be Fix compilation error in IndexShardIT after merge with master * cae4155 Relax NoOpEngine constraints (#37413) * 54d110b [RCI] Adapt NoOpEngine to latest FrozenEngine changes * c63fd69 [RCI] Add NoOpEngine for closed indices (#33903) Relates to #33888	2019-03-01 14:48:26 +01:00
Tal Levy	0b676f07f6	mute failing test in 20_mix_typeless_typeful awaits fix in #39198	2019-02-20 16:12:07 -08:00
Adrien Grand	c28b6fb9b6	Reenable test in `indices.put_mapping/20_mix_typeless_typeful.yml`. (#39056 ) (#39057 ) This test had been disabled because of test failures, but it only affected the 6.x branch. The fix for 6.x is at #39054. On master/7.x/7.0 we can reenable the test as-is.	2019-02-20 11:33:50 +01:00
Lee Hinman	41ac6f9c55	Revert "Mute failing test 20_mix_typless_typefull (#38781 )" (#38912 ) (#39141 ) Backport of #38912 This reverts commit b91e0589fe1efdaa5061a75a3674a5cc8706b703. This should be fixed by #38873 Resolves #38711	2019-02-19 16:14:31 -07:00
Alan Woodward	ab4d5f404f	Add overlapping, before, after filters to intervals query (#38999 ) Lucene recently added `overlapping`, `before` and `after` filters to the intervals package. This commit exposes them in elasticsearch.	2019-02-18 15:06:24 +00:00
Alpar Torok	12eac6ad4b	Mute failing test (#38781 ) Tracking #38711	2019-02-12 15:57:57 +02:00
Tanguy Leroux	510829f9f7	TransportVerifyShardBeforeCloseAction should force a flush (#38401 ) This commit changes the `TransportVerifyShardBeforeCloseAction` so that it always forces the flush of the shard. It seems that #37961 is not sufficient to ensure that the translog and the Lucene commit share the exact same max seq no and global checkpoint information in case of one or more noop operations have been made. The `BulkWithUpdatesIT.testThatMissingIndexDoesNotAbortFullBulkRequest` and `FrozenIndexTests.testFreezeEmptyIndexWithTranslogOps` test this trivial situation and they both fail 1 on 10 executions. Relates to #33888	2019-02-06 13:22:54 +01:00
Boaz Leskes	033ba725af	Remove support for internal versioning for concurrency control (#38254 ) Elasticsearch has long [supported](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-index_.html#index-versioning) compare and set (a.k.a optimistic concurrency control) operations using internal document versioning. Sadly that approach is flawed and can sometime do the wrong thing. Here's the relevant excerpt from the resiliency status page: > When a primary has been partitioned away from the cluster there is a short period of time until it detects this. During that time it will continue indexing writes locally, thereby updating document versions. When it tries to replicate the operation, however, it will discover that it is partitioned away. It won’t acknowledge the write and will wait until the partition is resolved to negotiate with the master on how to proceed. The master will decide to either fail any replicas which failed to index the operations on the primary or tell the primary that it has to step down because a new primary has been chosen in the meantime. Since the old primary has already written documents, clients may already have read from the old primary before it shuts itself down. The version numbers of these reads may not be unique if the new primary has already accepted writes for the same document We recently [introduced](https://www.elastic.co/guide/en/elasticsearch/reference/6.x/optimistic-concurrency-control.html) a new sequence number based approach that doesn't suffer from this dirty reads problem. This commit removes support for internal versioning as a concurrency control mechanism in favor of the sequence number approach. Relates to #1078	2019-02-05 20:53:35 +01:00
Julie Tibshirani	3ce7d2c9b6	Make sure to reject mappings with type _doc when include_type_name is false. (#38270 ) `CreateIndexRequest#source(Map<String, Object>, ... )`, which is used when deserializing index creation requests, accidentally accepts mappings that are nested twice under the type key (as described in the bug report #38266). This in turn causes us to be too lenient in parsing typeless mappings. In particular, we accept the following index creation request, even though it should not contain the type key `_doc`: ``` PUT index?include_type_name=false { "mappings": { "_doc": { "properties": { ... } } } } ``` There is a similar issue for both 'put templates' and 'put mappings' requests as well. This PR makes the minimal changes to detect and reject these typed mappings in requests. It does not address #38266 generally, or attempt a larger refactor around types in these server-side requests, as I think this should be done at a later time.	2019-02-05 10:52:32 -08:00
Tal Levy	ae47c025e2	add basic REST test for geohash_grid (#37996 )	2019-02-05 09:44:47 -08:00
Boaz Leskes	12657fda44	`if_seq_no` and `if_primary_term` parameters aren't wired correctly in REST Client's CRUD API (#38411 )	2019-02-05 18:05:56 +01:00
Yogesh Gaikwad	fe36861ada	Add support for API keys to access Elasticsearch (#38291 ) X-Pack security supports built-in authentication service `token-service` that allows access tokens to be used to access Elasticsearch without using Basic authentication. The tokens are generated by `token-service` based on OAuth2 spec. The access token is a short-lived token (defaults to 20m) and refresh token with a lifetime of 24 hours, making them unsuitable for long-lived or recurring tasks where the system might go offline thereby failing refresh of tokens. This commit introduces a built-in authentication service `api-key-service` that adds support for long-lived tokens aka API keys to access Elasticsearch. The `api-key-service` is consulted after `token-service` in the authentication chain. By default, if TLS is enabled then `api-key-service` is also enabled. The service can be disabled using the configuration setting. The API keys:- - by default do not have an expiration but expiration can be configured where the API keys need to be expired after a certain amount of time. - when generated will keep authentication information of the user that generated them. - can be defined with a role describing the privileges for accessing Elasticsearch and will be limited by the role of the user that generated them - can be invalidated via invalidation API - information can be retrieved via a get API - that have been expired or invalidated will be retained for 1 week before being deleted. The expired API keys remover task handles this. Following are the API key management APIs:- 1. Create API Key - `PUT/POST /_security/api_key` 2. Get API key(s) - `GET /_security/api_key` 3. Invalidate API Key(s) `DELETE /_security/api_key` The API keys can be used to access Elasticsearch using `Authorization` header, where the auth scheme is `ApiKey` and the credentials, is the base64 encoding of API key Id and API key separated by a colon. Example:- ``` curl -H "Authorization: ApiKey YXBpLWtleS1pZDphcGkta2V5" http://localhost:9200/_cluster/health ``` Closes #34383	2019-02-05 14:21:57 +11:00
Mayya Sharipova	641704464d	Deprecate types in rollover index API (#38039 ) Relates to #35190	2019-02-04 16:07:45 -05:00
Alexander Reelsen	87f3579125	Add nanosecond field mapper (#37755 ) This adds a dedicated field mapper that supports nanosecond resolution - at the price of a reduced date range. When using the date field mapper, the time is stored as milliseconds since the epoch in a long in lucene. This field mapper stores the time in nanoseconds since the epoch - which means its range is much smaller, ranging roughly from 1970 to 2262. Note that aggregations will still be in milliseconds. However docvalue fields will have full nanosecond resolution Relates #27330	2019-02-04 11:31:16 +01:00
Boaz Leskes	f6e06a2b19	Adapt minimum versions for seq# powered operations in Watch related requests and UpdateRequest (#38231 ) After backporting #37977, #37857 and #37872	2019-02-01 20:37:16 -05:00
Julie Tibshirani	c2e9d13ebd	Default include_type_name to false in the yml test harness. (#38058 ) This PR removes the temporary change we made to the yml test harness in #37285 to automatically set `include_type_name` to `true` in index creation requests if it's not already specified. This is possible now that the vast majority of index creation requests were updated to be typeless in #37611. A few additional tests also needed updating here. Additionally, this PR updates the test harness to set `include_type_name` to `false` in index creation requests when communicating with 6.x nodes. This mirrors the logic added in #37611 to allow for typeless document write requests in test set-up code. With this update in place, we can remove many references to `include_type_name: false` from the yml tests.	2019-02-01 11:44:13 -08:00
Nhat Nguyen	f5f3cb8f4c	AwaitsFix PUT mapping with _doc on an index that has types (#38204 ) Tracked at #38202	2019-02-01 12:00:43 -05:00
Adrien Grand	2229e7231e	Enable bw tests for #37871 and #38032 . (#38167 ) Mixed-version clusters tests had been disabled initially since they wouldn't work until the functionality would be backported.	2019-02-01 13:55:51 +01:00
Jim Ferenczi	b7308aa03c	Don't load global ordinals with the `map` execution_hint (#37833 ) The terms aggregator loads the global ordinals to retrieve the cardinality of the field to aggregate on. This information is then used to select the strategy to use for the aggregation (breadth_first or depth_first). However this should be avoided if the execution_hint is explicitly set to map since this mode doesn't really need the global ordinals. Since we still need the cardinality of the field this change picks the maximum cardinality in the segments as an estimation of the total cardinality to select the strategy to use (breadth_first or depth_first). This estimation is only used if the execution hint is set to map, otherwise the global ordinals are still used to retrieve the accurate cardinality. Closes #37705	2019-02-01 09:35:46 +01:00
Yuri Astrakhan	f3cde06a1d	geotile_grid implementation (#37842 ) Implements `geotile_grid` aggregation This patch refactors previous implementation https://github.com/elastic/elasticsearch/pull/30240 This code uses the same base classes as `geohash_grid` agg, but uses a different hashing algorithm to allow zoom consistency. Each grid bucket is aligned to Web Mercator tiles.	2019-01-31 19:11:30 -05:00
Luca Cavanna	622fb7883b	Introduce ability to minimize round-trips in CCS (#37828 ) With #37566 we have introduced the ability to merge multiple search responses into one. That makes it possible to expose a new way of executing cross-cluster search requests, that makes CCS much faster whenever there is network latency between the CCS coordinating node and the remote clusters. The coordinating node can now send a single search request to each remote cluster, which gets reduced by each one of them. from + size results are requested to each cluster, and the reduce phase in each cluster is non final (meaning that buckets are not pruned and pipeline aggs are not executed). The CCS coordinating node performs an additional, final reduction, which produces one search response out of the multiple responses received from the different clusters. This new execution path will be activated by default for any CCS request unless a scroll is provided or inner hits are requested as part of field collapsing. The search API accepts now a new parameter called ccs_minimize_roundtrips that allows to opt-out of the default behaviour. Relates to #32125	2019-01-31 15:12:14 +01:00
Adrien Grand	a536fa7755	Treat put-mapping calls with `_doc` as a top-level key as typed calls. (#38032 ) Currently the put-mapping API assumes that because the type name is `_doc` then it is dealing with a typeless put-mapping call. Yet we still allow running the put-mapping API in a typed fashion with `_doc` as a type name. The current logic triggers surprising errors when doing a typed put-mapping call with `_doc` as a type name on an index that has a type already. This is a bit of a corner-case, but is more important on 6.x due to the fact that using the index API with `_doc` as a type name triggers typed calls to the put-mapping API with `_doc` as a type name.	2019-01-31 13:57:42 +01:00
Adrien Grand	3c439d3b92	Fix test bug when testing the merging of mappings and templates. (#38021 ) This test performs a typed index call when it actually means to run a typeless index call.	2019-01-31 13:52:09 +01:00
David Turner	81c443c9de	Deprecate minimum_master_nodes (#37868 ) Today we pass `discovery.zen.minimum_master_nodes` to nodes started up in tests, but for 7.x nodes this setting is not required as it has no effect. This commit removes this setting so that nodes are started with more realistic configurations, and deprecates it.	2019-01-30 20:09:15 +00:00
Colin Goodheart-Smithe	21e392e95e	Removes typed calls from YAML REST tests (#37611 ) This PR attempts to remove all typed calls from our YAML REST tests. The PR adds include_type_name: false to create index requests that use a mapping and also to put mapping requests. It also removes _type from index requests where they haven't already been removed. The PR ignores tests named *_with_types.yml since this are specifically testing typed API behaviour. The change also includes changing the test harness to add the type _doc to index, update, get and bulk requests that do not specify the document type when the test is running against a mixed 7.x/6.x cluster.	2019-01-30 16:32:58 +00:00
Adrien Grand	c8af0f4bfa	Use mappings to format doc-value fields by default. (#30831 ) Doc-value fields now return a value that is based on the mappings rather than the script implementation by default. This deprecates the special `use_field_mapping` docvalue format which was added in #29639 only to ease the transition to 7.x and it is not necessary anymore in 7.0.	2019-01-30 10:31:51 +01:00
Adrien Grand	b63b50b945	Give precedence to index creation when mixing typed templates with typeless index creation and vice-versa. (#37871 ) Currently if you mix typed templates and typeless index creation or typeless templates and typed index creation then you will end up with an error because Elasticsearch tries to create an index that has multiple types: `_doc` and the explicit type name that you used. This commit proposes to give precedence to the index creation call so that the type from the template will be ignored if the index creation call is typeless while the template is typed, and the type from the index creation call will be used if there is a typeless template. This is consistent with the fact that index creation already "wins" if a field is defined differently in the index creation call and in a template: the definition from the index creation call is used in such cases. Closes #37773	2019-01-30 10:28:24 +01:00
Albert Zaharovits	d05a4b9d14	Get Aliases with wildcard exclusion expression (#34230 ) This commit adds the code in the HTTP layer that will parse exclusion wildcard expressions. The existing code issues 404s for wildcards as well as explicit indices. But, in general, in an expression with exclude wildcards (-...*) following other include wildcards, there is no way to tell if the include wildcard produced no results or they were subsequently excluded. Therefore, the proposed change is breaking the behavior of 404s for wildcards. Specifically, no 404s will be returned for wildcards, even if they are not followed by exclude wildcards or the exclude wildcards could not possibly exclude what has previously been included. Only explicitly requested aliases will be called out as missing.	2019-01-29 18:56:20 +02:00
Boaz Leskes	65a9b61a91	Add Seq# based optimistic concurrency control to UpdateRequest (#37872 ) The update request has a lesser known support for a one off update of a known document version. This PR adds an a seq# based alternative to power these operations. Relates #36148 Relates #10708	2019-01-29 09:18:05 -05:00
Jim Ferenczi	cb451edb01	Allow nested fields in the composite aggregation (#37178 ) This changes adds the support to handle `nested` fields in the `composite` aggregation. A `nested` aggregation can be used as parent of a `composite` aggregation in order to target `nested` fields in the `sources`. Closes #28611	2019-01-25 14:00:39 +01:00
Boaz Leskes	9d87ca567a	300_sequence_numbers should not rely on 7.0 total hits structure	2019-01-24 23:41:10 +01:00
Boaz Leskes	af2f4c8f73	enable bwc tests and bump versions after backporting https://github.com/elastic/elasticsearch/pull/37639	2019-01-24 20:55:55 +01:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
Boaz Leskes	52ba407931	Expose sequence number and primary terms in search responses (#37639 ) Users may require the sequence number and primary terms to perform optimistic concurrency control operations. Currently, you can get the sequence number via the `docvalues_fields` API but the primary term is not accessible because it is maintained by the `SeqNoFieldMapper` and the infrastructure can't find it. This commit adds a dedicated sub fetch phase to return both numbers that is connected to a new `seq_no_primary_term` parameter.	2019-01-23 09:01:58 +01:00
Tomas Della Vedova	257f3eff22	Add note about how the body is referenced (#33935 )	2019-01-22 09:07:48 +01:00
Julie Tibshirani	8da7a27f3b	Deprecate types in the put mapping API. (#37280 ) From #29453 and #37285, the `include_type_name` parameter was already present and defaulted to false. This PR makes the following updates: - Add deprecation warnings to `RestPutMappingAction`, plus tests in `RestPutMappingActionTests`. - Add a typeless 'put mappings' method to the Java HLRC, and deprecate the old typed version. To do this cleanly, I opted to create a new `PutMappingRequest` object that differs from the existing server one.	2019-01-18 12:28:31 -08:00
Christoph Büscher	2f0e0b2426	Allow indices.get_mapping response parsing without types (#37492 ) This change adds deprecation warning to the indices.get_mapping API in case the "inlcude_type_name" parameter is set to "true" and changes the parsing code in GetMappingsResponse to parse the type-less response instead of the one containing types. As a consequence the HLRC client doesn't need to force "include_type_name=true" any more and the GetMappingsResponseTests can be adapted to the new format as well. Also removing some "include_type_name" parameters in yaml test and docs where not necessary.	2019-01-18 09:33:36 +01:00
Julie Tibshirani	1a1dbf705f	Make sure to use the resolved type in DocumentMapperService#extractMappings. (#37451 ) * Pull out a shared method MapperService#resolveDocumentType. * Make sure to resolve the type when extracting the mappings. Addresses #36811.	2019-01-15 07:32:47 -08:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Christoph Büscher	bb6d8784e7	Switch indices.get rest after backport of `include_type_name` (#37351 ) With the `include_type_name` available now for indices.get on 6.x after the backport, the corresponsing yaml test can include anything from 6.7 on. Also changing the RestGetIndicesActionTests base test class.	2019-01-11 17:24:12 +01:00
Julie Tibshirani	a433c4012c	Support include_type_name in the field mapping and index template APIs. (#37210 ) * Add include_type_name to the get field mappings API. * Make sure the API specification lists include_type_name as a boolean. * Add include_type_name to the get index templates API. * Add include_type_name to the put index templates API.	2019-01-10 09:24:08 -08:00
Christoph Büscher	c149bb8cc2	Support 'include_type_name' in RestGetIndicesAction (#37149 ) This change adds support for the 'include_type_name' parameter for the indices.get API. This parameter, which defaults to `false` starting in 7.0, changes the response to not include the indices type names any longer. If the parameter is set in the request, we additionally emit a deprecation warning since using the parameter should be only temporarily necessary while adapting to the new response format and we will remove it with the next major version.	2019-01-09 14:17:17 +01:00
Mayya Sharipova	ec32e66088	Deprecate reference to _type in lookup queries (#37016 ) Relates to #35190	2019-01-08 18:46:41 -08:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Jim Ferenczi	78ba1889cf	Replace the TreeMap in the composite aggregation (#36675 ) The `composite` aggregation uses a TreeMap to keep track of the best buckets. This ensures a log(n) time cost to insert new buckets but also to retrieve buckets that are already present in the map. In order to speed up the retrieval of buckets this change replaces the TreeMap with a priority queue and a HashMap. The insertion cost is still log(n) but the retrieval of buckets through the HashMap is now done in constant time. This optimization can bring significant improvement since each document needs to check if its associated buckets are already present in the current best buckets.	2019-01-03 09:51:35 +01:00
Alan Woodward	7a0047744d	`query_string` should use indexed prefixes (#36895 ) The QueryStringQueryBuilder does not currently delegate to the field mapper's prefixQuery method, so does not use indexed prefixes. This commit corrects this. It also fixes a bug where a query a* would not match the word a if indexed prefixes were used with a minchar setting of 2.	2019-01-02 20:12:24 +00:00
Ryan Ernst	cfc0a47232	Core: Deprecate negative epoch timestamps (#36793 ) Negative timestamps are currently supported in joda time. These are dates before epoch. However, it doesn't really make sense to have a negative timestamp, since this is a modern format. Any dates before epoch can be represented with normal date formats, like ISO8601. Additionally, implementing negative epoch timestamp parsing in java time has an edge case which would more than double the code required. This commit deprecates use of negative epoch timestamps.	2018-12-20 00:17:06 -08:00
Julie Tibshirani	ecb822c666	Deprecate the document create endpoint. (#36863 )	2018-12-19 15:20:20 -08:00
Alan Woodward	dd540ef618	Use index-prefix fields for terms of length min_chars - 1 (#36703 ) The default index_prefix settings will index prefixes of between 2 and 5 characters in length. Currently, if a prefix search falls outside of this range at either end we fall back to a standard prefix expansion, which is still very expensive for single character prefixes. However, we have an option here to use a wildcard expansion rather than a prefix expansion, so that a query of a* gets remapped to a? against the _index_prefix field - likely to be a very small set of terms, and certain to be much smaller than a* against the whole index. This commit adds this extra level of mapping for any prefix term whose length is one less than the min_chars parameter of the index_prefixes field.	2018-12-19 08:55:05 +00:00
Mayya Sharipova	f884b2b1cd	Deprecate types in index API (#36575 ) * Deprecate types in index API - deprecate type-based constructors of IndexRequest - update tests to use typeless IndexRequest constructors - no yaml tests as they have been already added in #35790 Relates to #35190	2018-12-18 08:53:49 -05:00
Boaz Leskes	5f76f39386	Rename seq# powered optimistic concurrency control parameters to ifSeqNo/ifPrimaryTerm (#36757 ) This PR renames the parameters previously introduce to the following: ### URL Parameters ``` PUT twitter/_doc/1?if_seq_no=501&if_primary_term=1 { "user" : "kimchy", "post_date" : "2009-11-15T14:12:12", "message" : "trying out Elasticsearch" } DELETE twitter/_doc/1?if_seq_no=501&if_primary_term=1 ``` ### Bulk API ``` POST _bulk { "index" : { "_index" : "test", "_type" : "_doc", "_id" : "1", "if_seq_no": 501, "if_primary_term": 1 } } { "field1" : "value1" } { "delete" : { "_index" : "test", "_type" : "_doc", "_id" : "2", "if_seq_no": 501, "if_primary_term": 1 } } ``` ### Java API ``` IndexRequest.ifSeqNo(long seqNo) IndexRequest.ifPrimaryTerm(long primaryTerm) DeleteRequest.ifSeqNo(long seqNo) DeleteRequest.ifPrimaryTerm(long primaryTerm) ``` Relates #36148 Relates #10708	2018-12-18 14:35:18 +01:00
Boaz Leskes	9087c98a5a	Expose Sequence Number based Optimistic Concurrency Control in the rest layer (#36721 ) Relates #36148 Relates #10708	2018-12-18 10:56:02 +01:00
Christoph Büscher	2f5300e3a6	Deprecate types in get_source and exist_source (#36426 ) This change adds a new untyped endpoint `{index}/_source/{id}` for both the GET and the HEAD methods to get the source of a document or check for its existance. It also adds deprecation warnings to RestGetSourceAction that emit a warning when the old deprecated "type" parameter is still used. Also updating documentation and tests where appropriate. Relates to #35190	2018-12-18 00:57:42 +01:00
Julie Tibshirani	ccd1beb9b3	Deprecate types in update requests. (#36181 ) The following updates were made: * Add deprecation warnings to `RestUpdateAction`, plus a test in `RestUpdateActionTests`. * Deprecate relevant methods on the Java HLRC requests/ responses. * Add HLRC integration tests for the typed APIs. * Update documentation (for both the REST API and Java HLRC). * Fix failing integration tests. Because of an earlier PR, the REST yml tests were already updated (one version without types, and another legacy version that retains types).	2018-12-14 10:47:27 -08:00
Alan Woodward	09bf93dc2a	Add intervals query (#36135 ) * Add IntervalQueryBuilder with support for match and combine intervals * Add relative intervals * feedback * YAML test - broekn * yaml test; begin to add block source * Add block; make disjunction its own source * WIP * Extract IntervalBuilder and add tests for it * Fix eq/hashcode in Disjunction * New yaml test * checkstyle * license headers * test fix * YAML format * YAML formatting again * yaml tests; javadoc * Add OR test -> requires fix from LUCENE-8586 * Add docs * Re-do API * Clint's API * Delete bash script * doc fixes * imports * docs * test fix * feedback * comma * docs fixes * Tidy up doc references to old rule	2018-12-14 15:14:00 +00:00
Julie Tibshirani	eb733f404a	Fix the mixed cluster REST test explain/11_basic_with_types.	2018-12-11 11:03:26 -08:00
Yannick Welsch	d8e3d97a7d	Add discovery types to cluster stats (#36442 ) Adds information about the used discovery types to the cluster stats, similar as we have for the network types.	2018-12-11 10:35:58 +01:00
Julie Tibshirani	87831051dc	Deprecate types in explain requests. (#35611 ) The following updates were made: - Add a new untyped endpoint `{index}/_explain/{id}`. - Add deprecation warnings to RestAction, plus tests in RestActionTests. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 19:45:13 -08:00
Julie Tibshirani	99f89cd3b4	Deprecate types in get, exists, and multi get. (#35930 ) For each API, the following updates were made: - Add deprecation warnings to `RestAction`, plus tests in `RestActionTests`. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 17:22:48 -08:00
Julie Tibshirani	3f3cde41d3	Deprecate types in termvector and mtermvector requests. (#36182 ) * Add deprecation warnings to `RestTermVectorsAction`, plus tests in `RestTermVectorsActionTests`. * Deprecate relevant methods on the Java HLRC requests/ responses. * Update documentation (for both the REST API and Java HLRC). * For each REST yml test, create one version without types, and another legacy version that retains types (called *_with_types.yml).	2018-12-06 10:23:15 -08:00
jaymode	54dcc273a8	Revert "Test: mute failing mtermvector rest test" This reverts commit `5ba0ed9c8f` as this test was already fixed by `4c2e0d76f3`. Closes #36281	2018-12-05 13:38:37 -07:00
jaymode	5ba0ed9c8f	Test: mute failing mtermvector rest test See #36281	2018-12-05 12:52:46 -07:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Adrien Grand	d24b40f688	Make typeless APIs usable with indices whose type name is different from `_doc` (#35790 ) This commit makes `document`, `update`, `explain`, `termvectors` and `mapping` typeless APIs work on indices that have a type whose name is not `_doc`. Unfortunately, this needs to be a bit of a hack since I didn't want calls with random type names to see documents with the type name that the user had chosen upon type creation. The `explain` and `termvectors` do not support being called without a type for now so the test is just using `_doc` as a type for now, we will need to fix tests later but this shouldn't require further changes server-side since passing `_doc` as a type name is what typeless APIs do internally anyway. Relates #35190	2018-12-04 19:22:17 +01:00
Julie Tibshirani	0e1ddfd825	Deprecate types in document delete requests. (#36087 ) * Make sure to use _doc as a type name in the CRUD HLRC tests. * Deprecate types in document delete requests.	2018-11-30 15:11:29 -08:00
Jim Ferenczi	8a7f3f75f3	Add support for rest_total_hits_as_int (#36051 ) The support for rest_total_hits_as_int has already been merged to 6x in #35848 so this change adds this new option to master. The plan was to add this new option as part of #35848 but we've decided to wait a few days before merging this breaking change so this commit just handles the new option as a noop exactly like 6x for now. This will allow users to migrate to this parameter before #35848 is merged. Relates #33028	2018-11-29 18:36:16 +01:00
Martijn van Groningen	7624734f14	Added wait_for_metadata_version parameter to cluster state api. (#35535 ) The `wait_for_metadata_version` parameter will instruct the cluster state api to only return a cluster state until the metadata's version is equal or greater than the version specified in `wait_for_metadata_version`. If the specified `wait_for_timeout` has expired then a timed out response is returned. (a response with no cluster state and wait for timed out flag set to true) In the case metadata's version is equal or higher than `wait_for_metadata_version` then the api will immediately return. This feature is useful to avoid external components from constantly polling the cluster state to whether somethings have changed in the cluster state's metadata.	2018-11-26 08:50:08 +01:00
Yannick Welsch	2970abfce9	Add read-only repository verification (#35731 ) Adds a verification mode for read-only repositories. It also makes the extra bucket check on repository creation obsolete, which fixes #35703.	2018-11-23 14:45:05 +01:00
Julie Tibshirani	c6a0904e0e	Deprecate types in count and msearch. (#35421 ) * Deprecate types in count requests. * Move RestCountAction to the 'search' package. * Deprecate types in multi search requests. * Add tests for types deprecation in the _search endpoint.	2018-11-16 13:04:43 -08:00
Julie Tibshirani	ec53288fc0	Remove include_type_name from the relevant APIs. (#35192 ) We've decided that the bulk, delete, get, index, update, and search APIs should not contain this request parameter, and we will instead accept both typed and typeless calls.	2018-11-06 14:33:48 -08:00
Simon Willnauer	833e0f8ecf	Prevent throttled indices to be searched through wildcards by default (#34354 ) Today if a wildcard, date-math expression or alias expands/resolves to an index that is search-throttled we still search it. This is likely not the desired behavior since it can unexpectedly slow down searches significantly. This change adds a new indices option that allows `search`, `count` and `msearch` to ignore throttled indices by default. Users can force expansion to throttled indices by using `ignore_throttled=true` on the rest request to expand also to throttled indices. Relates to #34352	2018-11-06 09:45:30 +01:00
lipsill	d181d1bab1	Remove deprecated url parameters `_source_include` and `_source_exclude` (#35097 ) Removes `_source_include` and `_source_exclude` url parameters. These parameters have been deprecated in #33475. Closes #22792	2018-10-31 17:11:59 -04:00
Andy Bristol	010845b233	[test] dont skip last version for mad	2018-10-31 11:36:25 -07:00
Nik Everett	70e939ee38	Search: Test deprecated field in mixed cluster Update the test for the deprecated `_source_exclude` parameter to run against 6.6.0+ now that we've backported the deprecation. Relates to #33475	2018-10-30 11:59:09 -04:00
Andy Bristol	b8280ea7cc	median absolute deviation agg (#34482 ) This commit adds a new single value metric aggregation that calculates the statistic called median absolute deviation, which is a measure of variability that works on more types of data than standard deviation Our calculation of MAD is approximated using t-digests. In the collect phase, we collect each value visited into a t-digest. In the reduce phase, we merge all value t-digests, then create a t-digest of deviations using the first t-digest's median and centroids	2018-10-30 07:22:52 -07:00
Spencer	6169e9345e	[REST-API] fix url for reload_secure_settings	2018-10-29 12:09:53 -07:00
lipsill	6df1c9e818	Deprecate `_source_include` and `_source_exclude` url parameters (#33475 ) Deprecates `_source_include` and `_source_exclude` url parameters in favor of `_source_inclues` and `_source_excludes` because those are consistent with the rest of Elasticsearch's APIs. Relates to #22792	2018-10-29 12:06:38 -04:00
Emily S	eaa05fe762	[TEST] Minor updates to rest api spec tests (#34551 )	2018-10-23 19:09:09 +02:00
Julie Tibshirani	fbb9ac34f9	Deprecate type exists requests. (#34663 )	2018-10-22 08:46:11 -07:00
Christoph Büscher	a1c441f78a	HLRC: Add throttling for update & delete-by-query (#33951 ) This change adds throttling to the update-by-query and delete-by-query cases similar to throttling for reindex. This mostly means additional methods on the client class itself, since the request hits the same RestHandler, just with slightly different endpoints, and also the return values are similar.	2018-10-02 21:44:15 +02:00
Przemyslaw Gomulka	3f8cc89c9f	Completion types with multi-fields support (#34081 ) Mappings with completion type and multi-fields, were not able to index array or object format on completion fields. Only string format was supported. This is fixed by providing multiField parser with externalValueContext with already parsed object closes #15115	2018-10-02 14:32:56 +02:00
Russ Cam	94df8ec358	Rest-Api-Spec: Correct spelling in filter_path description (#33154 ) This commit corrects the spelling of response in the description of filter_path in the REST API spec.	2018-09-28 11:20:51 +02:00
Mayya Sharipova	80c5d30f30	XContentBuilder to handle BigInteger and BigDecimal (#32888 ) Although we allow to index BigInteger and BigDecimal into a keyword field, source filtering on these fields would fail as XContentBuilder was not able to deserialize BigInteger and BigDecimal to json. This modifies XContentBuilder to allow to handle BigInteger and BigDecimal. Closes #32395	2018-09-26 14:24:31 -04:00
Adrien Grand	3c2841d493	REST test for typeless APIs. (#33934 ) This commit duplicates REST tests for the - `indices.create` - `indices.put_mapping` - `indices.get_mapping` - `index` - `get` - `delete` - `update` - `bulk` APIs, so that we both test them when used without types (include_type_name=false) and with types, mostly for mixed-version cluster tests. Given a suite called `X_test_name.yml`, I first copied it to `(X+1)_test_name_with_types.yml` and then changed `X_test_name.yml` to set `include_type_name=false` on every API that supports it. Relates #15613	2018-09-26 17:11:37 +02:00
Jim Ferenczi	a255880497	Add nested and object fields to field capabilities response (#33803 ) This commit adds nested and object fields to the field capabilities response. Closes #33237	2018-09-26 08:59:41 +02:00
Tomas Della Vedova	7bed915499	[DOCS] Fixed list formatting (#32963 )	2018-09-18 17:05:10 +02:00
Jim Ferenczi	5f495c18df	Adapt skip version for doc_values format deprecation This commit fixes bwc rest tests for the doc_values format deprecation in search. The message of the deprecation changed in 6.4.1 so the bwc test should not check against 6.4.0.	2018-09-14 10:22:11 +02:00
Colin Goodheart-Smithe	624b84f897	Improves doc values format deprecation message (#33576 ) * Improves doc values format deprecation message This changes the deprecation message when doc values fields do not supply a format form logging a deprecation warning for each offending field individually to logging a single message which lists all offending fields Closes #33572 * Updates YAML test with new deprecation message Also adds a test to ensure multiple deprecation warnings are collated into one message * Condenses collection of fields without format check Moves the collection of fields that don't have a format to a separate loop and moves the logging of the deprecation warning to be next to it at the expesnse of looping through the field list twice * fixes typo * Fixes test	2018-09-11 14:32:43 +01:00
Jim Ferenczi	dbc7102c86	Fix inner hits retrieval when stored fields are disabled (_none_) (#33018 ) Now that types are unique per mapping we can retrieve the document mapper without referencing the type. This fixes an NPE when stored fields are disabled. For 6x we'll need a different fix since mappings can still have multiple types. Relates #32941	2018-09-04 16:25:52 +02:00
Albert Zaharovits	fbe609d589	Reload Secure Settings REST specs & docs (#32990 ) This is a minimal REST API spec and docs for the REST handler for the `_nodes/reload_secure_settings endpoint`. Relates #29135	2018-08-26 14:49:32 +03:00
Mayya Sharipova	6f1ee76443	Revert "Do NOT allow termvectors on nested fields (#32728 )" This reverts commit `fdff8f3db0`.	2018-08-24 10:12:16 -04:00
Mayya Sharipova	fdff8f3db0	Do NOT allow termvectors on nested fields (#32728 ) Requesting _termvectors on a nested field or any sub-fields of a nested field returns empty results. Closes #21625	2018-08-23 16:46:47 -04:00
Luca Cavanna	393eec1482	Set maxScore for empty TopDocs to Nan rather than 0 (#32938 ) We used to set `maxScore` to `0` within `TopDocs` in situations where there is really no score as the size was set to `0` and scores were not even tracked. In such scenarios, `Float.Nan` is more appropriate, which gets converted to `max_score: null` on the REST layer. That's also more consistent with lucene which set `maxScore` to `Float.Nan` when merging empty `TopDocs` (see `TopDocs#merge`).	2018-08-22 17:23:54 +02:00
Simon Willnauer	ffb1a5d5b7	Expose `max_concurrent_shard_requests` in `_msearch` (#33016 ) Today `_msearch` doesn't allow modifying the `max_concurrent_shard_requests` per sub search request. This change adds support for setting this parameter on all sub-search requests in an `_msearch`. Relates to #31877	2018-08-22 08:45:08 +02:00
Julie Tibshirani	99c3d8ab26	Fix a bug in the BWC tests where we added a field alias in unsupported versions.	2018-08-08 15:24:23 -07:00
Julie Tibshirani	0281846c6b	Adjust the test version now that the field alias bug has been backported.	2018-08-07 16:48:41 -07:00
Julie Tibshirani	d7183f8f3d	Make sure that field collapsing supports field aliases. (#32648 )	2018-08-07 16:20:09 -07:00
Shaunak Kashyap	0a83968650	Add cluster UUID to Cluster Stats API response (#32206 ) * Make cluster stats response contain cluster UUID * Updating constructor usage in Monitoring tests * Adding cluster_uuid field to Cluster Stats API reference doc * Adding rest api spec test for expecting cluster_uuid in cluster stats response * Adding missing newline * Indenting do section properly * Missed a spot! * Fixing the test cluster ID	2018-08-02 17:14:19 -07:00
Zachary Tong	7c43da280b	[TEST] add version skip to weighted_avg tests Added in 6.4, so we don't want this running on pre-6.4 BWC tests	2018-07-23 19:50:26 -04:00
Zachary Tong	6ba144ae31	Add WeightedAvg metric aggregation (#31037 ) Adds a new single-value metrics aggregation that computes the weighted average of numeric values that are extracted from the aggregated documents. These values can be extracted from specific numeric fields in the documents. When calculating a regular average, each datapoint has an equal "weight"; it contributes equally to the final value. In contrast, weighted averages scale each datapoint differently. The amount that each datapoint contributes to the final value is extracted from the document, or provided by a script. As a formula, a weighted average is the `∑(value * weight) / ∑(weight)` A regular average can be thought of as a weighted average where every value has an implicit weight of `1`. Closes #15731	2018-07-23 18:33:15 -04:00
Boaz Leskes	f7da55aa3c	Rest test - allow for snapshots to take 0 milliseconds Sometimes we get lucky and things are fast :)	2018-07-19 11:32:36 +02:00
Armin Braun	6de1f96cad	Fix BwC Tests looking for UUID Pre 6.4 (#32158 ) * UUID field was added for #31791 and only went into 6.4 and 7.0 * Fixes #32119	2018-07-18 15:32:36 +02:00
Armin Braun	4b5071f2d0	Add Index UUID to `/_stats` Response (#31871 ) * Add "uuid" field to each index's section in the `/_stats` response * closes #31791	2018-07-17 06:50:21 +02:00
Jim Ferenczi	44f0c1df39	Unmute field collapsing rest tests BWC tests can run now that master and 6x branch are aligned. Closes #32055	2018-07-16 12:03:28 +02:00
Christoph Büscher	466235288e	Mute failing tests Relates to #32055	2018-07-13 23:09:07 +02:00
Mayya Sharipova	80492cacfc	Add second level of field collapsing (#31808 ) * Put second level collapse under inner_hits Closes #24855	2018-07-13 11:40:03 -04:00
Tal Levy	68a8d13828	add support for is_write_index in put-alias body parsing (#31674 ) * add support for is_write_index in put-alias body parsing The Rest Put-Alias Action does separate parsing of the alias body to construct the IndicesAliasesRequest. This extra parsing was missed in #30703. * test flag was not just ignored by the parser * disable backcompat tests	2018-07-09 16:03:17 -04:00
Jim Ferenczi	584fa261cc	Remove the ability to index or query context suggestions without context (#31007 ) This is a follow up of #30712 that removes the ability to index or query and context enabled completion field without context. Relates #30712	2018-07-09 16:01:01 +02:00
Alan Woodward	d3a8094d35	Remove obsolete parameters from analyze rest spec (#31795 ) This commit also fixes a typo in the analyze high-level client documentation.	2018-07-06 09:05:34 +01:00
Sohaib Iftikhar	09e8ac8167	REST high-level client: add get index API (#31703 ) Also added master_timeout parameter for the indices.get spec Relates to #27205	2018-07-05 13:52:25 -04:00
Julie Tibshirani	1a54bca712	Mute 'Test typed keys parameter for suggesters' as we await a fix.	2018-06-29 12:57:54 -07:00
Nik Everett	232c71b6bf	QA: Create xpack yaml features (#31403 ) This creates a YAML test "features" that indices if the cluster being tested has xpack installed (`xpack`) or if it does not have xpack installed (`no_xpack`). It uses those features to centralize skipping a few tests that fail if xpack is installed. The plan is to use this in a followup to skip docs tests that require xpack when xpack is not installed. We plan to use the declaration of required license level on the docs page to generate the required `skip`. Closes #30933.	2018-06-26 09:26:48 -04:00
Luca Cavanna	0352d88621	Get Mapping API to honour allow_no_indices and ignore_unavailable (#31507 ) Get Mapping currently throws index not found exception (and returns 404 status code) from the REST layer whenever an index was specified and no indices have been returned. We should not have this logic in the REST layer though as only our index resolver should decide whether we need to throw exceptions or not based on provided indices and corresponding indices options. Closes #31485	2018-06-22 09:57:32 +02:00
Vladimir Dolzhenko	04e4e44409	Add get stored script and delete stored script to high level REST API (#31355 ) Add get stored script and delete stored script to high level REST API Relates to #27205	2018-06-19 14:21:11 +02:00
Tal Levy	3b70e943eb	add is-write-index flag to aliases (#30942 ) This commit adds the is-write-index flag for aliases. It allows requests to set the flag, and responses to display the flag. It does not validate and/or affect any indexing/getting/updating behavior of Elasticsearch -- this will be done in a follow-up PR.	2018-06-15 08:45:29 -07:00
Nik Everett	856936c286	REST Client: NodeSelector for node attributes (#31296 ) Add a `NodeSelector` so that users can filter the nodes that receive requests based on node attributes. I believe we'll need this to backport #30523 and we want it anyway. I also added a bash script to help with rebuilding the sniffer parsing test documents.	2018-06-15 08:04:54 -04:00
Nik Everett	0d9b78834f	LLClient: Support host selection (#30523 ) Allows users of the Low Level REST client to specify which hosts a request should be run on. They implement the `NodeSelector` interface or reuse a built in selector like `NOT_MASTER_ONLY` to chose which nodes are valid. Using it looks like: ``` Request request = new Request("POST", "/foo/_search"); RequestOptions options = request.getOptions().toBuilder(); options.setNodeSelector(NodeSelector.NOT_MASTER_ONLY); request.setOptions(options); ... ``` This introduces a new `Node` object which contains a `HttpHost` and the metadata about the host. At this point that metadata is just `version` and `roles` but I plan to add node attributes in a followup. The canonical way to get this metadata is to use the `Sniffer` to pull the information from the Elasticsearch cluster. I've marked this as "breaking-java" because it breaks custom implementations of `HostsSniffer` by renaming the interface to `NodesSniffer` and by changing it from returning a `List<HttpHost>` to a `List<Node>`. It shouldn't break anyone else though. Because we expect to find it useful, this also implements `host_selector` support to `do` statements in the yaml tests. Using it looks a little like: ``` --- "example test": - skip: features: host_selector - do: host_selector: version: " - 7.0.0" # same syntax as skip apiname: something: true ``` The `do` section parses the `version` string into a host selector that uses the same version comparison logic as the `skip` section. When the `do` section is executed it passed the off to the `RestClient`, using the `ElasticsearchHostsSniffer` to sniff the required metadata. The idea is to use this in mixed version tests to target a specific version of Elasticsearch so we can be sure about the deprecation logging though we don't currently have any examples that need it. We do, however, have at least one open pull request that requires something like this to properly test it. Closes #21888	2018-06-11 17:07:27 -04:00
Simon Willnauer	f825a530b8	Limit the number of concurrent requests per node (#31206 ) With `max_concurrent_shard_requests` we used to throttle / limit the number of concurrent shard requests a high level search request can execute per node. This had several problems since it limited the number on a global level based on the number of nodes. This change now throttles the number of concurrent requests per node while still allowing concurrency across multiple nodes. Closes #31192	2018-06-11 08:49:18 +02:00
Martijn van Groningen	07a57cc131	Move number of language analyzers to analysis-common module (#31143 ) The following analyzers were moved from server module to analysis-common module: `snowball`, `arabic`, `armenian`, `basque`, `bengali`, `brazilian`, `bulgarian`, `catalan`, `chinese`, `cjk`, `czech`, `danish`, `dutch`, `english`, `finnish`, `french`, `galician` and `german`. Relates to #23658	2018-06-08 08:58:46 +02:00
Jim Ferenczi	b30aa3137d	Reject long regex in query_string (#31136 ) This change applies the existing `index.max_regex_length` to regex queries produced by the `query_string` query. Relates #28344	2018-06-07 09:29:26 +02:00
Lee Hinman	b22a055bcf	Add get mappings support to high-level rest client (#30889 ) This adds support for the get mappings API to the high level rest client. Relates to #27205	2018-06-04 14:31:08 -06:00
Jim Ferenczi	f94a75778c	Fix index prefixes to work with span_multi (#31066 ) * Fix index prefixes to work with span_multi Text fields that use `index_prefixes` can rewrite `prefix` queries into `term` queries internally. This commit fix the handling of this rewriting in the `span_multi` query. This change also copies the index options of the text field into the prefix field in order to be able to run positional queries. This is mandatory for `span_multi` to work but this could also be useful to optimize `match_phrase_prefix` queries in a follow up. Note that this change can only be done on indices created after 6.3 since we set the index options to doc only in this version. Fixes #31056	2018-06-04 21:48:56 +02:00
Alan Woodward	0427339ab0	Index phrases (#30450 ) Specifying `index_phrases: true` on a text field mapping will add a subsidiary [field]._index_phrase field, indexing two-term shingles from the parent field. The parent analysis chain is re-used, wrapped with a FixedShingleFilter. At query time, if a phrase match query is executed, the mapping will redirect it to run against the subsidiary field. This should trade faster phrase querying for a larger index and longer indexing times. Relates to #27049	2018-06-04 08:50:35 +01:00
Michael Basnight	d826cb36c3	Remove version read/write logic in Verify Response (#30879 ) Since master will always communicate with a >=6.4 node, the logic for checking if the node is 6.4 and conditionally reading and writing based on that can be removed from master. This logic will stay in 6.x as it is the bridge to the cleaner response in master. This also unmutes the failing test due to this bwc change. Closes #30807	2018-05-31 12:10:01 -05:00
Jim Ferenczi	0f5e570184	Deprecates indexing and querying a context completion field without context (#30712 ) This change deprecates completion queries and documents without context that target a context enabled completion field. Querying without context degrades the search performance considerably (even when the number of indexed contexts is low). This commit targets master but the deprecation will take place in 6.x and the functionality will be removed in 7 in a follow up. Closes #29222	2018-05-31 16:09:48 +02:00
Jim Ferenczi	f582418ada	Fix missing option serialization after backport Relates #29465	2018-05-30 12:55:31 +02:00
Jim Ferenczi	e33d107f84	Add missing_bucket option in the composite agg (#29465 ) This change adds a new option to the composite aggregation named `missing_bucket`. This option can be set by source and dictates whether documents without a value for the source should be ignored. When set to true, documents without a value for a field emits an explicit `null` value which is then added in the composite bucket. The `missing` option that allows to set an explicit value (instead of `null`) is deprecated in this change and will be removed in a follow up (only in 7.x). This commit also changes how the big arrays are allocated, instead of reserving the provided `size` for all sources they are created with a small intial size and they grow depending on the number of buckets created by the aggregation: Closes #29380	2018-05-30 09:48:40 +02:00
Alan Woodward	67905c85a5	Rename index_prefix to index_prefixes (#30932 ) This commit also adds index_prefixes tests to TextFieldMapperTests to ensure that cloning and wire-serialization work correctly	2018-05-30 08:32:31 +01:00
Martijn van Groningen	544822c78b	Moved keyword tokenizer to analysis-common module (#30642 ) Relates to #23658	2018-05-29 19:22:28 +02:00
Vladimir Dolzhenko	b55b079a90	Include size of snapshot in snapshot metadata #18543 , bwc clean up (#30890 )	2018-05-26 21:20:44 +02:00
Vladimir Dolzhenko	81eb8ba0f0	Include size of snapshot in snapshot metadata (#29602 ) Include size of snapshot in snapshot metadata Adds difference of number of files (and file sizes) between prev and current snapshot. Total number/size reflects total number/size of files in snapshot. Closes #18543	2018-05-25 21:04:50 +02:00
Tom Callahan	36fbb4cb48	Harmonize include_defaults tests (#30700 ) This PR breaks the include_defaults functionality of the get settings API into its own test, which is skipped for mixed-mode clusters containing pre-6.4 nodes.	2018-05-25 09:41:16 -04:00
David Roberts	40534ccabc	[TEST] Mute {p0=snapshot.get_repository/10_basic/Verify created repository} YAML test Issue is #30807	2018-05-25 12:58:02 +01:00
Michael Basnight	e1ffbeb824	Fix bad version check writing Repository nodes (#30846 ) The writeTo method of VerifyRepositoryResponse incorrectly used its local version to determine what it was receiving, rather than the sender's version. This fixes a bug that ocassionally happened when a 6.4 master node sent data to a 7.0 client, causing the number of bytes to be improperly read. This also unmutes the test. Closes #30807	2018-05-24 19:21:57 -05:00
Julie Tibshirani	f55b09bae4	Update the version checks around ip_range bucket keys, now that the change was backported.	2018-05-24 12:04:18 -07:00
Julie Tibshirani	638a719370	Ensure that ip_range aggregations always return bucket keys. (#30701 )	2018-05-24 08:55:14 -07:00
Adrien Grand	405eb7a751	Change serialization version of doc-value fields. Relates #29639	2018-05-23 18:34:05 +02:00
Adrien Grand	a19df4ab3b	Add a `format` option to `docvalue_fields`. (#29639 ) This commit adds the ability to configure how a docvalue field should be formatted, so that it would be possible eg. to return a date field formatted as the number of milliseconds since Epoch. Closes #27740	2018-05-23 14:39:04 +02:00
Colin Goodheart-Smithe	483b25330b	Mustes {p0=snapshot.get_repository/10_basic/*} YAML test This is awaiting a fix for https://github.com/elastic/elasticsearch/issues/30807	2018-05-23 11:32:58 +01:00
olcbean	af8ad8d172	Add more yaml tests for get alias API (#29513 )	2018-05-22 11:48:28 +02:00
Jason Tedor	d68c44b76c	Default copy settings to true and deprecate on the REST layer (#30598 ) This commit defaults the copy_settings REST parameter to the shrink and split APIs to true, and deprecates the parameter.	2018-05-18 10:12:08 -04:00
Zachary Tong	d120fb222c	[TEST] Adjust version skips for movavg/movfn tests Since the MovFn PR was backported to 6.x, we can adjust the version skip numbers in master to correctly match 6.3.99 instead of 6.4.0	2018-05-17 18:07:52 +00:00
Mayya Sharipova	3dfa93ef7c	Improve explanation in rescore (#30629 ) Currently in a rescore request if window_size is smaller than the top N documents returned (N=size), explanation of scores could be incorrect for documents that were a part of topN and not part of rescoring. This PR corrects this, but saving in RescoreContext docIDs of documents for which rescoring was applied, and adding rescoring explanation only for these docIDs. Closes #28725	2018-05-17 07:09:18 -04:00
Tanguy Leroux	2ac1f9fe89	Fix _cluster/state to always return cluster_uuid (#30656 ) Since #30143, the Cluster State API should always returns the current cluster_uuid in the response body, regardless of the metrics filters. This is not exactly true as it is returned only if metadata metrics and no specific indices are requested. This commit fixes the behavior to always return the cluster_uuid and add new test.	2018-05-17 10:58:25 +02:00
Zachary Tong	df853c49c0	Add a MovingFunction pipeline aggregation, deprecate MovingAvg agg (#29594 ) This pipeline aggregation gives the user the ability to script functions that "move" across a window of data, instead of single data points. It is the scripted version of MovingAvg pipeline agg. Through custom script contexts, we expose a number of convenience methods: - MovingFunctions.max() - MovingFunctions.min() - MovingFunctions.sum() - MovingFunctions.unweightedAvg() - MovingFunctions.linearWeightedAvg() - MovingFunctions.ewma() - MovingFunctions.holt() - MovingFunctions.holtWinters() - MovingFunctions.stdDev() The user can also define any arbitrary logic via their own scripting, or combine with the above methods.	2018-05-16 10:57:00 -04:00
Simon Willnauer	b50cf3c6b0	Side-step pending deletes check (#30571 ) When we split/shrink an index we open several IndexWriter instances causeing file-deletes to be pending on windows. This subsequently fails when we open an IW to bootstrap the index history due to pending deletes. This change sidesteps the check since we know our history goes forward in terms of files and segments. Closes #30416	2018-05-15 11:51:54 +02:00
Jason Tedor	4e33443690	Adjust versions for resize copy settings (#30578 ) Now that the change to deprecate copy settings and disallow it being explicitly set to false is backported, this commit adjusts the BWC versions in master.	2018-05-14 16:41:25 -04:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Martijn van Groningen	7b95470897	Moved tokenizers to analysis common module (#30538 ) The following tokenizers were moved: classic, edge_ngram, letter, lowercase, ngram, path_hierarchy, pattern, thai, uax_url_email and whitespace. Left keyword tokenizer factory in server module, because normalizers directly depend on it.This should be addressed on a follow up change. Relates to #23658	2018-05-14 07:55:01 +02:00
Jason Tedor	593fdd40ed	Deprecate not copy settings and explicitly disallow (#30404 ) We want copying settings to be the default behavior. This commit deprecates not copying settings, and disallows explicitly not copying settings. This gives users a transition path to the future default behavior.	2018-05-13 10:30:05 -04:00
Tal Levy	34f92df2d3	AwaitsFix IntegTestZipClientYamlTestSuiteIT#indices.split tests there are two tests that have failed multiple times in one day on windows CI. This commit AwaitsFixes them until their timeout issues are resolved. tracking here: https://github.com/elastic/elasticsearch/issues/30503	2018-05-09 18:25:25 -07:00
Boaz Leskes	9f5fe49cec	Disable REST default settings testing until #29229 is back-ported That PR changed the execution path of index settings default to be on the master until the PR is back-ported the old master will not return default settings.	2018-05-07 13:30:14 +02:00
tomcallahan	0a93956194	Add Get Settings API support to java high-level rest client (#29229 ) This PR adds support for the Get Settings API to the java high-level rest client. Furthermore, logic related to the retrieval of default settings has been moved from the rest layer into the transport layer and now default settings may be retrieved consistency via both the rest API and the transport API.	2018-05-04 11:14:28 -04:00
Adrien Grand	bcdf3d5c61	Post backport of #29658 .	2018-05-02 11:43:50 +02:00
Adrien Grand	231a63fdf8	Remove useless version checks in REST tests. (#30165 ) Many tests are added with a version check so that they do not run against a version that doesn't have the feature yet. Master is 7.0, so all tests that do not run against 6.0+ can be removed and the version check can be removed on all tests that always run on 6.0+.	2018-05-02 11:34:15 +02:00
Adrien Grand	7358946bda	Add a new `_ignored` meta field. (#29658 ) This adds a new `_ignored` meta field which indexes and stores fields that have been ignored at index time because of the `ignore_malformed` option. It makes malformed documents easier to identify by using `exists` or `term(s)` queries on the `_ignored` field. Closes #29494	2018-05-02 10:47:02 +02:00
Jason Tedor	5de6f4ff7b	Adjust copy settings on resize BWC version This commit adjusts the BWC version for copy settings on resize operations after the behavior was backported to 6.x.	2018-05-01 08:49:16 -04:00
Jason Tedor	50535423ff	Allow copying source settings on resize operation (#30255 ) Today when an index is created from shrinking or splitting an existing index, the target index inherits almost none of the source index settings. This is surprising and a hassle for operators managing such indices. Given this is the default behavior, we can not simply change it. Instead, we start by introducing the ability to copy settings. This flag can be set on the REST API or on the transport layer and it has the behavior that it copies all settings from the source except non-copyable settings (a property of a setting introduced in this change). Additionally, settings on the request will always override. This change is the first step in our adventure: - this flag is added here in 7.0.0 and immediately deprecated - this flag will be backported to 6.4.0 and remain deprecated - then, we will remove the ability to set this flag to false in 7.0.0 - finally, in 8.0.0 we will remove this flag and the only behavior will be for settings to be copied	2018-05-01 08:48:19 -04:00
Chris Earle	421bd9bd7a	_cluster/state Skip Test for pre-6.4, not pre-7.0 (#30264 ) This updates the skip section for the new `_cluster/state` responses to include 6.4+ now that it has been backported.	2018-04-30 14:53:48 -04:00
Chris Earle	725a5af2c6	_cluster/state should always return cluster_uuid (#30143 ) Currently, the only way to get the REST response for the `/_cluster/state` call to return the `cluster_uuid` is to request the `metadata` metrics, which is one of the most expensive response structures. However, external monitoring agents will likely want the `cluster_uuid` to correlate the response with other API responses whether or not they want cluster metadata.	2018-04-30 10:16:11 -04:00
Julie Tibshirani	f5978d6d33	In the field capabilities API, remove support for providing fields in the request body. (#30185 )	2018-04-27 16:14:11 -07:00
Alexander Reelsen	e1a16a6018	REST: Remove GET support for clear cache indices (#29525 ) Clearing the cache indices can be done via GET and POST. As GET should only support read only operations, this removes the support for using GET for clearing the indices caches.	2018-04-27 08:41:36 +02:00
Jason Tedor	c12c2a6cc9	Rename the bulk thread pool to write thread pool (#29593 ) This commit renames the bulk thread pool to the write thread pool. This is to better reflect the fact that the underlying thread pool is used to execute any document write request (single-document index/delete/update requests, and bulk requests). With this change, we add support for fallback settings thread_pool.bulk.* which will be supported until 7.0.0. We also add a system property so that the display name of the thread pool remains as "bulk" if needed to avoid breaking users.	2018-04-19 08:18:58 -04:00
Martijn van Groningen	8afa7c174f	Added painless execute api. (#29164 ) Added an api that allows to execute an arbitrary script and a result to be returned. ``` POST /_scripts/painless/_execute { "script": { "source": "params.var1 / params.var2", "params": { "var1": 1, "var2": 1 } } } ``` Relates to #27875	2018-04-19 09:33:34 +02:00
Jason Tedor	2b47d67d95	Remove the index thread pool (#29556 ) Now that single-document indexing requests are executed on the bulk thread pool the index thread pool is no longer needed. This commit removes this thread pool from Elasticsearch.	2018-04-18 09:18:08 -04:00
Adrien Grand	d223bcf7ab	Add the `include_type_name` option to the search and document APIs. (#29506 ) This commit add the `include_type_name` option to the `index`, `update`, `delete`, `get`, `bulk` and `search` APIs. When set to `false`, the response will omit the `_type` in the response. This option doesn't work if the endpoint contains a type. For instance, the following call would succeed: ``` GET index/_doc/1?include_type_name=false ``` But the following one would fail: ``` GET index/some_type/1?include_type_name=false ``` Relates #15613	2018-04-17 11:29:08 +02:00
olcbean	b3e3b80f1b	REST high-level client: add support for Indices Update Settings API [take 2] (#29327 ) Relates to #27205	2018-04-16 21:39:11 +02:00
Ke Li	0bfb59dcf2	Using ObjectParser in UpdateRequest (#29293 ) CRUD: Parsing changes for UpdateRequest (#29293) Use `ObjectParser` to parse `UpdateRequest` so we reject unknown fields and drop support for the `_fields` parameter because it was deprecated in 5.x.	2018-04-16 08:39:35 -04:00
Adrien Grand	553c718d66	Make index APIs work without types. (#29479 ) Unlike the `indices.create`, `indices.get_mapping` and `indices.put_mapping` APIs, the index APIs do not need the `include_type_name` option, they can work work with and without types withouth knowing whether types are being used. Internally, `_doc` is used as a type if no type is provided, like for the `indices.put_mapping` API.	2018-04-13 09:08:45 +02:00
Adrien Grand	ebd6b5b7ba	Deprecate filtering on `_type`. (#29468 ) As indices are only allowed to have one type now, and types are going away in the future, we should deprecate filtering by `_type`. Relates #15613	2018-04-13 09:07:51 +02:00
Nhat Nguyen	067fbb8ecd	Backport periodic flush count to v6.3.0 Relates #29360	2018-04-11 17:14:28 -04:00
Nhat Nguyen	4e6a8900a3	Add periodic flush count to flush stats (#29360 ) Currently, a flush stats contains only the total flush which is the sum of manual flush (via API) and periodic flush (async triggered when the uncommitted translog size is exceeded the flush threshold). Sometimes, it's useful to know these two numbers independently. This commit tracks and returns a periodic flush count in a flush stats.	2018-04-11 11:15:33 -04:00
Adrien Grand	6a6c0ea5e6	Add an `include_type_name` option. (#29453 ) This adds an `include_type_name` option to the `indices.create`, `indices.get_mapping` and `indices.put_mapping` APIs, which defaults to `true`. When set to `false`, then mappings will be returned directly in the body of the `indices.get_mapping` API, without keying them by the type name, the `indices.create` will expect mappings directly under the `mappings` key, and the `indices.put_mapping` will use `_doc` as a type name and fail if a `type` is provided explicitly. Relates #15613	2018-04-11 15:54:16 +02:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Adrien Grand	6949c888bf	Remove useless skip. Relates #29353	2018-04-11 09:40:28 +02:00
Adrien Grand	a091d950a7	Deprecate slicing on `_uid`. (#29353 ) Deprecate slicing on `_uid`. `_id` should be used instead on 6.x.	2018-04-10 14:28:30 +02:00
Lee Hinman	a93c942927	Move ObjectParser into the x-content lib (#29373 ) * Move ObjectParser into the x-content lib This moves `ObjectParser`, `AbstractObjectParser`, and `ConstructingObjectParser` into the libs/x-content dependency. This decoupling allows them to be used for parsing for projects that don't want to depend on the entire Elasticsearch jar. Relates to #28504	2018-04-06 09:41:14 -06:00
Christoph Büscher	570f1d9ac7	Add indices options support to _rank_eval (#29386 ) Currently the ranking evaluation API doesn't support many of the standard parameters of the search API. Some of these make sense, like adding support for the common indices options parameters, which this change adds.	2018-04-06 16:23:19 +02:00
Jason Tedor	8fdca6a89a	Align cat thread pool info to thread pool config (#29195 ) Today we report thread pool info using a common object. This means that we use a shared set of terminology that is not consistent with the terminology used to the configure thread pools. This holds in particular for the minimum and maximum number of threads in the thread pool where we use the following terminology: thread pool info \| fixed \| scaling min core size max max size A previous change addressed this for the nodes info API. This commit changes the display of thread pool info in the cat thread pool API too to be dependent on the type of the thread pool so that we can align the terminology in the output of thread pool info with the terminology used to configure a thread pool.	2018-04-03 17:27:26 -04:00
Johnny Marnell	4db6fc9a08	Reindex: Fix error in delete-by-query rest spec (#29318 )	2018-04-03 11:22:04 -04:00
Mayya Sharipova	e70cd35bda	Revert "REST high-level client: add support for Indices Update Settings API (#28892 )" (#29323 ) This reverts commit `b67b5b1bbd`.	2018-03-30 16:26:46 -07:00
olcbean	b67b5b1bbd	REST high-level client: add support for Indices Update Settings API (#28892 ) Relates to #27205	2018-03-30 10:53:29 +02:00
Jim Ferenczi	dd77d7fd0a	#28745 : remove extra option in the composite rest tests `allow_partial_search_results` is not needed for these tests.	2018-03-26 14:32:59 +02:00
Christoph Büscher	318b0af953	Remove execute mode bit from source files Some source files seem to have the execute bit (a+x) set, which doesn't really seem to hurt but is a bit odd. This change removes those, making the permissions similar to other source files in the repository.	2018-03-26 13:37:55 +02:00
Jim Ferenczi	5288235ca3	Optimize the composite aggregation for match_all and range queries (#28745 ) This change refactors the composite aggregation to add an execution mode that visits documents in the order of the values present in the leading source of the composite definition. This mode does not need to visit all documents since it can early terminate the collection when the leading source value is greater than the lowest value in the queue. Instead of collecting the documents in the order of their doc_id, this mode uses the inverted lists (or the bkd tree for numerics) to collect documents in the order of the values present in the leading source. For instance the following aggregation: ``` "composite" : { "sources" : [ { "value1": { "terms" : { "field": "timestamp", "order": "asc" } } } ], "size": 10 } ``` ... can use the field `timestamp` to collect the documents with the 10 lowest values for the field instead of visiting all documents. For composite aggregation with more than one source the execution can early terminate as soon as one of the 10 lowest values produces enough composite buckets. For instance if visiting the first two lowest timestamp created 10 composite buckets we can early terminate the collection since it is guaranteed that the third lowest timestamp cannot create a composite key that compares lower than the one already visited. This mode can execute iff: * The leading source in the composite definition uses an indexed field of type `date` (works also with `date_histogram` source), `integer`, `long` or `keyword`. * The query is a match_all query or a range query over the field that is used as the leading source in the composite definition. * The sort order of the leading source is the natural order (ascending since postings and numerics are sorted in ascending order only). If these conditions are not met this aggregation visits each document like any other agg.	2018-03-26 09:51:37 +02:00
Nhat Nguyen	8cfe619f03	Revert "[TEST] Mute index synced flush rest tests" The BWC issue was fixed. This reverts commit `f8830b7b43`.	2018-03-21 16:51:52 -04:00
David Roberts	f8830b7b43	[TEST] Mute index synced flush rest tests Awaiting fix of #29162	2018-03-21 10:09:21 -04:00
Luca Cavanna	ff09c82319	REST high-level client: add clear cache API (#28866 ) * REST high-level client: add clear cache API Relates to #27205 Also Closes #26947 (rest-spec were outdated)	2018-03-20 10:39:36 +01:00
olcbean	3d81497f25	REST: Clear Indices Cache API remove deprecated url params (#29068 ) By the time the master branch is released the deprecated url parameters in the `/_cache/clear` API will have been deprecated for a couple of minor releases. Since master will be the next major release we are fine with removing these parameters.	2018-03-14 16:37:50 -04:00
Nik Everett	3e0e6444cf	Mark field_data as deprecated in /_cache/clear It was deprecated in #28943.	2018-03-14 11:39:35 -04:00
Nik Everett	8f0da37c5e	Revert "REST tests: Increase version to skip to include 6.3" This reverts commit `9f2c4df94d` which isn't needed now that the backport is complete.	2018-03-13 20:22:32 -04:00
Alexander Reelsen	9f2c4df94d	REST tests: Increase version to skip to include 6.3 In order to fix the tests, the correct version needs to be skipped until the backport is done. Relates #28943	2018-03-13 17:00:01 -07:00
Paul Sanwald	6dae955b6a	Document and test date_range "missing" support (#28983 ) * Add a REST integration test that documents date_range support Add a test case that exercises date_range aggregations using the missing option. Addresses #17597 * Test cleanup and correction Adding a document with a null date to exercise `missing` option, update test name to something reasonable. * Update documentation to explain how the "missing" parameter works for date_range aggregations. * Wrap lines at 80 chars in docs. * Change format of test to YAML for readability.	2018-03-13 12:58:30 -07:00
olcbean	edc57f6f34	REST: deprecate `field_data` in Clear Cache API (#28943 ) We call it `fielddata` everywhere else in the code and API so we may as well be consistent.	2018-03-13 15:16:27 -04:00
olcbean	3cf599be4c	REST api specs : remove unsupported `wait_for_merge` param (#28959 )	2018-03-13 12:12:52 +01:00
Spencer	9709d89daf	[rest-api-spec] update doc link for /_rank_eval	2018-03-08 16:06:23 -07:00
Jim Ferenczi	bc8b3fc71c	Revert "Rescore collapsed documents (#28521 )" This reverts commit `f057fc294a`. The rescorer does not resort the collapsed values inside the top docs during rescoring. For this reason the Lucene rescorer is not compatible with collapsing. Relates #27243	2018-03-08 11:20:29 +01:00
Sergey Galkin	f057fc294a	Rescore collapsed documents (#28521 ) This change adds the ability to rescore collapsed documents.	2018-03-04 13:39:50 -08:00
Andrew Odendaal	f207aacac0	Missing `timeout` parameter from the REST API spec JSON files (#28328 ) Closes #28200	2018-03-02 14:47:06 +01:00
olcbean	25834e2d26	REST api spec: remove a common param from nodes.usage.json (#28835 ) "human" is a common parameter defined in `_common.json` It should not be repeated again for a concrete api. Closes #28226	2018-03-01 10:58:38 +01:00
olcbean	ec0a1b36c9	REST api spec: remove unsupported parameter `parent_node` (#28841 )	2018-03-01 10:57:25 +01:00
Ke Li	a77273fc01	Reject regex search if regex string is too long (#28542 ) * Reject regex search if regex string is too long (#28344) * Add docs * Introduce index level setting `index.max_regex_length` to control the maximum length of the regular expression Closes #28344	2018-02-23 10:41:24 -08:00
Nhat Nguyen	30598627ce	Fix BWC issue of the translog last modified age stats We added a rest test for the translog last modified age without a version check. This causes BWC failed because the stats are not available in the old versions. Relates #28613	2018-02-16 14:55:20 -05:00
Justin Wyer	5aeb479ffd	Add translog file age to Translog Stats (#28613 ) Expose the age of translog files in the translog stats. This is useful to reason about your translog retention policy. Closes #28189	2018-02-16 16:23:33 +01:00
Boaz Leskes	beb55d148a	Simplify the Translog constructor by always expecting an existing translog (#28676 ) Currently the Translog constructor is capable both of opening an existing translog and creating a new one (deleting existing files). This PR separates these two into separate code paths. The constructors opens files and a dedicated static methods creates an empty translog.	2018-02-15 09:24:09 +01:00
Ke Li	fc406c9a5a	Upgrade t-digest to 3.2 (#28295 ) (#28305 )	2018-02-15 08:23:20 +00:00
Jim Ferenczi	37e938f9de	Fix indices.sort rest test Set the number of replicas to 0 in order to avoid race condition during the test Fixes #24416	2018-02-12 09:03:55 +01:00
Martijn van Groningen	2a35b4ee2b	Use right skip versions. Closes #27570	2018-02-06 12:22:42 +01:00
markharwood	77d2dd203e	Search - add allow_partial_search_results flag with default setting false (#28440 ) Adds allow_partial_search_results flag to search requests with default setting = true. When false, will error if search either timeouts, has partial errors or has missing shards rather than returning partial search results. A cluster-level setting provides a default for search requests with no flag. Closes #27435	2018-01-31 15:51:29 +00:00
Alan Woodward	424ecb3c7d	Add ability to index prefixes on text fields (#28290 ) This adds the ability to index term prefixes into a hidden subfield, enabling prefix queries to be run without multitermquery rewrites. The subfield reuses the analysis chain of its parent text field, appending an EdgeNGramTokenFilter. It can be configured with minimum and maximum ngram lengths. Query terms with lengths outside this min-max range fall back to using prefix queries against the parent text field. The mapping looks like this: "my_text_field" : { "type" : "text", "analyzer" : "english", "index_prefix" : { "min_chars" : 1, "max_chars" : 10 } } Relates to #27049	2018-01-30 08:26:56 +00:00
Jim Ferenczi	95c45aeb5d	Adapt bwc version after backport #28358	2018-01-25 09:26:10 +01:00
Jim Ferenczi	c26d4ac6c1	Always return the after_key in composite aggregation response (#28358 ) This change adds the `after_key` of a composite aggregation directly in the response. It is redundant when all buckets are not filtered/removed by a pipeline aggregation since in this case the `after_key` is always the last bucket in the response. Though when using a pipeline aggregation to filter composite buckets, the `after_key` can be lost if the last bucket is filtered. This commit fixes this situation by always returning the `after_key` in a dedicated section.	2018-01-25 09:15:27 +01:00
Jim Ferenczi	b10d166190	Adapt bwc version after backport #28310	2018-01-24 09:17:30 +01:00
Jim Ferenczi	19cfc25873	Adds the ability to specify a format on composite date_histogram source (#28310 ) This commit adds the ability to specify a date format on the `date_histogram` composite source. If the format is defined, the key for the source is returned as a formatted date. Closes #27923	2018-01-23 15:14:49 +01:00
Adrien Grand	700d9ecc95	Remove the `update_all_types` option. (#28288 ) This option is not useful in 7.x since no indices may have more than one type anymore.	2018-01-22 12:03:07 +01:00
Tanguy Leroux	1f66672d6f	[Test] Fix indices.stats/13_fields Remove the wait_for_active_shards: all added by commit 6256c since it does not work when the cluster has 1 node only.	2018-01-17 15:05:07 +01:00
Tanguy Leroux	6256c330c0	[Test] Wait for no relocating shards in indices.stats/13_fields tests MixedClusterClientYamlTestSuiteIT sometimes fails when executing the indices.stats/13_fields/* REST tests. It does not reproduce locally but the execution logs show that it failed when a shard is relocating during the set up execution. This commit change the set up so that it now waits for all shards to be active before executing the tests. closes #26732, #27146	2018-01-17 13:35:29 +01:00
Igor Motov	c75ac319a6	Add ability to associate an ID with tasks (#27764 ) Adds support for capturing the X-Opaque-Id header from a REST request and storing it's value in the tasks that this request started. It works for all user-initiated tasks (not only search). Closes #23250 Usage: ``` $ curl -H "X-Opaque-Id: imotov" -H "foo:bar" "localhost:9200/_tasks?pretty&group_by=parents" { "tasks" : { "7qrTVbiDQKiZfubUP7DPkg:6998" : { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6998, "type" : "transport", "action" : "cluster:monitor/tasks/lists", "start_time_in_millis" : 1513029940042, "running_time_in_nanos" : 266794, "cancellable" : false, "headers" : { "X-Opaque-Id" : "imotov" }, "children" : [ { "node" : "V-PuCjPhRp2ryuEsNw6V1g", "id" : 6088, "type" : "netty", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 67785, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } }, { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6999, "type" : "direct", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 98754, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } } ] } } } ```	2018-01-12 15:34:17 -05:00
Clinton Gormley	f15c2044c4	Fixed the cat.health REST test to accept 4ms, not just 4.0ms (#28186 )	2018-01-12 11:19:57 +01:00
kel	6c963379d9	Ignore null value for range field (#27845 ) (#28116 ) Currently when adding a document with a `null` value for a range field, the range field mapper raises an error. Instead we should ignore null like we do eg. with numbers or geo points. Closes #27845	2018-01-10 19:10:06 +01:00
Simon Willnauer	47a506d2db	Bump compat version for local depdendent test to 6.2.0	2018-01-04 16:49:31 +01:00
Simon Willnauer	b68f7ed8c3	Pass `java.locale.providers=COMPAT` to Java 9 onwards (#28080 ) Java 9 added some enhancements to the internationalization support that impact our date parsing support. To ensure flawless BWC and consistent behavior going forward Java 9 runtimes requrie the system property `java.locale.providers=COMPAT` to be set. Closes #10984	2018-01-04 16:43:51 +01:00
Mayya Sharipova	dcde895f49	Introduce limit to the number of terms in Terms Query (#27968 ) - Introduce index level settings to control the maximum number of terms that can be used in a Terms Query - Throw an error if a request exceeds this max number Closes #18829	2017-12-28 17:36:29 -05:00
Jim Ferenczi	0b2c8c835e	Fix composite aggregation when after term is missing in the shard (#27936 ) This change fixes a bug when a keyword term in the `after` key is not present in the shard. In this case the global ord of the document values are compared with the insertion point of the `after` keyword and values that are equal to the insertion point should be considered "after" the top value.	2017-12-26 09:58:49 +01:00
olcbean	7f2f59eb85	delete `operation_threading` from the rest specs (#27940 )	2017-12-21 13:09:11 -08:00
Mayya Sharipova	cbd271e497	Limit the analyzed text for highlighting (#27934 ) * Limit the analyzed text for highlighting - Introduce index level settings to control the max number of character to be analyzed for highlighting - Throw an error if analysis is required on a larger text Closes #27517	2017-12-21 10:19:58 -05:00
Jim Ferenczi	c753b82ca8	Adapt scroll rest test after backport. relates #27842	2017-12-21 09:31:56 +01:00
Colin Goodheart-Smithe	4cbbe3ed93	Fixes DocStats to not report index size < -1 (#27863 ) Previously to this change when DocStats are added together (for example when adding the index size of all primary shards for an index) we naively added the `totalSizeInBytes` together. This worked most of the time but not when the index size on one or multiple shards was reported to be `-1` (no value). This change improves the logic by considering if the current value or the value to be added is `-1`: * If the current and new value are both `-1` the value remains at `-1` * If the current value is `-1` and the new value is not `-1`, current value is changed to be equal to the new value * If the current value is not `-1` and the new value is `-1` the new value is ignored and the current value is not changed * If both the current and new values are not `-1` the current value is changed to be equal to the sum of the current and new values. The change also re-enables the failing rollover YAML test that was failing due to this bug.	2017-12-20 14:45:09 +00:00
Stuart Cam	e458c6b762	timestamp and ttl in index requests (#27888 ) timestamp and ttl are not accepted anymore as parameters of index/update requests.	2017-12-20 10:43:21 +11:00
Christoph Büscher	05aa1a6033	[Tests] Remove redudant rest test added in #27900 The removed rest test doesn't really test the the issue reported in #27841 and adds nothing on top of the unit test.	2017-12-19 20:04:37 +01:00
Christoph Büscher	fb2fd4e8ee	Fix preserving FiltersAggregationBuilder#keyed field on rewrite (#27900 ) Currently FiltersAggregationBuilder#doRewrite creates a new FiltersAggregationBuilder which doesn't correctly copy the original "keyed" field if a non-keyed filter gets rewritten. This can cause rendering bugs of the output aggregations like the one reported in #27841. Closes #27841	2017-12-19 19:56:12 +01:00
kel	7a27a2770b	Reject scroll query if size is 0 (#22552 ) (#27842 )	2017-12-18 10:38:41 +01:00
Jim Ferenczi	55b71a871b	Adapt rest test after backport. Relates #27833	2017-12-18 10:36:44 +01:00
Jason Tedor	75c0cd0672	Move range field mapper back to core This commit moves the range field mapper back to core so that we can remove the compile-time dependency of percolator on mapper-extras which compilcates dependency management for the percolator client JAR, and modules should not be intertwined like this anyway. Relates #27854	2017-12-17 14:27:10 -05:00
kel	f5e0932c8d	Add version support for inner hits in field collapsing (#27822 ) (#27833 ) Add version support for inner hits in field collapsing	2017-12-15 18:00:40 +01:00
Christoph Büscher	52cb6c8ef2	Merge branch 'master' into rankeval	2017-12-07 14:22:46 +01:00
Jim Ferenczi	caea6b70fa	Add a new cluster setting to limit the total number of buckets returned by a request (#27581 ) This commit adds a new dynamic cluster setting named `search.max_buckets` that can be used to limit the number of buckets created per shard or by the reduce phase. Each multi bucket aggregator can consume buckets during the final build of the aggregation at the shard level or during the reduce phase (final or not) in the coordinating node. When an aggregator consumes a bucket, a global count for the request is incremented and if this number is greater than the limit an exception is thrown (TooManyBuckets exception). This change adds the ability for multi bucket aggregator to "consume" buckets in the global limit, the default is 10,000. It's an opt-in consumer so each multi-bucket aggregator must explicitly call the consumer when a bucket is added in the response. Closes #27452 #26012	2017-12-06 09:15:28 +01:00
Christoph Büscher	bbec33d35c	Merge branch 'master' into rankeval	2017-12-04 12:57:19 +01:00
Mayya Sharipova	c6b73239ae	Limit the number of tokens produced by _analyze (#27529 ) Add an index level setting `index.analyze.max_token_count` to control the number of generated tokens in the _analyze endpoint. Defaults to 10000. Throw an error if the number of generated tokens exceeds this limit. Closes #27038	2017-11-30 11:54:39 -05:00
Tanguy Leroux	41f73e0acf	Fix version for include_global_state in Snapshot Status API It also adds a Rest test. Related #26853	2017-11-30 11:33:01 +01:00
Christoph Büscher	35688f6441	Merge branch 'master' into rankeval	2017-11-29 15:24:06 +01:00
Martijn van Groningen	cb1204774b	Include the _index, _type and _id to nested search hits in the top_hits and inner_hits response. Also include _type and _id for parent/child hits inside inner hits. In the case of top_hits aggregation the nested search hits are directly returned and are not grouped by a root or parent document, so it is important to include the _id and _index attributes in order to know to what documents these nested search hits belong to. Closes #27053	2017-11-28 14:05:29 +01:00
Nhat Nguyen	8d6bfe53bb	Remove workaround in translog rest test (#27530 ) Relates #25623 and `a6db0ea908`	2017-11-27 09:41:30 -05:00
Christoph Büscher	5661b1c3df	Merge branch 'master' into rankeval	2017-11-24 16:25:05 +01:00
Nhat Nguyen	06d35f4f01	Backport wait_for_initialiazing_shards to cluster health API Relates #27489	2017-11-24 09:56:16 -05:00
Nhat Nguyen	46b508d6c9	Add wait_for_no_initializing_shards to cluster health API (#27489 ) This adds a new option to the cluster health request allowing to wait until there is no initializing shards. Closes #25623	2017-11-23 15:09:58 -05:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Mayya Sharipova	57e4d10007	Limit the number of nested documents (#27405 ) Add an index level setting `index.mapping.nested_objects.limit` to control the number of nested json objects that can be in a single document across all fields. Defaults to 10000. Throw an error if the number of created nested documents exceed this limit during the parsing of a document. Closes #26962	2017-11-22 10:16:28 -05:00
Jim Ferenczi	90d2ead14a	Adapt rest test BWC version after backport Relates #26800	2017-11-21 15:45:02 +01:00

... 4 5 6 7 8 ...

1307 Commits