OpenSearch

Commit Graph

Author	SHA1	Message	Date
Martijn van Groningen	07a57cc131	Move number of language analyzers to analysis-common module (#31143 ) The following analyzers were moved from server module to analysis-common module: `snowball`, `arabic`, `armenian`, `basque`, `bengali`, `brazilian`, `bulgarian`, `catalan`, `chinese`, `cjk`, `czech`, `danish`, `dutch`, `english`, `finnish`, `french`, `galician` and `german`. Relates to #23658	2018-06-08 08:58:46 +02:00
Jim Ferenczi	b30aa3137d	Reject long regex in query_string (#31136 ) This change applies the existing `index.max_regex_length` to regex queries produced by the `query_string` query. Relates #28344	2018-06-07 09:29:26 +02:00
Lee Hinman	b22a055bcf	Add get mappings support to high-level rest client (#30889 ) This adds support for the get mappings API to the high level rest client. Relates to #27205	2018-06-04 14:31:08 -06:00
Jim Ferenczi	f94a75778c	Fix index prefixes to work with span_multi (#31066 ) * Fix index prefixes to work with span_multi Text fields that use `index_prefixes` can rewrite `prefix` queries into `term` queries internally. This commit fix the handling of this rewriting in the `span_multi` query. This change also copies the index options of the text field into the prefix field in order to be able to run positional queries. This is mandatory for `span_multi` to work but this could also be useful to optimize `match_phrase_prefix` queries in a follow up. Note that this change can only be done on indices created after 6.3 since we set the index options to doc only in this version. Fixes #31056	2018-06-04 21:48:56 +02:00
Alan Woodward	0427339ab0	Index phrases (#30450 ) Specifying `index_phrases: true` on a text field mapping will add a subsidiary [field]._index_phrase field, indexing two-term shingles from the parent field. The parent analysis chain is re-used, wrapped with a FixedShingleFilter. At query time, if a phrase match query is executed, the mapping will redirect it to run against the subsidiary field. This should trade faster phrase querying for a larger index and longer indexing times. Relates to #27049	2018-06-04 08:50:35 +01:00
Michael Basnight	d826cb36c3	Remove version read/write logic in Verify Response (#30879 ) Since master will always communicate with a >=6.4 node, the logic for checking if the node is 6.4 and conditionally reading and writing based on that can be removed from master. This logic will stay in 6.x as it is the bridge to the cleaner response in master. This also unmutes the failing test due to this bwc change. Closes #30807	2018-05-31 12:10:01 -05:00
Jim Ferenczi	0f5e570184	Deprecates indexing and querying a context completion field without context (#30712 ) This change deprecates completion queries and documents without context that target a context enabled completion field. Querying without context degrades the search performance considerably (even when the number of indexed contexts is low). This commit targets master but the deprecation will take place in 6.x and the functionality will be removed in 7 in a follow up. Closes #29222	2018-05-31 16:09:48 +02:00
Jim Ferenczi	f582418ada	Fix missing option serialization after backport Relates #29465	2018-05-30 12:55:31 +02:00
Jim Ferenczi	e33d107f84	Add missing_bucket option in the composite agg (#29465 ) This change adds a new option to the composite aggregation named `missing_bucket`. This option can be set by source and dictates whether documents without a value for the source should be ignored. When set to true, documents without a value for a field emits an explicit `null` value which is then added in the composite bucket. The `missing` option that allows to set an explicit value (instead of `null`) is deprecated in this change and will be removed in a follow up (only in 7.x). This commit also changes how the big arrays are allocated, instead of reserving the provided `size` for all sources they are created with a small intial size and they grow depending on the number of buckets created by the aggregation: Closes #29380	2018-05-30 09:48:40 +02:00
Alan Woodward	67905c85a5	Rename index_prefix to index_prefixes (#30932 ) This commit also adds index_prefixes tests to TextFieldMapperTests to ensure that cloning and wire-serialization work correctly	2018-05-30 08:32:31 +01:00
Martijn van Groningen	544822c78b	Moved keyword tokenizer to analysis-common module (#30642 ) Relates to #23658	2018-05-29 19:22:28 +02:00
Vladimir Dolzhenko	b55b079a90	Include size of snapshot in snapshot metadata #18543 , bwc clean up (#30890 )	2018-05-26 21:20:44 +02:00
Vladimir Dolzhenko	81eb8ba0f0	Include size of snapshot in snapshot metadata (#29602 ) Include size of snapshot in snapshot metadata Adds difference of number of files (and file sizes) between prev and current snapshot. Total number/size reflects total number/size of files in snapshot. Closes #18543	2018-05-25 21:04:50 +02:00
Tom Callahan	36fbb4cb48	Harmonize include_defaults tests (#30700 ) This PR breaks the include_defaults functionality of the get settings API into its own test, which is skipped for mixed-mode clusters containing pre-6.4 nodes.	2018-05-25 09:41:16 -04:00
David Roberts	40534ccabc	[TEST] Mute {p0=snapshot.get_repository/10_basic/Verify created repository} YAML test Issue is #30807	2018-05-25 12:58:02 +01:00
Michael Basnight	e1ffbeb824	Fix bad version check writing Repository nodes (#30846 ) The writeTo method of VerifyRepositoryResponse incorrectly used its local version to determine what it was receiving, rather than the sender's version. This fixes a bug that ocassionally happened when a 6.4 master node sent data to a 7.0 client, causing the number of bytes to be improperly read. This also unmutes the test. Closes #30807	2018-05-24 19:21:57 -05:00
Julie Tibshirani	f55b09bae4	Update the version checks around ip_range bucket keys, now that the change was backported.	2018-05-24 12:04:18 -07:00
Julie Tibshirani	638a719370	Ensure that ip_range aggregations always return bucket keys. (#30701 )	2018-05-24 08:55:14 -07:00
Adrien Grand	405eb7a751	Change serialization version of doc-value fields. Relates #29639	2018-05-23 18:34:05 +02:00
Adrien Grand	a19df4ab3b	Add a `format` option to `docvalue_fields`. (#29639 ) This commit adds the ability to configure how a docvalue field should be formatted, so that it would be possible eg. to return a date field formatted as the number of milliseconds since Epoch. Closes #27740	2018-05-23 14:39:04 +02:00
Colin Goodheart-Smithe	483b25330b	Mustes {p0=snapshot.get_repository/10_basic/*} YAML test This is awaiting a fix for https://github.com/elastic/elasticsearch/issues/30807	2018-05-23 11:32:58 +01:00
olcbean	af8ad8d172	Add more yaml tests for get alias API (#29513 )	2018-05-22 11:48:28 +02:00
Jason Tedor	d68c44b76c	Default copy settings to true and deprecate on the REST layer (#30598 ) This commit defaults the copy_settings REST parameter to the shrink and split APIs to true, and deprecates the parameter.	2018-05-18 10:12:08 -04:00
Zachary Tong	d120fb222c	[TEST] Adjust version skips for movavg/movfn tests Since the MovFn PR was backported to 6.x, we can adjust the version skip numbers in master to correctly match 6.3.99 instead of 6.4.0	2018-05-17 18:07:52 +00:00
Mayya Sharipova	3dfa93ef7c	Improve explanation in rescore (#30629 ) Currently in a rescore request if window_size is smaller than the top N documents returned (N=size), explanation of scores could be incorrect for documents that were a part of topN and not part of rescoring. This PR corrects this, but saving in RescoreContext docIDs of documents for which rescoring was applied, and adding rescoring explanation only for these docIDs. Closes #28725	2018-05-17 07:09:18 -04:00
Tanguy Leroux	2ac1f9fe89	Fix _cluster/state to always return cluster_uuid (#30656 ) Since #30143, the Cluster State API should always returns the current cluster_uuid in the response body, regardless of the metrics filters. This is not exactly true as it is returned only if metadata metrics and no specific indices are requested. This commit fixes the behavior to always return the cluster_uuid and add new test.	2018-05-17 10:58:25 +02:00
Zachary Tong	df853c49c0	Add a MovingFunction pipeline aggregation, deprecate MovingAvg agg (#29594 ) This pipeline aggregation gives the user the ability to script functions that "move" across a window of data, instead of single data points. It is the scripted version of MovingAvg pipeline agg. Through custom script contexts, we expose a number of convenience methods: - MovingFunctions.max() - MovingFunctions.min() - MovingFunctions.sum() - MovingFunctions.unweightedAvg() - MovingFunctions.linearWeightedAvg() - MovingFunctions.ewma() - MovingFunctions.holt() - MovingFunctions.holtWinters() - MovingFunctions.stdDev() The user can also define any arbitrary logic via their own scripting, or combine with the above methods.	2018-05-16 10:57:00 -04:00
Simon Willnauer	b50cf3c6b0	Side-step pending deletes check (#30571 ) When we split/shrink an index we open several IndexWriter instances causeing file-deletes to be pending on windows. This subsequently fails when we open an IW to bootstrap the index history due to pending deletes. This change sidesteps the check since we know our history goes forward in terms of files and segments. Closes #30416	2018-05-15 11:51:54 +02:00
Jason Tedor	4e33443690	Adjust versions for resize copy settings (#30578 ) Now that the change to deprecate copy settings and disallow it being explicitly set to false is backported, this commit adjusts the BWC versions in master.	2018-05-14 16:41:25 -04:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Martijn van Groningen	7b95470897	Moved tokenizers to analysis common module (#30538 ) The following tokenizers were moved: classic, edge_ngram, letter, lowercase, ngram, path_hierarchy, pattern, thai, uax_url_email and whitespace. Left keyword tokenizer factory in server module, because normalizers directly depend on it.This should be addressed on a follow up change. Relates to #23658	2018-05-14 07:55:01 +02:00
Jason Tedor	593fdd40ed	Deprecate not copy settings and explicitly disallow (#30404 ) We want copying settings to be the default behavior. This commit deprecates not copying settings, and disallows explicitly not copying settings. This gives users a transition path to the future default behavior.	2018-05-13 10:30:05 -04:00
Tal Levy	34f92df2d3	AwaitsFix IntegTestZipClientYamlTestSuiteIT#indices.split tests there are two tests that have failed multiple times in one day on windows CI. This commit AwaitsFixes them until their timeout issues are resolved. tracking here: https://github.com/elastic/elasticsearch/issues/30503	2018-05-09 18:25:25 -07:00
Boaz Leskes	9f5fe49cec	Disable REST default settings testing until #29229 is back-ported That PR changed the execution path of index settings default to be on the master until the PR is back-ported the old master will not return default settings.	2018-05-07 13:30:14 +02:00
tomcallahan	0a93956194	Add Get Settings API support to java high-level rest client (#29229 ) This PR adds support for the Get Settings API to the java high-level rest client. Furthermore, logic related to the retrieval of default settings has been moved from the rest layer into the transport layer and now default settings may be retrieved consistency via both the rest API and the transport API.	2018-05-04 11:14:28 -04:00
Adrien Grand	bcdf3d5c61	Post backport of #29658 .	2018-05-02 11:43:50 +02:00
Adrien Grand	231a63fdf8	Remove useless version checks in REST tests. (#30165 ) Many tests are added with a version check so that they do not run against a version that doesn't have the feature yet. Master is 7.0, so all tests that do not run against 6.0+ can be removed and the version check can be removed on all tests that always run on 6.0+.	2018-05-02 11:34:15 +02:00
Adrien Grand	7358946bda	Add a new `_ignored` meta field. (#29658 ) This adds a new `_ignored` meta field which indexes and stores fields that have been ignored at index time because of the `ignore_malformed` option. It makes malformed documents easier to identify by using `exists` or `term(s)` queries on the `_ignored` field. Closes #29494	2018-05-02 10:47:02 +02:00
Jason Tedor	5de6f4ff7b	Adjust copy settings on resize BWC version This commit adjusts the BWC version for copy settings on resize operations after the behavior was backported to 6.x.	2018-05-01 08:49:16 -04:00
Jason Tedor	50535423ff	Allow copying source settings on resize operation (#30255 ) Today when an index is created from shrinking or splitting an existing index, the target index inherits almost none of the source index settings. This is surprising and a hassle for operators managing such indices. Given this is the default behavior, we can not simply change it. Instead, we start by introducing the ability to copy settings. This flag can be set on the REST API or on the transport layer and it has the behavior that it copies all settings from the source except non-copyable settings (a property of a setting introduced in this change). Additionally, settings on the request will always override. This change is the first step in our adventure: - this flag is added here in 7.0.0 and immediately deprecated - this flag will be backported to 6.4.0 and remain deprecated - then, we will remove the ability to set this flag to false in 7.0.0 - finally, in 8.0.0 we will remove this flag and the only behavior will be for settings to be copied	2018-05-01 08:48:19 -04:00
Chris Earle	421bd9bd7a	_cluster/state Skip Test for pre-6.4, not pre-7.0 (#30264 ) This updates the skip section for the new `_cluster/state` responses to include 6.4+ now that it has been backported.	2018-04-30 14:53:48 -04:00
Chris Earle	725a5af2c6	_cluster/state should always return cluster_uuid (#30143 ) Currently, the only way to get the REST response for the `/_cluster/state` call to return the `cluster_uuid` is to request the `metadata` metrics, which is one of the most expensive response structures. However, external monitoring agents will likely want the `cluster_uuid` to correlate the response with other API responses whether or not they want cluster metadata.	2018-04-30 10:16:11 -04:00
Julie Tibshirani	f5978d6d33	In the field capabilities API, remove support for providing fields in the request body. (#30185 )	2018-04-27 16:14:11 -07:00
Alexander Reelsen	e1a16a6018	REST: Remove GET support for clear cache indices (#29525 ) Clearing the cache indices can be done via GET and POST. As GET should only support read only operations, this removes the support for using GET for clearing the indices caches.	2018-04-27 08:41:36 +02:00
Jason Tedor	c12c2a6cc9	Rename the bulk thread pool to write thread pool (#29593 ) This commit renames the bulk thread pool to the write thread pool. This is to better reflect the fact that the underlying thread pool is used to execute any document write request (single-document index/delete/update requests, and bulk requests). With this change, we add support for fallback settings thread_pool.bulk.* which will be supported until 7.0.0. We also add a system property so that the display name of the thread pool remains as "bulk" if needed to avoid breaking users.	2018-04-19 08:18:58 -04:00
Martijn van Groningen	8afa7c174f	Added painless execute api. (#29164 ) Added an api that allows to execute an arbitrary script and a result to be returned. ``` POST /_scripts/painless/_execute { "script": { "source": "params.var1 / params.var2", "params": { "var1": 1, "var2": 1 } } } ``` Relates to #27875	2018-04-19 09:33:34 +02:00
Jason Tedor	2b47d67d95	Remove the index thread pool (#29556 ) Now that single-document indexing requests are executed on the bulk thread pool the index thread pool is no longer needed. This commit removes this thread pool from Elasticsearch.	2018-04-18 09:18:08 -04:00
Adrien Grand	d223bcf7ab	Add the `include_type_name` option to the search and document APIs. (#29506 ) This commit add the `include_type_name` option to the `index`, `update`, `delete`, `get`, `bulk` and `search` APIs. When set to `false`, the response will omit the `_type` in the response. This option doesn't work if the endpoint contains a type. For instance, the following call would succeed: ``` GET index/_doc/1?include_type_name=false ``` But the following one would fail: ``` GET index/some_type/1?include_type_name=false ``` Relates #15613	2018-04-17 11:29:08 +02:00
olcbean	b3e3b80f1b	REST high-level client: add support for Indices Update Settings API [take 2] (#29327 ) Relates to #27205	2018-04-16 21:39:11 +02:00
Ke Li	0bfb59dcf2	Using ObjectParser in UpdateRequest (#29293 ) CRUD: Parsing changes for UpdateRequest (#29293) Use `ObjectParser` to parse `UpdateRequest` so we reject unknown fields and drop support for the `_fields` parameter because it was deprecated in 5.x.	2018-04-16 08:39:35 -04:00
Adrien Grand	553c718d66	Make index APIs work without types. (#29479 ) Unlike the `indices.create`, `indices.get_mapping` and `indices.put_mapping` APIs, the index APIs do not need the `include_type_name` option, they can work work with and without types withouth knowing whether types are being used. Internally, `_doc` is used as a type if no type is provided, like for the `indices.put_mapping` API.	2018-04-13 09:08:45 +02:00
Adrien Grand	ebd6b5b7ba	Deprecate filtering on `_type`. (#29468 ) As indices are only allowed to have one type now, and types are going away in the future, we should deprecate filtering by `_type`. Relates #15613	2018-04-13 09:07:51 +02:00
Nhat Nguyen	067fbb8ecd	Backport periodic flush count to v6.3.0 Relates #29360	2018-04-11 17:14:28 -04:00
Nhat Nguyen	4e6a8900a3	Add periodic flush count to flush stats (#29360 ) Currently, a flush stats contains only the total flush which is the sum of manual flush (via API) and periodic flush (async triggered when the uncommitted translog size is exceeded the flush threshold). Sometimes, it's useful to know these two numbers independently. This commit tracks and returns a periodic flush count in a flush stats.	2018-04-11 11:15:33 -04:00
Adrien Grand	6a6c0ea5e6	Add an `include_type_name` option. (#29453 ) This adds an `include_type_name` option to the `indices.create`, `indices.get_mapping` and `indices.put_mapping` APIs, which defaults to `true`. When set to `false`, then mappings will be returned directly in the body of the `indices.get_mapping` API, without keying them by the type name, the `indices.create` will expect mappings directly under the `mappings` key, and the `indices.put_mapping` will use `_doc` as a type name and fail if a `type` is provided explicitly. Relates #15613	2018-04-11 15:54:16 +02:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Adrien Grand	6949c888bf	Remove useless skip. Relates #29353	2018-04-11 09:40:28 +02:00
Adrien Grand	a091d950a7	Deprecate slicing on `_uid`. (#29353 ) Deprecate slicing on `_uid`. `_id` should be used instead on 6.x.	2018-04-10 14:28:30 +02:00
Lee Hinman	a93c942927	Move ObjectParser into the x-content lib (#29373 ) * Move ObjectParser into the x-content lib This moves `ObjectParser`, `AbstractObjectParser`, and `ConstructingObjectParser` into the libs/x-content dependency. This decoupling allows them to be used for parsing for projects that don't want to depend on the entire Elasticsearch jar. Relates to #28504	2018-04-06 09:41:14 -06:00
Christoph Büscher	570f1d9ac7	Add indices options support to _rank_eval (#29386 ) Currently the ranking evaluation API doesn't support many of the standard parameters of the search API. Some of these make sense, like adding support for the common indices options parameters, which this change adds.	2018-04-06 16:23:19 +02:00
Jason Tedor	8fdca6a89a	Align cat thread pool info to thread pool config (#29195 ) Today we report thread pool info using a common object. This means that we use a shared set of terminology that is not consistent with the terminology used to the configure thread pools. This holds in particular for the minimum and maximum number of threads in the thread pool where we use the following terminology: thread pool info \| fixed \| scaling min core size max max size A previous change addressed this for the nodes info API. This commit changes the display of thread pool info in the cat thread pool API too to be dependent on the type of the thread pool so that we can align the terminology in the output of thread pool info with the terminology used to configure a thread pool.	2018-04-03 17:27:26 -04:00
Johnny Marnell	4db6fc9a08	Reindex: Fix error in delete-by-query rest spec (#29318 )	2018-04-03 11:22:04 -04:00
Mayya Sharipova	e70cd35bda	Revert "REST high-level client: add support for Indices Update Settings API (#28892 )" (#29323 ) This reverts commit `b67b5b1bbd`.	2018-03-30 16:26:46 -07:00
olcbean	b67b5b1bbd	REST high-level client: add support for Indices Update Settings API (#28892 ) Relates to #27205	2018-03-30 10:53:29 +02:00
Jim Ferenczi	dd77d7fd0a	#28745 : remove extra option in the composite rest tests `allow_partial_search_results` is not needed for these tests.	2018-03-26 14:32:59 +02:00
Christoph Büscher	318b0af953	Remove execute mode bit from source files Some source files seem to have the execute bit (a+x) set, which doesn't really seem to hurt but is a bit odd. This change removes those, making the permissions similar to other source files in the repository.	2018-03-26 13:37:55 +02:00
Jim Ferenczi	5288235ca3	Optimize the composite aggregation for match_all and range queries (#28745 ) This change refactors the composite aggregation to add an execution mode that visits documents in the order of the values present in the leading source of the composite definition. This mode does not need to visit all documents since it can early terminate the collection when the leading source value is greater than the lowest value in the queue. Instead of collecting the documents in the order of their doc_id, this mode uses the inverted lists (or the bkd tree for numerics) to collect documents in the order of the values present in the leading source. For instance the following aggregation: ``` "composite" : { "sources" : [ { "value1": { "terms" : { "field": "timestamp", "order": "asc" } } } ], "size": 10 } ``` ... can use the field `timestamp` to collect the documents with the 10 lowest values for the field instead of visiting all documents. For composite aggregation with more than one source the execution can early terminate as soon as one of the 10 lowest values produces enough composite buckets. For instance if visiting the first two lowest timestamp created 10 composite buckets we can early terminate the collection since it is guaranteed that the third lowest timestamp cannot create a composite key that compares lower than the one already visited. This mode can execute iff: * The leading source in the composite definition uses an indexed field of type `date` (works also with `date_histogram` source), `integer`, `long` or `keyword`. * The query is a match_all query or a range query over the field that is used as the leading source in the composite definition. * The sort order of the leading source is the natural order (ascending since postings and numerics are sorted in ascending order only). If these conditions are not met this aggregation visits each document like any other agg.	2018-03-26 09:51:37 +02:00
Nhat Nguyen	8cfe619f03	Revert "[TEST] Mute index synced flush rest tests" The BWC issue was fixed. This reverts commit `f8830b7b43`.	2018-03-21 16:51:52 -04:00
David Roberts	f8830b7b43	[TEST] Mute index synced flush rest tests Awaiting fix of #29162	2018-03-21 10:09:21 -04:00
Luca Cavanna	ff09c82319	REST high-level client: add clear cache API (#28866 ) * REST high-level client: add clear cache API Relates to #27205 Also Closes #26947 (rest-spec were outdated)	2018-03-20 10:39:36 +01:00
olcbean	3d81497f25	REST: Clear Indices Cache API remove deprecated url params (#29068 ) By the time the master branch is released the deprecated url parameters in the `/_cache/clear` API will have been deprecated for a couple of minor releases. Since master will be the next major release we are fine with removing these parameters.	2018-03-14 16:37:50 -04:00
Nik Everett	3e0e6444cf	Mark field_data as deprecated in /_cache/clear It was deprecated in #28943.	2018-03-14 11:39:35 -04:00
Nik Everett	8f0da37c5e	Revert "REST tests: Increase version to skip to include 6.3" This reverts commit `9f2c4df94d` which isn't needed now that the backport is complete.	2018-03-13 20:22:32 -04:00
Alexander Reelsen	9f2c4df94d	REST tests: Increase version to skip to include 6.3 In order to fix the tests, the correct version needs to be skipped until the backport is done. Relates #28943	2018-03-13 17:00:01 -07:00
Paul Sanwald	6dae955b6a	Document and test date_range "missing" support (#28983 ) * Add a REST integration test that documents date_range support Add a test case that exercises date_range aggregations using the missing option. Addresses #17597 * Test cleanup and correction Adding a document with a null date to exercise `missing` option, update test name to something reasonable. * Update documentation to explain how the "missing" parameter works for date_range aggregations. * Wrap lines at 80 chars in docs. * Change format of test to YAML for readability.	2018-03-13 12:58:30 -07:00
olcbean	edc57f6f34	REST: deprecate `field_data` in Clear Cache API (#28943 ) We call it `fielddata` everywhere else in the code and API so we may as well be consistent.	2018-03-13 15:16:27 -04:00
olcbean	3cf599be4c	REST api specs : remove unsupported `wait_for_merge` param (#28959 )	2018-03-13 12:12:52 +01:00
Spencer	9709d89daf	[rest-api-spec] update doc link for /_rank_eval	2018-03-08 16:06:23 -07:00
Jim Ferenczi	bc8b3fc71c	Revert "Rescore collapsed documents (#28521 )" This reverts commit `f057fc294a`. The rescorer does not resort the collapsed values inside the top docs during rescoring. For this reason the Lucene rescorer is not compatible with collapsing. Relates #27243	2018-03-08 11:20:29 +01:00
Sergey Galkin	f057fc294a	Rescore collapsed documents (#28521 ) This change adds the ability to rescore collapsed documents.	2018-03-04 13:39:50 -08:00
Andrew Odendaal	f207aacac0	Missing `timeout` parameter from the REST API spec JSON files (#28328 ) Closes #28200	2018-03-02 14:47:06 +01:00
olcbean	25834e2d26	REST api spec: remove a common param from nodes.usage.json (#28835 ) "human" is a common parameter defined in `_common.json` It should not be repeated again for a concrete api. Closes #28226	2018-03-01 10:58:38 +01:00
olcbean	ec0a1b36c9	REST api spec: remove unsupported parameter `parent_node` (#28841 )	2018-03-01 10:57:25 +01:00
Ke Li	a77273fc01	Reject regex search if regex string is too long (#28542 ) * Reject regex search if regex string is too long (#28344) * Add docs * Introduce index level setting `index.max_regex_length` to control the maximum length of the regular expression Closes #28344	2018-02-23 10:41:24 -08:00
Nhat Nguyen	30598627ce	Fix BWC issue of the translog last modified age stats We added a rest test for the translog last modified age without a version check. This causes BWC failed because the stats are not available in the old versions. Relates #28613	2018-02-16 14:55:20 -05:00
Justin Wyer	5aeb479ffd	Add translog file age to Translog Stats (#28613 ) Expose the age of translog files in the translog stats. This is useful to reason about your translog retention policy. Closes #28189	2018-02-16 16:23:33 +01:00
Boaz Leskes	beb55d148a	Simplify the Translog constructor by always expecting an existing translog (#28676 ) Currently the Translog constructor is capable both of opening an existing translog and creating a new one (deleting existing files). This PR separates these two into separate code paths. The constructors opens files and a dedicated static methods creates an empty translog.	2018-02-15 09:24:09 +01:00
Ke Li	fc406c9a5a	Upgrade t-digest to 3.2 (#28295 ) (#28305 )	2018-02-15 08:23:20 +00:00
Jim Ferenczi	37e938f9de	Fix indices.sort rest test Set the number of replicas to 0 in order to avoid race condition during the test Fixes #24416	2018-02-12 09:03:55 +01:00
Martijn van Groningen	2a35b4ee2b	Use right skip versions. Closes #27570	2018-02-06 12:22:42 +01:00
markharwood	77d2dd203e	Search - add allow_partial_search_results flag with default setting false (#28440 ) Adds allow_partial_search_results flag to search requests with default setting = true. When false, will error if search either timeouts, has partial errors or has missing shards rather than returning partial search results. A cluster-level setting provides a default for search requests with no flag. Closes #27435	2018-01-31 15:51:29 +00:00
Alan Woodward	424ecb3c7d	Add ability to index prefixes on text fields (#28290 ) This adds the ability to index term prefixes into a hidden subfield, enabling prefix queries to be run without multitermquery rewrites. The subfield reuses the analysis chain of its parent text field, appending an EdgeNGramTokenFilter. It can be configured with minimum and maximum ngram lengths. Query terms with lengths outside this min-max range fall back to using prefix queries against the parent text field. The mapping looks like this: "my_text_field" : { "type" : "text", "analyzer" : "english", "index_prefix" : { "min_chars" : 1, "max_chars" : 10 } } Relates to #27049	2018-01-30 08:26:56 +00:00
Jim Ferenczi	95c45aeb5d	Adapt bwc version after backport #28358	2018-01-25 09:26:10 +01:00
Jim Ferenczi	c26d4ac6c1	Always return the after_key in composite aggregation response (#28358 ) This change adds the `after_key` of a composite aggregation directly in the response. It is redundant when all buckets are not filtered/removed by a pipeline aggregation since in this case the `after_key` is always the last bucket in the response. Though when using a pipeline aggregation to filter composite buckets, the `after_key` can be lost if the last bucket is filtered. This commit fixes this situation by always returning the `after_key` in a dedicated section.	2018-01-25 09:15:27 +01:00
Jim Ferenczi	b10d166190	Adapt bwc version after backport #28310	2018-01-24 09:17:30 +01:00
Jim Ferenczi	19cfc25873	Adds the ability to specify a format on composite date_histogram source (#28310 ) This commit adds the ability to specify a date format on the `date_histogram` composite source. If the format is defined, the key for the source is returned as a formatted date. Closes #27923	2018-01-23 15:14:49 +01:00
Adrien Grand	700d9ecc95	Remove the `update_all_types` option. (#28288 ) This option is not useful in 7.x since no indices may have more than one type anymore.	2018-01-22 12:03:07 +01:00
Tanguy Leroux	1f66672d6f	[Test] Fix indices.stats/13_fields Remove the wait_for_active_shards: all added by commit 6256c since it does not work when the cluster has 1 node only.	2018-01-17 15:05:07 +01:00
Tanguy Leroux	6256c330c0	[Test] Wait for no relocating shards in indices.stats/13_fields tests MixedClusterClientYamlTestSuiteIT sometimes fails when executing the indices.stats/13_fields/* REST tests. It does not reproduce locally but the execution logs show that it failed when a shard is relocating during the set up execution. This commit change the set up so that it now waits for all shards to be active before executing the tests. closes #26732, #27146	2018-01-17 13:35:29 +01:00
Igor Motov	c75ac319a6	Add ability to associate an ID with tasks (#27764 ) Adds support for capturing the X-Opaque-Id header from a REST request and storing it's value in the tasks that this request started. It works for all user-initiated tasks (not only search). Closes #23250 Usage: ``` $ curl -H "X-Opaque-Id: imotov" -H "foo:bar" "localhost:9200/_tasks?pretty&group_by=parents" { "tasks" : { "7qrTVbiDQKiZfubUP7DPkg:6998" : { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6998, "type" : "transport", "action" : "cluster:monitor/tasks/lists", "start_time_in_millis" : 1513029940042, "running_time_in_nanos" : 266794, "cancellable" : false, "headers" : { "X-Opaque-Id" : "imotov" }, "children" : [ { "node" : "V-PuCjPhRp2ryuEsNw6V1g", "id" : 6088, "type" : "netty", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 67785, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } }, { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6999, "type" : "direct", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 98754, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } } ] } } } ```	2018-01-12 15:34:17 -05:00
Clinton Gormley	f15c2044c4	Fixed the cat.health REST test to accept 4ms, not just 4.0ms (#28186 )	2018-01-12 11:19:57 +01:00
kel	6c963379d9	Ignore null value for range field (#27845 ) (#28116 ) Currently when adding a document with a `null` value for a range field, the range field mapper raises an error. Instead we should ignore null like we do eg. with numbers or geo points. Closes #27845	2018-01-10 19:10:06 +01:00
Simon Willnauer	47a506d2db	Bump compat version for local depdendent test to 6.2.0	2018-01-04 16:49:31 +01:00
Simon Willnauer	b68f7ed8c3	Pass `java.locale.providers=COMPAT` to Java 9 onwards (#28080 ) Java 9 added some enhancements to the internationalization support that impact our date parsing support. To ensure flawless BWC and consistent behavior going forward Java 9 runtimes requrie the system property `java.locale.providers=COMPAT` to be set. Closes #10984	2018-01-04 16:43:51 +01:00
Mayya Sharipova	dcde895f49	Introduce limit to the number of terms in Terms Query (#27968 ) - Introduce index level settings to control the maximum number of terms that can be used in a Terms Query - Throw an error if a request exceeds this max number Closes #18829	2017-12-28 17:36:29 -05:00
Jim Ferenczi	0b2c8c835e	Fix composite aggregation when after term is missing in the shard (#27936 ) This change fixes a bug when a keyword term in the `after` key is not present in the shard. In this case the global ord of the document values are compared with the insertion point of the `after` keyword and values that are equal to the insertion point should be considered "after" the top value.	2017-12-26 09:58:49 +01:00
olcbean	7f2f59eb85	delete `operation_threading` from the rest specs (#27940 )	2017-12-21 13:09:11 -08:00
Mayya Sharipova	cbd271e497	Limit the analyzed text for highlighting (#27934 ) * Limit the analyzed text for highlighting - Introduce index level settings to control the max number of character to be analyzed for highlighting - Throw an error if analysis is required on a larger text Closes #27517	2017-12-21 10:19:58 -05:00
Jim Ferenczi	c753b82ca8	Adapt scroll rest test after backport. relates #27842	2017-12-21 09:31:56 +01:00
Colin Goodheart-Smithe	4cbbe3ed93	Fixes DocStats to not report index size < -1 (#27863 ) Previously to this change when DocStats are added together (for example when adding the index size of all primary shards for an index) we naively added the `totalSizeInBytes` together. This worked most of the time but not when the index size on one or multiple shards was reported to be `-1` (no value). This change improves the logic by considering if the current value or the value to be added is `-1`: * If the current and new value are both `-1` the value remains at `-1` * If the current value is `-1` and the new value is not `-1`, current value is changed to be equal to the new value * If the current value is not `-1` and the new value is `-1` the new value is ignored and the current value is not changed * If both the current and new values are not `-1` the current value is changed to be equal to the sum of the current and new values. The change also re-enables the failing rollover YAML test that was failing due to this bug.	2017-12-20 14:45:09 +00:00
Stuart Cam	e458c6b762	timestamp and ttl in index requests (#27888 ) timestamp and ttl are not accepted anymore as parameters of index/update requests.	2017-12-20 10:43:21 +11:00
Christoph Büscher	05aa1a6033	[Tests] Remove redudant rest test added in #27900 The removed rest test doesn't really test the the issue reported in #27841 and adds nothing on top of the unit test.	2017-12-19 20:04:37 +01:00
Christoph Büscher	fb2fd4e8ee	Fix preserving FiltersAggregationBuilder#keyed field on rewrite (#27900 ) Currently FiltersAggregationBuilder#doRewrite creates a new FiltersAggregationBuilder which doesn't correctly copy the original "keyed" field if a non-keyed filter gets rewritten. This can cause rendering bugs of the output aggregations like the one reported in #27841. Closes #27841	2017-12-19 19:56:12 +01:00
kel	7a27a2770b	Reject scroll query if size is 0 (#22552 ) (#27842 )	2017-12-18 10:38:41 +01:00
Jim Ferenczi	55b71a871b	Adapt rest test after backport. Relates #27833	2017-12-18 10:36:44 +01:00
Jason Tedor	75c0cd0672	Move range field mapper back to core This commit moves the range field mapper back to core so that we can remove the compile-time dependency of percolator on mapper-extras which compilcates dependency management for the percolator client JAR, and modules should not be intertwined like this anyway. Relates #27854	2017-12-17 14:27:10 -05:00
kel	f5e0932c8d	Add version support for inner hits in field collapsing (#27822 ) (#27833 ) Add version support for inner hits in field collapsing	2017-12-15 18:00:40 +01:00
Christoph Büscher	52cb6c8ef2	Merge branch 'master' into rankeval	2017-12-07 14:22:46 +01:00
Jim Ferenczi	caea6b70fa	Add a new cluster setting to limit the total number of buckets returned by a request (#27581 ) This commit adds a new dynamic cluster setting named `search.max_buckets` that can be used to limit the number of buckets created per shard or by the reduce phase. Each multi bucket aggregator can consume buckets during the final build of the aggregation at the shard level or during the reduce phase (final or not) in the coordinating node. When an aggregator consumes a bucket, a global count for the request is incremented and if this number is greater than the limit an exception is thrown (TooManyBuckets exception). This change adds the ability for multi bucket aggregator to "consume" buckets in the global limit, the default is 10,000. It's an opt-in consumer so each multi-bucket aggregator must explicitly call the consumer when a bucket is added in the response. Closes #27452 #26012	2017-12-06 09:15:28 +01:00
Christoph Büscher	bbec33d35c	Merge branch 'master' into rankeval	2017-12-04 12:57:19 +01:00
Mayya Sharipova	c6b73239ae	Limit the number of tokens produced by _analyze (#27529 ) Add an index level setting `index.analyze.max_token_count` to control the number of generated tokens in the _analyze endpoint. Defaults to 10000. Throw an error if the number of generated tokens exceeds this limit. Closes #27038	2017-11-30 11:54:39 -05:00
Tanguy Leroux	41f73e0acf	Fix version for include_global_state in Snapshot Status API It also adds a Rest test. Related #26853	2017-11-30 11:33:01 +01:00
Christoph Büscher	35688f6441	Merge branch 'master' into rankeval	2017-11-29 15:24:06 +01:00
Martijn van Groningen	cb1204774b	Include the _index, _type and _id to nested search hits in the top_hits and inner_hits response. Also include _type and _id for parent/child hits inside inner hits. In the case of top_hits aggregation the nested search hits are directly returned and are not grouped by a root or parent document, so it is important to include the _id and _index attributes in order to know to what documents these nested search hits belong to. Closes #27053	2017-11-28 14:05:29 +01:00
Nhat Nguyen	8d6bfe53bb	Remove workaround in translog rest test (#27530 ) Relates #25623 and `a6db0ea908`	2017-11-27 09:41:30 -05:00
Christoph Büscher	5661b1c3df	Merge branch 'master' into rankeval	2017-11-24 16:25:05 +01:00
Nhat Nguyen	06d35f4f01	Backport wait_for_initialiazing_shards to cluster health API Relates #27489	2017-11-24 09:56:16 -05:00
Nhat Nguyen	46b508d6c9	Add wait_for_no_initializing_shards to cluster health API (#27489 ) This adds a new option to the cluster health request allowing to wait until there is no initializing shards. Closes #25623	2017-11-23 15:09:58 -05:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Mayya Sharipova	57e4d10007	Limit the number of nested documents (#27405 ) Add an index level setting `index.mapping.nested_objects.limit` to control the number of nested json objects that can be in a single document across all fields. Defaults to 10000. Throw an error if the number of created nested documents exceed this limit during the parsing of a document. Closes #26962	2017-11-22 10:16:28 -05:00
Jim Ferenczi	90d2ead14a	Adapt rest test BWC version after backport Relates #26800	2017-11-21 15:45:02 +01:00
Christoph Büscher	d979ccace9	Merge branch 'master' into rankeval	2017-11-21 14:11:02 +01:00
Jim Ferenczi	6319424e4a	Move composite aggregation to core (#27474 ) This change removes the module named aggs-composite and adds the `composite` aggs as a core aggregation. This allows other plugins to use this new aggregation and simplifies the integration in the HL rest client.	2017-11-21 13:31:01 +01:00
Simon Willnauer	8aba7c8bbe	Fix test BWC version after backport Relates to #27468	2017-11-21 12:31:04 +01:00
Simon Willnauer	ea35abca28	Protect shard splitting from illegal target shards (#27468 ) While we have an assertion that checks if the number of routing shards is a multiple of the number of shards we need a real hard exception that checks this way earlier. This change adds a check and test that is executed before we create the index. Relates to #26931	2017-11-21 12:09:45 +01:00
Luca Cavanna	29450de7b5	Cross Cluster Search: make remote clusters optional (#27182 ) Today Cross Cluster Search requires at least one node in each remote cluster to be up once the cross cluster search is run. Otherwise the whole search request fails despite some of the data (either local and/or remote) is available. This happens when performing the _search/shards calls to find out which remote shards the query has to be executed on. This scenario is different from shard failures that may happen later on when the query is actually executed, in case e.g. remote shards are missing, which is not going to fail the whole request but rather yield partial results, and the _shards section in the response will indicate that. This commit introduces a boolean setting per cluster called search.remote.$cluster_alias.skip_if_disconnected, set to false by default, which allows to skip certain clusters if they are down when trying to reach them through a cross cluster search requests. By default all clusters are mandatory. Scroll requests support such setting too when they are first initiated (first search request with scroll parameter), but subsequent scroll rounds (_search/scroll endpoint) will fail if some of the remote clusters went down meanwhile. The search API response contains now a new _clusters section, similar to the _shards section, that gets returned whenever one or more clusters were disconnected and got skipped: "_clusters" : { "total" : 3, "successful" : 2, "skipped" : 1 } Such section won't be part of the response if no clusters have been skipped. The per cluster skip_unavailable setting value has also been added to the output of the remote/info API.	2017-11-21 11:41:47 +01:00
Zachary Tong	196dbf3357	Add YAML REST tests for filters bucket agg (#27128 ) Related to #26220	2017-11-20 16:44:30 -05:00
Simon Willnauer	28e5cf933f	Bump test version after backport Relates to #27455	2017-11-20 16:54:59 +01:00
Simon Willnauer	720e96e288	Ensure nested documents have consistent version and seq_ids (#27455 ) Today we index dummy values for seq_ids and version on nested documents. This is on the one hand trappy since users can request these values via inner hits and on the other hand not necessarily good for compression since the dummy value will likely not compress well when seqIDs are lowish. This change ensures that we share the same field values for all documents in a nested block. This won't have any overhead, in-fact it might be more efficient since we even reduce the work needed slightly.	2017-11-20 16:50:08 +01:00
Mayya Sharipova	858b2c7cb8	Standardize underscore requirements in parameters (#27414 ) Stardardize underscore requirements in parameters across different type of requests: _index, _type, _source, _id keep their underscores params like version and retry_on_conflict will be without underscores Throw an error if older versions of parameters are used BulkRequest, MultiGetRequest, TermVectorcRequest, MoreLikeThisQuery were changed Closes #26886	2017-11-17 15:31:52 -05:00
Yannick Welsch	3b963dcfe5	Stop skipping REST test after backport of #27056	2017-11-16 16:08:10 +01:00
kel	6b817489f3	Fix default value of ignore_unavailable for snapshot REST API (#27056 ) The default value for ignore_unavailable did not match what was documented when using the REST APIs for snapshot creation and restore. This commit sets the default value of ignore_unavailable to false, the way it is documented and ensures it's the same when using either REST API or transport client. Closes #25359	2017-11-16 16:03:09 +01:00
Clinton Gormley	1caa5c8e32	Rest test fixes (#27354 ) * REST: Rename ingest.processor.grok to ingest.processor_grok * REST: Rename remote.info to cluster.remote_info * REST: Fixed bad YAML comments * REST: Force dummy scripts to be strings, not numbers * REST: Fix bad YAML in search/110_field_collapsing.yml * REST: Adjust percentile tests to work with Perl number handling	2017-11-14 11:14:14 +01:00
Jim Ferenczi	29331f1127	Fail queries with scroll that explicitely set request_cache (#27342 ) Queries that create a scroll context cannot use the cache. They modify the search context during their execution so using the cache can lead to duplicate result for the next scroll query. This change fails the entire request if the request_cache option is explictely set on a query that creates a scroll context (`scroll=1m`) and make sure internally that we never use the cache for these queries when the option is not explicitely used. For 6.x a deprecation log will be printed instead of failing the entire request and the request_cache hint will be ignored (forced to false).	2017-11-10 16:02:06 +01:00
Boaz Leskes	ace446f335	Update shrink's bwc version to 6.1.0 and enabled bwc tests	2017-11-07 15:35:46 +01:00
olcbean	7f593a26a3	Setting url parts as required to reflect the code base (#27263 )	2017-11-06 09:58:27 -07:00
Nick Lang	09294a9b9a	keys in aggs percentiles need to be in quotes. (#26905 ) Languages which are stronger typed will failed when comparing these results	2017-11-06 17:45:04 +01:00
Russ Cam	a0bdedb143	Align routing param type with search.json (#26958 ) Relates https://github.com/elastic/elasticsearch-net/issues/2869	2017-11-06 17:34:22 +01:00
Simon Willnauer	bd7efa908a	Add ability to split shards (#26931 ) This change adds a new `_split` API that allows to split indices into a new index with a power of two more shards that the source index. This API works alongside the `_shrink` API but doesn't require any shard relocation before indices can be split. The split operation is conceptually an inverse `_shrink` operation since we initialize the index with a _syntetic_ number of routing shards that are used for the consistent hashing at index time. Compared to indices created with earlier versions this might produce slightly different shard distributions but has no impact on the per-index backwards compatibility. For now, the user is required to prepare an index to be splittable by setting the `index.number_of_routing_shards` at index creation time. The setting allows the user to prepare the index to be splittable in factors of `index.number_of_routing_shards` ie. if the index is created with `index.number_of_routing_shards: 16` and `index.number_of_shards: 2` it can be split into `4, 8, 16` shards. This is an intermediate step until we can make this the default. This also allows us to safely backport this change to 6.x. The `_split` operation is implemented internally as a DeleteByQuery on the lucene level that is executed while the primary shards execute their initial recovery. Subsequent merges that are triggered due to this operation will not be executed immediately. All merges will be deferred unti the shards are started and will then be throttled accordingly. This change is intended for the 6.1 feature release but will not support pre-6.1 indices to be split unless these indices have been shrunk before. In that case these indices can be split backwards into their original number of shards.	2017-11-06 11:37:55 +01:00
olcbean	e440e23ad1	Fix inconsistencies in the rest api specs for `tasks` (#27163 ) modify parameters names to reflect the changes done in the code base	2017-11-06 10:11:25 +01:00

1 2 3 4 5 ...

1746 Commits