OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	5dd476feb5	Skip bad request REST test on pre-6.0 This commit adds a skip for the bad request REST test on pre-6.0 nodes. Previously, a request for /_(.*) where $1 is not an existing endpoint would return a 404. This is because the request would be treated as a get index request for an index named _$1. However, an index can never start with "_" so logic was added to detect this and return a 400 instead as this should be treated as a bad request. During the mixed-cluster BWC tests, a node running pre-6.0 code will still return a 404 though. Therefore, this test needs to skipped in such a mixed-cluster scenario.	2017-09-18 06:46:10 -04:00
Michael Basnight	296c239611	Add check for invalid index in WildcardExpressionResolver (#26409 ) This commit adds validation to the resolving of indexes in the wildcard expression resolver. It no longer throws a 404 Not Found when resolving invalid indices. It throws a 400 instead, as it is an invalid index. This was the behavior of 5.x.	2017-09-15 17:00:41 -05:00
Michael Basnight	f385e0cf26	Add bad_request to the rest-api-spec catch params (#26539 ) This adds another request to the catch params. It also makes sure that the generic request param does not allow 400 either.	2017-09-14 14:24:03 -05:00
Michael Basnight	e69c39a60f	Add missing catch arguments to the rest api spec (#26536 )	2017-09-14 11:05:46 -05:00
Christoph Büscher	2eaf7534f3	[Tests] Removing skipping tests in search rest tests After backporting the script_field soft limit to the 6.x branches, this test can now also run in a mixed cluster. Relates to #26598 enter the commit message for your changes. Lines starting	2017-09-13 18:21:15 +02:00
Adrien Grand	93da7720ff	Move non-core mappers to a module. (#26549 ) Today we have all non-plugin mappers in core. I'd like to start moving those that neither map to json datatypes nor are very frequently used like `date` or `ip` to a module. This commit creates a new module called `mappers-extra` and moves the `scaled_float` and `token_count` mappers to it. I'd like to eventually move `range` fields there but it's more complicated due to their intimate relationship with range queries. Relates #10368	2017-09-13 17:58:53 +02:00
Christoph Büscher	027c555c9b	Add soft limit on allowed number of script fields in request (#26598 ) Requesting to many script_fields in a search request can be costly because of script execution. This change introduces a soft limit on the number of script fields that are allowed per request. The setting can be changed per index using the index.max_script_fields setting. Relates to #26390	2017-09-13 17:22:16 +02:00
Christoph Büscher	d2cfad6187	[Tests] Remove skip tests in search/30_limits.yml After backporting the related change to the 6.x branches, this test can now also be run in a mixed cluster. Relates to #26574	2017-09-13 13:15:31 +02:00
Christoph Büscher	e00db235bc	Add a soft limit for the number of requested doc-value fields (#26574 ) Requesting to many docvalue_fields in a search request can potentially be costly because it might incur a per-field per-document seek. This change introduces a soft limit on the number of fields that can be retrieved. The setting can be changed per index using the `index.max_docvalue_fields_search` setting. Relates to #26390	2017-09-13 11:57:06 +02:00
Jim Ferenczi	c62b0192d0	#26496 : Set the correct bwc version after backport to 6.x	2017-09-11 13:09:44 +02:00
Jim Ferenczi	d68d8c9cef	Expose duplicate removal in the completion suggester (#26496 ) This change exposes the duplicate removal option added in Lucene for the completion suggester with a new option called `skip_duplicates` (defaults to false). This commit also adapts the custom suggest collector to handle deduplication when multiple contexts match the input. Closes #23364	2017-09-07 17:11:01 +02:00
Jim Ferenczi	b41c44370e	#26448 : Fix rest test not cleaning the settings	2017-09-07 14:11:04 +02:00
Jim Ferenczi	6f69b25f61	Restore bwc skip test because backport is missing	2017-09-07 12:38:41 +02:00
Jim Ferenczi	1b3b1c0647	Set bwc versions for scroll keep alive rest tests (backported to 6.1)	2017-09-07 12:34:33 +02:00
Jim Ferenczi	0c799eedc5	Add upper limit for scroll expiry (#26448 ) This change adds a dynamic cluster setting named `search.max_keep_alive`. It is used as an upper limit for scroll expiry time in scroll queries and defaults to 1 hour. This change also ensures that the existing setting `search.default_keep_alive` is always smaller than `search.max_keep_alive`. Relates #11511 * check style * add skip for bwc * iter * Add a maxium throttle wait time of 1h for reindex * review * remove empty line	2017-09-06 10:06:48 +02:00
Christoph Büscher	4ff12c9a0b	Throw exception in scroll requests using `from` (#26235 ) The `from` search parameter cannot really be used in scrolled searches. This commit adds a check for this case to the SearchRequest#validate() method so we can reported it as an error rather than silently ignoring it. Closes #9373	2017-08-21 15:12:34 +02:00
Zachary Tong	9f295b4ba8	Add REST tests for percentiles agg (#26266 ) There's a separate test for each TDigest and HDR Histo, to keep things clean. Related to #26220	2017-08-18 10:31:59 -04:00
Zachary Tong	67e003da0d	Add REST tests for value_count, stats, extended_stats and cardinality aggs (#26239 ) * Add REST tests for value_count, stats, extended_stats and cardinality aggs Also updates the document type of of other agg REST tests to `doc` Related to #26220	2017-08-17 10:33:01 -04:00
Zachary Tong	2978b5df8b	Add REST tests for avg/min/max/sum metric aggs (#26225 ) Adds some REST tests for avg/min/max/sum metric aggregations Related to #26220	2017-08-16 09:26:38 -04:00
Yannick Welsch	01f6851691	Serialize and expose timeout of acknowledged requests in REST layer (#26189 ) Due to the weird way of structuring the serialization code in AcknowledgedRequest, many request types forgot to properly serialize the request timeout, for example "index deletion", "index rollover", "index shrink", "putting pipeline", and other requests. This means that if those requests were not directly sent to the master node, the acknowledgement timeout information would be lost (and the default used instead). Some requests also don't properly expose the timeout mechanism in the REST layer, such as put / delete stored script. This commit fixes all that.	2017-08-16 07:43:05 +08:00
Luca Cavanna	14ba36977e	[TEST] prevent yaml tests from using raw requests (#26044 ) Raw requests are supported only by the java yaml test runner and were introduced to test docs snippets. Some yaml tests ended up using them (see #23497) which causes failures for other language clients. This commit migrates those yaml tests to Java tests that send requests through the Java low-level REST client, and also moves the ability to send raw requests to a special client that's only available when testing docs snippets. Closes #25694	2017-08-07 11:02:16 +02:00
Boaz Leskes	e11cbed534	Adding a refresh listener to a recovering shard should be a noop (#26055 ) When `refresh=wait_for` is set on an indexing request, we register a listener on the shards that are call during the next refresh. During the recover translog phase, when the engine is open, we have a window of time when indexing operations succeed and they can add their listeners. Those listeners will only be called when the recovery finishes as we do not refresh during recoveries (unless the indexing buffer is full). Next to being a bad user experience, it can also cause deadlocks with an ongoing peer recovery that may wait for those operations to mark the replica in sync (details below). To fix this, this PR changes refresh listeners to be a noop when the shard is not yet serving reads (implicitly covering the recovery period). It doesn't matter anyway. Deadlock with recovery: When finalizing a peer recovery we mark the peer as "in sync". To do so we wait until the peer's local checkpoint is at least as high as the global checkpoint. If an operation with `refresh=wait_for` is added as a listener on that peer during recovery, it is not completed from the perspective of the primary. The primary than may wait for it to complete before advancing the local checkpoint for that peer. Since that peer is not considered in sync, the global checkpoint on the primary can be higher, causing a deadlock. Operation waits for recovery to finish and a refresh to happen. Recovery waits on the operation.	2017-08-04 19:51:15 +02:00
Jason Tedor	f154e53b37	Correctly reset persistent settings in teardown In the refresh REST tests we setup some persistent settings for debug logging. In the teardown, we try to restore the logging level back to info via another persistent setting but this is a mistake because other tests check if there are no persistent settings. To fix this, we remove the persistent setting that we added.	2017-08-01 19:35:37 +09:00
Jason Tedor	e1ef3d5cc2	Add debug logging for refresh REST tests We are chasing a test failure in the "refresh=wait_for waits until changes are visible in search" test yet the logs currently give us no indication what is happening. This commit adds debug logging for this test, and cleans up this logging in a teardown section. We can remove this additional logging after we chase the test failure down.	2017-08-01 18:54:52 +09:00
Zachary Tong	caef6cc128	[TEST] Move version skip to setup in Indices.GetMapping#70_legacy_multi_type (#25816 ) Since the setup attempts to create an index with two types, and the setup runs before any test, this will fail on versions 6.0+ before it has a chance to check the skip in each individual test. Moving to the setup resolves this issue.	2017-07-21 11:53:48 -04:00
Simon Willnauer	0e3ad522a2	Rewrite search requests on the coordinating nodes (#25814 ) This change rewrites search requests on the coordinating node before we send requests to the individual shards. This will reduce the rewrite load and object creation for each rewrite on the executing nodes and will fetch resources only once instead of N times once per shard for queries like `terms` query with index lookups. (among percolator and geo-shape) Relates to #25791	2017-07-21 09:38:38 +02:00
Jack Conradson	9f7463e796	remove lang url parameter from stored script requests (#25779 ) Also has updates to ScriptMetaData for allowing the old namespace format to be loaded all the way back through 5.0; however, it will throw an exception if two scripts share the same id but different languages.	2017-07-20 08:51:08 -07:00
Luca Cavanna	5c5d723b86	Improve error message when aliases are not supported (#25728 ) With #23997 and #25268 we have changed put alias, delete alias, update aliases and delete index to not accept aliases. Instead concrete indices should be provided as their index parameter. This commit improves the error message in case aliases are provided, from an IndexNotFoundException (404 status code) with "no such index" message, to an IllegalArgumentException (400 status code) with "The provided expression [alias] matches an alias, specify the corresponding concrete indices instead." message. Note that there is no specific error message for the case where wildcard expressions match one or more aliases. In fact, aliases are simply ignored when expanding wildcards for such APIs. An error is thrown only when the expression ends up matching no indices at all, and allow_no_indices is set to false. In that case the error is still the generic "404 - no such index".	2017-07-18 15:40:17 +02:00
Luca Cavanna	0d8b753325	IndexClosedException to return 400 rather than 403 (#25752 ) 403 can be confused with security. If an API doesn't support working against closed indices and closed indices are referred to in a request, that is a bad request, hence 400 is more appropriate.	2017-07-18 10:26:32 +02:00
Christoph Büscher	a6e3d356ed	Change parsing of numeric `to` and `from` parameters in `date_range` aggregation (#25376 ) Currently the `to` and `from` parameter in the `date_range` aggregation is not parsed with the correct date field format from the mappings or the aggregation if the argument is numeric, but always treated as a long value specifying `epoch_millis`. This leads to problems e.g. when the format is `epoch_second`, but the `to` and `from` are currently treated as millis. With this change, we interpret these parameters according to the `format` of the target field. If the `format` in the mappings is not compatible with numeric input values, a compatible `format` (e.g. `epoch_millis`, `epoch_second`) must be specified in the `date_range` aggregation itself, otherwise an error is thrown. #Closes #17920	2017-07-18 09:45:28 +02:00
Jason Tedor	e9aa60dc9d	Skip shrink ignores template mapping in BWC tests This commit reverts some changes to the shrink API ignore template mapping REST test in favor of simply skipping the test for BWC purposes. The complexity here is due to deprecations and lacking the infrastructure to gracefully handle a situation like this.	2017-07-17 20:32:18 +09:00
Colin Goodheart-Smithe	7a401cd1d2	[TEST] skips shrink source mapping rest test This change skips the rest test in `rest-api-spec/test/indices.shrink/20_source_mapping.yml` as it currently fails because if we don’t expect the deprecation warning the normal rest tests fail because they get a warning they don’t expect but if we do expect the deprecation warning the mixed cluster tests fail because they don’t get a warning which they expected.	2017-07-17 12:24:07 +01:00
Jason Tedor	b1f8b75ac3	Fix warnings in shrink ignore templates test This commit fixes an issue with the REST test that the shrink API ignores templates. The problem is that we have to use a BWC version of the API (for the BWC tests) but this raises deprecation warnings. This commit adds an expectation for these deprecation warnings.	2017-07-17 18:25:37 +09:00
Simon Willnauer	2da79f2b5e	[TEST] Use 5.x compatible API in shrink tests	2017-07-17 09:45:49 +02:00
Jason Tedor	5b25b5d80a	Fix comment on shrink indices test This commit fixes a comment on a shrink indices test; the comment is wrong because the fix in question was applied starting 5.6.0.	2017-07-17 16:28:09 +09:00
Jason Tedor	fd98f7abc2	Adjust skip version for shrink index test This commit adjusts the skip version for a shrink index test that ensures that a shrunken index ignores templates; the version can be adjusted after the fix was backported targeting 5.6.0 and later. Relates #25380	2017-07-17 12:56:12 +09:00
Simon Willnauer	ccda0441e1	Bump BWC versions after #25658 backport to 5.6	2017-07-15 11:34:16 +02:00
Ryan Ernst	072402463b	Scripting: Remove search template actions (#25717 ) The dedicated search template put/get/delete actions are deprecated in 5.6. This commit removes them from 6.0.	2017-07-14 23:12:05 -07:00
Luca Cavanna	7930b8a720	Fix indices options parsing from REST in delete index API (#25709 ) When parsing indices options from REST, we parse the optional parameters that are supported at REST (ignore_unavailable, allow_no_indices and expand_wildcards) and we provide the API default values for all the other (internal) options so that they are set to the new indices options while parsing. The `ignoreAliases` option was forgotten though, which means that whenever you pass in any index option at REST to the delete index API, you get to delete aliases like it was supported before (as ignoreAliases gets set to false like in all the other APIs). Added unit tests for IndicesOptions parsing from REST parameters, and yaml tests for the delete index API.	2017-07-14 10:39:44 +02:00
Colin Goodheart-Smithe	11477a608f	Removes FieldStats API (#25628 ) * Removes FieldStats API * iter * iter	2017-07-13 11:56:46 +01:00
Sergey Galkin	e2bfb35f4a	Shrunk indices should ignore templates A shrunk index should ignore anything from templates and instead take its mappings, aliases, and settings from the original index, plus any new settings and aliases passed in with the shrink request. This commit causes this to be the case. Relates #25380	2017-07-12 18:27:38 -04:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
Adrien Grand	f9fbce84b6	Optimize the order of bytes in uuids for better compression. (#24615 ) Flake ids organize bytes in such a way that ids are ordered. However, we do not need that property and could reorganize bytes in an order that would better suit Lucene's terms dict instead. Some synthetic tests suggest that this change decreases the disk footprint of the `_id` field by about 50% in many cases (see `UUIDTests.testCompression`). For instance, when simulating the indexing of 10M docs at a rate of 10k docs per second, the current uid generator used 20.2 bytes per document on average, while this new generator which only puts bytes in a different order uses 9.6 bytes per document on average. We had already explored this idea in #18209 but the attempt to share long common prefixes had had a bad impact on indexing speed. This time I have been more careful about putting discriminant bytes early in the `_id` in a way that preserves indexing speed on par with today, while still allowing for better compression.	2017-07-11 17:28:23 +02:00
Simon Willnauer	98c91a3bd0	Limit the number of concurrent shard requests per search request (#25632 ) This is a protection mechanism to prevent a single search request from hitting a large number of shards in the cluster concurrently. If a search is executed against all indices in the cluster this can easily overload the cluster causing rejections etc. which is not necessarily desirable. Instead this PR adds a per request limit of `max_concurrent_shard_requests` that throttles the number of concurrent initial phase requests to `256` by default. This limit can be increased per request and protects single search requests from overloading the cluster. Subsequent PRs can introduces addiontional improvemetns ie. limiting this on a `_msearch` level, making defaults a factor of the number of nodes or sort shards iters such that we gain the best concurrency across nodes.	2017-07-11 16:23:10 +02:00
Boaz Leskes	a6db0ea908	Run Translog retention yaml tests with no replicas Initializing replicas change the translog retention logic and confuses the test. Switch to the solution suggested in https://github.com/elastic/elasticsearch/issues/25623, if implemented	2017-07-10 11:06:57 +02:00
olcbean	2ba9fd2aec	Remove deprecated created and found from index, delete and bulk (#25516 ) The created and found fields in index and delete responses became obsolete after the introduction of the result field in index, update and delete responses (#19566). After deprecating the created and found fields in 5.x (#19633), now they are removed. Fixes #19630	2017-07-07 13:58:46 -04:00
Jim Ferenczi	31614c3ddb	Remove deprecated fielddata_fields from search request (#25566 ) ... and inner_hits	2017-07-06 13:02:28 +02:00
Colin Goodheart-Smithe	41abccf6c5	Adds rewrite phase to aggregations (#25495 ) * Adds rewrite phase to aggregations This change adds aggregations to the rewrite performed by the `SearchSourceBuilder`. This means that `AggregationBuilder`s are able to implement a `rewrite()` method where they can return a new `AggregationBuilder` which is functionally the same but in a more primitive form. This is exactly analogous to the rewrite done by the `QueryBuilder`s. The first aggregation to implement the rewrite are the filter and filters aggregations so they can rewrite the filters they contain. Closes #17676 * Removes rewrite from PipelineAggregationBuilder Rewrite is based on shard level information. Since pipeline aggregation are run in the reduce phase it doesn’t make sense to rewrite them on the shards. In fact eventually we shouldn’t be transporting them to the shards at all and should be retaining them on the coordinating node for execution in the reduce phase * Addresses review comments * addresses more review comments * Fixed imports	2017-07-04 16:47:48 +01:00
Jun Ohtani	6894ef6057	[Analysis] Support normalizer in request param (#24767 ) * [Analysis] Support normalizer in request param Support normalizer param Support custom normalizer with char_filter/filter param Closes #23347	2017-07-04 19:16:56 +09:00
Jason Tedor	6ae4497c13	Adjust BWC version on bad allocation request test This commit adjusts the BWC version on the bad cluster allocation explain request test as changing the API to respond with a bad request status instead of an internal server error status was backported to 5.x to be included in 5.6.0. Relates #25503	2017-06-30 18:05:58 -04:00

1 2 3 4 5 ...

1408 Commits