OpenSearch

Commit Graph

Author	SHA1	Message	Date
Colin Goodheart-Smithe	11477a608f	Removes FieldStats API (#25628 ) * Removes FieldStats API * iter * iter	2017-07-13 11:56:46 +01:00
Sergey Galkin	e2bfb35f4a	Shrunk indices should ignore templates A shrunk index should ignore anything from templates and instead take its mappings, aliases, and settings from the original index, plus any new settings and aliases passed in with the shrink request. This commit causes this to be the case. Relates #25380	2017-07-12 18:27:38 -04:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
Adrien Grand	f9fbce84b6	Optimize the order of bytes in uuids for better compression. (#24615 ) Flake ids organize bytes in such a way that ids are ordered. However, we do not need that property and could reorganize bytes in an order that would better suit Lucene's terms dict instead. Some synthetic tests suggest that this change decreases the disk footprint of the `_id` field by about 50% in many cases (see `UUIDTests.testCompression`). For instance, when simulating the indexing of 10M docs at a rate of 10k docs per second, the current uid generator used 20.2 bytes per document on average, while this new generator which only puts bytes in a different order uses 9.6 bytes per document on average. We had already explored this idea in #18209 but the attempt to share long common prefixes had had a bad impact on indexing speed. This time I have been more careful about putting discriminant bytes early in the `_id` in a way that preserves indexing speed on par with today, while still allowing for better compression.	2017-07-11 17:28:23 +02:00
Simon Willnauer	98c91a3bd0	Limit the number of concurrent shard requests per search request (#25632 ) This is a protection mechanism to prevent a single search request from hitting a large number of shards in the cluster concurrently. If a search is executed against all indices in the cluster this can easily overload the cluster causing rejections etc. which is not necessarily desirable. Instead this PR adds a per request limit of `max_concurrent_shard_requests` that throttles the number of concurrent initial phase requests to `256` by default. This limit can be increased per request and protects single search requests from overloading the cluster. Subsequent PRs can introduces addiontional improvemetns ie. limiting this on a `_msearch` level, making defaults a factor of the number of nodes or sort shards iters such that we gain the best concurrency across nodes.	2017-07-11 16:23:10 +02:00
Boaz Leskes	a6db0ea908	Run Translog retention yaml tests with no replicas Initializing replicas change the translog retention logic and confuses the test. Switch to the solution suggested in https://github.com/elastic/elasticsearch/issues/25623, if implemented	2017-07-10 11:06:57 +02:00
olcbean	2ba9fd2aec	Remove deprecated created and found from index, delete and bulk (#25516 ) The created and found fields in index and delete responses became obsolete after the introduction of the result field in index, update and delete responses (#19566). After deprecating the created and found fields in 5.x (#19633), now they are removed. Fixes #19630	2017-07-07 13:58:46 -04:00
Jim Ferenczi	31614c3ddb	Remove deprecated fielddata_fields from search request (#25566 ) ... and inner_hits	2017-07-06 13:02:28 +02:00
Colin Goodheart-Smithe	41abccf6c5	Adds rewrite phase to aggregations (#25495 ) * Adds rewrite phase to aggregations This change adds aggregations to the rewrite performed by the `SearchSourceBuilder`. This means that `AggregationBuilder`s are able to implement a `rewrite()` method where they can return a new `AggregationBuilder` which is functionally the same but in a more primitive form. This is exactly analogous to the rewrite done by the `QueryBuilder`s. The first aggregation to implement the rewrite are the filter and filters aggregations so they can rewrite the filters they contain. Closes #17676 * Removes rewrite from PipelineAggregationBuilder Rewrite is based on shard level information. Since pipeline aggregation are run in the reduce phase it doesn’t make sense to rewrite them on the shards. In fact eventually we shouldn’t be transporting them to the shards at all and should be retaining them on the coordinating node for execution in the reduce phase * Addresses review comments * addresses more review comments * Fixed imports	2017-07-04 16:47:48 +01:00
Jun Ohtani	6894ef6057	[Analysis] Support normalizer in request param (#24767 ) * [Analysis] Support normalizer in request param Support normalizer param Support custom normalizer with char_filter/filter param Closes #23347	2017-07-04 19:16:56 +09:00
Jason Tedor	6ae4497c13	Adjust BWC version on bad allocation request test This commit adjusts the BWC version on the bad cluster allocation explain request test as changing the API to respond with a bad request status instead of an internal server error status was backported to 5.x to be included in 5.6.0. Relates #25503	2017-06-30 18:05:58 -04:00
Jason Tedor	c70c440050	Adjust status on bad allocation explain requests When a user requests a cluster allocation explain in a situation where it does not make sense (for example, there are no unassigned shards), we should consider this a bad request instead of a server error. Yet, today by throwing an illegal state exception, these are treated as server errors. This commit adjusts these so that they throw illegal argument exceptions and are treated as bad requests. Relates #25503	2017-06-30 17:50:20 -04:00
Glen Smith	1dd28808d5	Fix typo in name of test This commit fixes a typo in the name of a REST test. Relates #25451	2017-06-30 12:51:37 -04:00
Ali Beyad	b18bfd6062	Output all empty snapshot info fields if in verbose mode (#25455 ) In #24477, a less verbose option was added to retrieve snapshot info via GET /_snapshot/{repo}/{snapshots}. The point of adding this less verbose option was so that if the repository is a cloud based one, and there are many snapshots for which the snapshot info needed to be retrieved, then each snapshot would require reading a separate snapshot metadata file to pull out the necessary information. This can be costly (performance and cost) on cloud based repositories, so a less verbose option was added that only retrieves very basic information about each snapshot that is all available in the index-N blob - requiring only one read! In order to display this less verbose snapshot info appropriately, logic was added to not display those fields which could not be populated. However, this broke integrators (e.g. ECE) that required these fields to be present, even if empty. This commit is to return these fields in the response, even if empty, if the verbose option is set.	2017-06-28 17:37:56 -05:00
Andreas Gebhardt	a156ccd80e	Expand `/_cat/nodes` to return information about hard drive (#21775 ) Expand `/_cat/nodes` with already present information about available disk space `diskAvail` (alias: `d`, `disk`) by: * `diskTotal` (alias `dt`): total disk space * `diskUsed` (alias `du`): used disk space (`diskTotal - diskAvail`) * `diskUsedPercent` (alias `dup`): used disk space percentage Note: The available disk space is the number of bytes available to the node's Java virtual machine. The size might be smaller than the real one. That means the used disk space (percentage) is larger. Closes #21679	2017-06-28 18:20:20 +02:00
Simon Willnauer	4e4a104f4a	Remove remaining `index.mapper.single_type` setting usage from tests (#25388 ) This change removes the remaining explicitly specified `index.mapper.single_type` settings from tests in order to allow the removal of the setting. This is the already approved part of #25375 broken out to simplfiy reviews on	2017-06-25 12:25:41 +02:00
Boaz Leskes	d963882053	Enable a long translog retention policy by default (#25294 ) #25147 added the translog deletion policy but didn't enable it by default. This PR enables a default retention of 512MB (same maximum size of the current translog) and an age of 12 hours (i.e., after 12 hours all translog files will be deleted). This increases to chance to have an ops based recovery, even if the primary flushed or the replica was offline for a few hours. In order to see which parts of the translog are committed into lucene the translog stats are extended to include information about uncommitted operations. Views now include all translog ops and guarantee, as before, that those will not go away. Snapshotting a view allows to filter out generations that are not relevant based on a specific sequence number. Relates to #10708	2017-06-22 17:08:14 +02:00
Spencer	c5b79cd460	[rest-api-spec/indices.refresh] Remove old params Fixes #25234	2017-06-21 13:44:27 -07:00
Lee Hinman	50bac63210	[TEST] Add skip for 5.x BWC tests for custom filter in analyze API Resolves #25316	2017-06-20 09:25:03 -06:00
Jun Ohtani	62d1969595	Parse synonyms with the same analysis chain (#8049 ) * [Analysis] Parse synonyms with the same analysis chain Synonym Token Filter / Synonym Graph Filter tokenize synonyms with whatever tokenizer and token filters appear before it in the chain. Close #7199	2017-06-20 21:50:33 +09:00
Jim Ferenczi	68deda6d03	FastVectorHighlighter should not cache the field query globally (#25197 ) This commit removes the global caching of the field query and replaces it with a caching per field. Each field can use a different `highlight_query` and the rewriting of some queries (prefix, automaton, ...) depends on the targeted field so the query used for highlighting must be unique per field. There might be a small performance penalty when highlighting multiple fields since the query needs to be rewritten once per highlighted field with this change. Fixes #25171	2017-06-15 00:33:01 +02:00
Boaz Leskes	43f4ae5a7b	Indices.rollover/10_basic should refresh to make the doc visible in lucene stats	2017-06-13 23:37:15 +02:00
Boaz Leskes	d3c97615c1	Adapt skip version in rest-api-spec/test/indices.rollover/20_max_doc_condition.yml The relevant change was backported.	2017-06-13 14:46:15 +02:00
Sergey Galkin	1c95cbc4e8	Rollover max docs should only count primaries (#24977 ) max_doc condition for index rollover should use document count only from primary shards Fixes #24217	2017-06-13 14:30:46 +02:00
Spencer	88591fecac	[docs] include two cluster doc pages missing from index (#25180 ) * [docs] include two cluster doc pages missing from index * [rest-api-spec] update link to remote-info docs	2017-06-12 12:33:56 -07:00
Jason Tedor	725f6b6983	Change BWC versions on get mapping 404s This commit changes the BWC versions on the get mapping 404s now that this API returning 404s when a type is missing is supported since 5.5.0. Relates #23192	2017-06-11 16:59:12 -04:00
Jason Tedor	dcf57f296e	Fix get mappings HEAD requests Get mappings HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get mappings HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23192	2017-06-11 14:58:56 -04:00
Jim Ferenczi	5cdbebec94	Test: remove faling test that relies on merge order	2017-06-10 11:55:41 +02:00
Jason Tedor	8a45c3105f	Change BWC versions on create index response This commit changes the BWC versions on the create index response now that the index name in the response is supported since 5.6.0. Relates #25139	2017-06-09 13:52:08 -04:00
Sergey Novikov	7c8657df0e	Return the index name on a create index response This commit modifies the create index response so that it includes the index name. Relates #25139	2017-06-09 13:47:47 -04:00
Tal Levy	a771912a22	Add Ingest-Processor specific Rest Endpoints & Add Grok endpoint (#25059 ) This PR enables Ingest plugins to leverage processor-scoped REST endpoints. First of which being the Grok endpoint that retrieves Grok Patterns for users to retrieve all the built-in patterns. Example usage: Kibana Grok Autocomplete!	2017-06-08 15:24:35 -07:00
Lee Hinman	119f8ed9f0	Correctly enable _all for older 5.x indices When we disabled `_all` by default for indices created in 6.0, we missed adding a layer that would handle the situation where `_all` was not enabled in 5.x and then the cluster was updated to 6.0, this means that when the cluster was updated the `_all` field would be disabled for 5.x indices and field values would not be added to the `_all` field. This adds a compatibility layer for 5.x indices where we treat the default enabled value for the `_all` field to be `true` if unset on 5.x indices. Resolves #25068	2017-06-08 14:37:44 -06:00
Lee Hinman	050b7cd0f9	Include empty mappings in GET /{index}/_mappings requests (#25118 ) Previously this would output: ``` GET /test-1/_mappings { } ``` And after this change: ``` GET /test-1/_mappings { "test-1": { "mappings": {} } } ``` To bring parity back to the REST output after #24723. Relates to #25090	2017-06-08 10:57:04 -06:00
Lee Hinman	5b2ab96364	Return index name and empty map for /{index}/_alias with no aliases Previously in #24723 we changed the `_alias` API to not go through the `RestGetIndicesAction` endpoint, instead creating a `RestGetAliasesAction` that did the same thing. This changes the formatting so that it matches the old formatting of the endpoint, before: ``` GET /test-1/_alias { } ``` And after this change: ``` GET /test-1/_alias { "test-1": { "aliases": {} } } ``` This is related to #25090	2017-06-08 10:03:03 -06:00
Jim Ferenczi	36a5cf8f35	Automatically early terminate search query based on index sorting (#24864 ) This commit refactors the query phase in order to be able to automatically detect queries that can be early terminated. If the index sort matches the query sort, the top docs collection is early terminated on each segment and the computing of the total number of hits that match the query is delegated to a simple TotalHitCountCollector. This change also adds a new parameter to the search request called `track_total_hits`. It indicates if the total number of hits that match the query should be tracked. If false, queries sorted by the index sort will not try to compute this information and and will limit the collection to the first N documents per segment. Aggregations are not impacted and will continue to see every document even when the index sort matches the query sort and `track_total_hits` is false. Relates #6720	2017-06-08 12:10:46 +02:00
Simon Willnauer	d57641a747	Skip rest tests that use mutiple types in pure 6.x clusters (#24965 ) This change skips rest tests that use mutlitple types if the cluster is a pure 6.x cluster. This allows all indics to be created with a version less than 6.0 and that means we can safely use the `mapping.single_type` setting. Relates to #24961	2017-06-07 15:00:17 +02:00
Jason Tedor	5a0b159cb7	Modify skips for get missing alises tests Previous work modified the status code on the get aliases API when an alias is missing so that these requests 404 now. This change was also backported to 5.5 so we can adjust the skips to skip everything before 5.5.0.	2017-06-06 15:35:30 -04:00
Jason Tedor	e03c4938c5	GET aliases should 404 if aliases are missing Previously the HEAD and GET aliases endpoints were misaigned in behavior. The HEAD verb would 404 if any aliases are missing while the GET verb would not if any aliases existed. When HEAD was aligned with GET, this broke the previous usage of HEAD to serve as an existence check for aliases. It is the behavior of GET that is problematic here though, if any alias is missing the request should 404. This commit addresses this by modifying the behavior of GET to behave in this way. This fixes the behavior for HEAD to also 404 when aliases are missing. Relates #25043	2017-06-06 14:37:29 -04:00
Jim Ferenczi	4a8759ef4c	Collapse inner hits rest test should not skip 5.x Relates https://github.com/elastic/elasticsearch/pull/24517	2017-06-06 09:33:56 +02:00
jaymode	e98d5676b3	Test: update missing body tests to run against versions >= 5.5.0 This updates the missing body tests to run against versions >= 5.5.0 after backporting the change to the 5.x branch. See #23497	2017-06-05 14:26:07 -06:00
Alex Benusovich	5463294ec4	Fixed NPEs caused by requests without content. (#23497 ) REST handlers that require a body will throw an an ElasticsearchParseException "request body required". REST handlers that require a body OR source param will throw an ElasticsearchParseException "request body or source param required". Replaced asserts in BulkRequest parsing code with a more descriptive IllegalArgumentException if the line contains an empty object. Updated bulk REST test to verify an empty action line is rejected properly. Updated BulkRequestTests with randomized testing for an empty action line. Used try-with-resouces for XContentParser in AbstractBulkByQueryRestHandler.	2017-06-05 09:08:14 -06:00
Lee Hinman	134b0d594e	[TEST] Skip wildcard expansion test due to breaking change Relates to #24723	2017-06-02 20:48:52 -06:00
Lee Hinman	a32d1b91fa	Remove comma-separated feature parsing for GetIndicesAction This removes the parsing of things like `GET /idx/_aliases,_mappings`, instead, a user must choose between retriving all index metadata with `GET /idx`, or only a specific form such as `GET /idx/_settings`. Relates to (and is a prerequisite of) #24437	2017-06-02 14:43:38 -06:00
Colin Goodheart-Smithe	779fb9a1c0	Adds nodes usage API to monitor usages of actions (#24169 ) * Adds nodes usage API to monitor usages of actions The nodes usage API has 2 main endpoints /_nodes/usage and /_nodes/{nodeIds}/usage return the usage statistics for all nodes and the specified node(s) respectively. At the moment only one type of usage statistics is available, the REST actions usage. This records the number of times each REST action class is called and when the nodes usage api is called will return a map of rest action class name to long representing the number of times each of the action classes has been called. Still to do: * [x] Create usage service to store usage statistics * [x] Record usage in REST layer * [x] Add Transport Actions * [x] Add REST Actions * [x] Tests * [x] Documentation * Rafactors UsageService so counts are done by the handlers * Fixing up docs tests * Adds a name to all rest actions * Addresses review comments	2017-06-02 08:46:38 +01:00
Ryan Ernst	8d88b94372	Scripting: Add optional context parameter to put stored script requests (#25014 ) This commit adds an optional `context` url parameter to the put stored script request. When a context is specified, the script is compiled against that context before storing, as a validation the script will work when used in that context.	2017-06-01 17:53:48 -07:00
Jim Ferenczi	47cf7825dd	Move BWC version to 5.5 after backport Relates to #24517	2017-05-26 14:57:07 +02:00
Matt Weber	601a61a91c	Support Multiple Collapse Inner Hits Support multiple named inner hits on a field collapsing request.	2017-05-26 13:23:57 +02:00
markharwood	a64937db7a	Test fix - rest test missing version skip for new 6.0 significant_text agg	2017-05-24 17:05:02 +01:00
markharwood	b7197f5e21	SignificantText aggregation - like significant_terms, but for text (#24432 ) * SignificantText aggregation - like significant_terms but doesn’t require fielddata=true, recommended used with `sampler` agg to limit expense of tokenizing docs and takes optional `filter_duplicate_text`:true setting to avoid stats skew from repeated sections of text in search results. Closes #23674	2017-05-24 13:46:43 +01:00
Ryan Ernst	463fe2f4d4	Scripting: Remove file scripts (#24627 ) This commit removes file scripts, which were deprecated in 5.5. closes #21798	2017-05-17 14:42:25 -07:00

1 2 3 4 5 ...

1369 Commits