OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	41ea8fdcec	Picks offset source for the unified highlighter directly from the es mapping (#25747 ) This commit changes how the offset source is picked for each field using the es mapping rather than the underlying Lucene field infos. It's mandatory for large mappings where field infos retrieval can be costly (the global field infos is merged for each highlighted field in every hit by the Lucene impl). Fixes #25699	2017-07-17 19:10:46 +02:00
Lee Hinman	610ba7e427	Register data node stats from info carried back in search responses (#25430 ) * Register data node stats from info carried back in search responses This is part of #24915, where we now calculate the EWMA of service time for tasks in the search threadpool, and send that as well as the current queue size back to the coordinating node. The coordinating node now tracks this information for each node in the cluster. This information will be used in the future the determining the best replica a search request should be routed to. This change has no user-visible difference. * Move response time timing into ResponseListenerWrapper * Move ResponseListenerWrapper to ActionListener instead of SearchActionListener Also removes the logger * Move `requestIndex` back to private * De-guice-ify ResponseCollectorService \o/ * Undo all changes to SearchQueryThenFetchAsyncAction * Remove unneeded response collector from TransportSearchAction * Undo all changes to SearchDfsQueryThenFetchAsyncAction * Completely rewrite the inside of ResponseCollectorService's record keeping * Documentation and cleanups for ResponseCollectorService * Add unit test for collection of queue size and service time * Fix Guice construction error * Add basic unit tests for ResponseCollectorService * Fix version constant for the master merge * Fix test compilation after master merge * Add a test for node removal on cluster changed event * Remove integration test as there are now unit tests * Rename ResponseListenerWrapper -> SearchExecutionStatsCollector * Fix line-length * Make classes private and final where appropriate * Pass nodeId into SearchExecutionStatsCollector and use only ActionListener * Get nodeId from connection so searchShardTarget can be private * Remove threadpool from SearchContext, get it from IndexShard instead * Add missing import * Use BiFunction for responseWrapper rather than passing in collector service	2017-07-17 11:04:51 -06:00
Simon Willnauer	cb4eebcd6a	Make `index` in TermsLookup mandatory (#25753 ) This change removes the leniency of having a `null` index to fetch terms from in 6.0 onwards. This feature will be deprecated in the 5.x series and 6.0 nodes will require the index to be set. Closes #25750	2017-07-17 18:50:30 +02:00
Simon Willnauer	9ff259c260	Use concrete version for BWC checks in SearchTransportService (#25748 ) We used to compare agaisnt the min compatible version which is misleading since it might move over time and since we backported the `can_match` API entirely it's better to compare against a version constant.	2017-07-17 18:49:50 +02:00
Clinton Gormley	25a89e613a	Broke recipes into separate pages	2017-07-17 18:21:39 +02:00
Boaz Leskes	c0751c8650	deubg logging to TruncateTranslogIT To see what data paths are used.	2017-07-17 17:18:05 +02:00
Adrien Grand	949db39fad	Fix reproducibility of UUIDTests. Closes #25714	2017-07-17 15:43:28 +02:00
Adrien Grand	78a6c3427b	Optimize `terms` queries on `ip` addresses to use a `PointInSetQuery` whenever possible. (#25669 ) We can't do it in the general case because of prefix queries, but I believe this is mostly used in query strings and not in explicit `terms` queries. Closes #25667	2017-07-17 15:39:01 +02:00
Adrien Grand	264088f1c4	Deprecate the `_default_` mapping. (#25652 ) Now that indices cannot have types anymore, this feature does not buy anything anymore. Closes #25500	2017-07-17 15:37:59 +02:00
Jason Tedor	e9aa60dc9d	Skip shrink ignores template mapping in BWC tests This commit reverts some changes to the shrink API ignore template mapping REST test in favor of simply skipping the test for BWC purposes. The complexity here is due to deprecations and lacking the infrastructure to gracefully handle a situation like this.	2017-07-17 20:32:18 +09:00
Colin Goodheart-Smithe	7a401cd1d2	[TEST] skips shrink source mapping rest test This change skips the rest test in `rest-api-spec/test/indices.shrink/20_source_mapping.yml` as it currently fails because if we don’t expect the deprecation warning the normal rest tests fail because they get a warning they don’t expect but if we do expect the deprecation warning the mixed cluster tests fail because they don’t get a warning which they expected.	2017-07-17 12:24:07 +01:00
Jason Tedor	b1f8b75ac3	Fix warnings in shrink ignore templates test This commit fixes an issue with the REST test that the shrink API ignores templates. The problem is that we have to use a BWC version of the API (for the BWC tests) but this raises deprecation warnings. This commit adds an expectation for these deprecation warnings.	2017-07-17 18:25:37 +09:00
Boaz Leskes	7739aad1aa	Add testing around recovery to TruncateTranslogIT	2017-07-17 10:48:26 +02:00
Jason Tedor	f121cd3beb	Fix pre-6.0 response to unknown replication actions When sending replica requests for replication operations, we skip sending the request to pre-6.0 nodes for operations that such nodes would not be aware of (e.g., the background global checkpoint sync, or the primary/replica resync) since they would not know what to do with these requests. Yet, we simulate that we received responses from these nodes. Today, this is done by simulating that they sent us that their local checkpoint is unassigned sequence number. However, for pre-6.0 nodes we have introduced a special local checkpoint used in the global checkpoint tracker for such nodes and that is what we should use here too. This commit fixes this issue. Relates #25744	2017-07-17 17:47:48 +09:00
Simon Willnauer	2da79f2b5e	[TEST] Use 5.x compatible API in shrink tests	2017-07-17 09:45:49 +02:00
Jason Tedor	5b25b5d80a	Fix comment on shrink indices test This commit fixes a comment on a shrink indices test; the comment is wrong because the fix in question was applied starting 5.6.0.	2017-07-17 16:28:09 +09:00
Martijn van Groningen	8003171a0c	Move more token filters to analysis-common module The following token filters were moved: arabic_normalization, german_normalization, hindi_normalization, indic_normalization, persian_normalization, scandinavian_normalization, serbian_normalization, sorani_normalization, cjk_width and cjk_width Relates to #23658	2017-07-17 08:29:44 +02:00
Glen Smith	e9dfb2a215	Fix another simulate example in ingest docs When simulating an ingest pipeline against an existing pipeline, the _source field is required to wrap each doc. This commit fixes another example in the docs that is missing this. Relates #25743, relates `e3a0c11239`	2017-07-17 15:17:42 +09:00
Glen Smith	e3a0c11239	Fix simulate example in ingest docs When simulating an ingest pipeline against an existing pipeline, the _source field is required to wrap each doc. This commit fixes an example in the docs that is missing this. Relates #25742	2017-07-17 14:17:41 +09:00
Jason Tedor	fd98f7abc2	Adjust skip version for shrink index test This commit adjusts the skip version for a shrink index test that ensures that a shrunken index ignores templates; the version can be adjusted after the fix was backported targeting 5.6.0 and later. Relates #25380	2017-07-17 12:56:12 +09:00
Simon Willnauer	8364279b98	Prevent skipping shards if a suggest builder is present (#25739 ) Even if the query part can rewrite to match none we can't skip the suggest execution since it might yield results. Relates to #25658	2017-07-16 19:06:47 +02:00
Simon Willnauer	ccda0441e1	Bump BWC versions after #25658 backport to 5.6	2017-07-15 11:34:16 +02:00
Boaz Leskes	a6bea1bf97	testMockFailToSendNoConnectRule should wait for connection close to bubble up and disconnect the node #25521 changed channel closing to be handled async on anything but transport stop. This means it may take a while before calling `connection.close()` and the node being removed from the `connectedNodes` list (but the connection is immediately unusuable). Fixes #25686	2017-07-15 09:28:17 +02:00
Ryan Ernst	072402463b	Scripting: Remove search template actions (#25717 ) The dedicated search template put/get/delete actions are deprecated in 5.6. This commit removes them from 6.0.	2017-07-14 23:12:05 -07:00
javanna	2c38e93e96	[DOCS] Added note to high level client docs on version The alpha2 docs is built out of master which may make users think that the high level client was already released as part of alpha2 which it was not. This note should clarify that the client will be released with 6.0.0-beta1	2017-07-15 07:50:25 +02:00
Ryan Ernst	b1762d69b5	Setup: Change default heap to 1G (#25695 ) This commit changes the default heap size to 1 GB. Experimenting with elasticsearch is often done on laptops, and 1 GB is much friendlier to laptop memory. It does put more pressure on the gc, but the tradeoff is a smaller default footprint. Users running in production can (and should) adjust the heap size as necessary for their usecase.	2017-07-14 09:38:08 -07:00
Christoph Büscher	5387ed00d2	[Docs] Adding suggestion sections to high level client docs (#25724 ) This adds a section about how to add suggestions to the SearchSourceBuilder and how to retrieve them from a SearchResponse.	2017-07-14 18:33:28 +02:00
Yannick Welsch	8f0b357651	Let primary own its replication group (#25692 ) Currently replication and recovery are both coordinated through the latest cluster state available on the ClusterService as well as through the GlobalCheckpointTracker (to have consistent local/global checkpoint information), making it difficult to understand the relation between recovery and replication, and requiring some tricky checks in the recovery code to coordinate between the two. This commit makes the primary the single owner of its replication group, which simplifies the replication model and allows to clean up corner cases we have in our recovery code. It also reduces the dependencies in the code, so that neither RecoverySourceXXX nor ReplicationOperation need access to the latest state on ClusterService anymore. Finally, it gives us the property that in-sync shard copies won't receive global checkpoint updates which are above their local checkpoint (relates #25485).	2017-07-14 13:52:53 +02:00
Christoph Büscher	f809a12493	[Docs] Adding aggregation sections to high level client docs (#25707 ) This adds a section about how to add aggregations to the SearchSourceBuilder and how to retrieve them from a SearchRepsonse to the documentation for the high level rest client.	2017-07-14 12:47:47 +02:00
Bodecker DellaMaria	4f0dc5bf32	Mark filtered query example as not to be used (#25661 ) The Filtered Query has been deprecated in favour of the Bool Query with a filter context. However, this deleted page for the Filtered Query is often ranked highly in search results when searching for documentation on "filtered queries". Often people just copy the first code snippet they see, which in this case is the INCORRECT syntax (the correct syntax follows). I think reordering the examples would help avoid a lot of confusion (I have seen people make this same mistake 3 times now) Adding a comment to indicate that the first example shouldn't be used	2017-07-14 11:45:21 +02:00
Martijn van Groningen	c8777c4c2e	docs: Updated reference docs that `document_type` is deprecated	2017-07-14 11:07:46 +02:00
Luca Cavanna	7930b8a720	Fix indices options parsing from REST in delete index API (#25709 ) When parsing indices options from REST, we parse the optional parameters that are supported at REST (ignore_unavailable, allow_no_indices and expand_wildcards) and we provide the API default values for all the other (internal) options so that they are set to the new indices options while parsing. The `ignoreAliases` option was forgotten though, which means that whenever you pass in any index option at REST to the delete index API, you get to delete aliases like it was supported before (as ignoreAliases gets set to false like in all the other APIs). Added unit tests for IndicesOptions parsing from REST parameters, and yaml tests for the delete index API.	2017-07-14 10:39:44 +02:00
Antonio Matarrese	afd9a1c1b1	[DOCS] Explain mapping explosion (#25654 )	2017-07-14 09:47:41 +02:00
Martijn van Groningen	9040f4498e	test: wait for index to be green before running all checks	2017-07-13 21:49:37 +02:00
Neil Rickards	5189bd14f1	[Docs] Fix typo in pattern-tokenizer.asciidoc (#25626 )	2017-07-13 18:43:48 +02:00
Jim Ferenczi	fe383b7c27	More clarifications on the unified highlighter being the new default (#25668 ) * More clarifications on the unified highlighter being the new default	2017-07-13 15:38:58 +02:00
Jim Ferenczi	13da3eb53e	Refactor QueryStringQuery for 6.0 (#25646 ) This change refactors the query_string query to analyze the query text around logical operators of the query string the same way than a match_query/multi_match_query. It also adds a type parameter that can be used to change the way multi fields query are built the same way than a multi_match query does. Now that these queries share the same behavior regarding text analysis, some parameters are obsolete and have been deprecated: split_on_whitespace: This setting is now ignored with a deprecation notice if it is used explicitely. With this PR The query_string always splits on logical operator. It simplifies the understanding of the other parameters that can have different meanings depending on the value of split_on_whitespace. auto_generate_phrase_queries: This setting is now ignored with a deprecation notice if it is used explicitely. This setting only makes sense when the parser splits on whitespace. use_dismax: This setting is now ignored with a deprecation notice if it is used explicitely. The tie_breaker parameter is sufficient to handle best_fields/most_fields. Fixes #25574	2017-07-13 15:32:17 +02:00
Igor Motov	6125f535ae	mget with an alias shouldn't ignore alias routing (#25697 ) Closes #25696	2017-07-13 09:27:37 -04:00
Simon Willnauer	0e5d324c36	Prevent `can_match` requests from sending to incompatible nodes (#25705 ) With cross cluster search we can potentially proxy `can_match` requests to nodes that don't have the endpoint. This might not cause any problem from a functional perspecitve but will cause ugly error messages on the target node. This commit will cause an IAE if we try to talk to an incompatible node via a proxy. Relates to #25704	2017-07-13 14:59:41 +02:00
Colin Goodheart-Smithe	11477a608f	Removes FieldStats API (#25628 ) * Removes FieldStats API * iter * iter	2017-07-13 11:56:46 +01:00
Martijn van Groningen	a85b22b298	test: put template api is deprecated, so take warnings into account Relates to #25702	2017-07-13 11:39:53 +02:00
Martijn van Groningen	02fad9ac8c	docs: updated java client api to take this into account too to take into account the p/c queries are in parent-join module Closes #25624	2017-07-13 11:24:22 +02:00
Luca Cavanna	ec66d655b5	Rename client artifacts (#25693 ) It was brought up that our current client artifacts have generic names like 'rest' that may cause conflicts with other artifacts. This commit renames: - rest -> elasticsearch-rest-client - sniffer -> elasticsearch-rest-client-sniffer - rest-high-level -> elasticsearch-rest-high-level-client A couple of small changes are also preparing the high level client for its first release. Closes #20248	2017-07-13 09:44:25 +02:00
Christoph Büscher	97c4c43fb7	Make slop optional when parsing `span_near` query (#25677 ) The slop parameter defaults to 0 in the Lucene SpanNearQuery, so we can set it to this default value also and don't have to require it being specified in the query when using the Rest API. Leaving `slop` a ctro arg in the Java API as it should normally be specified and we can keep it `final` that way. Closes #25642	2017-07-13 09:21:49 +02:00
Simon Willnauer	02e9ad6d6f	Register correct response for `can_match` proxy response Relates to #25658 Closes #25698	2017-07-13 08:33:56 +02:00
Deb Adair	ded9f55263	[DOCS] Incorporated feedback on the highlighting changes.	2017-07-12 16:36:33 -07:00
Ryan Ernst	70b2897bdf	Scripting: Deprecate stored search template apis (#25437 ) This commit deprecates the PUT, GET and DELETE search template apis. Instead, the stored script api should be used. closes #24596	2017-07-12 16:07:28 -07:00
Sergey Galkin	e2bfb35f4a	Shrunk indices should ignore templates A shrunk index should ignore anything from templates and instead take its mappings, aliases, and settings from the original index, plus any new settings and aliases passed in with the shrink request. This commit causes this to be the case. Relates #25380	2017-07-12 18:27:38 -04:00
Simon Willnauer	b7bc790428	Use a non default port range in MockTransportService We already use a per JVM port range in MockTransportService. Yet, it's possible that if we are executing in the JVM with ordinal 0 that other clusters reuse ports from the mock transport service and some tests try to simulate disconnects etc. By using a non-defautl port range (starting at 10300) we prevent internal test clusters from reusing any of the mock impls ports Relates to #25301	2017-07-12 22:29:21 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00

1 2 3 4 5 ...

28236 Commits All Branches Search

28236 Commits

All Branches