OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	cc865cbc96	Add unit tests for stats and extended stats aggregations (#23287 ) Add tests for InternalStats, InternalExtendedStats and StatsAggregator/ExtendedStatsAggregator Relates #22278	2017-02-21 15:14:54 +01:00
Simon Willnauer	f933f80902	First step towards incremental reduction of query responses (#23253 ) Today all query results are buffered up until we received responses of all shards. This can hold on to a significant amount of memory if the number of shards is large. This commit adds a first step towards incrementally reducing aggregations results if a, per search request, configurable amount of responses are received. If enough query results have been received and buffered all so-far received aggregation responses will be reduced and released to be GCed.	2017-02-21 13:02:48 +01:00
Tanguy Leroux	39ed76c58b	Add parsing method to bulk response (#23234 ) This commit adds the `fromXContent()` parsing method to BulkResponse.	2017-02-21 10:49:40 +01:00
Tanguy Leroux	c88eb00b83	Add javadoc for DocWriteResponse.Builders (#23267 )	2017-02-21 10:19:01 +01:00
Martin Scholz	24bf18b610	Upgrade HDRHistogram to 2.1.9 (#23254 )	2017-02-21 08:50:26 +01:00
Martin Scholz	3e292d5245	Migrate TermsQuery to TermInSetQuery (#23229 )	2017-02-21 08:49:43 +01:00
Jim Ferenczi	1ff5b318be	Fix for IpRangeAggregatorTests#testRanges Handle null from/to ranges. Closes #23272	2017-02-20 21:16:14 +01:00
Jason Tedor	4c2bd5feab	Introduce sequence-number-aware translog Today, the relationship between Lucene and the translog is rather simple: every document not in Lucene is guaranteed to be in the translog. We need a stronger guarantee from the translog though, namely that it can replay all operations after a certain sequence number. For this to be possible, the translog has to made sequence-number aware. As a first step, we introduce the min and max sequence numbers into the translog so that each generation knows the possible range of operations contained in the generation. This will enable future work to keep around all generations containing operations after a certain sequence number (e.g., the global checkpoint). Relates #22822	2017-02-20 15:05:24 -05:00
Jason Tedor	15f5810774	Mark IP range aggregator test as awaits fix This test reliably fails with the seed 4AC319F8A6B0329B.	2017-02-20 14:42:16 -05:00
Christoph Büscher	ea7deace5d	Adding fromXContent to Suggest and Suggestion class (#23226 ) A follow up to #23202, this adds parsing from xContent and tests to the four Suggestion implementations and the top level suggest element to be used later when parsing the entire SearchResponse.	2017-02-20 15:45:10 +01:00
Christoph Büscher	ea9d51114c	Tests: Add unit test for InternalChildren (#23261 ) Relates to #22278	2017-02-20 14:02:56 +01:00
Jim Ferenczi	76d6b872dd	Add unit tests for GeoBoundsAggregator/InternalGeoBounds (#23259 ) * Add unit tests for GeoBoundsAggregator/InternalGeoBounds Relates #22278	2017-02-20 12:04:30 +01:00
Jim Ferenczi	69b1463f7c	Add unit tests for BinaryRangeAggregator/InternalBinaryRange (#23255 ) * Add unit tests for BinaryRangeAggregator/InternalBinaryRange Relates #22278	2017-02-20 11:55:48 +01:00
Tanguy Leroux	872412f645	[Tests] Cleans up DocWriteResponse parsing tests (#23233 ) This commit cleans up some parsing tests added from the High Level Rest Client: IndexResponseTests, DeleteResponseTests, UpdateResponseTests, BulkItemResponseTests. These tests are now more uniform with the others test-from-to-XContent tests we have, they now shuffle the XContent fields before parsing, the asserting method for parsed objects does not used a Map<String, Object> anymore, and buggy equals/hasCode methods in ShardInfo and ShardInfo.Failure have been removed.	2017-02-20 09:45:33 +01:00
Nik Everett	d9c37ce195	Adds unit test for sampler aggregation Relates to #22278	2017-02-17 16:16:04 -05:00
Nik Everett	d1de9574ea	Checkstyle: Fix link lengths in sampler aggregation	2017-02-17 15:03:57 -05:00
Jay Modi	b234644035	Enforce Content-Type requirement on the rest layer and remove deprecated methods (#23146 ) This commit enforces the requirement of Content-Type for the REST layer and removes the deprecated methods in transport requests and their usages. While doing this, it turns out that there are many places where *Entity classes are used from the apache http client libraries and many of these usages did not specify the content type. The methods that do not specify a content type explicitly have been added to forbidden apis to prevent more of these from entering our code base. Relates #19388	2017-02-17 14:45:41 -05:00
Adrien Grand	3bd1d46fc7	Add unit tests for terms aggregation objects. (#23149 ) Relates #22278	2017-02-17 18:01:40 +01:00
javanna	578853f264	Remove stale comment about setting routing before parent Order does not matter anymore since we merged #15371	2017-02-17 17:10:53 +01:00
Yuhao Bi	576e698613	Minor fix of _cat output (#23211 ) (#23213 ) One line was missing a trailing "\n"	2017-02-17 10:46:20 +01:00
Jason Tedor	00a8b8799f	Fix control group pattern The file /proc/self/cgroup lists the control groups to which the process belongs. This file is a colon separated list of three fields: 1. a hierarchy ID number 2. a comma-separated list of hierarchies 3. the pathname of the control group in the hierarchy The regex pattern for this contains a bug for the second field. It allows one or two entries in the comma-separated list, but not more. This commit fixes the pattern to allow one or more entires in the comma-separated list. Relates #23219	2017-02-16 15:31:18 -05:00
Christoph Büscher	268d15ec4c	Adding fromXContent to Suggestion.Entry and subclasses (#23202 ) This adds parsing from xContent to Suggestion.Entry and its subclasses for Terms-, Phrase- and CompletionSuggestion.Entry.	2017-02-16 17:59:55 +01:00
markharwood	1cd1ff6010	Test fix - faulty assumptions about when exceptions are thrown in relation to number of failing shards. (#23205 ) Search exceptions are thrown only when all shards report failure. Fix changes assertion logic to reflect this. Closes #23203	2017-02-16 13:48:17 +00:00
Jason Tedor	0a5917d182	Fix get HEAD requests Get HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23186	2017-02-15 13:07:29 -05:00
Christoph Büscher	458ca09e70	Fix checkstyle issue with modifier order in DocWriteResponse	2017-02-15 17:53:39 +01:00
Tanguy Leroux	e8d669f50c	Add parsing methods to BulkItemResponse (#22859 ) This commit adds a parsing method to the BulkItemResponse class. In order to do that, the way DocWriteResponses are parsed has to be changed: ConstructingObjectParser/ObjectParser is removed in favor of a simpler and more readable way to parse these objects. DocWriteResponse now provides the parseInnerToXContent() method that can be used by subclasses (IndexResponse, UpdateReponse and DeleteResponse) to parse the current token/field and potentially update a DocWriteResponseBuilder. The DocWriteResponseBuilder is a simple POJO used to contain parsed values. It can be passed around from one parsing method to another parsing method. For example, this is what is done in IndexResponse: a IndexResponseBuilder is created in IndexResponse.fromXContent(), it get passed to IndexResponse.parseXContentFields() that parses fields specific to IndexResponse (like "created") and updates the context, delegating to DocWriteResponse.parseInnerToXContent() the parsing of any other field. Once all XContent is parsed, IndexResponse.fromXContent() uses the method IndexResponseBuilder.build() to create the new instance of IndexResponse. This behavior allow to reuse parsing code among the class hierarchy while keeping the current behavior. It also allows other objects like BulkItemResponse to reuse the same parsing code to parse DocWriteResponses. Finally, IndexResponseTests, UpdateResponseTests and DeleteResponseTests have been updated to introduce some random shuffling of fields before the XContent is parsed in order to ensure that the parsing code does not rely on field order.	2017-02-15 17:33:10 +01:00
Christoph Büscher	b963144254	Add xcontent parsing to completion suggestion option (#23071 ) This adds parsing from xContent to the CompletionSuggestion.Entry.Option. The completion suggestion option also inlines the xContent rendering of the containes SearchHit, so in order to reuse the SearchHit parser this also changes the way SearchHit is parsed from using a loop-based parser to using a ConstructingObjectParser that creates an intermediate map representation and then later uses this output to create either a single SearchHit or use it with additional fields defined in the parser for the completion suggestion option.	2017-02-15 16:52:17 +01:00
Jim Ferenczi	3c26754f87	Add BWC index for new released version 5.2.1	2017-02-15 11:14:37 +01:00
Jim Ferenczi	f1aaa71a7f	Create version constants for next bug fix version v5.2.2	2017-02-15 11:13:09 +01:00
Ryan Ernst	048c87d8a5	Improve setting deprecation message (#23156 ) This change modifies the deprecation log message emitted when a setting is found which is deprecated. The new message indicates docs for the deprecated settings can be found in the breaking changes docs for the next major version. closes #22849	2017-02-14 21:33:13 -08:00
Jason Tedor	6ac1cb660b	Cleanup RestGetIndicesAction.java This commit is just a code cleanup of RestGetIndicesAction.java. For example, we remove an unnecessary class, remove some unnecessary local variables, and simplify some code flow. Relates #23129	2017-02-14 16:51:27 -05:00
Jason Tedor	673754b1d5	Fix get source HEAD requests Get source HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get source HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23151	2017-02-14 16:37:22 -05:00
Martijn van Groningen	cab43707dc	[percolator] Removed old 2.x bwc logic.	2017-02-14 22:17:17 +01:00
Areek Zillur	e178dc5493	Add request version asserting during replica operation (#23167 )	2017-02-14 15:40:55 -05:00
Simon Willnauer	a7a3729596	Add ExpandSearchPhase as a successor for the FetchSearchPhase (#23165 ) Now that we have more flexible search phases we should move the rather hacky integration of the collapse feature as a real search phase that can be tested and used by itself. This commit adds a new ExpandSearchPhase including a unittest for the phase. It's integrated into the fetch phase as an optional successor.	2017-02-14 17:14:17 +01:00
Adrien Grand	8d6a41f671	Nested queries should avoid adding unnecessary filters when possible. (#23079 ) When nested objects are present in the mappings, many queries get deoptimized due to the need to exclude documents that are not in the right space. For instance, a filter is applied to all queries that prevents them from matching non-root documents (`+: -_type:__`). Moreover, a filter is applied to all child queries of `nested` queries in order to make sure that the child query only matches child documents (`_type:__nested_path`), which is required by `ToParentBlockJoinQuery` (the Lucene query behing Elasticsearch's `nested` queries). These additional filters slow down `nested` queries. In 1.7-, the cost was somehow amortized by the fact that we cached filters very aggressively. However, this has proven to be a significant source of slow downs since 2.0 for users of `nested` mappings and queries, see #20797. This change makes the filtering a bit smarter. For instance if the query is a `match_all` query, then we need to exclude nested docs. However, if the query is `foo: bar` then it may only match root documents since `foo` is a top-level field, so no additional filtering is required. Another improvement is to use a `FILTER` clause on all types rather than a `MUST_NOT` clause on all nested paths when possible since `FILTER` clauses are more efficient. Here are some examples of queries and how they get rewritten: ``` "match_all": {} ``` This query gets rewritten to `ConstantScore(+:* -_type:__)` on master and `ConstantScore(_type:AutomatonQuery {\norg.apache.lucene.util.automaton.Automaton@4371da44})` with this change. The automaton is the complement of `_type:__` so it matches the same documents, but is faster since it is now a positive clause. Simplistic performance testing on a 10M index where each root document has 5 nested documents on average gave a latency of 420ms on master and 90ms with this change applied. ``` "term": { "foo": { "value": "0" } } ``` This query is rewritten to `+foo:0 #(ConstantScore(+: -_type:__))^0.0` on master and `foo:0` with this change: we do not need to filter nested docs out since the query cannot match nested docs. While doing performance testing in the same conditions as above, response times went from 250ms to 50ms. ``` "nested": { "path": "nested", "query": { "term": { "nested.foo": { "value": "0" } } } } ``` This query is rewritten to `+ToParentBlockJoinQuery (+nested.foo:0 #_type:__nested) #(ConstantScore(+:* -_type:__))^0.0` on master and `ToParentBlockJoinQuery (nested.foo:0)` with this change. The top-level filter (`-_type:__`) could be removed since `nested` queries only match documents of the parent space, as well as the child filter (`#_type:__nested`) since the child query may only match nested docs since the `nested` object has both `include_in_parent` and `include_in_root` set to `false`. While doing performance testing in the same conditions as above, response times went from 850ms to 270ms.	2017-02-14 16:05:19 +01:00
Adrien Grand	a969dad43e	Integrate IndexOrDocValuesQuery. (#23119 ) This gives Lucene the choice to use index/point-based queries or doc-values-based queries depending on which one is more efficient. This commit integrates this feature for: - long/integer/short/byte/double/float/half_float/scaled_float ranges, - date ranges, - geo bounding box queries, - geo distance queries.	2017-02-14 15:57:12 +01:00
Jun Ohtani	12bbe6e660	Merge pull request #23161 from johtani/support_keyword_to_analyze_api [Analyze]Support Keyword type in Analyze API	2017-02-14 23:22:32 +09:00
Christoph Büscher	abc8cd6c5f	Remove unused sourceAsBytes field in SearchHit	2017-02-14 14:08:38 +01:00
Simon Willnauer	aef0665ddb	Detach SearchPhases from AbstractSearchAsyncAction (#23118 ) Today all search phases are inner classes of AbstractSearchAsyncAction or one of it's subclasses. This makes unit testing of these classes practically impossible. This commit Extracts `DfsQueryPhase` and `FetchSearchPhase` or of the code that composes the actual query execution types and moves most of the fan-out and collect code into an `InitialSearchPhase` class that can be used to build initial search phases (phases that retry on shards). This will make modification to these classes simpler and allows to easily compose or add new search phases down the road if additional roundtrips are required.	2017-02-14 12:34:25 +01:00
Jun Ohtani	34ebb88650	[Analyze]Support Keyword type in Analyze API Add comment and clarify	2017-02-14 17:56:36 +09:00
Jun Ohtani	4d823d69f4	[Analyze]Support Keyword type in Analyze API	2017-02-14 16:41:16 +09:00
Jason Tedor	5343b87502	Handle bad HTTP requests When Netty decodes a bad HTTP request, it marks the decoder result on the HTTP request as a failure, and reroutes the request to GET /bad-request. This either leads to puzzling responses when a bad request is sent to Elasticsearch (if an index named "bad-request" does not exist then it produces an index not found exception and otherwise responds with the index settings for the index named "bad-request"). This commit addresses this by inspecting the decoder result on the HTTP request and dispatching the request to a bad request handler preserving the initial cause of the bad request and providing an error message to the client. Relates #23153	2017-02-13 17:39:25 -05:00
Jay Modi	61e383813d	Make the version of the remote node accessible on a transport channel (#23019 ) This commit adds a new method to the TransportChannel that provides access to the version of the remote node that the response is being sent on and that the request came from. This is helpful for serialization of data attached as headers.	2017-02-13 15:15:57 -05:00
Lee Hinman	b42d47770c	Fix total disk bytes returning negative value (#23093 ) * Fix total disk bytes returning negative value This adds a workaround for JDK-8162520 - https://bugs.openjdk.java.net/browse/JDK-8162520 Some filesystems can be so large that they return a negative value for their free/used/available disk bytes due to being larger than `Long.MAX_VALUE`. This adds protection for our `FsProbe` implementation and adds a test that it does the right thing.	2017-02-13 11:20:15 -07:00
jaymode	d8d03f45c2	Fix communication with 5.3.0 nodes This commit fixes communication with 5.3.0 nodes to send XContentType to these nodes since #22691 was backported to the 5.3 branch.	2017-02-13 13:15:51 -05:00
Jason Tedor	9dff5e2af7	Properly encode location header Today when trying to encode the location header to ASCII, we rely on the Java URI API. This API requires a proper URI which blows up whenever the URI contains, for example, a space (which can happen if the type, ID, or routing contain a space). This commit addresses this issue by properly encoding the URI. Additionally, we remove the need to create a URI simplifying the code flow. Relates #23133	2017-02-13 09:34:52 -05:00
Tanguy Leroux	de94c1253a	Expose WriteRequest.RefreshPolicy string representation (#23106 ) This commit changes the RefreshPolicy enum so that string representation are exposed. This will help the high level rest client to simply use refreshPolicy.getValue() to get the corresponding parameter value of a given refresh policy.	2017-02-13 10:49:46 +01:00
Boaz Leskes	29ea3059fc	Allow a cluster state applier to register an observer and wait for a better state (#23132 ) #21817 introduced the notion of a cluster state applier and banned those for sampling the cluster state directly (as it is not applied yet). Testing has exposed one exceptional use case - if the appliers want to spawn off a follow up it may require waiting for specific new cluster state (for example, the shard started action, called by the IndicesClusterStateService, may run into trouble connecting to the master and wait for a new master to be elected). This requires creating an observer which, in turn, samples the cluster state. An example failure can be seen at https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+periodic/1701/console This commit allows creating an observer from a cluster state applier. The observer is adapted to exclude any potential old cluster state in its logic.	2017-02-12 14:58:22 +02:00
Jason Tedor	0f21ed5b70	Fix template HEAD requests Template HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for template HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23130	2017-02-11 18:30:16 -05:00
Lee Hinman	13446937a5	Remove action.allow_id_generation setting (#23120 ) This was an undocumented and unsettable setting that allowed id generation. Resolves #23088	2017-02-10 14:04:40 -07:00
Jim Ferenczi	1ba73d9797	Fix GraphQuery expectation after Lucene upgrade to 6.5 (#23117 ) GraphQueries are now generated as simple clauses in BooleanQuery. So for instance a multi terms synonym will generate a GraphQuery but only for the side paths, the other part of the query will not be impacted. This means that we cannot apply `minimum_should_match` or `cutoff_frequency` on GraphQuery anymore (only ES 5.3 does that because we generate all possible paths if a query has at least one multi terms synonym). Starting in 5.4 multi terms synonym will now be treated as a single term when `minimum_should_match` is computed and will be ignored when `cutoff_frequency` is set. Fixes #23102	2017-02-10 18:20:00 +01:00
sabi0	09c7c5c82f	Limit IndexRequest toString() length (#22832 ) Limits the length of `IndexRequest#toString` which also limits the size of the task description generated for `IndexRequest`s. If the document being written is larger than 2kb we skip logging the _source entirely. This is because truncating the source is tricky and it isn't worth it.	2017-02-10 10:42:08 -05:00
Sebastian	976da87e8f	Fix some Javadoc typos (#23111 )	2017-02-10 15:53:30 +01:00
Jason Tedor	a6158398dd	Fix index HEAD requests Index HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for index HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23112	2017-02-10 09:44:01 -05:00
Jason Tedor	7ac44656df	Fix alias HEAD requests Alias HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for alias HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23094	2017-02-10 09:19:35 -05:00
Adrien Grand	709cc9ba65	Upgrade to lucene-6.5.0-snapshot-f919485. (#23087 )	2017-02-10 15:08:47 +01:00
Jay Modi	7018b6ac6f	Add BulkProcessor methods with XContentType parameter (#23078 ) This commit adds methods to the BulkProcessor that accept bytes and a XContentType to avoid content type detection. The methods that do not accept XContentType with bytes have been deprecated by this commit. Relates #22691	2017-02-10 08:59:37 -05:00
Jason Tedor	4f2b4724be	Cleanup RestGetAliasesAction.java This commit is just a code cleanup of RestGetAliasesAction.java. For example, we remove an unnecessary class, simplify a convenience method, and simplify some code flow. Relates #23095	2017-02-10 08:37:05 -05:00
Tanguy Leroux	e2e5937455	Use `typed_keys` parameter to prefix suggester names by type in search responses (#23080 ) This pull request reuses the typed_keys parameter added in #22965, but this time it applies it to suggesters. When set to true, the suggester names in the search response will be prefixed with a prefix that reflects their type.	2017-02-10 10:53:38 +01:00
Boaz Leskes	e0c8a6a3eb	Relax WaitActiveShardCountIT check of exception messages So ti wouldn't depend on BulkShardRequest.toString()	2017-02-09 23:14:09 +02:00
Areek Zillur	990918a655	fix failing tests for BulkShardRequest.tostring	2017-02-09 15:34:22 -05:00
Boaz Leskes	033defee9a	fix BulkShardRequestTests after changes to BulkShardRequest.toString	2017-02-09 21:05:21 +02:00
Boaz Leskes	cd1cb41603	Move EvilPeerRecoveryIT to a unit test in RecoveryDuringReplicationTests (#22900 ) EvillPeerRecoveryIT checks scenario where recovery is happening while there are on going indexing operation that already have been assigned a seq# . This is fairly hard to achieve and the test goes through a couple of hoops via the plugin infra to achieve that. This PR extends the unit tests infra to allow for those hoops to happen in unit tests. This allows the test to be moved to RecoveryDuringReplicationTests Relates to #22484	2017-02-09 20:14:03 +02:00
Jim Ferenczi	94087b3274	Removes ExpandCollapseSearchResponseListener, search response listeners and blocking calls This changes removes the SearchResponseListener that was used by the ExpandCollapseSearchResponseListener to expand collapsed hits. The removal of SearchResponseListener is not a breaking change because it was never released. This change also replace the blocking call in ExpandCollapseSearchResponseListener by a single asynchronous multi search request. The parallelism of the expand request can be set via CollapseBuilder#max_concurrent_group_searches Closes #23048	2017-02-09 18:06:10 +01:00
Boaz Leskes	33915aefd8	Improve BulkShardRequest.toString when it has only 1 internal request Now that we use bulk for single item indexing, this is often the case. Having an indicator of the id of the indexed document helps debugging. It now looks like this `BulkShardRequest to [[test][0]] containing [index {[test][type][AVojzy9ZxfWASZ-ysmN7], source[{"auto":true}]}]`	2017-02-09 18:59:49 +02:00
Luca Cavanna	90ea778c17	Cluster allocation explain to never return empty response body (#23054 ) Empty response bodies should only be sent for HEAD requests, otherwise we should always send back info about the exception that was thrown. Removed some manual exception handling in the REST action that should be rather bubbled up and handled by our rest action infra like every other rest action does.	2017-02-09 17:46:39 +01:00
Luca Cavanna	9f60924ed5	Remove redundant reads of human flag (#23074 ) The human flag is centrally handled in RestChannel, no need to have Rest actions manually read it and set it to the builder	2017-02-09 14:58:01 +01:00
Christoph Büscher	b85fa54ee7	Tests: Renaming InternalSearchHitsTests to SearchHitsTests The class under test changed its name from InternalSearchHit(s) to just SearchHit(s), renaming the tests accordingly.	2017-02-09 14:17:21 +01:00
Tanguy Leroux	3553522328	Add parameter to prefix aggs name with type in search responses (#22965 ) This pull request adds a new parameter to the REST Search API named `typed_keys`. When set to true, the aggregation names in the search response will be prefixed with a prefix that reflects the internal type of the aggregation. Here is a simple example: ``` GET /_search?typed_keys { "aggs": { "tweets_per_user": { "terms": { "field": "user" } } }, "size": 0 } ``` And the response: ``` { "aggs": { "sterms:tweets_per_user": { ... } } } ``` This parameter is intended to make life easier for REST clients that could parse back the prefix and could detect the type of the aggregation to parse. It could also be implemented for suggesters.	2017-02-09 11:19:04 +01:00
Simon Willnauer	e02d5563f4	Harden ops counting in AbstractSearchAsyncAction (#23045 ) Today we account for too many response with an `IllegalStateException` in `AbstractSearchAsyncAction` while this is something that should never happen we should rather assert that we are always have less or equal the number of expected ops when waiting for responses.	2017-02-09 09:30:13 +01:00
Luca Cavanna	b5f5356c4a	Remove getDefaultScriptingLanguage from QueryParseContext (#23043 ) The method is not needed anymore, was needed only when we supported setting a legacy default lang, which was removed with #21607 Relates to #21607	2017-02-09 09:03:26 +01:00
Nik Everett	f7071325c4	Fix generics on LeadDocLookup (#23060 ) All the warnings were upsetting me. This doesn't change behavior.	2017-02-08 18:59:24 -05:00
Christoph Büscher	e09f3ecbb3	Add xcontent parsing to suggestion options (#23018 ) This adds parsing from xContent to Suggestion.Entry.Option and Termsuggestion.Entry.Option.	2017-02-08 19:03:12 +01:00
Jay Modi	7f3769c745	Remove ldjson support and document ndjson for bulk/msearch (#23049 ) This commit removes support for the `application/x-ldjson` Content-Type header as this was only used in the first draft of the spec and had very little uptake. Additionally, the docs for bulk and msearch have been updated to specifically call out ndjson and mention that the newline character may be preceded by a carriage return. Finally, the bulk request handling of the carriage return has been improved to remove this character from the source. Closes #23025	2017-02-08 11:55:50 -05:00
Simon Willnauer	df932ef68f	Fix line len	2017-02-08 16:41:41 +01:00
Simon Willnauer	d45761e488	Fork off a search thread before sending back fetched responses This is just a temporary fix until #23048 is fixed. FieldCollapsing is executing blocking calls on a network thread which causes potential deadlocks and trips assertions. Relates to #23048	2017-02-08 15:27:08 +01:00
Simon Willnauer	ecb01c15b9	Fold InternalSearchHits and friends into their interfaces (#23042 ) We have a bunch of interfaces that have only a single implementation for 6 years now. These interfaces are pretty useless from a SW development perspective and only add unnecessary abstractions. They also require lots of casting in many places where we expect that there is only one concrete implementation. This change removes the interfaces, makes all of the classes final and removes the duplicate `foo` `getFoo` accessors in favor of `getFoo` from these classes.	2017-02-08 14:40:08 +01:00
Simon Willnauer	2d6d871f5c	Raise a phase failure if fetch phase gets rejected	2017-02-08 12:52:18 +01:00
Boaz Leskes	0161edae10	MasterFaultDetection can start after the initial cluster state has been processed and the NodeConnectionService connect to the new master (#23037 ) After the first cluster state from a new master is processed, NodeConnectionService guarantees we connect to the new master. This removes the need to explicitly connect to the master in the MasterFaultDetection code making it simpler and bypasses the assertion triggered due to the blocking operation on the cluster state thread. Relates to #22828	2017-02-08 13:49:06 +02:00
Simon Willnauer	a8b376670c	Separate reduce (aggs, suggest and profile) from merging fetched hits (#23017 ) Today we carry on all search results including aggs, suggest and profile results until we have successfully fetched all hits for the search request. This can potentially hold on to a large amount of memory if there are heavy aggregations involved. With this change aggs and profiles are entirely consumed an released for GC before the fetch phase is executing. This is a first step towards reducing results on-the-fly if the number of non-empty response are large.	2017-02-08 10:11:51 +01:00
Yannick Welsch	9154686623	Remove legacy primary shard allocation mode based on versions (#23016 ) Elasticsearch v5.0.0 uses allocation IDs to safely allocate primary shards whereas prior versions of ES used a version-based mode instead. Elasticsearch v5 still has support for version-based primary shard allocation as it needs to be able to load 2.x shards. ES v6 can drop the legacy support.	2017-02-08 10:00:55 +01:00
Boaz Leskes	a512ab32fb	Increase time out tolerance in NoMasterNodeIT. see https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-intake/746/console	2017-02-08 08:50:26 +02:00
Lee Hinman	b3c27a7fdd	Disallow include_in_all for 6.0+ indices Since `_all` is now deprecated and cannot be set for new indices, we should also disallow any field that has the `include_in_all` parameter set. Resolves #22923	2017-02-07 19:31:51 -07:00
Tim Brooks	fcc568fd8d	Add methods requiring connect to forbidden apis (#22964 ) This is related to #22116. This commit adds calls that require SocketPermission connect to forbidden APIs. The following calls are now forbidden: - java.net.URL#openStream() - java.net.URLConnection#connect() - java.net.URLConnection#getInputStream() - java.net.Socket#connect(java.net.SocketAddress) - java.net.Socket#connect(java.net.SocketAddress, int) - java.nio.channels.SocketChannel#open(java.net.SocketAddress) - java.nio.channels.SocketChannel#connect(java.net.SocketAddress)	2017-02-07 14:41:50 -06:00
Boaz Leskes	ba06c14a97	TransportService.connectToNode should validate remote node ID (#22828 ) #22194 gave us the ability to open low level temporary connections to remote node based on their address. With this use case out of the way, actual full blown connections should validate the node on the other side, making sure we speak to who we think we speak to. This helps in case where multiple nodes are started on the same host and a quick node restart causes them to swap addresses, which in turn can cause confusion down the road.	2017-02-07 22:11:32 +02:00
Tim Brooks	adc1184dd0	Fix broken test in FileSystemUtilsTests Commit `ee84ce09d7` changed an exception message without changing the corresponding test. This commit fixes the related test.	2017-02-07 12:50:07 -06:00
Tim Brooks	ee84ce09d7	Allow openFileURLStream(URL) to open jars This is related to #23020. There are some cases for where this method might be called with a URL to a file inside a jar. This commit allows this method to read URLs with a protocol of 'jar:/'.	2017-02-07 11:42:27 -06:00
Ryan Ernst	470ad1ae4a	Settings: Add secure settings validation on startup (#22894 ) Secure settings from the elasticsearch keystore were not yet validated. This changed improves support in Settings so that secure settings more seamlessly blend in with normal settings, allowing the existing settings validation to work. Note that the setting names are still not validated (yet) when using the elasticsearc-keystore tool.	2017-02-07 09:34:41 -08:00
Tim Brooks	27b7d9bd8d	Add FileSystemUtil method to read 'file:/' URLs (#23020 ) As part of #22116 we are going to forbid usage of api java.net.URL#openStream(). However in a number of places across the we use this method to read files from the local filesystem. This commit introduces a helper method openFileURLStream(URL url) to read files from URLs. It does specific validation to only ensure that file:/ urls are read. Additionlly, this commit removes unneeded method FileSystemUtil.newBufferedReader(URL, Charset). This method used the openStream () method which will soon be forbidden. Instead we use the Files.newBufferedReader(Path, Charset).	2017-02-07 10:24:22 -06:00
Jay Modi	c898e8ab83	Add support for newline delimited JSON Content-Type (#22947 ) This commit adds support for the newline delimited JSON Content-Type, which is how the bulk, multi-search, and multi-search template APIs expect data to be formatted. The `elasticsearch-js` client has also been using this content type for these types of requests. Closes #22943	2017-02-07 09:20:06 -05:00
Simon Willnauer	dc659feeb4	Add a setting to disable remote cluster connections on a node (#23005 ) Today either all nodes in the cluster connect to remote clusters of only nodes that have remote clusters configured in their node config. To allow global remote cluster configuration but restrict connections to a set of nodes in the cluster this change adds a new setting `search.remote.connect` (defaults to `true`) to allow to disable remote cluster connections on a per node basis.	2017-02-07 09:59:24 +01:00
Nik Everett	0d6e622242	Make dates be ReadableDateTimes in scripts (#22948 ) Instead of longs. If you want millis since epoch you can call doc.date_field.value.millis. Relates to #22875	2017-02-06 16:44:56 -05:00
Nicholas Knize	1c9fdfd1b3	Remove GeoPointFieldMapper abstraction In order to support the evolving GeoPoint encodings in Lucene 5 and 6, ES 2.x and 5.x implements an abstraction layer to the GeoPointFieldMapper classes. As of 5.x the geo_point field mapper settled on using Lucene's more performant LatLonPoint field type and deprecated all other encodings. In 6.0 all encodings except LatLonPoint have been removed rendering this abstraction layer useless. This commit removes the abstraction layer and renames the LatLonPointFieldMapper back to GeoPointFieldMapper to mantain consistency with ES field naming.	2017-02-06 14:17:21 -06:00
Christoph Büscher	033f03109f	[Tests] Adding tests for AvgAggregator and InternalAvg (#23000 )	2017-02-06 20:05:40 +01:00
Ali Beyad	42a9f95fde	This commit changes the exception type thrown when trying to (#22921 ) create a snapshot with a name that already exists in the repository. Instead of throwing a SnapshotCreateException, which results in a generic 500 status code, a duplicate snapshot name will throw a InvalidSnapshotNameException, which will result in a 400 status code (bad request).	2017-02-06 11:39:59 -06:00
Adrien Grand	eb26e1a292	Add unit tests to histogram aggregations. (#22961 )	2017-02-06 18:18:21 +01:00
Simon Willnauer	f09c4e1cdb	Expose `search.highlight.term_vector_multi_value` as a node level setting (#22999 ) This setting was missed in the great settings refactoring and should be exposed via node level settings.	2017-02-06 18:17:34 +01:00
Simon Willnauer	7513c6e4eb	Remove QUERY_AND_FETCH search type (#22996 ) `QUERY_AND_FETCH` has been treated as an internal optimization for 2 major versions. This commit removes the search type and it's implementation details and folds the optimization in the case of a single shard into the search controller such that every search with a single shard (non DFS) will receive this optimization.	2017-02-06 17:10:03 +01:00
Boaz Leskes	5e7d22357f	Connect to new nodes concurrently (#22984 ) When a node receives a new cluster state from the master, it opens up connections to any new node in the cluster state. That has always been done serially on the cluster state thread but it has been a long standing TODO to do this concurrently, which is done by this PR. This is spin off of #22828, where an extra handshake is done whenever connecting to a node, which may slow down connecting. Also, the handshake is done in a blocking fashion which triggers assertions w.r.t blocking requests on the cluster state thread. Instead of adding an exception, I opted to implement concurrent connections which both side steps the assertion and compensates for the extra handshake.	2017-02-06 16:32:41 +01:00

1 2 3 4 5 ...

7646 Commits