OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alan Woodward	5421a743a7	Move SearchLookup into FetchContext (#62549 ) FetchSubPhase#getProcessor currently takes a SearchLookup parameter. This however is only needed by a couple of subphases, and will almost certainly change in future as we want to simplify how fetch phases retrieve values for individual hits. To future-proof against further signature changes, this commit moves the SearchLookup reference into FetchContext instead.	2020-09-17 17:39:02 +01:00
Alan Woodward	e3e3aef3d8	Load version metadata even when stored fields are disabled (#62533 ) Currently we throw an error if stored fields are disabled, but hit version metadata is requested on a search. This doesn't make much sense, as the version information is stored in docvalues and so has no connection with stored fields. This commit removes the link between the two, allowing version metadata to be loaded even when stored fields are disabled in a request. Fixes #62456	2020-09-17 17:39:02 +01:00
Alan Woodward	91e2330529	Warn on badly-formed null values for date and IP field mappers (#62487 ) In #57666 we changed when null_value was parsed for ip and date fields. Previously, the null value was stored as a string, and parsed into a date or InetAddress whenever a document containing a null value was encountered. Now, the values are parsed when the mappings are built, which means that bad values are detected up front; if you try and add a mapping with a badly-parsed ip or date for a null_value, the mapping will be rejected. This causes problems for upgrades in the case when you have a badly-formed null_value in a pre-7.9 cluster. This commit fixes the upgrade case by changing the logic to only logging a warning on the badly formed value, replicating the earlier behaviour. Fixes #62363	2020-09-17 16:38:08 +01:00
Ignacio Vera	901000891a	Fix test error in InternalCardinalityTests#testEqualsAndHashcode (#62542 ) (#62554 ) Make sure the the new HLL++ is different to the original one	2020-09-17 17:09:13 +02:00
Alan Woodward	63afc61b08	Introduce FetchContext (#62357 ) We currently pass a SearchContext around to share configuration among FetchSubPhases. With the introduction of runtime fields, it would be useful to start storing some state on this context to be shared between different subphases (for example, stored fields or search lookups can be loaded lazily but referred to by many different subphases). However, SearchContext is a very large and unwieldy class, and adding more methods or state here feels like a bridge too far. This commit introduces a new FetchContext class that exposes only those methods on SearchContext that are required for fetch phases. This reduces the API surface area for fetch phases considerably, and should give us some leeway to add further state.	2020-09-17 09:57:43 +01:00
Adrien Grand	e0a4a94985	Speed up merging when source is disabled. (#62443 ) (#62474 ) The CodecReader wrapper we use to remove the `_recovery_source` field doesn't override `StoredFieldsreader#getMergeInstance`, which has the undesired side-effect of preventing the wrapped stored fields reader from optimizing merging.	2020-09-17 10:53:31 +02:00
David Turner	62dcc5b1ae	Suppress stack in VersionConflictEngineException (#62433 ) `VersionConflictEngineException` is thrown on the hot path for updates, but stack traces are expensive to compute and transport and rarely useful for this kind of exception. This commit avoids computing the stack trace for these exceptions.	2020-09-17 09:40:07 +01:00
Adrien Grand	9a8225bbc1	Upgrade to lucene-8.7.0-snapshot-9cd3af50f80. (#62450 ) (#62476 ) This new snapshot contains the following JIRAs that we're interested in: - [LUCENE-9525](https://issues.apache.org/jira/browse/LUCENE-9525) Better handling of small documents. This should improve retrieval times when documents are less than ~1kB. - [LUCENE-9510](https://issues.apache.org/jira/browse/LUCENE-9510) Faster flushes when index sorting is enabled by not compressing the temporary files that store stored fields and term vectors.	2020-09-17 10:28:20 +02:00
Armin Braun	5112c17319	Add WARN Logging on Slow Transport Message Handling (#62444 ) (#62521 ) Add simple WARN logging on slow inbound TCP messages.	2020-09-17 10:12:20 +02:00
David Turner	14aec44cd8	Log if recovery affected by disconnect (#62437 ) Today we only emit `DEBUG` logs if the source disconnects from the target during a recovery. This deserves to be noisier by default since it should be rare and may help users identify other problems with their network or with their shard movements. This commit promotes this message to `INFO`. There's no need for `WARN` since these days we will normally resume the recovery where it left off.	2020-09-17 08:22:40 +01:00
Ignacio Vera	2d3ca9c155	Introduce a sparse HyperLogLogPlusPlus class for cloning and serializing low cardinality buckets (#62480 ) (#62520 ) Reduces the memory footprint of an HLL++ structure that uses Linear counting when cloning or deserialising the data structure.	2020-09-17 08:54:50 +02:00
Julie Tibshirani	e1da558206	Remove unused test search context for significant_terms.	2020-09-16 14:27:11 -07:00
Jay Modi	5da922064f	LocalNodeMasterListener is a regular listener (#62485 ) This commit makes the LocalNodeMasterListener interface extend the ClusterStateListener interface and use a default implementation for detecting whether the local node master status changed. Backport of #62422	2020-09-16 11:42:53 -06:00
Tanguy Leroux	8a2e9e66d4	Wait for relocations and disk threshold monitor in DiskThresholdDeciderIT (#62358 ) (#62467 ) Closes #62326	2020-09-16 17:40:20 +02:00
Armin Braun	f6a8599cf8	Don't Start Redundant ConsistentSettingsService (#62283 ) (#62428 ) The consistent settings service is only used in tests so far. No need to start it unless it's actually used.	2020-09-16 09:43:04 +02:00
Ignacio Vera	f3ed641fc7	Adds bucketOrd back to cardinality algorithms (#62389 ) (#62427 )	2020-09-16 08:41:57 +02:00
Nik Everett	24a24d050a	Implement fields fetch for runtime fields (backport of #61995 ) (#62416 ) This implements the `fields` API in `_search` for runtime fields using doc values. Most of that implementation is stolen from the `docvalue_fields` fetch sub-phase, just moved into the same API that the `fields` API uses. At this point the `docvalue_fields` fetch phase looks like a special case of the `fields` API. While I was at it I moved the "which doc values sub-implementation should I use for fetching?" question from a bunch of `instanceof`s to a method on `LeafFieldData` so we can be much more flexible with what is returned and we're not forced to extend certain classes just to make the fetch phase happy. Relates to #59332	2020-09-15 20:24:10 -04:00
Nik Everett	0a7f335215	Speed up writeVInt (backport of #62345 ) (#62419 ) This speeds up `StreamOutput#writeVInt` quite a bit which is nice because it is very commonly called when serializing aggregations. Well, when serializing anything. All "collections" serialize their size as a vint. Anyway, I was examining the serialization speeds of `StringTerms` and this saves about 30% of the write time for that. I expect it'll be useful other places.	2020-09-15 17:14:08 -04:00
Nik Everett	771a8893a6	Add more debugging information for cardinality agg (#62317 ) (#62397 ) This adds two extra bits of info to the profiler: 1. Count of the number of different types of collectors. This lets us figure out if we're using the optimization for segment ordinals. It adds a few more similar counters just for good measure. 2. Profiles the `getLeafCollector` and `postCollection` methods. These are non-trivial for some aggregations, like cardinality.	2020-09-15 13:21:11 -04:00
Armin Braun	ffbc64bd10	Log WARN on Response Deserialization Failure (#62368 ) (#62388 ) We never see this exception in the logs even though it's pretty severe. All we might see is an exception about a transport message not having been read fully from the logic that follows this code. Technically we should probably bubble up the exception but that's a bigger change and needs some carefully reasoning, this change for the time being at least simplifies tracking down deserialization issues in responses.	2020-09-15 18:27:39 +02:00
Adrien Grand	6db8afefc2	Upgrade to lucene-8.7.0-snapshot-cdfdc1e0851. (#62376 ) Upgrade to a new Lucene snapshot that (at least partially) addresses the indexing rate regression when index sorting is enabled. Backport of #62334.	2020-09-15 17:48:07 +02:00
Alan Woodward	f89fa421e2	Remove unnecessary IndexSearcher field on HitContext (#62378 ) FastVectorHighlighter uses the top-level reader to rewrite queries against, which it gets via an IndexSearcher field on HitContext. However, we can already access this top-level reader via HitContext's existing LeafReaderContext field. This commit removes the unnecessary field and constructor parameter, and changes the implementation of topLevelReader to go via ReaderUtils and the leaf reader context.	2020-09-15 15:46:14 +01:00
Christoph Büscher	0ca9829867	Muting CoordinatorTests#testLogsMessagesIfPublicationDelayed	2020-09-15 15:40:51 +02:00
Albert Zaharovits	aeed1c05b0	Ensure authz operation overrides transient authz headers (#61621 ) AuthorizationService#authorize uses the thread context to carry the result of the authorisation as transient headers. The listener argument to the `authorize` method must necessarily observe the header values. This PR makes it so that the authorisation transient headers (`_indices_permissions` and `_authz_info`, but NOT `_originating_action_name`) of the child action override the ones of the parent action. Co-authored-by: Tim Vernum tim@adjective.org	2020-09-15 16:37:38 +03:00
Armin Braun	eae6a3b18e	Fix testMappingVersionAfterDynamicMappingUpdate (#62352 ) (#62360 ) There is a race in this test where the index request will return once the dynamic mapping update has been observed by the cluster state observer internally used by the indexing but not hit all state appliers and thus isn't showing up as the applied state returned by `clusterService.state()` yet.	2020-09-15 11:59:22 +02:00
Alan Woodward	a68f7077c7	Rationalise fetch phase exceptions (#62230 ) We have a special FetchPhaseExecutionException which contains some useful information about which shard and doc a fetch phase has failed in. However, this is not used in many places - currently only the ExplainPhase and the highlighters throw one, and the FetchPhase itself catches IOExceptions and just passes them to the ExceptionsHelper with no extra context. This commit changes FetchPhase to throw FetchPhaseExecutionException if it encounters problems in any of its subphases, and removes the special handling from the explain and highlight phases. It also removes the need to pass shard ids around when building HitContext objects.	2020-09-15 09:28:19 +01:00
Alan Woodward	8089210815	Some small cleanups in TermVectorsService (#62292 ) We removed the use of aggregated stats from term vectors back in #16452, but there is a bunch of dead code left here which can be stripped out.	2020-09-15 09:01:49 +01:00
Ignacio Vera	3536f7f7c2	Initialize BitArray storage as number of bits (#62327 ) (#62354 )	2020-09-15 08:34:22 +02:00
Armin Braun	c81a076f5a	Improve Efficiency of ClusterApplierService Iteration (#62282 ) (#62350 ) The complexity of removing a timeout listener was `O(n)` which means that in case of many queued up CS update tasks (such as in the case of an avalanche of dynamic mapping updates) we're dealing with quadratic complexity for timing out N tasks which was observed to be an issue in practice. This PR makes the complexity of timing out a task `O(1)` and generally simplifies the iteration logic of listeners and applies to be a little more efficient and inline better.	2020-09-15 05:59:48 +02:00
Julie Tibshirani	f56ce4f39b	Fix failure in InnerHitBuilderTests around 'fields' option. (#62344 ) The case InnerHitBuilderTests#testEqualsAndHashcode creates a copy of the object by serializing + deserializing it, then applies a modification. If the 'fields' list is empty, then deserializing it results in Collections.emptyList. Because this is immutable, then modifying it can throw an UnsupportedOperationException. This PR takes the same approach as for docvalue_fields, where we create a new list instead of trying to add to an empty one.	2020-09-14 15:39:03 -07:00
Julie Tibshirani	4a19bdb2ea	Support the 'fields' option in inner_hits and top_hits. (#62337 ) This PR adds support for the 'fields' option in the following places: * Anytime `inner_hits` is used, for both fetching nested/ child docs and field collapsing * The `top_hits` aggregation Addresses #61949.	2020-09-14 11:51:45 -07:00
David Turner	9acd2fd1fd	Minor cleanups to BytesReferenceStreamInput (#62302 ) Followup to #61681: - reuse the current iterator in `reset()` if possible - simply some integer-overflow-avoidance in `skip()` - clarify some comments - address some IntelliJ warnings	2020-09-14 17:02:27 +01:00
Christoph Büscher	e2eada2498	Fix disabling `allow_leading_wildcard` (#62300 ) (#62318 ) Disabling the `query_string` queries `allow_leading_wildcard` parameter didn't work after a change probably introduced in #60959 because the various field types `wildcardQuery` don't check the leading characters like QueryParserBase#getWildcardQuery does. This PR adds the missing check also before calling the field types wildcard generating method. Closes #62267	2020-09-14 17:13:17 +02:00
Alan Woodward	5358cee29c	Cut over more mapping tests to MapperServiceTestCase (#62312 ) Shaves a few more seconds off the build.	2020-09-14 16:00:37 +01:00
Armin Braun	95766da345	Save Some Allocations when Working with ClusterState (#62060 ) (#62303 ) Just a number of obvious spots where we were allocating duplicate empty structures or otherwise inefficient that I found while investigating snapshot cluster state update performance.	2020-09-14 15:09:54 +02:00
Armin Braun	875af1c976	Remove Dead Variable in BlobStoreIndexShardSnapshots. (#62285 ) (#62295 ) This was never used. Co-authored-by: Howard <danielhuang@tencent.com>	2020-09-14 13:40:39 +02:00
Luca Cavanna	53bf057a53	[TEST] avoid double null check in TransportSearchActionTests	2020-09-11 10:10:09 +02:00
Nhat Nguyen	aafb2cb812	Support point in time cross cluster search (#61827 ) This commit integrates point in time into cross cluster search. Relates #61062 Closes #61790	2020-09-10 19:25:48 -04:00
Nhat Nguyen	808c8689ac	Always include the matching node when resolving point in time (#61658 ) If shards are relocated to new nodes, then searches with a point in time will fail, although a pit keeps search contexts open. This commit solves this problem by reducing info used by SearchShardIterator and always including the matching nodes when resolving a point in time. Closes #61627	2020-09-10 19:25:48 -04:00
Nhat Nguyen	035f0638f4	Support point in time in async_search (#61560 ) This commit integrates point in time into async search and ensures that it works correctly with security enabled. Relates #61062	2020-09-10 19:25:48 -04:00
Nhat Nguyen	063a6d047c	Release search context when scroll keep_alive is too large (#62179 ) Previously, we close related search contexts if the keep_alive of a scroll is too large. But we accidentally change this behavior in #62061.	2020-09-10 19:25:48 -04:00
Nhat Nguyen	2eb1e8bc84	Make keep alive of point in time optional in search (#62184 ) A search request should not be required to extend the keep_alive of a point in time. This change makes that parameter optional.	2020-09-10 19:25:48 -04:00
Jim Ferenczi	3fc35aa76e	Shard Search Scroll failures consistency (#62061 ) Today some uncaught shard failures such as RejectedExecutionException skips the release of shard context and let subsequent scroll requests access the same shard context again. Depending on how the other shards advanced, this behavior can lead to missing data since scrolls always move forward. In order to avoid hidden data loss, this commit ensures that we always release the context of shard search scroll requests whenever a failure occurs locally. The shard search context will no longer exist in subsequent scroll requests which will lead to consistent shard failures in the responses. This change also modifies the retry tests of the reindex feature. Reindex retries scroll search request that contains a shard failure and move on whenever the failure disappears. That is not compatible with how scrolls work and can lead to missing data as explained above. That means that reindex will now report scroll failures when search rejection happen during the operation instead of skipping document silently. Finally this change removes an old TODO that was fulfilled with #61062.	2020-09-10 19:25:48 -04:00
Jim Ferenczi	4d528e91a1	Ensure validation of the reader context is executed first (#61831 ) This change makes sure that reader context is validated (`SearchOperationListener#validateReaderContext) before any other operation and that it is correctly recycled or removed at the end of the operation. This commit also fixes a race condition bug that would allocate the security reader for scrolls more than once. Relates #61446 Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co>	2020-09-10 19:25:48 -04:00
Luca Cavanna	44bd4a6004	Fix point in time toXContent impl (#62080 ) PointInTimeBuilder is a ToXContentObject yet it does not print out a whole object (it is rather a fragment). Also, when it is printed out as part of SearchSourceBuilder, an error is thrown because pit should be wrapped into its own object. This commit fixes this and adds tests for it.	2020-09-10 19:25:47 -04:00
Nhat Nguyen	3d69b5c41e	Introduce point in time APIs in x-pack basic (#61062 ) This commit introduces a new API that manages point-in-times in x-pack basic. Elasticsearch pit (point in time) is a lightweight view into the state of the data as it existed when initiated. A search request by default executes against the most recent point in time. In some cases, it is preferred to perform multiple search requests using the same point in time. For example, if refreshes happen between search_after requests, then the results of those requests might not be consistent as changes happening between searches are only visible to the more recent point in time. A point in time must be opened before being used in search requests. The `keep_alive` parameter tells Elasticsearch how long it should keep a point in time around. ``` POST /my_index/_pit?keep_alive=1m ``` The response from the above request includes a `id`, which should be passed to the `id` of the `pit` parameter of search requests. ``` POST /_search { "query": { "match" : { "title" : "elasticsearch" } }, "pit": { "id": "46ToAwMDaWR4BXV1aWQxAgZub2RlXzEAAAAAAAAAAAEBYQNpZHkFdXVpZDIrBm5vZGVfMwAAAAAAAAAAKgFjA2lkeQV1dWlkMioGbm9kZV8yAAAAAAAAAAAMAWICBXV1aWQyAAAFdXVpZDEAAQltYXRjaF9hbGw_gAAAAA==", "keep_alive": "1m" } } ``` Point-in-times are automatically closed when the `keep_alive` is elapsed. However, keeping point-in-times has a cost; hence, point-in-times should be closed as soon as they are no longer used in search requests. ``` DELETE /_pit { "id" : "46ToAwMDaWR4BXV1aWQxAgZub2RlXzEAAAAAAAAAAAEBYQNpZHkFdXVpZDIrBm5vZGVfMwAAAAAAAAAAKgFjA2lkeQV1dWlkMioGbm9kZV8yAAAAAAAAAAAMAWIBBXV1aWQyAAA=" } ``` #### Notable works in this change: - Move the search state to the coordinating node: #52741 - Allow searches with a specific reader context: #53989 - Add the ability to acquire readers in IndexShard: #54966 Relates #46523 Relates #26472 Co-authored-by: Jim Ferenczi <jimczi@apache.org>	2020-09-10 19:25:47 -04:00
Armin Braun	e0a81f7d14	Speed up Version Checks (#62216 ) (#62253 ) The `fromId` method would show up in profiling and JIT analysis as not-inlinable because it's too large in the contexts it's used in in many cases and was consuming a surprising amount of cycles for computing the min compat versions. -> extract cold path from `fromId` to make JIT happy and cache minimumg compatible versions to fields.	2020-09-10 22:57:06 +02:00
Armin Braun	25db5acb0d	Simplify TimeValue Serialization (#62023 ) (#62248 ) This can be done without map lookups => less code and much smaller methods => better inlining potentially.	2020-09-10 20:16:21 +02:00
Armin Braun	7b941a18e9	Optimize Snapshot Shard Status Update Handling (#62070 ) (#62219 ) Avoiding a number of noop updates that were observed to cause trouble (as in needless noop CS publishing) which can become an issue when working with a large number of concurrent snapshot operations. Also this sets up some simplifications made in the clone snapshot branch.	2020-09-10 16:29:16 +02:00
Ignacio Vera	c8981ea93d	upgrade to lucene-8.7.0-snapshot-b313618cc1d (#62213 ) (#62222 )	2020-09-10 16:23:18 +02:00

1 2 3 4 5 ...

5351 Commits