OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nik Everett	0a7f335215	Speed up writeVInt (backport of #62345 ) (#62419 ) This speeds up `StreamOutput#writeVInt` quite a bit which is nice because it is very commonly called when serializing aggregations. Well, when serializing anything. All "collections" serialize their size as a vint. Anyway, I was examining the serialization speeds of `StringTerms` and this saves about 30% of the write time for that. I expect it'll be useful other places.	2020-09-15 17:14:08 -04:00
Lisa Cawley	6320967546	[DOCS] Minor typo in ML API (#62414 )	2020-09-15 13:20:55 -07:00
Costin Leau	b2e85d5639	SQL: Do not resolve self-referencing aliases (#62382 ) Prevent the analyzer for trying to resolve aliases on expressions that reference themselves (or fields within themselves) as that causes infinite recursion. Fix #62296 (cherry picked from commit 021d27815b03e92e02859bc9c0c8eec78f30c72e)	2020-09-15 20:53:28 +03:00
Armin Braun	9ac4ee9c44	Increase Flaky Timeout in testIlmHistoryIndexCanRollover (#62353 ) (#62402 ) This busy assert easily takes about 5s on a very fast work station so the default of 10s is not sufficient here at all.	2020-09-15 19:50:45 +02:00
Nik Everett	771a8893a6	Add more debugging information for cardinality agg (#62317 ) (#62397 ) This adds two extra bits of info to the profiler: 1. Count of the number of different types of collectors. This lets us figure out if we're using the optimization for segment ordinals. It adds a few more similar counters just for good measure. 2. Profiles the `getLeafCollector` and `postCollection` methods. These are non-trivial for some aggregations, like cardinality.	2020-09-15 13:21:11 -04:00
James Rodewig	2a7de79a1b	[DOCS] Remove redundant index.blocks.read_only_allow_delete setting (#62392 ) (#62399 )	2020-09-15 12:43:48 -04:00
William Brafford	af64e46065	Add logstash system index APIs (#53350 ) (#62347 ) We want Logstash indices to be system indices, but the logstash service will still need to be able to manage its indices. This PR adds special system index APIs to the logstash plugin so that logstash can manage its pipelines without direct access to the underlying indices. * Add logstash module with dedicated logstash APIs * merge with x-pack plugin * add system index access allowance * Break out serialization tests into distinct classes * Log failures for partial multiget failure * Move LogstashSystemIndexIT to javaRestTest task Co-authored-by: William Brafford <william.brafford@elastic.co> Co-authored-by: Jay Modi <jaymode@users.noreply.github.com>	2020-09-15 12:42:14 -04:00
Armin Braun	ffbc64bd10	Log WARN on Response Deserialization Failure (#62368 ) (#62388 ) We never see this exception in the logs even though it's pretty severe. All we might see is an exception about a transport message not having been read fully from the logic that follows this code. Technically we should probably bubble up the exception but that's a bigger change and needs some carefully reasoning, this change for the time being at least simplifies tracking down deserialization issues in responses.	2020-09-15 18:27:39 +02:00
Armin Braun	98f525f8a7	Faster Azure Blob InputStream (#61812 ) (#62387 ) Building our own that should perform better than the one in the SDK. Also, as a result saving a HEAD call for each ranged read on Azure.	2020-09-15 18:27:22 +02:00
Adam Locke	5dc0de04fb	[DOCS] Clarifying remote clusters based on feedback from Support (#62335 ) (#62394 ) * Clarifying remote clusters based on feedback from Support. * Apply suggestions from code review * Making additional editorial changes.	2020-09-15 12:02:43 -04:00
Adrien Grand	6db8afefc2	Upgrade to lucene-8.7.0-snapshot-cdfdc1e0851. (#62376 ) Upgrade to a new Lucene snapshot that (at least partially) addresses the indexing rate regression when index sorting is enabled. Backport of #62334.	2020-09-15 17:48:07 +02:00
Alan Woodward	f89fa421e2	Remove unnecessary IndexSearcher field on HitContext (#62378 ) FastVectorHighlighter uses the top-level reader to rewrite queries against, which it gets via an IndexSearcher field on HitContext. However, we can already access this top-level reader via HitContext's existing LeafReaderContext field. This commit removes the unnecessary field and constructor parameter, and changes the implementation of topLevelReader to go via ReaderUtils and the leaf reader context.	2020-09-15 15:46:14 +01:00
Fernando Briano	7dd073c243	Wraps timestamp values in quotes in runtime fields YAML tests. (#62155 )	2020-09-15 15:24:57 +01:00
Christoph Büscher	0ca9829867	Muting CoordinatorTests#testLogsMessagesIfPublicationDelayed	2020-09-15 15:40:51 +02:00
Albert Zaharovits	aeed1c05b0	Ensure authz operation overrides transient authz headers (#61621 ) AuthorizationService#authorize uses the thread context to carry the result of the authorisation as transient headers. The listener argument to the `authorize` method must necessarily observe the header values. This PR makes it so that the authorisation transient headers (`_indices_permissions` and `_authz_info`, but NOT `_originating_action_name`) of the child action override the ones of the parent action. Co-authored-by: Tim Vernum tim@adjective.org	2020-09-15 16:37:38 +03:00
Armin Braun	76f56c1264	Add Missing NamedWritable Registration for ExecuteEnrichPolicyStatus (#62364 ) (#62374 ) This was missing and caused nodes to drop out of the cluster on serialization failures when ever one tried to get an enrich policy task by name. The test in here is a little dirty but I figured it would be nice to have an actual reproducer for the issue and I couldn't find any infrastructure to nicely time the tasks so I put this on top of existing test infra.	2020-09-15 15:24:15 +02:00
István Zoltán Szabó	d0c7b0a3a8	[DOCS] Removes init_script line from example Painless aggregation. (#62367 ) (#62379 )	2020-09-15 15:13:23 +02:00
Costin Leau	03d2395183	EQL: Use Point In Time inside sequences (#62276 ) Use the newly introduced PIT API to have a consistent view of the data while doing sequence matching, which involves multiple calls, aka repeatable reads and thus avoid race conditions or any in-flight updates on the data. (cherry picked from commit daa72fc3c71fd36afb55278021ff6bbc591ef148)	2020-09-15 15:40:03 +03:00
Tanguy Leroux	faf96c175e	Abort non-fully consumed S3 input stream (#62167 ) (#62370 ) Today when an S3RetryingInputStream is closed the remaining bytes that were not consumed are drained right before closing the underlying stream. In some contexts it might be more efficient to not consume the remaining bytes and just drop the connection. This is for example the case with snapshot backed indices prewarming, where there is not point in reading potentially large blobs if we know the cache file we want to write the content of the blob as already been evicted. Draining all bytes here takes a slot in the prewarming thread pool for nothing.	2020-09-15 14:33:37 +02:00
Jim Ferenczi	4eea602d2d	Add a snapshot test module to delay shard aggregations (#62082 ) (#62359 ) This change adds an aggregation that can be used to delay the query phase execution on shards with a configurable time: { "aggs": { "delay": { "shard_delay": { "value": "30s" }, "aggs": { "host": { "terms": { "field": "hostname" } } } } } } This test module is built on top of #61954 so the aggregation will be available only within snapshots since this module is not meant to be used in production. Closes #54159	2020-09-15 13:52:38 +02:00
Martijn van Groningen	3ed60df59d	Re-enable resolve index multi cluster test (#62365 ) Backport of #62361 to 7.x branch. This test was fine and shouldn't have been muted. The test case class should have preserved data streams as part of #62205 Closes #62210	2020-09-15 13:26:52 +02:00
David Kyle	717259a049	Revert "[ML] Add debug logging of notification messages to upgrade test (#62342 )" This reverts commit `c50899dd8f`.	2020-09-15 11:07:23 +01:00
Armin Braun	eae6a3b18e	Fix testMappingVersionAfterDynamicMappingUpdate (#62352 ) (#62360 ) There is a race in this test where the index request will return once the dynamic mapping update has been observed by the cluster state observer internally used by the indexing but not hit all state appliers and thus isn't showing up as the applied state returned by `clusterService.state()` yet.	2020-09-15 11:59:22 +02:00
Alan Woodward	a68f7077c7	Rationalise fetch phase exceptions (#62230 ) We have a special FetchPhaseExecutionException which contains some useful information about which shard and doc a fetch phase has failed in. However, this is not used in many places - currently only the ExplainPhase and the highlighters throw one, and the FetchPhase itself catches IOExceptions and just passes them to the ExceptionsHelper with no extra context. This commit changes FetchPhase to throw FetchPhaseExecutionException if it encounters problems in any of its subphases, and removes the special handling from the explain and highlight phases. It also removes the need to pass shard ids around when building HitContext objects.	2020-09-15 09:28:19 +01:00
Rory Hunter	84ac72dced	Fix log4j config for :qa:os tests (#62234 ) The log4j config in :qa:os was broken because it referenced an appender plugin that is not on that project's classpath. Resolve this by adding a dedicated logging config and removing the copy step.	2020-09-15 09:17:19 +01:00
Alan Woodward	8089210815	Some small cleanups in TermVectorsService (#62292 ) We removed the use of aggregated stats from term vectors back in #16452, but there is a bunch of dead code left here which can be stripped out.	2020-09-15 09:01:49 +01:00
David Kyle	c50899dd8f	[ML] Add debug logging of notification messages to upgrade test (#62342 ) For #61908	2020-09-15 08:24:13 +01:00
Ignacio Vera	3536f7f7c2	Initialize BitArray storage as number of bits (#62327 ) (#62354 )	2020-09-15 08:34:22 +02:00
Armin Braun	c81a076f5a	Improve Efficiency of ClusterApplierService Iteration (#62282 ) (#62350 ) The complexity of removing a timeout listener was `O(n)` which means that in case of many queued up CS update tasks (such as in the case of an avalanche of dynamic mapping updates) we're dealing with quadratic complexity for timing out N tasks which was observed to be an issue in practice. This PR makes the complexity of timing out a task `O(1)` and generally simplifies the iteration logic of listeners and applies to be a little more efficient and inline better.	2020-09-15 05:59:48 +02:00
Lee Hinman	6b2af30a62	[7.x] Add "synthetics--" templates for synthetics fleet data (#62193 ) (#62346 ) * Add "synthetics--" templates for synthetics fleet data For the Elastic Agent we currently have `logs` and `metrics`, however, synthetic data doesn't belong with those and thus we should have a place for it to live. This would be data reported from heartbeat and under the 'monitoring' category. This commit adds a composable index template for `synthetics--` indices similar to the work in #56709 and #57629. Resolves #61665	2020-09-14 17:14:34 -06:00
Julie Tibshirani	f56ce4f39b	Fix failure in InnerHitBuilderTests around 'fields' option. (#62344 ) The case InnerHitBuilderTests#testEqualsAndHashcode creates a copy of the object by serializing + deserializing it, then applies a modification. If the 'fields' list is empty, then deserializing it results in Collections.emptyList. Because this is immutable, then modifying it can throw an UnsupportedOperationException. This PR takes the same approach as for docvalue_fields, where we create a new list instead of trying to add to an empty one.	2020-09-14 15:39:03 -07:00
Julie Tibshirani	9332a9c74b	Add the fields option to the search API docs. (#62260 )	2020-09-14 13:44:44 -07:00
Julie Tibshirani	4a19bdb2ea	Support the 'fields' option in inner_hits and top_hits. (#62337 ) This PR adds support for the 'fields' option in the following places: * Anytime `inner_hits` is used, for both fetching nested/ child docs and field collapsing * The `top_hits` aggregation Addresses #61949.	2020-09-14 11:51:45 -07:00
David Roberts	3d5c13f559	[ML] Add an assertion on annotations mappings to upgrade test (#62331 ) The annotations index is not covered by the comparison between mappings and templates, as it does not use an index template. This commit adds an assertion on annotations index mappings that will fail if the mappings are not upgraded as expected. Backport of #62325	2020-09-14 18:46:35 +01:00
James Rodewig	ec335c7c34	[DOCS] Fix capitalization for several headings (#62324 ) (#62329 )	2020-09-14 12:35:15 -04:00
David Turner	9acd2fd1fd	Minor cleanups to BytesReferenceStreamInput (#62302 ) Followup to #61681: - reuse the current iterator in `reset()` if possible - simply some integer-overflow-avoidance in `skip()` - clarify some comments - address some IntelliJ warnings	2020-09-14 17:02:27 +01:00
David Roberts	e4275f3749	[ML] Use utility thread pool for memory estimation (#62314 ) The job comms thread pool is intended for the long-running job processes that do anomaly detection or data frame analytics and count towards job count and memory limits. This commit moves the short-lived memory estimation processes to the ML utility thread pool. Although this doesn't matter in most cases, at the limits of scale it could mean that memory estimations would get in the way of starting jobs, or would queue up for an excessive period of time while waiting for jobs to finish.	2020-09-14 16:47:12 +01:00
Lee Hinman	bf9651c635	[7.x] Add "content" tier as new "data_content" role (#62247 ) (#62322 ) Similar to the work in #60994 where we introduced the `data_hot`, `data_warm`, etc node roles. This introduces a new `data_content` node role to be used for the Content tier. Currently this tier is not used anywhere, but subsequent work will use this tier. Relates to #60848	2020-09-14 09:42:57 -06:00
Benjamin Trent	13c193a9fc	[Enrich] add logging for when there are search/bulk failures on _execute (#62313 ) (#62320 ) When calling `_execute` there is a chance that there will be bulk indexing failures or search failures. These will result in the call failing overall. But, no information is provided for troubleshooting the failure. This commit adds logging to indicate the number of failures, and new debug level logging so that failure details can be determined if necessary. closes https://github.com/elastic/elasticsearch/issues/60491	2020-09-14 11:20:13 -04:00
Christoph Büscher	e2eada2498	Fix disabling `allow_leading_wildcard` (#62300 ) (#62318 ) Disabling the `query_string` queries `allow_leading_wildcard` parameter didn't work after a change probably introduced in #60959 because the various field types `wildcardQuery` don't check the leading characters like QueryParserBase#getWildcardQuery does. This PR adds the missing check also before calling the field types wildcard generating method. Closes #62267	2020-09-14 17:13:17 +02:00
Alan Woodward	5358cee29c	Cut over more mapping tests to MapperServiceTestCase (#62312 ) Shaves a few more seconds off the build.	2020-09-14 16:00:37 +01:00
James Rodewig	f4dfdc9d59	[DOCS] Fix typo in rollup groups docs (#62269 ) (#62316 ) Co-authored-by: AndyHunt66 <andrew.hunt@elastic.co>	2020-09-14 10:42:58 -04:00
Varun Sharma	65ec94f8a3	[DOCS] Fix node roles typo (#62307 ) (#62306 )	2020-09-14 10:17:30 -04:00
James Rodewig	3ab28e84c6	[DOCS] EQL: Update keyword family field types (#62254 ) (#62310 ) Updates several keyword/constant keyword references to use any field type in the keyword family.	2020-09-14 09:51:34 -04:00
James Rodewig	af13c9802d	[7.x] [DOCS] Add PIT to search after docs (#61593 ) (#62101 )	2020-09-14 09:13:23 -04:00
Armin Braun	95766da345	Save Some Allocations when Working with ClusterState (#62060 ) (#62303 ) Just a number of obvious spots where we were allocating duplicate empty structures or otherwise inefficient that I found while investigating snapshot cluster state update performance.	2020-09-14 15:09:54 +02:00
Tanguy Leroux	9e38dd0254	Deprecate Repository Stats API (#62297 ) (#62308 ) This commit deprecates the Repository Stats API added in 7.8.0 as an experimental API behind a feature flag. The goal is to deprecate this API in 7.10.0 and remove it in a follow up PR in 8.0.0. This API is now superseded by the Repositories Metering API.	2020-09-14 14:57:38 +02:00
Armin Braun	875af1c976	Remove Dead Variable in BlobStoreIndexShardSnapshots. (#62285 ) (#62295 ) This was never used. Co-authored-by: Howard <danielhuang@tencent.com>	2020-09-14 13:40:39 +02:00
David Roberts	d8288526d9	[ML] Add null checks for C++ log handler (#62238 ) It has been observed that if the normalizer process fails to connect to the JVM then this causes a null pointer exception as the JVM tries to close the native process object. The accessors and close methods of the native process class that access the C++ log handler should not assume that it connected correctly.	2020-09-14 11:28:26 +01:00
Martijn van Groningen	c88f4174ec	Fix resolve index data streams yaml test. (#62221 ) Closes #62190	2020-09-14 08:43:58 +02:00

1 2 3 4 5 ...

53661 Commits All Branches Search

53661 Commits

All Branches