OpenSearch

Commit Graph

Author	SHA1	Message	Date
Armin Braun	9be36865ef	Speed up XContent Collection Parsing (#61442 ) (#61617 ) 1. Get rid of the capturing lambda on the hot path that inlines very badly 2. Remove as many bounds checks as possible, thereby reducing method size and improving inlining	2020-08-27 14:15:46 +02:00
Rene Groeschke	ae90ae343a	Fixing jvm options for testcluster for older versions (#61620 ) * Fixing jvm options for testcluster for older versions	2020-08-27 14:06:14 +02:00
Rene Groeschke	c4a0c603f0	Fix normalizing build output for windows (#61622 ) (#61625 ) - handle windows paths better Fixes #61612	2020-08-27 14:05:22 +02:00
David Kyle	49a5afc6c1	[ML] Increase wait for templates timeout in tests (#61623 ) (#61628 )	2020-08-27 12:57:12 +01:00
David Turner	411965d392	Allow background cluster state update in tests (#61455 ) Today the `CoordinatorTests` run the publication process as a single atomic action; however in production it appears possible that another master may be elected, publish its state, then fail, then we win another election, all in between the time we sampled our previous cluster state and started to publish the one we first thought of. This violates the `assertClusterStateConsistency()` assertion that verifies the cluster state update event matches the states we actually published and applied. This commit adjusts the tests to run the publication process more asynchronously so as to allow time for this behaviour to occur. This should eventually result in a reproduction of the failure in #61437 that will let us analyse what's really going on there and help us fix it.	2020-08-27 11:22:58 +01:00
David Kyle	25e811ced7	Rewrite Inference yml tests for better clean up (#61180 ) (#61555 ) Inference processors asynchronously usage write stats to the .ml-stats index after they used. In tests the write can leak into the next test causing failures depending on which test follows. This change waits for the usage stats docs to be written at the end of the test	2020-08-27 11:16:26 +01:00
David Turner	f6055dc9b2	Suppress noisy SSL exceptions (#61359 ) If a TLS-protected connection closes unexpectedly then today we often emit a `WARN` log, typically one of the following: io.netty.handler.codec.DecoderException: javax.net.ssl.SSLHandshakeException: Insufficient buffer remaining for AEAD cipher fragment (2). Needs to be more than tag size (16) io.netty.handler.codec.DecoderException: javax.net.ssl.SSLException: Received close_notify during handshake We typically only report unexpectedly-closed connections at `DEBUG` level, but these two messages don't follow that rule and generate a lot of noise as a result. This commit adjusts the logging to report these two exceptions at `DEBUG` level only.	2020-08-27 10:59:39 +01:00
David Turner	b866aaf81c	Use int for number of parts in blob store (#61618 ) Today we use `long` to represent the number of parts of a blob. There's no need for this extra range, it forces us to do some casting elsewhere, and indeed when snapshotting we iterate over the parts using an `int` which would be an infinite loop in case of overflow anyway: for (int i = 0; i < fileInfo.numberOfParts(); i++) { This commit changes the representation of the number of parts of a blob to an `int`.	2020-08-27 10:54:03 +01:00
Ioannis Kakavas	aac9eb6b64	Kerberos doc kibana link (#61466 ) (#61619 ) Add a note in Kerberos documentation that Kibana requires a configuration change too, and link to that documentation page.	2020-08-27 12:42:52 +03:00
Ioannis Kakavas	3640ff1ff2	Add SAML AuthN request signing tests (#61582 ) - Add a unit test for our signing code - Change SAML IT to use signed authentication requests for Shibboleth to consume Backport of #48444	2020-08-27 10:41:56 +03:00
David Turner	5df74cc888	Replace Math.toIntExact with toIntBytes (#61604 ) We convert longs to ints using `Math.toIntExact` in places where we're sure there will be no overflow, but this doesn't explain the intent of these conversions very well. This commit introduces a dedicated method for these conversions, and adds an assertion that we never overflow.	2020-08-27 08:28:54 +01:00
David Turner	e14d9c9514	Introduce cache index for searchable snapshots (#61595 ) If a searchable snapshot shard fails (e.g. its node leaves the cluster) we want to be able to start it up again on a different node as quickly as possible to avoid unnecessarily blocking or failing searches. It isn't feasible to fully restore such shards in an acceptably short time. In particular we would like to be able to deal with the `can_match` phase of a search ASAP so that we can skip unnecessary waiting on shards that may still be warming up but which are not required for the search. This commit solves this problem by introducing a system index that holds much of the data required to start a shard. Today() this means it holds the contents of every file with size <8kB, and the first 4kB of every other file in the shard. This system index acts as a second-level cache, behind the first-level node-local disk cache but in front of the blob store itself. Reading chunks from the index is slower than reading them directly from disk, but faster than reading them from the blob store, and is also replicated and accessible to all nodes in the cluster. () the exact heuristics for what we should put into the system index are still under investigation and may change in future. This second-level cache is populated when we attempt to read a chunk which is missing from both levels of cache and must therefore be read from the blob store. We also introduce `SearchableSnapshotsBlobStoreCacheIntegTests` which verify that we do not hit the blob store more than necessary when starting up a shard that we've seen before, whether due to a node restart or because a snapshot was mounted multiple times. Backport of #60522 Co-authored-by: Tanguy Leroux <tlrx.dev@gmail.com>	2020-08-27 06:38:32 +01:00
Ryan Ernst	e60c74240a	Add base precommit task to all java projects (#61439 ) This commit adds java compilation to the base precommit task, and adds that to the java plugin. This further reduces dependence on the build plugin.	2020-08-26 17:21:00 -07:00
Lisa Cawley	6d6f5d4acc	[DOCS] Per-partition categorization (#61506 )	2020-08-26 17:10:01 -07:00
James Rodewig	580ef8eb0c	[DOCS] Document static field cache settings (#61424 ) (#61606 )	2020-08-26 17:29:15 -04:00
Jason Tedor	9840fd1485	Add Lucene 8.6.0 memory leak as a known issue (#61603 ) This commit adds a note to the known issues docs that Lucene 8.6.0 contains a memory leak that manifests in Elasticsearch as a slow memory leak.	2020-08-26 15:45:14 -04:00
James Rodewig	462754e4e6	[DOCS] Reorg field data types page (#61117 ) (#61599 )	2020-08-26 14:24:09 -04:00
James Rodewig	8a6ecd5bfc	[DOCS] Fix EQL syntax admon	2020-08-26 13:39:42 -04:00
James Rodewig	20053bfd8c	[DOCS] Remove dupe EQl fn/pipe TOC	2020-08-26 12:45:09 -04:00
Jay Modi	34c4fc3b91	Remove tasks module to define tasks system index (#61588 ) This commit removes the tasks module that only existed to define the tasks result index, `.tasks`, as a system index. The definition for the tasks results system index descriptor is moved to the `SystemIndices` class with a check that no other plugin or module attempts to define an entry with the same source. Additionally, this change also makes the pattern for the tasks result index a wildcard pattern since we will need this when the index is upgraded (reindex to new name and then alias that to .tasks). Backport of #61540	2020-08-26 09:48:23 -06:00
David Turner	f2dc664228	Remove dead code in EsExecutors (#61574 ) Removes a couple of unused methods.	2020-08-26 16:08:36 +01:00
Dimitris Athanasiou	3ed65eb418	[7.x][ML] Recover data frame extraction search from latest sort key (#61544 ) (#61572 ) If a search failure occurs during data frame extraction we catch the error and retry once. However, we retry another search that is identical to the first one. This means we will re-fetch any docs that were already processed. This may result either to training a model using duplicate data or in the case of outlier detection to an error message that the process received more records than it expected. This commit fixes this issue by tracking the latest doc's sort key and then using that in a range query in case we restart the search due to a failure. Backport of #61544 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-08-26 17:54:00 +03:00
Benjamin Trent	a6e7a3d65f	[7.x] [ML] write warning if configured memory limit is too low for analytics job (#61505 ) (#61528 ) Backports the following commits to 7.x: [ML] write warning if configured memory limit is too low for analytics job (#61505) Having `_start` fail when the configured memory limit is too low can be frustrating. We should instead warn the user that their job might not run properly if their configured limit is too low. It might be that our estimate is too high, and their configured limit works just fine.	2020-08-26 10:35:38 -04:00
Przemyslaw Gomulka	9f566644af	Do not create two loggers for DeprecationLogger backport(#58435 ) (#61530 ) DeprecationLogger's constructor should not create two loggers. It was taking parent logger instance, changing its name with a .deprecation prefix and creating a new logger. Most of the time parent logger was not needed. It was causing Log4j to unnecessarily cache the unused parent logger instance. depends on #61515 backports #58435	2020-08-26 16:04:02 +02:00
Rene Groeschke	3a8cfdc1f5	Extract distribution archive checks into plugin (7.x backport) (#61567 ) - Added test coverage - Removes build script cluttering - Splits archive building and archive checking logic - only rely on boost for now for ML licenses(tbd) - Use Gradle build-in untar and unzip support * Handle dynamic versions in func tests assertions	2020-08-26 15:04:12 +02:00
Rene Groeschke	fac66a7528	Rework test cluster distribution handling (#61407 ) (#61566 ) Driven by this issue https://github.com/elastic/elasticsearch/pull/60969#issuecomment-674962158 we apply some rework on how we handle distributions in our test cluster setups: - If no custom modules, plugins or extra jar files are declared we do not create a cluster specific distro folder and use the origin distribution folder instead. - If a custom distribution folder is required, we fallback to file copy when hard linking is not supported	2020-08-26 15:03:52 +02:00
Ioannis Kakavas	283eaabc71	[7.x] Refactor SamlAuthenticationIT (#57162 ) (#61568 ) Refactor the tests to not require a mock HTTP Server. This has been the cause of flakiness and removing it doesn't affect the logical coverage of this suite. The "fake UI" is now simulated by an http client that makes the necessary requests to Elasticsearch APIs.	2020-08-26 15:34:56 +03:00
James Rodewig	4701832879	[DOCS] Add 7.9 breaking change for built-in templates (#61549 ) (#61558 )	2020-08-26 08:10:59 -04:00
Przemysław Witek	11c2710e7f	[7.x] [ML] Do not mark the DFA job as FAILED when a failure occurs after the node is shutdown (#61331 ) (#61526 )	2020-08-26 09:53:13 +02:00
lcawl	5fa839b906	[DOCS] Fix typo in update anomaly detection job API	2020-08-25 17:13:38 -07:00
Igor Motov	f70a59971a	[7.x] Add rate aggregation (#61369 ) (#61554 ) Adds a new rate aggregation that can calculate a document rate for buckets of a date_histogram. Closes #60674	2020-08-25 17:39:00 -04:00
debadair	82585107aa	updated shard limit doc (#56496 ) (#61509 ) * updated shard limit doc As the documentation was not so clear. I have updated saying this limit includes open indices with unassigned primaries and replicas count towards the limit. * [DOCS] Incorporated edits. Co-authored-by: Deb Adair <debadair@elastic.co> Co-authored-by: gadekishore <50092970+gadekishore@users.noreply.github.com>	2020-08-25 14:24:47 -07:00
James Rodewig	e0843571c4	[DOCS] Fix typo in search your data docs	2020-08-25 17:01:08 -04:00
Nik Everett	87cf81e179	Migrate some more mapper test cases (#61507 ) (#61552 ) Migrate some more mapper test cases from `ESSingleNodeTestCase` to `MapperTestCase`.	2020-08-25 15:27:26 -04:00
markharwood	8b56441d2b	Search - add case insensitive support for regex queries. (#59441 ) (#61532 ) Backport to add case insensitive support for regex queries. Forks a copy of Lucene’s RegexpQuery and RegExp from Lucene master. This can be removed when 8.7 Lucene is released. Closes #59235	2020-08-25 17:18:59 +01:00
James Rodewig	e3d23c34ab	[DOCS] Document static HTTP settings (#61429 ) (#61536 )	2020-08-25 11:27:05 -04:00
James Rodewig	5ad0ce49e1	[DOCS] Remove response params for #61428 (#61524 ) (#61534 )	2020-08-25 11:17:56 -04:00
Brandon Morelli	fade7408cd	[DOCS] Fix link to quartz crontrigger tutorial (#61531 )	2020-08-25 10:49:00 -04:00
Przemyslaw Gomulka	f3f7d25316	Header warning logging refactoring backport(#55941 ) (#61515 ) Splitting DeprecationLogger into two. HeaderWarningLogger - responsible for adding a response warning headers and ThrottlingLogger - responsible for limiting the duplicated log entries for the same key (previously deprecateAndMaybeLog). Introducing A ThrottlingAndHeaderWarningLogger which is a base for other common logging usages where both response warning header and logging throttling was needed. relates #55699 relates #52369 backports #55941	2020-08-25 16:35:54 +02:00
Costin Leau	bff3c7470e	EQL: Replace SearchHit in response with Event (#61428 ) (#61522 ) The building block of the eql response is currently the SearchHit. This is a problem since it is tied to an actual search, and thus has scoring, highlighting, shard information and a lot of other things that are not relevant for EQL. This becomes a problem when doing sequence queries since the response is not generated from one search query and thus there are no SearchHits to speak of. Emulating one is not just conceptually incorrect but also problematic since most of the data is missed or made-up. As such this PR introduces a simple class, Event, that maps nicely to the terminology while hiding the ES internals (the use of SearchHit or GetResult/GetResponse depending on the API used). Fix #59764 Fix #59779 Co-authored-by: Igor Motov <igor@motovs.org> (cherry picked from commit 997376fbe6ef2894038968842f5e0635731ede65)	2020-08-25 17:32:42 +03:00
Armin Braun	f22ddf822e	Some Optimizations around BytesArray (#61183 ) (#61511 ) * Faster `equals` for `BytesArray` which is nice since with this change we use it for the search cache * Lighter `StreamInput` for `BytesArray` that should save memory and some indirection relative to the one on the abstract bytes reference * Lighter `writeTo` implementation * Build a `BytesArray` instead of a PagedBytesReference whenever possible to save indirection and memory	2020-08-25 07:13:39 +02:00
Armin Braun	806dfcfcf7	Speed up Compression Logic by Pooling Resources (#61358 ) (#61495 ) This is mostly motivated by the performance issues we are seeing around the GET mappings REST API which (in case of a large number of indices) will create decompressing streams in a hot loop which takes a significant amount of time for the system calls involved in instantiating deflaters and inflaters. Also, this fixes a leaked deflater when deserializing cached repository data.	2020-08-25 04:01:55 +02:00
Armin Braun	16b932c1dc	Remove Potentially Expensive Use of BytesReference.toBytesRef (#61415 ) (#61503 ) This method might have materialize all the bytes in a reference into a fresh `byte[]`. Using the stream is much safer and only trivially more expensive + in most cases we now run the fast path via `BytesArray` anyway.	2020-08-24 23:58:21 +02:00
David Kyle	539cf914bc	[ML] handle new model metadata stream from native process (#59725 ) (#61251 ) This adds the serialization handling for the new model_metadata object from the native process. Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2020-08-24 15:52:13 -04:00
James Rodewig	2400098a52	[DOCS] Fix typo in profile API docs (#61445 ) (#61501 ) Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com> Co-authored-by: shashikumarec088 <shashikumarec088@gmail.com>	2020-08-24 15:30:18 -04:00
Nhat Nguyen	baa685c2d9	Fix anchor doc for msearch cancellation paragraph Relates #61418	2020-08-24 15:14:17 -04:00
Nhat Nguyen	f34d3efae7	Add cancellation doc for multi search (#61418 ) Relates #61337	2020-08-24 15:14:05 -04:00
Nhat Nguyen	d47bbbafe0	Cancel multisearch when http connection closed (#61399 ) Relates #61337	2020-08-24 15:12:54 -04:00
Nhat Nguyen	23a0f8b617	Detect and optimize noop of update index settings (#61348 ) This optimization is more relevant in the context of CCR. When a node in the follower cluster leaves, we reallocate the shard-follow tasks on that node to other nodes. The new tasks will overwhelm the follower cluster with many put-mapping, update-settings requests, although most of them are noop. This change detects and optimizes the noop update-settings requests.	2020-08-24 15:08:53 -04:00
James Rodewig	439fa46735	[DOCS] Remove collapsible sections in EQL fn docs (#61498 ) (#61499 )	2020-08-24 14:41:27 -04:00

1 2 3 4 5 ...

53382 Commits All Branches Search

53382 Commits

All Branches