OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	d091c12e0c	[7.x] Generalize AsyncTwoPhaseIndexer first phase (#61739 ) (#62482 ) Current implementations of the indexer are using aggregations. Thus each search step executes a search action. However, we can generalize that to allow for any action that returns a `SearchResponse`. This commit abstracts the search phase from the search action. Backport of #61739	2020-09-17 11:57:22 +03:00
Adrien Grand	e0a4a94985	Speed up merging when source is disabled. (#62443 ) (#62474 ) The CodecReader wrapper we use to remove the `_recovery_source` field doesn't override `StoredFieldsreader#getMergeInstance`, which has the undesired side-effect of preventing the wrapped stored fields reader from optimizing merging.	2020-09-17 10:53:31 +02:00
David Turner	62dcc5b1ae	Suppress stack in VersionConflictEngineException (#62433 ) `VersionConflictEngineException` is thrown on the hot path for updates, but stack traces are expensive to compute and transport and rarely useful for this kind of exception. This commit avoids computing the stack trace for these exceptions.	2020-09-17 09:40:07 +01:00
Adrien Grand	9a8225bbc1	Upgrade to lucene-8.7.0-snapshot-9cd3af50f80. (#62450 ) (#62476 ) This new snapshot contains the following JIRAs that we're interested in: - [LUCENE-9525](https://issues.apache.org/jira/browse/LUCENE-9525) Better handling of small documents. This should improve retrieval times when documents are less than ~1kB. - [LUCENE-9510](https://issues.apache.org/jira/browse/LUCENE-9510) Faster flushes when index sorting is enabled by not compressing the temporary files that store stored fields and term vectors.	2020-09-17 10:28:20 +02:00
Armin Braun	5112c17319	Add WARN Logging on Slow Transport Message Handling (#62444 ) (#62521 ) Add simple WARN logging on slow inbound TCP messages.	2020-09-17 10:12:20 +02:00
David Turner	14aec44cd8	Log if recovery affected by disconnect (#62437 ) Today we only emit `DEBUG` logs if the source disconnects from the target during a recovery. This deserves to be noisier by default since it should be rare and may help users identify other problems with their network or with their shard movements. This commit promotes this message to `INFO`. There's no need for `WARN` since these days we will normally resume the recovery where it left off.	2020-09-17 08:22:40 +01:00
Luca Cavanna	26388fe22e	Runtime fields: rename fielddata and mapped field type classes (#62483 ) With this commit we rename all of the fielddata, doc_values and mapped field type classes for runtime fields to not start with the Script prefix but rather their runtime type (e.g. Boolean) and only then Script	2020-09-17 09:14:30 +02:00
Ignacio Vera	2d3ca9c155	Introduce a sparse HyperLogLogPlusPlus class for cloning and serializing low cardinality buckets (#62480 ) (#62520 ) Reduces the memory footprint of an HLL++ structure that uses Linear counting when cloning or deserialising the data structure.	2020-09-17 08:54:50 +02:00
James Rodewig	2fd28d0944	[DOCS] Fix broken link to EC hot warm profile (#62514 ) (#62516 )	2020-09-16 22:21:26 -04:00
James Rodewig	fd26976423	[DOCS] Fix outdated cross doc link (#62504 ) (#62506 ) Co-authored-by: Kelly Murphy <kelly.murphy@elastic.co>	2020-09-16 17:52:21 -04:00
Julie Tibshirani	e1da558206	Remove unused test search context for significant_terms.	2020-09-16 14:27:11 -07:00
Costin Leau	ceaf96061c	EQL: Fetch sequence documents using Point-In-Time (#62469 ) To preserve the PIT semantics, the retrieval of results has moved from using multi-get to using an idsQuery. (cherry picked from commit 1c2362fcf2be62ce568b3772924abce7331ef23c)	2020-09-17 00:12:19 +03:00
Jay Modi	5da922064f	LocalNodeMasterListener is a regular listener (#62485 ) This commit makes the LocalNodeMasterListener interface extend the ClusterStateListener interface and use a default implementation for detecting whether the local node master status changed. Backport of #62422	2020-09-16 11:42:53 -06:00
Jack Conradson	6fac8478ef	Backport all Painless compiler extensibility changes (#61755 ) This backport incorporates all the changes to improve compiler extensibility. The reason for this backport is the changes are now required to support runtime fields.	2020-09-16 09:32:00 -07:00
Luca Cavanna	1e352fdb7f	Runtime fields: rename script classes (#62448 ) With this commit we rename the script classes used for each mapped field type used for runtime fields. The new naming is a shorter version of the previous one: from e.g. BooleanScriptFieldScrip to BooleanScript . We also move such classes to the existing mapper package.	2020-09-16 18:00:06 +02:00
James Rodewig	f347f0207f	[DOCS] EQL: Use consistent string notation (#62472 ) (#62477 )	2020-09-16 11:43:37 -04:00
Tanguy Leroux	8a2e9e66d4	Wait for relocations and disk threshold monitor in DiskThresholdDeciderIT (#62358 ) (#62467 ) Closes #62326	2020-09-16 17:40:20 +02:00
James Rodewig	e92b237dd5	[DOCS] EQL: Clarify wildcard operator	2020-09-16 11:05:29 -04:00
James Rodewig	ed072404ff	[DOCS] EQL: Make operator refs consistent	2020-09-16 11:03:48 -04:00
James Rodewig	65bb679c56	[DOCS] EQL: Move comparison operator defs	2020-09-16 10:54:31 -04:00
James Rodewig	9b10d0b3af	[DOCS] EQL: Add xrefs to EQL intro	2020-09-16 10:44:01 -04:00
James Rodewig	6394629b99	[DOCS] Document `toJSON` function for role query (#62257 ) (#62462 )	2020-09-16 10:38:56 -04:00
James Rodewig	61ea9af25b	[DOCS] Document static/dynamic security settings (#62181 ) (#62460 )	2020-09-16 09:50:58 -04:00
James Rodewig	484e74ccaa	[DOCS] Update range field type docs (#62112 ) (#62454 ) Co-authored-by: Wylie Conlon <william.conlon@elastic.co>	2020-09-16 09:33:04 -04:00
Christoph Büscher	f8634e5bea	Muting SimpleSecurityNetty4ServerTransportTests	2020-09-16 15:14:08 +02:00
Benjamin Trent	341eeae6e7	[ML] fixes testWatchdog test verifying matcher is interrupted on timeout (#62391 ) (#62447 ) Constructing the timout checker FIRST and THEN registering the watcher allows the test to have a race condition. The timeout value could be reached BEFORE the matcher is added. To prevent the matcher never being interrupted, a new timedOut value is added to the watcher thread entry. Then when a new matcher is registered, if the thread was previously timedout, we interrupt the matcher immediately. closes #48861	2020-09-16 09:13:22 -04:00
Nik Everett	a68e47ab1f	Check for runtime field loops in queries (backport of #61927 ) (#62420 ) We were checking for loops in queries before, but we had an "off by one" error where we wouldn't notice the "top level" runtime field when detecting a loop. So the error message would be wrong. I also caught a few bugs with query generation caused by missing `@Override` annotations and fixed a few of them. There is a bug with `regexp` queries with match options that I'm not fixing in this PR but will get to later. Relates to #59332	2020-09-16 09:03:49 -04:00
Lyudmila Fokina	167172a057	Update authc failure headers on license change (#61734 ) (#62442 ) Backport of #61734	2020-09-16 14:37:03 +02:00
Benjamin Trent	8d89a28126	[ML] unmuting test for testTooManyPartitions memory check on windows (#62393 ) (#62405 ) This commit unmutes the windows check for testTooManyPartitions test. The assertion has since changed to include a soft_limit check. This coupled with changes over the past years means the test should be enabled again. related to: #32033	2020-09-16 07:03:10 -04:00
Christoph Büscher	6a016fb755	Muting LogstashSystemIndexIT.testPipelineCRUD	2020-09-16 11:04:41 +02:00
Armin Braun	f6a8599cf8	Don't Start Redundant ConsistentSettingsService (#62283 ) (#62428 ) The consistent settings service is only used in tests so far. No need to start it unless it's actually used.	2020-09-16 09:43:04 +02:00
Ignacio Vera	f3ed641fc7	Adds bucketOrd back to cardinality algorithms (#62389 ) (#62427 )	2020-09-16 08:41:57 +02:00
Hendrik Muhs	8566e9e3e7	[Transform] Make pivot validation sub-agg aware (#62381 ) With the addition of sub aggregations like filter, the validation could fail if 2 sub aggs use the same output name. This change makes validation sub-agg aware. fixes #57814	2020-09-16 07:55:58 +02:00
Yang Wang	a11dfbe031	Oidc additional client auth types (#58708 ) (#62289 ) The OpenID Connect specification defines a number of ways for a client (RP) to authenticate itself to the OP when accessing the Token Endpoint. We currently only support `client_secret_basic`. This change introduces support for 2 additional authentication methods, namely `client_secret_post` (where the client credentials are passed in the body of the POST request to the OP) and `client_secret_jwt` where the client constructs a JWT and signs it using the the client secret as a key. Support for the above, and especially `client_secret_jwt` in our integration tests meant that the OP we use ( Connect2id server ) should be able to validate the JWT that we send it from the RP. Since we run the OP in docker and it listens on an ephemeral port we would have no way of knowing the port so that we can configure the ES running via the testcluster to know the "correct" Token Endpoint, and even if we did, this would not be the Token Endpoint URL that the OP would think it listens on. To alleviate this, we run an ES single node cluster in docker, alongside the OP so that we can configured it with the correct hostname and port within the docker network. Co-authored-by: Ioannis Kakavas <ioannis@elastic.co>	2020-09-16 14:29:09 +10:00
Nik Everett	24a24d050a	Implement fields fetch for runtime fields (backport of #61995 ) (#62416 ) This implements the `fields` API in `_search` for runtime fields using doc values. Most of that implementation is stolen from the `docvalue_fields` fetch sub-phase, just moved into the same API that the `fields` API uses. At this point the `docvalue_fields` fetch phase looks like a special case of the `fields` API. While I was at it I moved the "which doc values sub-implementation should I use for fetching?" question from a bunch of `instanceof`s to a method on `LeafFieldData` so we can be much more flexible with what is returned and we're not forced to extend certain classes just to make the fetch phase happy. Relates to #59332	2020-09-15 20:24:10 -04:00
James Rodewig	f94ae7ae26	[DOCS] Add static/dynamic type to audit logging settings (#61235 ) (#62417 )	2020-09-15 17:34:24 -04:00
debadair	139a2988a0	[DOCS] Make it more obvious that the Java API is deprecated. (#62398 )	2020-09-15 14:24:45 -07:00
Nik Everett	e5ad3a41f1	Check for runtime field loops in queries (backport of #61927 ) (#62421 ) We were checking for loops in queries before, but we had an "off by one" error where we wouldn't notice the "top level" runtime field when detecting a loop. So the error message would be wrong. I also caught a few bugs with query generation caused by missing `@Override` annotations and fixed a few of them. There is a bug with `regexp` queries with match options that I'm not fixing in this PR but will get to later. Relates to #59332	2020-09-15 17:24:19 -04:00
Nik Everett	0a7f335215	Speed up writeVInt (backport of #62345 ) (#62419 ) This speeds up `StreamOutput#writeVInt` quite a bit which is nice because it is very commonly called when serializing aggregations. Well, when serializing anything. All "collections" serialize their size as a vint. Anyway, I was examining the serialization speeds of `StringTerms` and this saves about 30% of the write time for that. I expect it'll be useful other places.	2020-09-15 17:14:08 -04:00
Lisa Cawley	6320967546	[DOCS] Minor typo in ML API (#62414 )	2020-09-15 13:20:55 -07:00
Costin Leau	b2e85d5639	SQL: Do not resolve self-referencing aliases (#62382 ) Prevent the analyzer for trying to resolve aliases on expressions that reference themselves (or fields within themselves) as that causes infinite recursion. Fix #62296 (cherry picked from commit 021d27815b03e92e02859bc9c0c8eec78f30c72e)	2020-09-15 20:53:28 +03:00
Armin Braun	9ac4ee9c44	Increase Flaky Timeout in testIlmHistoryIndexCanRollover (#62353 ) (#62402 ) This busy assert easily takes about 5s on a very fast work station so the default of 10s is not sufficient here at all.	2020-09-15 19:50:45 +02:00
Nik Everett	771a8893a6	Add more debugging information for cardinality agg (#62317 ) (#62397 ) This adds two extra bits of info to the profiler: 1. Count of the number of different types of collectors. This lets us figure out if we're using the optimization for segment ordinals. It adds a few more similar counters just for good measure. 2. Profiles the `getLeafCollector` and `postCollection` methods. These are non-trivial for some aggregations, like cardinality.	2020-09-15 13:21:11 -04:00
James Rodewig	2a7de79a1b	[DOCS] Remove redundant index.blocks.read_only_allow_delete setting (#62392 ) (#62399 )	2020-09-15 12:43:48 -04:00
William Brafford	af64e46065	Add logstash system index APIs (#53350 ) (#62347 ) We want Logstash indices to be system indices, but the logstash service will still need to be able to manage its indices. This PR adds special system index APIs to the logstash plugin so that logstash can manage its pipelines without direct access to the underlying indices. * Add logstash module with dedicated logstash APIs * merge with x-pack plugin * add system index access allowance * Break out serialization tests into distinct classes * Log failures for partial multiget failure * Move LogstashSystemIndexIT to javaRestTest task Co-authored-by: William Brafford <william.brafford@elastic.co> Co-authored-by: Jay Modi <jaymode@users.noreply.github.com>	2020-09-15 12:42:14 -04:00
Armin Braun	ffbc64bd10	Log WARN on Response Deserialization Failure (#62368 ) (#62388 ) We never see this exception in the logs even though it's pretty severe. All we might see is an exception about a transport message not having been read fully from the logic that follows this code. Technically we should probably bubble up the exception but that's a bigger change and needs some carefully reasoning, this change for the time being at least simplifies tracking down deserialization issues in responses.	2020-09-15 18:27:39 +02:00
Armin Braun	98f525f8a7	Faster Azure Blob InputStream (#61812 ) (#62387 ) Building our own that should perform better than the one in the SDK. Also, as a result saving a HEAD call for each ranged read on Azure.	2020-09-15 18:27:22 +02:00
Adam Locke	5dc0de04fb	[DOCS] Clarifying remote clusters based on feedback from Support (#62335 ) (#62394 ) * Clarifying remote clusters based on feedback from Support. * Apply suggestions from code review * Making additional editorial changes.	2020-09-15 12:02:43 -04:00
Adrien Grand	6db8afefc2	Upgrade to lucene-8.7.0-snapshot-cdfdc1e0851. (#62376 ) Upgrade to a new Lucene snapshot that (at least partially) addresses the indexing rate regression when index sorting is enabled. Backport of #62334.	2020-09-15 17:48:07 +02:00
Alan Woodward	f89fa421e2	Remove unnecessary IndexSearcher field on HitContext (#62378 ) FastVectorHighlighter uses the top-level reader to rewrite queries against, which it gets via an IndexSearcher field on HitContext. However, we can already access this top-level reader via HitContext's existing LeafReaderContext field. This commit removes the unnecessary field and constructor parameter, and changes the implementation of topLevelReader to go via ReaderUtils and the leaf reader context.	2020-09-15 15:46:14 +01:00

... 6 7 8 9 10 ...

53999 Commits All Branches Search

53999 Commits

All Branches