OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-18 19:05:06 +00:00

Author	SHA1	Message	Date
debadair	7d20d32a8c	Update node.asciidoc (#59201 ) (#59479 ) TIP block was missing due to the lack of line break prior to the "TIP" Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>	2020-07-13 16:51:14 -07:00
Tim Brooks	623df95a32	Adding indexing pressure stats to node stats API (#59467 ) We have recently added internal metrics to monitor the amount of indexing occurring on a node. These metrics introduce back pressure to indexing when memory utilization is too high. This commit exposes these stats through the node stats API.	2020-07-13 17:23:42 -06:00
Mark Vieira	dc7d4c615c	Ensure fixture runtime dependencies are built before starting containers (#59474 )	2020-07-13 15:58:01 -07:00
Nik Everett	81cba796e6	Add microbenchmark for LongKeyedBucketOrds (#58608 ) (#59459 ) I've always been confused by the strange behavior that I saw when working on #57304. Specifically, I saw switching from a bimorphic invocation to a monomorphic invocation to give us a 7%-15% performance bump. This felt bonkers to me. And, it also made me wonder whether it'd be worth looking into doing it everywhere. It turns out that, no, it isn't needed everywhere. This benchmark shows that a bimorphic invocation like: ``` LongKeyedBucketOrds ords = new LongKeyedBucketOrds.ForSingle(); ords.add(0, 0); <------ this line ``` is 19% slower than a monomorphic invocation like: ``` LongKeyedBucketOrds.ForSingle ords = new LongKeyedBucketOrds.ForSingle(); ords.add(0, 0); <------ this line ``` But only when the reference is mutable. In the example above, if `ords` is never changed then both perform the same. But if the `ords` reference is assigned twice then we start to see the difference: ``` immutable bimorphic avgt 10 6.468 ± 0.045 ns/op immutable monomorphic avgt 10 6.756 ± 0.026 ns/op mutable bimorphic avgt 10 9.741 ± 0.073 ns/op mutable monomorphic avgt 10 8.190 ± 0.016 ns/op ``` So the conclusion from all this is that we've done the right thing: `auto_date_histogram` is the only aggregation in which `ords` isn't final and it is the only aggregation that forces monomorphic invocations. All other aggregations use an immutable bimorphic invocation. Which is fine. Relates to #56487	2020-07-13 17:22:46 -04:00
James Rodewig	db89764539	[DOCS] Add data streams to rollup APIs (#59423 ) (#59465 )	2020-07-13 16:57:40 -04:00
Lee Hinman	81bdb20b8a	Fix license header for DataStreamRestIT	2020-07-13 14:41:29 -06:00
Tim Brooks	68d56fa7db	Implement rejections in `WriteMemoryLimits` (#59451 ) This commit adds rejections when the indexing memory limits are exceeded for primary or coordinating operations. The amount of bytes allow for indexing is controlled by a new setting `indexing_limits.memory.limit`.	2020-07-13 14:34:50 -06:00
James Rodewig	a1cf955dbd	[DOCS] Clarify that passwords are not preserved for `kibana_system` user (#59449 ) (#59460 )	2020-07-13 16:34:11 -04:00
Mark Tozzi	eb0b28dd1d	Move getPointReaderOrNull into AggregatorBase (#58769 ) (#59455 )	2020-07-13 16:31:33 -04:00
Lee Hinman	bf1a60130d	[7.x] Add telemetery for data streams (#59433 ) (#59454 ) This commit adds data stream info to the `/_xpack` and `/_xpack/usage` APIs. Currently the usage is pretty minimal, returning only the number of data streams and the number of indices currently abstracted by a data stream: ``` ... "data_streams" : { "available" : true, "enabled" : true, "data_streams" : 3, "indices_count" : 17 } ... ```	2020-07-13 14:30:11 -06:00
Adam Locke	aa260636e5	Indicating that the size parameter defaults to 10. (#59438 ) (#59461 )	2020-07-13 16:27:20 -04:00
Armin Braun	64c5f70a2d	Remove Needless Context Switches on Loading RepositoryData (#56935 ) (#59452 ) We don't need to switch to the generic or snapshot pool for loading cached repository data (i.e. most of the time in normal operation). This makes `executeConsistentStateUpdate` less heavy if it has to retry and lowers the chance of having to retry in the first place. Also, this change allowed simplifying a few other spots in the codebase where we would fork off to another pool just to load repository data.	2020-07-13 21:38:29 +02:00
Jake Landis	665b7b7bd8	Convert modules to use yamlRestTest (#59089 ) (#59446 ) This commit moves the modules REST tests to the newly introduced yamlRestTest source set. A few tests have also been re-named to include the correct IT suffix. Without changing the names, the testing conventions task would fail since now that the YAML tests are no longer present pacify the convention. These tests have moved to the internalClusterTest source set. related: #56841	2020-07-13 13:53:05 -05:00
Armin Braun	bde92fc5fc	Remove Needless Context Switch From Snapshot Finalization (#56871 ) (#59443 ) No need to do any switch to the `SNAPSHOT` pool here, the blob store repo handles all its writes async on the `SNAPSHOT` pool so we're just needlessly context-switching to enqueue those tasks there. Also cleaned up the source only repository (the only override to `finalizeSnapshot`) to make it clear that no IO is happening there and we don't need to run it on the `SNAPSHOT` pool either.	2020-07-13 20:11:07 +02:00
Armin Braun	31be3a3645	More Efficient Snapshot State Handling (#56669 ) (#59430 ) Follow up to #56365. Instead of redundantly checking snapshots for completion over and over, just track the completed snapshots in the CS updates that complete them instead of looping over the smae snapshot entries over and over. Also, in the batched snapshot shard status updates, only check for completion of a snapshot entry if it isn't already finalizing.	2020-07-13 18:58:04 +02:00
James Rodewig	d293e1ae36	[DOCS] Add data streams to reload search analyzers API (#59422 ) (#59437 )	2020-07-13 12:50:47 -04:00
James Rodewig	0a7664e190	[DOCS] Add data streams to validate query API (#59420 ) (#59436 )	2020-07-13 12:50:34 -04:00
homersimpsons	f95658d1f8	[DOCS] MatchQuery: `transpositions` to `fuzzy_transpositions` (#59371 )	2020-07-13 12:37:30 -04:00
Christos Soulios	3868bcc7b8	[7.x] Histogram integration on Histogram field type (#59431 ) Backports #58930 to 7.x Implements histogram aggregation over histogram fields as requested in #53285.	2020-07-13 19:36:33 +03:00
Dimitris Athanasiou	a7895ff458	[7.x][ML] Remove unused member var from ExtractedFieldsDetector (#59395 ) (#59406 ) Removes member variable `index` from `ExtractedFieldsDetector` as it is not used. Backport of #59395 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-13 19:10:43 +03:00
Igor Motov	1acb4aeba9	EQL: Prepare for release (#59331 ) (#59426 ) Enables eql setting in release builds. Relates #51613	2020-07-13 11:54:32 -04:00
Henning Andersen	adf6083dd0	Enhance real memory circuit breaker with G1 GC (#58674 ) (#59394 ) Using G1 GC, Elasticsearch can rarely trigger that heap usage goes above the real memory circuit breaker limit and stays there for an extended period. This situation will persist until the next young GC. The circuit breaking itself hinders that from occurring in a timely manner since it breaks all request before real work is done. This commit gently nudges G1 to do a young GC and then double checks that heap usage is still above the real memory circuit breaker limit before throwing the circuit breaker exception. Related to #57202	2020-07-13 17:41:09 +02:00
Martijn van Groningen	b1b7bf3912	Make data streams a basic licensed feature. (#59392 ) Backport of #59293 to 7.x branch. * Create new data-stream xpack module. * Move TimestampFieldMapper to the new module, this results in storing a composable index template with data stream definition only to work with default distribution. This way data streams can only be used with default distribution, since a data stream can currently only be created if a matching composable index template exists with a data stream definition. * Renamed `_timestamp` meta field mapper to `_data_stream_timestamp` meta field mapper. * Add logic to put composable index template api to fail if `_data_stream_timestamp` meta field mapper isn't registered. So that a more understandable error is returned when attempting to store a template with data stream definition via the oss distribution. In a follow up the data stream transport and rest actions can be moved to the xpack data-stream module.	2020-07-13 17:26:46 +02:00
Yang Wang	cc9166a5ea	Mute failed 120_api_key_auth test till #59425 is addressed.	2020-07-14 01:10:36 +10:00
Yang Wang	edf27cd765	Adjust BWC versions for API key auth test. API key realm name is not available in authentication metadata prior to v7.5. The issue is tracked at #59425	2020-07-14 00:38:42 +10:00
David Roberts	b5e8250a4e	[ML] Drive categorization warning notifications from annotations (#59393 ) With the introduction of per-partition categorization the old logic for creating a job notification for categorization status "warn" does not work. However, the C++ code is already writing annotations for categorization status "warn" that take into account whether per-partition categorization is being used and which partition(s) the warnings relate to. Therefore, this change alters the Java results processor to create notifications based on the annotations the C++ writes. (It is arguable that we don't need both annotations and notifications, but they show up in different ways in the UI: only annotations are visible in results and only notifications set the warning symbol in the jobs list. This means it's best to have both.) Backport of #59377	2020-07-13 15:28:57 +01:00
Dan Hermann	c228532ebd	Update docs for delete data stream API to show that multiple names are supported	2020-07-13 09:11:25 -05:00
James Rodewig	27a87c9d0c	[DOCS] Update snapshot/restore and SLM docs for data streams (#58513 ) (#59403 ) Updates the existing snapshot/restore and SLM docs to make them aware of data streams.	2020-07-13 09:26:51 -04:00
Alan Woodward	bd01fd107c	Revert "Migrate CompletionFieldMapper to parametrized format (#59291 )" This reverts commit 19ba6c39d2749d964075703e40ec55d796dcf4dc.	2020-07-13 14:16:09 +01:00
David Kyle	054d5236d4	Mute RegressionIT failure (#59414 ) For #59413	2020-07-13 14:12:19 +01:00
James Rodewig	2629a95e14	[DOCS] EQL: Document `until` keyword support (#59320 ) (#59408 )	2020-07-13 09:05:47 -04:00
James Rodewig	85101fa487	[DOCS] Add data streams to searchable snapshot API docs (#59325 ) (#59409 )	2020-07-13 09:05:27 -04:00
James Rodewig	a357ec59f2	[DOCS] Add data streams to index APIs (#59329 ) (#59410 )	2020-07-13 09:05:03 -04:00
James Rodewig	35a78b88ab	[DOCS] Add data streams to ILM explain API (#59343 ) (#59411 )	2020-07-13 09:04:42 -04:00
James Rodewig	896d0ffd9b	[DOCS] EQL: Prepare docs for release (#59259 ) (#59407 ) Changes: * Swaps the `dev` admonitions for `experimental` admonitions * Removes `ifdef` statements preventing the docs from appearing in released branches	2020-07-13 09:04:15 -04:00
James Rodewig	9d5c091f7a	[DOCS] Add data streams to EQL search docs (#58611 ) (#59404 )	2020-07-13 09:03:55 -04:00
James Rodewig	39bcc4a1a7	[DOCS] Add ingest pipeline ex to data stream docs (#58343 ) (#59402 )	2020-07-13 09:03:36 -04:00
Yang Wang	a84469742c	Improve role cache efficiency for API key roles (#58156 ) (#59397 ) This PR ensure that same roles are cached only once even when they are from different API keys. API key role descriptors and limited role descriptors are now saved in Authentication#metadata as raw bytes instead of deserialised Map<String, Object>. Hashes of these bytes are used as keys for API key roles. Only when the required role is not found in the cache, they will be deserialised to build the RoleDescriptors. The deserialisation is directly from raw bytes to RoleDescriptors without going through the current detour of "bytes -> Map -> bytes -> RoleDescriptors".	2020-07-13 22:58:11 +10:00
Armin Braun	4e574a7136	Remove Dead Code from Closed Index Snapshot Logic (#56764 ) (#59398 ) The code path for closed indices is dead code here ever since #39644 because `shards(currentState, indexIds, ...)` does not set `MISSING` on a closed index's shard that is assigned any longer. Before that change it would always set `MISSING` for a closed index's shard even it was assigned. => simplified the code accordingly.	2020-07-13 14:49:16 +02:00
David Turner	3fb9dccc22	Fix FSHealthServiceTests on Windows (#59387 ) In #52680 we introduced a new health check mechanism. This commit fixes up some related test failures on Windows caused by erroneously assuming that all paths begin with `/`. Closes #59380	2020-07-13 12:43:45 +01:00
Alan Woodward	19ba6c39d2	Migrate CompletionFieldMapper to parametrized format (#59291 ) This adds some optional extra configuration to Parameter: * custom serialization (to handle analyzers) * deprecated parameter names * parameter validation	2020-07-13 12:43:15 +01:00
Armin Braun	08b54feaaf	Remove Snapshot INIT Step (#55918 ) (#59374 ) With #55773 the snapshot INIT state step has become obsolete. We can set up the snapshot directly in one single step to simplify the state machine. This is a big help for building concurrent snapshots because it allows us to establish a deterministic order of operations between snapshot create and delete operations since all of their entries now contain a repository generation. With this change simple queuing up of snapshot operations can and will be added in a follow-up.	2020-07-13 13:41:09 +02:00
Kartika Prasad	8ab0c1b4a0	Update indexing-speed.asciidoc (#59347 ) typo fix	2020-07-13 12:19:43 +01:00
Dylan Mann	42035a5df3	Update ICU Analyzer Documentation (#59305 )	2020-07-13 12:15:13 +01:00
Alan Woodward	c810a4a12e	Continue to accept unused 'universal' params in <8.0 indexes (#59381 ) We have a number of parameters which are universally parsed by almost all mappers, whether or not they make sense. Migrating the binary and boolean mappers to the new style of declaring their parameters explicitly has meant that these universal parameters stopped being accepted, which would break existing mappings. This commit adds some extra logic to ParametrizedFieldMapper that checks for the existence of these universal parameters, and issues a warning on 7x indexes if it finds them. Indexes created in 8.0 and beyond will throw an error. Fixes #59359	2020-07-13 11:15:56 +01:00
István Zoltán Szabó	cdf6a054c6	[DOCS] Fixes getting time features example in Painless in Transforms (#59379 )	2020-07-13 10:57:59 +02:00
David Kyle	7dcd943e1d	Mute FsHealthServiceTests testFailsHealthOnIOException (#59382 ) For #59380	2020-07-13 09:48:07 +01:00
David Roberts	2f9d4a1c7a	[DOCS] Adds extra ml-cpp PRs to release notes (#59354 ) Following the rebuild of 7.8.1 two extra ml-cpp PRs will now be released in 7.8.1.	2020-07-13 09:36:21 +01:00
Armin Braun	483386136d	Move all Snapshot Master Node Steps to SnapshotsService (#56365 ) (#59373 ) This refactoring has three motivations: 1. Separate all master node steps during snapshot operations from all data node steps in code. 2. Set up next steps in concurrent repository operations and general improvements by centralizing tracking of each shard's state in the repository in `SnapshotsService` so that operations for each shard can be linearized efficiently (i.e. without having to inspect the full snapshot state for all shards on every cluster state update, allowing us to track more in memory and only fall back to inspecting the full CS on master failover like we do in the snapshot shards service). * This PR already contains some best effort examples of this, but obviously this could be way improved upon still (just did not want to do it in this PR for complexity reasons) 3. Make the `SnapshotsService` less expensive on the CS thread for large snapshots	2020-07-12 22:19:07 +02:00
David Kyle	a6a27b76dc	Fix broken links to aggregation javadoc (#59083 ) (#59319 ) Fixes links from the Java High Level Rest Client to the aggregations java docs	2020-07-11 13:28:03 +01:00

... 2 3 4 5 6 ...

52786 Commits