OpenSearch

Commit Graph

Author	SHA1	Message	Date
Yang Wang	a84469742c	Improve role cache efficiency for API key roles (#58156 ) (#59397 ) This PR ensure that same roles are cached only once even when they are from different API keys. API key role descriptors and limited role descriptors are now saved in Authentication#metadata as raw bytes instead of deserialised Map<String, Object>. Hashes of these bytes are used as keys for API key roles. Only when the required role is not found in the cache, they will be deserialised to build the RoleDescriptors. The deserialisation is directly from raw bytes to RoleDescriptors without going through the current detour of "bytes -> Map -> bytes -> RoleDescriptors".	2020-07-13 22:58:11 +10:00
Armin Braun	4e574a7136	Remove Dead Code from Closed Index Snapshot Logic (#56764 ) (#59398 ) The code path for closed indices is dead code here ever since #39644 because `shards(currentState, indexIds, ...)` does not set `MISSING` on a closed index's shard that is assigned any longer. Before that change it would always set `MISSING` for a closed index's shard even it was assigned. => simplified the code accordingly.	2020-07-13 14:49:16 +02:00
David Turner	3fb9dccc22	Fix FSHealthServiceTests on Windows (#59387 ) In #52680 we introduced a new health check mechanism. This commit fixes up some related test failures on Windows caused by erroneously assuming that all paths begin with `/`. Closes #59380	2020-07-13 12:43:45 +01:00
Alan Woodward	19ba6c39d2	Migrate CompletionFieldMapper to parametrized format (#59291 ) This adds some optional extra configuration to Parameter: * custom serialization (to handle analyzers) * deprecated parameter names * parameter validation	2020-07-13 12:43:15 +01:00
Armin Braun	08b54feaaf	Remove Snapshot INIT Step (#55918 ) (#59374 ) With #55773 the snapshot INIT state step has become obsolete. We can set up the snapshot directly in one single step to simplify the state machine. This is a big help for building concurrent snapshots because it allows us to establish a deterministic order of operations between snapshot create and delete operations since all of their entries now contain a repository generation. With this change simple queuing up of snapshot operations can and will be added in a follow-up.	2020-07-13 13:41:09 +02:00
Kartika Prasad	8ab0c1b4a0	Update indexing-speed.asciidoc (#59347 ) typo fix	2020-07-13 12:19:43 +01:00
Dylan Mann	42035a5df3	Update ICU Analyzer Documentation (#59305 )	2020-07-13 12:15:13 +01:00
Alan Woodward	c810a4a12e	Continue to accept unused 'universal' params in <8.0 indexes (#59381 ) We have a number of parameters which are universally parsed by almost all mappers, whether or not they make sense. Migrating the binary and boolean mappers to the new style of declaring their parameters explicitly has meant that these universal parameters stopped being accepted, which would break existing mappings. This commit adds some extra logic to ParametrizedFieldMapper that checks for the existence of these universal parameters, and issues a warning on 7x indexes if it finds them. Indexes created in 8.0 and beyond will throw an error. Fixes #59359	2020-07-13 11:15:56 +01:00
István Zoltán Szabó	cdf6a054c6	[DOCS] Fixes getting time features example in Painless in Transforms (#59379 )	2020-07-13 10:57:59 +02:00
David Kyle	7dcd943e1d	Mute FsHealthServiceTests testFailsHealthOnIOException (#59382 ) For #59380	2020-07-13 09:48:07 +01:00
David Roberts	2f9d4a1c7a	[DOCS] Adds extra ml-cpp PRs to release notes (#59354 ) Following the rebuild of 7.8.1 two extra ml-cpp PRs will now be released in 7.8.1.	2020-07-13 09:36:21 +01:00
Armin Braun	483386136d	Move all Snapshot Master Node Steps to SnapshotsService (#56365 ) (#59373 ) This refactoring has three motivations: 1. Separate all master node steps during snapshot operations from all data node steps in code. 2. Set up next steps in concurrent repository operations and general improvements by centralizing tracking of each shard's state in the repository in `SnapshotsService` so that operations for each shard can be linearized efficiently (i.e. without having to inspect the full snapshot state for all shards on every cluster state update, allowing us to track more in memory and only fall back to inspecting the full CS on master failover like we do in the snapshot shards service). * This PR already contains some best effort examples of this, but obviously this could be way improved upon still (just did not want to do it in this PR for complexity reasons) 3. Make the `SnapshotsService` less expensive on the CS thread for large snapshots	2020-07-12 22:19:07 +02:00
David Kyle	a6a27b76dc	Fix broken links to aggregation javadoc (#59083 ) (#59319 ) Fixes links from the Java High Level Rest Client to the aggregations java docs	2020-07-11 13:28:03 +01:00
Dan Hermann	e01d73c737	[7.x] Data stream admin actions are now index-level actions	2020-07-10 14:36:18 -05:00
Dan Hermann	7fa9cf601b	Data stream support for rollup search	2020-07-10 11:13:34 -05:00
Rene Groeschke	68dd431bc9	Fix deprecated unsave project outputs resolution (#59088 ) (#59356 ) - Fixes how libs in distribution are resolved - Required minor rework on common repository setup to allow distribution projects to resolve thirdparty artifacts - Use Default configurations when resolving tools for distribution packaging - Related to #57920	2020-07-10 17:16:47 +02:00
Stuart Tettemer	4c04fd1e05	Scripting: Unlimited compilation rate for ingest (#59268 ) * `ingest` and `processor_conditional` default to unlimited compilation rate Refs: #50152	2020-07-09 16:34:47 -05:00
James Rodewig	1402f787f8	[DOCS] Add data streams to field caps API docs (#59326 ) (#59340 )	2020-07-09 16:54:33 -04:00
James Rodewig	41345d4dd3	[DOCS] Add data streams to clear cache API docs (#59324 ) (#59339 )	2020-07-09 16:54:04 -04:00
James Rodewig	77e227bf9b	[DOCS] Document custom routing support for data streams (#59323 ) (#59338 )	2020-07-09 16:52:30 -04:00
James Rodewig	ef74a68bcc	[DOCS] Document index aliases do not support data streams (#59321 ) (#59337 )	2020-07-09 16:51:58 -04:00
Alan Woodward	4b9cbfca64	Remove test backported in error	2020-07-09 21:45:41 +01:00
Stuart Tettemer	94e213dd5f	Scripting: Per context stats in `script` in _nodes/stats (#59266 ) Updated `_nodes/stats`: * Update `script` in `_node/stats` to include stats per context: ``` "script": { "compilations": 1, "cache_evictions": 0, "compilation_limit_triggered": 0, "contexts":[ { "context": "aggregation_selector", "compilations": 0, "cache_evictions": 0, "compilation_limit_triggered": 0 }, ``` Refs: #50152 Backport: #59625	2020-07-09 15:30:50 -05:00
Alan Woodward	f4caadd239	MappedFieldType no longer requires equals/hashCode/clone (#59212 ) With the removal of mapping types and the immutability of FieldTypeLookup in #58162, we no longer have any cause to compare MappedFieldType instances. This means that we can remove all equals and hashCode implementations, and in addition we no longer need the clone implementations which were required for equals/hashcode testing. This greatly simplifies implementing new MappedFieldTypes, which will be particularly useful for the runtime fields project.	2020-07-09 21:05:10 +01:00
Lisa Cawley	54483394ae	[DOCS] Clarify subscription requirements (#58958 ) (#59307 )	2020-07-09 12:24:45 -07:00
Dan Hermann	c7e977701a	Data stream support for async search	2020-07-09 13:12:04 -05:00
Dan Hermann	c26d2b5fa5	Data stream support for indices shard stores API	2020-07-09 13:11:45 -05:00
Dan Hermann	34c50c045c	Data stream support for rank eval API	2020-07-09 13:11:29 -05:00
Dan Hermann	b9fb12924b	Data stream support for EQL search	2020-07-09 13:10:44 -05:00
James Rodewig	fca722cee1	[DOCS] Add x-pack tag to data stream docs (#59241 ) (#59299 )	2020-07-09 13:12:38 -04:00
Dimitris Athanasiou	b2243337d8	[7.x][ML] Data frame analytics max_num_threads setting (#59254 ) (#59308 ) This adds a setting to data frame analytics jobs called `max_number_threads`. The setting expects a positive integer. When used the user specifies the max number of threads that may be used by the analysis. Note that the actual number of threads used is limited by the number of processors on the node where the job is assigned. Also, the process may use a couple more threads for operational functionality that is not the analysis itself. This setting may also be updated for a stopped job. More threads may reduce the time it takes to complete the job at the cost of using more CPU. Backport of #59254 and #57274	2020-07-09 19:15:46 +03:00
Nik Everett	28ef997953	Improve vwh's distant bucket handling (#59094 ) (#59248 ) This modifies the `variable_width_histogram`'s distant bucket handling to: 1. Properly handle integer overflows 2. Recalculate the average distance when new buckets are added on the ends. This should slow down the rate at which we build extra buckets as we build more of them. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-09 12:14:46 -04:00
Przemko Robakowski	c870d6e570	[7.x] Restart tests with data streams (#58330 ) (#59303 ) * Restart tests with data streams (#58330)	2020-07-09 17:52:20 +02:00
Costin Leau	d9c1e531db	EQL: Introduce until functionality (#59292 ) Sequences now support until conditional, which prevents a match from occurring if the until matches a document while doing look-ups. Thus a sequence must complete before the until condition matches - if any document within the sequence occurs at, or after, the until hit, the sequence is discarded. (cherry picked from commit 1ba1b9f0661aee655aa48cf9475ac61aaee2bfda)	2020-07-09 17:12:01 +03:00
Dimitris Athanasiou	d07b11b86b	[7.x][ML] Perform test inference on java (#58877 ) (#59298 ) Since we are able to load the inference model and perform inference in java, we no longer need to rely on the analytics process to be performing test inference on the docs that were not used for training. The benefit is that we do not need to send test docs and fit them in memory of the c++ process. Backport of #58877 Co-authored-by: Dimitris Athanasiou <dimitris@elastic.co> Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2020-07-09 16:30:49 +03:00
David Kyle	86555ec163	Remove unused function InferenceIndexConstants.mapping() (#59146 ) (#59158 ) InferenceIndexConstants.mapping() is broken and unused.	2020-07-09 14:28:53 +01:00
Andrei Stefan	d187b531ed	EQL: Give a name to all toml tests and enforce the naming of new tests (#59283 ) (#59295 ) (cherry picked from commit c8ffe3c9237d3cdd90331795b8e37517155b7e91)	2020-07-09 16:20:29 +03:00
Rory Hunter	5debd09808	Dangling indices documentation (#58751 ) Part of #48366. Add documentation for the dangling indices API added in #58176. Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Adam Locke <adam.locke@elastic.co>	2020-07-09 14:02:23 +01:00
David Kyle	dbb9c802b1	Better error message when the model cannot be parsed due to its size (#59166 ) (#59209 ) The actual cause can be lost in a long list of parse exceptions this surfaces the cause when the problem is size.	2020-07-09 13:43:46 +01:00
David Kyle	c5443f78ce	Add Inference Pipeline aggregation to HLRC (#59086 ) (#59250 ) Adds InferencePipelineAggregationBuilder to the HLRC duplicating the server side classes	2020-07-09 13:38:45 +01:00
David Turner	d56fc72ee5	Fix node health-check-related test failures (#59277 ) In #52680 we introduced a new health check mechanism. This commit fixes up some sporadic related test failures, and improves the behaviour of the `FollowersChecker` slightly in the case that no retries are configured. Closes #59252 Closes #59172	2020-07-09 12:46:12 +01:00
David Turner	c80a9e2ec2	Skip unnecessary directory iteration (#59007 ) Today `NodeEnvironment#findAllShardIds` enumerates the index directories in each data path in order to find one with a specific name. Since we already know the name of the folder we seek we can construct the path directly and avoid this directory listing. This commit does that.	2020-07-09 11:56:41 +01:00
Daniel Mitterdorfer	10ef4d2140	Mute testMaxRestoreBytesPerSecIsUsed (#59289 ) Relates #59287	2020-07-09 12:52:17 +02:00
Alan Woodward	67a27e2b9d	Add declarative parameters to FieldMappers (#58663 ) The FieldMapper infrastructure currently has a bunch of shared parameters, many of which are only applicable to a subset of the 41 mapper implementations we ship with. Merging, parsing and serialization of these parameters are spread around the class hierarchy, with much repetitive boilerplate code required. It would be much easier to reason about these things if we could declare the parameter set of each FieldMapper directly in the implementing class, and share the parsing, merging and serialization logic instead. This commit is a first effort at introducing a declarative parameter style. It adds a new FieldMapper subclass, ParametrizedFieldMapper, and refactors two mappers, Boolean and Binary, to use it. Parameters are declared on Builder classes, with the declaration including the parameter name, whether or not it is updateable, a default value, how to parse it from mappings, and how to extract it from another mapper at merge time. Builders have a getParameters method, which returns a list of the declared parameters; this is then used for parsing, merging and serialization. Merging is achieved by constructing a new Builder from the existing Mapper, and merging in values from the merging Mapper; conflicts are all caught at this point, and if none exist then a new, merged, Mapper can be built from the Builder. This allows all values on the Mapper to be final. Other mappers can be gradually migrated to this new style, and once they have all been refactored we can merge ParametrizedFieldMapper and FieldMapper entirely.	2020-07-09 11:43:21 +01:00
Daniel Mitterdorfer	daa48329ec	[TEST] Mute FollowerFailOverIT.testFailOverOnFollower (#58659 ) (#59286 ) Relates #58534 Co-authored-by: Dimitris Athanasiou <dimitris@elastic.co>	2020-07-09 12:38:36 +02:00
Albert Zaharovits	2b7456db7f	Improve auditing of API key authentication #58928 1. Add the `apikey.id`, `apikey.name` and `authentication.type` fields to the `access_granted`, `access_denied`, `authentication_success`, and (some) `tampered_request` audit events. The `apikey.id` and `apikey.name` are present only when authn using an API Key. 2. When authn with an API Key, the `user.realm` field now contains the effective realm name of the user that created the key, instead of the synthetic value of `_es_api_key`.	2020-07-09 13:26:18 +03:00
Dimitris Athanasiou	d323f8d698	[ML] Add REST spec for the update data frame analytics endpoint (#59253 ) (#59281 ) Closes #59148 Backport of #59253	2020-07-09 13:12:21 +03:00
Ignacio Vera	1ad00d1ceb	Add Support in geo_match enrichment policy for any type of geometry (#59276 ) geo_match enrichment works currently only with points. This change adds the ability to use any type of geometry.	2020-07-09 11:41:41 +02:00
Andrei Stefan	c0e0bca84c	Remove search_after and implicit_join_key_field (#59232 ) (#59280 ) (cherry picked from commit 6ede6c59eff321b9fedad30e19508b9e4f788b54)	2020-07-09 12:34:01 +03:00
Bogdan Pintea	acfff7b896	Add sample versions of standard deviation and variance funcs (#59093 ) (#59274 ) * Add sample versions of standard deviation and variance functions (#59093) * Add STDDEV_SAMP, VAR_SAMP This commit adds the sampling variations of the standard deviation and variance agg functions. (cherry picked from commit 8b29817b49e386215f29cb5b3356d0183fd5d9de) * Fix: workaround for lack of Map#of() in Java8 Replace Map#of() with a HashMap static init.	2020-07-09 10:17:13 +02:00

1 2 3 4 5 ...

52649 Commits All Branches Search

52649 Commits

All Branches