OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-07 13:38:49 +00:00

Author	SHA1	Message	Date
Jack Conradson	2b838d1ea6	fix expression type for null safe operator (#63367 ) An invalid void expression type from a null safe operator caused ClassFormatError for the script Map x= ['0': 0]; x?.0 > 1. This change sets and propagates the correct expression type for the null safe operator to be written out.	2020-10-06 16:02:56 -07:00
Rene Groeschke	1d0f46cc35	Fix openjdk download on mac (#63309 ) - the wrong platform classifier (mac) was used when downloading openjdk on mac. This fixes it to use osx closes #63353	2020-10-06 15:04:15 -07:00
James Rodewig	26a157da7b	[DOCS] Update snowball links (#63351 ) (#63355 )	2020-10-06 16:21:37 -04:00
Gordon Brown	5c8b0662df	Deprecate REST access to System Indices (#63274 ) (Original #60945 ) This PR adds deprecation warnings when accessing System Indices via the REST layer. At this time, these warnings are only enabled for Snapshot builds by default, to allow projects external to Elasticsearch additional time to adjust their access patterns. Deprecation warnings will be triggered by all REST requests which access registered System Indices, except for purpose-specific APIs which access System Indices as an implementation detail a few specific APIs which will continue to allow access to system indices by default: - `GET _cluster/health` - `GET {index}/_recovery` - `GET _cluster/allocation/explain` - `GET _cluster/state` - `POST _cluster/reroute` - `GET {index}/_stats` - `GET {index}/_segments` - `GET {index}/_shard_stores` - `GET _cat/[indices,aliases,health,recovery,shards,segments]` Deprecation warnings for accessing system indices take the form: ``` this request accesses system indices: [.some_system_index], but in a future major version, direct access to system indices will be prevented by default ```	2020-10-06 13:41:40 -06:00
James Rodewig	3e548592b6	[DOCS] Update link to Snowball documentation (#63305 ) (#63348 ) The current link points to an obsolete site, which is no longer maintained. Co-authored-by: Stefan Walter <67258699+rd-stefan-walter@users.noreply.github.com>	2020-10-06 13:41:06 -04:00
Lee Hinman	dd99125193	[7.x] Add assert that raw and readable xcontent field names are different (#63332 ) (#63343 ) This adds asserts that will catch the case where we accidentally provide the same raw and readable field name in xcontent.	2020-10-06 11:32:41 -06:00
Adam Locke	4307b1d607	[DOCS] Updating permissions language for RPM install packages (#63277 ) (#63344 ) * Updating permissions language for RPM install packages. * Fix typo	2020-10-06 13:09:50 -04:00
Lisa Cawley	8f76c89cd3	[7.x][DOCS] Add feature_importance_baseline to get trained model API (#63279 ) (#63336 ) Co-authored-by: Benjamin Trent <ben.w.trent@gmail.com>	2020-10-06 10:08:34 -07:00
Adam Locke	4f314eeb9c	Updating certificate location instructions. (#63334 ) (#63340 )	2020-10-06 12:51:22 -04:00
Tanguy Leroux	87076c32e2	Determine shard size before allocating shards recovering from snapshots (#61906 ) (#63337 ) Determines the shard size of shards before allocating shards that are recovering from snapshots. It ensures during shard allocation that the target node that is selected as recovery target will have enough free disk space for the recovery event. This applies to regular restores, CCR bootstrap from remote, as well as mounting searchable snapshots. The InternalSnapshotInfoService is responsible for fetching snapshot shard sizes from repositories. It provides a getShardSize() method to other components of the system that can be used to retrieve the latest known shard size. If the latest snapshot shard size retrieval failed, the getShardSize() returns ShardRouting.UNAVAILABLE_EXPECTED_SHARD_SIZE. While we'd like a better way to handle such failures, returning this value allows to keep the existing behavior for now. Note that this PR does not address an issues (we already have today) where a replica is being allocated without knowing how much disk space is being used by the primary. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-10-06 18:37:05 +02:00
Julie Tibshirani	733e89d7ed	Make sure that IdFieldType#isAggregatable is accurate. (#62903 ) Before, it always returned 'true' even when the setting "indices.id_field_data.enabled" was false. Fixes #62897.	2020-10-06 09:33:44 -07:00
Igor Motov	2405162c39	Mute RegressionIT.testAliasFields test (#63339 ) It fails quite frequently in 7.x. Relates to #63268	2020-10-06 12:18:12 -04:00
David Kyle	ea32b4ab82	[ML] Audit message when nightly maintenance times out (#63252 ) (#63330 ) During deletion of old ml data set the delete by query timeout to 8 hours and audit a job message when the nightly maintenance task times out.	2020-10-06 16:19:37 +01:00
Hendrik Muhs	058c55da6a	[Transform] disallow field and script being empty for group sources (#63313 ) fail validation earlier when field and script are both missing in a group source	2020-10-06 16:59:02 +02:00
Rene Groeschke	09f7cff612	Fix syncing expanded distributions in gradle build (#63285 ) (#63328 )	2020-10-06 16:51:12 +02:00
István Zoltán Szabó	a3a373b67f	[DOCS] Adds delta and offset parameters to Evaluate DFA API docs (#63317 ) (#63329 )	2020-10-06 16:49:08 +02:00
Dan Hermann	7a59ae8fa2	[7.x] Allow_duplicates option for append processor (#61916 ) (#63257 )	2020-10-06 09:03:47 -05:00
Rene Groeschke	a3252af5c0	Cleanup on integtest distribution setup (7.x backport) (#63189 ) * Cleanup on integtest distribution setup (#62937) - Simplify build task and archive base name calculation - Move integ test zip project only setup into integ test zip build script * Fix merge	2020-10-06 15:58:42 +02:00
Rene Groeschke	8144106ace	Build local unreleased bwc versions more efficient for tests (7.x backport) (#63188 ) * Wire local unreleased bwc versions more efficient for tests (#62473) For testing against the local distribution we already avoid the packaging/unpackaging cycle of es distributions when setting up test clusters. This PR adopts the usage of the expanded created distributions for unreleased bwc versions (versions that are checkout from a branch and build from source in the :distribution:bwc:minor / :distribution:bwc:bugfix). This makes the setup of bwc based cross version tests a bit faster by avoiding the unpackaging overhead. We still assemble both in the bwcBuild tasks atm which will be addressed in a later issue. This reworks the :distribution:bwc project: - Convert all the custom logic from build script logic (groovy) into gradle binary plugins (java) - Tried to make the bwc setup logic a bit more readable - Add basic functional test coverage for the bwc logic this PR tweaked. - Extracted a general internal BWC Git plugin out of the bwc setup plugin to improve maintenance and testability - Changed the InternalDistributionPlugin to resolve the extracted distro instead on relying on unpacking the distribution archive * Fix java8 incompatibility * Fix extension calculation for 6.8.* distribution	2020-10-06 15:58:13 +02:00
Yang Wang	abf9b885b4	Bulk invalidate API keys using a list of IDs (#63224 ) (#63320 ) Add a new ids field to the API of invalidating API keys so that it supports bulk invalidation with a list of IDs. Note the existing id field is kept as is and it is an error if both id and ids are specified.	2020-10-07 00:49:21 +11:00
Yang Wang	bbfa2f1303	Fix test failure due to missing client action	2020-10-07 00:45:30 +11:00
Armin Braun	a8dbab23a5	Increase Timeout in testDynamicRestoreThrottling (#63300 ) (#63324 ) Even if we increase the limit it might not take effect straight away if a thread is blocked on a long wait in `org.elasticsearch.index.snapshots.blobstore.RateLimitingInputStream#maybePause`. Let's increase the limit a little and see if that deals with the remaining failures for good and stop burning cycles busy asserting a future completion. Closes #63246	2020-10-06 15:27:05 +02:00
Benjamin Trent	a72d7cc76a	[ML] prefer secondary auth headers on data frame analytics _explain (#63281 ) (#63323 ) We should prefer secondary auth headers when calling _explain	2020-10-06 09:15:29 -04:00
Luca Cavanna	ca68298e89	Remove MapperService argument from IndexFieldData.Builder#build (#63197 ) (#63311 ) MapperService carries a lot of weight and is only used to determine if loading of field data for the id field is enabled, which can be done in a different way.	2020-10-06 15:04:23 +02:00
Yang Wang	7969fbb4ab	Cache API key doc to reduce traffic to the security index (#59376 ) (#63319 ) Getting the API key document form the security index is the most time consuing part of the API Key authentication flow (>60% if index is local and >90% if index is remote). This traffic is now avoided by caching added with this PR. Additionally, we add a cache invalidator registry so that clearing of different caches will be managed in a single place (requires follow-up PRs).	2020-10-06 23:49:23 +11:00
Armin Braun	2aa80f9ee3	Dry up Searchable Snapshots ITs (#63190 ) (#63321 ) Just a few spots where we can dry up these tests using the snapshot test infrastructure in core that I found while studying the existing searchable snapshot tests.	2020-10-06 14:41:11 +02:00
Christoph Büscher	82096d3971	Enable SourceLookup to leverage sequential stored fields reader (#63035 ) (#63316 ) In #62509 we already plugged faster sequential access for stored fields in the fetch phase. This PR now adds using the potentially better field reader also in SourceLookup. Rally exeriments are showing that this speeds up e.g. when runtime fields that are using "_source" are added e.g. via "docvalue_fields" or are used in queries or aggs. Closes #62621	2020-10-06 14:34:39 +02:00
Mayya Sharipova	bea0ead08a	Fix fields retrieval on unsinged_long field (#63310 ) This fixes fields retrieval on unsigned_long field 1) For docvalue_fields a custom UnsignedLongLeafFieldData::getLeafValueFetcher is implemented that correctly retrieves doc values. 2) For stored fields, an error was fixed in UnsignedLongFieldMapper how stored values were stored. Before they were incorrectly stored in the shifted format, now they are stored as original values in String format. Relates to #60050 Backport for #63119	2020-10-06 06:37:31 -04:00
David Kyle	8f4ef40f78	[ML] Auditor ensures template is installed before writes (#63286 ) The ML auditors should not write if the latest template is not present. Instead a PUT template request is made and the writes queued up	2020-10-06 11:20:37 +01:00
Alan Woodward	7405af8060	Convert TypeFieldType to a constant field type (#63214 ) In 6x and 7x, indexes can have only one type, which means that we can rework all queries against the type field to use a ConstantFieldType. This has already been done in master with the removal of the TypeFieldMapper, but we still need that class in 7x to deal with nested documents. This commit leaves TypeFieldMapper in place, but refactors TypeFieldType to extend ConstantFieldType and consolidates deprecation warnings within that class. It also incidentally removes the requirement to pass a MapperService to IndexFieldData.Builder#build, which should allow #63197 to be backported.	2020-10-06 10:27:37 +01:00
Armin Braun	d7f6812d78	Improve Snapshot Abort Efficiency (#62173 ) (#63297 ) There is no need to let snapshots that haven't yet written anything to the repo finalize with `FAILED`. When we still had the `INIT` state we would also just remove these snapshots from the state without any further action. This is not just a theoretical optimization. Currently, the situation of having a lot of queued up snapshots is fairly complicated to resolve when all the queued shards move to aborted since it is now necessary to execute tasks on the `SNAPSHOT` pool (that might be very busy) to remove the snapshot from the CS (including a number of redundant CS updates and repo writes for finalizing these snapshots before deleting them right away after).	2020-10-06 05:14:25 +02:00
Nhat Nguyen	25fbc01459	Retry CCR shard follow task when no seed node left (#63225 ) If the connection between clusters is disconnected or the leader cluster is offline, then CCR shard-follow tasks can stop with "no seed node left". CCR should retry on this error.	2020-10-05 21:56:56 -04:00
Armin Braun	5c3a4c13dd	Clone Snapshot API (#61839 ) (#63291 ) Snapshot clone API. Complete except for some TODOs around documentation (and adding HLRC support). backport of #61839, #63217, #63037	2020-10-06 01:52:25 +02:00
Ryan Ernst	25f8a3ba42	Switch bundled jdk back to Oracle JDK (#63288 ) (#63290 ) We switched to adoptopenjdk from oracle jdk to rely on the notarization found in adoptopnejdk on MacOS. However, that notarization still had issues, and we currently do our own notarization of the entire distribution, including the jdk. The recent bump to jdk 15 has revealed openjdk to be lax in maintaining support for older systems. Since the notarization is no longer an issue, this PR moves the bundled jdk back to Oracle, in order to continue supporting those older systems affected by adoptopenjdk 15. relates #62709	2020-10-05 16:31:10 -07:00
Armin Braun	e91936512a	Refactor SnapshotsInProgress State Transitions (#60517 ) (#63266 ) The copy constructors previously used were hard to read and the exact state changes were not obvious at all. Refactored those into a number of named constructors instead, added additional assertions and moved the snapshot abort logic into `SnapshotsInProgress`.	2020-10-06 00:03:42 +02:00
Armin Braun	860791260d	Implement Shard Snapshot Clone Logic (#62771 ) (#63260 ) First part of the snapshot clone logic that implements the snapshot clone functionality on the repository level.	2020-10-05 22:55:52 +02:00
Costin Leau	d027e24b31	EQL: Remove match functions (#63275 ) Since match (for matching regex) is not currently in use remove it for now. Close #63263 (cherry picked from commit 6abd531cf457f3c5686f59709647bed3276e3c6b)	2020-10-05 23:30:41 +03:00
Costin Leau	6856306dcf	EQL: Remove wildcard functionality from : (#63276 ) Restrict : operator to only case insensitive matching on strings Close #63262 (cherry picked from commit bc02e77150cdd85594dfac4f03d8aeb85aaddbb3)	2020-10-05 23:30:41 +03:00
Lisa Cawley	22aea11016	[DOCS] Add experimental tag to rollup APIs (#63206 )	2020-10-05 13:22:11 -07:00
James Rodewig	df0861348c	[DOCS] Document static/dynamic watcher settings (#62218 ) (#63282 )	2020-10-05 15:50:01 -04:00
James Rodewig	a8bf9a6a91	[DOCS] Make EQL case-sensitive by default (#63270 ) (#63280 )	2020-10-05 15:49:48 -04:00
Andrei Stefan	76bba601ab	Remove case_sensitive request option (#63218 ) (#63244 ) Make EQL case sensitive by default and adapt some of the string functions Remove the case sensitive option from Between string function Add case_insensitive option to term and wildcard queries usage (cherry picked from commit 7550e0664c8c2f1f13519036c759b1e76345551f)	2020-10-05 22:04:42 +03:00
Nhat Nguyen	1a6837883a	Upgrade to Lucene-8.7.0-snapshot-77396dbf339 (#63222 ) Includes LUCENE-9554, which exposes the pendingNumDocs from IndexWriter.	2020-10-05 14:39:30 -04:00
Lisa Cawley	ce23c38e96	[DOCS] Add find file structure API to HLRC docs (#63212 ) (#63261 )	2020-10-05 11:37:44 -07:00
Nik Everett	7f07deb8d8	Skip broken test In #63242 we changed how we build `nextRoundingValue` to, well, be correct. But the old `org.elasticsearch.common.rounding.Rounding` implementation didn't get the fix. Which is fine, because it doesn't that method on that implementation doesn't receive any use outside of tests. In fact, it is entirely removed in master. Anyway, now that the two implementation produce different values we really can't go around asserting that they produce the same values now can we? Well, we were! This skips that assertion if we know `nextRoundingValue` is implemented differently. Closes #63256	2020-10-05 14:25:53 -04:00
Stuart Tettemer	791a9d5102	Scripting: enable regular expressions by default (#63029 ) (#63272 ) * Setting `script.painless.regex.enabled` has a new option, `use-factor`, the default. This defaults to using regular expressions but limiting the complexity of the regular expressions. In addition to `use-factor`, the setting can be `true`, as before, which enables regular expressions without limiting them. `false` totally disables regular expressions, which was the old default. * New setting `script.painless.regex.limit-factor`. This limits regular expression complexity by limiting the number characters a regular expression can consider based on input length. The default is `6`, so a regular expression can consider `6` * input length number of characters. With input `foobarbaz` (length `9`), for example, the regular expression can consider `54` (`6 * 9`) characters. This reduces the impact of exponential backtracking in Java's regular expression engine. * add `@inject_constant` annotation to whitelist. This annotation signals that a compiler settings will be injected at the beginning of a whitelisted method. The format is `argnum=settingname`: `1=foo_setting 2=bar_setting`. Argument numbers must start at one and must be sequential. * Augment `Pattern.split(CharSequence)` `Pattern.split(CharSequence, int)`, `Pattern.splitAsStream(CharSequence)` `Pattern.matcher(CharSequence)` to take the value of `script.painless.regex.limit-factor` as a an injected parameter, limiting as explained above when this setting is in use. Fixes: #49873 Backport of: 93f29a4	2020-10-05 13:17:47 -05:00
Jack Conradson	d134b4f70b	Make location final in IRNode (#63078 ) This change makes Location a final member of IRNode as opposed to possibly changing it. This ensures that all ir nodes have a Location for error information upon creation that cannot be updated so each node can be tracked as where it came from originally.	2020-10-05 10:16:31 -07:00
Armin Braun	cf75abb021	Optimize XContentParserUtils.ensureExpectedToken (#62691 ) (#63253 ) We only ever use this with `XContentParser` no need to make it inline worse by forcing the lambda and hence dynamic callsite here. => Extraced the exception formatting code path that is likely very cold to a separate method and removed the lambda usage in hot loops by simplifying the signature here.	2020-10-05 19:08:32 +02:00
James Rodewig	f4ddb43240	[DOCS] Clarify `allow_no_indices` def (#63209 ) (#63258 )	2020-10-05 13:00:53 -04:00
Armin Braun	51d0ed1bf3	Prepare Snapshot Shard State Update Logic For Clone Logic (#62617 ) (#63255 ) Small refactoring to shorten the diff with the clone logic in #61839: * Since clones will create a different kind of shard state update that isn't the same request sent by the snapshot shards service (and cannot be the same request because we have no `ShardId`) base the shard state updates on a different class that can be extended to be general enough to accomodate shard clones as well. * Make the update executor a singleton (can't make it an inline lambda as that would break CS update batching because the executor is used as a map key but this change still makes it crystal clear that there's no internal state to the executor) * Make shard state update responses a singleton (can't use TransportResponse.Empty because we need an action response but still it makes it clear that there's no actual response with content here)	2020-10-05 18:54:01 +02:00

1 2 3 4 5 ...

54001 Commits