OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-19 19:35:02 +00:00

Author	SHA1	Message	Date
Hendrik Muhs	faadb388da	mute mixed continuous transforms upgrade test (#56198 ) mute transform upgrade test, see #56196	2020-05-05 14:40:50 +02:00
David Turner	40ea0eabd9	Forbid snapshot access on applier thread (#56044 ) This commit strengthens the assertion about which threads may access a blob store to exclude the cluster applier thread, since we no longer need to do so. Relates #50999	2020-05-05 13:27:55 +01:00
Dimitris Athanasiou	2d7899c83c	[7.x][ML] Adjust DF Analytics process phases (#56107 ) (#56177 ) As of elastic/ml-cpp#1179, the analytics process reports phases depending on the analysis type. This commit adjusts the phases of current analyses from `analyzing` to the following: - outlier_detection: [`computing_outlier`] - regression/classification: [`feature_selection`, `coarse_parameter_search`, `fine_tuning_parameters`, `final_training`] Backport of #56107	2020-05-05 15:00:07 +03:00
Dimitris Athanasiou	75dadb7a6d	[7.x][ML] Add loss_function to regression (#56118 ) (#56187 ) Adds parameters `loss_function` and `loss_function_parameter` to regression. Backport of #56118	2020-05-05 14:59:51 +03:00
Jason Tedor	c38388c506	Fix compiling in TransportValidateQueryActionTests This arose after a backport where we do not have the nicities of the Java 11 diamond operator. This commit fixes it by adding the proper type parameter.	2020-05-05 07:36:40 -04:00
Jason Tedor	410eb29937	Fix validate query listener invocation bug (#56157 ) When the index we are validating a query does not exist, we try to send back a response letting the client know that the index does not exist. Yet, we accidentally fallthrough into the case that the validation failed for some other reason. This means that we end up notifying the channel twice. Sometimes the notification occurs after the failure has been written out and the channel closed (so the second invocation leads to a silent failed to write to a closed channel issue), and sometimes the response does end up in the channel, creating garbled responses to the client. This commit fixes that issue by avoiding the fallthrough.	2020-05-05 07:26:02 -04:00
Hendrik Muhs	e177a38504	[7.x][Transform] add throttling (#56007 ) (#56184 ) add throttling to transform, throttling will slow down search requests by delaying the execution based on a documents per second metric. fixes #54862	2020-05-05 13:09:02 +02:00
Andrei Dan	f569405fde	Enable simulate API tests in 7.8 (#55946 ) As #55686 was backported the simulate index template api is no available in 7.8.	2020-05-05 11:28:00 +01:00
Marios Trivyzas	363e994171	SQL: Fix DATETIME_PARSE behaviour regarding timezones (#56158 ) (#56182 ) Previously, when the timezone was missing from the datetime string and the pattern, UTC was used, instead of the session defined timezone. Moreover, if a timezone was included in the datetime string and the pattern then this timezone was used. To have a consistent behaviour the resulting datetime will always be converted to the session defined timezone, e.g.: ``` SELECT DATETIME_PARSE('2020-05-04 10:20:30.123 +02:00', 'HH:mm:ss dd/MM/uuuu VV') AS datetime; ``` with `time_zone` set to `-03:00` will result in ``` 2020-05-04T05:20:40.123-03:00 ``` Follows: #54960 (cherry picked from commit 8810ed03a209cc8fe1bad309a81e85b56a39da27)	2020-05-05 12:08:39 +02:00
Tanguy Leroux	f717830563	Use workers to warm cache parts (#55793 ) (#56181 ) Today the cache prewarming introduced in #55322 works by enqueuing altogether the files parts to warm in the searchable_snapshots thread pool. In order to make this fairer among concurrent warmings, this commit starts workers that concurrently polls file parts to warm from a queue, warms the part and then immediately schedule another warming execution. This should leave more room for concurrent shard warming to sneak in and be executed. Relates #55322	2020-05-05 11:48:06 +02:00
Tanguy Leroux	35622747fd	Add Minio tests for searchable snapshots (#56112 ) (#56179 ) This commit adds QA tests for searchable snapshot on MinIO, similarly to what already exist for S3, GCS and Azure.	2020-05-05 11:40:06 +02:00
Marios Trivyzas	cc21468559	SQL: Fix issue with date range queries and timezone (#56115 ) (#56174 ) Previously, the timezone parameter was not passed to the RangeQuery and as a results queries that use the ES date math notation (now, now-1d, now/d, now/h, now+2h, etc.) were using the UTC timezone and not the one passed through the "timezone"/"time_zone" JDBC/REST params. As a consequence, the date math defined dates were always considered in UTC and possibly led to incorrect results for queries like: ``` SELECT * FROM t WHERE date BETWEEN now-1d/d AND now/d ``` Fixes: #56049 (cherry picked from commit 300f010c0b18ed0f10a41d5e1606466ba0a3088f)	2020-05-05 10:54:23 +02:00
Théophile Helleboid - chtitux	8a23da429a	Docs fix node_id spec for secure settings reload API (#55712 ) Fix docs typo for the `node_id` parameter in the secure settings reload API.	2020-05-05 11:21:02 +03:00
Dimitris Athanasiou	6061aa3db4	[7.x][ML] Fix race condition updating reindexing progress (#56135 ) (#56146 ) In #55763 I thought I could remove the flag that marks reindexing was finished on a data frame analytics task. However, that exposed a race condition. It is possible that between updating reindexing progress to 100 because we have called `DataFrameAnalyticsManager.startAnalytics()` and a call to the _stats API which updates reindexing progress via the method `DataFrameAnalyticsTask.updateReindexTaskProgress()` we end up overwriting the 100 with a lower progress value. This commit fixes this issue by bringing back the help of a `isReindexingFinished` flag as it was prior to #55763. Closes #56128 Backport of #56135	2020-05-05 10:48:42 +03:00
Albert Zaharovits	e8763bad41	Let realms gracefully terminate the authN chain (#55623 ) AuthN realms are ordered as a chain so that the credentials of a given user are verified in succession. Upon the first successful verification, the user is authenticated. Realms do however have the option to cut short this iterative process, when the credentials don't verify and the user cannot exist in any other realm. This mechanism is currently used by the Reserved and the Kerberos realm. This commit improves the early termination operation by allowing realms to gracefully terminate authentication, as if the chain has been tried out completely. Previously, early termination resulted in an authentication error which varies the response body compared to the failed authentication outcome where no realm could verify the credentials successfully. Reserved users are hence denied authentication in exactly the same way as other users are when no realm can validate their credentials.	2020-05-05 10:11:49 +03:00
István Zoltán Szabó	9bcc975bd1	[DOCS] Simplifies footnote text in DFA APIs (#56105 ) Co-authored-by: Lisa Cawley <lcawley@elastic.co>	2020-05-05 09:05:08 +02:00
Nhat Nguyen	60d097e262	Avoid copying file chunks in peer covery (#56072 ) (#56172 ) A follow-up of #55353 to avoid copying file chunks before sending them to the network layer. Relates #55353	2020-05-04 23:39:34 -04:00
Ryan Ernst	39ba06cbb2	Add dummy file for new client example snippets location (#56152 ) This file is added simply to ensure the new directory exists, so it can be added to the docs configuration.	2020-05-04 15:48:56 -07:00
Lee Hinman	8fa14b333d	[7.x] Validate non-negative priorities for V2 index templates (#56139 ) (#56163 ) Backports the following commits to 7.x: - Validate non-negative priorities for V2 index templates (#56139)	2020-05-04 16:19:13 -06:00
James Rodewig	922a80c3f4	[DOCS] Add collapsible sections to search API response (#55887 )	2020-05-04 16:57:10 -04:00
Martijn van Groningen	2ac32db607	Move includeDataStream flag from IndicesOptions to IndexNameExpressionResolver.Context (#56151 ) Backport of #56034. Move includeDataStream flag from an IndicesOptions to IndexNameExpressionResolver.Context as a dedicated field that callers to IndexNameExpressionResolver can set. Also alter indices stats api to support data streams. The rollover api uses this api and otherwise rolling over data stream does no longer work. Relates to #53100	2020-05-04 22:38:33 +02:00
Dan Hermann	9892813842	[7.x] Delay warning about missing x-pack (#56142 ) * Delay warning about missing x-pack (#54265) Currently, when monitoring is enabled in a freshly-installed cluster, the non-master nodes log a warning message indicating that master may not have x-pack installed. The message is often printed even when the master does have x-pack installed but takes some time to setup the local exporter for monitoring. This commit adds the local exporter setting `wait_master.timeout` which defaults to 30 seconds. The setting configures the time that the non-master nodes should wait for master to setup monitoring. After the time elapses, they log a message to the user about possible missing x-pack installation on master. The logging of this warning was moved from `resolveBulk()` to `openBulk()` since `resolveBulk()` is called only on cluster updates and the message might not be logged until a new cluster update occurs. Closes #40898	2020-05-04 14:16:18 -05:00
Lee Hinman	3cefe192a2	[7.x] Remove Index Templates V2 feature flag (#56123 ) (#56141 ) Backports the following commits to 7.x: - Remove Index Templates V2 feature flag (#56123)	2020-05-04 13:15:51 -06:00
Armin Braun	75d4a4def4	Fix potential NPEin Netty4Transport.stopInternal (#56080 ) (#56129 ) Closes #56068	2020-05-04 19:38:21 +02:00
Lisa Cawley	b816ab0c18	[DOCS] Synchs and links hyperparameter descriptions (#56131 )	2020-05-04 10:37:26 -07:00
Benjamin Trent	6c26de444d	[ML] reduce InferenceProcessor.Factory log spam by not parsing pipelines (#56020 ) (#56126 ) If there are ill-formed pipelines, or other pipelines are not ready to be parsed, `InferenceProcessor.Factory::accept(ClusterState)` logs warnings. This can be confusing and cause log spam. It might lead folks to think there an issue with the inference processor. Also, they would see logs for the inference processor even though they might not be using the inference processor. Leading to more confusion. Additionally, pipelines might not be parseable in this method as some processors require the new cluster state metadata before construction (e.g. `enrich` requires cluster metadata to be set before creating the processor). closes https://github.com/elastic/elasticsearch/issues/55985	2020-05-04 13:32:01 -04:00
Martijn van Groningen	6d03081560	Add auto create action (#56122 ) Backport of #55858 to 7.x branch. Currently the TransportBulkAction detects whether an index is missing and then decides whether it should be auto created. The coordination of the index creation also happens in the TransportBulkAction on the coordinating node. This change adds a new transport action that the TransportBulkAction delegates to if missing indices need to be created. The reasons for this change: * Auto creation of data streams can't occur on the coordinating node. Based on the index template (v2) either a regular index or a data stream should be created. However if the coordinating node is slow in processing cluster state updates then it may be unaware of the existence of certain index templates, which then can load to the TransportBulkAction creating an index instead of a data stream. Therefor the coordination of creating an index or data stream should occur on the master node. See #55377 * From a security perspective it is useful to know whether index creation originates from the create index api or from auto creating a new index via the bulk or index api. For example a user would be allowed to auto create an index, but not to use the create index api. The auto create action will allow security to distinguish these two different patterns of index creation. This change adds the following new transport actions: AutoCreateAction, the TransportBulkAction redirects to this action and this action will actually create the index (instead of the TransportCreateIndexAction). Later via #55377, can improve the AutoCreateAction to also determine whether an index or data stream should be created. The create_index index privilege is also modified, so that if this permission is granted then a user is also allowed to auto create indices. This change does not yet add an auto_create index privilege. A future change can introduce this new index privilege or modify an existing index / write index privilege. Relates to #53100	2020-05-04 19:10:09 +02:00
Julie Tibshirani	6b5cf1b031	For constant_keyword, make sure exists query handles missing values. (#55757 ) It's possible for a constant_keyword to have a 'null' value before any documents are seen that contain a value for the field. In this case, no documents have a value for the field, and 'exists' queries should return no documents.	2020-05-04 09:41:52 -07:00
Ross Wolf	6da686c7e0	EQL: Add match function implementation (#55182 ) * EQL: Add Match function * EQL: Add note about character classes * EQL: QueryFolderFailTests.java * EQL: Add match() fail tests * EQL: Add match tests and fix alias * EQL: Add match verifier failure tests * EQL: Reorder query folder fail tests	2020-05-04 09:34:20 -06:00
James Rodewig	4faf5a7916	[DOCS] Reformat `porter_stem` token filter (#56053 ) Makes the following changes to the `porter_stem` token filter docs: * Rewrites description and adds a Lucene link * Adds detailed analyze example * Adds an analyzer example	2020-05-04 10:39:17 -04:00
Armin Braun	e8ef44ce78	Allow Bulk Snapshot Deletes to Abort (#56009 ) (#56111 ) Making use of #55773 to simplify snapshot state machine. 1. Deletes with no in-progress snapshot now add the delete entry to the cluster state right away instead of doing a second CS update after the fist update was a NOOP. 2. If a bulk delete matches in-progress as well as completed snapshots, abort the in-progress snapshot and then move on to delete from the repository.	2020-05-04 16:21:00 +02:00
Dimitris Athanasiou	76fa5a2397	[7.x][ML] Improve cleanup for DF Analytics HLRC tests (#56101 ) (#56109 ) Adds the step of stopping all data frame analytics before deleting them to the cleanup of the corresponding HLRC tests. Closes #56097 Backport of #56101	2020-05-04 16:08:08 +03:00
Andrei Stefan	5d1bc6c89c	EQL: reject queries that use a nested field or a sub-field of a nested field (#56108 ) * Reject queries that act on nested fields or fields with nested field types in their hierarchy (#55721) (cherry picked from commit 2a024461cd9da821112953d4c6e565ea622c678b)	2020-05-04 15:50:31 +03:00
bellengao	722de7dd98	[Docs] Fix typo in match-bool-prefix-query doc (#56077 )	2020-05-04 14:19:23 +02:00
bellengao	40f99119ae	[Docs] Fix typo in getting-started-slm doc (#56075 )	2020-05-04 14:18:00 +02:00
Przemysław Witek	44f5a8ccd3	Use snapshot's latest result time rather than snapshot's creation time when creating an annotation (#56093 ) (#56103 )	2020-05-04 12:36:12 +02:00
markharwood	e197b6c45b	Analysis enhancement - add preserve_original setting in ngram-token-filter (#55432 ) (#56100 ) Authored-by: Amit Khandelwal <amitmbm87@gmail.com>	2020-05-04 11:31:28 +01:00
Christos Soulios	c65f828cb7	[7.x] Histogram field type support for ValueCount and Avg aggregations (#56099 ) Backports #55933 to 7.x Implements value_count and avg aggregations over Histogram fields as discussed in #53285 - value_count returns the sum of all counts array of the histograms - avg computes a weighted average of the values array of the histogram by multiplying each value with its associated element in the counts array	2020-05-04 13:23:02 +03:00
Armin Braun	0860d1dc74	Remove Dead Code in SLM Delete Handling (#56081 ) (#56098 ) The delete response is always acknowledged. No need to handle anything else.	2020-05-04 12:22:06 +02:00
Armin Braun	e01b999ef0	Add Functionality to Consistently Read RepositoryData For CS Updates (#55773 ) (#56091 ) Using optimistic locking, add the ability to run a repository state update task with a consistent view of the current repository data. Allows for a follow-up to remove the snapshot INIT state.	2020-05-04 08:13:14 +02:00
David Roberts	31e32aa420	[TEST] Allow more warnings about multiple template matches (#56085 ) Adds some extra allowed warnings about multiple index templates matching on index creation of the same type that were added in #56038.	2020-05-03 21:07:51 +01:00
Armin Braun	3a64ecb6bf	Allow Deleting Multiple Snapshots at Once (#55474 ) (#56083 ) * Allow Deleting Multiple Snapshots at Once (#55474) Adds deleting multiple snapshots in one go without significantly changing the mechanics of snapshot deletes otherwise. This change does not yet allow mixing snapshot delete and abort. Abort is still only allowed for a single snapshot delete by exact name.	2020-05-03 20:30:58 +02:00
Dan Hermann	2061652988	Ensure auto close of HTMLStripCharFilter in HtmlStripProcessor The HtmlStripProcessor did not use a try-with resources block to ensure that the used HTMLStripCharFilter is closed.	2020-05-01 17:31:53 -05:00
William Brafford	d53c941c41	Make xpack.monitoring.enabled setting a no-op (#55617 ) (#56061 ) * Make xpack.monitoring.enabled setting a no-op This commit turns xpack.monitoring.enabled into a no-op. Mostly, this involved removing the setting from the setup for integration tests. Monitoring may introduce some complexity for test setup and teardown, so we should keep an eye out for turbulence and failures * Docs for making deprecated setting a no-op	2020-05-01 16:42:11 -04:00
David Turner	69f50fe79f	Improve same-shard allocation explanations (#56010 ) I see occasional confusion about the explanations emitted by the same-shard allocation decider, particularly amongst new users setting up a single-node cluster and trying to determine why their cluster has `yellow` health. For example: the shard cannot be allocated to the same node on which a copy of the shard already exists This is technically correct but it's quite a complicated sentence. Also, by starting with "the shard cannot be allocated" it makes it sound like this is the problem, whereas in fact this message is a good thing and users should typically focus their attention elsewhere. This commit simplifies the wording of these messages and makes them sound more positive, for example: a copy of this shard is already allocated to this node	2020-05-01 10:07:14 +01:00
Andrei Stefan	fbba65d8b3	SQL: SubSelect unresolved bugfix (#55956 ) (#56055 ) * Resolve the missing refs only after the aggregate tree is resolved (cherry picked from commit 10167b1cf2df6b074a1ba0c8e73c261ff9e9d1db)	2020-05-01 07:48:11 +03:00
Ryan Ernst	52b9d8d15e	Convert remaining license methods to isAllowed (#55908 ) (#55991 ) This commit converts the remaining isXXXAllowed methods to instead of use isAllowed with a Feature value. There are a couple other methods that are static, as well as some licensed features that check the license directly, but those will be dealt with in other followups.	2020-04-30 15:52:22 -07:00
Jason Tedor	6679c7ed95	Remove old unused release-related scripts (#56054 ) This commit removes some old and unused scripts that were related to release activities.	2020-04-30 18:16:12 -04:00
Mark Tozzi	d8eb51ed63	Wire up GeoDistanceAggregation (#55975 ) (#56042 )	2020-04-30 15:43:27 -04:00
Tim Brooks	54dbea6c65	Improve RemoteConnectionManager consistency (#55759 ) In order to iterate through remote connections, the remote connection manager maintains a local cache of connected nodes. Unfortunately this is difficult in relationship with testing as it is inherently racy in comparison to the parent connection manager map of connections. This commit improves the relationship by only returning a cached connection if it is still registered with the parent. If the connection is not open, we will go to the slow path of allocating a iterator directly from the parent.	2020-04-30 12:13:06 -06:00

... 2 3 4 5 6 ...

51607 Commits