OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-01 16:39:11 +00:00

Author	SHA1	Message	Date
Przemysław Witek	b8a0379057	Refactor auditor-related classes (#45893 ) (#46120 )	2019-08-29 14:21:03 +02:00
David Turner	d340530a47	Avoid overshooting watermarks during relocation (#46079 ) Today the `DiskThresholdDecider` attempts to account for already-relocating shards when deciding how to allocate or relocate a shard. Its goal is to stop relocating shards onto a node before that node exceeds the low watermark, and to stop relocating shards away from a node as soon as the node drops below the high watermark. The decider handles multiple data paths by only accounting for relocating shards that affect the appropriate data path. However, this mechanism does not correctly account for _new_ relocating shards, which are unwittingly ignored. This means that we may evict far too many shards from a node above the high watermark, and may relocate far too many shards onto a node causing it to blow right past the low watermark and potentially other watermarks too. There are in fact two distinct issues that this PR fixes. New incoming shards have an unknown data path until the `ClusterInfoService` refreshes its statistics. New outgoing shards have a known data path, but we fail to account for the change of the corresponding `ShardRouting` from `STARTED` to `RELOCATING`, meaning that we fail to find the correct data path and treat the path as unknown here too. This PR also reworks the `MockDiskUsagesIT` test to avoid using fake data paths for all shards. With the changes here, the data paths are handled in tests as they are in production, except that their sizes are fake. Fixes #45177	2019-08-29 12:40:55 +01:00
Tanguy Leroux	b526309fbd	Replace MockAmazonS3 usage in S3BlobStoreRepositoryTests by a HTTP server (#46081 ) This commit removes the usage of MockAmazonS3 in S3BlobStoreRepositoryTests and replaces it by a HttpServer that emulates the S3 service. This allows the repository tests to use the real Amazon's S3 client under the hood in tests and will allow to test the behavior of the snapshot/restore feature for S3 repositories by simulating random server-side internal errors. The HTTP server used to emulate the S3 service is intentionally simple and minimal to keep things understandable and maintainable. Testing full client options on the server side (like authentication, chunked encoding etc) remains the responsibility of the AmazonS3Fixture.	2019-08-29 13:16:59 +02:00
István Zoltán Szabó	93ede78b66	Revert "[DOCS] Adds search-related query parameters to the common parameters. (#46057 )" This reverts commit 95a50ae809cf62de45d6a69c622c21b01eef42aa.	2019-08-29 11:59:21 +02:00
István Zoltán Szabó	4b086fbef2	Revert "[DOCS] Reformats URI search request (#45844 )" This reverts commit 7f11c3240018b4005f918c4850cc38ef23d7aeb8.	2019-08-29 11:58:28 +02:00
István Zoltán Szabó	95a50ae809	[DOCS] Adds search-related query parameters to the common parameters. (#46057 ) @szabosteve Merging so I can make some additions. Will incorporate the comments from @jrodewig.	2019-08-29 11:42:08 +02:00
Przemysław Witek	fbe9e8a530	Do not throw an exception if the process finished quickly but without any error. (#46073 ) (#46113 )	2019-08-29 10:47:17 +02:00
Rory Hunter	3666bcfbd8	Handle multiple loopback addresses (#46061 ) AbstractSimpleTransportTestCase.testTransportProfilesWithPortAndHost expects a host to only have a single IPv4 loopback address, which isn't necessarily the case. Allow for >= 1 address. Backport of #45901.	2019-08-29 09:45:51 +01:00
Costin Leau	867cfe0223	DOC: Update SQL docs for DbVis and Workbench/J (#45981 ) Refresh the setup for the new versions of DbVisualizer and SQL Workbench/J which have Elasticsearch JDBC support out of the box. (cherry picked from commit 6d257194c1055d060505e0faaaa37b41e21699f5)	2019-08-29 11:14:03 +03:00
István Zoltán Szabó	7f11c32400	[DOCS] Reformats URI search request (#45844 ) * [DOCS] Reformats URI search request. Co-Authored-By: James Rodewig <james.rodewig@elastic.co> Co-Authored-By: debadair <debadair@elastic.co>	2019-08-29 10:06:16 +02:00
Henning Andersen	0425f6a327	Docs _cat/health verification fix (#46064 ) The _cat/health call in getting-started assumes that the master task max wait time is always 0 (-), however, the test could sometimes run into a short wait time (like some ms). Fixed to allow this.	2019-08-29 07:52:27 +02:00
Jason Tedor	9bc4a24118	Handle delete document level failures (#46100 ) Today we assume that document failures can not occur for deletes. This assumption is bogus, as they can fail for a variety of reasons such as the Lucene index having reached the document limit. Because of this assumption, we were asserting that such a document-level failure would never happen. When this bogus assertion is violated, we fail the node, a catastrophe. Instead, we need to treat this as a fatal engine exception.	2019-08-28 22:17:16 -04:00
Tim Brooks	70507e1041	Move netty numDirectArenas to jvm.options (#46104 ) We currently configure io.netty.allocator.numDirectArenas to be 0 in the jvm erconomics class. This is a config that we always want to set, so it makes sense to move it to jvm.options.	2019-08-28 19:30:55 -06:00
Igor Motov	28006fe19f	Fix GeoIpProcessorFactoryTests on windows (#45668 ) Switches windows build to use geoip database loaded on heap instead of memory mapping it. Closes #44552	2019-08-28 18:02:25 -04:00
Gordon Brown	47bbd9d9a9	[7.x] Fix rollover alias in SLM history index template (#46001 ) This commit adds the `rollover_alias` setting required for ILM to work correctly to the SLM history index template and adds assertions to the SLM integration tests to ensure that it works correctly.	2019-08-28 14:50:22 -07:00
Tal Levy	a356bcff41	Add Circle Processor (#43851 ) (#46097 ) add circle-processor that translates circles to polygons	2019-08-28 14:44:08 -07:00
Julie Tibshirani	d94c4dcffb	Use float instead of double for query vectors. (#46004 ) Currently, when using script_score functions like cosineSimilarity, the query vector is treated as an array of doubles. Since the stored document vectors use floats, it seems like the least surprising behavior for the query vectors to also be float arrays. In addition to improving consistency, this change may help with some optimizations we have been considering around vector dot product.	2019-08-28 11:03:14 -07:00
Jason Tedor	1249e6ba5d	Handle no-op document level failures (#46083 ) Today we assume that document failures can not occur for no-ops. This assumption is bogus, as they can fail for a variety of reasons such as the Lucene index having reached the document limit. Because of this assumption, we were asserting that such a document-level failure would never happen. When this bogus assertion is violated, we fail the node, a catastrophe. Instead, we need to treat this as a fatal engine exception.	2019-08-28 13:57:24 -04:00
Ryan Ernst	564b80303d	Fix rest-api-spec dep for external plugins (#45949 ) This commit fixes the maven coordinates for the rest-api-spec jar. It was accidentally by #45107. closes #45891	2019-08-28 10:54:03 -07:00
Ryan Ernst	f20969959f	Remove plugins dir reference from docs (#46047 ) While the plugin installation directory used to be settable, it has not been so for several major versions. This commit removes a lingering reference to the plugins directory in upgrade docs. closes #45889	2019-08-28 10:50:35 -07:00
Tanguy Leroux	9e14ffa8be	Few clean ups in ESBlobStoreRepositoryIntegTestCase (#46068 )	2019-08-28 16:29:46 +02:00
Martijn van Groningen	f50c7cf88b	Add XContentType as parameter to HLRC ART#createServerTestInstance (#46036 ) Add XContentType as parameter to the AbstractResponseTestCase#createServerTestInstance method. In the case a server side response class serializes xcontent as bytes then the test needs to know what xcontent type was randomily selected. This change is needed in #45970	2019-08-28 16:16:47 +02:00
Mark Tozzi	9ac85a4a2b	Fix compilation in CumulativeCardinalityAggregatorTests	2019-08-28 09:31:48 -04:00
James Rodewig	54a882ada9	[DOCS] Add index alias exists API docs (#46042 )	2019-08-28 09:13:51 -04:00
Mark Tozzi	aec125faff	Support Range Fields in Histogram and Date Histogram (#46012 ) Backport of 1a0dddf4ad24b3f2c751a1fe0e024fdbf8754f94 (AKA #445395) * Add support for a Range field ValuesSource, including decode logic for range doc values and exposing RangeType as a first class enum * Provide hooks in ValuesSourceConfig for aggregations to control ValuesSource class selection on missing & script values * Branch aggregator creation in Histogram and DateHistogram based on ValuesSource class, to enable specialization based on type. This is similar to how Terms aggregator works. * Prioritize field type when available for selecting the ValuesSource class type to use for an aggregation	2019-08-28 09:06:09 -04:00
James Rodewig	f28644c498	[DOCS] Add upgrade support matrix (#45790 )	2019-08-28 08:33:57 -04:00
Dimitris Athanasiou	25d64508f6	[7.x][ML] Support boolean fields for DF analytics (#46037 ) (#46054 ) This commit adds support for `boolean` fields in data frame analytics (and currently both outlier detection and regression). The analytics process expects `boolean` fields to be encoded as integers with 0 or 1 value.	2019-08-28 12:02:29 +03:00
Dimitris Athanasiou	bb8fcb3cac	[7.x][ML][HLRC] Add data frame analytics regression analysis (#46024 ) (#46053 )	2019-08-28 12:02:14 +03:00
Jake Landis	154d1dd962	Watcher max_iterations with foreach action execution (#45715 ) (#46039 ) Prior to this commit the foreach action execution had a hard coded limit to 100 iterations. This commit allows the max number of iterations to be a configuration ('max_iterations') on the foreach action. The default remains 100.	2019-08-27 16:57:20 -05:00
Tim Brooks	956df7be92	Reindex task state initialized before reindex (#46043 ) Currently the process to execute a reindex process is tightly coupled to step of initializing the task state. This creates problems when this process is asynchronous. It is possible that the task state has not been initialized which prevents follow-up actions such as rethrottle. This commit separates the task initialization so that it can be executed as a first step in the persistent reindex process.	2019-08-27 15:28:04 -05:00
Tim Brooks	07f3ddb549	Extract reindexing logic from transport action (#46033 ) This commit extracts the reindexing logic from the transport action so that it can be incorporated into the persistent reindex work without requiring the usage of the client.	2019-08-27 12:28:37 -05:00
James Rodewig	ff1acf3489	[DOCS] Reformat update index settings API docs (#45931 )	2019-08-27 12:49:41 -04:00
James Rodewig	a3d7547e10	[DOCS] Separate and reformat close index API docs (#45922 )	2019-08-27 12:32:23 -04:00
Tim Brooks	ad233e3e38	Add test for CopyBytesSocketChannel (#46031 ) Currently we use a custom CopyBytesSocketChannel for interfacing with netty. We have integration tests that use this channel, however we never verify the read and write behavior in the face of potential partial writes. This commit adds a test for this behavior.	2019-08-27 11:25:22 -05:00
James Rodewig	8228a218b4	[DOCS] Reformat open index API docs (#45921 )	2019-08-27 11:44:20 -04:00
Armin Braun	fdef293c81	Fix RegressionTests#fromXContent (#46029 ) * The `trainingPercent` must be between `1` and `100`, not `0` and `100` which is causing test failures	2019-08-27 18:24:26 +03:00
Lisa Cawley	4b879848f0	[DOCS] Add 7.3.1 ml-cpp PRs to release notes (#46003 )	2019-08-27 08:10:13 -07:00
Henning Andersen	300e717e42	Disallow partial results when shard unavailable (#45739 ) Searching with `allowPartialSearchResults=false` could still return partial search results during recovery. If a shard copy fails with a "shard not available" exception, the failure would be ignored and a partial result returned. The one case where this is known to happen is when a shard copy is recovering when searching, since `IllegalIndexShardStateException` is considered a "shard not available" exception. Relates to #42612	2019-08-27 17:01:23 +02:00
Dimitris Athanasiou	873ad3f942	[7.x][ML] Add option to regression to randomize training set (#45969 ) (#46017 ) Adds a parameter `training_percent` to regression. The default value is `100`. When the parameter is set to a value less than `100`, from the rows that can be used for training (ie. those that have a value for the dependent variable) we randomly choose whether to actually use for training. This enables splitting the data into a training set and the rest, usually called testing, validation or holdout set, which allows for validating the model on data that have not been used for training. Technically, the analytics process considers as training the data that have a value for the dependent variable. Thus, when we decide a training row is not going to be used for training, we simply clear the row's dependent variable.	2019-08-27 17:53:11 +03:00
Yogesh Gaikwad	7b6246ec67	Add `manage_own_api_key` cluster privilege (#45897 ) (#46023 ) The existing privilege model for API keys with privileges like `manage_api_key`, `manage_security` etc. are too permissive and we would want finer-grained control over the cluster privileges for API keys. Previously APIs created would also need these privileges to get its own information. This commit adds support for `manage_own_api_key` cluster privilege which only allows api key cluster actions on API keys owned by the currently authenticated user. Also adds support for retrieval of the API key self-information when authenticating via API key without the need for the additional API key privileges. To support this privilege, we are introducing additional authentication context along with the request context such that it can be used to authorize cluster actions based on the current user authentication. The API key get and invalidate APIs introduce an `owner` flag that can be set to true if the API key request (Get or Invalidate) is for the API keys owned by the currently authenticated user only. In that case, `realm` and `username` cannot be set as they are assumed to be the currently authenticated ones. The changes cover HLRC changes, documentation for the API changes. Closes #40031	2019-08-28 00:44:23 +10:00
Dimitris Athanasiou	dd6c13fdf9	[ML] Add description to DF analytics (#45774 ) (#46019 )	2019-08-27 15:48:59 +03:00
Luca Cavanna	267183998e	[TEST] wait for http channels to be closed in ESIntegTestCase (#45977 ) We recently added a check to `ESIntegTestCase` in order to verify that no http channels are being tracked when we close clusters and the REST client. Close listeners though are invoked asynchronously, hence this check may fail if we assert before the close listener that removes the channel from the map is invoked. With this commit we add an `assertBusy` so we try and wait for the map to be empty. Closes #45914 Closes #45955	2019-08-27 14:00:24 +02:00
Albert Zaharovits	1ebee5bf9b	PKI realm authentication delegation (#45906 ) This commit introduces PKI realm delegation. This feature supports the PKI authentication feature in Kibana. In essence, this creates a new API endpoint which Kibana must call to authenticate clients that use certificates in their TLS connection to Kibana. The API call passes to Elasticsearch the client's certificate chain. The response contains an access token to be further used to authenticate as the client. The client's certificates are validated by the PKI realms that have been explicitly configured to permit certificates from the proxy (Kibana). The user calling the delegation API must have the delegate_pki privilege. Closes #34396	2019-08-27 14:42:46 +03:00
Ioannis Kakavas	b249e25bb4	Partly revert globalInfo.ready check (#45960 ) This check was introduced in #41392 but had the unwanted side-effect that the keystore settings in such blocks would note be added in the node's keystore. Given that we have a mid-term plan for FIPS testing that would made such checks unnecessary, and that the conditional in these two cases is not really that important, this change removes this conditional logic so that full-cluster-restart and rolling upgrade tests will run with PEM files for key/certificate material no matter if we're in a FIPS JVM or not. Resolves: #45475	2019-08-27 13:01:56 +03:00
debadair	cf34ff62ad	[DOCS] Streamline GS search topic. (#45941 ) * Streamline GS search topic. * Added missing comma. * Update docs/reference/getting-started.asciidoc Co-Authored-By: István Zoltán Szabó <istvan.szabo@elastic.co>	2019-08-26 18:29:52 -07:00
debadair	948b03856b	[DOCS] Backporting GS search & aggs updates. (#46008 ) * [DOCS] Streamlined GS aggs section. (#45951) * [DOCS] Streamlined GS aggs section. * Update docs/reference/getting-started.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * [DOCS] Fix typo. (#46006)	2019-08-26 18:24:05 -07:00
Ryan Ernst	d50d700f14	Don't use assemble task on root project (#45999 ) The root project uses the base plugin to get a clean task, but does not actually need the assemble task. This commit changes the root project to use the lifecycle-base plugin, which while still creating the assemble task, won't add any dependencies to it.	2019-08-26 16:35:11 -07:00
Nhat Nguyen	146e23a8a9	Relax translog assertion in testRestoreLocalHistoryFromTranslog (#45943 ) Since #45473, we trim translog below the local checkpoint of the safe commit immediately if soft-deletes enabled. In testRestoreLocalHistoryFromTranslog, we should have a safe commit after recoverFromTranslog is called; then we will trim translog files which contain only operations that are at most the global checkpoint. With this change, we relax the assertion to ensure that we don't put operations to translog while recovering history from the local translog.	2019-08-26 17:19:19 -04:00
Nhat Nguyen	c66bae39c3	Update translog checkpoint after marking ops as persisted (#45634 ) If two translog syncs happen concurrently, then one can return before its operations are marked as persisted. In general, this should not be an issue; however, peer recoveries currently rely on this assumption. Closes #29161	2019-08-26 17:18:52 -04:00
Nhat Nguyen	f2e8b17696	Do not create engine under IndexShard#mutex (#45263 ) Today we create new engines under IndexShard#mutex. This is not ideal because it can block the cluster state updates which also execute under the same mutex. We can avoid this problem by creating new engines under a separate mutex. Closes #43699	2019-08-26 17:18:29 -04:00

1 2 3 4 5 ...

47478 Commits