OpenSearch

Commit Graph

Author	SHA1	Message	Date
Gordon Brown	827ece73c8	Mute MlConfigMigratorIT.testMigrateConfigs (#37374 )	2019-01-11 11:11:58 -07:00
Gordon Brown	955d3aea19	Mute testRoundRobinWithFailures (#32190 )	2019-01-11 09:38:40 -07:00
David Roberts	953fb9352f	[ML] Update error message for process update (#37363 ) When this message was first added the model debug config was the only thing that could be updated, but now more aspects of the config can be updated so the message needs to be more general.	2019-01-11 16:31:55 +00:00
Martijn van Groningen	e4391afd98	Test fix, wait for auto follower to have stopped in the background Relates to #36761	2019-01-11 17:26:17 +01:00
Benjamin Trent	19a7e0f4eb	ML: update .ml-state actions to support > 1 index (#37307 ) * ML: Updating .ml-state calls to be able to support > 1 index * Matching bulk delete behavior with dbq * Adjusting state name * refreshing indices before search * fixing line length * adjusting index expansion options	2019-01-11 08:03:41 -06:00
David Roberts	1da59db3fb	[ML] Wait for autodetect to be ready in the datafeed (#37349 ) This is a reinforcement of #37227. It turns out that persistent tasks are not made stale if the node they were running on is restarted and the master node does not notice this. The main scenario where this happens is when minimum master nodes is the same as the number of nodes in the cluster, so the cluster cannot elect a master node when any node is restarted. When an ML node restarts we need the datafeeds for any jobs that were running on that node to not just wait until the jobs are allocated, but to wait for the autodetect process of the job to start up. In the case of reassignment of the job persistent task this was dealt with by the stale status test. But in the case where a node restarts but its persistent tasks are not reassigned we need a deeper test. Fixes #36810	2019-01-11 13:22:35 +00:00
Alexander Reelsen	bbd093059f	Add whitelist to watcher HttpClient (#36817 ) This adds a configurable whitelist to the HTTP client in watcher. By default every URL is allowed to retain BWC. A dynamically configurable setting named "xpack.http.whitelist" was added that allows to configure an array of URLs, which can also contain simple regexes. Closes #29937	2019-01-11 09:22:47 +01:00
Martijn van Groningen	37493c204d	Unmuted test now that #37239 has been merged and backported. Relates to #37231	2019-01-11 09:02:46 +01:00
Ioannis Kakavas	80084138dd	[DOCS] Fix link to role mapping doc	2019-01-11 09:22:40 +02:00
markharwood	434430506b	Type removal - added deprecation warnings to _bulk apis (#36549 ) Added warnings checks to existing tests Added “defaultTypeIfNull” to DocWriteRequest interface so that Bulk requests can override a null choice of document type with any global custom choice. Related to #35190	2019-01-10 21:35:19 +00:00
Jay Modi	e6d3d85db4	Ensure latch is counted down in ssl reload test (#37313 ) This change ensures we always countdown the latch in the SSLConfigurationReloaderTests to prevent the suite from timing out in case of an exception. Additionally, we also increase the logging of the resource watcher in case an IOException occurs. See #36053	2019-01-10 13:27:25 -07:00
Costin Leau	83f7423cd6	SQL: Fix bug regarding alias fields with dots (#37279 ) Field of types aliases that have dots in name are returned without a hierarchy by field_caps, as oppose to the mapping api or field with concrete types, which in turn breaks IndexResolver. This commit fixes this by creating the backing hierarchy similar to the mapping api. Close #37224	2019-01-10 22:18:53 +02:00
David Roberts	b65006e8cd	[ML] Fix ML memory tracker for old jobs (#37311 ) Jobs created in version 6.1 or earlier can have a null model_memory_limit. If these are parsed from cluster state following a full cluster restart then we replace the null with 4096mb to make the meaning explicit. But if such jobs are streamed from an old node in a mixed version cluster this does not happen. Therefore we need to account for the possibility of a null model_memory_limit in the ML memory tracker.	2019-01-10 17:28:00 +00:00
Jay Modi	71633775fd	Security: reorder realms based on last success (#36878 ) This commit reorders the realm list for iteration based on the last successful authentication for the given principal. This is an optimization to prevent unnecessary iteration over realms if we can make a smart guess on which realm to try first.	2019-01-10 09:06:16 -07:00
Martijn van Groningen	6d81e7c3e7	[CCR] FollowingEngine should fail with 403 if operation has no seqno assigned (#37213 ) Fail with a 403 when indexing a document directly into a follower index. In order to test this change, I had to move specific assertions into a dedicated class and disable assertions for that class in the rest qa module. I think that is the right trade off.	2019-01-10 15:54:34 +01:00
Martijn van Groningen	df488720e0	[CCR] Make shard follow tasks more resilient for restarts (#37239 ) If a running shard follow task needs to be restarted and the remote connection seeds have changed then a shard follow task currently fails with a fatal error. The change creates the remote client lazily and adjusts the errors a shard follow task should retry. This issue was found in test failures in the recently added ccr rolling upgrade tests. The reason why this issue occurs more frequently in the rolling upgrade test is because ccr is setup in local mode (so remote connection seed will become stale) and all nodes are restarted, which forces the shard follow tasks to get restarted at some point during the test. Note that these tests cannot be enabled yet, because this change will need to be backported to 6.x first. (otherwise the issue still occurs on non upgraded nodes) I also changed the RestartIndexFollowingIT to setup remote cluster via persistent settings and to also restart the leader cluster. This way what happens during the ccr rolling upgrade qa tests, also happens in this test. Relates to #37231	2019-01-10 15:02:30 +01:00
Alpar Torok	3d66764660	Mute watcher SingleNodeTests Tracking: #36782	2019-01-10 12:23:29 +02:00
Martijn van Groningen	1a41d84536	[CCR] Resume follow Api should not require a request body (#37217 ) Closes #37022	2019-01-10 09:48:26 +01:00
Alexander Reelsen	b2e8437424	Tests: Add ElasticsearchAssertions.awaitLatch method (#36777 ) * Tests: Add ElasticsearchAssertions.awaitLatch method Some tests are using assertTrue(latch.await(...)) in their code. This leads to an assertion error without any error message. This adds a method which has a nicer error message and can be used in tests. * fix forbidden apis * fix spaces	2019-01-10 09:25:36 +01:00
Andrei Stefan	4a92de214a	SQL: Proper handling of COUNT(field_name) and COUNT(DISTINCT field_name) (#37254 ) * provide overriden `hashCode` and toString methods to account for `DISTINCT` * change the analyzer for scenarios where `COUNT <field_name>` and `COUNT DISTINCT` have different paths * defined a new `filter` aggregation encapsulating an `exists` query to filter out null or missing values	2019-01-10 09:51:51 +02:00
Benjamin Trent	df3b58cb04	ML: add migrate anomalies assistant (#36643 ) * ML: add migrate anomalies assistant * adjusting failure handling for reindex * Fixing request and tests * Adding tests to blacklist * adjusting test * test fix: posting data directly to the job instead of relying on datafeed * adjusting API usage * adding Todos and adjusting endpoint * Adding types to reindexRequest * removing unreliable "live" data test * adding index refresh to test * adding index refresh to test * adding index refresh to yaml test * fixing bad exists call * removing todo * Addressing remove comments * Adjusting rest endpoint name * making service have its own logger * adjusting validity check for newindex names * fixing typos * fixing renaming	2019-01-09 14:25:35 -06:00
jaymode	c71060fa01	Test: fix race in auth result propagation test This commit fixes a race condition in a test introduced by #36900 that verifies concurrent authentications get a result propagated from the first thread that attempts to authenticate. Previously, a thread may be in a state where it had not attempted to authenticate when the first thread that authenticates finishes the authentication, which would cause the test to fail as there would be an additional authentication attempt. This change adds additional latches to ensure all threads have attempted to authenticate before a result gets returned in the thread that is performing authentication.	2019-01-09 12:17:43 -07:00
Tim Brooks	cfa58a51af	Add TLS/SSL channel close timeouts (#37246 ) Closing a channel using TLS/SSL requires reading and writing a CLOSE_NOTIFY message (for pre-1.3 TLS versions). Many implementations do not actually send the CLOSE_NOTIFY message, which means we are depending on the TCP close from the other side to ensure channels are closed. In case there is an issue with this, we need a timeout. This commit adds a timeout to the channel close process for TLS secured channels. As part of this change, we need a timer service. We could use the generic Elasticsearch timeout threadpool. However, it would be nice to have a local to the nio event loop timer service dedicated to network needs. In the future this service could support read timeouts, connect timeouts, request timeouts, etc. This commit adds a basic priority queue backed service. Since our timeout volume (channel closes) is very low, this should be fine. However, this can be updated to something more efficient in the future if needed (timer wheel). Everything being local to the event loop thread makes the logic simple as no locking or synchronization is necessary.	2019-01-09 11:46:24 -07:00
Alpar Torok	6a5f3f05f4	Fix build on Fips testing convetions need to be disabled if the test task is for fips.	2019-01-09 19:27:01 +02:00
Martijn van Groningen	9122585359	[CCR] Added more logging.	2019-01-09 12:17:47 +01:00
Tanguy Leroux	f1f5d834c3	Merge branch 'close-index-api-refactoring'	2019-01-09 11:48:57 +01:00
David Roberts	e0ce73713f	[ML] Stop datafeeds running when their jobs are stale (#37227 ) We already had logic to stop datafeeds running against jobs that were OPENING, but a job that relocates from one node to another while OPENED stays OPENED, and this could cause the datafeed to fail when it sent data to the OPENED job on its new node before it had a corresponding autodetect process. This change extends the check to stop datafeeds running when their job is OPENING _or_ stale (i.e. has not had its status reset since relocating to a different node). Relates #36810	2019-01-09 10:42:47 +00:00
Tanguy Leroux	096a83183e	Merge branch 'master' into close-index-api-refactoring	2019-01-09 10:52:46 +01:00
David Roberts	f14cff2102	[TEST] Ensure interrupted flag reset after test that sets it (#37230 ) Test fix to stop a problem in one test leaking into a different test and causing that other test to spuriously fail.	2019-01-09 08:51:00 +00:00
Tanguy Leroux	7f6fe14b66	Merge branch 'master' into close-index-api-refactoring	2019-01-09 09:26:05 +01:00
Ioannis Kakavas	9049263c2c	Handle malformed license signatures (#37137 ) This commit adds a more user friendly error message when a license signature is malformed/truncated in a way that it cannot be meaningfully parsed.	2019-01-09 07:29:22 +02:00
Ioannis Kakavas	2a79c468f8	Ensure that ActionListener is called exactly once This bug was introduced in #36893 and had the effect that execution would continue after calling onFailure on the the listener in checkIfTokenIsValid in the case that the token is expired. In a case of many consecutive requests this could lead to the unwelcome side effect of an expired access token producing a successful authentication response.	2019-01-09 07:23:35 +02:00
Marios Trivyzas	5f2fbedd8c	SQL: Replace String.format() with LoggerMessageFormat.format() (#37216 ) Fixes: #36532	2019-01-08 23:56:00 +02:00
Martijn van Groningen	d6608caf55	Muted rolling upgrade tests. Relates to #37231	2019-01-08 16:52:22 +01:00
Jay Modi	1514bbcdde	Security: propagate auth result to listeners (#36900 ) After #30794, our caching realms limit each principal to a single auth attempt at a time. This prevents hammering of external servers but can cause a significant performance hit when requests need to go through a realm that takes a long time to attempt to authenticate in order to get to the realm that actually authenticates. In order to address this, this change will propagate failed results to listeners if they use the same set of credentials that the authentication attempt used. This does prevent these stalled requests from retrying the authentication attempt but the implementation does allow for new requests to retry the attempt.	2019-01-08 08:52:12 -07:00
Alpar Torok	6344e9a3ce	Testing conventions: add support for checking base classes (#36650 )	2019-01-08 13:39:03 +02:00
Tanguy Leroux	6e852dfa7c	Merge branch 'master' into close-index-api-refactoring	2019-01-08 11:28:51 +01:00
Martijn van Groningen	c980cc12df	Added CCR rolling upgrade tests (#36648 ) Added CCR rolling upgrade tests.	2019-01-08 11:05:18 +01:00
Tanguy Leroux	d70ebfd1d6	Merge branch 'master' into close-index-api-refactoring	2019-01-08 09:17:48 +01:00
Andrei Stefan	3fad9d25f6	SQL: fix COUNT DISTINCT filtering (#37176 ) * Use `_count` aggregation value only for not-DISTINCT COUNT function calls * COUNT DISTINCT will use the _exact_ version of a field (the `keyword` sub-field for example), if there is one	2019-01-08 08:47:35 +02:00
Jason Tedor	c8c596cead	Introduce retention lease expiration (#37195 ) This commit implements a straightforward approach to retention lease expiration. Namely, we inspect which leases are expired when obtaining the current leases through the replication tracker. At that moment, we clean the map that persists the retention leases in memory.	2019-01-07 22:03:52 -08:00
Benjamin Trent	6b376a1ff4	ML: fix delayed data annotations on secured cluster (#37193 ) * changing executing context for writing annotation * adjusting user * removing unused import	2019-01-07 15:18:38 -06:00
Tanguy Leroux	97bf4d7176	Merge branch 'master' into close-index-api-refactoring	2019-01-07 18:38:27 +01:00
Benjamin Trent	1780ced82d	ML: changing JobResultsProvider.getForecastRequestStats to support > 1 index (#37157 ) * ML: changing JobResultsProvider.getForecastRequestStats to support more than one index * moving to use idsQuery()	2019-01-07 10:58:55 -06:00
Christophe Bismuth	9602d794c6	Separate out validation of groups of settings (#34184 ) Today, a setting can declare that its validity depends on the values of other related settings. However, the validity of a setting is not always checked against the correct values of its dependent settings because those settings' correct values may not be available when the validator runs. This commit separates the validation of a settings updates into two phases, with separate methods on the `Setting.Validator` interface. In the first phase the setting's validity is checked in isolation, and in the second phase it is checked again against the values of its related settings. Most settings only use the first phase, and only the few settings with dependencies make use of the second phase.	2019-01-07 16:12:58 +00:00
Jason Tedor	c0f8c89172	Introduce shard history retention leases (#37167 ) This commit is the first in a series which will culminate with fully-functional shard history retention leases. Shard history retention leases are aimed at preventing shard history consumers from having to fallback to expensive file copy operations if shard history is not available from a certain point. These consumers include following indices in cross-cluster replication, and local shard recoveries. A future consumer will be the changes API. Further, index lifecycle management requires coordinating with some of these consumers otherwise it could remove the source before all consumers have finished reading all operations. The notion of shard history retention leases that we are introducing here will also be used to address this problem. Shard history retention leases are a property of the replication group managed under the authority of the primary. A shard history retention lease is a combination of an identifier, a retaining sequence number, a timestamp indicating when the lease was acquired or renewed, and a string indicating the source of the lease. Being leases they have a limited lifespan that will expire if not renewed. The idea of these leases is that all operations above the minimum of all retaining sequence numbers will be retained during merges (which would otherwise clear away operations that are soft deleted). These leases will be periodically persisted to Lucene and restored during recovery, and broadcast to replicas under certain circumstances. This commit is merely putting the basics in place. This first commit only introduces the concept and integrates their use with the soft delete retention policy. We add some tests to demonstrate the basic management is correct, and that the soft delete policy is correctly influenced by the existence of any retention leases. We make no effort in this commit to implement any of the following: - timestamps - expiration - persistence to and recovery from Lucene - handoff during primary relocation - sharing retention leases with replicas - exposing leases in shard-level statistics - integration with cross-cluster replication These will occur individually in follow-up commits.	2019-01-07 07:43:57 -08:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Josh Soref	edb48321ba	[DOCS] Various spelling corrections (#37046 )	2019-01-07 14:44:12 +01:00
Tanguy Leroux	f5af79b9cd	Merge branch 'master' into close-index-api-refactoring	2019-01-07 12:43:03 +01:00
Christoph Büscher	12a105e5ef	Remove deprecated PutIndexTemplateRequestBuilder#setTemplate (#37151 ) The method has been removed since 6.0, there is a direct replacement and it is only used in tests still.	2019-01-07 10:41:04 +01:00

1 2 3 4 5 ...

2332 Commits