OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	a5ce1e0bec	Integrate retention leases to recovery from remote (#38829 ) This commit is the first step in integrating shard history retention leases with CCR. In this commit we integrate shard history retention leases with recovery from remote. Before we start transferring files, we take out a retention lease on the primary. Then during the file copy phase, we repeatedly renew the retention lease. Finally, when recovery from remote is complete, we disable the background renewing of the retention lease.	2019-02-16 15:37:52 -05:00
Tim Brooks	b1c1daa63f	Add get file chunk timeouts with listener timeouts (#38758 ) This commit adds a `ListenerTimeouts` class that will wrap a `ActionListener` in a listener with a timeout scheduled on the generic thread pool. If the timeout expires before the listener is completed, `onFailure` will be called with an `ElasticsearchTimeoutException`. Timeouts for the get ccr file chunk action are implemented using this functionality. Additionally, this commit attempts to fix #38027 by also blocking proxied get ccr file chunk actions. This test being un-muted is useful to verify the timeout functionality.	2019-02-16 10:56:03 -07:00
Jason Tedor	d80325f288	Mark fail over on follower test as awaits fix This test is failing since the introduction of recovery from remote. This commit marks this test as awaits fix.	2019-02-16 12:28:16 -05:00
Nhat Nguyen	7e20a92888	Advance max_seq_no before add operation to Lucene (#38879 ) Today when processing an operation on a replica engine (or the following engine), we first add it to Lucene, then add it to translog, then finally marks its seq_no as completed. If a flush occurs after step1, but before step-3, the max_seq_no in the commit's user_data will be smaller than the seq_no of some documents in the Lucene commit.	2019-02-15 21:04:28 -05:00
Nhat Nguyen	20755e666c	Reduce global checkpoint sync interval in disruption tests (#38931 ) We verify seq_no_stats is aligned between copies at the end of some disruption tests. Sometimes, the assertion `assertSeqNos` is tripped due to a lagged global checkpoint on replicas. The global checkpoint on replicas is lagged because we sync the global checkpoint 30 seconds (by default) after the last replication operation. This change reduces the global checkpoint sync-internal to 1s in the disruption tests. Closes #38318 Closes #36789	2019-02-15 21:04:20 -05:00
Jason Tedor	58551198d5	Address some CCR REST test case flakiness (#38975 ) The CCR REST tests that rely on these assertions are flaky. They are flaky since the introduction of recovery from the remote. The underlying problem is this: these tests are making assertions about the number of operations read by the shard following task. However, with recovery from remote, we no longer have guarantees that the assumptions these tests were relying on hold. Namely, these tests were assuming that the only way that a document could land in the follower index is via the shard following task. With recovery from remote, there is another way, which is via the files that are copied over during the recovery phase. Most of the time this will not be a problem because with the small number of documents that we are indexing in these tests, it is usally not the case that a flush would occur and so there would not be any documents in the files copied over. However, a flush can occur any time at which point all of the indexed documents could end up in a safe commit and copied over during recovery from remote. This commit modifies these assertions to ones that are not prone to this issue, yet still validate the health of the follower shard.	2019-02-15 16:01:02 -05:00
Martijn van Groningen	03b67b3ee1	Introduced class reuses follow parameter code between ShardFollowTasks (#38910 ) and AutoFollowPattern classes. The ImmutableFollowParameters is like the already existing FollowParameters, but all of its fields are final.	2019-02-15 18:26:15 +01:00
iverase	b19b778cbb	[CI] Muting method testFollowIndex in IndexFollowingIT Relates to #38949	2019-02-15 16:07:45 +01:00
Yogesh Gaikwad	36c274867e	Fix intermittent failure in ApiKeyIntegTests (#38627 ) (#38935 ) Few tests failed intermittently and most of the times due to invalidated or expired keys that were deleted were still reported in search results. This commit removes the test and adds enhancements to other tests testing different scenario's. When ExpiredApiKeysRemover is triggered, the tests did not await its termination thereby sometimes the results would be wrong for a search operation. DELETE_INTERVAL setting has been further reduced to 100ms so we can trigger ExpiredApiKeysRemover faster. Closes #38408	2019-02-15 23:01:35 +11:00
Yannick Welsch	d55e52223f	Smarter CCR concurrent file chunk fetching (#38841 ) The previous logic for concurrent file chunk fetching did not allow for multiple chunks from the same file to be fetched in parallel. The parallelism only allowed to fetch chunks from different files in parallel. This required complex logic on the follower to be aware from which file it was already fetching information, in order to ensure that chunks for the same file would be fetched in sequential order. During benchmarking, this exhibited throughput issues when recovery came towards the end, where it would only be sequentially fetching chunks for the same largest segment file, with throughput considerably going down in a high-latency network as there was no parallelism anymore. The new logic here follows the peer recovery model more closely, and sends multiple requests for the same file in parallel, and then reorders the results as necessary. Benchmarks show that this leads to better overall throughput and the implementation is also simpler.	2019-02-15 07:51:58 +01:00
Shaunak Kashyap	1f74ba2d33	[Monitoring] Remove `include_type_name` parameter from GET _template request (#38925 ) Backport of #38818 to `7.x`. Original description: The HTTP exporter code in the Monitoring plugin makes `GET _template` requests to check for existence of templates. These requests don't need to pass the `include_type_name` query parameter so this PR removes it from the request. This should remove the following deprecation log entries on the Monitoring cluster in 7.0.0 onwards: ``` [types removal] Specifying include_type_name in get index template requests is deprecated. ```	2019-02-14 16:09:52 -08:00
Jay Modi	5d06226507	Fix writing of SecurityFeatureSetUsage to pre-7.1 (#38922 ) This change makes the writing of new usage data conditional based on the version that is being written to. A test has also been added to ensure serialization works as expected to an older version. Relates #38687, #38917	2019-02-14 16:28:52 -07:00
Jay Modi	e59b7b696a	Use consistent view of realms for authentication (#38815 ) This change updates the authentication service to use a consistent view of the realms based on the license state at the start of authentication. Without this, the license can change during authentication of a request and it will result in a failure if the realm that extracted the token is no longer in the realm list. This manifests in some tests as an authentication failure that should never really happen; one example would be the test framework's transport client user should always have a succesful authentication but in the LicensingTests this can fail and will show up as a NoNodeAvailableException. Additionally, the licensing tests have been updated to ensure that there is consistency when changing the license. The license is changed by modifying the internal xpack license state on each node, which has no protection against be changed by some pending cluster action. The methods to disable and enable now ensure we have a green cluster and that the cluster is consistent before returning. Closes #30301	2019-02-14 07:49:14 -07:00
Albert Zaharovits	6243a9797f	_cat/indices with Security, hide names when wildcard (#38824 ) This changes the output of the `_cat/indices` API with `Security` enabled. It is possible to only display the index name (and possibly the index health, depending on the request options) but not its stats (doc count, merges, size, etc). This is the case for closed indices which have index metadata in the cluster state but no associated shards, hence no shard stats. However, when `Security` is enabled, and the request contains wildcards, open indices without stats are a common occurrence. This is because the index names in the response table are picked up directly from the cluster state which is not filtered by `Security`'s _indexNameExpressionResolver_, unlike the stats data which is populated by the indices stats API which does go through the index name resolver. This is a bug, because it is circumventing `Security`'s function to hide unauthorized indices. This has been fixed by displaying the index names as they are resolved by the indices stats API. The outputs of these two APIs is now very similar: same index names, similar data but different format. Closes #37190	2019-02-14 15:09:17 +02:00
Andrei Stefan	7d78f4641b	SQL: fall back to using the field name for column label (#38842 ) (cherry picked from commit 0567bf24957be477e7649cff94872b0e7dc4d284)	2019-02-14 14:10:59 +02:00
Yogesh Gaikwad	335cf91bb9	Add enabled status for token and api key service (#38687 ) (#38882 ) Right now there is no way to determine whether the token service or API key service is enabled or not. This commit adds support for the enabled status of token and API key service to the security feature set usage API `/_xpack/usage`. Closes #38535	2019-02-14 23:08:52 +11:00
Martijn van Groningen	96e7d71948	Handle the fact that `ShardStats` instance may have no commit or seqno stats (#38782 ) The should fix the following NPE: ``` [2019-02-11T23:27:48,452][WARN ][o.e.p.PersistentTasksNodeService] [node_s_0] task kD8YzUhHTK6uKNBNQI-1ZQ-0 failed with an exception 1> java.lang.NullPointerException: null 1> at org.elasticsearch.xpack.ccr.action.ShardFollowTasksExecutor.lambda$fetchFollowerShardInfo$7(ShardFollowTasksExecutor.java:305) ~[main/:?] 1> at org.elasticsearch.action.ActionListener$1.onResponse(ActionListener.java:61) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.TransportAction$1.onResponse(TransportAction.java:68) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.TransportAction$1.onResponse(TransportAction.java:64) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$AsyncAction.onCompletion(TransportBroadcastByNodeAction.java:383) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$AsyncAction.onNodeResponse(TransportBroadcastByNodeAction.java:352) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$AsyncAction$1.handleResponse(TransportBroadcastByNodeAction.java:324) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$AsyncAction$1.handleResponse(TransportBroadcastByNodeAction.java:314) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler.handleResponse(TransportService.java:1108) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.transport.TransportService$DirectResponseChannel.processResponse(TransportService.java:1189) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.transport.TransportService$DirectResponseChannel.sendResponse(TransportService.java:1169) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.transport.TaskTransportChannel.sendResponse(TaskTransportChannel.java:54) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$BroadcastByNodeTransportRequestHandler.messageReceived(TransportBroadcastByNodeAction.java:417) [elasticsearch-8.0.0-SNAP SHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.action.support.broadcast.node.TransportBroadcastByNodeAction$BroadcastByNodeTransportRequestHandler.messageReceived(TransportBroadcastByNodeAction.java:391) [elasticsearch-8.0.0-SNAP SHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:63) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.transport.TransportService$7.doRun(TransportService.java:687) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:751) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:37) [elasticsearch-8.0.0-SNAPSHOT.jar:8.0.0-SNAPSHOT] 1> at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [?:1.8.0_202] 1> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [?:1.8.0_202] 1> at java.lang.Thread.run(Thread.java:748) [?:1.8.0_202] ``` Relates to #38779	2019-02-14 13:05:21 +01:00
Dimitris Athanasiou	21f76aba28	[ML] Extract base class for integ tests with native processes (#38850 ) (#38860 )	2019-02-14 12:15:00 +02:00
Martijn van Groningen	88489a3f3a	Backport rolling upgrade multi cluster module (#38859 ) * Add rolling upgrade multi cluster test module (#38277) This test starts 2 clusters, each with 3 nodes. First the leader cluster is started and tests are run against it and then the follower cluster is started and tests execute against this two cluster. Then the follower cluster is upgraded, one node at a time. After that the leader cluster is upgraded, one node at a time. Every time a node is upgraded tests are ran while both clusters are online. (and either leader cluster has mixed node versions or the follower cluster) This commit only tests CCR index following, but could be used for CCS tests as well. In particular for CCR, unidirectional index following is tested during a rolling upgrade. During the test several indices are created and followed in the leader cluster before or while the follower cluster is being upgraded. This tests also verifies that attempting to follow an index in the upgraded cluster from the not upgraded cluster fails. After both clusters are upgraded following the index that previously failed should succeed. Relates to #37231 and #38037 * Filter out upgraded version index settings when starting index following (#38838) The `index.version.upgraded` and `index.version.upgraded_string` are likely to be different between leader and follower index. In the event that a follower index gets restored on a upgraded node while the leader index is still on non-upgraded nodes. Closes #38835	2019-02-14 08:12:14 +01:00
Lee Hinman	60c1dcde88	Only flush Watcher's bulk processor if Watcher is enabled (#38803 ) When shutting down Watcher, the `bulkProcessor` is null if watcher has been disabled in the configuration. This protects the flush and close calls with a check for watcher enabled to avoid a NullPointerException Resolves #38798	2019-02-13 16:13:53 -07:00
Tim Brooks	ec08581319	Improve CcrRepositoryIT mappings tests (#38817 ) Currently we index documents concurrently to attempt to ensure that we update mappings during the restore process. However, this does not actually test that the mapping will be correct and is dangerous as it can lead to a misalignment between the max sequence number and the local checkpoint. If these are not aligned, peer recovery cannot be completed without initiating following which this test does not do. That causes teardown assertions to fail. This commit removes the concurrent indexing and flushes after the documents are indexed. Additionally it modifies the mapping specific test to ensure that there is a mapping update when the restore session is initiated. This mapping update is picked up at the end of the restore by the follower.	2019-02-13 13:47:10 -07:00
Julie Tibshirani	e769cb4efd	Perform precise check for types warnings in cluster restart tests. (#37944 ) Instead of using `WarningsHandler.PERMISSIVE`, we only match warnings that are due to types removal. This PR also renames `allowTypeRemovalWarnings` to `allowTypesRemovalWarnings`. Relates to #37920.	2019-02-13 11:28:58 -08:00
Benjamin Trent	d2ac05e249	ML allow aliased .ml-anomalies* index on PUT Job (#38821 ) (#38847 )	2019-02-13 10:58:55 -06:00
Jake Landis	46bb663a09	Make 7.x like 6.7 user agent ecs, but default to true (#38828 ) Forward port of https://github.com/elastic/elasticsearch/pull/38757 This change reverts the initial 7.0 commits and replaces them with the 6.7 variant that still allows for the ecs flag. This commit differs from the 6.7 variants in that ecs flag will now default to true. 6.7: `ecs` : default `false` 7.x: `ecs` : default `true` 8.0: no option, but behaves as `true` * Revert "Ingest node - user agent, move device to an object (#38115)" This reverts commit `5b008a34aa`. * Revert "Add ECS schema for user-agent ingest processor (#37727) (#37984)" This reverts commit `cac6b8e06f`. * cherry-pick 5dfe1935345da3799931fd4a3ebe0b6aa9c17f57 Add ECS schema for user-agent ingest processor (#37727) * cherry-pick ec8ddc890a34853ee8db6af66f608b0ad0cd1099 Ingest node - user agent, move device to an object (#38115) (#38121) * cherry-pick f63cbdb9b426ba24ee4d987ca767ca05a22f2fbb (with manual merge fixes) Dep. check for ECS changes to User Agent processor (#38362) * make true the default for the ecs option, and update 7.0 references and tests	2019-02-13 10:28:01 -06:00
Przemyslaw Gomulka	542ee5f46a	Format Watcher.status.lastChecked and lastMetCondition (#38788 ) backport#38626 Change the formatting for Watcher.status.lastCheck and lastMetCondition to be the same as Watcher.status.state.timestamp. These should all have only millisecond precision closes #38619 backport #38626	2019-02-13 08:33:53 +01:00
Shaunak Kashyap	a9178b3239	Remove _type term filters from cluster alert watches (#38819 ) (#38826 ) Backport of https://github.com/elastic/elasticsearch/pull/38819. Original message: This PR removes usages of the `_type` field in `_search` requests issued from Monitoring code.	2019-02-12 19:54:36 -08:00
Nhat Nguyen	a3f39741be	Adjust log and unmute testFailOverOnFollower (#38762 ) There were two documents (seq=2 and seq=103) missing on the follower in one of the failures of `testFailOverOnFollower`. I spent several hours on that failure but could not figure out the reason. I adjust log and unmute this test so we can collect more information. Relates #38633	2019-02-12 11:42:25 -05:00
Jay Modi	f04bd4a07e	Remove TLSv1.2 pinning in ssl reload tests (#38651 ) This change removes the pinning of TLSv1.2 in the SSLConfigurationReloaderTests that had been added to workaround an issue with the MockWebServer and Apache HttpClient when using TLSv1.3. The way HttpClient closes the socket causes issues with the TLSv1.3 SSLEngine implementation that causes the MockWebServer to loop endlessly trying to send the close message back to the client. This change wraps the created http connection in a way that allows us to override the closing behavior of HttpClient. An upstream request with HttpClient has been opened at https://issues.apache.org/jira/browse/HTTPCORE-571 to see if the method of closing can be special cased for SSLSocket instances. This is caused by a JDK bug, JDK-8214418 which is fixed by https://hg.openjdk.java.net/jdk/jdk12/rev/5022a4915fe9. Relates #38646	2019-02-12 09:18:04 -07:00
Martijn van Groningen	40d5beaf41	muted test Relates to #38779	2019-02-12 16:54:54 +01:00
Marios Trivyzas	032bcf99d6	SQL: Implement `::` cast operator (#38774 ) `<expression>::<dataType>` is a simplified altenative syntax to `CAST(<expression> AS <dataType> which exists in PostgreSQL and provides an improved user experience and possibly more compact SQL queries. Fixes: #38717	2019-02-12 16:54:14 +02:00
Przemyslaw Gomulka	7e178aa4a7	Enable IndexActionTests and WatcherIndexingListenerTests Backport #38738 fix tests to use clock in milliseconds precision in watcher code make sure the date comparison in string format is using same formatters some of the code was modified in #38514 possibly because of merge conflicts closes #38581 Backport #38738	2019-02-12 13:05:44 +01:00
Alexander Reelsen	6ae7915b9d	Fix exporter tests to have reasonable dates (#38436 ) The java time formatter used in the exporter adds a plus sign to the year, if a year with more than five digits is used. This changes the creation of those timestamp to only have a date up to 9999. Closes #38378	2019-02-12 10:39:44 +01:00
Martijn van Groningen	6290d59ffa	Use clear cluster names in order to make debugging easier. Relates to #37681	2019-02-12 10:19:39 +01:00
Yannick Welsch	bafc709326	Fix CCR concurrent file chunk fetching bug (#38736 ) Fixes a bug with concurrent file chunk fetching during recovery from remote where the wrong offset was used.	2019-02-11 19:15:57 +01:00
Tanguy Leroux	dc212de822	Specialize pre-closing checks for engine implementations (#38702 ) (#38722 ) The Close Index API has been refactored in 6.7.0 and it now performs pre-closing sanity checks on shards before an index is closed: the maximum sequence number must be equals to the global checkpoint. While this is a strong requirement for regular shards, we identified the need to relax this check in the case of CCR following shards. The following shards are not in charge of managing the max sequence number or global checkpoint, which are pulled from a leader shard. They also fetch and process batches of operations from the leader in an unordered way, potentially leaving gaps in the history of ops. If the following shard lags a lot it's possible that the global checkpoint and max seq number never get in sync, preventing the following shard to be closed and a new PUT Follow action to be issued on this shard (which is our recommended way to resume/restart a CCR following). This commit allows each Engine implementation to define the specific verification it must perform before closing the index. In order to allow following/frozen/closed shards to be closed whatever the max seq number or global checkpoint are, the FollowingEngine and ReadOnlyEngine do not perform any check before the index is closed. Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2019-02-11 17:34:17 +01:00
Luca Cavanna	6443b46184	Clean up ShardSearchLocalRequest (#38574 ) Added a constructor accepting `StreamInput` as argument, which allowed to make most of the instance members final as well as remove the default constructor. Removed a test only constructor in favour of invoking the existing constructor that takes a `SearchRequest` as first argument. Also removed profile members and related methods as they were all unused.	2019-02-11 15:55:46 +01:00
Martijn van Groningen	92201ef563	Catch AlreadyClosedException and use other IndexShard instance (#38630 ) Closes #38617	2019-02-11 15:36:48 +01:00
Andrei Stefan	b3695750bc	Randomize the time zone properly for the current date test. (#38670 ) (cherry picked from commit 29abbb8a590cdf4f9e0c0b447d6694bb7223648e)	2019-02-11 14:25:02 +02:00
Przemyslaw Gomulka	ba9a4d13e1	mute Failing tests related to logging and joda-java migration backport(#38704 )(#38710 ) the tests awaits fix from #38693 and #38705 and #38581	2019-02-11 13:15:12 +01:00
Przemyslaw Gomulka	ab9e2f2e69	Move testToUtc test to DateFormattersTests #38698 Backport #38610 The test was relying on toString in ZonedDateTime which is different to what is formatted by strict_date_time when milliseconds are 0 The method is just delegating to dateFormatter, so that scenario should be covered there. closes #38359 Backport #38610	2019-02-11 11:34:25 +01:00
Ioannis Kakavas	8c624e5a20	Enhance parsing of StatusCode in SAML Responses (#38628 ) * Enhance parsing of StatusCode in SAML Responses <Status> elements in a failed response might contain two nested <StatusCode> elements. We currently only parse the first one in order to create a message that we attach to the Exception we return and log. However this is generic and only gives out informarion about whether the SAML IDP believes it's an error with the request or if it couldn't handle the request for other reasons. The encapsulated StatusCode has a more interesting error message that potentially gives out the actual error as in Invalid nameid policy, authentication failure etc. This change ensures that we print that information also, and removes Message and Details fields from the message when these are not part of the Status element (which quite often is the case)	2019-02-11 11:55:26 +02:00
Martijn van Groningen	a29bf2585e	Added unit test for FollowParameters class (#38500 ) (#38690 ) A unit test that tests FollowParameters directly was missing.	2019-02-11 10:53:04 +01:00
Przemyslaw Gomulka	0e5a734e7e	Fix HistoryIntegrationTests timestamp comparison #38565 Backport#38505 When the millisecond part of a timestamp is 0 the toString representation in java-time is omitting the millisecond part (joda was not). The Search response is returning timestamps formatted with WatcherDateTimeUtils, therefore comparisons of strings should be done with the same formatter relates #27330 BackPort #38505	2019-02-11 08:50:21 +01:00
Martijn van Groningen	4625807505	Reuse FollowParameters' parse fields. (#38508 )	2019-02-11 08:46:36 +01:00
Martijn van Groningen	e213ad3e88	Mute test. Relates to #38695	2019-02-11 08:32:42 +01:00
Tim Vernum	273edea712	Mute testExpiredApiKeysDeletedAfter1Week (#38683 ) Tracked: #38408	2019-02-11 16:50:10 +11:00
Tim Brooks	023e3c207a	Concurrent file chunk fetching for CCR restore (#38656 ) Adds the ability to fetch chunks from different files in parallel, configurable using the new `ccr.indices.recovery.max_concurrent_file_chunks` setting, which defaults to 5 in this PR. The implementation uses the parallel file writer functionality that is also used by peer recoveries.	2019-02-09 21:19:57 -07:00
Nhat Nguyen	c202900915	Retry on wait_for_metada_version timeout (#38521 ) Closes #37807 Backport of #38521	2019-02-09 19:51:58 -05:00
Costin Leau	794ee4fb10	SQL: Prevent grouping over grouping functions (#38649 ) Improve verifier to disallow grouping over grouping functions (e.g. HISTOGRAM over HISTOGRAM). Close #38308 (cherry picked from commit 4e9b1cfd4df38c652bba36b4b4b538ce7c714b6e)	2019-02-09 09:30:06 +02:00
Marios Trivyzas	871036bd21	SQL: Relax StackOverflow circuit breaker for constants (#38572 ) Constant numbers (of any form: integers, decimals, negatives, scientific) and strings shouldn't increase the depth counters as they don't contribute to the increment of the stack depth. Fixes: #38571	2019-02-09 09:18:21 +02:00
Marios Trivyzas	af8a444caa	SQL: Replace joda with java time (#38437 ) Replace remaining usages of joda classes with java time. Fixes: #37703	2019-02-08 22:58:07 +02:00
Benjamin Trent	24a8ea06f5	ML: update set_upgrade_mode, add logging (#38372 ) (#38538 ) * ML: update set_upgrade_mode, add logging * Attempt to fix datafeed isolation Also renamed a few methods/variables for clarity and added some comments	2019-02-08 12:56:04 -06:00
Christoph Büscher	d03b386f6a	Mute FollowerFailOverIT testFailOverOnFollower (#38634 ) Relates to #38633	2019-02-08 17:20:30 +01:00
Andrei Stefan	6359d988f0	Account for a possible rolled over file while reading the audit log file (#34909 ) (cherry picked from commit 75cb6b38ed67dc9d32c9291b0c174ffa94e473bc)	2019-02-08 17:49:00 +02:00
Christoph Büscher	779673c792	Mute failing WatchStatusIntegrationTests (#38621 ) Relates to #38619	2019-02-08 13:56:47 +01:00
Christoph Büscher	5180b36547	Mute failing ApiKeyIntegTests (#38614 )	2019-02-08 13:04:17 +01:00
Jason Tedor	fdf6b3f23f	Add 7.1 version constant to 7.x branch (#38513 ) This commit adds the 7.1 version constant to the 7.x branch. Co-authored-by: Andy Bristol <andy.bristol@elastic.co> Co-authored-by: Tim Brooks <tim@uncontended.net> Co-authored-by: Christoph Büscher <cbuescher@posteo.de> Co-authored-by: Luca Cavanna <javanna@users.noreply.github.com> Co-authored-by: markharwood <markharwood@gmail.com> Co-authored-by: Ioannis Kakavas <ioannis@elastic.co> Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co> Co-authored-by: David Roberts <dave.roberts@elastic.co> Co-authored-by: Jason Tedor <jason@tedor.me> Co-authored-by: Alpar Torok <torokalpar@gmail.com> Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Tim Vernum <tim@adjective.org> Co-authored-by: Albert Zaharovits <albert.zaharovits@gmail.com>	2019-02-07 16:32:27 -05:00
Marios Trivyzas	f96bd2ad71	SQL: Fix issue with IN not resolving to underlying keyword field (#38440 ) - Add resolution to the exact keyword field (if exists) for text fields. - Add proper verification and error message if underlying keyword doesn'texist. - Move check for field attribute in the comparison list to the `resolveType()` method of `IN`. Fixes: #38424	2019-02-06 16:25:06 +02:00
David Turner	5a3c452480	Align docs etc with new discovery setting names (#38492 ) In #38333 and #38350 we moved away from the `discovery.zen` settings namespace since these settings have an effect even though Zen Discovery itself is being phased out. This change aligns the documentation and the names of related classes and methods with the newly-introduced naming conventions.	2019-02-06 11:34:38 +00:00
Costin Leau	1a02445ae1	SQL: Allow look-ahead resolution of aliases for WHERE clause (#38450 ) Aliases defined in SELECT (Project or Aggregate) are now resolved in the following WHERE clause. The Analyzer has been enhanced to identify this rule and replace the field accordingly. Close #29983	2019-02-06 12:08:32 +02:00
Yogesh Gaikwad	6ff4a8cfd5	Add API key settings documentation (#38490 ) This commit adds missing API key service settings documentation.	2019-02-06 20:58:22 +11:00
Luca Cavanna	a7046e001c	Remove support for maxRetryTimeout from low-level REST client (#38085 ) We have had various reports of problems caused by the maxRetryTimeout setting in the low-level REST client. Such setting was initially added in the attempts to not have requests go through retries if the request already took longer than the provided timeout. The implementation was problematic though as such timeout would also expire in the first request attempt (see #31834), would leave the request executing after expiration causing memory leaks (see #33342), and would not take into account the http client internal queuing (see #25951). Given all these issues, it seems that this custom timeout mechanism gives little benefits while causing a lot of harm. We should rather rely on connect and socket timeout exposed by the underlying http client and accept that a request can overall take longer than the configured timeout, which is the case even with a single retry anyways. This commit removes the `maxRetryTimeout` setting and all of its usages.	2019-02-06 08:43:47 +01:00
Yogesh Gaikwad	5261673349	Change the min supported version to 6.7.0 for API keys (#38481 ) This commit changes the minimum supported version to 6.7.0 for API keys, the change for the API keys has been backported to 6.7.0 version #38399	2019-02-06 16:03:49 +11:00
Jay Modi	e73c9c90ee	Add an authentication cache for API keys (#38469 ) This commit adds an authentication cache for API keys that caches the hash of an API key with a faster hash. This will enable better performance when API keys are used for bulk or heavy searching.	2019-02-05 18:16:26 -07:00
Yogesh Gaikwad	57600c5acb	Enable logs for intermittent test failure (#38426 ) I have not been able to reproduce the failing test scenario locally for #38408 and there are other similar tests which are running fine in the same test class. I am re-enabling the test with additional logs so that we can debug further on what's happening. I will keep the issue open for now and look out for the builds to see if there are any related failures.	2019-02-06 11:21:54 +11:00
Martijn van Groningen	8972ebabdd	Enable bwc tests now that #38443 is backported. (#38462 )	2019-02-06 00:04:43 +01:00
Tim Brooks	fb0ec26fd4	Set update mappings mater node timeout to 30 min (#38439 ) This is related to #35975. We do not want a slow master to fail a recovery from remote process due to a slow put mappings call. This commit increases the master node timeout on this call to 30 mins.	2019-02-05 16:22:11 -06:00
Marios Trivyzas	2c30501c74	SQL: Fix esType for DATETIME/DATE and INTERVALS (#38179 ) Since introduction of data types that don't have a corresponding type in ES the `esType` is error-prone when used for `unmappedType()` calls. Moreover since the renaming of `DATE` to `DATETIME` and the introduction of an actual date-only `DATE` the `esType` would return `datetime` which is not a valid type for ES mapping. Fixes: #38051	2019-02-05 23:12:52 +02:00
Przemyslaw Gomulka	afcdbd2bc0	XPack: core/ccr/Security-cli migration to java-time (#38415 ) part of the migrating joda time work. refactoring x-pack plugins usages of joda to java-time refers #27330	2019-02-05 22:09:32 +01:00
Jay Modi	7ca5495d86	Allow custom authorization with an authorization engine (#38358 ) For some users, the built in authorization mechanism does not fit their needs and no feature that we offer would allow them to control the authorization process to meet their needs. In order to support this, a concept of an AuthorizationEngine is being introduced, which can be provided using the security extension mechanism. An AuthorizationEngine is responsible for making the authorization decisions about a request. The engine is responsible for knowing how to authorize and can be backed by whatever mechanism a user wants. The default mechanism is one backed by roles to provide the authorization decisions. The AuthorizationEngine will be called by the AuthorizationService, which handles more of the internal workings that apply in general to authorization within Elasticsearch. In order to support external authorization services that would back an authorization engine, the entire authorization process has become asynchronous, which also includes all calls to the AuthorizationEngine. The use of roles also leaked out of the AuthorizationService in our existing code that is not specifically related to roles so this also needed to be addressed. RequestInterceptor instances sometimes used a role to ensure a user was not attempting to escalate their privileges. Addressing this leakage of roles meant that the RequestInterceptor execution needed to move within the AuthorizationService and that AuthorizationEngines needed to support detection of whether a user has more privileges on a name than another. The second area where roles leaked to the user is in the handling of a few privilege APIs that could be used to retrieve the user's privileges or ask if a user has privileges to perform an action. To remove the leakage of roles from these actions, the AuthorizationService and AuthorizationEngine gained methods that enabled an AuthorizationEngine to return the response for these APIs. Ultimately this feature is the work included in: #37785 #37495 #37328 #36245 #38137 #38219 Closes #32435	2019-02-05 13:39:29 -07:00
Boaz Leskes	033ba725af	Remove support for internal versioning for concurrency control (#38254 ) Elasticsearch has long [supported](https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-index_.html#index-versioning) compare and set (a.k.a optimistic concurrency control) operations using internal document versioning. Sadly that approach is flawed and can sometime do the wrong thing. Here's the relevant excerpt from the resiliency status page: > When a primary has been partitioned away from the cluster there is a short period of time until it detects this. During that time it will continue indexing writes locally, thereby updating document versions. When it tries to replicate the operation, however, it will discover that it is partitioned away. It won’t acknowledge the write and will wait until the partition is resolved to negotiate with the master on how to proceed. The master will decide to either fail any replicas which failed to index the operations on the primary or tell the primary that it has to step down because a new primary has been chosen in the meantime. Since the old primary has already written documents, clients may already have read from the old primary before it shuts itself down. The version numbers of these reads may not be unique if the new primary has already accepted writes for the same document We recently [introduced](https://www.elastic.co/guide/en/elasticsearch/reference/6.x/optimistic-concurrency-control.html) a new sequence number based approach that doesn't suffer from this dirty reads problem. This commit removes support for internal versioning as a concurrency control mechanism in favor of the sequence number approach. Relates to #1078	2019-02-05 20:53:35 +01:00
Tim Brooks	4a15e2b29e	Make Ccr recovery file chunk size configurable (#38370 ) This commit adds a byte setting `ccr.indices.recovery.chunk_size`. This setting configs the size of file chunk requested while recovering from remote.	2019-02-05 13:34:00 -06:00
Tim Brooks	c2a8fe1f91	Prevent CCR recovery from missing documents (#38237 ) Currently the snapshot/restore process manually sets the global checkpoint to the max sequence number from the restored segements. This does not work for Ccr as this will lead to documents that would be recovered in the normal followering operation from being recovered. This commit fixes this issue by setting the initial global checkpoint to the existing local checkpoint.	2019-02-05 13:32:41 -06:00
Julie Tibshirani	3ce7d2c9b6	Make sure to reject mappings with type _doc when include_type_name is false. (#38270 ) `CreateIndexRequest#source(Map<String, Object>, ... )`, which is used when deserializing index creation requests, accidentally accepts mappings that are nested twice under the type key (as described in the bug report #38266). This in turn causes us to be too lenient in parsing typeless mappings. In particular, we accept the following index creation request, even though it should not contain the type key `_doc`: ``` PUT index?include_type_name=false { "mappings": { "_doc": { "properties": { ... } } } } ``` There is a similar issue for both 'put templates' and 'put mappings' requests as well. This PR makes the minimal changes to detect and reject these typed mappings in requests. It does not address #38266 generally, or attempt a larger refactor around types in these server-side requests, as I think this should be done at a later time.	2019-02-05 10:52:32 -08:00
Christoph Büscher	ca47f68091	Ignore type-removal warnings in XPackRestTestHelper (#38431 ) The backport of #38022 introduced types-deprecation warning for get/put template requests that cause problems on tests master in mixed cluster scenarios. While these warnings are caught and ignored in regular Rest tests, the get template requests in XPackRestTestHelper were missed. Closes #38412	2019-02-05 19:07:53 +01:00
Zachary Tong	54e684bedd	testHlrcFromXContent() should respect assertToXContentEquivalence() (#38232 ) Tests can override assertToXContentEquivalence() in case their xcontent cannot be directly compared (e.g. due to insertion order in maps affecting the xcontent ordering). But the `testHlrcFromXContent` test hardcoded the equivalence test to `true` instead of consulting `assertToXContentEquivalence()` Fixes #36034	2019-02-05 12:59:05 -05:00
David Turner	f2dd5dd6eb	Remove DiscoveryPlugin#getDiscoveryTypes (#38414 ) With this change we no longer support pluggable discovery implementations. No known implementations of `DiscoveryPlugin` actually override this method, so in practice this should have no effect on the wider world. However, we were using this rather extensively in tests to provide the `test-zen` discovery type. We no longer need a separate discovery type for tests as we no longer need to customise its behaviour. Relates #38410	2019-02-05 17:42:24 +00:00
Przemyslaw Gomulka	963b474f2f	Fix the clock resolution to millis in GetWatchResponseTests (#38405 ) the clock resolution changed from jdk8->jdk10, hence the test is passing in jdk8 but failing in jdk10. The Watcher's objects are serialised and deserialised with milliseconds precision, making test to fail in jdk 10 and higher closes #38400	2019-02-05 18:27:24 +01:00
Przemyslaw Gomulka	df4eb0485d	Enable CronEvalToolTest.testEnsureDateIsShownInRootLocale (#38394 ) The test is now expected to be always passing no matter what the random locale is. This is fixed with using jdk ZoneId.systemDefault() in both the test and CronEvalTool closes #35687	2019-02-05 17:48:47 +01:00
Marios Trivyzas	c9701be1e8	SQL: Implement CURRENT_DATE (#38175 ) Since DATE data type is now available, this implements the `CURRENT_DATE/CURRENT_DATE()/TODAY()` similar to `CURRENT_TIMESTAMP`. Closes: #38160	2019-02-05 18:15:26 +02:00
Armin Braun	887fa2c97a	Mute testReadRequestsReturnLatestMappingVersion (#38438 ) * Relates #37807	2019-02-05 17:10:12 +01:00
David Roberts	92bc681705	[ML] Report index unavailable instead of waiting for lazy node (#38423 ) If a job cannot be assigned to a node because an index it requires is unavailable and there are lazy ML nodes then index unavailable should be reported as the assignment explanation rather than waiting for a lazy ML node.	2019-02-05 16:10:00 +00:00
Martijn van Groningen	0beb3c93d1	Clean up duplicate follow config parameter code (#37688 ) Introduced FollowParameters class that put follow, resume follow, put auto follow pattern requests and follow info response classes reuse. The FollowParameters class had the fields, getters etc. for the common parameters that all these APIs have. Also binary and xcontent serialization / parsing is handled by this class. The follow, resume follow, put auto follow pattern request classes originally used optional non primitive fields, so FollowParameters has that too and the follow info api can handle that now too. Also the followerIndex field can in production only be specified via the url path. If it is also specified via the request body then it must have the same value as is specified in the url path. This option only existed to xcontent testing. However the AbstractSerializingTestCase base class now also supports createXContextTestInstance() to provide a different test instance when testing xcontent, so allowing followerIndex to be specified via the request body is no longer needed. By moving the followerIndex field from Body to ResumeFollowAction.Request class and not allowing the followerIndex field to be specified via the request body the Body class is redundant and can be removed. The ResumeFollowAction.Request class can then directly use the FollowParameters class. For consistency I also removed the ability to specified followerIndex in the put follow api and the name in put auto follow pattern api via the request body.	2019-02-05 17:05:19 +01:00
Jason Tedor	638ba4a59a	Mute failing API key integration test (#38409 ) This commit mutes the test testGetAndInvalidateApiKeysWithExpiredAndInvalidatedApiKey as it failed during a PR build.	2019-02-05 06:08:03 -05:00
Andrei Stefan	cea81b199d	Change the milliseconds precision to 3 digits for intervals. (#38297 )	2019-02-05 12:00:49 +02:00
Albert Zaharovits	8e2eb39cef	SecuritySettingsSource license.self_generated: trial (#38233 ) Authn is enabled only if `license_type` is non `basic`, but `basic` is what the `LicenseService` generates implicitly. This commit explicitly sets license type to `trial`, which allows for authn, in the `SecuritySettingsSource` which is the settings configuration parameter for `InternalTestCluster`s. The real problem, that had created tests failures like #31028 and #32685, is that the check `licenseState.isAuthAllowed()` can change sporadically. If it were to return `true` or `false` during the whole test there would be no problem. The problem manifests when it turns from `true` to `false` right before `Realms.asList()`. There are other license checks before this one (request filter, token service, etc) that would not cause a problem if they would suddenly see the check as `false`. But switching to `false` before `Realms.asList()` makes it appear that no installed realms could have handled the authn token which is an authentication error, as can be seen in the failing tests. Closes #31028 #32685	2019-02-05 10:49:08 +02:00
David Turner	3b2a0d7959	Rename no-master-block setting (#38350 ) Replaces `discovery.zen.no_master_block` with `cluster.no_master_block`. Any value set for the old setting is now ignored.	2019-02-05 08:47:56 +00:00
David Turner	2d114a02ff	Rename static Zen1 settings (#38333 ) Renames the following settings to remove the mention of `zen` in their names: - `discovery.zen.hosts_provider` -> `discovery.seed_providers` - `discovery.zen.ping.unicast.concurrent_connects` -> `discovery.seed_resolver.max_concurrent_resolvers` - `discovery.zen.ping.unicast.hosts.resolve_timeout` -> `discovery.seed_resolver.timeout` - `discovery.zen.ping.unicast.hosts` -> `discovery.seed_addresses`	2019-02-05 08:46:52 +00:00
Brandon Kobel	64ff75f04e	Add apm_user reserved role (#38206 ) * Adding apm_user * Fixing SecurityDocumentationIT testGetRoles test * Adding access to .ml-anomalies-* * Fixing APM test, we don't have access to the ML state index	2019-02-04 21:45:28 -08:00
Yogesh Gaikwad	fe36861ada	Add support for API keys to access Elasticsearch (#38291 ) X-Pack security supports built-in authentication service `token-service` that allows access tokens to be used to access Elasticsearch without using Basic authentication. The tokens are generated by `token-service` based on OAuth2 spec. The access token is a short-lived token (defaults to 20m) and refresh token with a lifetime of 24 hours, making them unsuitable for long-lived or recurring tasks where the system might go offline thereby failing refresh of tokens. This commit introduces a built-in authentication service `api-key-service` that adds support for long-lived tokens aka API keys to access Elasticsearch. The `api-key-service` is consulted after `token-service` in the authentication chain. By default, if TLS is enabled then `api-key-service` is also enabled. The service can be disabled using the configuration setting. The API keys:- - by default do not have an expiration but expiration can be configured where the API keys need to be expired after a certain amount of time. - when generated will keep authentication information of the user that generated them. - can be defined with a role describing the privileges for accessing Elasticsearch and will be limited by the role of the user that generated them - can be invalidated via invalidation API - information can be retrieved via a get API - that have been expired or invalidated will be retained for 1 week before being deleted. The expired API keys remover task handles this. Following are the API key management APIs:- 1. Create API Key - `PUT/POST /_security/api_key` 2. Get API key(s) - `GET /_security/api_key` 3. Invalidate API Key(s) `DELETE /_security/api_key` The API keys can be used to access Elasticsearch using `Authorization` header, where the auth scheme is `ApiKey` and the credentials, is the base64 encoding of API key Id and API key separated by a colon. Example:- ``` curl -H "Authorization: ApiKey YXBpLWtleS1pZDphcGkta2V5" http://localhost:9200/_cluster/health ``` Closes #34383	2019-02-05 14:21:57 +11:00
Yogesh Gaikwad	9d3f057894	Limit token expiry to 1 hour maximum (#38244 ) We mention in our documentation for the token expiration configuration maximum value is 1 hour but do not enforce it. This commit adds max limit to the TOKEN_EXPIRATION setting.	2019-02-05 12:02:36 +11:00
Gordon Brown	b866417650	Mute testCannotShrinkLeaderIndex (#38374 ) This test should not pass until CCR finishes integrating shard history retention leases. It currently sometimes passes (which is a bug in the test), but cannot pass reliably until the linked issue is resolved.	2019-02-04 16:06:19 -07:00
Nhat Nguyen	cecfa5bd6d	Tighten mapping syncing in ccr remote restore (#38071 ) There are two issues regarding the way that we sync mapping from leader to follower when a ccr restore is completed: 1. The returned mapping from a cluster service might not be up to date as the mapping of the restored index commit. 2. We should not compare the mapping version of the follower and the leader. They are not related to one another. Moreover, I think we should only ensure that once the restore is done, the mapping on the follower should be at least the mapping of the copied index commit. We don't have to sync the mapping which is updated after we have opened a session. Relates #36879 Closes #37887	2019-02-04 17:53:41 -05:00
Tim Brooks	5a33816c86	Add test for `PutFollowAction` on a closed index (#38236 ) This is related to #35975. Currently when an index falls behind a leader it encounters a fatal exception. This commit adds a test for that scenario. Additionally, it tests that the user can stop following, close the follower index, and put follow again. After the indexing is re-bootstrapped, it will recover the documents it lost in normal following operations.	2019-02-04 16:37:42 -06:00
Jay Modi	c3cdf84c04	Fix SSLContext pinning to TLSV1.2 in reload tests (#38341 ) This commit fixes the pinning of SSLContexts to TLSv1.2 in the SSLConfigurationReloaderTests. The pinning was added for the initial creation of clients and webservers but the updated contexts would default to TLSv1.3, which is known to cause hangs with the MockWebServer that we use. Relates #38103 Closes #38247	2019-02-04 14:34:37 -07:00
Nhat Nguyen	fb1e350c81	Mute testFollowIndexAndCloseNode (#38360 ) Tracked at #33337	2019-02-04 15:04:46 -05:00
Shaunak Kashyap	be1bb0ec7d	Remove types from Monitoring plugin "backend" code (#37745 ) This PR removes the use of document types from the monitoring exporters and template + watches setup code. It does not remove the notion of types from the monitoring bulk API endpoint "front end" code as that code will eventually just go away in 8.0 and be replaced with Beats as collectors/shippers directly to the monitoring cluster.	2019-02-04 10:58:03 -08:00
Gordon Brown	f872c721ac	Run Node deprecation checks locally (#38065 ) (#38250 ) At times, we need to check for usage of deprecated settings in settings which should not be returned by the NodeInfo API. This commit changes the deprecation info API to run all node checks locally so that these settings can be checked without exposing them via any externally accessible API.	2019-02-04 09:43:28 -07:00
Jason Tedor	625d37a26a	Introduce retention lease background sync (#38262 ) This commit introduces a background sync for retention leases. The idea here is that we do a heavyweight sync when adding a new retention lease, and then periodically we want to background sync any retention lease renewals to the replicas. As long as the background sync interval is significantly lower than the extended lifetime of a retention lease, it is okay if from time to time a replica misses a sync (it will still have an older version of the lease that is retaining more data as we assume that renewals do not decrease the retaining sequence number). There are two follow-ups that will come after this commit. The first is to address the fact that we have not adapted the should periodically flush logic to possibly flush the retention leases. We want to do something like flush if we have not flushed in the last five minutes and there are renewed retention leases since the last time that we flushed. An additional follow-up will remove the syncing of retention leases when a retention lease expires. Today this sync could be invoked in the background by a merge operation. Rather, we will move the syncing of retention lease expiration to be done under the background sync. The background sync will use the heavyweight sync (write action) if a lease has expired, and will use the lightweight background sync (replication action) otherwise.	2019-02-04 10:35:29 -05:00
David Roberts	fb6a176caf	[ML] Add explanation so far to file structure finder exceptions (#38191 ) The explanation so far can be invaluable for troubleshooting as incorrect decisions made early on in the structure analysis can result in seemingly crazy decisions or timeouts later on. Relates elastic/kibana#29821	2019-02-04 14:32:35 +00:00
Boaz Leskes	e49b593c81	Move TokenService to seqno powered cas (#38311 ) Relates #37872 Relates #10708	2019-02-04 15:25:41 +01:00
Przemyslaw Gomulka	9b64558efb	Migrating from joda to java.time. Watcher plugin (#35809 ) part of the migrating joda time work. Migrating watcher plugin to use JDK's java-time refers #27330	2019-02-04 15:08:31 +01:00
Przemyslaw Gomulka	85b4bfe3ff	Core: Migrating from joda to java.time. Monitoring plugin (#36297 ) monitoring plugin migration from joda to java.time refers #27330	2019-02-04 14:47:08 +01:00
Boaz Leskes	ff13a43144	Move ML Optimistic Concurrency Control to Seq No (#38278 ) This commit moves the usage of internal versioning for CAS operations to use sequence numbers and primary terms Relates to #36148 Relates to #10708	2019-02-04 10:41:08 +01:00
David Turner	1d82a6d9f9	Deprecate unused Zen1 settings (#38289 ) Today the following settings in the `discovery.zen` namespace are still used: - `discovery.zen.no_master_block` - `discovery.zen.hosts_provider` - `discovery.zen.ping.unicast.concurrent_connects` - `discovery.zen.ping.unicast.hosts.resolve_timeout` - `discovery.zen.ping.unicast.hosts` This commit deprecates all other settings in this namespace so that they can be removed in the next major version.	2019-02-04 08:52:08 +00:00
Tim Vernum	0164acb0a7	Cleanup construction of interceptors (#38294 ) It would be beneficial to apply some of the request interceptors even when features are disabled. This change reworks the way we build that list so that the interceptors we always want to use are constructed outside of the settings check.	2019-02-04 17:27:41 +11:00
Costin Leau	75f0750ff7	SQL: Remove exceptions from Analyzer (#38260 ) Instead of throwing an exception, use an unresolved attribute to pass the message to the Verifier. Additionally improve the parser to save the extended source for the Aggregate and OrderBy. Close #38208	2019-02-03 22:32:16 +02:00
Costin Leau	a088155f4d	SQL: Move metrics tracking inside PlanExecutor (#38259 ) Move metrics in one place, from the transport layer inside the PlanExecutor Remove unused class Close #38258	2019-02-03 22:31:35 +02:00
Albert Zaharovits	3c1544d259	Fix NPE in Logfile Audit Filter (#38120 ) The culprit in #38097 is an `IndicesRequest` that has no indices, but instead of `request.indices()` returning `null` or `String[0]` it returned `String[] {null}` . This tripped the audit filter. I have addressed this in two ways: 1. `request.indices()` returning `String[] {null}` is treated as `null` or `String[0]`, i.e. no indices 2. `null` values among the roles and indices lists, which are unexpected, will never again stumble the audit filter; `null` values are treated as special values that will not match any policy, i.e. their events will always be printed. Closes #38097	2019-02-03 10:34:17 +02:00
Andrei Stefan	6968f0925b	SQL: Generate relevant error message when grouping functions are not used in GROUP BY (#38017 ) * Add checks for Grouping functions restriction to be placed inside GROUP BY * Fixed bug where GROUP BY HISTOGRAM (not using alias) wasn't recognized properly in the Verifier due to functions equality not working correctly.	2019-02-02 22:05:47 +02:00
Gordon Brown	475a045192	Mute tests in SSLConfigurationReloaderTests (#38248 ) Specifically `testReloadingTrustStore` and `testReloadingPEMTrustConfig`	2019-02-01 21:00:58 -07:00
Gordon Brown	7a1e89c7ed	Ensure ILM policies run safely on leader indices (#38140 ) Adds a Step to the Shrink and Delete actions which prevents those actions from running on a leader index - all follower indices must first unfollow the leader index before these actions can run. This prevents the loss of history before follower indices are ready, which might otherwise result in the loss of data.	2019-02-01 20:46:12 -07:00
Boaz Leskes	f6e06a2b19	Adapt minimum versions for seq# powered operations in Watch related requests and UpdateRequest (#38231 ) After backporting #37977, #37857 and #37872	2019-02-01 20:37:16 -05:00
Costin Leau	783c9ed372	SQL: Allow sorting of groups by aggregates (#38042 ) Introduce client-side sorting of groups based on aggregate functions. To allow this, the Analyzer has been extended to push down to underlying Aggregate, aggregate function and the Querier has been extended to identify the case and consume the results in order and sort them based on the given columns. The underlying QueryContainer has been slightly modified to allow a view of the underlying values being extracted as the columns used for sorting might not be requested by the user. The PR also adds minor tweaks, mainly related to tree output. Close #35118	2019-02-02 01:38:25 +02:00
Jason Tedor	f181e17038	Introduce retention leases versioning (#37951 ) Because concurrent sync requests from a primary to its replicas could be in flight, it can be the case that an older retention leases collection arrives and is processed on the replica after a newer retention leases collection has arrived and been processed. Without a defense, in this case the replica would overwrite the newer retention leases with the older retention leases. This commit addresses this issue by introducing a versioning scheme to retention leases. This versioning scheme is used to resolve out-of-order processing on the replica. We persist this version into Lucene and restore it on recovery. The encoding of retention leases is starting to get a little ugly. We can consider addressing this in a follow-up.	2019-02-01 17:19:19 -05:00
Tal Levy	bae656dcea	Preserve ILM operation mode when creating new lifecycles (#38134 ) There was a bug where creating a new policy would start the ILM service, even if it was stopped. This change ensures that there is no change to the existing operation mode	2019-02-01 13:16:34 -08:00
Nhat Nguyen	3ecdfe1060	Enable trace log in FollowerFailOverIT (#38148 ) This suite still fails one per week sometimes with a worrying assertion. Sadly we are still unable to find the actual source. Expected: <SeqNoStats{maxSeqNo=229, localCheckpoint=86, globalCheckpoint=86}> but: was <SeqNoStats{maxSeqNo=229, localCheckpoint=-1, globalCheckpoint=86}> This change enables trace log in the suite so we will have a better picture if this fails again. Relates #3333	2019-02-01 15:44:39 -05:00
Julie Tibshirani	c2e9d13ebd	Default include_type_name to false in the yml test harness. (#38058 ) This PR removes the temporary change we made to the yml test harness in #37285 to automatically set `include_type_name` to `true` in index creation requests if it's not already specified. This is possible now that the vast majority of index creation requests were updated to be typeless in #37611. A few additional tests also needed updating here. Additionally, this PR updates the test harness to set `include_type_name` to `false` in index creation requests when communicating with 6.x nodes. This mirrors the logic added in #37611 to allow for typeless document write requests in test set-up code. With this update in place, we can remove many references to `include_type_name: false` from the yml tests.	2019-02-01 11:44:13 -08:00
Nhat Nguyen	f64b20383e	Replace awaitBusy with assertBusy in atLeastDocsIndexed (#38190 ) Unlike assertBusy, awaitBusy does not retry if the code-block throws an AssertionError. A refresh in atLeastDocsIndexed can fail because we call this method while we are closing some node in FollowerFailOverIT.	2019-02-01 13:31:17 -05:00
Benjamin Trent	5db305023d	ML: Fix error race condition on stop _all datafeeds and close _all jobs (#38113 ) * ML: Ignore when task is not found for _all * Addressing PR comments * Update TransportStopDatafeedAction.java	2019-02-01 11:16:35 -06:00
Shaunak Kashyap	cc7c42d7e2	Allow built-in monitoring_user role to call GET _xpack API (#38060 ) This PR adds the `monitor/xpack/info` cluster-level privilege to the built-in `monitoring_user` role. This privilege is required for the Monitoring UI to call the `GET _xpack API` on the Monitoring Cluster. It needs to do this in order to determine the license of the Monitoring Cluster, which further determines whether Cluster Alerts are shown to the user or not. Resolves #37970.	2019-02-01 08:56:34 -08:00
David Roberts	1fa413a16d	[ML] Remove "8" prefixes from file structure finder timestamp formats (#38016 ) In 7.x Java timestamp formats are the default timestamp format and there is no need to prefix them with "8". (The "8" prefix was used in 6.7 to distinguish Java timestamp formats from Joda timestamp formats.) This change removes the "8" prefixes from timestamp formats in the output of the ML file structure finder.	2019-02-01 15:36:04 +00:00
Jay Modi	2ca22209cd	Enable TLSv1.3 by default for JDKs with support (#38103 ) This commit enables the use of TLSv1.3 with security by enabling us to properly map `TLSv1.3` in the supported protocols setting to the algorithm for a SSLContext. Additionally, we also enable TLSv1.3 by default on JDKs that support it. An issue was uncovered with the MockWebServer when TLSv1.3 is used that ultimately winds up in an endless loop when the client does not trust the server's certificate. Due to this, SSLConfigurationReloaderTests has been pinned to TLSv1.2. Closes #32276	2019-02-01 08:34:11 -07:00
Tim Vernum	6fcbd07420	Remove heuristics that enable security on trial licenses (#38075 ) In 6.3 trial licenses were changed to default to security disabled, and ee added some heuristics to detect when security should be automatically be enabled if `xpack.security.enabled` was not set. This change removes those heuristics, and requires that security be explicitly enabled (via the `xpack.security.enabled` setting) for trial licenses. Relates: #38009	2019-02-01 17:59:13 +11:00
Tim Brooks	291c4e7a0c	Fix file reading in ccr restore service (#38117 ) Currently we use the raw byte array length when calling the IndexInput read call to determine how many bytes we want to read. However, due to how BigArrays works, the array length might be longer than the reference length. This commit fixes the issue and uses the BytesRef length when calling read. Additionally, it expands the index follow test to index many more documents. These documents should potentially lead to large enough segment files to trigger scenarios where this fix matters.	2019-01-31 18:02:24 -07:00
Benjamin Trent	be381b4525	ML: better handle task state race condition (#38040 )	2019-01-31 11:07:54 -06:00
Henning Andersen	68ed72b923	Handle scheduler exceptions (#38014 ) Scheduler.schedule(...) would previously assume that caller handles exception by calling get() on the returned ScheduledFuture. schedule() now returns a ScheduledCancellable that no longer gives access to the exception. Instead, any exception thrown out of a scheduled Runnable is logged as a warning. This is a continuation of #28667, #36137 and also fixes #37708.	2019-01-31 17:51:45 +01:00
Tim Brooks	b8575c6aa3	Update PutFollowAction serialization post-backport (#37989 ) This commit modifies the `PutFollowRequest` to reflect the fact that active shard functionality has been backported to 6.7.	2019-01-31 09:31:22 -07:00
Alpar Torok	b7de8e1d1e	Mute failing test Tracking #38100	2019-01-31 17:01:16 +02:00
Alpar Torok	f15d7b9b91	Mute failing test Tracking #38027	2019-01-31 16:55:52 +02:00
Marios Trivyzas	4710a7472f	SQL: Implement FIRST/LAST aggregate functions (#37936 ) FIRST and LAST can be used with one argument and work similarly to MIN and MAX but they are implemented using a Top Hits aggregation and therefore can also operate on keyword fields. When a second argument is provided then they return the first/last value of the first arg when its values are ordered ascending/descending (respectively) by the values of the second argument. Currently because of the usage of a Top Hits aggregation FIRST and LAST cannot be used in the HAVING clause of a GROUP BY query to filter on the results of the aggregation. Closes: #35639	2019-01-31 16:33:05 +02:00
Josh Soref	0154052335	spelling: java script -- not JavaScript (#37057 )	2019-01-31 14:09:36 +02:00
Andrei Stefan	22d3290078	SQL: Added SSL configuration options tests (#37875 ) * Added SSL configuration options tests Removed the allow.self.signed option from the documentation since we allow by default self signed certificates as well. * Added more tests	2019-01-31 10:52:49 +02:00
Alexander Reelsen	b94acb608b	Speed up converting of temporal accessor to zoned date time (#37915 ) The existing implementation was slow due to exceptions being thrown if an accessor did not have a time zone. This implementation queries for having a timezone, local time and local date and also checks for an instant preventing to throw an exception and thus speeding up the conversion. This removes the existing method and create a new one named DateFormatters.from(TemporalAccessor accessor) to resemble the naming of the java time ones. Before this change an epoch millis parser using the toZonedDateTime method took approximately 50x longer. Relates #37826	2019-01-31 08:55:40 +01:00
Nhat Nguyen	1a93976ff7	Correct arg names when update mapping/settings from leader (#38063 ) These two arguments are not named incorrectly and caused confusion.	2019-01-31 02:45:42 -05:00
Boaz Leskes	b11732104f	Move watcher to use seq# and primary term for concurrency control (#37977 ) * move watcher to seq# occ * top level set * fix parsing and missing setters * share toXContent for PutResponse and rest end point * fix redacted password * fix username reference * fix deactivate-watch.asciidoc have seq no references * add seq# + term to activate-watch.asciidoc * more doc fixes	2019-01-30 20:14:59 -05:00
Jake Landis	dad41c2b7f	ILM setPriority corrections for a 0 value (#38001 ) This commit fixes the test case that ensures only a priority less then 0 is used with testNonPositivePriority. This also allows the HLRC to support a value of 0. Closes #37652	2019-01-30 17:38:47 -06:00
Tal Levy	7c738fd241	Skip Shrink when numberOfShards not changed (#37953 ) Previously, ShrinkAction would fail if it was executed on an index that had the same number of shards as the target shrunken number. This PR introduced a new BranchingStep that is used inside of ShrinkAction to branch which step to move to next, depending on the shard values. So no shrink will occur if the shard count is unchanged.	2019-01-30 15:09:17 -08:00
Tim Brooks	b88bdfe958	Add dispatching to `HandledTransportAction` (#38050 ) This commit allows implementors of the `HandledTransportAction` to specify what thread the action should be executed on. The motivation for this commit is that certain CCR requests should be performed on the generic threadpool.	2019-01-30 15:40:49 -07:00
Jay Modi	54dbf9469c	Update httpclient for JDK 11 TLS engine (#37994 ) The apache commons http client implementations recently released versions that solve TLS compatibility issues with the new TLS engine that supports TLSv1.3 with JDK 11. This change updates our code to use these versions since JDK 11 is a supported JDK and we should allow the use of TLSv1.3.	2019-01-30 14:24:29 -07:00
Tim Brooks	aeab55e8d1	Reduce flaxiness of ccr recovery timeouts test (#38035 ) This fixes #38027. Currently we assert that all shards have failed. However, it is possible that some shards do not have segement files created yet. The action that we block is fetching these segement files so it is possible that some shards successfully recover. This commit changes the assertion to ensure that at least some of the shards have failed.	2019-01-30 14:13:23 -07:00
David Roberts	be788160ef	[ML] Datafeed deprecation checks (#38026 ) Deprecation checks for the ML datafeed query and aggregations.	2019-01-30 20:12:20 +00:00
David Turner	81c443c9de	Deprecate minimum_master_nodes (#37868 ) Today we pass `discovery.zen.minimum_master_nodes` to nodes started up in tests, but for 7.x nodes this setting is not required as it has no effect. This commit removes this setting so that nodes are started with more realistic configurations, and deprecates it.	2019-01-30 20:09:15 +00:00
Martijn van Groningen	5433af28e3	Fixed test bug, lastFollowTime is null if there are no follower indices.	2019-01-30 19:33:16 +01:00
Jason Tedor	6500b0cbd7	Expose retention leases in shard stats (#37991 ) This commit exposes retention leases via shard-level stats.	2019-01-30 13:20:40 -05:00
Benjamin Trent	9782aaa1b8	ML: Add reason field in JobTaskState (#38029 ) * ML: adding reason to job failure status * marking reason as nullable * Update AutodetectProcessManager.java	2019-01-30 11:56:24 -06:00
Albert Zaharovits	53e80e9814	Fix failure in test code ClusterPrivilegeTests Closes #38030	2019-01-30 16:11:44 +02:00
Benjamin Trent	8280a20664	ML: Add upgrade mode docs, hlrc, and fix bug (#37942 ) * ML: Add upgrade mode docs, hlrc, and fix bug * [DOCS] Fixes build error and edits text * adjusting docs * Update docs/reference/ml/apis/set-upgrade-mode.asciidoc Co-Authored-By: benwtrent <ben.w.trent@gmail.com> * Update set-upgrade-mode.asciidoc * Update set-upgrade-mode.asciidoc	2019-01-30 06:51:11 -06:00
Andrei Stefan	908c8def06	SQL: Skip the nested and object field types in case of an ODBC request (#37948 )	2019-01-30 11:34:47 +02:00
Adrien Grand	c8af0f4bfa	Use mappings to format doc-value fields by default. (#30831 ) Doc-value fields now return a value that is based on the mappings rather than the script implementation by default. This deprecates the special `use_field_mapping` docvalue format which was added in #29639 only to ease the transition to 7.x and it is not necessary anymore in 7.0.	2019-01-30 10:31:51 +01:00
Martijn van Groningen	f51bc00fcf	Added ccr to xpack usage infrastructure (#37256 ) * Added ccr to xpack usage infrastructure Closes #37221	2019-01-30 07:58:26 +01:00
Tim Vernum	99129d7786	Fix exit code for Security CLI tools (#37956 ) The certgen, certutil and saml-metadata tools did not correctly return their exit code to the calling shell. These commands now explicitly exit with the code that was returned from the main(args, terminal) method.	2019-01-30 17:51:11 +11:00
Tim Brooks	55b916afc0	Ensure task metadata not null in follow test (#37993 ) This commit fixes a potential race in the IndexFollowingIT. Currently it is possible that we fetch the task metadata, it is null, and that throws a null pointer exception. Assertbusy does not catch null pointer exceptions. This commit assertions that the metadata is not null.	2019-01-29 15:58:31 -07:00
Tim Brooks	f3f9cabd67	Add timeout for ccr recovery action (#37840 ) This is related to #35975. It adds a action timeout setting that allows timeouts to be applied to the individual transport actions that are used during a ccr recovery.	2019-01-29 12:29:06 -07:00
Marios Trivyzas	e9332331a3	SQL: Make error msg for validation of 2nd arg of PERCENTILE[_RANK] consistent (#37937 ) Use `first` and `second` instead of `1st` and `2nd`.	2019-01-29 21:20:09 +02:00
Albert Zaharovits	697b2fbe52	Remove implicit index monitor privilege (#37774 ) Restricted indices (currently only .security-6 and .security) are special internal indices that require setting the `allow_restricted_indices` flag on every index permission that covers them. If this flag is `false` (default) the permission will not cover these and actions against them will not be authorized. However, the monitoring APIs were the only exception to this rule. This exception is herein forfeited and index monitoring privileges have to be granted explicitly, using the `allow_restricted_indices` flag on the permission, as is the case for any other index privilege.	2019-01-29 21:10:03 +02:00
Benjamin Trent	34d61d3231	ML: ignore unknown fields for JobTaskState (#37982 )	2019-01-29 12:51:34 -06:00
Tim Brooks	00ace369af	Use `CcrRepository` to init follower index (#35719 ) This commit modifies the put follow index action to use a CcrRepository when creating a follower index. It routes the logic through the snapshot/restore process. A wait_for_active_shards parameter can be used to configure how long to wait before returning the response.	2019-01-29 11:47:29 -07:00
David Roberts	5f106a27ea	[ML] Add meta information to all ML indices (#37964 ) This change adds a _meta field storing the version in which the index mappings were last updated to the 3 ML indices that didn't previously have one: - .ml-annotations - .ml-meta - .ml-notifications All other ML indices already had such a _meta field. This field will be useful if we ever need to automatically update the index mappings during a future upgrade.	2019-01-29 15:41:35 +00:00
David Kyle	6d1693ff49	[ML] Prevent submit after autodetect worker is stopped (#37700 ) Runnables can be submitted to AutodetectProcessManager.AutodetectWorkerExecutorService without error after it has been shutdown which can lead to requests timing out as their handlers are never called by the terminated executor. This change throws an EsRejectedExecutionException if a runnable is submitted after after the shutdown and calls AbstractRunnable.onRejection on any tasks not run. Closes #37108	2019-01-29 15:09:40 +00:00
Luca Cavanna	2325fb9cb3	Remove test only SearchShardTarget constructor (#37912 ) Remove SearchShardTarget test only constructor and replace all the usages with calls to the other constructor that accepts a ShardId.	2019-01-29 14:58:11 +01:00
Przemyslaw Gomulka	4f4113e964	Rename security audit.log to _audit.json (#37916 ) in order to keep json logs consistent the security audit logs are renamed from .log to .json relates #32850	2019-01-29 14:53:55 +01:00
Tanguy Leroux	460f10ce60	Close Index API should force a flush if a sync is needed (#37961 ) This commit changes the TransportVerifyShardBeforeCloseAction so that it issues a forced flush, forcing the translog and the Lucene commit to contain the same max seq number and global checkpoint in the case the Translog contains operations that were not written in the IndexWriter (like a Delete that touches a non existing doc). This way the assertion added in #37426 won't trip. Related to #33888	2019-01-29 13:15:58 +01:00
Henrique Gonçalves	eceb3185c7	[ML] Make GetJobStats work with arbitrary wildcards and groups (#36683 ) The /_ml/anomaly_detectors/{job}/_stats endpoint now works correctly when {job} is a wildcard or job group. Closes #34745	2019-01-29 09:06:50 +00:00
Dimitris Athanasiou	ebe9c95230	[ML] Audit all errors during job deletion (#37933 ) This commit moves the auditing of job deletion related errors to the final listener in the job delete action. This ensures any error that occurs during job deletion is audited.	2019-01-29 10:23:50 +02:00
Przemyslaw Gomulka	891320f5ac	Elasticsearch support to JSON logging (#36833 ) In order to support JSON log format, a custom pattern layout was used and its configuration is enclosed in ESJsonLayout. Users are free to use their own patterns, but if smooth Beats integration is needed, they should use ESJsonLayout. EvilLoggerTests are left intact to make sure user's custom log patterns work fine. To populate additional fields node.id and cluster.uuid which are not available at start time, a cluster state update will have to be received and the values passed to log4j pattern converter. A ClusterStateObserver.Listener is used to receive only one ClusteStateUpdate. Once update is received the nodeId and clusterUUid are set in a static field in a NodeAndClusterIdConverter. Following fields are expected in JSON log lines: type, tiemstamp, level, component, cluster.name, node.name, node.id, cluster.uuid, message, stacktrace see ESJsonLayout.java for more details and field descriptions Docker log4j2 configuration is now almost the same as the one use for ES binary. The only difference is that docker is using console appenders, whereas ES is using file appenders. relates: #32850	2019-01-29 07:20:09 +01:00
Like	6ed35fbb94	Support merge nested Map in list for JIRA configurations (#37634 ) This commit allows JIRA API fields that require a list of key/value pairs (maps), such as JIRA "components" to use use template snippets (e.g. {{ctx.payload.foo}}). Prior to this change the templated value (not the de-referenced value) would be sent via the API and error. Closes #30068	2019-01-28 18:01:09 -06:00
Gordon Brown	49bd8715ff	Inject Unfollow before Rollover and Shrink (#37625 ) We inject an Unfollow action before Shrink because the Shrink action cannot be safely used on a following index, as it may not be fully caught up with the leader index before the "original" following index is deleted and replaced with a non-following Shrunken index. The Unfollow action will verify that 1) the index is marked as "complete", and 2) all operations up to this point have been replicated from the leader to the follower before explicitly disconnecting the follower from the leader. Injecting an Unfollow action before the Rollover action is done mainly as a convenience: This allow users to use the same lifecycle policy on both the leader and follower cluster without having to explictly modify the policy to unfollow the index, while doing what we expect users to want in most cases.	2019-01-28 14:09:12 -07:00
Nhat Nguyen	557fcf915e	Wait for mapping in testReadRequestsReturnLatestMappingVersion (#37886 ) If the index request is executed before the mapping update is applied on the IndexShard, the index request will perform a dynamic mapping update. This mapping update will be timeout (i.e, ProcessClusterEventTimeoutException) because the latch is not open. This leads to the failure of the index request and the test. This commit makes sure the mapping is ready before we execute the index request. Closes #37807	2019-01-28 15:25:56 -05:00
Jake Landis	99b75a9bdf	deprecate types for watcher (#37594 ) This commit adds deprecation warnings for index actions and search actions when executed via watcher. Unit and integration tests updated accordingly. relates #35190	2019-01-28 13:46:43 -06:00
Benjamin Trent	7e4c0e6991	ML: Adds set_upgrade_mode API endpoint (#37837 ) * ML: Add MlMetadata.upgrade_mode and API * Adding tests * Adding wait conditionals for the upgrade_mode call to return * Adding tests * adjusting format and tests * Adjusting wait conditions for api return and msgs * adjusting doc tests * adding upgrade mode tests to black list	2019-01-28 09:07:30 -06:00
David Kyle	c0409fb9f0	[ML] Marginal gains in slow multi node QA tests (#37825 ) Move 2 tests that are simple rest tests and out of the QA suite and cut the number of post data calls in ForecastIT	2019-01-28 10:00:59 +00:00
David Roberts	57d321ed5f	[ML] Tighten up use of aliases rather than concrete indices (#37874 ) We have read and write aliases for the ML results indices. However, the job still had methods that purported to reliably return the name of the concrete results index being used by the job. After reindexing prior to upgrade to 7.x this will be wrong, so the method has been renamed and the comments made more explicit to say the returned index name may not be the actual concrete index name for the lifetime of the job. Additionally, the selection of indices when deleting the job has been changed so that it works regardless of concrete index names. All these changes are nice-to-have for 6.7 and 7.0, but will become critical if we add rolling results indices in the 7.x release stream as 6.7 and 7.0 nodes may have to operate in a mixed version cluster that includes a version that can roll results indices.	2019-01-28 09:38:46 +00:00
Martijn van Groningen	4e1a779773	Prepare ShardFollowNodeTask to bootstrap when it fall behind leader shard (#37562 ) * Changed `LuceneSnapshot` to throw an `OperationsMissingException` if the requested ops are missing. * Changed the shard changes api to handle the `OperationsMissingException` and wrap the exception into `ResourceNotFound` exception and include metadata to indicate the requested range can no longer be retrieved. * Changed `ShardFollowNodeTask` to handle this `ResourceNotFound` exception with the included metdata header. Relates to #35975	2019-01-28 09:30:04 +01:00
Dimitrios Liappis	290c6637c2	Refactor into appropriate uses of scheduleUnlessShuttingDown (#37709 ) Replace `threadPool().schedule()` / catch `EsRejectedExecutionException` pattern with direct calls to `ThreadPool#scheduleUnlessShuttingDown()`. Closes #36318	2019-01-28 10:01:26 +02:00
Albert Zaharovits	66ddd8d2f7	Create snapshot role (#35820 ) This commit introduces the `create_snapshot` cluster privilege and the `snapshot_user` role. This role is to be used by "cronable" tools that call the snapshot API periodically without recurring to the `manage` cluster privilege. The `create_snapshot` cluster privilege is much more limited compared to the `manage` privilege. The `snapshot_user` role grants the privileges to view the metadata of all indices (including restricted ones, i.e. .security). It obviously grants the create snapshot privilege but the repository has to be created using another role. In addition, it grants the privileges to (only) GET repositories and snapshots, but not create and delete them. The role does not allow to create repositories. This distinction is important because snapshotting equates to the `read` index privilege if the user has control of the snapshot destination, but this is not the case in this instance, because the role does not grant control over repository configuration.	2019-01-27 23:07:32 +02:00
Jason Tedor	5fddb631a2	Introduce retention lease syncing (#37398 ) This commit introduces retention lease syncing from the primary to its replicas when a new retention lease is added. A follow-up commit will add a background sync of the retention leases as well so that renewed retention leases are synced to replicas.	2019-01-27 07:49:56 -05:00
David Roberts	f2c0c26d15	[ML] Adjust structure finder for Joda to Java time migration (#37306 ) The ML file structure finder has always reported both Joda and Java time format strings. This change makes the Java time format strings the ones that are incorporated into mappings and ingest pipeline definitions. The BWC syntax of prepending "8" to these formats is used. This will need to be removed once Java time format strings become the default in Elasticsearch. This commit also removes direct imports of Joda classes in the structure finder unit tests. Instead the core Joda BWC class is used.	2019-01-26 20:19:57 +00:00
Julie Tibshirani	7c130d235a	Mute CcrRepositoryIT#testFollowerMappingIsUpdated Tracked in #37887.	2019-01-25 14:55:47 -08:00
Marios Trivyzas	d1ff450edc	SQL: Fix casting from date to numeric type to use millis (#37869 ) Previously casting from a DATE[TIME] type to a numeric (DOUBLE, LONG, INT, etc. used seconds instead of the epoch millis. Fixes: #37655	2019-01-25 23:29:10 +02:00
Benjamin Trent	9e932f4869	ML: removing unnecessary upgrade code (#37879 )	2019-01-25 13:57:41 -06:00
Julie Tibshirani	455f223c3a	Mute TransformIntegrationTests#testSearchTransform Tracked in #37882.	2019-01-25 11:12:45 -08:00
Martijn Laarman	dfecb256cb	Exit batch files explictly using ERRORLEVEL (#29583 ) * Exit batch files explictly using ERRORLEVEL This makes sure the exit code is preserved when calling the batch files from different contexts other than DOS Fixes #29582 This also fixes specific error codes being masked by an explict exit /b 1 causing the useful exitcodes from ExitCodes to be lost. * fix line breaks for calling cli to match the bash scripts * indent size of bash files is 2, make sure editorconfig does the same for bat files * update indenting to match bash files * update elasticsearch-keystore.bat indenting * Update elasticsearch-node.bat to exit outside of endlocal	2019-01-25 16:44:33 +01:00
Tanguy Leroux	f1f54e0f61	TransportUnfollowAction should increase settings version (#37859 ) The TransportUnfollowAction updates the index settings but does not increase the settings version to reflect that change. This issue has been caught while working on the replication of closed indices (#33888). The IndexFollowingIT.testUnfollowIndex() started to fail and this specific assertion tripped. It does not happen on master branch today because index metadata for closed indices are never updated in IndexService instances, but this is something that is going to change with the replication of closed indices.	2019-01-25 16:31:26 +01:00
Przemyslaw Gomulka	85acc11ef7	AsyncTwoPhaseIndexerTests race condition fixed (#37830 ) The unlucky timing can cause this test to fail when the indexing is triggered from `maybeTriggerAsyncJob`. As this is asynchronous, in can finish quicker then the test stepping over to next assertion The introduced barrier solves the problem closes #37695	2019-01-25 16:26:16 +01:00
Christoph Büscher	b4b4cd6ebd	Clean codebase from empty statements (#37822 ) * Remove empty statements There are a couple of instances of undocumented empty statements all across the code base. While they are mostly harmless, they make the code hard to read and are potentially error-prone. Removing most of these instances and marking blocks that look empty by intention as such. * Change test, slightly more verbose but less confusing	2019-01-25 14:23:02 +01:00
David Roberts	deafce1acd	[ML] No need to add state doc mapping on job open in 7.x (#37759 ) When upgrading from 5.4 to 5.5 to 6.7 (inclusive) it was necessary to ensure there was a mapping for type "doc" on the ML state index before opening a job. This was because 5.4 created a multi-type ML state index. In version 7.x we can be sure that any such 5.4 index is no longer in use. It would have had to be reindexed into the 6.x index format prior to the upgrade to version 7.x.	2019-01-25 13:15:35 +00:00
Jim Ferenczi	787acb14b9	Track total hits up to 10,000 by default (#37466 ) This commit changes the default for the `track_total_hits` option of the search request to `10,000`. This means that by default search requests will accurately track the total hit count up to `10,000` documents, requests that match more than this value will set the `"total.relation"` to `"gte"` (e.g. greater than or equals) and the `"total.value"` to `10,000` in the search response. Scroll queries are not impacted, they will continue to count the total hits accurately. The default is set back to `true` (accurate hit count) if `rest_total_hits_as_int` is set in the search request. I choose `10,000` as the default because that's also the number we use to limit pagination. This means that users will be able to know how far they can jump (up to 10,000) even if the total number of hits is not accurate. Closes #33028	2019-01-25 13:45:39 +01:00
Tanguy Leroux	a3baa8f5ef	Freezing an index should increase its index settings version (#37813 ) When an index is frozen, two index settings are updated (index.frozen and index.search.throttled) but the settings version is left unchanged and does not reflect the settings update. This commit change the TransportFreezeIndexAction so that it also increases the settings version when an index is frozen/unfrozen. This issue has been caught while working on the replication of closed indices (#3388) in which index metadata for a closed index are updated to frozen metadata and this specific assertion tripped.	2019-01-25 11:27:27 +01:00
David Roberts	170d7413d0	[ML] Fix gaps in reserved roles tests (#37772 ) Some of our newer endpoints and indices were missing from the tests.	2019-01-25 09:29:53 +00:00
Martijn van Groningen	1151f3b3ff	Fail with a dedicated exception if remote connection is missing or (#37767 ) or connectivity to the remote connection is failing. Relates to #37681	2019-01-25 08:53:18 +01:00
Tim Vernum	03690d12b2	Remove TLS 1.0 as a default SSL protocol (#37512 ) The default value for ssl.supported_protocols no longer includes TLSv1 as this is an old protocol with known security issues. Administrators can enable TLSv1.0 support by configuring the appropriate `ssl.supported_protocols` setting, for example: xpack.security.http.ssl.supported_protocols: ["TLSv1.2","TLSv1.1","TLSv1"] Relates: #36021	2019-01-25 15:46:39 +11:00
Lee Hinman	0f3c542850	Deprecate xpack.watcher.history.cleaner_service.enabled (#37782 ) This deprecates the `xpack.watcher.history.cleaner_service.enabled` setting, since all newly created `.watch-history` indices in 7.0 will use ILM to manage their retention. In 8.0 the setting itself and cleanup actions will be removed. Resolves #32041	2019-01-24 15:31:31 -07:00
Nhat Nguyen	76fb573569	Do not allow put mapping on follower (#37675 ) Today, the mapping on the follower is managed and replicated from its leader index by the ShardFollowTask. Thus, we should prevent users from modifying the mapping on the follower indices. Relates #30086	2019-01-24 12:13:00 -05:00
Marios Trivyzas	74b6f308e9	SQL: Fix issue with complex expression as args of PERCENTILE/_RANK (#37102 ) When the arguements of PERCENTILE and PERCENTILE_RANK can be folded, the `ConstantFolding` rule kicks in and calls the `replaceChildren()` method on `InnerAggregate` which is created from the aggregation rules of the `Optimizerz. `InnerAggregate` in turn, cannot implement the method as the logic of creating a new `InnerAggregate` instance from a list of `Expression`s resides in the Optimizer. So, instead, `ConstantFolding` should be applied before any of the aggregations related rules. Fixes: #37099	2019-01-24 18:40:20 +02:00
Ioannis Kakavas	265710e658	Better msg on unmapped principal attribute (#37805 ) When we can't map the principal attribute from the configured SAML attribute in the realm settings, we can't complete the authentication. We return an error to the user indicating this and we present them with a list of attributes we did get from the SAML response to point out that the expected one was not part of that list. This list will never contain the NameIDs though as they are not part of the SAMLAttribute list. So we might have a NameID but just with a different format.	2019-01-24 17:05:01 +02:00
Andrei Stefan	163a27b93c	SQL: Fix BasicFormatter NPE (#37804 )	2019-01-24 15:40:51 +02:00
Marios Trivyzas	9357929309	SQL: Improve handling of invalid args for PERCENTILE/PERCENTILE_RANK (#37803 ) Improve the Exception and the error message returned when 2nd argument of PERCENTILE and PERCENTILE_RANK is not a constant.	2019-01-24 15:03:49 +02:00
Yulong	20533c5990	Add built-in user and role for code plugin (#37030 ) * Add built-in roles for code plugin * Fix rest-client get-roles test count * Fix broken test	2019-01-24 20:12:32 +08:00
Marios Trivyzas	f707fa9e0a	SQL: Introduce SQL DATE data type (#37693 ) * SQL: Introduce SQL DATE data type Support ANSI SQL's DATE type by introducing a runtime-only ES SQL date type. Closes: #37340	2019-01-24 13:41:58 +02:00
Albert Zaharovits	b6936e3c1e	Remove index audit output type (#37707 ) This commit removes the Index Audit Output type, following its deprecation in 6.7 by 8765a31d4e6770. It also adds the migration notice (settings notice). In general, the problem with the index audit output is that event indexing can be slower than the rate with which audit events are generated, especially during the daily rollovers or the rolling cluster upgrades. In this situation audit events will be lost which is a terrible failure situation for an audit system. Besides of the settings under the `xpack.security.audit.index` namespace, the `xpack.security.audit.outputs` setting has also been deprecated and will be removed in 7. Although explicitly configuring the logfile output does not touch any deprecation bits, this setting is made redundant in 7 so this PR deprecates it as well. Relates #29881	2019-01-24 12:36:10 +02:00
David Roberts	f12bfb4684	Mute FollowerFailOverIT testReadRequestsReturnsLatestMappingVersion Due to https://github.com/elastic/elasticsearch/issues/37807	2019-01-24 09:58:50 +00:00
David Kyle	e1226f69b7	[ML] Increase close job timeout and lower the max number (#37770 )	2019-01-24 09:18:48 +00:00
Martijn van Groningen	2908ca1b35	Fix index filtering in follow info api. (#37752 ) The filtering by follower index was completely broken. Also the wrong persistent tasks were selected, causing the wrong status to be reported. Closes #37738	2019-01-24 08:50:23 +01:00
Nhat Nguyen	0096f1b2e4	Ensure changes requests return the latest mapping version (#37633 ) Today we keep the mapping on the follower in sync with the leader's using the mapping version from changes requests. There are two rare cases where the mapping on the follower is not synced properly: 1. The returned mapping version (from ClusterService) is outdated than the actual mapping. This happens because we expose the latest cluster state in ClusterService after applying it to IndexService. 2. It's possible for the FollowTask to receive an outdated mapping than the min_required_mapping. In that case, it should fetch the mapping again; otherwise, the follower won't have the right mapping. Relates to #31140	2019-01-23 13:41:13 -05:00
Jason Tedor	169cb38778	Liberalize StreamOutput#writeStringList (#37768 ) In some cases we only have a string collection instead of a string list that we want to serialize out. We have a convenience method for writing a list of strings, but no such method for writing a collection of strings. Yet, a list of strings is a collection of strings, so we can simply liberalize StreamOutput#writeStringList to be more generous in the collections that it accepts and write out collections of strings too. On the other side, we do not have a convenience method for reading a list of strings. This commit addresses both of these issues.	2019-01-23 12:52:17 -05:00
Lee Hinman	427bc7f940	Use ILM for Watcher history deletion (#37443 ) * Use ILM for Watcher history deletion This commit adds an index lifecycle policy for the `.watch-history-*` indices. This policy is automatically used for all new watch history indices. This does not yet remove the automatic cleanup that the monitoring plugin does for the .watch-history indices, and it does not touch the `xpack.watcher.history.cleaner_service.enabled` setting. Relates to #32041	2019-01-23 10:18:08 -07:00
Lee Hinman	647e225698	Retry ILM steps that fail due to SnapshotInProgressException (#37624 ) Some steps, such as steps that delete, close, or freeze an index, may fail due to a currently running snapshot of the index. In those cases, rather than move to the ERROR step, we should retry the step when the snapshot has completed. This change adds an abstract step (`AsyncRetryDuringSnapshotActionStep`) that certain steps (like the ones I mentioned above) can extend that will automatically handle a situation where a snapshot is taking place. When a `SnapshotInProgressException` is received by the listener wrapper, a `ClusterStateObserver` listener is registered to wait until the snapshot has completed, re-running the ILM action when no snapshot is occurring. This also adds integration tests for these scenarios (thanks to @talevy in #37552). Resolves #37541	2019-01-23 09:46:31 -07:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
David Roberts	7b3dd3022d	[ML] Update ML results mappings on process start (#37706 ) This change moves the update to the results index mappings from the open job action to the code that starts the autodetect process. When a rolling upgrade is performed we need to update the mappings for already-open jobs that are reassigned from an old version node to a new version node, but the open job action is not called in this case. Closes #37607	2019-01-23 09:37:37 +00:00
Andrey Ershov	534ba1dd34	Remove LicenseServiceClusterNotRecoveredTests (#37528 ) While tests migration from Zen1 to Zen2, we've encountered this test. This test is organized as follows: Starts the first cluster node. Starts the second cluster node. Checks that license is active. Interesting fact that adding assertLicenseActive(true) between 1 and 2 also makes the test pass. assertLicenseActive retrieves XPackLicenseState from the nodes and checks that active flag is set. It's set to true even before the cluster is initialized. So this test does not make sense.	2019-01-23 07:23:06 +01:00
Brandon Kobel	940f6ba4c1	Remove kibana_user and kibana_dashboard_only_user index privileges (#37441 ) * Remove kibana_user and kibana_dashboard_only_user .kibana* index privileges * Removing unused imports	2019-01-22 12:09:08 -08:00
Tim Brooks	eb43ab6d60	Implement leader rate limiting for file restore (#37677 ) This is related to #35975. This commit implements rate limiting on the leader side using the CombinedRateLimiter.	2019-01-22 10:57:37 -07:00
Zachary Tong	2ba9e361ab	Add helper classes to determine if aggs have a value (#36020 ) This adds a set of helper classes to determine if an agg "has a value". This is needed because InternalAggs represent "empty" in different manners according to convention. Some use `NaN`, `+/- Inf`, `0.0`, etc. A user can pass the Internal agg type to one of these helper methods and it will report if the agg contains a value or not, which allows the user to differentiate "empty" from a real `NaN`. These helpers are best-effort in some cases. For example, several pipeline aggs share a single return class but use different conventions to mark "empty", so the helper uses the loosest definition that applies to all the aggs that use the class. Sums in particular are unreliable. The InternalSum simply returns 0.0 if the agg is empty (which is correct, no values == sum of zero). But this also means the helper cannot differentiate from "empty" and `+1 + -1`.	2019-01-22 12:38:55 -05:00
Christoph Büscher	256e01ca92	Fix potential NPE in UsersTool (#37660 ) It looks like the output of FileUserPasswdStore.parseFile shouldn't be wrapped into another map since its output can be null. Doing this wrapping after the null check (which potentially raises an exception) instead.	2019-01-22 17:34:13 +01:00
Ioannis Kakavas	5c1a1f7ac1	Use PEM files for PkiOptionalClientAuthTests (#37683 ) Use PEM files for the key/cert for TLS on the http layer of the node instead of a JKS keystore so that the tests can also run in a FIPS 140 JVM . Resolves: #37682	2019-01-22 17:26:36 +02:00
Andrei Stefan	7507af29fa	SQL: Return Intervals in SQL format for CLI (#37602 ) * Add separate CLI Mode * Use the correct Mode for cursor close requests * Renamed CliFormatter and have different formatting behavior for CLI and "text" format.	2019-01-22 14:55:28 +02:00
Martijn van Groningen	ef2f5e4a13	Follow stats api should return a 404 when requesting stats for a non existing index (#37220 ) Currently it returns an empty response with a 200 response code. Closes #37021	2019-01-22 12:48:05 +01:00
Adrien Grand	e9fcb25a28	Upgrade to lucene-8.0.0-snapshot-83f9835. (#37668 ) This snapshot uses a new file format for doc-values which is expected to make advance/advanceExact perform faster on sparse fields: https://issues.apache.org/jira/browse/LUCENE-8585	2019-01-22 11:44:29 +01:00
Yogesh Gaikwad	3e1e1b0b37	Removes awaits fix as the fix is in. (#37676 ) The PR for the fix has been merged. https://github.com/elastic/elasticsearch/pull/37661 but the awaits fix annotation was not removed.	2019-01-22 19:35:17 +11:00
Andrei Stefan	90ae556d97	Define constants for REST requests endpoints in tests (#37610 )	2019-01-22 10:01:51 +02:00
Yogesh Gaikwad	ca4b5861c8	Fix a test failure in CompositeRolesStoreTests (#37661 ) Due to missing stubbing for `NativePrivilegeStore#getPrivileges` the test `testNegativeLookupsAreCached` failed when the superuser role name was present in the role names. This commit adds missing stubbing. Closes: #37657	2019-01-22 09:34:40 +11:00
Tim Brooks	f516d68fb2	Share `NioGroup` between http and transport impls (#37396 ) Currently we create dedicated network threads for both the http and transport implementations. Since these these threads should never perform blocking operations, these threads could be shared. This commit modifies the nio-transport to have 0 http workers be default. If the default configs are used, this will cause the http transport to be run on the transport worker threads. The http worker setting will still exist in case the user would like to configure dedicated workers. Additionally, this commmit deletes dedicated acceptor threads. We have never had these for the netty transport and they can be added back if a need is determined in the future.	2019-01-21 13:50:56 -07:00
Ryan Ernst	9a34b20233	Simplify integ test distribution types (#37618 ) The integ tests currently use the raw zip project name as the distribution type. This commit simplifies this specification to be "default" or "oss". Whether zip or tar is used should be an internal implementation detail of the integ test setup, which can (in the future) be platform specific.	2019-01-21 12:37:17 -08:00
Albert Zaharovits	0d7831ca6a	Checkstyle PutRoleRequest	2019-01-21 19:02:42 +02:00
Albert Zaharovits	f349372fba	Mute test. Relates #37657	2019-01-21 18:39:53 +02:00
Albert Zaharovits	5843aba8bd	Checkstyle PutRoleRequestTests	2019-01-21 18:36:39 +02:00
Albert Zaharovits	2c02b298d3	Fix PutRoleRequestTests Closes #37662	2019-01-21 18:16:10 +02:00
Alexander Reelsen	24c5dd498f	Mute PutRoleRequestTests.testSerializationBetweenV63AndV70 Relates #37662	2019-01-21 16:11:42 +01:00
Albert Zaharovits	f70ec3badb	Fix PutRoleRequestTests Related `ff0f5402`	2019-01-21 14:07:58 +02:00
Albert Zaharovits	0631322dda	Stream version nit after `ff0f540` and ce60585	2019-01-21 14:01:22 +02:00
markharwood	468bae29f7	Mute test Tracking #37652	2019-01-21 11:52:21 +00:00
Martijn van Groningen	88f4b0a326	Do not set fatal exception when shard follow task is stopped. (#37603 ) When shard follow task is cancelled while fetching operations then the fatal exception field should not be set.	2019-01-21 07:54:51 +01:00
Albert Zaharovits	ff0f540255	Permission for restricted indices (#37577 ) This grants the capability to grant privileges over certain restricted indices (.security and .security-6 at the moment). It also removes the special status of the superuser role. IndicesPermission.Group is extended by adding the `allow_restricted_indices` boolean flag. By default the flag is false. When it is toggled, you acknowledge that the indices under the scope of the permission group can cover the restricted indices as well. Otherwise, by default, restricted indices are ignored when granting privileges, thus rendering them hidden for authorization purposes. This effectively adds a confirmation "check-box" for roles that might grant privileges to restricted indices. The "special status" of the superuser role has been removed and coded as any other role: ``` new RoleDescriptor("superuser", new String[] { "all" }, new RoleDescriptor.IndicesPrivileges[] { RoleDescriptor.IndicesPrivileges.builder() .indices("") .privileges("all") .allowRestrictedIndices(true) // this ----^ .build() }, new RoleDescriptor.ApplicationResourcePrivileges[] { RoleDescriptor.ApplicationResourcePrivileges.builder() .application("") .privileges("") .resources("") .build() }, null, new String[] { "*" }, MetadataUtils.DEFAULT_RESERVED_METADATA, Collections.emptyMap()); ``` In the context of the Backup .security work, this allows the creation of a "curator role" that would permit listing (get settings) for all indices (including the restricted ones). That way the curator role would be able to ist and snapshot all indices, but not read or restore any of them. Supersedes #36765 Relates #34454	2019-01-20 23:19:40 +02:00
Albert Zaharovits	5308746270	Remove Watcher Account "unsecure" settings (#36736 ) Removes all sensitive settings (passwords, auth tokens, urls, etc...) for watcher notifications accounts. These settings were deprecated (and herein removed) in favor of their secure sibling that is set inside the elasticsearch keystore. For example: `xpack.notification.email.account.<id>.smtp.password` is no longer a valid setting, and it is replaced by `xpack.notification.email.account.<id>.smtp.secure_password`	2019-01-20 12:51:24 +02:00
Ryan Ernst	fc99eb3e65	Add cache cleaning task for ML snapshot (#37505 ) The ML subproject of xpack has a cache for the cpp artifact snapshots which is checked on each build. The cache is outside of the build dir so that it is not wiped on a typical clean, as the artifacts can be large and do not change often. This commit adds a cleanCache task which will wipe the cache dir, as over time the size of the directory can become bloated.	2019-01-19 16:16:58 -08:00
Tim Brooks	fe753ee1d2	Do not add index event listener if CCR disabled (#37432 ) Currently we add the CcrRestoreSourceService as a index event listener. However, if ccr is disabled, this service is null and we attempt to add a null listener throwing an exception. This commit only adds the listener if ccr is enabled.	2019-01-18 16:31:21 -07:00
Tim Brooks	cd41289396	Add local session timeouts to leader node (#37438 ) This is related to #35975. This commit adds timeout functionality to the local session on a leader node. When a session is started, a timeout is scheduled using a repeatable runnable. If the session is not accessed in between two runs the session is closed. When the sssion is closed, the repeating task is cancelled. Additionally, this commit moves session uuid generation to the leader cluster. And renames the PutCcrRestoreSessionRequest to StartCcrRestoreSessionRequest to reflect that change.	2019-01-18 14:48:20 -07:00
Gordon Brown	88b9810567	Remove obsolete deprecation checks (#37510 ) * Remove obsolete deprecation checks This also updates the old-indices check to be appropriate for the 7.x series of releases, and leaves it as the only deprecation check in place. * Add toString to DeprecationIssue * Bring filterChecks across from 6.x * License headers	2019-01-18 14:24:34 -07:00
Benjamin Trent	12cdf1cba4	ML: Add support for single bucket aggs in Datafeeds (#37544 ) Single bucket aggs are now supported in datafeed aggregation configurations.	2019-01-18 15:08:53 -06:00
Benjamin Trent	5384162a42	ML: creating ML State write alias and pointing writes there (#37483 ) * ML: creating ML State write alias and pointing writes there * Moving alias check to openJob method * adjusting concrete index lookup for ml-state	2019-01-18 14:32:34 -06:00
Martijn van Groningen	a3030c51e2	[ILM] Add unfollow action (#36970 ) This change adds the unfollow action for CCR follower indices. This is needed for the shrink action in case an index is a follower index. This will give the follower index the opportunity to fully catch up with the leader index, pause index following and unfollow the leader index. After this the shrink action can safely perform the ilm shrink. The unfollow action needs to be added to the hot phase and acts as barrier for going to the next phase (warm or delete phases), so that follower indices are being unfollowed properly before indices are expected to go in read-only mode. This allows the force merge action to execute its steps safely. The unfollow action has three steps: * `wait-for-indexing-complete` step: waits for the index in question to get the `index.lifecycle.indexing_complete` setting be set to `true` * `wait-for-follow-shard-tasks` step: waits for all the shard follow tasks for the index being handled to report that the leader shard global checkpoint is equal to the follower shard global checkpoint. * `pause-follower-index` step: Pauses index following, necessary to unfollow * `close-follower-index` step: Closes the index, necessary to unfollow * `unfollow-follower-index` step: Actually unfollows the index using the CCR Unfollow API * `open-follower-index` step: Reopens the index now that it is a normal index * `wait-for-yellow` step: Waits for primary shards to be allocated after reopening the index to ensure the index is ready for the next step In the case of the last two steps, if the index in being handled is a regular index then the steps acts as a no-op. Relates to #34648 Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Gordon Brown <gordon.brown@elastic.co>	2019-01-18 13:05:03 -07:00
Igor Motov	54af8a4e7a	SQL: fix object extraction from sources (#37502 ) Throws an exception if hit extractor tries to retrieve unsupported object. For example, selecting "a" from `{"a": {"b": "c"}}` now throws an exception instead of returning null. Relates to #37364	2019-01-18 14:03:48 -05:00
Martijn van Groningen	6846666b6b	Add ccr follow info api (#37408 ) * Add ccr follow info api This api returns all follower indices and per follower index the provided parameters at put follow / resume follow time and whether index following is paused or active. Closes #37127 * iter * [DOCS] Edits the get follower info API * [DOCS] Fixes link to remote cluster * [DOCS] Clarifies descriptions for configured parameters	2019-01-18 16:37:21 +01:00
Ioannis Kakavas	7597b7ce2b	Add validation for empty PutPrivilegeRequest (#37569 ) Return an error to the user if the put privilege api is called with an empty body (no privileges) Resolves: #37561	2019-01-18 17:06:40 +02:00
Tim Brooks	978c818d0f	Use RestoreSnapshotRequest in CcrRepositoryIT Commit #37535 removed an internal restore request in favor of the RestoreSnapshotRequest. Commit #37449 added a new test that used the internal restore request. This commit modifies the new test to use the RestoreSnapshotRequest.	2019-01-17 15:31:27 -07:00
Tim Brooks	b6f06a48c0	Implement follower rate limiting for file restore (#37449 ) This is related to #35975. This commit implements rate limiting on the follower side using a new class `CombinedRateLimiter`.	2019-01-17 14:58:46 -07:00
Armin Braun	381d035cd6	Remove Redundant RestoreRequest Class (#37535 ) * Same as #37464 but for the restore side	2019-01-17 22:23:23 +01:00
Yannick Welsch	6d64a2a901	Propagate Errors in executors to uncaught exception handler (#36137 ) This is a continuation of #28667 and has as goal to convert all executors to propagate errors to the uncaught exception handler. Notable missing ones were the direct executor and the scheduler. This commit also makes it the property of the executor, not the runnable, to ensure this property. A big part of this commit also consists of vastly improving the test coverage in this area.	2019-01-17 17:46:35 +01:00
Jake Landis	587034dfa7	Add set_priority action to ILM (#37397 ) This commit adds a set_priority action to the hot, warm, and cold phases for an ILM policy. This action sets the `index.priority` on the managed index to allow different priorities between the hot, warm, and cold recoveries. This commit also includes the HLRC and documentation changes. closes #36905	2019-01-17 09:55:36 -06:00
Martijn van Groningen	b85bfd3e17	Added fatal_exception field for ccr stats in monitoring mapping. (#37563 )	2019-01-17 14:04:41 +01:00
Martijn van Groningen	99b09845da	Moved ccr integration to the package with other ccr integration tests.	2019-01-17 13:57:56 +01:00
Marios Trivyzas	1686c32ba9	SQL: Rename SQL type DATE to DATETIME (#37395 ) * SQL: Rename SQL data type DATE to DATETIME SQL data type DATE has only the date part (e.g.: 2019-01-14) without any time information. Previously the SQL type DATE was referring to the ES DATE which contains also the time part along with TZ information. To conform with SQL data types the data type `DATE` is renamed to `DATETIME`, since it includes also the time, as a new runtime SQL `DATE` data type will be introduced down the road, which only contains the date part and meets the SQL standard. Closes: #36440 * Address comments	2019-01-17 10:17:58 +02:00
Marios Trivyzas	ecf0de30ba	SQL: Lowercase the datatypes in validation error msgs (#37524 ) To follow the ES convention display the datatypes in lowercase in error messages thrown during validation if `IN` and conditional functions.	2019-01-16 18:41:10 +02:00
Marios Trivyzas	2cf4b1863f	SQL: Lowercase es data type (mapping) returned from SQL Commands (#37531 ) To follow the ES convention, convert the es data type, returned as column `mapping` from SQL Commands, to lowercase. Fixes: #37521	2019-01-16 18:08:33 +02:00
Costin Leau	19d2c29ca6	Remove SYS CATALOGS leftover Relates #37506	2019-01-16 17:28:06 +02:00
Costin Leau	1f76b5fc31	SQL: Describe aliases as views (#37496 ) When reporting metadata, several clients have issues with the 'ALIAS' type. To improve compatibility and be consistent with the ANSI SQL expectations and because they are similar, aliases targets are now reported as views. Close #37422	2019-01-16 17:26:00 +02:00
Tim Brooks	0b5af276a8	Allow system privilege to execute proxied actions (#37508 ) Currently all proxied actions are denied for the `SystemPrivilege`. Unfortunately, there are use cases (CCR) where we would like to proxy actions to a remote node that are normally performed by the system context. This commit allows the system context to perform proxy actions if they are actions that the system context is normally allowed to execute.	2019-01-16 07:52:38 -07:00
Andrei Stefan	659326fdd6	SQL: Add protocol tests and remove jdbc_type from drivers response (#37516 )	2019-01-16 16:28:46 +02:00
David Kyle	75410dc632	[Ml] Prevent config snapshot failure blocking migration (#37493 )	2019-01-16 11:51:15 +00:00
Costin Leau	023bb2f1e4	SQL: Remove slightly used meta commands (#37506 ) Remove SYS CATALOGS and SYS TABLE TYPES as they are a subset of SYS TABLES (and thus somewhat redundant) and used only by JDBC. Close #37409	2019-01-16 12:36:35 +02:00
Przemyslaw Gomulka	5e94f384c4	Remove the use of AbstracLifecycleComponent constructor #37488 (#37488 ) The AbstracLifecycleComponent used to extend AbstractComponent, so it had to pass settings to the constractor of its supper class. It no longer extends the AbstractComponent so there is no need for this constructor There is also no need for AbstracLifecycleComponent subclasses to have Settings in their constructors if they were only passing it over to super constructor. This is part 1. which will be backported to 6.x with a migration guide/deprecation log. part 2 will have this constructor removed in 7 relates #35560 relates #34488	2019-01-16 09:05:30 +01:00
Hendrik Muhs	15d1b904a1	[ML] log minimum diskspace setting if forecast fails due to insufficient d… (#37486 ) log minimum disk space setting if forecast fails due to insufficient disk space	2019-01-16 08:10:13 +01:00
Jay Modi	987576b013	Consistently use loopback address for ssl profile (#37487 ) This change fixes failures in the SslMultiPortTests where we attempt to connect to a profile on a port it is listening on but the connection fails. The failure is due to the profile being bound to multiple addresses and randomization will pick one of these addresses to determine the listening port. However, the address we get the port for may not be the address we are actually connecting to. In order to resolve this, the test now sets the bind host for profiles to the loopback address and uses the same address for connecting. Closes #37481	2019-01-15 14:03:21 -07:00
David Kyle	bea46f7b52	[ML] Migrate unallocated jobs and datafeeds (#37430 ) Migrate ml job and datafeed config of open jobs and update the parameters of the persistent tasks as they become unallocated during a rolling upgrade. Block allocation of ml persistent tasks until the configs are migrated.	2019-01-15 18:21:39 +00:00
David Kyle	7c11b05c28	[ML] Remove unused code from the JIndex project (#37477 )	2019-01-15 17:19:58 +00:00
Martijn van Groningen	9554b2fecb	When removing an AutoFollower also mark it as removed. (#37402 ) Currently when there are no more auto follow patterns for a remote cluster then the AutoFollower instance for this remote cluster will be removed. If a new auto follow pattern for this remote cluster gets added quickly enough after the last delete then there may be two AutoFollower instance running for this remote cluster instead of one. Each AutoFollower instance stops automatically after it sees in the start() method that there are no more auto follow patterns for the remote cluster it is tracking. However when an auto follow pattern gets removed and then added back quickly enough then old AutoFollower may never detect that at some point there were no auto follow patterns for the remote cluster it is monitoring. The creation and removal of an AutoFollower instance happens independently in the `updateAutoFollowers()` as part of a cluster state update. By adding the `removed` field, an AutoFollower instance will not miss the fact there were no auto follow patterns at some point in time. The `updateAutoFollowers()` method now marks an AutoFollower instance as removed when it sees that there are no more patterns for a remote cluster. The updateAutoFollowers() method can then safely start a new AutoFollower instance. Relates to #36761	2019-01-15 16:24:19 +01:00
Jay Modi	a56aa4f076	Remove SslNullCipherTests from codebase (#37431 ) This change deletes the SslNullCipherTests from our codebase since it will have issues with newer JDK versions and it is essentially testing JDK functionality rather than our own. The upstream JDK issue for disabling these ciphers by default is https://bugs.openjdk.java.net/browse/JDK-8212823. Closes #37403	2019-01-15 07:52:58 -07:00
David Roberts	7cdf7f882b	[ML] Fix ML datafeed CCS with wildcarded cluster name (#37470 ) The test that remote clusters used by ML datafeeds have a license that allows ML was not accounting for the possibility that the remote cluster name could be wildcarded. This change fixes that omission. Fixes #36228	2019-01-15 14:19:05 +00:00
Tanguy Leroux	e848388865	Fix SourceOnlySnapshotIT (#37461 ) The SourceOnlySnapshotIT class tests a source only repository using the following scenario: starts a master node starts a data node creates a source only repository creates an index with documents snapshots the index to the source only repository deletes the index stops the data node starts a new data node restores the index Thanks to ESIntegTestCase the index is sometimes created using a custom data path. With such a setting, when a shard is assigned to one of the data node of the cluster the shard path is resolved using the index custom data path and the node's lock id by the NodeEnvironment#resolveCustomLocation(). It should work nicely but in SourceOnlySnapshotIT.snashotAndRestore(), b efore the change in this PR, the last data node was restarted using a different path.home. At startup time this node was assigned a node lock based on other locks in the data directory of this temporary path.home which is empty. So it always got the 0 lock id. And when this new data node is assigned a shard for the index and resolves it against the index custom data path, it also uses the node lock id 0 which conflicts with another node of the cluster, resulting in various errors with the most obvious one being LockObtainFailedException. This commit removes the temporary home path for the last data node so that it uses the same path home as other nodes of the cluster and then got assigned a correct node lock id at startup. Closes #36330 Closes #36276	2019-01-15 15:03:09 +01:00
Marios Trivyzas	b594e81c86	SQL: Fix issue with field names containing "." (#37364 ) Adjust FieldExtractor to handle fields which contain `.` in their name, regardless where they fall in, in the document hierarchy. E.g.: ``` { "a.b": "Elastic Search" } { "a": { "b.c": "Elastic Search" } } { "a.b": { "c": { "d.e" : "Elastic Search" } } } ``` Fixes: #37128	2019-01-15 09:41:41 +02:00
Jason Tedor	3bc0711b90	Add simple method to write collection of writeables (#37448 ) This commit adds a simple convenience method for writing a collection of writeables, and replaces existing call sites with the new method.	2019-01-14 21:28:28 -05:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Jay Modi	f3edbe2911	Security: remove SSL settings fallback (#36846 ) This commit removes the fallback for SSL settings. While this may be seen as a non user friendly change, the intention behind this change is to simplify the reasoning needed to understand what is actually being used for a given SSL configuration. Each configuration now needs to be explicitly specified as there is no global configuration or fallback to some other configuration. Closes #29797	2019-01-14 14:06:22 -07:00
Shaunak Kashyap	b86621c157	Adding mapping for hostname field (#37288 ) This new `hostname` field is meant to be a replacement for its sibling `name` field. See https://github.com/elastic/beats/pull/9943, particularly https://github.com/elastic/beats/pull/9943#discussion_r245932581. This PR simply adds the new field (`hostname`) to the mapping without removing the old one (`name`), because a user might be running an older-version Beat (without this field rename in it) with a newer-version Monitoring ES cluster (with this PR's change in it). AFAICT the Monitoring UI isn't currently using the `name` field so no changes are necessary there yet. If it decides to start using the `name` field, it will also want to look at the value of the `hostname` field.	2019-01-14 12:41:10 -08:00
Tim Brooks	5c68338a1c	Implement ccr file restore (#37130 ) This is related to #35975. It implements a file based restore in the CcrRepository. The restore transfers files from the leader cluster to the follower cluster. It does not implement any advanced resiliency features at the moment. Any request failure will end the restore.	2019-01-14 13:07:55 -07:00
David Kyle	2ee55a50bf	[ML] Use String rep of Version in map for serialisation (#37416 )	2019-01-14 16:39:47 +00:00
Martijn van Groningen	de852765d6	unmuted test Relates to #37014	2019-01-14 14:27:42 +01:00
Ioannis Kakavas	374e24c7fd	Mute SslNullCipherTests on JDK12 JDK12 doesn't support NULL cipher for TLS by default. This commit mutes these tests on JDK12 until we decide whether we need to keep or remove them	2019-01-14 10:50:24 +02:00
Albert Zaharovits	6fd57d90da	Security Audit includes HTTP method for requests (#37322 ) Adds another field, named "request.method", to the structured logfile audit. This field is present for all events associated with a REST request (not a transport request) and the value is one of GET, POST, PUT, DELETE, OPTIONS, HEAD, PATCH, TRACE and CONNECT.	2019-01-13 15:26:23 +02:00
Costin Leau	a4339ec7e9	SQL: Use declared source for error messages (#37161 ) Improve error messages by returning the original SQL statement declaration instead of trying to reproduce it as the casing and whitespaces are not preserved accurately leading to small differences. Close #37161	2019-01-13 01:40:22 +02:00
Marios Trivyzas	359222c55c	SQL: Make `FULL` non-reserved keyword in the grammar (#37377 ) Since `full` can be common as a field name or part of a field name (e.g.: `full.name` or `name.full`), it's nice if it's not a reserved keyword of the grammar so a user can use it without resorting to quotes. Fixes: #37376	2019-01-11 23:08:00 +02:00
Marios Trivyzas	85531f0285	SQL: [Tests] Fix and enable internalClusterTests (#37300 ) SqlPlugin cannot have more than one public constructor, so for the testing purposes the `getLicenseState()` should be overriden. Fixes: #37191 Co-authored-by: Michael Basnight <mbasnight@gmail.com>	2019-01-11 22:43:17 +02:00
Benjamin Trent	5101e51891	ML: Fix testMigrateConfigs (#37373 ) * ML: :s/execute/get * Fixing other broken tests * unmuting test	2019-01-11 13:29:30 -06:00
Zachary Tong	de52ba1f78	Fix RollupDocumentation test to wait for job to stop Also adds some extra state debug information to various log messages	2019-01-11 14:14:58 -05:00
Gordon Brown	827ece73c8	Mute MlConfigMigratorIT.testMigrateConfigs (#37374 )	2019-01-11 11:11:58 -07:00
Gordon Brown	955d3aea19	Mute testRoundRobinWithFailures (#32190 )	2019-01-11 09:38:40 -07:00
David Roberts	953fb9352f	[ML] Update error message for process update (#37363 ) When this message was first added the model debug config was the only thing that could be updated, but now more aspects of the config can be updated so the message needs to be more general.	2019-01-11 16:31:55 +00:00
Martijn van Groningen	e4391afd98	Test fix, wait for auto follower to have stopped in the background Relates to #36761	2019-01-11 17:26:17 +01:00
Benjamin Trent	19a7e0f4eb	ML: update .ml-state actions to support > 1 index (#37307 ) * ML: Updating .ml-state calls to be able to support > 1 index * Matching bulk delete behavior with dbq * Adjusting state name * refreshing indices before search * fixing line length * adjusting index expansion options	2019-01-11 08:03:41 -06:00
David Roberts	1da59db3fb	[ML] Wait for autodetect to be ready in the datafeed (#37349 ) This is a reinforcement of #37227. It turns out that persistent tasks are not made stale if the node they were running on is restarted and the master node does not notice this. The main scenario where this happens is when minimum master nodes is the same as the number of nodes in the cluster, so the cluster cannot elect a master node when any node is restarted. When an ML node restarts we need the datafeeds for any jobs that were running on that node to not just wait until the jobs are allocated, but to wait for the autodetect process of the job to start up. In the case of reassignment of the job persistent task this was dealt with by the stale status test. But in the case where a node restarts but its persistent tasks are not reassigned we need a deeper test. Fixes #36810	2019-01-11 13:22:35 +00:00
Alexander Reelsen	bbd093059f	Add whitelist to watcher HttpClient (#36817 ) This adds a configurable whitelist to the HTTP client in watcher. By default every URL is allowed to retain BWC. A dynamically configurable setting named "xpack.http.whitelist" was added that allows to configure an array of URLs, which can also contain simple regexes. Closes #29937	2019-01-11 09:22:47 +01:00
markharwood	434430506b	Type removal - added deprecation warnings to _bulk apis (#36549 ) Added warnings checks to existing tests Added “defaultTypeIfNull” to DocWriteRequest interface so that Bulk requests can override a null choice of document type with any global custom choice. Related to #35190	2019-01-10 21:35:19 +00:00
Jay Modi	e6d3d85db4	Ensure latch is counted down in ssl reload test (#37313 ) This change ensures we always countdown the latch in the SSLConfigurationReloaderTests to prevent the suite from timing out in case of an exception. Additionally, we also increase the logging of the resource watcher in case an IOException occurs. See #36053	2019-01-10 13:27:25 -07:00
Costin Leau	83f7423cd6	SQL: Fix bug regarding alias fields with dots (#37279 ) Field of types aliases that have dots in name are returned without a hierarchy by field_caps, as oppose to the mapping api or field with concrete types, which in turn breaks IndexResolver. This commit fixes this by creating the backing hierarchy similar to the mapping api. Close #37224	2019-01-10 22:18:53 +02:00
David Roberts	b65006e8cd	[ML] Fix ML memory tracker for old jobs (#37311 ) Jobs created in version 6.1 or earlier can have a null model_memory_limit. If these are parsed from cluster state following a full cluster restart then we replace the null with 4096mb to make the meaning explicit. But if such jobs are streamed from an old node in a mixed version cluster this does not happen. Therefore we need to account for the possibility of a null model_memory_limit in the ML memory tracker.	2019-01-10 17:28:00 +00:00
Jay Modi	71633775fd	Security: reorder realms based on last success (#36878 ) This commit reorders the realm list for iteration based on the last successful authentication for the given principal. This is an optimization to prevent unnecessary iteration over realms if we can make a smart guess on which realm to try first.	2019-01-10 09:06:16 -07:00
Martijn van Groningen	6d81e7c3e7	[CCR] FollowingEngine should fail with 403 if operation has no seqno assigned (#37213 ) Fail with a 403 when indexing a document directly into a follower index. In order to test this change, I had to move specific assertions into a dedicated class and disable assertions for that class in the rest qa module. I think that is the right trade off.	2019-01-10 15:54:34 +01:00
Martijn van Groningen	df488720e0	[CCR] Make shard follow tasks more resilient for restarts (#37239 ) If a running shard follow task needs to be restarted and the remote connection seeds have changed then a shard follow task currently fails with a fatal error. The change creates the remote client lazily and adjusts the errors a shard follow task should retry. This issue was found in test failures in the recently added ccr rolling upgrade tests. The reason why this issue occurs more frequently in the rolling upgrade test is because ccr is setup in local mode (so remote connection seed will become stale) and all nodes are restarted, which forces the shard follow tasks to get restarted at some point during the test. Note that these tests cannot be enabled yet, because this change will need to be backported to 6.x first. (otherwise the issue still occurs on non upgraded nodes) I also changed the RestartIndexFollowingIT to setup remote cluster via persistent settings and to also restart the leader cluster. This way what happens during the ccr rolling upgrade qa tests, also happens in this test. Relates to #37231	2019-01-10 15:02:30 +01:00
Alpar Torok	3d66764660	Mute watcher SingleNodeTests Tracking: #36782	2019-01-10 12:23:29 +02:00

... 4 5 6 7 8 ...

2510 Commits