OpenSearch

Commit Graph

Author	SHA1	Message	Date
Mark Vieira	e44b8b1e2e	[Backport] Remove dependency substitutions 7.x (#42866 ) * Remove unnecessary usage of Gradle dependency substitution rules (#42773) (cherry picked from commit 12d583dbf6f7d44f00aa365e34fc7e937c3c61f7)	2019-06-04 13:50:23 -07:00
Tim Vernum	928f49992f	Don't require TLS for single node clusters (#42830 ) This commit removes the TLS cluster join validator. This validator existed to prevent v6.x nodes (which mandated TLS) from joining an existing cluster of v5.x nodes (which did not mandate TLS) unless the 6.x node (and by implication the 5.x nodes) was configured to use TLS. Since 7.x nodes cannot talk to 5.x nodes, this validator is no longer needed. Removing the validator solves a problem where single node clusters that were bound to local interfaces were incorrectly requiring TLS when they recovered cluster state and joined their own cluster. Backport of: #42826	2019-06-04 19:48:37 +10:00
Tim Vernum	8de3a88205	Log the status of security on license change (#42741 ) Whether security is enabled/disabled is dependent on the combination of the node settings and the cluster license. This commit adds a license state listener that logs when the license change causes security to switch state (or to be initialised). This is primarily useful for diagnosing cluster formation issues. Backport of: #42488	2019-06-04 14:25:43 +10:00
Tim Vernum	9035e61825	Detect when security index is closed (#42740 ) If the security index is closed, it should be treated as unavailable for security purposes. Prior to 8.0 (or in a mixed cluster) a closed security index has no routing data, which would cause a NPE in the cluster change handler, and the index state would not be updated correctly. This commit fixes that problem Backport of: #42191	2019-06-04 14:25:20 +10:00
Alan Woodward	2129d06643	Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197 ) This commit clones the existing AnalyzeRequest/AnalyzeResponse classes to the high-level rest client, and adjusts request converters to use these new classes. This is a prerequisite to removing the Streamable interface from the internal server version of these classes.	2019-06-03 09:46:36 +01:00
Mark Vieira	c1816354ed	[Backport] Improve build configuration time (#42674 )	2019-05-30 10:29:42 -07:00
Jay Modi	711de2f59a	Make hashed token ids url safe (#42651 ) This commit changes the way token ids are hashed so that the output is url safe without requiring encoding. This follows the pattern that we use for document ids that are autogenerated, see UUIDs and the associated classes for additional details.	2019-05-30 10:44:41 -06:00
Ioannis Kakavas	7cabe8acc9	Fix refresh remote JWKS logic (#42662 ) This change ensures that: - We only attempt to refresh the remote JWKS when there is a signature related error only ( BadJWSException instead of the geric BadJOSEException ) - We do call OpenIDConnectAuthenticator#getUserClaims upon successful refresh. - We test this in OpenIdConnectAuthenticatorTests. Without this fix, when using the OpenID Connect realm with a remote JWKSet configured in `op.jwks_path`, the refresh would be triggered for most configuration errors ( i.e. wrong value for `op.issuer` ) and the kibana wouldn't get a response and timeout since `getUserClaims` wouldn't be called because `ReloadableJWKSource#reloadAsync` wouldn't call `onResponse` on the future.	2019-05-30 18:08:30 +03:00
Ioannis Kakavas	24a794fd6b	Fix testTokenExpiry flaky test (#42585 ) Test was using ClockMock#rewind passing the amount of nanoseconds in order to "strip" nanos from the time value. This was intentional as the expiration time of the UserToken doesn't have nanosecond precision. However, ClockMock#rewind doesn't support nanos either, so when it's called with a TimeValue, it rewinds the clock by the TimeValue's millis instead. This was causing the clock to go enough millis before token expiration time and the test was passing. Once every few hundred times though, the TimeValue by which we attempted to rewind the clock only had nanos and no millis, so rewind moved the clock back just a few millis, but still after expiration time. This change moves the clock explicitly to the same instant as expiration, using clock.setTime and disregarding nanos.	2019-05-30 07:53:56 +03:00
Armin Braun	a96606d962	Safer Wait for Snapshot Success in ClusterPrivilegeTests (#40943 ) (#42575 ) * Safer Wait for Snapshot Success in ClusterPrivilegeTests * The snapshot state returned by the API might become SUCCESS before it's fully removed from the cluster state. * We should fix this race in the transport API but it's not trivial and will be part of the incoming big round of refactoring the repository interaction, this added check fixes the test for now * closes #38030	2019-05-27 12:08:20 +02:00
Ryan Ernst	a49bafc194	Split document and metadata fields in GetResult (#38373 ) (#42456 ) This commit makes creators of GetField split the fields into document fields and metadata fields. It is part of larger refactoring that aims to remove the calls to static methods of MapperService related to metadata fields, as discussed in #24422.	2019-05-23 14:01:07 -07:00
Ioannis Kakavas	aab97f1311	Fail early when rp.client_secret is missing in OIDC realm (#42256 ) rp.client_secret is a required secure setting. Make sure we fail with a SettingsException and a clear, actionable message when building the realm, if the setting is missing.	2019-05-22 13:20:41 +03:00
Ioannis Kakavas	ccdc0e6b3e	Merge claims from userinfo and ID Token correctly (#42277 ) Enhance the handling of merging the claims sets of the ID Token and the UserInfo response. JsonObject#merge would throw a runtime exception when attempting to merge two objects with the same key and different values. This could happen for an OP that returns different vales for the same claim in the ID Token and the UserInfo response ( Google does that for profile claim ). If a claim is contained in both sets, we attempt to merge the values if they are objects or arrays, otherwise the ID Token claim value takes presedence and overwrites the userinfo response.	2019-05-22 13:15:41 +03:00
Ioannis Kakavas	7af30345b4	Revert "mute failing filerealm hash caching tests (#42304 )" This reverts commit `39fbed1577`.	2019-05-22 13:15:00 +03:00
Ioannis Kakavas	34dda75cdf	Ensure SHA256 is not used in tests (#42289 ) SHA256 was recently added to the Hasher class in order to be used in the TokenService. A few tests were still using values() to get the available algorithms from the Enum and it could happen that SHA256 would be picked up by these. This change adds an extra convenience method (Hasher#getAvailableAlgoCacheHash) and enures that only this and Hasher#getAvailableAlgoStoredHash are used for getting the list of available password hashing algorithms in our tests.	2019-05-22 09:54:24 +03:00
Tim Vernum	c5f191f6af	Add cluster restart for security on basic (#42217 ) This performs a simple restart test to move a basic licensed cluster from no security (the default) to security & transport TLS enabled. Backport of: #41933	2019-05-22 14:27:45 +10:00
Tal Levy	39fbed1577	mute failing filerealm hash caching tests (#42304 ) some tests are failing after the introduction of #41792. relates #42267 and #42289.	2019-05-21 10:40:14 -07:00
Tim Vernum	7b3a9c7033	Do not refresh realm cache unless required (#42212 ) If there are no realms that depend on the native role mapping store, then changes should it should not perform any cache refresh. A refresh with an empty realm array will refresh all realms. This also fixes a spurious log warning that could occur if the role mapping store was notified that the security index was recovered before any realm were attached. Backport of: #42169	2019-05-21 18:14:22 +10:00
Ioannis Kakavas	b4a413c4d0	Hash token values for storage (#41792 ) (#42220 ) This commit changes how access tokens and refresh tokens are stored in the tokens index. Access token values are now hashed before being stored in the id field of the `user_token` and before becoming part of the token document id. Refresh token values are hashed before being stored in the token field of the `refresh_token`. The tokens are hashed without a salt value since these are v4 UUID values that have enough entropy themselves. Both rainbow table attacks and offline brute force attacks are impractical. As a side effect of this change and in order to support multiple concurrent refreshes as introduced in #39631, upon refreshing an <access token, refresh token> pair, the superseding access token and refresh tokens values are stored in the superseded token doc, encrypted with a key that is derived from the superseded refresh token. As such, subsequent requests to refresh the same token in the predefined time window will return the same superseding access token and refresh token values, without hitting the tokens index (as this only stores hashes of the token values). AES in GCM mode is used for encrypting the token values and the key derivation from the superseded refresh token uses a small number of iterations as it needs to be quick. For backwards compatibility reasons, the new behavior is only enabled when all nodes in a cluster are in the required version so that old nodes can cope with the token values in a mixed cluster during a rolling upgrade.	2019-05-20 17:55:29 +03:00
Jay Modi	dbbdcea128	Update ciphers for TLSv1.3 and JDK11 if available (#42082 ) This commit updates the default ciphers and TLS protocols that are used when the runtime JDK supports them. New cipher support has been introduced in JDK 11 and 12 along with performance fixes for AES GCM. The ciphers are ordered with PFS ciphers being most preferred, then AEAD ciphers, and finally those with mainstream hardware support. When available stronger encryption is preferred for a given cipher. This is a backport of #41385 and #41808. There are known JDK bugs with TLSv1.3 that have been fixed in various versions. These are: 1. The JDK's bundled HttpsServer will endless loop under JDK11 and JDK 12.0 (Fixed in 12.0.1) based on the way the Apache HttpClient performs a close (half close). 2. In all versions of JDK 11 and 12, the HttpsServer will endless loop when certificates are not trusted or another handshake error occurs. An email has been sent to the openjdk security-dev list and #38646 is open to track this. 3. In JDK 11.0.2 and prior there is a race condition with session resumption that leads to handshake errors when multiple concurrent handshakes are going on between the same client and server. This bug does not appear when client authentication is in use. This is JDK-8213202, which was fixed in 11.0.3 and 12.0. 4. In JDK 11.0.2 and prior there is a bug where resumed TLS sessions do not retain peer certificate information. This is JDK-8212885. The way these issues are addressed is that the current java version is checked and used to determine the supported protocols for tests that provoke these issues.	2019-05-20 09:45:36 -04:00
Ryan Ernst	fa1d1d1f57	Deprecate the native realm migration tool (#42142 ) The migrate tool was added when the native realm was created, to aid users in converting from file realms that were per node, into the cluster managed native realm. While this tool was useful at the time, users should now be using the native realm directly. This commit deprecates the tool, to be removed in a followup for 8.0.	2019-05-16 09:52:31 -04:00
Tim Vernum	9191b02213	Enforce transport TLS on Basic with Security (#42150 ) If a basic license enables security, then we should also enforce TLS on the transport interface. This was already the case for Standard/Gold/Platinum licenses. For Basic, security defaults to disabled, so some of the process around checking whether security is actuallY enabled is more complex now that we need to account for basic licenses.	2019-05-15 13:59:27 -04:00
David Kyle	c0d67919c8	Mute ApiKeyIntegTests See https://github.com/elastic/elasticsearch/issues/41747	2019-05-09 13:24:52 +01:00
Ioannis Kakavas	58041f3fdb	Remove op.name configuration setting (#41445 ) This setting was not eventually used in the realm and thus can be removed	2019-05-07 19:01:55 +03:00
Tim Vernum	3508b6c641	Log warning when unlicensed realms are skipped (#41828 ) Because realms are configured at node startup, but license levels can change dynamically, it is possible to have a running node that has a particular realm type configured, but that realm is not permitted under the current license. In this case the realm is silently ignored during authentication. This commit adds a warning in the elasticsearch logs if authentication fails, and there are realms that have been skipped due to licensing. This message is not intended to imply that the realms could (or would) have successfully authenticated the user, but they may help reduce confusion about why authentication failed if the caller was expecting the authentication to be handled by a particular realm that is in fact unlicensed. Backport of: #41778	2019-05-07 09:55:48 +10:00
Ryan Ernst	6fd8924c5a	Switch run task to use real distro (#41590 ) The run task is supposed to run elasticsearch with the given plugin or module. However, for modules, this is most realistic if using the full distribution. This commit changes the run setup to use the default or oss as appropriate.	2019-05-06 12:34:07 -07:00
Tim Brooks	927013426a	Read multiple TLS packets in one read call (#41820 ) This is related to #27260. Currently we have a single read buffer that is no larger than a single TLS packet. This prevents us from reading multiple TLS packets in a single socket read call. This commit modifies our TLS work to support reading similar to the plaintext case. The data will be copied to a (potentially) recycled TLS packet-sized buffer for interaction with the SSLEngine.	2019-05-06 09:51:32 -06:00
Jason Tedor	d0f071236a	Simplify filtering addresses on interfaces (#41758 ) This commit is a refactoring of how we filter addresses on interfaces. In particular, we refactor all of these methods into a common private method. We also change the order of logic to first check if an address matches our filter and then check if the interface is up. This is to possibly avoid problems we are seeing where devices are flapping up and down while we are checking for loopback addresses. We do not expect the loopback device to flap up and down so by reversing the logic here we avoid that problem on CI machines. Finally, we expand the error message when this does occur so that we know which device is flapping.	2019-05-02 16:36:27 -04:00
Tim Brooks	b4bcbf9f64	Support http read timeouts for transport-nio (#41466 ) This is related to #27260. Currently there is a setting http.read_timeout that allows users to define a read timeout for the http transport. This commit implements support for this functionality with the transport-nio plugin. The behavior here is that a repeating task will be scheduled for the interval defined. If there have been no requests received since the last run and there are no inflight requests, the channel will be closed.	2019-05-02 09:48:52 -06:00
Jason Tedor	0870523489	Fix compilation in SecurityMocks This commit fixes compilation in SecurityMocks from what appears to be some merge conflicts that were not resolved adequately.	2019-05-01 14:29:33 -04:00
Jason Tedor	f500d727cf	Resolve conflicts in AuthenticationServiceTests This commit resolves some merge conflicts that arose in AuthenticationServiceTests after a rebase.	2019-05-01 14:20:58 -04:00
Jason Tedor	942a1445f3	Fix reference to 7.1 in security token tests This version should be referencing 7.2 rather than 7.1, due to some changes in timing of the token service changes.	2019-05-01 14:00:35 -04:00
Ioannis Kakavas	8426130553	Add negative tests for security features in basic Assert that API Keys, Tokens, DLS/FLS do not work in basic	2019-05-01 14:00:32 -04:00
Tim Vernum	3589ca8493	Add test for security on basic license. This is modelled on the qa test for TLS on basic. It starts a cluster on basic with security & performs a number of security related checks. It also performs those same checks on a trial license.	2019-05-01 14:00:29 -04:00
Tim Vernum	0ee16d0115	Security on Basic License This adds support for using security on a basic license. It includes: - AllowedRealmType.NATIVE realms (reserved, native, file) - Roles / RBAC - TLS (already supported) It does not support: - Audit - IP filters - Token Service & API Keys - Advanced realms (AD, LDAP, SAML, etc) - Advanced roles (DLS, FLS) - Pluggable security As with trial licences, security is disabled by default. This commit does not include any new automated tests, but existing tests have been updated.	2019-05-01 14:00:25 -04:00
Jason Tedor	7f3ab4524f	Bump 7.x branch to version 7.2.0 This commit adds the 7.2.0 version constant to the 7.x branch, and bumps BWC logic accordingly.	2019-05-01 13:38:57 -04:00
Albert Zaharovits	990be1f806	Security Tokens moved to a new separate index (#40742 ) This commit introduces the `.security-tokens` and `.security-tokens-7` alias-index pair. Because index snapshotting is at the index level granularity (ie you cannot snapshot a subset of an index) snapshoting .`security` had the undesirable effect of storing ephemeral security tokens. The changes herein address this issue by moving tokens "seamlessly" (without user intervention) to another index, so that a "Security Backup" (ie snapshot of `.security`) would not be bloated by ephemeral data.	2019-05-01 14:53:56 +03:00
Jason Tedor	0b46a62f6b	Drop distinction in entries for keystore (#41701 ) Today we allow adding entries from a file or from a string, yet we internally maintain this distinction such that if you try to add a value from a file for a setting that expects a string or add a value from a string for a setting that expects a file, you will have a bad time. This causes a pain for operators such that for each setting they need to know this difference. Yet, we do not need to maintain this distinction internally as they are bytes after all. This commit removes that distinction and includes logic to upgrade legacy keystores.	2019-05-01 07:02:04 -04:00
Tim Brooks	df3ef66294	Remove dedicated SSL network write buffer (#41654 ) This is related to #27260. Currently for the SSLDriver we allocate a dedicated network write buffer and encrypt the data into that buffer one buffer at a time. This requires constantly switching between encrypting and flushing. This commit adds a dedicated outbound buffer for SSL operations that will internally allocate new packet sized buffers as they are need (for writing encrypted data). This allows us to totally encrypt an operation before writing it to the network. Eventually it can be hooked up to buffer recycling. This commit also backports the following commit: Handle WRAP ops during SSL read It is possible that a WRAP operation can occur while decrypting handshake data in TLS 1.3. The SSLDriver does not currently handle this well as it does not have access to the outbound buffer during read call. This commit moves the buffer into the Driver to fix this issue. Data wrapped during a read call will be queued for writing after the read call is complete.	2019-04-29 17:59:13 -06:00
David Kyle	1a6ffb2644	Mute ClusterPrivilegeTests.testThatSnapshotAndRestore Tracked in #38030	2019-04-29 16:45:01 +10:00
Yogesh Gaikwad	c0d40ae4ca	Remove deprecated stashWithOrigin calls and use the alternative (#40847 ) (#41562 ) This commit removes the deprecated `stashWithOrigin` and modifies its usage to use the alternative.	2019-04-28 21:25:42 +10:00
Tim Brooks	1f8ff052a1	Revert "Remove dedicated SSL network write buffer (#41283 )" This reverts commit `f65a86c258`.	2019-04-25 18:39:25 -06:00
Tim Brooks	f65a86c258	Remove dedicated SSL network write buffer (#41283 ) This is related to #27260. Currently for the SSLDriver we allocate a dedicated network write buffer and encrypt the data into that buffer one buffer at a time. This requires constantly switching between encrypting and flushing. This commit adds a dedicated outbound buffer for SSL operations that will internally allocate new packet sized buffers as they are need (for writing encrypted data). This allows us to totally encrypt an operation before writing it to the network. Eventually it can be hooked up to buffer recycling.	2019-04-25 14:30:54 -06:00
Christoph Büscher	52495843cc	[Docs] Fix common word repetitions (#39703 )	2019-04-25 20:47:47 +02:00
Tim Brooks	6d7110edf5	SSLDriver can transition to CLOSED in handshake (#41458 ) TLS 1.3 changes to the SSLEngine introduced a scenario where a UNWRAP call during a handshake can consume a close notify alerty without throwing an exception. This means that we continue down a codepath where we assert that we are still in handshaking mode. Transitioning to closed from handshaking is a valid scenario. This commit removes this assertion.	2019-04-25 12:02:17 -06:00
Jim Ferenczi	6184efaff6	Handle unmapped fields in _field_caps API (#34071 ) (#41426 ) Today the `_field_caps` API returns the list of indices where a field is present only if this field has different types within the requested indices. However if the request is an index pattern (or an alias, or both...) there is no way to infer the indices if the response contains only fields that have the same type in all indices. This commit changes the response to always return the list of indices in the response. It also adds a way to retrieve unmapped field in a specific section per field called `unmapped`. This section is created for each field that is present in some indices but not all if the parameter `include_unmapped` is set to true in the request (defaults to false).	2019-04-25 18:13:48 +02:00
Albert Zaharovits	fe5789ada1	Fix Has Privilege API check on restricted indices (#41226 ) The Has Privileges API allows to tap into the authorization process, to validate privileges without actually running the operations to be authorized. This commit fixes a bug, in which the Has Privilege API returned spurious results when checking for index privileges over restricted indices (currently .security, .security-6, .security-7). The actual authorization process is not affected by the bug.	2019-04-25 12:03:27 +03:00
Ryan Ernst	7e3875d781	Upgrade hamcrest to 2.1 (#41464 ) hamcrest has some improvements in newer versions, like FileMatchers that make assertions regarding file exists cleaner. This commit upgrades to the latest version of hamcrest so we can start using new and improved matchers.	2019-04-24 23:40:03 -07:00
Albert Zaharovits	c3e0ae24d3	Fix role mapping DN field wildcards for users with NULL DNs (#41343 ) The `DistinguishedNamePredicate`, used for matching users to role mapping expressions, should handle users with null DNs. But it fails to do so (and this is a NPE bug), if the role mapping expression contains a lucene regexp or a wildcard. The fix simplifies `DistinguishedNamePredicate` to not handle null DNs at all, and instead use the `ExpressionModel#NULL_PREDICATE` for the DN field, just like any other missing user field.	2019-04-22 10:25:24 +03:00
Yogesh Gaikwad	0d1178fca6	put mapping authorization for alias with write-index and multiple read indices (#40834 ) (#41287 ) When the same alias points to multiple indices we can write to only one index with `is_write_index` value `true`. The special handling in case of the put mapping request(to resolve authorized indices) has a check on indices size for a concrete index. If multiple indices existed then it marked the request as unauthorized. The check has been modified to consider write index flag and only when the requested index matches with the one with write index alias, the alias is considered for authorization. Closes #40831	2019-04-17 14:25:33 +10:00
Ioannis Kakavas	fe9442b05b	Add an OpenID Connect authentication realm (#40674 ) (#41178 ) This commit adds an OpenID Connect authentication realm to elasticsearch. Elasticsearch (with the assistance of kibana or another web component) acts as an OpenID Connect Relying Party and supports the Authorization Code Grant and Implicit flows as described in http://ela.st/oidc-spec. It adds support for consuming and verifying signed ID Tokens, both RP initiated and 3rd party initiated Single Sign on and RP initiated signle logout. It also adds an OpenID Connect Provider in the idp-fixture to be used for the associated integration tests. This is a backport of #40674	2019-04-15 12:41:16 +03:00
Yogesh Gaikwad	47ba45732d	Find and use non local IPv4 address while testing IP filtering (#40234 ) (#41141 ) For pattern "n:localhost" PatternRule#isLocalhost() matches any local address, loopback address. [Note: I think for "localhost" this should not consider IP address as a match when they are bound to network interfaces. It should just be loopback address check unless the intent is to match all local addresses. This class is adopted from Netty3 and I am not sure if this is intended behavior or maybe I am missing something] For now I have fixed this assuming the PatternRule#isLocalhost check is correct by avoiding use of local address to check address denied. Closes #40194	2019-04-13 04:37:25 +10:00
Martijn van Groningen	1eff8976a8	Deprecate AbstractHlrc* and AbstractHlrcStreamable* base test classes (#41014 ) * moved hlrc parsing tests from xpack to hlrc module and removed dependency on hlrc from xpack core * deprecated old base test class * added deprecated jdoc tag * split test between xpack-core part and hlrc part * added lang-mustache test dependency, this previously came in via hlrc dependency. * added hlrc dependency on a qa module * duplicated ClusterPrivilegeName class in xpack-core, since x-pack core no longer has a dependency on hlrc. * replace ClusterPrivilegeName usages with string literals * moved tests to dedicated to hlrc packages in order to remove Hlrc part from the name and make sure to use imports instead of full qualified class where possible * remove ESTestCase. from method invocation and use method directly, because these tests indirectly extend from ESTestCase	2019-04-10 16:29:17 +02:00
Albert Zaharovits	adf3393a4e	Deprecate permission over aliases (#38059 ) (#41060 ) This PR generates deprecation log entries for each Role Descriptor, used for building a Role, when the Role Descriptor grants more privileges for an alias compared to an index that the alias points to. This is done in preparation for the removal of the ability to define privileges over aliases. There is one log entry for each "role descriptor name"-"alias name" pair. On such a notice, the administrator is expected to modify the Role Descriptor definition so that the name pattern for index names does not cover aliases. Caveats: * Role Descriptors that are not used in any authorization process, either because they are not mapped to any user or the user they are mapped to is not used by clients, are not be checked. * Role Descriptors are merged when building the effective Role that is used in the authorization process. Therefore some Role Descriptors can overlap others, so even if one matches aliases in a deprecated way, and it is reported as such, it is not at risk from the breaking behavior in the current role mapping configuration and index-alias configuration. It is still reported because it is a best practice to change its definition, or remove offending aliases.	2019-04-10 15:02:33 +03:00
Mark Vieira	1287c7d91f	[Backport] Replace usages RandomizedTestingTask with built-in Gradle Test (#40978 ) (#40993 ) * Replace usages RandomizedTestingTask with built-in Gradle Test (#40978) This commit replaces the existing RandomizedTestingTask and supporting code with Gradle's built-in JUnit support via the Test task type. Additionally, the previous workaround to disable all tasks named "test" and create new unit testing tasks named "unitTest" has been removed such that the "test" task now runs unit tests as per the normal Gradle Java plugin conventions. (cherry picked from commit 323f312bbc829a63056a79ebe45adced5099f6e6) * Fix forking JVM runner * Don't bump shadow plugin version	2019-04-09 11:52:50 -07:00
Jason Tedor	26d8ecfe07	Fix unsafe publication in opt-out query cache (#40957 ) This opt-out query cache has an unsafe publication issue, where the cache is exposed to another thread (namely the cluster state update thread) before the constructor has finished execution. This exposes the opt-out query cache to concurrency bugs. This commit addresses this by ensuring that the opt-out query cache is not registered as a listener for license state changes until after the constructor has returned.	2019-04-08 16:11:20 -04:00
Mark Vieira	2569fb60de	Avoid sharing source directories as it breaks intellij (#40877 ) * Avoid sharing source directories as it breaks intellij * Subprojects share main project output classes directory * Fix jar hell * Fix sql security with ssl integ tests * Relax dependency ordering rule so we don't explode on cycles	2019-04-08 17:26:46 +03:00
Tim Vernum	26c63e0115	Add test for HTTP and Transport TLS on basic license (#40932 ) This adds a new security/qa test for TLS on a basic license. It starts a 2 node cluster with a basic license, and TLS enabled on both HTTP and Transport, and verifies the license type, x-pack SSL usage and SSL certificates API. It also upgrades the cluster to a trial license and performs that same set of checks (to ensure that clusters with basic license and TLS enabled can be upgraded to a higher feature license) Backport of: #40714	2019-04-08 13:23:12 +10:00
Jay Modi	f34663282c	Update apache httpclient to version 4.5.8 (#40875 ) This change updates our version of httpclient to version 4.5.8, which contains the fix for HTTPCLIENT-1968, which is a bug where the client started re-writing paths that contained encoded reserved characters with their unreserved form.	2019-04-05 13:48:10 -06:00
Martijn van Groningen	809a5f13a4	Make -try xlint warning disabled by default. (#40833 ) Many gradle projects specifically use the -try exclude flag, because there are many cases where auto-closeable resource ignore is never referenced in body of corresponding try statement. Suppressing this warning specifically in each case that it happens using `@SuppressWarnings("try")` would be very verbose. This change removes `-try` from any gradle project and adds it to the build plugin. Also this change removes exclude flags from gradle projects that is already specified in build plugin (for example -deprecation). Relates to #40366	2019-04-05 08:02:26 +02:00
Tim Vernum	1a30ab22fb	Show SSL usage when security is not disabled (#40761 ) It is possible to have SSL enabled but security disabled if security was dynamically disabled by the license type (e.g. trial license). e.g. In the following configuration: xpack.license.self_generated.type: trial # xpack.security not set, default to disabled on trial xpack.security.transport.ssl.enabled: true The security feature will be reported as available: true enabled: false And in this case, SSL will be active even though security is not enabled. This commit causes the X-Pack feature usage to report the state of the "ssl" features unless security was explicitly disabled in the settings. Backport of: #40672	2019-04-04 14:40:15 +11:00
Tim Vernum	2c770ba3cb	Support mustache templates in role mappings (#40571 ) This adds a new `role_templates` field to role mappings that is an alternative to the existing roles field. These templates are evaluated at runtime to determine which roles should be granted to a user. For example, it is possible to specify: "role_templates": [ { "template":{ "source": "_user_{{username}}" } } ] which would mean that every user is assigned to their own role based on their username. You may not specify both roles and role_templates in the same role mapping. This commit adds support for templates to the role mapping API, the role mapping engine, the Java high level rest client, and Elasticsearch documentation. Due to the lack of caching in our role mapping store, it is currently inefficient to use a large number of templated role mappings. This will be addressed in a future change. Backport of: #39984, #40504	2019-04-02 20:55:10 +11:00
Tim Vernum	7bdd41399d	Support roles with application privileges against wildcard applications (#40675 ) This commit introduces 2 changes to application privileges: - The validation rules now accept a wildcard in the "suffix" of an application name. Wildcards were always accepted in the application name, but the "valid filename" check for the suffix incorrectly prevented the use of wildcards there. - A role may now be defined against a wildcard application (e.g. kibana-*) and this will be correctly treated as granting the named privileges against all named applications. This does not allow wildcard application names in the body of a "has-privileges" check, but the "has-privileges" check can test concrete application names against roles with wildcards. Backport of: #40398	2019-04-02 14:48:39 +11:00
Yannick Welsch	64b31f44af	No mapper service and index caches for replicated closed indices (#40423 ) Replicated closed indices can't be indexed into or searched, and therefore don't need a shard with full indexing and search capabilities allocated. We can save on a lot of heap memory for those indices by not allocating a mapper service and caching infrastructure (which preallocates a constant amount per instance). Before this change, a 1GB ES instance could host 250 replicated closed metricbeat indices (each index with one shard). After this change, the same instance can host 7300 replicated closed metricbeat instances (not that this would be a recommended configuration). Most of the remaining memory is in the cluster state and the IndexSettings object.	2019-03-27 19:04:24 +01:00
Albert Zaharovits	2f80b7304f	Refactor Token Service (#39808 ) This refactoring is in the context of the work related to moving security tokens to a new index. In that regard, the Token Service has to work with token documents stored in any of the two indices, albeit only as a transient situation. I reckoned the added complexity as unmanageable, hence this refactoring. This is incomplete, as it fails to address the goal of minimizing .security accesses, but I have stopped because otherwise it would've become a full blown rewrite (if not already). I will follow-up with more targeted PRs. In addition to being a true refactoring, some 400 errors moved to 500. Furthermore, more stringed validation of various return result, has been implemented, notably the one of the token document creation.	2019-03-21 15:55:56 +02:00
Yogesh Gaikwad	5d30df5a60	Fix so non super users can also create API keys (#40028 ) (#40286 ) When creating API keys we check for if API key with the same key name already exists and fail the request if it does. The check should have been performed with XPackSecurityUser instead of the authenticated user. This caused the request to fail in case of the non-super user trying to create an API key. This commit fixes by executing search action with SECURITY_ORIGIN so it can be executed with XPackSecurityUser. Also fixed the Rest test to avoid using a user with `super_user` role. Closes #40029	2019-03-21 15:53:25 +11:00
Yannick Welsch	1d8b5fc658	Fail command-line client's auto-URL detection with helpful message (#40151 ) The setup-passwords tool gives cryptic messages in case where custom discovery providers are used (see #33580). As the URL auto-detection logic should be seen as best effort, this commit improves the exception message to make it clearer what needs to be done to fix the issue. Relates #33580	2019-03-19 09:04:14 +01:00
Albert Zaharovits	124de8d938	Un-hardcode SecurityIndexManager to handle generic indices (#40064 ) `SecurityIndexManager` is hardcoded to handle only the `.security`-`.security-7` alias-index pair. This commit removes the hardcoded bits, so that the `SecurityIndexManager` can be reused for other indices, such as the planned security tokens index (`.security-tokens-7`).	2019-03-17 14:46:16 +02:00
Albert Zaharovits	1b75ee0bd7	AuditTrail correctly handle ReplicatedWriteRequest (#39925 ) This fix deduplicates index names in `BulkShardRequests` and only audits the specific resolved index for every comprising `BulkItemRequest`.	2019-03-17 13:05:26 +02:00
Jason Tedor	d02bca1314	Upgrade the bouncycastle dependency to 1.61 (#40017 ) This commit upgrades the bouncycastle dependency from 1.59 to 1.61.	2019-03-14 08:54:47 -04:00
Michael Basnight	8c78fc096d	More lenient socket binding in LDAP tests (#39864 ) The LDAP tests attempt to bind all interfaces, but if for some reason an interface can't be bound the tests will stall until the suite times out. This modifies the tests to be a bit more lenient and allow some binding to fail so long as at least one succeeds. This allows the test to continue even in more antagonistic environments.	2019-03-12 12:00:49 -04:00
Albert Zaharovits	3c7fafd0cc	Fix token invalidation when retries exhausted (#39799 ) Fixes an error about missing to call the index invalidation listener when retry count is exhausted but there are still tokens to be retried.	2019-03-08 20:18:59 +02:00
Tim Brooks	8043fefcf6	Log close_notify during handshake at debug level (#39715 ) A TLS handshake requires exchanging multiple messages to initiate a session. If one side decides to close during the handshake, it is supposed to send a close_notify alert (similar to closing during application data exchange). The java SSLEngine engine throws an exception when this happens. We currently log this at the warn level if trace logging is not enabled. This level is too high for a valid scenario. Additionally it happens all the time in tests (quickly closing and opened transports). This commit changes this to be logged at the debug level if trace is not enabled. Additionally, it extracts the transport security exception handling to a common class.	2019-03-07 09:52:18 -07:00
Ioannis Kakavas	6c19d872a0	Fix testRefreshingMultipleTimesWithinWindowSucceeds (#39701 ) Previously all the threads were writing the received tokens to a HashSet. In cases with many threads, sometimes (1 every ~25 tests) calling size() on the HashSet returned 2 even though it seemed to contain only one String and there was no evidence from logging that threadSecurityClient.refreshToken() ever returned a different access or refresh token. This commit changes the test to use a ConcurrentHashMap instead, checking that we only received one pair of access token/refresh token eventually. It also adds a check so that we won't take into consideration tokens that are returned after 30s, hence not in the concurrent refresh time window.	2019-03-07 13:13:50 +02:00
Albert Zaharovits	fb1005fffc	Fix Token Service retry mechanism (#39639 ) Fixes several errors of the token retry logic: * not checking for backoff.hasNext() before calling backoff.next() * checking for backoff.hasNext() without calling backoff.next() * not preserving the context on the retry * calling scheduleWithFixedDelay instead of schedule	2019-03-06 15:32:23 +02:00
David Turner	77dd711847	Tidy up GroupedActionListener (#39633 ) Today the `GroupedActionListener` accepts a `defaults` parameter but all callers pass an empty list. Also it is permitted to pass an empty group but this is trappy because the delegated listener is never be called in that case. This commit removes the `defaults` parameter and forbids an empty group.	2019-03-06 09:25:10 +00:00
Yogesh Gaikwad	c91dcbd5ee	Types removal security index template (#39705 ) (#39728 ) As we are moving to single type indices, we need to address this change in security-related indexes. To address this, we are - updating index templates to use preferred type name `_doc` - updating the API calls to use preferred type name `_doc` Upgrade impact:- In case of an upgrade from 6.x, the security index has type `doc` and this will keep working as there is a single type and `_doc` works as an alias to an existing type. The change is handled in the `SecurityIndexManager` when we load mappings and settings from the template. Previously, we used to do a `PutIndexTemplateRequest` with the mapping source JSON with the type name. This has been modified to remove the type name from the source. So in the case of an upgrade, the `doc` type is updated whereas for fresh installs `_doc` is updated. This happens as backend handles `_doc` as an alias to the existing type name. An optional step is to `reindex` security index and update the type to `_doc`. Since we do not support the security audit log index, that template has been deleted. Relates: #38637	2019-03-06 18:53:59 +11:00
Ioannis Kakavas	7ed9d52824	Support concurrent refresh of refresh tokens (#39647 ) This is a backport of #39631 Co-authored-by: Jay Modi jaymode@users.noreply.github.com This change adds support for the concurrent refresh of access tokens as described in #36872 In short it allows subsequent client requests to refresh the same token that come within a predefined window of 60 seconds to be handled as duplicates of the original one and thus receive the same response with the same newly issued access token and refresh token. In order to support that, two new fields are added in the token document. One contains the instant (in epoqueMillis) when a given refresh token is refreshed and one that contains a pointer to the token document that stores the new refresh token and access token that was created by the original refresh. A side effect of this change, that was however also a intended enhancement for the token service, is that we needed to stop encrypting the string representation of the UserToken while serializing. ( It was necessary as we correctly used a new IV for every time we encrypted a token in serialization, so subsequent serializations of the same exact UserToken would produce different access token strings) This change also handles the serialization/deserialization BWC logic: In mixed clusters we keep creating tokens in the old format and consume only old format tokens In upgraded clusters, we start creating tokens in the new format but still remain able to consume old format tokens (that could have been created during the rolling upgrade and are still valid) When reading/writing TokensInvalidationResult objects, we take into consideration that pre 7.1.0 these contained an integer field that carried the attempt count Resolves #36872	2019-03-05 14:55:59 +02:00
Albert Zaharovits	e7dbfda5d3	Fix security index auto-create and state recovery race (#39582 ) Previously, the security index could be wrongfully recreated. This might happen if the index was interpreted as missing, as in the case of a fresh install, but the index existed and the state did not yet recover. This fix will return HTTP SERVICE_UNAVAILABLE (503) for requests that try to write to the security index before the state has not been recovered yet.	2019-03-05 12:47:59 +02:00
Tanguy Leroux	0c6b7cfb77	Revert "Support concurrent refresh of refresh tokens (#39559 )" This reverts commit `e2599214e0`.	2019-03-01 17:59:45 +01:00
Ioannis Kakavas	e2599214e0	Support concurrent refresh of refresh tokens (#39559 ) This is a backport of #38382 This change adds supports for the concurrent refresh of access tokens as described in #36872 In short it allows subsequent client requests to refresh the same token that come within a predefined window of 60 seconds to be handled as duplicates of the original one and thus receive the same response with the same newly issued access token and refresh token. In order to support that, two new fields are added in the token document. One contains the instant (in epoqueMillis) when a given refresh token is refreshed and one that contains a pointer to the token document that stores the new refresh token and access token that was created by the original refresh. A side effect of this change, that was however also a intended enhancement for the token service, is that we needed to stop encrypting the string representation of the UserToken while serializing. ( It was necessary as we correctly used a new IV for every time we encrypted a token in serialization, so subsequent serializations of the same exact UserToken would produce different access token strings) This change also handles the serialization/deserialization BWC logic: - In mixed clusters we keep creating tokens in the old format and consume only old format tokens - In upgraded clusters, we start creating tokens in the new format but still remain able to consume old format tokens (that could have been created during the rolling upgrade and are still valid) Resolves #36872 Co-authored-by: Jay Modi jaymode@users.noreply.github.com	2019-03-01 16:00:07 +02:00
Albert Zaharovits	8a19d981db	Integ test snapshot and restore for native realm (#39123 ) This commit adds a simple integ test that exercises the flow: * snapshot .security * delete .security * restore .security , checking that the Native Realm works as expected. Relates #34454	2019-02-28 14:41:47 +02:00
Tim Brooks	f24dae302d	Make security tests transport agnostic (#39411 ) Currently there are two security tests that specifically target the netty security transport. This PR moves the client authentication tests into `AbstractSimpleSecurityTransportTestCase` so that the nio transport will also be tested. Additionally the work to build transport configurations is moved out of the netty transport and tested independently.	2019-02-26 18:55:19 -07:00
Tim Vernum	30687cbe7f	Switch internal security index to ".security-7" (#39422 ) This changes the name of the internal security index to ".security-7", but supports indices that were upgraded from earlier versions and use the ".security-6" name. In all cases, both ".security-6" and ".security-7" are considered to be restricted index names regardless of which name is actually in use on the cluster. Backport of: #39337	2019-02-27 12:49:44 +11:00
Ioannis Kakavas	7f999c43b3	[BACKPORT-7.x] Fix TokenBackwardsCompatibility tests (#39294 ) This change is a backport of #39252 - Fixes TokenBackwardsCompatibilityIT: Existing tests seemed to made the assumption that in the oneThirdUpgraded stage the master node will be on the old version and in the twoThirdsUpgraded stage, the master node will be one of the upgraded ones. However, there is no guarantee that the master node in any of the states will or will not be one of the upgraded ones. This class now tests: - That we can generate and consume tokens before we start the rolling upgrade. - That we can consume tokens generated in the old cluster during all the stages of the rolling upgrade. - That while on a mixed cluster, when/if the master node is upgraded, we can generate, consume and refresh a token - That after the rolling upgrade, we can consume a token generated in an old cluster and can invalidate it so that it can't be used any more. - Ensures that during the rolling upgrade, the upgraded nodes have the same configuration as the old nodes. Specifically that the file realm we use is explicitly named `file1`. This is needed because while attempting to refresh a token in a mixed cluster we might create a token hitting an old node and attempt to refresh it hitting a new node. If the file realm name is not the same, the refresh will be seen as being made by a "different" client, and will, thus, fail. - Renames the Authentication variable we check while refreshing a token to be clientAuth in order to make the code more readable. Some of the above were possibly causing the flakiness of #37379	2019-02-26 10:42:36 +02:00
Tim Brooks	44df76251f	Rebuild remote connections on profile changes (#39146 ) Currently remote compression and ping schedule settings are dynamic. However, we do not listen for changes. This commit adds listeners for changes to those two settings. Additionally, when those settings change we now close existing connections and open new ones with the settings applied. Fixes #37201.	2019-02-21 14:00:39 -07:00
Jay Modi	af451459a5	Fix failures in SessionFactoryLoadBalancingTests (#39154 ) This change aims to fix failures in the session factory load balancing tests that mock failure scenarios. For these tests, we randomly shut down ldap servers and bind a client socket to the port they were listening on. Unfortunately, we would occasionally encounter failures in these tests where a socket was already in use and/or the port we expected to connect to was wrong and in fact was to one of the ldap instances that should have been shut down. The failures are caused by the behavior of certain operating systems when it comes to binding ports and wildcard addresses. It is possible for a separate application to be bound to a wildcard address and still allow our code to bind to that port on a specific address. So when we close the server socket and open the client socket, we are still able to establish a connection since the other application is already listening on that port on a wildcard address. Another variant is that the os will allow a wildcard bind of a server socket when there is already an application listening on that port for a specific address. In order to do our best to prevent failures in these scenarios, this change does the following: 1. Binds a client socket to all addresses in an awaitBusy 2. Adds assumption that we could bind all valid addresses 3. In the case that we still establish a connection to an address that we should not be able to, try to bind and expect a failure of not being connected Closes #32190	2019-02-20 11:38:26 -07:00
Albert Zaharovits	af8ef1bb98	Do not create the missing index when invoking getRole (#39039 ) In most of the places we avoid creating the `.security` index (or updating the mapping) for read/search operations. This is more of a nit for the case of the getRole call, that fixes a possible mapping update during a get role, and removes a dead if branch about creating the `.security` index.	2019-02-20 17:33:10 +02:00
Jason Tedor	09ea3ccd16	Remove retention leases when unfollowing (#39088 ) This commit attempts to remove the retention leases on the leader shards when unfollowing an index. This is best effort, since the leader might not be available.	2019-02-20 07:06:49 -05:00
Ioannis Kakavas	210f34f8e9	Remove BCryptTests (#39098 ) This test was added to verify that we fixed a specific behavior in Bcrypt and hasn't been running for almost 4 years now.	2019-02-19 18:12:18 +02:00
Ioannis Kakavas	59e9a0f4f4	Disable specific locales for tests in fips mode (#38938 ) * Disable specific locales for tests in fips mode The Bouncy Castle FIPS provider that we use for running our tests in fips mode has an issue with locale sensitive handling of Dates as described in https://github.com/bcgit/bc-java/issues/405 This causes certificate validation to fail if any given test that includes some form of certificate validation happens to run in one of the locales. This manifested earlier in #33081 which was handled insufficiently in #33299 This change ensures that the problematic 3 locales * th-TH * ja-JP-u-ca-japanese-x-lvariant-JP * th-TH-u-nu-thai-x-lvariant-TH will not be used when running our tests in a FIPS 140 JVM. It also reverts #33299	2019-02-19 08:46:08 +02:00
Hendrik Muhs	4f662bd289	Add data frame feature (#38934 ) (#39029 ) The data frame plugin allows users to create feature indexes by pivoting a source index. In a nutshell this can be understood as reindex supporting aggregations or similar to the so called entity centric indexing. Full history is provided in: feature/data-frame-transforms	2019-02-18 11:07:29 +01:00
Jason Tedor	a5ce1e0bec	Integrate retention leases to recovery from remote (#38829 ) This commit is the first step in integrating shard history retention leases with CCR. In this commit we integrate shard history retention leases with recovery from remote. Before we start transferring files, we take out a retention lease on the primary. Then during the file copy phase, we repeatedly renew the retention lease. Finally, when recovery from remote is complete, we disable the background renewing of the retention lease.	2019-02-16 15:37:52 -05:00
Yogesh Gaikwad	36c274867e	Fix intermittent failure in ApiKeyIntegTests (#38627 ) (#38935 ) Few tests failed intermittently and most of the times due to invalidated or expired keys that were deleted were still reported in search results. This commit removes the test and adds enhancements to other tests testing different scenario's. When ExpiredApiKeysRemover is triggered, the tests did not await its termination thereby sometimes the results would be wrong for a search operation. DELETE_INTERVAL setting has been further reduced to 100ms so we can trigger ExpiredApiKeysRemover faster. Closes #38408	2019-02-15 23:01:35 +11:00
Jay Modi	5d06226507	Fix writing of SecurityFeatureSetUsage to pre-7.1 (#38922 ) This change makes the writing of new usage data conditional based on the version that is being written to. A test has also been added to ensure serialization works as expected to an older version. Relates #38687, #38917	2019-02-14 16:28:52 -07:00
Jay Modi	e59b7b696a	Use consistent view of realms for authentication (#38815 ) This change updates the authentication service to use a consistent view of the realms based on the license state at the start of authentication. Without this, the license can change during authentication of a request and it will result in a failure if the realm that extracted the token is no longer in the realm list. This manifests in some tests as an authentication failure that should never really happen; one example would be the test framework's transport client user should always have a succesful authentication but in the LicensingTests this can fail and will show up as a NoNodeAvailableException. Additionally, the licensing tests have been updated to ensure that there is consistency when changing the license. The license is changed by modifying the internal xpack license state on each node, which has no protection against be changed by some pending cluster action. The methods to disable and enable now ensure we have a green cluster and that the cluster is consistent before returning. Closes #30301	2019-02-14 07:49:14 -07:00
Yogesh Gaikwad	335cf91bb9	Add enabled status for token and api key service (#38687 ) (#38882 ) Right now there is no way to determine whether the token service or API key service is enabled or not. This commit adds support for the enabled status of token and API key service to the security feature set usage API `/_xpack/usage`. Closes #38535	2019-02-14 23:08:52 +11:00
Ioannis Kakavas	8c624e5a20	Enhance parsing of StatusCode in SAML Responses (#38628 ) * Enhance parsing of StatusCode in SAML Responses <Status> elements in a failed response might contain two nested <StatusCode> elements. We currently only parse the first one in order to create a message that we attach to the Exception we return and log. However this is generic and only gives out informarion about whether the SAML IDP believes it's an error with the request or if it couldn't handle the request for other reasons. The encapsulated StatusCode has a more interesting error message that potentially gives out the actual error as in Invalid nameid policy, authentication failure etc. This change ensures that we print that information also, and removes Message and Details fields from the message when these are not part of the Status element (which quite often is the case)	2019-02-11 11:55:26 +02:00
Tim Vernum	273edea712	Mute testExpiredApiKeysDeletedAfter1Week (#38683 ) Tracked: #38408	2019-02-11 16:50:10 +11:00
Christoph Büscher	5180b36547	Mute failing ApiKeyIntegTests (#38614 )	2019-02-08 13:04:17 +01:00

1 2 3 4 5 ...

557 Commits