OpenSearch

Commit Graph

Author	SHA1	Message	Date
Armin Braun	aad33121d8	Async Snapshot Repository Deletes (#40144 ) (#41571 ) Motivated by slow snapshot deletes reported in e.g. #39656 and the fact that these likely are a contributing factor to repositories accumulating stale files over time when deletes fail to finish in time and are interrupted before they can complete. * Makes snapshot deletion async and parallelizes some steps of the delete process that can be safely run concurrently via the snapshot thread poll * I did not take the biggest potential speedup step here and parallelize the shard file deletion because that's probably better handled by moving to bulk deletes where possible (and can still be parallelized via the snapshot pool where it isn't). Also, I wanted to keep the size of the PR manageable. * See https://github.com/elastic/elasticsearch/pull/39656#issuecomment-470492106 * Also, as a side effect this gives the `SnapshotResiliencyTests` a little more coverage for master failover scenarios (since parallel access to a blob store repository during deletes is now possible since a delete isn't a single task anymore). * By adding a `ThreadPool` reference to the repository this also lays the groundwork to parallelizing shard snapshot uploads to improve the situation reported in #39657	2019-04-26 15:36:09 +02:00
Armin Braun	23b3741618	Remove Exists Check from S3 Repository Deletes (#40931 ) (#41534 ) * The check doesn't add much if anything practically, since the S3 repository is eventually consistent and we only log the non-existence of a blob anyway * We don't do the check on writes for this very reason and documented it as such * Removing the check saves one API call per single delete speeding up the deletion process and lowering costs	2019-04-25 18:25:03 +02:00
Armin Braun	c4e84e2b34	Add Bulk Delete Api to BlobStore (#40322 ) (#41253 ) * Adds Bulk delete API to blob container * Implement bulk delete API for S3 * Adjust S3Fixture to accept both path styles for bulk deletes since the S3 SDK uses both during our ITs * Closes #40250	2019-04-16 17:19:05 +02:00
Armin Braun	65732d707f	Add Support for S3 Intelligent Tiering (#39376 ) (#39620 ) * Add support for S3 intelligent tiering * Closes #38836	2019-03-04 10:32:37 +01:00
Henning Andersen	00a26b9dd2	Blob store compression fix (#39073 ) Blob store compression was not enabled for some of the files in snapshots due to constructor accessing sub-class fields. Fixed to instead accept compress field as constructor param. Also fixed chunk size validation to work. Deprecated repositories.fs.compress setting as well to be able to unify in a future commit.	2019-02-20 09:24:41 +01:00
Colin Goodheart-Smithe	21e392e95e	Removes typed calls from YAML REST tests (#37611 ) This PR attempts to remove all typed calls from our YAML REST tests. The PR adds include_type_name: false to create index requests that use a mapping and also to put mapping requests. It also removes _type from index requests where they haven't already been removed. The PR ignores tests named *_with_types.yml since this are specifically testing typed API behaviour. The change also includes changing the test harness to add the type _doc to index, update, get and bulk requests that do not specify the document type when the test is running against a mixed 7.x/6.x cluster.	2019-01-30 16:32:58 +00:00
Armin Braun	57823c484f	Streamline S3 Repository- and Client-Settings (#37393 ) * Make repository settings override static settings * Cache clients according to settings * Introduce custom implementations for the AWS credentials here to be able to use them as part of a hash key	2019-01-30 06:22:31 +01:00
Armin Braun	617e294133	SNAPSHOT: Make Atomic Blob Writes Mandatory (#37168 ) * With #37066 introducing atomic writes to HDFS repository we can enforce atomic write capabilities on this interface * The overrides on the other three cloud implementations are ok because: * https://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectPUT.html states that "Amazon S3 never adds partial objects; if you receive a success response, Amazon S3 added the entire object to the bucket." * https://cloud.google.com/storage/docs/consistency states that GCS has strong read-after-write consistency * https://docs.microsoft.com/en-us/rest/api/storageservices/put-block#remarks Azure has the concept of committing blobs, so there's no partial content here either * Relates #37011	2019-01-07 12:11:19 +01:00
Tanguy Leroux	6186ccf83e	[Tests] Fix third party tests with Gradle 5.0 (#36302 ) * [Tests] Fix third party tests with Gradle 5.0 * apply feedback	2018-12-06 16:05:05 +01:00
Yannick Welsch	2970abfce9	Add read-only repository verification (#35731 ) Adds a verification mode for read-only repositories. It also makes the extra bucket check on repository creation obsolete, which fixes #35703.	2018-11-23 14:45:05 +01:00
Jernej Klancic	baf33b3162	Removes AbstractComponent from several classes (#35566 ) Removes inhertiting from AbstractComponent for some classes (mostly in the plugins module). Relates to #34488	2018-11-16 20:50:18 +01:00
Armin Braun	02b4e28534	#31608 Add S3 Setting to Force Path Type Access (#34721 ) * SNAPSHOTS: Use Path Style Access in S3 * Use path style access pattern to fix #31608 * closes #31608	2018-11-09 05:07:26 +01:00
Andy Bristol	eec357ebde	[test] quote base_path in repository tests	2018-11-01 13:01:53 -07:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Nik Everett	086ada4c08	Core: Drop settings member from AbstractComponent (#35083 ) Drops the `Settings` member from `AbstractComponent`, moving it from the base class on to the classes that use it. For the most part this is a mechanical change that doesn't drop `Settings` accesses. The one exception to this is naming threads where it switches from an invocation that passes `Settings` and extracts the node name to one that explicitly passes the node name. This change doesn't drop the `Settings` argument from `AbstractComponent`'s ctor because this change is big enough as is. We'll do that in a follow up change.	2018-10-30 16:10:38 -04:00
Nik Everett	9f87fdc7ab	Drop deprecationLogger from AbstractComponent (#34859 ) Drops the `deprecationLogger` from `AbstractComponent`, moving it to places where we need it. This saves us from building a bunch of `DeprecationLogger`s that we don't need. Relates to #34488	2018-10-26 15:40:16 -04:00
Alpar Torok	59536966c2	Add a new "contains" feature (#34738 ) The contains syntax was added in #30874 but the skips were not properly put in place. The java runner has the feature so the tests will run as part of the build, but language clients will be able to support it at their own pace.	2018-10-25 08:50:50 +03:00
Nik Everett	6c07d105f3	Amazon: Wrap at 140 columns (#34495 ) Applies our standard column wrapping to the `discovery-ec2` and `repository-s3` plugins.	2018-10-18 09:09:09 -04:00
Jason Tedor	99681f91f8	Use more precise does S3 bucket exist method (#34123 ) We are using a deprecated method for checking if an S3 bucket exists. This deprecated method has a limitation that it can not distinguish between invalid credentials and a lack of permissions. This commit switches to using a method that correctly surfaces if invalid credentials are supplied when checking for the existence of a bucket.	2018-09-28 10:05:04 -04:00
Jason Tedor	839a677557	Do not override named S3 client credentials (#33793 ) In cases when mixed secure S3 client credentials and insecure S3 client credentials were used (that is, those defined on the repository), we were overriding the credentials from the repository using insecure settings to all the repositories. This commit fixes this by not mixing up repositories that use insecure settings with those that use secure settings.	2018-09-19 16:18:54 -04:00
Tanguy Leroux	7e195c2912	Update AWS SDK to 1.11.406 in repository-s3 (#30723 )	2018-09-12 15:27:57 +02:00
Vladimir Dolzhenko	7c0fc209bf	ECS Task IAM profile credentials ignored in repository-s3 plugin (#31864 ) ECS Task IAM profile credentials ignored in repository-s3 plugin (#31864) Closes #26913	2018-07-19 12:54:38 +02:00
Vladimir Dolzhenko	8235b254ab	Add EC2 credential test for repository-s3 (#31918 ) Add EC2 credential test for repository-s3 Relates to #26913	2018-07-18 12:18:00 +02:00
Vladimir Dolzhenko	b1bf643e41	lazy snapshot repository initialization (#31606 ) lazy snapshot repository initialization	2018-07-13 20:05:49 +02:00
Christoph Büscher	043e40adde	Fix unreachable error condition in AmazonS3Fixture (#32005 ) The `else` branch where currently the error response should be thrown is not reachable because `handler` is always non-null inside the previous outer check. Moving error creation into an else branch on the other condition check, removing the other superflous check for non-null handler inside the first branch.	2018-07-12 17:38:30 +02:00
Alpar Torok	d7785a7c43	Fix doclint jdk 11	2018-07-05 08:18:51 +03:00
David Turner	18c17dfb35	S3 fixture should report 404 on unknown bucket (#31782 ) Today, `AmazonS3Fixture` returns 403 on attempts to access any inappropriate bucket, whether known or otherwise. In fact, S3 reports 404 on nonexistent buckets and 403 on inaccessible ones. This change enhances `AmazonS3Fixture` to distinguish these cases.	2018-07-04 10:48:00 +01:00
Yannick Welsch	32d67ef504	Fixture for Minio testing (#31688 ) Adds a Minio fixture to run the S3 repository tests against Minio. Also collapses the single qa subproject into the s3-repository project, which simplifies the code structure (having it all in one place) and helps to avoid having too many Gradle subprojects.	2018-07-04 08:04:07 +02:00
David Turner	4108722052	Add support for AWS session tokens (#30414 ) AWS supports the creation and use of credentials that are only valid for a fixed period of time. These credentials comprise three parts: the usual access key and secret key, together with a session token. This commit adds support for these three-part credentials to the EC2 discovery plugin and the S3 repository plugin. Note that session tokens are only valid for a limited period of time and yet there is no mechanism for refreshing or rotating them when they expire without restarting Elasticsearch. Nonetheless, this feature is already useful for nodes that need only run for a few days, such as for training, testing or evaluation. #29135 tracks the work towards allowing these credentials to be refreshed at runtime. Resolves #16428	2018-07-03 14:12:07 +01:00
Yannick Welsch	2bb4f38371	Add writeBlob option to replace existing blob (#31729 ) Adds a new parameter to the BlobContainer#writeBlob methods to specify whether the existing file should be overridden or not. For some metadata files in the repository, we actually want to replace the current file. This is currently implemented through an explicit blob delete and then a fresh write. In case of using a cloud provider (S3, GCS, Azure), this results in 2 API requests instead of just 1. This change will therefore allow us to achieve the same functionality using less API requests.	2018-07-03 09:13:50 +02:00
Tanguy Leroux	8fa06294ab	Do not check for object existence when deleting repository index files (#31680 ) Before deleting a repository index generation file, BlobStoreRepository checks for the existence of the file and then deletes it. We can save a request here by using BlobContainer.deleteBlobIgnoringIfNotExists() which ignores error when deleting a file that does not exist. Since there is no way with S3 to know if a non versioned file existed before being deleted, this pull request also changes S3BlobContainer so that it now implements deleteBlobIgnoringIfNotExists(). It will now save one more request (blobExist?) when appropriate. The tests and fixture have been modified to conform the S3 API that always returns a 204/NO CONTENT HTTP response on deletions.	2018-06-29 14:15:34 +02:00
Tanguy Leroux	0ef22db844	[Test] Clean up some repository-s3 tests (#31601 ) This commit removes some tests in the repository-s3 plugin that have not been executed for 2+ years but have been maintained for nothing. Most of the tests in AbstractAwsTestCase were obsolete or superseded by fixture based integration tests.	2018-06-29 13:21:29 +02:00
Tanguy Leroux	9d523d05ed	Merge AwsS3Service and InternalAwsS3Service in a S3Service class (#31580 ) The interface and its implementation can be merged into a single class, which is renamed to S3Service like the other S3BlobStore, S3Repository classes.	2018-06-28 10:40:53 +02:00
Alpar Torok	08b8d11e30	Add support for switching distribution for all integration tests (#30874 ) * remove left-over comment * make sure of the property for plugins * skip installing modules if these exist in the distribution * Log the distrbution being ran * Don't allow running with integ-tests-zip passed externally * top level x-pack/qa can't run with oss distro * Add support for matching objects in lists Makes it possible to have a key that points to a list and assert that a certain object is present in the list. All keys have to be present and values have to match. The objects in the source list may have additional fields. example: ``` match: { 'nodes.$master.plugins': { name: ingest-attachment } } ``` * Update plugin and module tests to work with other distributions Some of the tests expected that the integration tests will always be ran with the `integ-test-zip` distribution so that there will be no other plugins loaded. With this change, we check for the presence of the plugin without assuming exclusivity. * Allow modules to run on other distros as well To match the behavior of tets.distributions * Add and use a new `contains` assertion Replaces the previus changes that caused `match` to do a partial match. * Implement PR review comments	2018-06-26 06:49:03 -07:00
Albert Zaharovits	3378240b29	Reload secure settings for plugins (#31383 ) Adds the ability to reread and decrypt the local node keystore. Commonly, the contents of the keystore, backing the `SecureSettings`, are not retrievable except during node initialization. This changes that by adding a new API which broadcasts a password to every node. The password is used to decrypt the local keystore and use it to populate a `Settings` object that is passes to all the plugins implementing the `ReloadablePlugin` interface. The plugin is then responsible to do whatever "reload" means in his case. When the `reload`handler returns, the keystore is closed and its contents are no longer retrievable. Password is never stored persistently on any node. Plugins that have been moded in this commit are: `repository-azure`, `repository-s3`, `repository-gcs` and `discovery-ec2`.	2018-06-18 09:42:11 +03:00
Tanguy Leroux	bf58660482	Remove all unused imports and fix CRLF (#31207 ) The X-Pack opening and the recent other refactorings left a lot of unused imports in the codebase. This commit removes them all.	2018-06-11 15:12:12 +02:00
Tanguy Leroux	b5f05f676c	Remove BlobContainer.move() method (#31100 ) closes #30680	2018-06-07 10:48:31 +02:00
Yannick Welsch	515a23360d	Do not check for S3 blob to exist before writing (#31128 ) In #19749 an extra check was added before writing each blob to ensure that we would not be overriding an existing blob. Due to S3's weak consistency model, this check was best effort. To make matters worse, however, this resulted in a HEAD request to be done before every PUT, in particular also when PUTTING a new object. The approach taken in #19749 worsened our consistency guarantees for follow-up snapshot actions, as it made it less likely for new files that had been written to be available for reads. This commit therefore removes this extra check. Due to the weak consistency model, this check was a best effort thing anyway, and there's currently no way to prevent accidental overrides on S3.	2018-06-06 16:38:06 +02:00
Albert Zaharovits	8e9d2b1e28	S3 repo plugin populate SettingsFilter (#30652 ) The accessKey and secretKey repo settings (in the cluster state) of the s3 client are registered and will populate the SettingsFilter.	2018-05-16 16:12:24 +03:00
Tanguy Leroux	7ae3b3b155	Move repository-s3 fixture tests to QA test project (#29372 ) This commit moves the repository-s3 fixture test added in #29296 in a new `repository-s3/qa/amazon-s3` project. This new project allows the REST integration tests to be executed using the real S3 service when all the required environment variables are provided. When no env var is provided, then the tests are executed using the fixture added in #29296. The REST tests located at the `repository-s3`plugin project now only verify that the plugin is correctly loaded. The REST tests have been adapted to allow a bucket name and a base path to be specified as env vars. This way it is possible to run the tests with different base paths (could be anything, like a CI job name or a branch name) without multiplicating buckets. Related to #29349	2018-04-27 16:49:06 +02:00
Tanguy Leroux	d813a05b9f	Use ESBlobStoreRepositoryIntegTestCase to test the repository-s3 plugin (#29315 ) This commit adds the S3BlobStoreRepositoryTests class that extends the base testing class for S3. It also removes some usage of socket servers that emulate socket connections in unit tests. It was added to trigger security exceptions, but this won't be needed anymore since #29296 is merged.	2018-04-05 13:34:02 +02:00
Jason Tedor	c95e7539e7	Enhance error for out of bounds byte size settings (#29338 ) Today when you input a byte size setting that is out of bounds for the setting, you get an error message that indicates the maximum value of the setting. The problem is that because we use ByteSize#toString, we end up with a representation of the value that does not really tell you what the bound is. For example, if the bound is 2^31 - 1 bytes, the output would be 1.9gb which does not really tell you want the limit as there are many byte size values that we format to the same 1.9gb with ByteSize#toString. We have a method ByteSize#getStringRep that uses the input units to the value as the output units for the string representation, so we end up with no loss if we use this to report the bound. This commit does this.	2018-04-04 07:22:13 -04:00
Tanguy Leroux	989e465964	Use fixture to test repository-s3 plugin (#29296 ) This commit adds a new fixture that emulates a S3 service in order to improve the existing integration tests. This is very similar to what has been made for Google Cloud Storage in #28788, and such tests would have helped a lot to catch bugs like #22534. The AmazonS3Fixture is brittle and only implements the very necessary stuff for the S3 repository to work, but at least it works and can be adapted for specific tests needs.	2018-04-03 11:30:43 +02:00
kel	c675407a70	Remove redundant argument for buildConfiguration of s3 plugin (#28281 )	2018-01-23 22:32:46 -08:00
Nhat Nguyen	3c865d6d04	TEST: reduce blob size #testExecuteMultipartUpload If a large blob size and small buffer size are picked, this test causes out of memory. https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+intake/1061/	2017-12-20 12:43:05 -05:00
Colin Goodheart-Smithe	579d1fea57	Fixes ByteSizeValue to serialise correctly (#27702 ) * Fixes ByteSizeValue to serialise correctly This fix makes a few fixes to ByteSizeValue to make it possible to perform round-trip serialisation: * Changes wire serialisation to use Zlong methods instead of VLong methods. This is needed because the value `-1` is accepted but previously if `-1` is supplied it cannot be serialised using the wire protocol. * Limits the supplied size to be no more than Long.MAX_VALUE when converted to bytes. Previously values greater than Long.MAX_VALUE bytes were accepted but would be silently interpreted as Long.MAX_VALUE bytes rather than erroring so the user had no idea the value was not being used the way they had intended. I consider this a bug and so fine to include this bug fix in a minor version but I am open to other points of view. * Adds a `getStringRep()` method that can be used when serialising the value to JSON. This will print the bytes value if the size is positive, `”0”` if the size is `0` and `”-1”` if the size is `-1`. * Adds logic to detect fractional values when parsing from a String and emits a deprecation warning in this case. * Modifies hashCode and equals methods to work with long values rather than doubles so they don’t run into precision problems when dealing with large values. Previous to this change the equals method would not detect small differences in the values (e.g. 1-1000 bytes ranges) if the actual values where very large (e.g. PBs). This was due to the values being in the order of 10^18 but doubles only maintaining a precision of ~10^15. Closes #27568 * Fix bytes settings default value to not use fractional values * Fixes test * Addresses review comments * Modifies parsing to preserve unit This should be bwc since in the case that the input is fractional it reverts back to the old method of parsing it to the bytes value. * Addresses more review comments * Fixes tests * Temporarily changes version check to 7.0.0 This will be changed to 6.2 when the fix has been backported	2017-12-14 12:17:17 +00:00
Tanguy Leroux	b69923f112	Remove some unused code (#27792 ) This commit removes some unused code.	2017-12-13 16:45:55 +01:00
Tanguy Leroux	f27cb96a64	Use AmazonS3.doesObjectExist() method in S3BlobContainer (#27723 ) This pull request changes the S3BlobContainer.blobExists() method implementation to make it use the AmazonS3.doesObjectExist() method instead of AmazonS3.getObjectMetadata(). The AmazonS3 implementation takes care of catching any thrown AmazonS3Exception and compares its response code with 404, returning false (object does not exist) or lets the exception be propagated.	2017-12-12 09:30:36 +01:00
Tanguy Leroux	f6c2ea0f7d	[Test] Fix S3BlobStoreContainerTests.testNumberOfMultiparts()	2017-11-10 15:45:20 +01:00
Tanguy Leroux	9c4d6c629a	Remove S3 output stream (#27280 ) Now the blob size information is available before writing anything, the repository implementation can know upfront what will be the more suitable API to upload the blob to S3. This commit removes the DefaultS3OutputStream and S3OutputStream classes and moves the implementation of the upload logic directly in the S3BlobContainer. related #26993 closes #26969	2017-11-10 12:22:33 +01:00

1 2 3 4 5

209 Commits