OpenSearch

Commit Graph

Author	SHA1	Message	Date
Andrey Pleskach	b9ff91d591	Add proxy settings for GCS repository (#2096 ) Added proxy settings for GCS repository. Security settings: - gcs.client..proxy.username - Proxy user name - gcs.client..proxy.password - Proxy user password Common settings: - gcs.client..proxy.type - java Proxy.Type names: HTTP, SOCKS. default is DIRECT - gcs.client..proxy.host - Proxy host name - gcs.client.*.proxy.port - Proxy port Signed-off-by: Andrey Pleskach <ples@aiven.io>	2022-02-17 11:49:33 -08:00
Tianli Feng	8b8d04173c	Update protobuf-java to 3.19.3 (#1945 ) * Update protobuf-java to 3.19.3 Signed-off-by: Tianli Feng <ftl94@live.com> * Exclude some API usage violations in the package com.google.protobuf for thirdPartyAudit task to pass Signed-off-by: Tianli Feng <ftl94@live.com>	2022-01-20 11:05:28 -08:00
Andriy Redko	385b268bc0	Update Mockito to 4.2.x (#1830 ) Signed-off-by: Andriy Redko <andriy.redko@aiven.io>	2022-01-03 12:00:45 -05:00
Andriy Redko	65804d25a6	Update to log4j 2.17.1 (#1820 ) Signed-off-by: Andriy Redko <andriy.redko@aiven.io>	2021-12-28 17:06:42 -05:00
Andriy Redko	ca27c8fd4f	Update to log4j 2.17.0 (#1771 )	2021-12-18 09:36:59 -08:00
Andriy Redko	6db435412b	Upgrade to log4j 2.16.0 (#1721 ) Signed-off-by: Andriy Redko <andriy.redko@aiven.io>	2021-12-14 07:34:45 -05:00
Andrew Ross	309649ce8a	Upgrade to logj4 2.15.0 (#1698 ) Signed-off-by: Andrew Ross <andrross@amazon.com>	2021-12-10 13:03:41 -08:00
Sarat Vemulapalli	e0e6995c4a	Updating Log4j to 2.11.2 (#1696 ) Signed-off-by: Sarat Vemulapalli <vemulapallisarat@gmail.com>	2021-12-10 08:03:45 -08:00
Vacha	bcfb57c06a	Upgrade dependency (#1571 ) * Upgrading guava, commons-io and apache-ant dependencies Signed-off-by: Vacha <vachshah@amazon.com> * Adding failureaccess since guava needs it Signed-off-by: Vacha <vachshah@amazon.com>	2021-11-18 13:38:49 -05:00
Vacha	c6dd484ce3	Upgrading gson to 2.8.9 (#1541 ) Signed-off-by: Vacha <vachshah@amazon.com>	2021-11-15 14:10:29 -05:00
Himanshu Setia	681e5548c1	Enabling spotless, disabling checkstyle check on plugins (#1488 ) * Enabling spotless, disabling checkstyle on below modules :plugins:mapper-annotated-text :plugins:mapper-murmur3 :plugins:mapper-size :plugins:repository-azure :plugins:repository-gcs :plugins:repository-hdfs :plugins:repository-s3 :plugins:store-smb :plugins:transport-nio :qa:die-with-dignity Signed-off-by: Himanshu Setia <setiah@amazon.com> * Enabling spotless for more plugins Signed-off-by: Himanshu Setia <setiah@amazon.com> * Fixing error in merge conflict Signed-off-by: Himanshu Setia <setiah@amazon.com>	2021-11-01 17:40:06 -07:00
Rabi Panda	50abf6d066	[CVE] Upgrade dependencies to mitigate CVEs (#657 ) This PR upgrade the following dependencies to fix CVEs. - commons-codec:1.12 (->1.13) apache/commons-codec@48b6157 - ant:1.10.8 (->1.10.9) https://ant.apache.org/security.html - jackson-databind:2.10.4 (->2.11.0) FasterXML/jackson-databind#2589 - jackson-dataformat-cbor:2.10.4 (->2.11.0) https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2020-28491 - apache-httpclient:4.5.10 (->4.5.13) https://bugzilla.redhat.com/show_bug.cgi?id=CVE-2020-13956 - checkstyle:8.20 (->8.29) https://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2019-10782 - junit:4.12 (->4.13.1) https://github.com/junit-team/junit4/security/advisories/GHSA-269g-pwp5-87pp - netty:4.1.49.Final (->4.1.59) https://github.com/netty/netty/security/advisories/GHSA-5mcr-gq6c-3hq2 Signed-off-by: Rabi Panda <adnapibar@gmail.com>	2021-05-18 11:37:24 -07:00
Rabi Panda	6550e099b3	[CVE-2020-7692] Upgrade google-oauth clients for goolge cloud plugins (#662 ) For discovery-gce and repository-gcs plugins update the google-oauth-client library to version 1.31.0. See CVE details at https://nvd.nist.gov/vuln/detail/CVE-2020-7692 Signed-off-by: Rabi Panda <adnapibar@gmail.com>	2021-05-13 12:19:57 -07:00
Nick Knize	0ba0e7cc26	[Versioning] Rebase to OpenSearch version 1.0.0 (#555 ) This commit rebases the versioning to OpenSearch 1.0.0 Co-authored-by: Rabi Panda <adnapibar@gmail.com> Signed-off-by: Nicholas Walter Knize <nknize@apache.org>	2021-04-15 17:06:47 -05:00
Nick Knize	ee6d15e26a	[License] Add SPDX License Header to security policies (#531 ) This commit adds the SPDX license header and modifications copyright to security policy files. Signed-off-by: Nicholas Walter Knize <nknize@apache.org>	2021-04-12 22:59:36 -05:00
Rabi Panda	8727afbcd3	Use the correct domain to fix failing integration tests. (#519 ) This commit fixes a renaming issue (opensearch.co -> opensearch.org) which was causing few integration test failures. Signed-off-by: Rabi Panda <adnapibar@gmail.com>	2021-04-10 09:42:39 -07:00
Nick Knize	9168f1fb43	[License] Add SPDX and OpenSearch Modification license header (#509 ) This commit adds the SPDX Apache-2.0 license header along with an additional copyright header for all modifications. Signed-off-by: Nicholas Walter Knize <nknize@apache.org>	2021-04-09 14:28:18 -05:00
Rabi Panda	13f6d23e40	[Rename] Property and metadata keys with prefix es. (#389 ) Rename all property and metadata keys with prefix 'es.' to 'opensearch.'. Signed-off-by: Rabi Panda <adnapibar@gmail.com>	2021-03-21 20:56:34 -05:00
Nick Knize	5b46a05702	[Rename] remaining packages and resources in test/fixture (#364 ) This commit refactors the remaining o.e.index and o.e.test packages in the test/fixtures module. References throughout the codebase are also refactored. Signed-off-by: Nicholas Walter Knize <nknize@apache.org>	2021-03-21 20:56:34 -05:00
Harold Wang	82f9ff93cb	[Rename] plugins (#193 ) * [Rename] plugins (#193) This PR refactors files under "plugins" folders part of the Elasticsearch to OpenSearch renaming effort. Signed-off-by: Harold Wang <harowang@amazon.com>	2021-03-21 20:56:34 -05:00
Nick Knize	923ea001f5	[Rename] o.e.action.support classes (#253 ) This commit refactors the classes in o.e.action.support to o.opensearch.action.support. The remaining directories will be refactored in a separate commit. Signed-off-by: Nicholas Knize <nknize@amazon.com>	2021-03-21 20:56:34 -05:00
Rabi Panda	3eee5183d1	[Rename] server/rest (#229 ) This commit refactors the `server/rest` package as part of the Elasticsearch to OpenSearch renaming. Signed-off-by: Rabi Panda <adnapibar@gmail.com>	2021-03-21 20:56:34 -05:00
Nick Knize	1203aa7302	[Rename] refactor o.e.action classes (#203 ) This commit refactors top level classes in o.e.action to o.opensearch.action. References throughout the rest of the codebase have been updated. Signed-off-by: Nicholas Knize <nknize@amazon.com>	2021-03-21 20:56:34 -05:00
Armin Braun	83ec8dd4e2	Upgrade GCS SDK to 1.113.1 (#62848 ) (#62864 ) Just staying on top of upgrades to the SDK and its dependencies.	2020-09-24 15:43:21 +02:00
Francisco Fernández Castaño	2bb5716b3d	Add repositories metering API (#62088 ) This pull request adds a new set of APIs that allows tracking the number of requests performed by the different registered repositories. In order to avoid losing data, the repository statistics are archived after the repository is closed for a configurable retention period `repositories.stats.archive.retention_period`. The API exposes the statistics for the active repositories as well as the modified/closed repositories. Backport of #60371	2020-09-08 14:01:04 +02:00
Ryan Ernst	d6e17170c3	Simplify adding plugins and modules to testclusters (#61886 ) There are currently half a dozen ways to add plugins and modules for test clusters to use. All of them require the calling project to peek into the plugin or module they want to use to grab its bundlePlugin task, and then both depend on that task, as well as extract the archive path the task will produce. This creates cross project dependencies that are difficult to detect, and if the dependent plugin/module has not yet been configured, the build will fail because the task does not yet exist. This commit makes the plugin and module methods for testclusters symmetetric, and simply adding a file provider directly, or a project path that will produce the plugin/module zip. Internally this new variant uses normal configuration/dependencies across projects to get the zip artifact. It also has the added benefit of no longer needing the caller to add to the test task a dependsOn for bundlePlugin task.	2020-09-03 19:37:46 -07:00
Armin Braun	3e2dfc6eac	Remove GCS Bucket Exists Check (#60899 ) (#60914 ) Same as https://github.com/elastic/elasticsearch/pull/43288 for GCS. We don't need to do the bucket exists check before using the repo, that just needlessly increases the necessary permissions for using the GCS repository.	2020-08-11 09:54:27 +02:00
Rene Groeschke	bdd7347bbf	Merge test runner task into RestIntegTest (7.x backport) (#60600 ) * Merge test runner task into RestIntegTest (#60261) * Merge test runner task into RestIntegTest * Reorganizing Standalone runner and RestIntegTest task * Rework general test task configuration and extension * Fix merge issues * use former 7.x common test configuration	2020-08-04 14:46:32 +02:00
Armin Braun	7ae9dc2092	Unify Stream Copy Buffer Usage (#56078 ) (#60608 ) We have various ways of copying between two streams and handling thread-local buffers throughout the codebase. This commit unifies a number of them and removes buffer allocations in many spots.	2020-08-04 09:54:52 +02:00
Jake Landis	96b7122917	[7.x] Convert repository-* from integTest to [yaml \| java]RestTest or internalClusterTest (#60085 ) (#60404 ) For OSS plugins that being with repository-*, integTest task is now a no-op and all of the tests are now executed via a test, yamlRestTest, javaRestTest, or internalClusterTest. related: #56841 related: #59444	2020-07-29 11:19:44 -05:00
Armin Braun	ebb6677815	Formalize and Streamline Buffer Sizes used by Repositories (#59771 ) (#60051 ) Due to complicated access checks (reads and writes execute in their own access context) on some repositories (GCS, Azure, HDFS), using a hard coded buffer size of 4k for restores was needlessly inefficient. By the same token, the use of stream copying with the default 8k buffer size for blob writes was inefficient as well. We also had dedicated, undocumented buffer size settings for HDFS and FS repositories. For these two we would use a 100k buffer by default. We did not have such a setting for e.g. GCS though, which would only use an 8k read buffer which is needlessly small for reading from a raw `URLConnection`. This commit adds an undocumented setting that sets the default buffer size to `128k` for all repositories. It removes wasteful allocation of such a large buffer for small writes and reads in case of HDFS and FS repositories (i.e. still using the smaller buffer to write metadata) but uses a large buffer for doing restores and uploading segment blobs. This should speed up Azure and GCS restores and snapshots in a non-trivial way as well as save some memory when reading small blobs on FS and HFDS repositories.	2020-07-22 21:06:31 +02:00
Armin Braun	d18b434e62	Remove Artificially Low Chunk Size Limits from GCS + Azure Blob Stores (#59279 ) (#59564 ) Removing these limits as they cause unnecessarily many object in the blob stores. We do not have to worry about BwC of this change since we do not support any 3rd party implementations of Azure or GCS. Also, since there is no valid reason to set a different than the default maximum chunk size at this point, removing the documentation (which was incorrect in the case of Azure to begin with) for the setting from the docs. Closes #56018	2020-07-14 22:31:07 +02:00
Armin Braun	9268b25789	Add Check for Metadata Existence in BlobStoreRepository (#59141 ) (#59216 ) In order to ensure that we do not write a broken piece of `RepositoryData` because the phyiscal repository generation was moved ahead more than one step by erroneous concurrent writing to a repository we must check whether or not the current assumed repository generation exists in the repository physically. Without this check we run the risk of writing on top of stale cached repository data. Relates #56911	2020-07-08 14:25:01 +02:00
Jake Landis	604c6dd528	7.x - Create plugin for yamlTest task (#56841 ) (#59090 ) This commit creates a new Gradle plugin to provide a separate task name and source set for running YAML based REST tests. The only project converted to use the new plugin in this PR is distribution/archives/integ-test-zip. For which the testing has been moved to :rest-api-spec since it makes the most sense and it avoids a small but awkward change to the distribution plugin. The remaining cases in modules, plugins, and x-pack will be handled in followups. This plugin is distinctly different from the plugin introduced in #55896 since the YAML REST tests are intended to be black box tests over HTTP. As such they should not (by default) have access to the classpath for that which they are testing. The YAML based REST tests will be moved to separate source sets (yamlRestTest). The which source is the target for the test resources is dependent on if this new plugin is applied. If it is not applied, it will default to the test source set. Further, this introduces a breaking change for plugin developers that use the YAML testing framework. They will now need to either use the new source set and matching task, or configure the rest resources to use the old "test" source set that matches the old integTest task. (The former should be preferred). As part of this change (which is also breaking for plugin developers) the rest resources plugin has been removed from the build plugin and now requires either explicit application or application via the new YAML REST test plugin. Plugin developers should be able to fix the breaking changes to the YAML tests by adding apply plugin: 'elasticsearch.yaml-rest-test' and moving the YAML tests under a yamlRestTest folder (instead of test)	2020-07-06 14:16:26 -05:00
Yannick Welsch	15c85b29fd	Account for recovery throttling when restoring snapshot (#58658 ) (#58811 ) Restoring from a snapshot (which is a particular form of recovery) does not currently take recovery throttling into account (i.e. the `indices.recovery.max_bytes_per_sec` setting). While restores are subject to their own throttling (repository setting `max_restore_bytes_per_sec`), this repository setting does not allow for values to be configured differently on a per-node basis. As restores are very similar in nature to peer recoveries (streaming bytes to the node), it makes sense to configure throttling in a single place. The `max_restore_bytes_per_sec` setting is also changed to default to unlimited now, whereas previously it was set to `40mb`, which is the current default of `indices.recovery.max_bytes_per_sec`). This means that no behavioral change will be observed by clusters where the recovery and restore settings were not adapted. Relates https://github.com/elastic/elasticsearch/issues/57023 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-07-01 12:19:29 +02:00
Rene Groeschke	d952b101e6	Replace compile configuration usage with api (7.x backport) (#58721 ) * Replace compile configuration usage with api (#58451) - Use java-library instead of plugin to allow api configuration usage - Remove explicit references to runtime configurations in dependency declarations - Make test runtime classpath input for testing convention - required as java library will by default not have build jar file - jar file is now explicit input of the task and gradle will ensure its properly build * Fix compile usages in 7.x branch	2020-06-30 15:57:41 +02:00
Rene Groeschke	abc72c1a27	Unify dependency licenses task configuration (#58116 ) (#58274 ) - Remove duplicate dependency configuration - Use task avoidance api accross the build - Remove redundant licensesCheck config	2020-06-18 08:15:50 +02:00
Rene Groeschke	01e9126588	Remove deprecated usage of testCompile configuration (#57921 ) (#58083 ) * Remove usage of deprecated testCompile configuration * Replace testCompile usage by testImplementation * Make testImplementation non transitive by default (as we did for testCompile) * Update CONTRIBUTING about using testImplementation for test dependencies * Fail on testCompile configuration usage	2020-06-14 22:30:44 +02:00
markharwood	e2c0c4197f	Mute GoogleCloudStorageRepositoryClientYamlTestSuiteIT For #57115	2020-06-03 13:25:31 +01:00
Tanguy Leroux	b4a2cd810a	Use 3rd party task to run integration tests on external service (#56588 ) Backport of #56587 for 7.x	2020-06-02 11:26:58 +02:00
Armin Braun	be6fa72432	Fix GCS Mock Behavior for Missing Bucket (#57283 ) (#57310 ) * Fix GCS Mock Behavior for Missing Bucket We were throwing a 500 instead of a 404 for a missing bucket. This would make yaml tests needlessly wait for multiple seconds, retrying the 500 response with backoff, in the test checking behavior for missing buckets.	2020-05-29 10:01:20 +02:00
Armin Braun	a4eb3edf46	Fix GCS Repository YAML Test Build (#57073 ) (#57101 ) A few relatively obvious issues here: * We cannot run the different IT runs (large blob setting one and normal integ run) concurrently * We need to set the dependency tasks up correctly for the large blob run so that it works in isolation * We can't use the `localAddress` for the location header of the resumable upload (this breaks in YAML tests because GCS is using a loopback port forward for the initial request and the local address will be chosen as the actual Docker container host) Closes #57026	2020-05-25 11:10:39 +02:00
Francisco Fernández Castaño	60c7832141	Track upload requests on S3 repositories (#56904 ) Add tracking for regular and multipart uploads. Regular uploads are categorized as PUT. Multi part uploads are categorized as POST. The number of documents created for the test #testRequestStats have been increased so all upload methods are exercised. Backport of #56826	2020-05-18 19:05:17 +02:00
Francisco Fernández Castaño	8ab9fc10c1	Track multipart/resumable uploads GCS API calls (#56892 ) Add tracking for multipart and resumable uploads for GoogleCloudStorage. For resumable uploads only the last request is taken into account for billing, so that's the only request that's tracked. Backport of #56821	2020-05-18 13:39:26 +02:00
Francisco Fernández Castaño	97bf47f5b9	Track GET/LIST GoogleCloudStorage API calls (#56758 ) Backporting #56585 to 7.x branch. Adds tracking for the API calls performed by the GoogleCloudStorage underlying SDK. It hooks an HttpResponseInterceptor to the SDK transport layer and does http request filtering based on the URI paths that we are interested to track. Unfortunately we cannot hook a wrapper into the ServiceRPC interface since we're using different levels of abstraction to implement retries during reads (GoogleCloudStorageRetryingInputStream).	2020-05-14 14:03:21 +02:00
Yannick Welsch	ba39c261e8	Use streaming reads for GCS (#55506 ) To read from GCS repositories we're currently using Google SDK's official BlobReadChannel, which issues a new request every 2MB (default chunk size for BlobReadChannel) using range requests, and fully downloads the chunk before exposing it to the returned InputStream. This means that the SDK issues an awfully high number of requests to download large blobs. Increasing the chunk size is not an option, as that will mean that an awfully high amount of heap memory will be consumed by the download process. The Google SDK does not provide the right abstractions for a streaming download. This PR uses the lower-level primitives of the SDK to implement a streaming download, similar to what S3's SDK does. Also closes #55505	2020-04-21 13:22:26 +02:00
Ignacio Vera	4783f1894c	mute test testReadRangeBlobWithRetries (#55507 ) (#55508 )	2020-04-21 10:59:35 +02:00
Yannick Welsch	b9da307cd1	Add GCS support for searchable snapshots (#55403 ) Adds ranged read support for GCS repositories in order to enable searchable snapshot support for GCS. As part of this PR, I've extracted some of the test infrastructure to make sure that GoogleCloudStorageBlobContainerRetriesTests and S3BlobContainerRetriesTests are covering similar test (as I saw those diverging in what they cover)	2020-04-20 13:02:59 +02:00
Jason Tedor	0a1b566c65	Fix security manager bug writing large blobs to GCS (#55421 ) * Fix security manager bug writing large blobs to GCS This commit addresses a security manager permissions issue writing large blobs (on the resumable upload path) to GCS. The underlying issue here is that we need to wrap the close and write calls on the channel. It is not enough to do this: SocketAccess.doPrivilegedVoidIOException( () -> Streams.copy( inputStream, Channels.newOutputStream(client().writer(blobInfo, writeOptions)))); This reason that this is not enough is because Streams#copy will be in the stacktrace and it is not granted the security manager permissions needed to close or write this channel. We only grant those permissions to classes loaded in the plugin classloader, and Streams#copy is from the parent classloader. This is why we must wrap the close and write calls as privileged, to truncate the Streams#copy call out of the stacktrace. The reason that this issue is not caught in testing is because the size of data that we use in testing is too small to trigger the large blob resumable upload path. Therefore, we address this by adding a system property to control the threshold, which we can then set in tests to exercise this code path. Prior to rewriting the writeBlobResumable method to wrap the close and write calls as privileged, with this additional test, we are able to reproduce the security manager permissions issue. After adding the wrapping, this test now passes. * Fix forbidden APIs issue * Remove leftover debugging	2020-04-17 18:49:10 -04:00
Armin Braun	73ab3719e8	Mute GCS Retry Tests on JDK8 (#55372 ) Same as #53119 but for the retries tests. Closes #55317	2020-04-17 12:19:35 +02:00

1 2 3 4

188 Commits