OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	32dd852210	Update jackson-databind to 2.8.11.6 (#53522 ) This commit upgrades the jackson-databind depdendency to 2.8.11.6. Additionally, we revert a previous change that put ingest-geoip on the version of jackson-databind from the version properties file. This is because upgrading ingest-geoip to a later version of jackson-databind also requires an upgrade to the geoip2 dependency which is currently blocked. Therefore, if we can get to a point where we otherwise upgrade our Jackson dependencies, we do not want ingest-geoip to automatically come along with it.	2020-03-12 20:15:13 -04:00
Alan Woodward	5c861cfe6e	Upgrade to final lucene 8.5.0 snapshot (#53293 ) Lucene 8.5.0 release candidates are imminent. This commit upgrades master to use the latest snapshot to check that there are no last-minute bugs or regressions.	2020-03-10 09:32:59 +00:00
Nhat Nguyen	5476a49833	Revert "upgrade to lucene-snapshot-fa75139efea (#53150 ) (#53151 )" This reverts commit `058113aa42`.	2020-03-05 17:33:00 -05:00
Armin Braun	204c366a4e	Upgrade GCS SDK to 1.104.0 (#52839 ) (#53152 ) Upgrading the GCS SDK to the most recent version. Adjusting (i.e. improving) the REST mock accordingly. This should significantly boost performance by pulling in https://github.com/googleapis/java-core/issues/86 in some cases.	2020-03-05 11:18:18 +01:00
Ignacio Vera	058113aa42	upgrade to lucene-snapshot-fa75139efea (#53150 ) (#53151 )	2020-03-05 10:04:05 +01:00
Tanguy Leroux	52d4807f8d	Mute GoogleCloudStorageBlobStoreRepositoryTests on jdk8 (#53119 ) Tests in GoogleCloudStorageBlobStoreRepositoryTests are known to be flaky on JDK 8 (#51446, #52430 ) and we suspect a JDK bug (https://bugs.openjdk.java.net/browse/JDK-8180754) that triggers some assertion on the server side logic that emulates the Google Cloud Storage service. Sadly we were not able to reproduce the failures, even when using the same OS (Debian 9, Ubuntu 16.04) and JDK (Oracle Corporation 1.8.0_241 [Java HotSpot(TM) 64-Bit Server VM 25.241-b07]) of almost all the test failures on CI. While we spent some time fixing code (#51933, #52431) to circumvent the JDK bug they are still flaky on JDK-8. This commit mute these tests for JDK-8 only. Close ##52906	2020-03-05 09:18:05 +01:00
Nhat Nguyen	e6755afeeb	Upgrade to Lucene 8.5.0-snapshot-c4475920b08 (#52950 ) (#52977 ) To give LUCENE-9228 more CI cycles	2020-02-29 09:29:16 -05:00
Lee Hinman	a47e404732	Mute GoogleCloudStorageBlobStoreRepositoryTests (#52926 ) These intermittently fail due to an assertion triggered by a JDK bug. Relates to #52906	2020-02-27 15:16:48 -07:00
Mark Vieira	f46b370e7a	Fix cacheability of repository-hdfs integ tests (#52858 )	2020-02-27 09:53:51 -08:00
Mark Vieira	bc9c3f0135	Ignore test seed in third party test system property inputs (#52849 )	2020-02-26 14:29:34 -08:00
Mark Vieira	f06d692706	[Backport] Consolidate docker availability logic (#52656 )	2020-02-21 15:24:05 -08:00
markharwood	96d603979b	Upgrade Lucene to 8.5.0-snapshot-b01d7cb (#52584 ) Upgrading 7x to same Lucene 8.5 version used in master	2020-02-21 10:25:03 +00:00
Armin Braun	5a7db0c520	Fix GCS Test testReadLargeBlobWithRetries (#52619 ) (#52624 ) The countdown didn't work well here because it only returns `true` once the countdown reaches `0` but can on subsequent executions return `false` again if a countdown at `0` is counted down again, leading to more than the expected number of simulated failures. Closes #52607	2020-02-21 10:34:53 +01:00
Armin Braun	1662cd45a4	Add Region and Signer Algorithm Overrides to S3 Repos (#52112 ) (#52562 ) Exposes S3 SDK signing region and algorithm override settings as requested in #51861. Closes #51861	2020-02-21 10:21:20 +01:00
Armin Braun	0a09e15959	Add Caching for RepositoryData in BlobStoreRepository (#52341 ) (#52566 ) Cache latest `RepositoryData` on heap when it's absolutely safe to do so (i.e. when the repository is in strictly consistent mode). `RepositoryData` can safely be assumed to not grow to a size that would cause trouble because we often have at least two copies of it loaded at the same time when doing repository operations. Also, concurrent snapshot API status requests currently load it independently of each other and so on, making it safe to cache on heap and assume as "small" IMO. The benefits of this move are: * Much faster repository status API calls * listing all snapshot names becomes instant * Other operations are sped up massively too because they mostly operate in two steps: load repository data then load multiple other blobs to get the additional data * Additional cloud cost savings * Better resiliency, saving another spot where an IO issue could break the snapshot * We can simplify a number of spots in the current code that currently pass around the repository data in tricky ways to avoid loading it multiple times in follow ups.	2020-02-21 10:20:07 +01:00
Armin Braun	4bb780bc37	Refactor Inflexible Snapshot Repository BwC (#52365 ) (#52557 ) * Refactor Inflexible Snapshot Repository BwC (#52365) Transport the version to use for a snapshot instead of whether to use shard generations in the snapshots in progress entry. This allows making upcoming repository metadata changes in a flexible manner in an analogous way to how we handle serialization BwC elsewhere. Also, exposing the version at the repository API level will make it easier to do BwC relevant changes in derived repositories like source only or encrypted.	2020-02-21 09:14:34 +01:00
Mark Vieira	4bce9984e6	Mute GoogleCloudStorageBlobContainerRetriesTests.testReadLargeBlobWithRetries Signed-off-by: Mark Vieira <portugee@gmail.com>	2020-02-20 15:13:34 -08:00
Armin Braun	aeb7b777e6	Add Blob Download Retries to GCS Repository (#52479 ) (#52521 ) * Add Blob Download Retries to GCS Repository Exactly as #46589 (and kept as close to it as possible code wise so we can dry things up in a follow-up potentially) but for GCS. Closes #52319	2020-02-19 18:29:13 +01:00
Tim Brooks	e752221fc6	Upgrade netty to 4.1.45.Final (#51689 ) Upgrade netty.	2020-02-18 09:11:29 -07:00
Ioannis Kakavas	d9ce0e6733	Update BouncyCastle to 1.64 (#52185 ) (#52464 ) This commit upgrades the bouncycastle dependency from 1.61 to 1.64.	2020-02-18 14:11:34 +02:00
Armin Braun	a9c7557ac4	Fix Failure to Drain Stream in GCS Repo Tests (#52431 ) (#52454 ) Same as #51933 but for the custom handler just used in this test. Closes #52430	2020-02-18 11:37:34 +01:00
Marios Trivyzas	dac720d7a1	Add a cluster setting to disallow expensive queries (#51385 ) (#52279 ) Add a new cluster setting `search.allow_expensive_queries` which by default is `true`. If set to `false`, certain queries that have usually slow performance cannot be executed and an error message is returned. - Queries that need to do linear scans to identify matches: - Script queries - Queries that have a high up-front cost: - Fuzzy queries - Regexp queries - Prefix queries (without index_prefixes enabled - Wildcard queries - Range queries on text and keyword fields - Joining queries - HasParent queries - HasChild queries - ParentId queries - Nested queries - Queries on deprecated 6.x geo shapes (using PrefixTree implementation) - Queries that may have a high per-document cost: - Script score queries - Percolate queries Closes: #29050 (cherry picked from commit a8b39ed842c7770bd9275958c9f747502fd9a3ea)	2020-02-12 22:56:14 +01:00
Armin Braun	6ea3f5ada1	Move EC2 Discovery Tests to Mock Rest API (#50605 ) (#52270 ) Move EC2 discovery tests to using the mock REST API introduced in https://github.com/elastic/elasticsearch/pull/50550 instead of mocking the AWS SDK classes manually. Move the trivial remaining AWS SDK mocks to the single test suit that was using them.	2020-02-12 18:35:50 +01:00
Ignacio Vera	80e3c97210	Upgrade to lucene-8.5.0-snapshot-d62f6307658 (#52039 ) (#52130 )	2020-02-10 10:13:22 +01:00
Ioannis Kakavas	343fb36c7f	Test modifications for FIPS 140 mode (#51832 ) (#52128 ) - Enable SunJGSS provider for Kerberos tests - Handle the fact that in the decrypt method in KeyStoreWrapper might not throw immediately when the GCM cipher is from BouncyCastle FIPS and we end up with a DataInputStream that has reached it's end. - Disable tests, jarHell, testingConventions for ingest attachment plugin. We don't support this plugin (and document this) in FIPS mode. - Don't attempt to install ingest-attachment in smoke-test-plugins	2020-02-10 10:57:03 +02:00
Jay Modi	3edadfefd0	RestHandlers declare handled routes (#52123 ) This commit changes how RestHandlers are registered with the RestController so that a RestHandler no longer needs to register itself with the RestController. Instead the RestHandler interface has new methods which when called provide information about the routes (method and path combinations) that are handled by the handler including any deprecated and/or replaced combinations. This change also makes the publication of RestHandlers safe since they no longer publish a reference to themselves within their constructors. Closes #51622 Co-authored-by: Jason Tedor <jason@tedor.me> Backport of #51950	2020-02-09 22:48:32 -07:00
Ioannis Kakavas	8c0b49cd32	Adjust jarHell and 3rd party audit exclusions (#51733 ) (#51766 ) Now that the FIPS 140 security provider is simply a test dependency we don't need the thirdPartyAudit exceptions, but plugin-cli and transport-netty4 do need jarHell disabled as they use the non fips BouncyCastle security provider as a test dependency too.	2020-02-10 07:38:59 +02:00
Julie Tibshirani	337d73a7c6	Rename MapperService#fullName to fieldType. The new name more accurately describes what the method returns.	2020-02-07 10:35:53 -08:00
Armin Braun	91e938ead8	Add Trace Logging of REST Requests (#51684 ) (#52015 ) Being able to trace log all REST requests to a node would make debugging a number of issues a lot easier.	2020-02-07 09:03:20 +01:00
Maria Ralli	8d3e73b3a0	Add host address to BindTransportException message (#51269 ) When bind fails, show the host address in addition to the port. This helps debugging cases with wrong "network.host" values. Closes #48001	2020-02-04 17:13:19 +00:00
Mayya Sharipova	42b885f050	Upgrade to lucene-8.5.0-snapshot-3333ce7da6d (#51749 ) Backport for #51327	2020-01-31 11:20:15 -05:00
Ioannis Kakavas	1dc965f03f	Mute ec2 test in FIPS 140 mode (#51686 ) (#51726 ) as it needs an extra permission, until we can figure out how to grant the permission in FIPS 140 mode too. See: https://github.com/elastic/elasticsearch/issues/51685	2020-01-31 09:35:20 +02:00
Armin Braun	74e3694234	Optimize GCS Repo Uploads (#51596 ) (#51618 ) For small uploads (that can still be up to 5MB!) we needlessly reading the `InputStream` into a BAOS which entailed allocating the `byte[]` for the stream contents twice (because to `toByteArray` on the BAOS copies). Also, for resumeable uploads we were needlessly wrapping the output channel and running each individual write in its own privileged context when we could just wrap the whole upload in a single privileged context. Relates #51593	2020-01-29 16:07:30 +01:00
Armin Braun	7914c1a734	Optimize GCS Mock (#51593 ) (#51594 ) This test was still very GC heavy in Java 8 runs in particular which seems to slow down request processing to the point of timeouts in some runs. This PR completely removes the large number of O(MB) `byte[]` allocations that were happening in the mock http handler which cuts the allocation rate by about a factor of 5 in my local testing for the GC heavy `testSnapshotWithLargeSegmentFiles` run. Closes #51446 Closes #50754	2020-01-29 11:06:05 +01:00
Ioannis Kakavas	ee202a642f	Enable tests in FIPS 140 in JDK 11 (#49485 ) This change changes the way to run our test suites in JVMs configured in FIPS 140 approved mode. It does so by: - Configuring any given runtime Java in FIPS mode with the bundled policy and security properties files, setting the system properties java.security.properties and java.security.policy with the == operator that overrides the default JVM properties and policy. - When runtime java is 11 and higher, using BouncyCastle FIPS Cryptographic provider and BCJSSE in FIPS mode. These are used as testRuntime dependencies for unit tests and internal clusters, and copied (relevant jars) explicitly to the lib directory for testclusters used in REST tests - When runtime java is 8, using BouncyCastle FIPS Cryptographic provider and SunJSSE in FIPS mode. Running the tests in FIPS 140 approved mode doesn't require an additional configuration either in CI workers or locally and is controlled by specifying -Dtests.fips.enabled=true	2020-01-27 11:14:52 +02:00
Armin Braun	3e3673b518	Fix ByteBuf Leak in Nio HTTP Tests (#51444 ) (#51457 ) It is the job of the http server transport to release the request in the handler but the mock fails to do so since we never override `incomingRequest`.	2020-01-25 16:19:49 +01:00
Armin Braun	c29b235a5a	Stop Copying Bulk HTTP Requests in NIO Networking (#49819 ) (#51393 ) Same as #44564 but for NIO.	2020-01-24 11:23:16 +01:00
Mark Vieira	f86de2a9cb	Always test against default distribution when in a FIPS JVM (#51273 ) (#51333 )	2020-01-23 14:54:57 -08:00
Mark Vieira	c08c282c0e	Revert "Always test against default distribution when in a FIPS JVM (#51273 )" This reverts commit `0169498711`. This reverts commit `c5a032b594`.	2020-01-22 12:15:57 -08:00
Mark Vieira	c5a032b594	Always test against default distribution when in a FIPS JVM (#51273 ) (cherry picked from commit e34d7fdaf7b511627c64a9e16805fd82f980b8c6)	2020-01-22 11:30:25 -08:00
Armin Braun	c5f1a90159	Add CoolDown Period to S3 Repository (#51074 ) (#51213 ) Add cool down period after snapshot finalization and delete to prevent eventually consistent AWS S3 from corrupting shard level metadata as long as the repository is using the old format metadata on the shard level.	2020-01-20 12:18:16 +01:00
Nik Everett	f6c89b4599	Move test of custom sig heuristic to plugin (#50891 ) (#51067 ) This moves the testing of custom significance heuristic plugins from an `ESIntegTestCase` to an example plugin. This is much more "real" and can be used as an example for anyone that needs to actually build such a plugin. The old test had testing concerns and the example all jumbled together.	2020-01-16 14:49:12 -05:00
Armin Braun	4a7e09f624	Enforce Logging of Errors in GCS Rest RetriesTests (#50761 ) (#50783 ) It's impossible to tell why #50754 fails without this change. We're failing to close the `exchange` somewhere and there is no write timeout in the GCS SDK (something to look into separately) only a read timeout on the socket so if we're failing on an assertion without reading the full request body (at least into the read-buffer) we're locking up waiting forever on `write0`. This change ensure the `exchange` is closed in the tests where we could lock up on a write and logs the failure so we can find out what broke #50754.	2020-01-09 10:46:07 +01:00
Adrien Grand	4f2299c714	Upgrade to Lucene 8.4.0. (#50518 ) (#50750 )	2020-01-08 18:53:59 +01:00
Armin Braun	a725896c92	Fix and Reenable SnapshotTool Minio Tests (#50736 ) (#50745 ) This solves half of the problem in #46813 by moving the S3 tests to using the shared minio fixture so we at least have some non-3rd-party, constantly running coverage on these tests.	2020-01-08 16:33:36 +01:00
Armin Braun	8819fa4ebe	Make EC2 Discovery Cache Empty Seed Hosts List (#50607 ) (#50626 ) Follow up to #50550. Cache empty nodes lists (`fetchDynamicNodes` will return an empty list in case of failure) now that the plugin properly retries requests to AWS EC2 APIs.	2020-01-03 21:32:36 +01:00
Armin Braun	8092a4991e	Make EC2 Discovery Plugin Retry Requests (#50550 ) (#50558 ) Use the default retry condition instead of never retrying in the discovery plugin causing hot retries upstream and add a test that verifies retrying works. Closes #50462	2020-01-02 17:39:59 +01:00
Alexander Reelsen	541dc262bb	Remove accidentally added license files (#50370 ) As license infos and sha files belong to the licenses/ folder, these files seem to have been added accidentally some time ago.	2019-12-20 13:53:55 +01:00
Stuart Tettemer	689df1f28f	Scripting: ScriptFactory not required by compile (#50344 ) (#50392 ) Avoid backwards incompatible changes for 8.x and 7.6 by removing type restriction on compile and Factory. Factories may optionally implement ScriptFactory. If so, then they can indicate determinism and thus cacheability. Backport Relates: #49466	2019-12-19 12:50:25 -07:00
Tanguy Leroux	903305284d	Remove snapshots left by previous tests failures (#50380 ) When a third party test failed, it potentially left some snapshots in the repository. In case of tests running against an external service like Azure, the remaining snapshots can fail the future test executions are they are not supposed to exist. Similarly to what has been done for S3 and GCS, this commit cleans up remaining snapshots before the test execution. Closes #50304	2019-12-19 17:51:51 +01:00
Armin Braun	ce294e1564	Better Logging S3 Bulk Delete Failures (#50203 ) (#50262 ) Unfortunately bulk delete exceptions don't show the individual delete errors when a bulk delete fails when you log them outright so I added this work-around to get the individual details to get useful logging.	2019-12-17 09:42:39 +01:00
Armin Braun	761d6e8e4b	Remove BlobContainer Tests against Mocks (#50194 ) (#50220 ) * Remove BlobContainer Tests against Mocks Removing all these weird mocks as asked for by #30424. All these tests are now part of real repository ITs and otherwise left unchanged if they had independent tests that didn't call the `createBlobStore` method previously. The HDFS tests also get added coverage as a side-effect because they did not have an implementation of the abstract repository ITs. Closes #30424	2019-12-16 11:37:09 +01:00
Ignacio Vera	b5ec227de8	upgrade to lucene 8.4.0-snapshot-08b8d116f8f (#50129 ) (#50132 )	2019-12-12 13:13:37 +01:00
Armin Braun	6eee41e253	Remove Unused Single Delete in BlobStoreRepository (#50024 ) (#50123 ) * Remove Unused Single Delete in BlobStoreRepository There are no more production uses of the non-bulk delete or the delete that throws on missing so this commit removes both these methods. Only the bulk delete logic remains. Where the bulk delete was derived from single deletes, the single delete code was inlined into the bulk delete method. Where single delete was used in tests it was replaced by bulk deleting.	2019-12-12 11:17:46 +01:00
Armin Braun	d19c8db4e4	Fix GCS Mock Batch Delete Behavior (#50034 ) (#50084 ) Batch deletes get a response for every delete request, not just those that actually hit an existing blob. The fact that we only responded for existing blobs leads to a degenerate response that throws a parse exception if a batch delete only contains non-existant blobs.	2019-12-11 17:40:25 +01:00
Adrien Grand	87e72156ce	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) (#50039 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 18:04:58 +01:00
Jason Tedor	bfb2dc1353	Enable dependent settings values to be validated (#49942 ) Today settings can declare dependencies on another setting. This declaration is implemented so that if the declared setting is not set when the declaring setting is, settings validation fails. Yet, in some cases we want not only that the setting is set, but that it also has a specific value. For example, with the monitoring exporter settings, if xpack.monitoring.exporters.my_exporter.host is set, we not only want that xpack.monitoring.exporters.my_exporter.type is set, but that it is also set to local. This commit extends the settings infrastructure so that this declaration is possible. The use of this in the monitoring exporter settings will be implemented in a follow-up.	2019-12-09 12:45:50 -05:00
Stuart Tettemer	17cda5b2c0	Scripting: Groundwork for caching script results (#49895 ) (#49944 ) In order to cache script results in the query shard cache, we need to check if scripts are deterministic. This change adds a default method to the script factories, `isResultDeterministic() -> false` which is used by the `QueryShardContext`. Script results were never cached and that does not change here. Future changes will implement this method based on whether the results of the scripts are deterministic or not and therefore cacheable. Refs: #49466 Backport	2019-12-06 15:08:05 -07:00
Jake Landis	1c5a139968	Update jackson-databind to 2.8.11.4 (#49347 ) (#49937 )	2019-12-06 13:39:33 -06:00
Alexander Reelsen	d299bf5760	Add tests for ingesting CBOR data attachments (#49715 ) Our docs specifically mention that CBOR is supported when ingesting attachments. However this is not tested anywhere. This adds a test, that uses specifically CBOR format in its IndexRequest and another one that behaves like CBOR in the ingest attachment unit tests.	2019-12-06 14:33:39 +01:00
Stuart Tettemer	426c7a5e8f	Scripting: add available languages & contexts API (#49652 ) (#49815 ) Adds `GET /_script_language` to support Kibana dynamic scripting language selection. Response contains whether `inline` and/or `stored` scripts are enabled as determined by the `script.allowed_types` settings. For each scripting language registered, such as `painless`, `expression`, `mustache` or custom, available contexts for the language are included as determined by the `script.allowed_contexts` setting. Response format: ``` { "types_allowed": [ "inline", "stored" ], "language_contexts": [ { "language": "expression", "contexts": [ "aggregation_selector", "aggs" ... ] }, { "language": "painless", "contexts": [ "aggregation_selector", "aggs", "aggs_combine", ... ] } ... ] } ``` Fixes: #49463 Backport	2019-12-04 16:18:22 -07:00
Armin Braun	996cddd98b	Stop Copying Every Http Request in Message Handler (#44564 ) (#49809 ) * Copying the request is not necessary here. We can simply release it once the response has been generated and a lot of `Unpooled` allocations that way * Relates #32228 * I think the issue that preventet that PR that PR from being merged was solved by #39634 that moved the bulk index marker search to ByteBuf bulk access so the composite buffer shouldn't require many additional bounds checks (I'd argue the bounds checks we add, we save when copying the composite buffer) * I couldn't neccessarily reproduce much of a speedup from this change, but I could reproduce a very measureable reduction in GC time with e.g. Rally's PMC (4g heap node and bulk requests of size 5k saw a reduction in young GC time by ~10% for me)	2019-12-04 08:41:42 +01:00
Armin Braun	813b49adb4	Make BlobStoreRepository Aware of ClusterState (#49639 ) (#49711 ) * Make BlobStoreRepository Aware of ClusterState (#49639) This is a preliminary to #49060. It does not introduce any substantial behavior change to how the blob store repository operates. What it does is to add all the infrastructure changes around passing the cluster service to the blob store, associated test changes and a best effort approach to tracking the latest repository generation on all nodes from cluster state updates. This brings a slight improvement to the consistency by which non-master nodes (or master directly after a failover) will be able to determine the latest repository generation. It does not however do any tricky checks for the situation after a repository operation (create, delete or cleanup) that could theoretically be used to get even greater accuracy to keep this change simple. This change does not in any way alter the behavior of the blobstore repository other than adding a better "guess" for the value of the latest repo generation and is mainly intended to isolate the actual logical change to how the repository operates in #49060	2019-11-29 14:57:47 +01:00
Mayya Sharipova	2dafecc398	Upgrade lucene to 8.4.0-snapshot-e648d601efb (#49641 )	2019-11-28 11:59:58 -05:00
Jim Ferenczi	d6445fae4b	Add a cluster setting to disallow loading fielddata on _id field (#49166 ) This change adds a dynamic cluster setting named `indices.id_field_data.enabled`. When set to `false` any attempt to load the fielddata for the `_id` field will fail with an exception. The default value in this change is set to `false` in order to prevent fielddata usage on this field for future versions but it will be set to `true` when backporting to 7x. When the setting is set to true (manually or by default in 7x) the loading will also issue a deprecation warning since we want to disallow fielddata entirely when https://github.com/elastic/elasticsearch/issues/26472 is implemented. Closes #43599	2019-11-28 09:35:28 +01:00
Armin Braun	3862400270	Remove Redundant EsBlobStoreTestCase (#49603 ) (#49605 ) All the implementations of `EsBlobStoreTestCase` use the exact same bootstrap code that is also used by their implementation of `EsBlobStoreContainerTestCase`. This means all tests might as well live under `EsBlobStoreContainerTestCase` saving a lot of code duplication. Also, there was no HDFS implementation for `EsBlobStoreTestCase` which is now automatically resolved by moving the tests over since there is a HDFS implementation for the container tests.	2019-11-26 20:57:19 +01:00
Alan Woodward	fe2c65185e	Annotated text type should extend TextFieldType (#49555 ) The annotated text mapper has a field type that currently extends StringFieldType, which means that all the positional-related query factory methods need to be copied over from TextFieldType. In addition, MappedFieldType.intervals() hasn't been overridden, so you can't use intervals queries with annotated text - a major drawback, since one of the purposes of annotated text is to be able to run positional queries against annotations. This commit changes the annotated text field type to extend TextFieldType instead, adding tests to ensure that position queries work correctly. Closes #49289	2019-11-26 16:52:21 +00:00
Armin Braun	495b543e63	Improve Stability of GCS Mock API (#49592 ) (#49597 ) Same as #49518 pretty much but for GCS. Fixing a few more spots where input stream can get closed without being fully drained and adding assertions to make sure it's always drained. Moved the no-close stream wrapper to production code utilities since there's a number of spots in production code where it's also useful (will reuse it there in a follow-up).	2019-11-26 16:53:51 +01:00
Armin Braun	231d079bf8	Fix Azure Mock Issues (#49377 ) (#49381 ) Fixing a few small issues found in this code: 1. We weren't reading the request headers but the response headers when checking for blob existence in the mocked single upload path 2. Error code can never be `null` removed the dead code that resulted 3. In the logging wrapper we weren't checking for `Throwable` so any failing assertions in the http mock would not show up since they run on a thread managed by the mock http server	2019-11-21 19:57:50 +01:00
Tanguy Leroux	6bad28a835	Mute AzureBlobStoreRepositoryTests (#49364 ) Relates #48978	2019-11-20 11:16:16 +01:00
Tanguy Leroux	f753fa2265	HttpHandlers should return correct list of objects (#49283 ) This commit fixes the server side logic of "List Objects" operations of Azure and S3 fixtures. Until today, the fixtures were returning a " flat" view of stored objects and were not correctly handling the delimiter parameter. This causes some objects listing to be wrongly interpreted by the snapshot deletion logic in Elasticsearch which relies on the ability to list child containers of BlobContainer (#42653) to correctly delete stale indices. As a consequence, the blobs were not correctly deleted from the emulated storage service and stayed in heap until they got garbage collected, causing CI failures like #48978. This commit fixes the server side logic of Azure and S3 fixture when listing objects so that it now return correct common blob prefixes as expected by the snapshot deletion process. It also adds an after-test check to ensure that tests leave the repository empty (besides the root index files). Closes #48978	2019-11-20 09:26:42 +01:00
Tanguy Leroux	ca4f55f2e4	Add docker-compose fixtures for S3 integration tests (#49107 ) (#49229 ) Similarly to what has been done for Azure (#48636) and GCS (#48762), this committ removes the existing Ant fixture that emulates a S3 storage service in favor of multiple docker-compose based fixtures. The goals here are multiple: be able to reuse a s3-fixture outside of the repository-s3 plugin; allow parallel execution of integration tests; removes the existing AmazonS3Fixture that has evolved in a weird beast in dedicated, more maintainable fixtures. The server side logic that emulates S3 mostly comes from the latest HttpHandler made for S3 blob store repository tests, with additional features extracted from the (now removed) AmazonS3Fixture: authentication checks, session token checks and improved response errors. Chunked upload request support for S3 object has been added too. The server side logic of all tests now reside in a single S3HttpHandler class. Whereas AmazonS3Fixture contained logic for basic tests, session token tests, EC2 tests or ECS tests, the S3 fixtures are now dedicated to each kind of test. Fixtures are inheriting from each other, making things easier to maintain.	2019-11-18 05:56:59 -05:00
Rory Hunter	c46a0e8708	Apply 2-space indent to all gradle scripts (#49071 ) Backport of #48849. Update `.editorconfig` to make the Java settings the default for all files, and then apply a 2-space indent to all `*.gradle` files. Then reformat all the files.	2019-11-14 11:01:23 +00:00
Tanguy Leroux	20fc1dbe18	Move MinIO fixture in its own project (#49036 ) This commit moves the MinIO docker-compose fixture from the :plugins:repository-s3 to its own :test:minio-fixture Gradle project.	2019-11-13 10:03:59 -05:00
Tanguy Leroux	8a14ea5567	Add docker-composed based test fixture for GCS (#48902 ) Similarly to what has be done for Azure in #48636, this commit adds a new :test:fixtures:gcs-fixture project which provides two docker-compose based fixtures that emulate a Google Cloud Storage service. Some code has been extracted from existing tests and placed into this new project so that it can be easily reused in other projects.	2019-11-07 13:27:22 -05:00
Mark Vieira	6ab4645f4e	[7.x] Introduce type-safe and consistent pattern for handling build globals (#48818 ) This commit introduces a consistent, and type-safe manner for handling global build parameters through out our build logic. Primarily this replaces the existing usages of extra properties with static accessors. It also introduces and explicit API for initialization and mutation of any such parameters, as well as better error handling for uninitialized or eager access of parameter values. Closes #42042	2019-11-01 11:33:11 -07:00
Andrey Ershov	088988bb37	GCS snapshot cleanup tool backport to 7.x (#48750 ) This is the backport of #45076 with dependent changes.	2019-10-31 18:21:36 +03:00
Tanguy Leroux	989467ca1e	Add docker-compose based test fixture for Azure (#48736 ) This commit adds a new :test:fixtures:azure-fixture project which provides a docker-compose based container that runs a AzureHttpFixture Java class that emulates an Azure Storage service. The logic to emulate the service is extracted from existing tests and placed in AzureHttpHandler into the new project so that it can be easily reused. The :plugins:repository-azure project is an example of such utilization. The AzureHttpFixture fixture is just a wrapper around AzureHttpHandler and is now executed within the docker container. The :plugins:repository-azure:qa:microsoft-azure project uses the new test fixture and the existing AzureStorageFixture has been removed.	2019-10-31 10:43:43 +01:00
Tanguy Leroux	24f6985235	Reduce allocations when draining HTTP requests bodies in repository tests (#48541 ) In repository integration tests, we drain the HTTP request body before returning a response. Before this change this operation was done using Streams.readFully() which uses a 8kb buffer to read the input stream, it now uses a 1kb for the same operation. This should reduce the allocations made during the tests and speed them up a bit on CI. Co-authored-by: Armin Braun <me@obrown.io>	2019-10-29 09:15:06 +01:00
Tim Brooks	45e42f4e18	Upgrade to Netty 4.1.43 (#48484 ) With this update we can remove the mitigation in our custom allocator which forces heap buffer allocations.	2019-10-25 10:17:25 -06:00
Tanguy Leroux	06d2cc5cef	Add missing azure error code (#48520 ) In #47176 we changed the internal HTTP server that emulates the Azure Storage service so that it includes a response body for injected errors. This fixed most of the issues reported in #47120 but sadly I missed to map one error to its Azure equivalent, and it triggered some CI failures today. Closes #47120	2019-10-25 16:50:51 +02:00
Tim Brooks	c0b545f325	Make BytesReference an interface (#48486 ) BytesReference is currently an abstract class which is extended by various implementations. This makes it very difficult to use the delegation pattern. The implication of this is that our releasable BytesReference is a PagedBytesReference type and cannot be used as a generic releasable bytes reference that delegates to any reference type. This commit makes BytesReference an interface and introduces an AbstractBytesReference for common functionality.	2019-10-24 15:39:30 -06:00
Tanguy Leroux	e1dd0e753d	Differentiate service account tokens in GCS tests (#48382 ) This commit changes the test so that each node use a specific service account and private key. It also changes how unique request ids are generated for refresh token request using the token itself, so that error count will be specific per node (each node should execute a single refresh token request as tokens are valid for 1 hour).	2019-10-23 16:57:35 +02:00
Tanguy Leroux	4790ee4c32	Reenable azure repository tests and remove some randomization in http servers (#48283 ) Relates #47948 Relates #47380	2019-10-23 09:06:50 +02:00
Ignacio Vera	b1224fca8c	upgrade to Lucene-8.3.0-snapshot-25968e3b75e (#48227 )	2019-10-21 08:21:09 +02:00
Armin Braun	5caa101345	Fix Bug in Azure Repo Exception Handling (#47968 ) (#48030 ) We were incorrectly handling `IOExceptions` thrown by the `InputStream` side of the upload operation, resulting in a `ClassCastException` as we expected to never get `IOException` from the Azure SDK code but we do in practice. This PR also sets an assertion on `markSupported` for the streams used by the SDK as adding the test for this scenario revealed that the SDK client would retry uploads for non-mark-supporting streams on `IOException` in the `InputStream`.	2019-10-15 12:10:19 +02:00
Tim Brooks	8814bf07f1	Upgrade to Netty 4.1.42 (#48015 ) Upgrades the netty version.	2019-10-14 13:54:02 -06:00
Nick Knize	7f01b0a670	Mute AzureBlobStoreRepositoryTests.testIndicesDeletedFromRepository (#47949 )	2019-10-11 14:07:24 -05:00
Jim Ferenczi	bd6e2592a7	Remove the SearchContext from the highlighter context (#47733 ) Today built-in highlighter and plugins have access to the SearchContext through the highlighter context. However most of the information exposed in the SearchContext are not needed and a QueryShardContext would be enough to perform highlighting. This change replaces the SearchContext by the informations that are absolutely required by highlighter: a QueryShardContext and the SearchContextHighlight. This change allows to reduce the exposure of the complex SearchContext and remove the needs to clone it in the percolator sub phase. Relates #47198 Relates #46523	2019-10-10 10:34:10 +02:00
Armin Braun	302e09decf	Simplify some Common ActionRunnable Uses (#47799 ) (#47828 ) Especially in the snapshot code there's a lot of logic chaining `ActionRunnables` in tricky ways now and the code is getting hard to follow. This change introduces two convinience methods that make it clear that a wrapped listener is invoked with certainty in some trickier spots and shortens the code a bit.	2019-10-09 23:29:50 +02:00
Ryan Ernst	f32692208e	Add explanations to script score queries (#46693 ) (#47548 ) While function scores using scripts do allow explanations, they are only creatable with an expert plugin. This commit improves the situation for the newer script score query by adding the ability to set the explanation from the script itself. To set the explanation, a user would check for `explanation != null` to indicate an explanation is needed, and then call `explanation.set("some description")`.	2019-10-03 21:05:05 -07:00
Alpar Torok	0a14bb174f	Remove eclipse conditionals (#44075 ) * Remove eclipse conditionals We used to have some meta projects with a `-test` prefix because historically eclipse could not distinguish between test and main source-sets and could only use a single classpath. This is no longer the case for the past few Eclipse versions. This PR adds the necessary configuration to correctly categorize source folders and libraries. With this change eclipse can import projects, and the visibility rules are correct e.x. auto compete doesn't offer classes from test code or `testCompile` dependencies when editing classes in `main`. Unfortunately the cyclic dependency detection in Eclipse doesn't seem to take the difference between test and non test source sets into account, but since we are checking this in Gradle anyhow, it's safe to set to `warning` in the settings. Unfortunately there is no setting to ignore it. This might cause problems when building since Eclipse will probably not know the right order to build things in so more wirk might be necesarry.	2019-10-03 11:55:00 +03:00
Tanguy Leroux	f5c5411fe8	Differentiate base paths in repository integration tests (#47284 ) (#47300 ) This commit change the repositories base paths used in Azure/S3/GCS integration tests so that they don't conflict with each other when tests run in parallel on real storage services. Closes #47202	2019-10-01 08:39:55 +02:00
James Rodewig	e01465eb88	[DOCS] Correct typo in ICU Analysis plugin description (#47175 ) (#47219 )	2019-09-27 13:04:14 -04:00
Henning Andersen	a1e2e208ce	Mute Snapshot/Restore with repository-azure (#47204 ) Relates #47201	2019-09-27 12:13:01 +02:00
Tanguy Leroux	42ae76ab7c	Injected response errors in Azure repository tests should have a body (#47176 ) The Azure SDK client expects server errors to have a body, something that looks like: <?xml version="1.0" encoding="utf-8"?> <Error> <Code>string-value</Code> <Message>string-value</Message> </Error> I've forgot to add such errors in Azure tests and that triggers some NPE in the client like the one reported in #47120. Closes #47120	2019-09-27 09:43:29 +02:00
Tanguy Leroux	b1bf05bb89	Add blob container retries tests for Azure SDK client (#47032 ) Similarly to what has been done for S3 and GCS, this commit adds unit tests that verify the retry logic of the Azure SDK client implementation when the remote service returns errors. It only tests the retry logic in case of errors and not in case of timeouts because Azure client timeout options are not exposed as settings.	2019-09-25 09:19:48 +02:00
Armin Braun	00f2e7f627	Update AWS SDK for repository-s3 plugin to support IAM Roles for Service Accounts (#46969 ) (#47004 ) * Update AWS SDK for repository-s3 and discovery-ec2 plugins	2019-09-24 17:15:11 +02:00
Tanguy Leroux	6986d7f968	Add blob container retries tests for Google Cloud Storage (#46968 ) Similarly to what has been done for S3 in #45383, this commit adds unit tests that verify the behavior of the SDK client and blob container implementation for Google Storage when the remote service returns errors. The main purpose was to add an extra test to the specific retry logic for 410-Gone errors added in #45963. Relates #45963	2019-09-24 08:58:24 +02:00
Alpar Torok	5fd7505efc	Testfixtures allow a single service only (#46780 ) This PR adds some restrictions around testfixtures to make sure the same service ( as defiend in docker-compose.yml ) is not shared between multiple projects. Sharing would break running with --parallel. Projects can still share fixtures as long as each has it;s own service within. This is still useful to share some of the setup and configuration code of the fixture. Project now also have to specify a service name when calling useCluster to refer to a specific service. If this is not the case all services will be claimed and the fixture can't be shared. For this reason fixtures have to explicitly specify if they are using themselves ( fixture and tests in the same project ).	2019-09-23 14:13:49 +03:00
Tanguy Leroux	add7148f3b	GCS deleteBlobsIgnoringIfNotExists should catch StorageException (#46832 ) GoogleCloudStorageBlobStore.deleteBlobsIgnoringIfNotExists() does not correctly catch StorageException thrown by batch.submit(). In the case a snapshot is deleted through BlobStoreRepository.deleteSnapshot() a storage exception is not caught (only IOException are) so the deletion is interrupted and indices cannot be cleaned up. The storage exception bubbles up to SnapshotService.deleteSnapshotFromRepository() but the listener that removes the deletion from the cluster state is not executed, leaving the deletion in the cluster state. This bug has been reported in #46772 where batch.submit() threw an exception in the test testIndicesDeletedFromRepository and following tests failed because a snapshot deletion was running. Relates #46772	2019-09-20 10:02:23 +02:00
Tanguy Leroux	3ae51f25dd	Move testSnapshotWithLargeSegmentFiles to ESMockAPIBasedRepositoryIntegTestCase (#46802 ) This commit moves the common test testSnapshotWithLargeSegmentFiles to the ESMockAPIBasedRepositoryIntegTestCase base class.	2019-09-18 15:41:30 +02:00
Tanguy Leroux	799f7def9f	Add block support to AzureBlobStoreRepositoryTests (#46664 ) This commit adds support for Put Block API to the internal HTTP server used in Azure repository integration tests. This allows to test the behavior of the Azure SDK client when the Azure Storage service returns errors when uploading Blob in multiple blocks or when downloading a blob using ranged downloads.	2019-09-18 09:43:08 +02:00
Tanguy Leroux	fd42358a6d	Add support for Multipart upload to S3 repository integration tests (#46704 ) This commit adds support for Multipart upload to the internal HTTP server used in S3 repository integration tests.	2019-09-18 09:40:25 +02:00
Tanguy Leroux	4db37801d0	Add resumable uploads support to GCS repository integration tests (#46562 ) This commit adds support for resumable uploads to the internal HTTP server used in GoogleCloudStorageBlobStoreRepositoryTests. This way we can also test the behavior of the Google's client when the service returns server errors in response to resumable upload requests. The BlobStore implementation for GCS has the choice between 2 methods to upload a blob: resumable and multipart. In the current implementation, the client executes a resumable upload if the blob size is larger than LARGE_BLOB_THRESHOLD_BYTE_SIZE, otherwise it executes a multipart upload. This commit makes this logic overridable in tests, allowing to randomize the decision of using one method or the other. The commit add support for single request resumable uploads and chunked resumable uploads (the blob is uploaded into multiple 2Mb chunks; each chunk being a resumable upload). For this last case, this PR also adds a test testSnapshotWithLargeSegmentFiles which makes it more probable that a chunked resumable upload is executed.	2019-09-18 09:33:05 +02:00
Armin Braun	371c355bca	Retry GCS Resumable Upload on Error 410 (#45963 ) (#46783 ) A resumable upload session can fail on with a 410 error and should be retried in that case. I added retrying twice using resetting of the given `InputStream` as the retry mechanism since the same approach is used by the AWS S3 SDK already as well and relied upon by the S3 repository implementation. Related GCS documentation: https://cloud.google.com/storage/docs/json_api/v1/status-codes#410_Gone	2019-09-17 19:06:43 +02:00
Armin Braun	b00de8edf3	Ensure SAS Tokens in Test Use Minimal Permissions (#46112 ) (#46628 ) There were some issues with the Azure implementation requiring permissions to list all containers ue to a container exists check. This was caught in CI this time, but going forward we should ensure that CI is executed using a token that does not allow listing containers. Relates #43288	2019-09-17 15:40:11 +02:00
David Turner	65dc888623	Resume partial download from S3 on connection drop (#46589 ) Today if the connection to S3 times out or drops after starting to download an object then the SDK does not attempt to recover or resume the download, causing the restore of the whole shard to fail and retry. This commit allows Elasticsearch to detect such a mid-stream failure and to resume the download from where it failed.	2019-09-17 13:11:36 +01:00
Luca Cavanna	e57756492a	Update http-core and http-client dependencies (#46549 ) Relates to #45808 Closes #45577	2019-09-12 09:45:29 +02:00
Mark Vieira	ccf656a9d0	Repository plugin test cacheability fixes (#46572 )	2019-09-11 08:24:55 -07:00
Tanguy Leroux	88bed09119	Mutualize code in cloud-based repository integration tests (#46483 ) This commit factors out some common code between the cloud-based repository integration tests that were recently improved. Relates #46376	2019-09-09 16:02:14 +02:00
Tanguy Leroux	023cf44025	Inject random server errors in AzureBlobStoreRepositoryTests (#46371 ) This commit modifies the HTTP server used in AzureBlobStoreRepositoryTests so that it randomly returns server errors for any type of request executed by the Azure client.	2019-09-09 10:00:09 +02:00
Tanguy Leroux	8e3dc68454	Inject random server errors in GoogleCloudStorageBlobStoreRepositoryTests (#46376 ) This commit modifies the HTTP server used in GoogleCloudStorageBlobStoreRepositoryTests so that it randomly returns server errors. The test does not inject server errors for the following types of request: batch request, resumable upload request.	2019-09-09 09:59:59 +02:00
David Turner	cc092b1be1	Add support for OneZoneInfrequentAccess storage (#46436 ) The `repository-s3` plugin has supported a storage class of `onezone_ia` since the SDK upgrade in #30723, but we do not test or document this fact. This commit adds this storage class to the docs and adds a test to ensure that the documented storage classes are all accepted by S3 too. Fixes #30474	2019-09-09 07:54:44 +01:00
Tanguy Leroux	2290865559	Fix usage of randomIntBetween() in testWriteBlobWithRetries (#46380 ) This commit fixes the usage of randomIntBetween() in the test testWriteBlobWithRetries, when the test generates a random array of a single byte.	2019-09-06 09:10:38 +02:00
Tanguy Leroux	28974b5723	Replace mocked client in GCSBlobStoreRepositoryTests by HTTP server (#46255 ) This commit removes the usage of MockGoogleCloudStoragePlugin in GoogleCloudStorageBlobStoreRepositoryTests and replaces it by a HttpServer that emulates the Storage service. This allows the repository tests to use the real Google's client under the hood in tests and will allow us to test the behavior of the snapshot/restore feature for GCS repositories by simulating random server-side internal errors. The HTTP server used to emulate the Storage service is intentionally simple and minimal to keep things understandable and maintainable. Testing full client options on the server side (like authentication, chunked encoding etc) remains the responsibility of the GoogleCloudStorageFixture.	2019-09-05 10:37:37 +02:00
Tanguy Leroux	6d1a82134c	Add repository integration tests for Azure (#46263 ) Similarly to what had been done for S3 (#46081) and GCS (#46255) this commit adds repository integration tests for Azure, based on an internal HTTP server instead of mocks.	2019-09-05 09:26:42 +02:00
Tanguy Leroux	bd7a04cd55	Disable request throttling in S3BlobStoreRepositoryTests (#46226 ) When some high values are randomly picked up - for example the number of indices to snapshot or the number of snapshots to create - the tests in S3BlobStoreRepositoryTests can generate a high number of requests to the internal S3 server. In order to test the retry logic of the S3 client, the internal server is designed to randomly generate random server errors. When many requests are made, it is possible that the S3 client reaches its maximum number of successive retries capacity. Then the S3 client will stop retrying requests until enough retry attempts succeed, but it means that any request could fail before reaching the max retries count and make the test fail too. Closes #46217 Closes #46218 Closes #46219	2019-09-02 16:44:43 +02:00
Henning Andersen	d68e05aade	Mute 2 tests in S3BlobStoreRepositoryTests (#46221 ) Muted testSnapshotAndRestore and testMultipleSnapshotAndRollback Relates #46218 and #46219	2019-09-02 10:38:03 +02:00
Tanguy Leroux	0c1b263e8d	Inject random errors in S3BlobStoreRepositoryTests (#46125 ) This commit modifies the HTTP server used in S3BlobStoreRepositoryTests so that it randomly returns server errors for any type of request executed by the SDK client. It is now possible to verify that the repository tests are s uccessfully completed even if one or more errors were returned by the S3 service in response of a blob upload, a blob deletion or a object listing request etc. Because injecting errors forces the SDK client to retry requests, the test limits the maximum errors to send in response for each request at 3 retries.	2019-08-30 11:58:09 +02:00
Tanguy Leroux	b526309fbd	Replace MockAmazonS3 usage in S3BlobStoreRepositoryTests by a HTTP server (#46081 ) This commit removes the usage of MockAmazonS3 in S3BlobStoreRepositoryTests and replaces it by a HttpServer that emulates the S3 service. This allows the repository tests to use the real Amazon's S3 client under the hood in tests and will allow to test the behavior of the snapshot/restore feature for S3 repositories by simulating random server-side internal errors. The HTTP server used to emulate the S3 service is intentionally simple and minimal to keep things understandable and maintainable. Testing full client options on the server side (like authentication, chunked encoding etc) remains the responsibility of the AmazonS3Fixture.	2019-08-29 13:16:59 +02:00
Tanguy Leroux	9e14ffa8be	Few clean ups in ESBlobStoreRepositoryIntegTestCase (#46068 )	2019-08-28 16:29:46 +02:00
Jason Tedor	3d64605075	Remove node settings from blob store repositories (#45991 ) This commit starts from the simple premise that the use of node settings in blob store repositories is a mistake. Here we see that the node settings are used to get default settings for store and restore throttle rates. Yet, since there are not any node settings registered to this effect, there can never be a default setting to fall back to there, and so we always end up falling back to the default rate. Since this was the only use of node settings in blob store repository, we move them. From this, several places fall out where we were chaining settings through only to get them to the blob store repository, so we clean these up as well. That leaves us with the changeset in this commit.	2019-08-26 16:26:13 -04:00
Tanguy Leroux	a3d918bddb	Refactor RepositoryCredentialsTests (#45919 ) This commit refactors the S3 credentials tests in RepositoryCredentialsTests so that it now uses a single node (ESSingleNodeTestCase) to test how secure/insecure credentials are overriding each other. Using a single node makes it much easier to understand what each test is actually testing and IMO better reflect how things are initialized. It also allows to fold into this class the test testInsecureRepositoryCredentials which was wrongly located in S3BlobStoreRepositoryTests. By moving this test away, the S3BlobStoreRepositoryTests class does not need the allow_insecure_settings option anymore and thus can be executed as part of the usual gradle test task.	2019-08-26 15:14:43 +02:00
Tanguy Leroux	aee92d573c	Allow partial request body reads in AWS S3 retries tests (#45847 ) This commit changes the tests added in #45383 so that the fixture that emulates the S3 service now sometimes consumes all the request body before sending an error, sometimes consumes only a part of the request body and sometimes consumes nothing. The idea here is to beef up a bit the tests that writes blob because the client's retry logic relies on marking and resetting the blob's input stream. This pull request also changes the testWriteBlobWithRetries() so that it (rarely) tests with a large blob (up to 1mb), which is more than the client's default read limit on input streams (131Kb). Finally, it optimizes the ZeroInputStream so that it is a bit more effective (now works using an internal buffer and System.arraycopy() primitives).	2019-08-23 13:43:31 +02:00
Tanguy Leroux	57a36eb373	Add tests to check that requests are retried when writing/reading blobs on S3 (#45383 ) This commit adds tests to verify the behavior of the S3BlobContainer and its underlying AWS SDK client when the remote S3 service is responding errors or not responding at all. The expected behavior is that requests are retried multiple times before the client gives up and the S3BlobContainer bubbles up an exception. The test verifies the behavior of BlobContainer.writeBlob() and BlobContainer.readBlob(). In the case of S3 writing a blob can be executed as a single upload or using multipart requests; the test checks both scenario by writing a small then a large blob.	2019-08-22 11:41:40 +02:00
Armin Braun	6aaee8aa0a	Repository Cleanup Endpoint (#43900 ) (#45780 ) * Repository Cleanup Endpoint (#43900) * Snapshot cleanup functionality via transport/REST endpoint. * Added all the infrastructure for this with the HLRC and node client * Made use of it in tests and resolved relevant TODO * Added new `Custom` CS element that tracks the cleanup logic. Kept it similar to the delete and in progress classes and gave it some (for now) redundant way of handling multiple cleanups but only allow one * Use the exact same mechanism used by deletes to have the combination of CS entry and increment in repository state ID provide some concurrency safety (the initial approach of just an entry in the CS was not enough, we must increment the repository state ID to be safe against concurrent modifications, otherwise we run the risk of "cleaning up" blobs that just got created without noticing) * Isolated the logic to the transport action class as much as I could. It's not ideal, but we don't need to keep any state and do the same for other repository operations (like getting the detailed snapshot shard status)	2019-08-21 17:59:49 +02:00
Jim Ferenczi	fe2a7523ec	Add support for inlined user dictionary in the Kuromoji plugin (#45489 ) This change adds a new option called user_dictionary_rules to Kuromoji's tokenizer. It can be used to set additional tokenization rules to the Japanese tokenizer directly in the settings (instead of using a file). This commit also adds a check that no rules are duplicated since this is not allowed in the UserDictionary. Closes #25343	2019-08-21 16:28:30 +02:00
Igor Motov	1818c5fa44	Ingest Attachment: Upgrade tika to v1.22 (#45575 ) Upgrades: Apache Tika: 1.19.1 -> 1.22. pdfbox : 2.0.12 -> 2.0.16 poi : 4.0.0 -> 4.0.1	2019-08-19 18:17:16 -04:00
Luca Cavanna	c31cddf27e	Update the schema for the REST API specification (#42346 ) * Update the REST API specification This patch updates the REST API spefication in JSON files to better encode deprecated entities, to improve specification of URL paths, and to open up the schema for future extensions. Notably, it changes the `paths` from a list of strings to a list of objects, where each particular object encodes all the information for this particular path: the `parts` and the `methods`. Among the benefits of this approach is eg. encoding the difference between using the `PUT` and `POST` methods in the Index API, to either use a specific document ID, or let Elasticsearch generate one. Also `documentation` becomes an object that supports an `url` and also a `description` which is a new field. * Adapt YAML runner to new REST API specification format The logic for choosing the path to use when running tests has been simplified, as a consequence of the path parts being listed under each path in the spec. The special case for create and index has been removed. Also the parsing code has been hardened so that errors are thrown earlier when the structure of the spec differs from what expected, and their error messages should be more helpful.	2019-08-16 14:40:00 +02:00
Yogesh Gaikwad	471d940c44	Refactor cluster privileges and cluster permission (#45265 ) (#45442 ) The current implementations make it difficult for adding new privileges (example: a cluster privilege which is more than cluster action-based and not exposed to the security administrator). On the high level, we would like our cluster privilege either: - a named cluster privilege This corresponds to `cluster` field from the role descriptor - or a configurable cluster privilege This corresponds to the `global` field from the role-descriptor and allows a security administrator to configure them. Some of the responsibilities like the merging of action based cluster privileges are now pushed at cluster permission level. How to implement the predicate (using Automaton) is being now enforced by cluster permission. `ClusterPermission` helps in enforcing the cluster level access either by performing checks against cluster action and optionally against a request. It is a collection of one or more permission checks where if any of the checks allow access then the permission allows access to a cluster action. Implementations of cluster privilege must be able to provide information regarding the predicates to the cluster permission so that can be enforced. This is enforced by making implementations of cluster privilege aware of cluster permission builder and provide a way to specify how the permission is to be built for a given privilege. This commit renames `ConditionalClusterPrivilege` to `ConfigurableClusterPrivilege`. `ConfigurableClusterPrivilege` is a renderable cluster privilege exposed as a `global` field in role descriptor. Other than this there is a requirement where we would want to know if a cluster permission is implied by another cluster-permission (`has-privileges`). This is helpful in addressing queries related to privileges for a user. This is not just simply checking of cluster permissions since we do not have access to runtime information (like request object). This refactoring does not try to address those scenarios. Relates #44048	2019-08-13 09:06:18 +10:00
Armin Braun	a9e1402189	Remove Settings from BaseRestRequest Constructor (#45418 ) (#45429 ) * Resolving the todo, cleaning up the unused `settings` parameter * Cleaning up some other minor dead code in affected classes	2019-08-12 05:14:45 +02:00
Armin Braun	a501d68f23	Upgrade to Netty 4.1.38 (#45132 ) (#45364 ) * A number of fixes to buffer handling in the .37 and .38 -> we should stay up to date	2019-08-09 03:38:14 +02:00
Tim Brooks	af908efa41	Disable netty direct buffer pooling by default (#44837 ) Elasticsearch does not grant Netty reflection access to get Unsafe. The only mechanism that currently exists to free direct buffers in a timely manner is to use Unsafe. This leads to the occasional scenario, under heavy network load, that direct byte buffers can slowly build up without being freed. This commit disables Netty direct buffer pooling and moves to a strategy of using a single thread-local direct buffer for interfacing with sockets. This will reduce the memory usage from networking. Elasticsearch currently derives very little value from direct buffer usage (TLS, compression, Lucene, Elasticsearch handling, etc all use heap bytes). So this seems like the correct trade-off until that changes.	2019-08-08 15:10:31 -06:00
Armin Braun	5d7fafec14	Add Assertion to Ensure Retries in S3BlobContainer (#45224 ) (#45230 ) * We need a `markSupported` input stream to retry uploads * Relates #45153	2019-08-06 16:11:19 +02:00
Yannick Welsch	7aeb2fe73c	Add per-socket keepalive options (#44055 ) Uses JDK 11's per-socket configuration of TCP keepalive (supported on Linux and Mac), see https://bugs.openjdk.java.net/browse/JDK-8194298, and exposes these as transport settings. By default, these options are disabled for now (i.e. fall-back to OS behavior), but we would like to explore whether we can enable them by default, in particular to force keepalive configurations that are better tuned for running ES.	2019-08-06 10:45:44 +02:00
Tim Brooks	984ba82251	Move nio channel initialization to event loop (#45155 ) Currently in the transport-nio work we connect and bind channels on the a thread before the channel is registered with a selector. Additionally, it is at this point that we set all the socket options. This commit moves these operations onto the event-loop after the channel has been registered with a selector. It attempts to set the socket options for a non-server channel at registration time. If that fails, it will attempt to set the options after the channel is connected. This should fix #41071.	2019-08-02 17:31:31 -04:00
Armin Braun	9450505d5b	Stop Passing Around REST Request in Multiple Spots (#44949 ) (#45109 ) * Stop Passing Around REST Request in Multiple Spots * Motivated by #44564 * We are currently passing the REST request object around to a large number of places. This works fine since we simply copy the full request content before we handle the rest itself which is needlessly hard on GC and heap. * This PR removes a number of spots where the request is passed around needlessly. There are many more spots to optimize in follow-ups to this, but this one would already enable bypassing the request copying for some error paths in a follow up.	2019-08-02 07:31:38 +02:00
Tim Brooks	aff66e3ac5	Add Cors integration tests (#44361 ) This commit adds integration tests to ensure that the basic cors functionality works for the netty and nio transports.	2019-07-31 14:24:23 -06:00
Armin Braun	548c767b6b	S3 3rd Party Test Goal (#44799 ) (#45004 ) * Create S3 Third Party Test Task that Covers the S3 CLI Tool * Adjust snapshot cli test tool tests to work with real S3 * Build adjustment * Clean up repo path before testing * Dedup the logic for asserting path contents by using the correct utility method here that somehow became unused	2019-07-30 17:16:41 +02:00
Armin Braun	4495140d1f	Release Pooled Buffers Earlier for HTTP Requests (#44952 ) (#44991 ) * We should release the buffers right after copying and not only do so after we did all the request handling on the copy * Relates #44564	2019-07-30 10:30:01 +02:00
Ignacio Vera	821f6f893b	Upgrade to Lucene 8.2.0 release (#44859 ) (#44892 )	2019-07-26 08:14:59 +02:00
Ioannis Kakavas	3714cb63da	Allow parsing the value of java.version sysprop (#44017 ) We often start testing with early access versions of new Java versions and this have caused minor issues in our tests (i.e. #43141) because the version string that the JVM reports cannot be parsed as it ends with the string -ea. This commit changes how we parse and compare Java versions to allow correct parsing and comparison of the output of java.version system property that might include an additional alphanumeric part after the version numbers (see [JEP 223[(https://openjdk.java.net/jeps/223)). In short it handles a version number part, like before, but additionally a PRE part that matches ([a-zA-Z0-9]+). It also changes a number of tests that would attempt to parse java.specification.version in order to get the full version of Java. java.specification.version only contains the major version and is thus inappropriate when trying to compare against a version that might contain a minor, patch or an early access part. We know parse java.version that can be consistently parsed. Resolves #43141	2019-07-22 20:14:56 +03:00
Jason Tedor	d82a570a2a	Reomve debugging loging statements from Azure tests This commit removes some unneeded debugging logging statements from the Azure storage tests. Relates #44672	2019-07-22 16:55:55 +09:00
Jason Tedor	a493a34143	Use debug logging instead for Azure tests (#44672 ) These Azure tests have hard println statements which means we always see these messages during configuration. Yet, there are unnecessary most of the time. This commit changes them to use debug logging.	2019-07-22 16:46:13 +09:00
Armin Braun	07cf2cb665	Add disable_chunked_encoding Setting to S3 Repo (#44052 ) (#44562 ) * Add disable_chunked_encoding setting to S3 repo plugin to support S3 implementations that don't support chunked encoding	2019-07-18 16:57:56 +02:00
Yogesh Gaikwad	4c95cc3223	skip repository-hdfs integTest in case of fips jvm (#44319 ) The repository-hdfs runners need to be disabled it in fips mode. Testing done for all the tasks, dynamic created and static (integTest, integTestHa, integSecureTest, integSecureHaTest)	2019-07-18 21:10:53 +10:00
maarab7	1375cc93a8	Fix parameter value for calling data.advanceExact (#44205 ) While the code works perfectly well for a single segment, it returns the wrong values for multiple segments. E.g. If we have 500 docs in one segment and if we want to get the doc id = 280 then data.advanceExact(topDocs.scoreDocs[i].doc) works fine. If we have two segments, say, with first segment having docs 1-200 and the second segment having docs 201-500, then 280 is fetched from the second segment but is actually 480. Subtracting the docBase (280-200) takes us to the correct document which is 80 in the second segment and actually 280.	2019-07-18 10:55:10 +02:00
Jason Tedor	39c5f98de7	Introduce test issue logging (#44477 ) Today we have an annotation for controlling logging levels in tests. This annotation serves two purposes, one is to control the logging level used in tests, when such control is needed to impact and assert the behavior of loggers in tests. The other use is when a test is failing and additional logging is needed. This commit separates these two concerns into separate annotations. The primary motivation for this is that we have a history of leaving behind the annotation for the purpose of investigating test failures long after the test failure is resolved. The accumulation of these stale logging annotations has led to excessive disk consumption. Having recently cleaned this up, we would like to avoid falling into this state again. To do this, we are adding a link to the test failure under investigation to the annotation when used for the purpose of investigating test failures. We will add tooling to inspect these annotations, in the same way that we have tooling on awaits fix annotations. This will enable us to report on the use of these annotations, and report when stale uses of the annotation exist.	2019-07-18 05:33:33 +09:00
Armin Braun	65fcaecce1	Remove Minio Host Hack in S3 Repository Build (#44491 ) (#44497 ) * Resolving the todo to clean this hackyness up	2019-07-17 19:59:00 +02:00
Ignacio Vera	eb348d2593	Upgrade to lucene-8.2.0-snapshot-6413aae226 (#44480 )	2019-07-17 13:28:28 +02:00
Armin Braun	c8db0e9b7e	Remove blobExists Method from BlobContainer (#44472 ) (#44475 ) * We only use this method in one place in production code and can replace that with a read -> remove it to simplify the interface * Keep it as an implementation detail in the Azure repository	2019-07-17 11:56:02 +02:00
Tim Brooks	6b1a769638	Move CORS Config into :server package (#43779 ) This commit moves the config that stores Cors options into the server package. Currently both nio and netty modules must have a copy of this config. Moving it into server allows one copy and the tests to be in a common location.	2019-07-16 17:50:42 -06:00
Tim Brooks	0a352486e8	Isolate nio channel registered from channel active (#44388 ) Registering a channel with a selector is a required operation for the channel to be handled properly. Currently, we mix the registeration with other setup operations (ip filtering, SSL initiation, etc). However, a fail to register is fatal. This PR modifies how registeration occurs to immediately close the channel if it fails. There are still two clear loopholes for how a user can interact with a channel even if registration fails. 1. through the exception handler. 2. through the channel accepted callback. These can perhaps be improved in the future. For now, this PR prevents writes from proceeding if the channel is not registered.	2019-07-16 17:18:57 -06:00
Armin Braun	940aa71930	Cleanup S3 BlobContainer Listing Logic (#43088 ) (#44406 ) * Cleanup duplication in creating and looping over IO Requests	2019-07-16 12:19:20 +02:00
Ryan Ernst	7e06888bae	Convert testclusters to use distro download plugin (#44253 ) (#44362 ) Test clusters currently has its own set of logic for dealing with finding different versions of Elasticsearch, downloading them, and extracting them. This commit converts testclusters to use the DistributionDownloadPlugin.	2019-07-15 17:53:05 -07:00
Yogesh Gaikwad	b40b6dd542	Disable repository-hdfs tests in FIPS jvm (#44283 ) Due to https://github.com/elastic/elasticsearch/issues/40079, we need to disable repository-hdfs tests in FIPS jvm.	2019-07-13 20:11:32 +10:00
Yogesh Gaikwad	91c342a888	fix and enable repository-hdfs secure tests (#44044 ) (#44199 ) Due to recent changes are done for converting `repository-hdfs` to test clusters (#41252), the `integTestSecure*` tasks did not depend on `secureHdfsFixture` which when running would fail as the fixture would not be available. This commit adds the dependency of the fixture to the task. The `secureHdfsFixture` is a `AntFixture` which is spawned a process. Internally it waits for 30 seconds for the resources to be made available. For my local machine, it took almost 45 seconds to be available so I have added the wait time as an input to the `AntFixture` defaults to 30 seconds and set it to 60 seconds in case of secure hdfs fixture. The integ test for secure hdfs was disabled for a long time and so the changes done in #42090 to fix the tests are also done in this commit.	2019-07-12 12:44:01 +10:00
Nick Knize	374030a53f	Upgrade to lucene-8.2.0-snapshot-860e0be5378 (#44171 ) (#44184 ) Upgrades lucene library to lucene-8.2.0-snapshot-860e0be5378	2019-07-11 09:17:22 -05:00
Yannick Welsch	2ee07f1ff4	Simplify port usage in transport tests (#44157 ) Simplifies AbstractSimpleTransportTestCase to use JVM-local ports and also adds an assertion so that cases like #44134 can be more easily debugged. The likely reason for that one is that a test, which was repeated again and again while always spawning a fresh Gradle worker (due to Gradle daemon) kept increasing Gradle worker IDs, causing an overflow at some point.	2019-07-11 13:35:37 +02:00
Mark Vieira	7c2e4b2857	[Backport] Enable caching of rest tests which use integ-test distribution (#44181 )	2019-07-10 15:42:28 -07:00
Alpar Torok	bde5802ad6	Test fixtures improovements (#43956 ) * Test fixtures improovements Don't disable some of the precommit tasks on fixtures. This no longer makes sense now that a project can both produce and use a fixture. In order for this to be possible, had to add an additional configuration to make JarHell class accessible to the task even if it's not a dependency of the project and fix some of the third party audit fallout from #43671 which wasn't detected at the time due to the issue being fixed here. Closes #43918	2019-07-10 21:21:06 +03:00
Alpar Torok	0c8294e633	Make sure the clean task doesn't break test fixtures (#43641 ) Use a dedicated fixture dir.	2019-07-08 17:58:27 +03:00
Armin Braun	af9b98e81c	Recursively Delete Unreferenced Index Directories (#42189 ) (#44051 ) * Use ability to list child "folders" in the blob store to implement recursive delete on all stale index folders when cleaning up instead of using the diff between two `RepositoryData` instances to cover aborted deletes * Runs after ever delete operation * Relates #13159 (fixing most of this issues caused by unreferenced indices, leaving some meta files to be cleaned up only)	2019-07-08 10:55:39 +02:00
Armin Braun	2176d09c37	Provide an Option to Use Path-Style-Access with S3 Repo (#41966 ) (#44046 ) * Provide an Option to Use Path-Style-Access with S3 Repo * As discussed, added the option to use path style access back again and deprecated it. * Defaulted to `false` * Added warning to docs * Closes #41816	2019-07-08 08:10:01 +02:00
Alan Woodward	4b99255fed	Add name() method to TokenizerFactory (#43909 ) This brings TokenizerFactory into line with CharFilterFactory and TokenFilterFactory, and removes the need to pass around tokenizer names when building custom analyzers. As this means that TokenizerFactory is no longer a functional interface, the commit also adds a factory method to TokenizerFactory to make construction simpler.	2019-07-04 11:28:55 +01:00
Armin Braun	be20fb80e4	Recursive Delete on BlobContainer (#43281 ) (#43920 ) This is a prerequisite of #42189: * Add directory delete method to blob container specific to each implementation: * Some notes on the implementations: * AWS + GCS: We can simply exploit the fact that both AWS and GCS return blobs lexicographically ordered which allows us to simply delete in the same order that we receive the blobs from the listing request. For AWS this simply required listing without the delimiter setting (so we get a deep listing) and for GCS the same behavior is achieved by not using the directory mode on the listing invocation. The nice thing about this is, that even for very large numbers of blobs the memory requirements are now capped nicely since we go page by page when deleting. * For Azure I extended the parallelization to the listing calls as well and made it work recursively. I verified that this works with thread count `1` since we only block once in the initial thread and then fan out to a "graph" of child listeners that never block. * HDFS and FS are trivial since we have directory delete methods available for them * Enhances third party tests to ensure the new functionality works (I manually ran them for all cloud providers)	2019-07-03 17:14:57 +02:00
Armin Braun	3317169c4f	Fix GCS Blob Repository 3rd Party Tests (#43030 ) (#43913 ) * We have to strip the trailing slash from child names here like we do for AWS * closes #43029	2019-07-03 15:09:28 +02:00
Armin Braun	455b12a4fb	Add Ability to List Child Containers to BlobContainer (#42653 ) (#43903 ) * Add Ability to List Child Containers to BlobContainer (#42653) * Add Ability to List Child Containers to BlobContainer * This is a prerequisite of #42189	2019-07-03 11:30:49 +02:00
Armin Braun	826f38cd70	Enable Parallel Deletes in Azure Repository (#42783 ) (#43886 ) * Parallel deletes via private thread pool	2019-07-03 09:28:39 +02:00
Armin Braun	cd4f81e15e	Remove Unused AWS KMS Dependency (#43671 ) (#43679 ) * We don't make use of KMS at the moment, no need to have this dependency here	2019-06-27 16:51:11 +02:00
Armin Braun	83067968ca	Add SAS Token Authentication Support to Azure Repo Plugin (#42982 ) (#43618 ) * Added setting for SAS token * Added support for the token in tests * Relates #42117	2019-06-26 13:43:32 +02:00
Jack Conradson	5eb044e635	Add annotations to Painless whitelist (#43239 ) This change adds the ability to attach annotative information for classes, methods, fields, static methods, class bindings, and instance bindings during Painless whitelisting. Annotations are specified as @annotation or optionally as @annotation[parameter="argument",...]. Annotations open up the ability to specify whitelist objects as having a short name (no_import -> @no_import) or deprecated.	2019-06-24 09:27:24 -07:00
Tim Brooks	38516a4dd5	Move nio ip filter rule to be a channel handler (#43507 ) Currently nio implements ip filtering at the channel context level. This is kind of a hack as the application logic should be implemented at the handler level. This commit moves the ip filtering into a channel handler. This requires adding an indicator to the channel handler to show when a channel should be closed.	2019-06-24 10:03:24 -06:00
Armin Braun	6e7a0e1b2a	Remove Azure Container Exists Check (#43288 ) (#43433 ) * This check is redundant, if the container doesn't exist subsequent operations will fail anyway. Since we are not running this exists check during verification I don't think there's much point to having it in snapshot initialization. * This PR is mainly motivated by the fact that this forces more permissions to be available in shared environments	2019-06-20 20:23:51 +02:00
Armin Braun	b7322812e0	Upgrade AWS SDK to Latest Version (#42708 ) (#43422 ) * Just staying up to data on the SDK version * Use `AbstractAmazonEC2` to shorten code	2019-06-20 16:43:42 +02:00
Armin Braun	21e74dd7d2	Upgrade GCS Repository Dependencies (#43142 ) (#43418 ) * Upgrade to latest GCS SDK and transitive dependencies (I chose the later version here on conflict) * Remove now unnecessary hack for custom endpoints (the linked bugs were both resolved in the SDK)	2019-06-20 16:35:54 +02:00
Armin Braun	667bdcd3ce	Ensure Test Cluster in Azure Discovery Tests Closes (#43057 ) (#43372 ) * Use `internalCluster().close()` to force all nodes (and not just the datanodes) to shut down even if one fails to shut down in time * Force closing httpServer to get cleaner logs if nodes still hang on shut down * Relates #43048	2019-06-19 18:03:39 +02:00
Martijn Laarman	8b1b9f8ab9	Introduce stability description to the REST API specification (#38413 ) (#43278 ) * introduce state to the REST API specification * change state over to stability * CCR is no GA updated to stable * SQL is now GA so marked as stable * Introduce `internal` as state for API's, marks stable in terms of lifetime but unstable in terms of guarantees on its output format since it exposes internal representations * make setting a wrong stability value, or not setting it at all an error that causes the YAML test suite to fail * update spec files to be explicit about their stability state * Document the fact that stability needs to be defined Otherwise the YAML test runner will fail (with a nice exception message) * address check style violations * update rest spec unit tests to include stability * found one more test spec file not declaring stability, made sure stability appears after documentation everywhere * cluster.state is stable, mark response in some way to denote its a key value format that can be changed during minors * mark data frame API's as beta * remove internal and private as states for an API * removed the wrong enum values in the Stability Enum in the previous commit (cherry picked from commit 61c34bbd92f8f7e5f22fa411c6b682b0ebd8a99d)	2019-06-17 16:57:13 +02:00
Yogesh Gaikwad	4ae1e30a98	Enable krb5kdc-fixture, kerberos tests mount urandom for kdc container (#41710 ) (#43178 ) Infra has fixed #10462 by installing `haveged` on CI workers. This commit enables the disabled fixture and tests, and mounts `/dev/urandom` for the container so there is enough entropy required for kdc. Note: hdfs-repository tests have been disabled, will raise a separate issue for it. Closes #40624 Closes #40678	2019-06-13 13:02:16 +10:00
Simon Willnauer	9d2adfb41e	Remove usage of FileSwitchDirectory (#42937 ) We are still using `FileSwitchDirectory` in the case a user configures file based pre-load of mmaps. This is trappy for multiple reasons if the both directories used by `FileSwitchDirectory` point to the same filesystem directory. One issue is LUCENE-8835 that cause issues like #37111 - unless LUCENE-8835 isn't fixed we should not use it in elasticsearch. Instead we use a similar trick as we use for HybridFS and subclass mmap directory directly.	2019-06-12 19:35:27 +02:00
Christoph Büscher	7f690e8606	Fix suggestions for empty indices (#42927 ) Currently suggesters return null values on empty shards. Usually this gets replaced by results from other non-epmty shards, but if the index is completely epmty (e.g. after creation) the search responses "suggest" is also "null" and we don't render a corresponding output in the REST response. This is an irritating edge case that requires special handling on the user side (see #42473) and should be fixed. This change makes sure every suggester type (completion, terms, phrase) returns at least an empty skeleton suggestion output, even for empty shards. This way, even if we don't find any suggestions anywhere, we still return and output the empty suggestion. Closes #42473	2019-06-12 15:42:23 +02:00
Yannick Welsch	e5a4a2272b	Wipe repositories more often (#42511 ) Fixes an issue where repositories are unintentionally shared among tests (given that the repo contents is captured in a static variable on the test class, to allow "sharing" among nodes) and two tests randomly chose the same snapshot name, leading to a conflict. Closes #42519	2019-06-12 11:58:38 +02:00
Martijn Laarman	cb7ce865b7	remove path from rest-api-spec (#41452 ) (#43084 ) (cherry picked from commit f5fde1d0843d2f0f53d3b9a15b9cfc8b94471ab7)	2019-06-11 12:52:36 +02:00
Henning Andersen	014bad1f61	Mute AzureDiscoveryClusterFormationTests (#43049 ) Relates #43048	2019-06-10 17:39:42 +02:00
Mayya Sharipova	81a3b6e2fe	Improve documentation for smart_cn analyzer (#42822 )	2019-06-10 08:59:30 -04:00
Alpar Torok	9def454ea9	Clean up configuration when docker isn't available (#42745 ) We initially added `requireDocker` for a way for tasks to say that they absolutely must have it, like the build docker image tasks. Projects using the test fixtures plugin are not in this both, as the intent with these is that they will be skipped if docker and docker-compose is not available. Before this change we were lenient, the docker image build would succeed but produce nothing. The implementation was also confusing as it was not immediately obvious this was the case due to all the indirection in the code. The reason we have this leniency is that when we added the docker image build, docker was a fairly new requirement for us, and we didn't have it deployed in CI widely enough nor had CI configured to prefer workers with docker when possible. We are in a much better position now. The other reason was other stack teams running `./gradlew assemble` in their respective CI and the possibility of breaking them if docker is not installed. We have been advocating for building specific distros for some time now and I will also send out an additional notice The PR also removes the use of `requireDocker` from tests that actually use test fixtures and are ok without it, and fixes a bug in test fixtures that would cause incorrect configuration and allow some tasks to run when docker was not available and they shouldn't have. Closes #42680 and #42829 see also #42719	2019-06-10 13:44:15 +03:00
Jason Tedor	aad1b3a2a0	Fix version parsing in various tests (#42871 ) This commit fixes the version parsing in various tests. The issue here is that the parsing was relying on java.version. However, java.version can contain additional characters such as -ea for early access builds. See JEP 233: Name Syntax ------------------------------ -------------- java.version $VNUM(\-$PRE)? java.runtime.version $VSTR java.vm.version $VSTR java.specification.version $VNUM java.vm.specification.version $VNUM Instead, we want java.specification.version.	2019-06-04 18:22:20 -04:00
Mark Vieira	e44b8b1e2e	[Backport] Remove dependency substitutions 7.x (#42866 ) * Remove unnecessary usage of Gradle dependency substitution rules (#42773) (cherry picked from commit 12d583dbf6f7d44f00aa365e34fc7e937c3c61f7)	2019-06-04 13:50:23 -07:00
Christoph Büscher	bde0137e07	Fix compile issue of earlier commit on 7.x	2019-06-03 13:36:20 +02:00
Christoph Büscher	9a9ee9abed	[Docs] Add example to reimplement stempel analyzer (#42676 ) Adding an example of how to re-implement the polish stempel analyzer in case a user want to modify or extend it. In order for the analyzer to be able to use polish stopwords, also registering a polish_stop filter for the stempel plugin. Closes #13150	2019-06-03 13:22:44 +02:00
Jason Tedor	371cb9a8ce	Remove Log4j 1.2 API as a dependency (#42702 ) We had this as a dependency for legacy dependencies that still needed the Log4j 1.2 API. This appears to no longer be necessary, so this commit removes this artifact as a dependency. To remove this dependency, we had to fix a few places where we were accidentally relying on Log4j 1.2 instead of Log4j 2 (easy to do, since both APIs were on the compile-time classpath). Finally, we can remove our custom Netty logger factory. This was needed when we were on Log4j 1.2 and handled logging in our own unique way. When we migrated to Log4j 2 we could have dropped this dependency. However, even then Netty would still pick up Log4j 1.2 since it was on the classpath, thus the advantage to removing this as a dependency now.	2019-05-30 16:08:07 -04:00
Mark Vieira	c1816354ed	[Backport] Improve build configuration time (#42674 )	2019-05-30 10:29:42 -07:00
Armin Braun	116b050cc6	Cleanup Bulk Delete Exception Logging (#41693 ) (#42606 ) * Cleanup Bulk Delete Exception Logging * Follow up to #41368 * Collect all failed blob deletes and add them to the exception message * Remove logging of blob name list from caller exception logging	2019-05-28 11:00:28 +02:00
Armin Braun	44bf784fe1	Add Infrastructure to Run 3rd Party Repository Tests (#42586 ) (#42604 ) * Add Infrastructure to Run 3rd Party Repository Tests * Add infrastructure to run third party repository tests using our standard JUnit infrastructure * This is a prerequisite of #42189	2019-05-28 10:46:22 +02:00
Armin Braun	c079fb61bf	Remove Dead Code from Azure Repo Plugin (#42178 ) (#42569 ) * None of this stuff is used	2019-05-28 08:00:02 +02:00
Armin Braun	c4f44024af	Remove Delete Method from BlobStore (#41619 ) (#42574 ) * Remove Delete Method from BlobStore (#41619) * The delete method on the blob store was used almost nowhere and just duplicates the delete method on the blob containers * The fact that it provided for some recursive delete logic (that did not behave the same way on all implementations) was not used and not properly tested either	2019-05-27 12:24:20 +02:00
Armin Braun	d2cd36bd9f	Upgrade to Netty 4.1.36 (#42543 ) (#42564 )	2019-05-27 10:38:03 +02:00
Mark Vieira	24cf86a013	Ignore JAR manifests when snapshotting runtime classpaths (#42548 ) (cherry picked from commit d5281fc96f6fb2f022c87699bdad64d88614e04c)	2019-05-24 18:28:47 -07:00
Simon Willnauer	46ccfba808	Remove IndexStore and DirectoryService (#42446 ) Both of these classes are basically a bloated wrapper around a simple construct that can simply be a DirectoryFactory interface. This change removes both classes and replaces them with a simple stateless interface that creates a new `Directory` per shard. The concept of `index.store` is preserved since it makes sense from a configuration perspective.	2019-05-24 12:14:56 +02:00
Jim Ferenczi	b88e80ab89	Upgrade to Lucene 8.1.0 (#42214 ) This commit upgrades to the GA release of Lucene 8.1.0	2019-05-23 11:46:45 +02:00
Jim Ferenczi	4ca5649a0d	Upgrade to lucene 8.1.0-snapshot-e460356abe (#40952 )	2019-05-23 11:45:33 +02:00
Jay Modi	dbbdcea128	Update ciphers for TLSv1.3 and JDK11 if available (#42082 ) This commit updates the default ciphers and TLS protocols that are used when the runtime JDK supports them. New cipher support has been introduced in JDK 11 and 12 along with performance fixes for AES GCM. The ciphers are ordered with PFS ciphers being most preferred, then AEAD ciphers, and finally those with mainstream hardware support. When available stronger encryption is preferred for a given cipher. This is a backport of #41385 and #41808. There are known JDK bugs with TLSv1.3 that have been fixed in various versions. These are: 1. The JDK's bundled HttpsServer will endless loop under JDK11 and JDK 12.0 (Fixed in 12.0.1) based on the way the Apache HttpClient performs a close (half close). 2. In all versions of JDK 11 and 12, the HttpsServer will endless loop when certificates are not trusted or another handshake error occurs. An email has been sent to the openjdk security-dev list and #38646 is open to track this. 3. In JDK 11.0.2 and prior there is a race condition with session resumption that leads to handshake errors when multiple concurrent handshakes are going on between the same client and server. This bug does not appear when client authentication is in use. This is JDK-8213202, which was fixed in 11.0.3 and 12.0. 4. In JDK 11.0.2 and prior there is a bug where resumed TLS sessions do not retain peer certificate information. This is JDK-8212885. The way these issues are addressed is that the current java version is checked and used to determine the supported protocols for tests that provoke these issues.	2019-05-20 09:45:36 -04:00
Alan Woodward	44c3418531	Simplify handling of keyword field normalizers (#42002 ) We have a number of places in analysis-handling code where we check if a field type is a keyword field, and if so then extract the normalizer rather than pulling the index-time analyzer. However, a keyword normalizer is really just a special case of an analyzer, so we should be able to simplify this by setting the normalizer as the index-time analyzer at construction time.	2019-05-10 14:38:46 +01:00
Alpar Torok	711ace0533	Testclusters: support for security and convert example plugins (#41864 ) testclusters detect from settings that security is enabled if a user is not specified using the DSL introduced in this PR, a default one is created the appropriate wait conditions are used authenticating with the first user defined in the DSL ( or the default user ). an example DSL to create a user is user username:"test_user" password:"x-pack-test-password" role: "superuser" all keys are optional and default to the values shown in this example	2019-05-08 14:04:00 +03:00
Yannick Welsch	5b71baa100	Upgrade SDK and test discovery-ec2 credential providers (#41732 ) Upgrades the AWS SDK to the same version that we're using for the repository-s3 plugin, providing testing capabilities to override certain SDK endpoints in order to point them to localhost for testing. Adds tests for the various credential providers.	2019-05-08 09:38:36 +02:00
David Turner	4c909e93bb	Reject port ranges in `discovery.seed_hosts` (#41905 ) Today Elasticsearch accepts, but silently ignores, port ranges in the `discovery.seed_hosts` setting: ``` discovery.seed_hosts: 10.1.2.3:9300-9400 ``` Silently ignoring part of a setting like this is trappy. With this change we reject seed host addresses of this form. Closes #40786 Backport of #41404	2019-05-08 08:34:32 +01:00
Alan Woodward	4cca1e8fff	Correct spelling of MockLogAppender.PatternSeenEventExpectation (#41893 ) The class was called PatternSeenEventExcpectation. This commit is a straight class rename to correct the spelling.	2019-05-07 17:28:51 +01:00
Tim Brooks	927013426a	Read multiple TLS packets in one read call (#41820 ) This is related to #27260. Currently we have a single read buffer that is no larger than a single TLS packet. This prevents us from reading multiple TLS packets in a single socket read call. This commit modifies our TLS work to support reading similar to the plaintext case. The data will be copied to a (potentially) recycled TLS packet-sized buffer for interaction with the SSLEngine.	2019-05-06 09:51:32 -06:00
Tim Brooks	24484ae227	Fix http read timeout test by releasing response (#41801 ) This fixes #41794. Currently the read timeout test queues up responses in the netty pipeline. These responses are immediately returned in the write call, but they are not released. This commit releases the responses. This will cause the leak detector to quit throwing exceptions.	2019-05-03 16:18:26 -06:00
Tim Brooks	b4bcbf9f64	Support http read timeouts for transport-nio (#41466 ) This is related to #27260. Currently there is a setting http.read_timeout that allows users to define a read timeout for the http transport. This commit implements support for this functionality with the transport-nio plugin. The behavior here is that a repeating task will be scheduled for the interval defined. If there have been no requests received since the last run and there are no inflight requests, the channel will be closed.	2019-05-02 09:48:52 -06:00
Armin Braun	7cc4b9a8b3	Implement Bulk Deletes for GCS Repository (#41368 ) (#41681 ) * Implement Bulk Deletes for GCS Repository (#41368) * Just like #40322 for AWS * We already had a bulk delete API but weren't using it from the blob container implementation, now we are using it * Made the bulk delete API also compliant with our interface that only suppresses errors about non existent blobs by stating failed deletes (I didn't use any bulk stat action here since having to stat here should be the exception anyway and it would make error handling a lot more complex) * Fixed bulk delete API to limit its batch size to 100 in line with GCS recommendations back port of #41368	2019-04-30 17:03:57 +02:00
Armin Braun	08c0ecb90e	Upgrade to Netty 4.1.35 (#41499 ) (#41651 ) * Some fixes and possible performance fixes in the last 3 versions -> upgrading	2019-04-30 09:27:51 +02:00
Tim Brooks	df3ef66294	Remove dedicated SSL network write buffer (#41654 ) This is related to #27260. Currently for the SSLDriver we allocate a dedicated network write buffer and encrypt the data into that buffer one buffer at a time. This requires constantly switching between encrypting and flushing. This commit adds a dedicated outbound buffer for SSL operations that will internally allocate new packet sized buffers as they are need (for writing encrypted data). This allows us to totally encrypt an operation before writing it to the network. Eventually it can be hooked up to buffer recycling. This commit also backports the following commit: Handle WRAP ops during SSL read It is possible that a WRAP operation can occur while decrypting handshake data in TLS 1.3. The SSLDriver does not currently handle this well as it does not have access to the outbound buffer during read call. This commit moves the buffer into the Driver to fix this issue. Data wrapped during a read call will be queued for writing after the read call is complete.	2019-04-29 17:59:13 -06:00
Alpar Torok	335f2bf102	Testclsuters: convert plugins qa projects (#41496 ) Add testclusters support for files in keystore and convert qa subprojects within plugins.	2019-04-26 08:57:52 -07:00
Armin Braun	aad33121d8	Async Snapshot Repository Deletes (#40144 ) (#41571 ) Motivated by slow snapshot deletes reported in e.g. #39656 and the fact that these likely are a contributing factor to repositories accumulating stale files over time when deletes fail to finish in time and are interrupted before they can complete. * Makes snapshot deletion async and parallelizes some steps of the delete process that can be safely run concurrently via the snapshot thread poll * I did not take the biggest potential speedup step here and parallelize the shard file deletion because that's probably better handled by moving to bulk deletes where possible (and can still be parallelized via the snapshot pool where it isn't). Also, I wanted to keep the size of the PR manageable. * See https://github.com/elastic/elasticsearch/pull/39656#issuecomment-470492106 * Also, as a side effect this gives the `SnapshotResiliencyTests` a little more coverage for master failover scenarios (since parallel access to a blob store repository during deletes is now possible since a delete isn't a single task anymore). * By adding a `ThreadPool` reference to the repository this also lays the groundwork to parallelizing shard snapshot uploads to improve the situation reported in #39657	2019-04-26 15:36:09 +02:00
Tim Brooks	1f8ff052a1	Revert "Remove dedicated SSL network write buffer (#41283 )" This reverts commit `f65a86c258`.	2019-04-25 18:39:25 -06:00
Tim Brooks	f65a86c258	Remove dedicated SSL network write buffer (#41283 ) This is related to #27260. Currently for the SSLDriver we allocate a dedicated network write buffer and encrypt the data into that buffer one buffer at a time. This requires constantly switching between encrypting and flushing. This commit adds a dedicated outbound buffer for SSL operations that will internally allocate new packet sized buffers as they are need (for writing encrypted data). This allows us to totally encrypt an operation before writing it to the network. Eventually it can be hooked up to buffer recycling.	2019-04-25 14:30:54 -06:00
Armin Braun	23b3741618	Remove Exists Check from S3 Repository Deletes (#40931 ) (#41534 ) * The check doesn't add much if anything practically, since the S3 repository is eventually consistent and we only log the non-existence of a blob anyway * We don't do the check on writes for this very reason and documented it as such * Removing the check saves one API call per single delete speeding up the deletion process and lowering costs	2019-04-25 18:25:03 +02:00
Armin Braun	2e4ac178e2	Fix Repository Base Path Matching in Azure ITs (#41457 ) (#41469 ) * Added quotes so that "regexy" base paths like `7.0` that we use on CI don't break matching * closes #41405	2019-04-24 12:24:05 +02:00
clement-tourriere	c80f86e3e4	Add ignore_above in ICUCollationKeywordFieldMapper (#40414 ) Add the possibility to use ignore_above parameter in ICUCollationKeywordFieldMapper. Close #40413	2019-04-19 14:19:35 -07:00
Alpar Torok	4ef4ed66b9	Convert repository-hdfs to testclusters (#41252 ) * Convert repository-hdfs to testclusters Relates #40862	2019-04-19 09:47:14 +03:00
Armin Braun	c4e84e2b34	Add Bulk Delete Api to BlobStore (#40322 ) (#41253 ) * Adds Bulk delete API to blob container * Implement bulk delete API for S3 * Adjust S3Fixture to accept both path styles for bulk deletes since the S3 SDK uses both during our ITs * Closes #40250	2019-04-16 17:19:05 +02:00
Mark Vieira	1287c7d91f	[Backport] Replace usages RandomizedTestingTask with built-in Gradle Test (#40978 ) (#40993 ) * Replace usages RandomizedTestingTask with built-in Gradle Test (#40978) This commit replaces the existing RandomizedTestingTask and supporting code with Gradle's built-in JUnit support via the Test task type. Additionally, the previous workaround to disable all tasks named "test" and create new unit testing tasks named "unitTest" has been removed such that the "test" task now runs unit tests as per the normal Gradle Java plugin conventions. (cherry picked from commit 323f312bbc829a63056a79ebe45adced5099f6e6) * Fix forking JVM runner * Don't bump shadow plugin version	2019-04-09 11:52:50 -07:00
Jay Modi	f34663282c	Update apache httpclient to version 4.5.8 (#40875 ) This change updates our version of httpclient to version 4.5.8, which contains the fix for HTTPCLIENT-1968, which is a bug where the client started re-writing paths that contained encoded reserved characters with their unreserved form.	2019-04-05 13:48:10 -06:00
Martijn van Groningen	e5cec87697	Remove -Xlint exclusions in all plugins. (#40721 ) The xlint exclusions of the following plugins were removed: * ingest-attachment. * mapper-size. * transport-nio. Removing the -try exclusion required some work, because the NettyAdaptor implements AutoCloseable and NettyAdaptor#close() method could throw an InterruptedException (ChannelFuture#await() and a generic Exception is re-thrown, which maybe an ChannelFuture). The easiest way around this to me seemed that NettyAdaptor should not implement AutoCloseable, because it is not directly used in a try-with-resources statement. Relates to #40366	2019-04-04 08:30:34 +02:00
Christoph Büscher	09ba3ec677	Small refactorings to analysis components (#40745 ) This change adds the following internal refactorings: * wraps input analyzers into an unmodifiable map in IndexAnalyzers ctor * removes duplicated indexSetting in IndexAnalyzers * removes references to IndexAnalyzers from DocumentMapperParser and TypeParser.ParserContext. It can always be retrieve it from MapperService directly in those cases	2019-04-03 14:22:16 +02:00
Alpar Torok	293297ae3d	Fix repository-hdfs when no docker and unnecesary fixture The hdfs-fixture is actually executed in plugin/repository-hdfs as a dependency. The fixture is not needed and actually causes a failure because we have two copies now and both use the same ports.	2019-03-29 16:55:12 +02:00
Alpar Torok	35d96c22c0	Fix 3rd pary S3 tests (#40588 ) * Fix 3rd pary S3 tests This is allready excluded on line 186, by doing this again here, the other exclusion from arround that line are removed causing the tests to fail. * Fix blacklisting with the fixture	2019-03-29 08:04:16 +02:00
Alpar Torok	d791e08932	Test fixtures krb5 (#40297 ) Replaces the vagrant based kerberos fixtures with docker based test fixtures plugin. The configuration is now entirely static on the docker side and no longer driven by Gradle, also two different services are being configured since there are two different consumers of the fixture that can run in parallel and require different configurations.	2019-03-28 17:26:58 +02:00
Andy Bristol	23395a9b9f	search as you type fieldmapper (#35600 ) Adds the search_as_you_type field type that acts like a text field optimized for as-you-type search completion. It creates a couple subfields that analyze the indexed terms as shingles, against which full terms are queried, and a prefix subfield that analyze terms as the largest shingle size used and edge-ngrams, against which partial terms are queried Adds a match_bool_prefix query type that creates a boolean clause of a term query for each term except the last, for which a boolean clause with a prefix query is created. The match_bool_prefix query is the recommended way of querying a search as you type field, which will boil down to term queries for each shingle of the input text on the appropriate shingle field, and the final (possibly partial) term as a term query on the prefix field. This field type also supports phrase and phrase prefix queries however	2019-03-27 13:29:13 -07:00
Tim Brooks	ab44f5fd5d	Add InboundHandler for inbound message handling (#40430 ) This commit adds an InboundHandler to handle inbound message processing. With this commit, this code is moved out of the TcpTransport. Additionally, finer grained unit tests are added to ensure that the inbound processing works as expected	2019-03-27 12:33:26 -06:00
Tim Brooks	3860ddd1a4	Move outbound message handling to OutboundHandler (#40336 ) Currently there are some components of message serializer and sending that still occur in TcpTransport. This commit makes it possible to send a message without the TcpTransport by moving all of the remaining application logic to the OutboundHandler. Additionally, it adds unit tests to ensure that this logic works as expected.	2019-03-27 11:47:36 -06:00
Alpar Torok	524e0273ae	Testclusters: convert plugin repository-s3 (#40399 ) * Add support for setting and keystore settings * system properties and env var config * use testclusters for repository-s3 * Some cleanup of the build.gradle file for plugin-s3 * add runner {} to rest integ test task	2019-03-27 08:40:16 +02:00
Henning Andersen	bf444b9f02	Store Pending Deletions Fix (#40345 ) FilterDirectory.getPendingDeletions does not delegate, fixed temporarily by overriding in StoreDirectory. This in turn caused duplicate file name use after a trimUnsafeCommits had been done, since a new IndexWriter would not consider the pending deletes in IndexFileDeleter. This should only happen on windows (AFAIK). Reenabled doing index updates for all tests using IndexShardTests.indexOnReplicaWithGaps (which could fail due to above when using mocked WindowsFS). Added getPendingDeletions delegation to all elasticsearch FilterDirectory subclasses that were not trivial test-only overrides to minimize the risk of hitting this issue in another case.	2019-03-26 15:30:44 +01:00
Armin Braun	13d76239a0	Use Netty ByteBuf Bulk Operations for Faster Deserialization (#40158 ) (#40339 ) * Use bulk methods to read numbers faster from byte buffers	2019-03-24 19:08:51 +01:00
Yannick Welsch	8f84f455c3	Remove note about Azure ARM plugin (#40219 ) Relates to #22679	2019-03-20 16:31:37 +01:00
Tim Brooks	0b50a670a4	Remove transport name from tcp channel (#40074 ) Currently, we maintain a transport name ("mock-nio", "nio", "netty") that is passed to a `TcpTransportChannel` when a request is received. The value of this name is to associate with the task when we register a task with the task manager. However, it is only possible to run ES with one transport, so having an implementation specific name is unnecessary. This commit removes the name and replaces it with the generic "transport".	2019-03-15 12:04:13 -06:00
Ryan Ernst	8f09c77777	Add no-jdk distributions (#39882 ) This commit adds a variant for every official distribution that omits the bundled jdk. The "no-jdk" naming is conveyed through the package classifier, alongside the platform. Package tests are also added for each new distribution.	2019-03-15 00:55:57 -07:00
Jason Tedor	d02bca1314	Upgrade the bouncycastle dependency to 1.61 (#40017 ) This commit upgrades the bouncycastle dependency from 1.59 to 1.61.	2019-03-14 08:54:47 -04:00
Jim Ferenczi	7a7658707a	Upgrade to Lucene release 8.0.0 (#39998 ) This commit upgrades to the GA release of Lucene 8 Closes #39640	2019-03-13 18:11:50 +01:00
Jay Modi	ba40230c7f	Add client jar for transport-nio (#39860 ) This change marks the transport-nio plugin as having a client jar. The nio transport can be used from a transport client and the x-pack-transport artifact depends on the transport-nio jar but this jar is not published. This change marks the transport-nio project as having a client jar so that the jar may be published in the same way that we publish the netty4 transport artifact. The change to actually publish the jar will be handled separately as an update to the release manager.	2019-03-12 09:27:08 -06:00
David Emanuel Buchmann	b5ed039160	plugins/repository-gcs: Update google-cloud-storage/core to 1.59.0 (#39748 ) * plugins/repository-gcs: Update google-cloud-storage / google-cloud-core to 1.59.0 * plugins: Update sha1 for google-cloud-core & google-cloud-storage	2019-03-10 11:04:52 -04:00
Alpar Torok	6c75a2f2b0	Testclusters: start using it for testing some plugins (#39693 ) * enable testclusters for some plugins	2019-03-07 17:52:50 +02:00
markharwood	1873de5240	Bug fix for AnnotatedTextHighlighter - port of 39525 (#39749 ) Bug fix for AnnotatedTextHighlighter - port of 39525 Relates to #39395	2019-03-06 19:02:04 +00:00
Armin Braun	aaecaf59a4	Optimize Bulk Message Parsing and Message Length Parsing (#39634 ) (#39730 ) * Optimize Bulk Message Parsing and Message Length Parsing * findNextMarker took almost 1ms per invocation during the PMC rally track * Fixed to be about an order of magnitude faster by using Netty's bulk `ByteBuf` search * It is unnecessary to instantiate an object (the input stream wrapper) and throw it away, just to read the `int` length from the message bytes * Fixed by adding bulk `int` read to BytesReference	2019-03-06 08:13:15 +01:00
Armin Braun	65732d707f	Add Support for S3 Intelligent Tiering (#39376 ) (#39620 ) * Add support for S3 intelligent tiering * Closes #38836	2019-03-04 10:32:37 +01:00
Alpar Torok	813351fe26	Un-mute and fix BuildExamplePluginsIT (#38899 ) * Un-mute and fix BuildExamplePluginsIT There doesn't seem to be anything wrong with the test iteself. I think the failure were CI performance related, but while it was muted, some failures managed to sneak in. Closes #38784 * PR review	2019-03-04 08:50:55 +02:00
Alan Woodward	71b8494181	Upgrade to lucene 8.0.0-snapshot-ff9509a8df (#39444 ) Backport of #39350 Contains the following: * LUCENE-8635: Move terms dictionary off-heap for non-primary-key fields in `MMapDirectory` * LUCENE-8292: `TermsEnum` is fully abstract * LUCENE-8679: Return WITHIN in `EdgeTree#relateTriangle` only when polygon and triangle share one edge * LUCENE-8676: Nori tokenizer deals correctly with large buffers * LUCENE-8697: `GraphTokenStreamFiniteStrings` better handles side paths with gaps * LUCENE-8664: Add `equals` and `hashCode` to `TotalHits` * LUCENE-8660: `TopDocsCollector` returns accurate hit counts if the total equals the threshold * LUCENE-8654: `Polygon2D#relateTriangle` fix for when the polygon is inside the triangle * LUCENE-8645: `Intervals#fixField` can merge intervals from different fields * LUCENE-8585: Create jump-tables for DocValues at index time	2019-02-27 14:36:08 +00:00
Jason Tedor	224600f370	Bump jackson-databind version for AWS SDK (#39183 ) This commit bumps the jackson-databind version for discovery-ec2 and repository-s3 to 2.8.11.3.	2019-02-20 13:04:50 -05:00
Henning Andersen	00a26b9dd2	Blob store compression fix (#39073 ) Blob store compression was not enabled for some of the files in snapshots due to constructor accessing sub-class fields. Fixed to instead accept compress field as constructor param. Also fixed chunk size validation to work. Deprecated repositories.fs.compress setting as well to be able to unify in a future commit.	2019-02-20 09:24:41 +01:00
Alan Woodward	176013e23c	Avoid double term construction in DfsPhase (#38716 ) DfsPhase captures terms used for scoring a query in order to build global term statistics across multiple shards for more accurate scoring. It currently does this by building the query's `Weight` and calling `extractTerms` on it to collect terms, and then calling `IndexSearcher.termStatistics()` for each collected term. This duplicates work, however, as the various `Weight` implementations will already have collected these statistics at construction time. This commit replaces this round-about way of collecting stats, instead using a delegating IndexSearcher that collects the term contexts and statistics when `IndexSearcher.termStatistics()` is called from the Weight. It also fixes a bug when using rescorers, where a `QueryRescorer` would calculate distributed term statistics, but ignore field statistics. `Rescorer.extractTerms` has been removed, and replaced with a new method on `RescoreContext` that returns any queries used by the rescore implementation. The delegating IndexSearcher then collects term contexts and statistics in the same way described above for each Query.	2019-02-15 16:00:38 +00:00
Tanguy Leroux	510829f9f7	TransportVerifyShardBeforeCloseAction should force a flush (#38401 ) This commit changes the `TransportVerifyShardBeforeCloseAction` so that it always forces the flush of the shard. It seems that #37961 is not sufficient to ensure that the translog and the Lucene commit share the exact same max seq no and global checkpoint information in case of one or more noop operations have been made. The `BulkWithUpdatesIT.testThatMissingIndexDoesNotAbortFullBulkRequest` and `FrozenIndexTests.testFreezeEmptyIndexWithTranslogOps` test this trivial situation and they both fail 1 on 10 executions. Relates to #33888	2019-02-06 13:22:54 +01:00
David Turner	5a3c452480	Align docs etc with new discovery setting names (#38492 ) In #38333 and #38350 we moved away from the `discovery.zen` settings namespace since these settings have an effect even though Zen Discovery itself is being phased out. This change aligns the documentation and the names of related classes and methods with the newly-introduced naming conventions.	2019-02-06 11:34:38 +00:00
Luca Cavanna	a7046e001c	Remove support for maxRetryTimeout from low-level REST client (#38085 ) We have had various reports of problems caused by the maxRetryTimeout setting in the low-level REST client. Such setting was initially added in the attempts to not have requests go through retries if the request already took longer than the provided timeout. The implementation was problematic though as such timeout would also expire in the first request attempt (see #31834), would leave the request executing after expiration causing memory leaks (see #33342), and would not take into account the http client internal queuing (see #25951). Given all these issues, it seems that this custom timeout mechanism gives little benefits while causing a lot of harm. We should rather rely on connect and socket timeout exposed by the underlying http client and accept that a request can overall take longer than the configured timeout, which is the case even with a single retry anyways. This commit removes the `maxRetryTimeout` setting and all of its usages.	2019-02-06 08:43:47 +01:00
Jay Modi	7ca5495d86	Allow custom authorization with an authorization engine (#38358 ) For some users, the built in authorization mechanism does not fit their needs and no feature that we offer would allow them to control the authorization process to meet their needs. In order to support this, a concept of an AuthorizationEngine is being introduced, which can be provided using the security extension mechanism. An AuthorizationEngine is responsible for making the authorization decisions about a request. The engine is responsible for knowing how to authorize and can be backed by whatever mechanism a user wants. The default mechanism is one backed by roles to provide the authorization decisions. The AuthorizationEngine will be called by the AuthorizationService, which handles more of the internal workings that apply in general to authorization within Elasticsearch. In order to support external authorization services that would back an authorization engine, the entire authorization process has become asynchronous, which also includes all calls to the AuthorizationEngine. The use of roles also leaked out of the AuthorizationService in our existing code that is not specifically related to roles so this also needed to be addressed. RequestInterceptor instances sometimes used a role to ensure a user was not attempting to escalate their privileges. Addressing this leakage of roles meant that the RequestInterceptor execution needed to move within the AuthorizationService and that AuthorizationEngines needed to support detection of whether a user has more privileges on a name than another. The second area where roles leaked to the user is in the handling of a few privilege APIs that could be used to retrieve the user's privileges or ask if a user has privileges to perform an action. To remove the leakage of roles from these actions, the AuthorizationService and AuthorizationEngine gained methods that enabled an AuthorizationEngine to return the response for these APIs. Ultimately this feature is the work included in: #37785 #37495 #37328 #36245 #38137 #38219 Closes #32435	2019-02-05 13:39:29 -07:00
David Turner	2d114a02ff	Rename static Zen1 settings (#38333 ) Renames the following settings to remove the mention of `zen` in their names: - `discovery.zen.hosts_provider` -> `discovery.seed_providers` - `discovery.zen.ping.unicast.concurrent_connects` -> `discovery.seed_resolver.max_concurrent_resolvers` - `discovery.zen.ping.unicast.hosts.resolve_timeout` -> `discovery.seed_resolver.timeout` - `discovery.zen.ping.unicast.hosts` -> `discovery.seed_addresses`	2019-02-05 08:46:52 +00:00
David Turner	1d82a6d9f9	Deprecate unused Zen1 settings (#38289 ) Today the following settings in the `discovery.zen` namespace are still used: - `discovery.zen.no_master_block` - `discovery.zen.hosts_provider` - `discovery.zen.ping.unicast.concurrent_connects` - `discovery.zen.ping.unicast.hosts.resolve_timeout` - `discovery.zen.ping.unicast.hosts` This commit deprecates all other settings in this namespace so that they can be removed in the next major version.	2019-02-04 08:52:08 +00:00
Julie Tibshirani	c2e9d13ebd	Default include_type_name to false in the yml test harness. (#38058 ) This PR removes the temporary change we made to the yml test harness in #37285 to automatically set `include_type_name` to `true` in index creation requests if it's not already specified. This is possible now that the vast majority of index creation requests were updated to be typeless in #37611. A few additional tests also needed updating here. Additionally, this PR updates the test harness to set `include_type_name` to `false` in index creation requests when communicating with 6.x nodes. This mirrors the logic added in #37611 to allow for typeless document write requests in test set-up code. With this update in place, we can remove many references to `include_type_name: false` from the yml tests.	2019-02-01 11:44:13 -08:00
Jay Modi	54dbf9469c	Update httpclient for JDK 11 TLS engine (#37994 ) The apache commons http client implementations recently released versions that solve TLS compatibility issues with the new TLS engine that supports TLSv1.3 with JDK 11. This change updates our code to use these versions since JDK 11 is a supported JDK and we should allow the use of TLSv1.3.	2019-01-30 14:24:29 -07:00
David Turner	81c443c9de	Deprecate minimum_master_nodes (#37868 ) Today we pass `discovery.zen.minimum_master_nodes` to nodes started up in tests, but for 7.x nodes this setting is not required as it has no effect. This commit removes this setting so that nodes are started with more realistic configurations, and deprecates it.	2019-01-30 20:09:15 +00:00
Colin Goodheart-Smithe	21e392e95e	Removes typed calls from YAML REST tests (#37611 ) This PR attempts to remove all typed calls from our YAML REST tests. The PR adds include_type_name: false to create index requests that use a mapping and also to put mapping requests. It also removes _type from index requests where they haven't already been removed. The PR ignores tests named *_with_types.yml since this are specifically testing typed API behaviour. The change also includes changing the test harness to add the type _doc to index, update, get and bulk requests that do not specify the document type when the test is running against a mixed 7.x/6.x cluster.	2019-01-30 16:32:58 +00:00
Armin Braun	57823c484f	Streamline S3 Repository- and Client-Settings (#37393 ) * Make repository settings override static settings * Cache clients according to settings * Introduce custom implementations for the AWS credentials here to be able to use them as part of a hash key	2019-01-30 06:22:31 +01:00
Armin Braun	899dfc38bc	Fix S3 Repository ITs When Docker is not Available (#37878 ) * Disable Minio fixture and tests that require it when fixtures are disabled or Docker is not available * Relates #37852	2019-01-25 22:55:29 +01:00
Armin Braun	be6bdab346	Use TestFixturesPlugin to Run Minio in Tests (#37852 ) * Use TestFixturesPlugin to Run Minio in Tests * Closes #37680 * Closes #37783	2019-01-25 12:56:40 +01:00
Armin Braun	d7fe4e57fe	Make Minio Setup more Reliable (#37747 ) * Retry starting Minio five times in case we run into a race between finding the free port and starting it up * Closes #37680	2019-01-23 19:05:25 +01:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
Adrien Grand	e9fcb25a28	Upgrade to lucene-8.0.0-snapshot-83f9835. (#37668 ) This snapshot uses a new file format for doc-values which is expected to make advance/advanceExact perform faster on sparse fields: https://issues.apache.org/jira/browse/LUCENE-8585	2019-01-22 11:44:29 +01:00
Tim Brooks	21838d73b5	Extract message serialization from `TcpTransport` (#37034 ) This commit introduces a NetworkMessage class. This class has two subclasses - InboundMessage and OutboundMessage. These messages can be serialized and deserialized independent of the transport. This allows more granular testing. Additionally, the serialization mechanism is now a simple Supplier. This builds the framework to eventually move the serialization of transport messages to the network thread. This is the one serialization component that is not currently performed on the network thread (transport deserialization and http serialization and deserialization are all on the network thread).	2019-01-21 14:14:18 -07:00
Tim Brooks	f516d68fb2	Share `NioGroup` between http and transport impls (#37396 ) Currently we create dedicated network threads for both the http and transport implementations. Since these these threads should never perform blocking operations, these threads could be shared. This commit modifies the nio-transport to have 0 http workers be default. If the default configs are used, this will cause the http transport to be run on the transport worker threads. The http worker setting will still exist in case the user would like to configure dedicated workers. Additionally, this commmit deletes dedicated acceptor threads. We have never had these for the netty transport and they can be added back if a need is determined in the future.	2019-01-21 13:50:56 -07:00
Ryan Ernst	9a34b20233	Simplify integ test distribution types (#37618 ) The integ tests currently use the raw zip project name as the distribution type. This commit simplifies this specification to be "default" or "oss". Whether zip or tar is used should be an internal implementation detail of the integ test setup, which can (in the future) be platform specific.	2019-01-21 12:37:17 -08:00
Jim Ferenczi	4351a5e537	Allow field types to optimize phrase prefix queries (#37436 ) This change adds a way to customize how phrase prefix queries should be created on field types. The match phrase prefix query is exposed in field types in order to allow optimizations based on the options set on the field. For instance the text field uses the configured prefix field (if available) to build a span near that mixes the original field and the prefix field on the last position. This change also contains a small refactoring of the match/multi_match query that simplifies the interactions between the builders. Closes #31921	2019-01-17 15:10:28 +01:00
Przemyslaw Gomulka	5e94f384c4	Remove the use of AbstracLifecycleComponent constructor #37488 (#37488 ) The AbstracLifecycleComponent used to extend AbstractComponent, so it had to pass settings to the constractor of its supper class. It no longer extends the AbstractComponent so there is no need for this constructor There is also no need for AbstracLifecycleComponent subclasses to have Settings in their constructors if they were only passing it over to super constructor. This is part 1. which will be backported to 6.x with a migration guide/deprecation log. part 2 will have this constructor removed in 7 relates #35560 relates #34488	2019-01-16 09:05:30 +01:00
Daniel Mitterdorfer	abe35fb99b	Remove unused index store in directory service With this commit we remove the unused field `indexStore` from all implementations of `FsDirectoryService`. Relates #37097	2019-01-14 13:44:32 +01:00
Luca Cavanna	d54f88f62c	Remove unused empty constructors from suggestions classes (#37295 ) We recently migrated suggestions to `Writeable`. That allows us to also clean up empty constructors and methods that called them as they are no longer needed. They are replaced by constructors that accept a `StreamInput` instance.	2019-01-14 08:32:45 +01:00
Alpar Torok	6344e9a3ce	Testing conventions: add support for checking base classes (#36650 )	2019-01-08 13:39:03 +02:00
Christophe Bismuth	9602d794c6	Separate out validation of groups of settings (#34184 ) Today, a setting can declare that its validity depends on the values of other related settings. However, the validity of a setting is not always checked against the correct values of its dependent settings because those settings' correct values may not be available when the validator runs. This commit separates the validation of a settings updates into two phases, with separate methods on the `Setting.Validator` interface. In the first phase the setting's validity is checked in isolation, and in the second phase it is checked again against the values of its related settings. Most settings only use the first phase, and only the few settings with dependencies make use of the second phase.	2019-01-07 16:12:58 +00:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Armin Braun	617e294133	SNAPSHOT: Make Atomic Blob Writes Mandatory (#37168 ) * With #37066 introducing atomic writes to HDFS repository we can enforce atomic write capabilities on this interface * The overrides on the other three cloud implementations are ok because: * https://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectPUT.html states that "Amazon S3 never adds partial objects; if you receive a success response, Amazon S3 added the entire object to the bucket." * https://cloud.google.com/storage/docs/consistency states that GCS has strong read-after-write consistency * https://docs.microsoft.com/en-us/rest/api/storageservices/put-block#remarks Azure has the concept of committing blobs, so there's no partial content here either * Relates #37011	2019-01-07 12:11:19 +01:00
Armin Braun	31c33fdb9b	MINOR: Remove some Deadcode in Gradle (#37160 )	2019-01-07 09:21:25 +01:00
Armin Braun	99f13b90d3	SNAPSHOT: Speed up HDFS Repository Writes (#37069 ) * There is no point in hsyncing after every individual write since there is only value in completely written blobs for restores, this is ensures by the `SYNC` flag already and there is no need for separately invoking `hsync` after individual writes	2019-01-03 16:16:05 +01:00
Armin Braun	10d9819f99	Implement Atomic Blob Writes for HDFS Repository (#37066 ) * Implement atomic writes the same way we do for the FsBlobContainer via rename which is atomic * Relates #37011	2019-01-03 15:51:47 +01:00
Nick Knize	b2aa655f46	Upgrade master to lucene-8.0.0-snapshot-a1c6e642aa (#37091 ) Updates the master branch to the latest snapshot of Lucene 8.0.	2019-01-02 20:18:19 -06:00
Nhat Nguyen	7580d9d925	Make SourceToParse immutable (#36971 ) Today the routing of a SourceToParse is assigned in a separate step after the object is created. We can easily forget to set the routing. With this commit, the routing must be provided in the constructor of SourceToParse. Relates #36921	2018-12-24 14:06:50 -05:00
Jason Tedor	1f574bd17a	Package ingest-user-agent as a module (#36956 ) This commit moves ingest-user-agent from being a plugin to being a module that is packaged with Elasticsearch distributions.	2018-12-22 20:20:53 -05:00
Jason Tedor	e1717df0ac	Package ingest-geoip as a module (#36898 ) This commit moves ingest-geoip from being a plugin to being a module that is packaged with Elasticsearch distributions.	2018-12-22 07:21:49 -05:00
Jason Tedor	6781a29f9b	Move ingest-geoip default databases out of config (#36949 ) Today the default databases bundled with ingest-geoip are treated as configuration files that we unbundle into the Elasticsearch configuration directory. This can cause problems for users using our Docker images if they bind mount over the configuration directory. Additionally, it creates complexity when trying to convert ingest-geoip to a module. This commit moves these databases out of the configuration directory and instead loads from the plugin directory. Further, custom databases can still be put into the configuration directory.	2018-12-21 18:05:50 -05:00
Alpar Torok	e9ef5bdce8	Converting randomized testing to create a separate unitTest task instead of replacing the builtin test task (#36311 ) - Create a separate unitTest task instead of Gradle's built in - convert all configuration to use the new task - the built in task is now disabled	2018-12-19 08:25:20 +02:00
Jason Tedor	c65d2339db	Simplify formatting in geo-IP test case This commit cleans up the formatting in a geo-IP processor factory test case so that the code does not consume a silly number of lines.	2018-12-18 22:20:53 -05:00
Jason Tedor	273b37aaa4	Make the ingest-geoip databases even lazier to load (#36679 ) Today we try to load the ingest-geoip databases lazily. Currently they are loaded as soon as any pipeline that uses an ingest-geoip processor is created. This is not lazy enough. For example, we could only load the databases the first time that they are actually used. This would ensure that we load the minimal set of data to support an in-use pipeline (instead of all of the data). This can come up in a couple of ways. One is when a subset of the database is used (e.g., the city database versus the country database versus the ASN database). Another is when the plugins are installed on non-ingest nodes (e.g., master-only nodes); we would never use the database in this case yet they are currently being loaded occupying ~60 MB of the heap. This commit makes the ingest-geoip databases as lazy as possible. Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com>	2018-12-18 22:15:33 -05:00
Boaz Leskes	733a6d34c1	Add seq no powered optimistic locking support to the index and delete transport actions (#36619 ) This commit add support for using sequence numbers to power [optimistic concurrency control](http://en.wikipedia.org/wiki/Optimistic_concurrency_control) in the delete and index transport actions and requests. A follow up will come with adding sequence numbers to the update and get results. Relates #36148 Relates #10708	2018-12-15 17:59:57 +01:00
Tim Brooks	3065300434	Unify transport settings naming (#36623 ) This commit updates our transport settings for 7.0. It generally takes a few approaches. First, for normal transport settings, it usestransport. instead of transport.tcp. Second, it uses transport.tcp, http.tcp, or network.tcp for all settings that are proxies for OS level socket settings. Third, it marks the network.tcp.connect_timeout setting for removal. Network service level settings are only settings that apply to both the http and transport modules. There is no connect timeout in http. Fourth, it moves all the transport settings to a single class TransportSettings similar to the HttpTransportSettings class. This commit does not actually remove any settings. It just adds the new renamed settings and adds todos for settings that will be deprecated.	2018-12-14 14:41:04 -07:00
Alan Woodward	c7ac9ef826	Upgrade to lucene snapshot 774e9aefbc (#36637 ) Includes LUCENE-8607: improvement to MatchAllDocsQuery	2018-12-14 20:30:07 +00:00
Tim Brooks	fbf88b2ab7	Remove the `MockTcpTransport` (#36628 ) This commit removes all remaining usages of the `MockTcpTransport`. Additionally it removes the `MockTcpTransport` and its test case.	2018-12-14 10:59:07 -07:00
Armin Braun	5df93218d5	SNAPSHOTS: Upgrade GCS Dependencies to 1.55.0 (#36634 ) * Closes #35459 * Closes #35229	2018-12-14 13:24:29 +01:00
Alan Woodward	9ac7359643	Update lucene to snapshot-7e4555a2fd (#36563 ) Includes the following: * Reversion of doc-values changes in LUCENE-8374; we are interested in seeing if this has an effect on benchmarks for node-stats and index-stats * More improvements to docvalues updates	2018-12-12 20:18:32 +00:00
Nhat Nguyen	3fb5a12b30	Upgrade to Lucene-8.0.0-snapshot-61e448666d (#36518 ) Includes: - LUCENE-8602: Share TermsEnum if possible while applying DV updates	2018-12-12 06:47:40 +01:00
Tim Brooks	797f985067	Add version to handshake requests (#36171 ) Currently our handshake requests do not include a version. This is unfortunate as we cannot rely on the stream version since it is not the sending node's version. Instead it is the minimum compatibility version. The handshake request is currently empty and we do nothing with it. This should allow us to add data to the request without breaking backwards compatibility. This commit adds the version to the handshake request. Additionally, it allows "future data" to be added to the request. This allows nodes to craft a version compatible response. And will properly handle additional data in future handshake requests. The proper handling of "future data" is useful as this is the only request where we do not know the other node's version. Finally, it renames the TcpTransportHandshaker to TransportHandshaker.	2018-12-11 16:09:28 -07:00
Luca Cavanna	fb18b35347	Add default methods to DocValueFormat (#36480 ) The different `DocValueFormat` implementors throw `UnsupportedOperationException` for methods that they don't support. That is perfectly fine, and quite common as not all implementors support all of the possible formats. This makes it hard though to trace back which implementors support which formats as they all implement the same methods. This commit introduces default methods in the `DocValueFormat` interface so that all methods throw `UnsupportedOperationException` by default. This way implementors can override only the methods that they specifically support.	2018-12-11 20:19:01 +01:00
Tim Brooks	790f8102e9	Modify `BigArrays` to take name of circuit breaker (#36461 ) This commit modifies BigArrays to take a circuit breaker name and the circuit breaking service. The default instance of BigArrays that is passed around everywhere always uses the request breaker. At the network level, we want to be using the inflight request breaker. So this change will allow that. Additionally, as this change moves away from a single instance of BigArrays, the class is modified to not be a Releasable anymore. Releasing big arrays was always dispatching to the PageCacheRecycler, so this change makes the PageCacheRecycler the class that needs to be managed and torn-down. Finally, this commit closes #31435 be making the serialization of transport messages use the inflight request breaker. With this change, we no longer push the global BigArrays instnace to the network level.	2018-12-11 11:55:41 -07:00
Nhat Nguyen	2a7edca59f	Upgrade to Lucene-8.0.0-snapshot-ef61b547b1 (#36450 ) Includes: - LUCENE-8598: Improve field updates packed values - LUCENE-8599: Use sparse bitset to store docs in SingleValueDocValuesFieldUpdates	2018-12-10 16:33:49 -05:00
Yannick Welsch	6e6e63d01d	Zen2: Move all mixed-version REST tests to Zen2 (#36398 ) Moves all remaining (rolling-upgrade and mixed-version) REST tests to use Zen2. To avoid adding extra configuration, it relies on Zen2 being set as the default discovery type. This required a few smaller changes in other tests. I've removed AzureMinimumMasterNodesTests which tests Zen1 functionality and dates from a time where host providers were not configurable and each cloud plugin had its own discovery.type, subclassing the ZenDiscovery class. I've also adapted a few tests which were unnecessarily adding addTestZenDiscovery = false for the same legacy reasons. Finally, this also moves the unconfigured-node-name REST test to Zen2, testing the auto-bootstrapping functionality in development mode when no discovery configuration is provided.	2018-12-10 11:00:57 +01:00
Jim Ferenczi	a53e8653f2	Add support for inlined user dictionary in Nori (#36123 ) Add support for inlined user dictionary in Nori This change adds a new option called `user_dictionary_rules` to the Nori a tokenizer`. It can be used to set additional tokenization rules to the Korean tokenizer directly in the settings (instead of using a file). Closes #35842	2018-12-07 15:26:08 +01:00
Nhat Nguyen	10feb75eb7	Upgrade to Lucene-8.0.0-snapshot-aaa64d70159 (#36335 ) Includes: LUCENE-8594: DV update are broken for updates on new field LUCENE-8590: Optimize DocValues update datastructures LUCENE-8593: Specialize single value numeric DV updates Relates #36286	2018-12-06 20:33:25 -05:00
Ryan Ernst	37b3fc383f	Build: Use explicit deps on test tasks for check (#36325 ) This commit moves back to use explicit dependsOn for test tasks on check. Not all tasks extending RandomizedTestingTask should be run by check directly.	2018-12-06 14:13:49 -08:00
Tanguy Leroux	6186ccf83e	[Tests] Fix third party tests with Gradle 5.0 (#36302 ) * [Tests] Fix third party tests with Gradle 5.0 * apply feedback	2018-12-06 16:05:05 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00
Alpar Torok	60e45cd81d	Testing conventions task part 2 (#36107 ) Closes #35435 - make it easier to add additional testing tasks with the proper configuration and add some where they were missing. - mute or fix failing tests - add a check as part of testing conventions to find classes not included in any testing task.	2018-12-05 14:20:01 +02:00
Alpar Torok	59b0900174	Upgrade to Gradle 5.0 (#34263 )	2018-12-05 14:06:11 +02:00
Tim Brooks	370472b6d1	Upgrade Netty 4.3.32.Final (#36102 ) This commit upgrades netty. This will close #35360. Netty started throwing an IllegalArgumentException if a CompositeByteBuf is created with < 2 components. Netty4Utils was updated to reflect this change.	2018-11-30 09:02:10 -07:00
Tim Brooks	c305f9dc03	Make keepalive pings bidirectional and optimizable (#35441 ) This is related to #34405 and a follow-up to #34753. It makes a number of changes to our current keepalive pings. The ping interval configuration is moved to the ConnectionProfile. The server channel now responds to pings. This makes the keepalive pings bidirectional. On the client-side, the pings can now be optimized away. What this means is that if the channel has received a message or sent a message since the last pinging round, the ping is not sent for this round.	2018-11-29 08:55:53 -07:00
Alan Woodward	a646f85a99	Ensure TokenFilters only produce single tokens when parsing synonyms (#34331 ) A number of tokenfilters can produce multiple tokens at the same position. This is a problem when using token chains to parse synonym files, as the SynonymMap requires that there are no stacked tokens in its input. This commit ensures that when used to parse synonyms, these tokenfilters either produce a single version of their input token, or that they throw an error when mappings are generated. In indexes created in elasticsearch 6.x deprecation warnings are emitted in place of the error. * asciifolding and cjk_bigram produce only the folded or bigrammed token * decompounders, synonyms and keyword_repeat are skipped * n-grams, word-delimiter-filter, multiplexer, fingerprint and phonetic throw errors Fixes #34298	2018-11-29 10:35:38 +00:00
Tim Brooks	cc1fa799c8	Remove `TcpChannel#setSoLinger` method (#35924 ) This commit removes the dedicated `setSoLinger` method. This simplifies the `TcpChannel` interface. This method has very little effect as the SO_LINGER is not set prior to the channels being closed in the abstract transport test case. We still will set SO_LINGER on the `MockNioTransport`. However we can do this manually.	2018-11-27 09:08:14 -07:00
Tim Brooks	b6ed6ef189	Add sni name to SSLEngine in nio transport (#35920 ) This commit is related to #32517. It allows an "sni_server_name" attribute on a DiscoveryNode to be propagated to the server using the TLS SNI extentsion. Prior to this commit, this functionality was only support for the netty transport. This commit adds this functionality to the security nio transport.	2018-11-27 09:06:52 -07:00
Armin Braun	04a48430aa	NETWORKING: Upgrade Netty 4.3.31.Final (#35877 )	2018-11-25 10:17:54 +01:00
Yannick Welsch	2970abfce9	Add read-only repository verification (#35731 ) Adds a verification mode for read-only repositories. It also makes the extra bucket check on repository creation obsolete, which fixes #35703.	2018-11-23 14:45:05 +01:00
Jim Ferenczi	e37a0ef844	Upgrade to lucene-8.0.0-snapshot-67cdd21996 (#35816 )	2018-11-22 15:42:59 +01:00
Alan Woodward	f6a43b5939	Add a prebuilt ICU Analyzer (#34958 ) The ICU plugin provides the building blocks of an analysis chain, but doesn't actually have a prebuilt analyzer. It would be a better for users if there was a simple analyzer that they could use out of the box, and also something we can point to from the CJK Analyzer docs as a superior alternative. Relates to #34285	2018-11-21 09:00:48 +00:00
Gordon Brown	b2057138a7	Remove AbstractComponent from AbstractLifecycleComponent (#35560 ) AbstractLifecycleComponent now no longer extends AbstractComponent. In order to accomplish this, many, many classes now instantiate their own logger.	2018-11-19 09:51:32 -07:00
Jernej Klancic	baf33b3162	Removes AbstractComponent from several classes (#35566 ) Removes inhertiting from AbstractComponent for some classes (mostly in the plugins module). Relates to #34488	2018-11-16 20:50:18 +01:00
Nick Knize	2591f66a33	upgrade to lucene-8.0.0-snapshot-6d9c714052 (#35428 )	2018-11-12 10:48:27 -06:00
Christoph Büscher	09cac321e7	Upgrade to Joda 2.10.1 (#35410 ) This version contains a bugfix that allows us to reenable one of our muted tests in DateTimeUnitTests. Closes #33749	2018-11-12 10:02:41 +01:00
Armin Braun	02b4e28534	#31608 Add S3 Setting to Force Path Type Access (#34721 ) * SNAPSHOTS: Use Path Style Access in S3 * Use path style access pattern to fix #31608 * closes #31608	2018-11-09 05:07:26 +01:00
Tim Brooks	93c2c604e5	Move compression config to ConnectionProfile (#35357 ) This is related to #34483. It introduces a namespaced setting for compression that allows users to configure compression on a per remote cluster basis. The transport.tcp.compress remains as a fallback setting. If transport.tcp.compress is set to true, then all requests and responses are compressed. If it is set to false, only requests to clusters based on the cluster.remote.cluster_name.transport.compress setting are compressed. However, after this change regardless of any local settings, responses will be compressed if the request that is received was compressed.	2018-11-08 10:37:59 -07:00
Alpar Torok	8a85b2eada	Remove build qualifier from server's Version (#35172 ) With this change, `Version` no longer carries information about the qualifier, we still need a way to show the "display version" that does have both qualifier and snapshot. This is now stored by the build and red from `META-INF`.	2018-11-07 14:01:05 +02:00
Tim Brooks	f395b1eace	Open node connections asynchronously (#35144 ) This is related to #29023. Additionally at other points we have discussed a preference for removing the need to unnecessarily block threads for opening new node connections. This commit lays the groudwork for this by opening connections asynchronously at the transport level. We still block, however, this work will make it possible to eventually remove all blocking on new connections out of the TransportService and Transport.	2018-11-06 17:58:20 -07:00
Yannick Welsch	a4b26febcb	Register Azure max_retries setting (#35286 ) This commit properly registers the Azure max_retries setting in the settings infrastructure, allowing this setting to be actually used.	2018-11-06 19:33:48 +01:00
Nick Knize	a5e1f4d3a2	Upgrade to lucene-8.0.0-snapshot-31d7dfe6b1 (#35224 )	2018-11-06 11:55:23 +01:00
Alan Woodward	9f4b93fd5e	Fix failing ICU tests (#35207 ) Fixes #35173	2018-11-06 09:02:40 +00:00
Vladimir Dolzhenko	9d28a104e1	Discovery EC2: Utilize Amazon SDK to resolve EC2 metadata server (#35246 ) * Discovery EC2: Utilize Amazon SDK to resolve EC2 metadata server Closes #35141	2018-11-05 15:02:09 +01:00
Tim Brooks	0166388d74	Use single netty event loop group for transports (#35181 ) Currently we create a new netty event loop group for client connections and all server profiles. Each new group creates new threads for io processing. This means 2 * num of processors new threads for each group. A single group should be able to handle all io processing (for the transports). This also brings the netty module inline with what we do for nio. Additionally, this PR renames the worker threads to be the same for netty and nio.	2018-11-02 16:31:19 -06:00
Andy Bristol	eec357ebde	[test] quote base_path in repository tests	2018-11-01 13:01:53 -07:00
Alpar Torok	928c9a12d2	Mute failing tests #35173	2018-11-01 19:31:52 +02:00
Alpar Torok	7744f6f590	Fix checkstyle	2018-11-01 14:46:36 +02:00
Ke Li	14f540e8e6	Deprecate unicodeSetFilter in favour of unicode_set_filter (#29215 )	2018-11-01 10:06:51 +00:00
Nik Everett	e28509fbfe	Core: Less settings to AbstractComponent (#35140 ) Stop passing `Settings` to `AbstractComponent`'s ctor. This allows us to stop passing around `Settings` in a ton of places. While this change touches many files, it touches them all in fairly small, mechanical ways, doing a few things per file: 1. Drop the `super(settings);` line on everything that extends `AbstractComponent`. 2. Drop the `settings` argument to the ctor if it is no longer used. 3. If the file doesn't use `logger` then drop `extends AbstractComponent` from it. 4. Clean up all compilation failure caused by the `settings` removal and drop any now unused `settings` isntances and method arguments. I've intentionally not removed the `settings` argument from a few files: 1. TransportAction 2. AbstractLifecycleComponent 3. BaseRestHandler These files don't need `settings` either, but this change is large enough as is. Relates to #34488	2018-10-31 21:23:20 -04:00
Nik Everett	086ada4c08	Core: Drop settings member from AbstractComponent (#35083 ) Drops the `Settings` member from `AbstractComponent`, moving it from the base class on to the classes that use it. For the most part this is a mechanical change that doesn't drop `Settings` accesses. The one exception to this is naming threads where it switches from an invocation that passes `Settings` and extracts the node name to one that explicitly passes the node name. This change doesn't drop the `Settings` argument from `AbstractComponent`'s ctor because this change is big enough as is. We'll do that in a follow up change.	2018-10-30 16:10:38 -04:00
Pratik Sanglikar	f1135ef0ce	Core: Replace deprecated Loggers calls with LogManager. (#34691 ) Replace deprecated Loggers calls with LogManager. Relates to #32174	2018-10-29 15:52:30 -04:00
Armin Braun	71dddfd8f8	MINOR: Remove Deadcode in NioTransport CORS (#34324 ) * Same as #34134 but for nio transport	2018-10-27 14:54:46 +02:00
Nik Everett	9f87fdc7ab	Drop deprecationLogger from AbstractComponent (#34859 ) Drops the `deprecationLogger` from `AbstractComponent`, moving it to places where we need it. This saves us from building a bunch of `DeprecationLogger`s that we don't need. Relates to #34488	2018-10-26 15:40:16 -04:00
Armin Braun	65edec0d42	TEST: Stablize Minio Free Port Search (#34894 ) * Binding to `0` gives us free ports that are assigned sequentially by Linux making collisions much less likely compared to manually finding a free port in a range * Closes #32208	2018-10-26 12:55:21 +02:00
Tal Levy	e1fdd00420	Lowercase static final DeprecationLogger instance names (#34887 ) After discussing on the team's FixItFriday, we concluded that static final instance variables that are mutable should be lowercased. Historically, DeprecationLogger was uppercased more frequently than lowercased.	2018-10-25 21:12:19 -07:00
Tim Brooks	cf9aff954e	Reduce channels in AbstractSimpleTransportTestCase (#34863 ) This is related to #30876. The AbstractSimpleTransportTestCase initiates many tcp connections. There are normally over 1,000 connections in TIME_WAIT at the end of the test. This is because every test opens at least two different transports that connect to each other with 13 channel connection profiles. This commit modifies the default connection profile used by this test to 6. One connection for each type, except for REG which gets 2 connections.	2018-10-25 13:37:49 -06:00
Martijn van Groningen	bb807b147b	fixed test	2018-10-25 08:09:45 +02:00
Alpar Torok	59536966c2	Add a new "contains" feature (#34738 ) The contains syntax was added in #30874 but the skips were not properly put in place. The java runner has the feature so the tests will run as part of the build, but language clients will be able to support it at their own pace.	2018-10-25 08:50:50 +03:00
Jack Conradson	1b085252c3	[Painless] Add instance bindings (#34410 ) This change adds instance bindings to Painless. This binding allows a whitelisted method to be called on an instance instantiated prior to script compilation. Whitelisting must be done in code as there is no practical way to instantiate a useful instance from a text file (see the tests for an example). Since an instance can be shared by multiple scripts, each method called must be thread-safe.	2018-10-24 12:57:28 -07:00
Luca Cavanna	d51bc05dce	[TEST] Improve validation of do sections (#34734 ) We throw parsing exception when an unknown array is found, but we don't when an unknown top-level field is found. This commit makes sure that unsupported top-level fields are not ignored in a do section. Closes #34651	2018-10-24 21:27:07 +02:00
Alpar Torok	795d57b4f9	Auto configure all test tasks (#34666 ) With this change, we apply the common test config automatically to all newly created tasks instead of opting in specifically. For plugin authors using the plugin externally this means that the configuration will be applied to their RandomizedTestingTasks as well. The purpose of the task is to simplify setup and make it easier to change projects that use the `test` task but actually run integration tests to use a task called `integTest` for clarity, but also because we may want to configure and run them differently. E.x. using different levels of concurrency.	2018-10-24 16:05:50 +03:00
Alpar Torok	0536635c44	Upgrade forbiddenapis to 2.6 (#33809 ) * Upgrade forbiddenapis to 2.6 Closes #33759 * Switch forbiddenApis back to official plugin * Remove CLI based task * Fix forbiddenApisJava9	2018-10-23 12:06:46 +03:00

... 5 6 7 8 9 ...

2797 Commits