OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tanguy Leroux	989e465964	Use fixture to test repository-s3 plugin (#29296 ) This commit adds a new fixture that emulates a S3 service in order to improve the existing integration tests. This is very similar to what has been made for Google Cloud Storage in #28788, and such tests would have helped a lot to catch bugs like #22534. The AmazonS3Fixture is brittle and only implements the very necessary stuff for the S3 repository to work, but at least it works and can be adapted for specific tests needs.	2018-04-03 11:30:43 +02:00
Adrien Grand	3bdfc8f3fb	Upgrade to lucene-7.3.0-snapshot-98a6b3d. (#29298 ) Most notable changes include: - this release doesn't have the 7.2.1 version constant so I had to create one - spatial4j and jts were upgraded	2018-04-03 09:27:14 +02:00
Christoph Büscher	318b0af953	Remove execute mode bit from source files Some source files seem to have the execute bit (a+x) set, which doesn't really seem to hurt but is a bit odd. This change removes those, making the permissions similar to other source files in the repository.	2018-03-26 13:37:55 +02:00
Lee Hinman	8e8fdc4f0e	Decouple XContentBuilder from BytesReference (#28972 ) * Decouple XContentBuilder from BytesReference This commit removes all mentions of `BytesReference` from `XContentBuilder`. This is needed so that we can completely decouple the XContent code and move it into its own dependency. While this change appears large, it is due to two main changes, moving `.bytes()` and `.string()` out of XContentBuilder itself into static methods `BytesReference.bytes` and `Strings.toString` respectively. The rest of the change is code reacting to these changes (the majority of it in tests). Relates to #28504	2018-03-14 13:47:57 -06:00
David Pilato	87553bba16	Add ingest-attachment support for per document `indexed_chars` limit (#28977 ) We today support a global `indexed_chars` processor parameter. But in some cases, users would like to set this limit depending on the document itself. It used to be supported in mapper-attachments plugin by extracting the limit value from a meta field in the document sent to indexation process. We add an option which reads this limit value from the document itself by adding a setting named `indexed_chars_field`. Which allows running: ``` PUT _ingest/pipeline/attachment { "description" : "Extract attachment information. Used to parse pdf and office files", "processors" : [ { "attachment" : { "field" : "data", "indexed_chars_field" : "size" } } ] } ``` Then index either: ``` PUT index/doc/1?pipeline=attachment { "data": "BASE64" } ``` Which will use the default value (or the one defined by `indexed_chars`) Or ``` PUT index/doc/2?pipeline=attachment { "data": "BASE64", "size": 1000 } ``` Closes #28942	2018-03-14 19:07:20 +01:00
Jason Tedor	5904d936fa	Copy Lucene IOUtils (#29012 ) As we have factored Elasticsearch into smaller libraries, we have ended up in a situation that some of the dependencies of Elasticsearch are not available to code that depends on these smaller libraries but not server Elasticsearch. This is a good thing, this was one of the goals of separating Elasticsearch into smaller libraries, to shed some of the dependencies from other components of the system. However, this now means that simple utility methods from Lucene that we rely on are no longer available everywhere. This commit copies IOUtils (with some small formatting changes for our codebase) into the fold so that other components of the system can rely on these methods where they no longer depend on Lucene.	2018-03-13 12:49:33 -04:00
Daniel Mitterdorfer	b2557b9c11	Skip GeoIpProcessorFactoryTests on Windows (#29005 ) With this commit we skip all GeoIpProcessorFactoryTests on Windows. These tests use a MappedByteBuffer which will keep its file mappings until it is garbage-collected. As a consequence, the corresponding file appears to be still in use, Windows cannot delete it and the test will fail in teardown. Closes #29001	2018-03-13 09:10:40 +01:00
Tanguy Leroux	5a65db153e	[Test] GoogleCloudStorageFixture command line is too long on Windows (#28991 ) Windows has some strong limitations on command line arguments, specially when it's too long. In the googlecloudstoragefixture anttask the classpath argument is very long and the command fails. This commit removes the classpath as an argument and uses the CLASSPATH environment variable instead.	2018-03-12 18:02:30 +01:00
Daniel Mitterdorfer	0d78a5890e	Reduce heap-memory usage of ingest-geoip plugin (#28963 ) With this commit we reduce heap usage of the ingest-geoip plugin by memory-mapping the database files. Previously, we have stored these files gzip-compressed but this has resulted that data are loaded on the heap. Closes #28782	2018-03-12 08:07:33 +01:00
Tanguy Leroux	d9cc6b9270	Remove temporary file 10_basic.yml~	2018-03-09 17:44:10 +01:00
Tanguy Leroux	4756790d6e	Use fixture to test the repository-gcs plugin (#28788 ) This commit adds a GoogleCloudStorageFixture that uses the logic of a GoogleCloudStorageTestServer (added in #28576) to emulate a remote Google Cloud Storage service. By adding this fixture and a more complete integration test, we should be able to catch more bugs when upgrading the client library. The fixture is started by the googleCloudStorageFixture task and a custom Service Account file is created and added to the Elasticsearch keystore for each test.	2018-03-09 13:57:27 +01:00
Tim Brooks	7d434c16f9	Remove NioNotEnabledBootstrapCheck bootstrap check (#28901 ) This is related to #27260. This commit removes the bootstrap check that prevents nio from being enabled.	2018-03-08 11:06:36 -07:00
Tim Brooks	d8d1f0d4f0	Give transport-nio plugin socket permissions (#28900 ) This is related to #27260. The transport-nio plugin needs socket permissions to operate as a transport. This commit gives it these permissions in the policy file.	2018-03-08 09:33:39 -07:00
Tim Brooks	5a8ec9b762	Selectors operate on channel contexts (#28468 ) This commit is related to #27260. Currently there is a weird relationship between channel contexts and nio channels. The selectors use the context for read and writing. But the selector operates directly on the nio channel for registering, closing, and connecting. This commit works on improving this relationship. The selector operates directly on the context which wraps the low level java.nio.channels. The NioChannel class is simply an API that is used to interact with the channel (sending messages from outside the selector event loop, scheduling a close, adding listeners, etc). The context is only used internally by the channel to implement these apis and by the selector to perform these operations.	2018-02-22 09:44:52 -07:00
Tanguy Leroux	a6a138905d	Use client settings in repository-gcs (#28575 ) Similarly to what has been done for s3 and azure, this commit removes the repository settings `application_name` and `connect/read_timeout` in favor of client settings. It introduce a GoogleCloudStorageClientSettings class (similar to S3ClientSettings) and a bunch of unit tests for that, it aligns the documentation to be more coherent with the S3 one, it documents the connect/read timeouts that were not documented at all and also adds a new client setting that allows to define a custom endpoint.	2018-02-22 15:40:20 +01:00
Tim Brooks	de2a0dfa6e	Ensure that azure stream has socket privileges (#28751 ) This is related to #28662. It wraps the azure repository inputstream in an inputstream that ensures `read` calls have socket permissions. This is because the azure inputstream internally makes service calls.	2018-02-21 11:20:06 -07:00
Tanguy Leroux	9a95be35cf	[Tests] Extract the testing logic for Google Cloud Storage (#28576 ) This pull request extracts in a dedicated class the request/response logic that "emulates" a Google Cloud Storage service in our repository-gcs tests. The idea behind this is to make the logic more reusable. The class MockHttpTransport has been renamed to MockStorage which now only takes care of instantiating a Storage client and does the low-level request/response plumbing needed by this client. The "Google Cloud Storage" logic has been extracted from MockHttpTransport and put in a new GoogleCloudStorageTestServer that is now independent from the google client testing framework.	2018-02-21 13:20:35 +01:00
Tanguy Leroux	9485b43167	[Tests] Fix RetryHttpInitializerWrapperTests.testIOExceptionRetry This commit gives more time to the IO exception handler to retry the request.	2018-02-20 14:54:53 +01:00
Tanguy Leroux	207ca1cc38	[Tests] Simplify GceDiscoverTests (#28726 ) GceDiscoverTests can be simplified in a similar manner than #27945. It now uses a mocked GceInstancesService that exposes internal test cluster nodes as if they were real GCE nodes. It should also make the test more robust by not using a HTTP server anymore. closes #24313	2018-02-20 09:38:22 +01:00
Christoph Büscher	231fd3c9be	[Docs] Remove misleading comment The TikaImpl#parse method comment sounds like this method is only used in the same package for testing, but AttachmentProcessor uses it outside of testing, so we should remove this comment.	2018-02-09 15:47:38 +01:00
Jason Tedor	641a6c9e62	Guard accessDeclaredMembers for Tika on JDK 10 Tika parsers need accessDeclaredMembers because ZipFile needs accessDeclaredMembers on JDK 10. This commit guards adding this permission to parsers so that the permission is only granted on JDK 10. Additionally, we add an assertion that forces us to check if the permission is still needed in JDK 11. Relates #28603	2018-02-09 09:08:07 -05:00
Christoph Büscher	cc9cb5356a	Add missing runtime permission to TikaImpl (#28602 ) Tests on jdk10 were failing because of a change in its ZipFile implementation that now needs `accessDeclaredMembers` permissions. This change adds the missing permission to the plugins security policy and TikaImpl. Closes #28568	2018-02-09 14:41:24 +01:00
Lee Hinman	eebff4d2b3	Use non deprecated xcontenthelper (#28503 ) * Move to non-deprecated XContentHelper.createParser(...) This moves away from one of the now-deprecated XContentHelper.createParser methods in favor of specifying the deprecation logger at parser creation time. Relates to #28449 Note that this doesn't move all the `createParser` calls because some of them use the already-deprecated method that doesn't specify the XContentType. * Remove the deprecated (and now non-needed) createParser method	2018-02-05 16:18:18 -07:00
Tanguy Leroux	be74f11517	Replace jvm-example by two plugin examples (#28339 ) This pull request replaces the jvm-example plugin (from the jvm/site plugins era) by two new plugins: a custom-settings that shows how to register and use custom settings (including secured settings) in a plugin, and rest-handler plugin that shows how to register a rest handler. The two plugins now reside in the plugins/examples project. They can serve as sample plugins for users, a special attention has been put on documentation. The packaging tests have been adapted to use the custom-settings plugin.	2018-01-26 17:34:24 +01:00
kel	c675407a70	Remove redundant argument for buildConfiguration of s3 plugin (#28281 )	2018-01-23 22:32:46 -08:00
Adrien Grand	700d9ecc95	Remove the `update_all_types` option. (#28288 ) This option is not useful in 7.x since no indices may have more than one type anymore.	2018-01-22 12:03:07 +01:00
Tim Brooks	a6a57a71d3	Implement socket and server ChannelContexts (#28275 ) This commit is related to #27260. Currently have a channel context that implements reading and writing logic for socket channels. Additionally, we have exception contexts to handle exceptions. And accepting contexts to handle accepted channels. This PR introduces a ChannelContext that handles close and exception handling for all channel types. Additionally, it has implementers that provide specific functionality for socket channels (read and writing). And specific functionality for server channels (accepting).	2018-01-18 13:06:40 -07:00
Ryan Ernst	cefea1a7c9	Build: Add gradle plugin for configuring meta plugin (#28276 ) This commit adds a gradle plugin to ease development of meta plugins. Applying the plugin will generated the meta plugin properties based on the es_meta_plugin configuration object, which includes name and description. The plugins to include within the meta plugin are configured through the `plugins` list. An integ test task is also automatically added.	2018-01-17 19:47:37 -08:00
Tim Brooks	4ea9ddb7d3	Unify nio read / write channel contexts (#28160 ) This commit is related to #27260. Right now we have separate read and write contexts for implementing specific protocol logic. However, some protocols require a closer relationship between read and write operations than is allowed by our current model. An example is HTTP which might require a write if some problem with request parsing was encountered. Additionally, some protocols require close messages to be sent when a channel is shutdown. This is also problematic in our current model, where we assume that channels should simply be queued for close and forgotten. This commit transitions to a single ChannelContext which implements all read, write, and close logic for protocols. It is the job of the context to tell the selector when to close the channel. A channel can still be manually queued for close with a selector. This is how server channels are closed for now. And this route allows timeout mechanisms on normal channel closes to be implemented.	2018-01-17 09:44:21 -07:00
Jason Tedor	aded32f48f	Fix third-party audit tasks on JDK 8 This one is interesting. The third party audit task runs inside the Gradle JVM. This means that if Gradle is started on JDK 8, the third party audit tasks will fail as a result of the changes to support building Elasticsearch with the JDK 9 compiler. This commit reverts the third party audit changes to support running this task when Gradle is started with JDK 8. Relates #28256	2018-01-16 22:59:29 -05:00
Jason Tedor	0a79555a12	Require JDK 9 for compilation (#28071 ) This commit modifies the build to require JDK 9 for compilation. Henceforth, we will compile with a JDK 9 compiler targeting JDK 8 as the class file format. Optionally, RUNTIME_JAVA_HOME can be set as the runtime JDK used for running tests. To enable this change, we separate the meaning of the compiler Java home versus the runtime Java home. If the runtime Java home is not set (via RUNTIME_JAVA_HOME) then we fallback to using JAVA_HOME as the runtime Java home. This enables: - developers only have to set one Java home (JAVA_HOME) - developers can set an optional Java home (RUNTIME_JAVA_HOME) to test on the minimum supported runtime - we can test compiling with JDK 9 running on JDK 8 and compiling with JDK 9 running on JDK 9 in CI	2018-01-16 13:45:13 -05:00
Ryan Ernst	18463e7e9f	Painless: Add whitelist extensions (#28161 ) This commit adds a PainlessExtension which may be plugged in via SPI to add additional classes, methods and members to the painless whitelist on a per context basis. An example plugin adding and using a whitelist is also added.	2018-01-15 11:28:31 -08:00
Jim Ferenczi	b82017cbfe	Fix daitch_mokotoff phonetic filter to use the dedicated Lucene filter (#28225 ) This commit changes the phonetic filter factory to use a DaitchMokotoffSoundexFilter instead of a PhoneticFilter with a daitch_mokotoff encoder when daitch_mokotoff is selected. The latter does not hanlde branching when computing the soundex and fails to encode multiple variations when possible. Closes #28211	2018-01-15 19:35:54 +01:00
Tim Brooks	ee7eac8dc1	`MockTcpTransport` to connect asynchronously (#28203 ) The method `initiateChannel` on `TcpTransport` is explicit in that channels can be connect asynchronously. All production implementations do connect asynchronously. Only the blocking `MockTcpTransport` connects in a synchronous manner. This avoids testing some of the blocking code in `TcpTransport` that waits on connections to complete. Additionally, it requires a more extensive method signature than required for other transports. This commit modifies the `MockTcpTransport` to make these connections asynchronously on a different thread. Additionally, it simplifies that `initiateChannel` method signature.	2018-01-15 10:20:30 -07:00
Jim Ferenczi	be012b1326	upgrade to lucene 7.2.1 (#28218 )	2018-01-15 16:47:46 +01:00
Igor Motov	c75ac319a6	Add ability to associate an ID with tasks (#27764 ) Adds support for capturing the X-Opaque-Id header from a REST request and storing it's value in the tasks that this request started. It works for all user-initiated tasks (not only search). Closes #23250 Usage: ``` $ curl -H "X-Opaque-Id: imotov" -H "foo:bar" "localhost:9200/_tasks?pretty&group_by=parents" { "tasks" : { "7qrTVbiDQKiZfubUP7DPkg:6998" : { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6998, "type" : "transport", "action" : "cluster:monitor/tasks/lists", "start_time_in_millis" : 1513029940042, "running_time_in_nanos" : 266794, "cancellable" : false, "headers" : { "X-Opaque-Id" : "imotov" }, "children" : [ { "node" : "V-PuCjPhRp2ryuEsNw6V1g", "id" : 6088, "type" : "netty", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 67785, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } }, { "node" : "7qrTVbiDQKiZfubUP7DPkg", "id" : 6999, "type" : "direct", "action" : "cluster:monitor/tasks/lists[n]", "start_time_in_millis" : 1513029940043, "running_time_in_nanos" : 98754, "cancellable" : false, "parent_task_id" : "7qrTVbiDQKiZfubUP7DPkg:6998", "headers" : { "X-Opaque-Id" : "imotov" } } ] } } } ```	2018-01-12 15:34:17 -05:00
Jim Ferenczi	fcf4114adc	Make sure that we don't detect files as maven coordinate when installing a plugin (#28163 ) * This change makes sure that we don't detect a file path containing a ':' as a maven coordinate (e.g.: `file:C:\path\to\zip`) * restore test muted on master	2018-01-10 14:59:37 +01:00
Jim Ferenczi	ca6b15bf7c	[Tests] temporary disable meta plugin rest tests #28163	2018-01-10 14:31:07 +01:00
Jim Ferenczi	36729d1c46	Add the ability to bundle multiple plugins into a meta plugin (#28022 ) This commit adds the ability to package multiple plugins in a single zip. The zip file for a meta plugin must contains the following structure: \|____elasticsearch/ \| \|____ <plugin1> <-- The plugin files for plugin1 (the content of the elastisearch directory) \| \|____ <plugin2> <-- The plugin files for plugin2 \| \|____ meta-plugin-descriptor.properties <-- example contents below The meta plugin properties descriptor is mandatory and must contain the following properties: description: simple summary of the meta plugin. name: the meta plugin name The installation process installs each plugin in a sub-folder inside the meta plugin directory. The example above would create the following structure in the plugins directory: \|_____ plugins \| \|____ <name_of_the_meta_plugin> \| \| \|____ meta-plugin-descriptor.properties \| \| \|____ <plugin1> \| \| \|____ <plugin2> If the sub plugins contain a config or a bin directory, they are copied in a sub folder inside the meta plugin config/bin directory. \|_____ config \| \|____ <name_of_the_meta_plugin> \| \| \|____ <plugin1> \| \| \|____ <plugin2> \|_____ bin \| \|____ <name_of_the_meta_plugin> \| \| \|____ <plugin1> \| \| \|____ <plugin2> The sub-plugins are loaded at startup like normal plugins with the same restrictions; they have a separate class loader and a sub-plugin cannot have the same name than another plugin (or a sub-plugin inside another meta plugin). It is also not possible to remove a sub-plugin inside a meta plugin, only full removal of the meta plugin is allowed. Closes #27316	2018-01-09 18:28:43 +01:00
Tim Brooks	ff3db0b50e	Cleanup TcpChannelFactory and remove classes (#28102 ) This commit is related to #27260. It moves the TcpChannelFactory into NioTransport so that consumers do not have to be passed around. Additionally it deletes an unused read handler.	2018-01-08 10:18:19 -07:00
Martijn van Groningen	b46bb2efae	test: do not use asn fields Closes #28124	2018-01-07 23:21:01 +01:00
Tim Brooks	38701fb6ee	Create nio-transport plugin for NioTransport (#27949 ) This is related to #27260. This commit moves the NioTransport from :test:framework to a new nio-transport plugin. Additionally, supporting tcp decoding classes are moved to this plugin. Generic byte reading and writing contexts are moved to the nio library. Additionally, this commit adds a basic MockNioTransport to :test:framework that is a TcpTransport implementation for testing that is driven by nio.	2018-01-05 09:41:29 -07:00
Martijn van Groningen	fdb9b50747	test: replaced try-catch statements with expectThrows(...)	2018-01-05 14:29:53 +01:00
Sian Lerk Lau	a4a7150b56	Added ASN support for Ingest GeoIP plugin. Closes #27849	2018-01-05 14:07:04 +01:00
Ryan Ernst	d36ec18029	Plugins: Add plugin extension capabilities (#27881 ) This commit adds the infrastructure to plugin building and loading to allow one plugin to extend another. That is, one plugin may extend another by the "parent" plugin allowing itself to be extended through java SPI. When all plugins extending a plugin are finished loading, the "parent" plugin has a callback (through the ExtensiblePlugin interface) allowing it to reload SPI. This commit also adds an example plugin which uses as-yet implemented extensibility (adding to the painless whitelist).	2018-01-03 11:12:43 -08:00
Tanguy Leroux	098f82f086	[Test] Do not rely on MockZenPing for Azure tests (#27945 ) This commit changes some Azure tests so that they do not rely on MockZenPing and TestZenDiscovery anymore, but instead use a mocked AzureComputeService that exposes internal test cluster nodes as if they were real Azure nodes. Related to #27859 Closes #27917, #11533	2017-12-22 09:58:02 +01:00
Martijn van Groningen	90ee35930a	ingest: upgraded ingest geoip's geoip2's dependencies.	2017-12-21 08:43:02 +01:00
Nhat Nguyen	3c865d6d04	TEST: reduce blob size #testExecuteMultipartUpload If a large blob size and small buffer size are picked, this test causes out of memory. https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+intake/1061/	2017-12-20 12:43:05 -05:00
Adrien Grand	77711508b0	Upgrade to Lucene 7.2.0. (#27910 )	2017-12-20 14:17:40 +01:00
Boaz Leskes	0ca141880f	Disable TestZenDiscovery in cloud providers integrations test TestZenDiscovery is used to allow discovery based on in memory structures. This isn't a relevant for the cloud providers tests (but isn't a problem at the moment either)	2017-12-20 14:02:55 +01:00
Martijn van Groningen	4585cc8312	ingest: Upgraded the geolite2 databases.	2017-12-20 10:42:46 +01:00
Tal Levy	43ff38c5da	update ingest-attachment to use Tika 1.17 and newer deps (#27824 ) - this pr updates tika and its dependencies - updates the SHAs - updates the class excludes	2017-12-15 13:47:26 -08:00
Colin Goodheart-Smithe	579d1fea57	Fixes ByteSizeValue to serialise correctly (#27702 ) * Fixes ByteSizeValue to serialise correctly This fix makes a few fixes to ByteSizeValue to make it possible to perform round-trip serialisation: * Changes wire serialisation to use Zlong methods instead of VLong methods. This is needed because the value `-1` is accepted but previously if `-1` is supplied it cannot be serialised using the wire protocol. * Limits the supplied size to be no more than Long.MAX_VALUE when converted to bytes. Previously values greater than Long.MAX_VALUE bytes were accepted but would be silently interpreted as Long.MAX_VALUE bytes rather than erroring so the user had no idea the value was not being used the way they had intended. I consider this a bug and so fine to include this bug fix in a minor version but I am open to other points of view. * Adds a `getStringRep()` method that can be used when serialising the value to JSON. This will print the bytes value if the size is positive, `”0”` if the size is `0` and `”-1”` if the size is `-1`. * Adds logic to detect fractional values when parsing from a String and emits a deprecation warning in this case. * Modifies hashCode and equals methods to work with long values rather than doubles so they don’t run into precision problems when dealing with large values. Previous to this change the equals method would not detect small differences in the values (e.g. 1-1000 bytes ranges) if the actual values where very large (e.g. PBs). This was due to the values being in the order of 10^18 but doubles only maintaining a precision of ~10^15. Closes #27568 * Fix bytes settings default value to not use fractional values * Fixes test * Addresses review comments * Modifies parsing to preserve unit This should be bwc since in the case that the input is fractional it reverts back to the old method of parsing it to the bytes value. * Addresses more review comments * Fixes tests * Temporarily changes version check to 7.0.0 This will be changed to 6.2 when the fix has been backported	2017-12-14 12:17:17 +00:00
Tanguy Leroux	b69923f112	Remove some unused code (#27792 ) This commit removes some unused code.	2017-12-13 16:45:55 +01:00
Tanguy Leroux	f27cb96a64	Use AmazonS3.doesObjectExist() method in S3BlobContainer (#27723 ) This pull request changes the S3BlobContainer.blobExists() method implementation to make it use the AmazonS3.doesObjectExist() method instead of AmazonS3.getObjectMetadata(). The AmazonS3 implementation takes care of catching any thrown AmazonS3Exception and compares its response code with 404, returning false (object does not exist) or lets the exception be propagated.	2017-12-12 09:30:36 +01:00
Luca Cavanna	f4fb4d3bf5	Add support for filtering mappings fields (#27603 ) Add support for filtering fields returned as part of mappings in get index, get mappings, get field mappings and field capabilities API. Plugins can plug in their own function, which receives the index as argument, and return a predicate which controls whether each field is included or not in the returned output.	2017-12-05 20:31:29 +01:00
Jason Tedor	42a4ad35da	Add node name to thread pool executor name This commit adds the node name to the names of thread pool executors so that the node name is visible in rejected execution exception messages. Relates #27663	2017-12-05 07:45:40 -05:00
Henrik Lindström	7a44596446	Catch InvalidPathException in IcuCollationTokenFilterFactory (#27202 ) Using custom rules in the icu_collation filter can fail on Windows. If the rules are interpreted as a file location, this leads to an InvalidPathException when trying to read the rules from a file.	2017-12-04 10:29:08 +01:00
Adrien Grand	6323bb0d97	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27619 ) This new snapshot mostly brings a change to TopFieldCollector which can now early terminate collection when trackTotalHits is `false`. As a follow-up, we should replace our usage of `EarlyTerminatingSortingCollector` with this new option.	2017-12-04 09:40:08 +01:00
James Baiera	e16f1271b6	Fix SecurityException when HDFS Repository used against HA Namenodes (#27196 ) * Sense HA HDFS settings and remove permission restrictions during regular execution. This PR adds integration tests for HA-Enabled HDFS deployments, both regular and secured. The Mini HDFS fixture has been updated to optionally run in HA-Mode. A new test suite has been added for reproducing the effects of a Namenode failing over during regular repository usage. Going forward, the HDFS Repository will still be subject to its self imposed permission restrictions during normal use, but will no longer restrict them when running against an HA enabled HDFS cluster. Instead, the plugin will rely on the provided security policy and not further restrict the permissions so that the transparent operation to failover to a different Namenode in the client does not raise security exceptions. Additionally, we are now testing the secure mode with SASL based wire encryption of data between Elasticsearch and HDFS. This includes a missing library (commons codec) in order to support this change.	2017-12-01 14:26:05 -05:00
Jason Tedor	d0cd18169e	Remove stale awaits fix on azure master nodes test This awaits fix has been there forever and no one seems to know what to do with this test. I say let CI churn on it because it passed for me three out of three times. If there is something wrong with it, we will know quickly and can then address with the new information that we have.	2017-11-28 22:43:34 -05:00
Adrien Grand	996990ad1f	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27496 ) The main highlight of this new snapshot is that it introduces the opportunity for queries to opt out of caching. In case a query opts out of caching, not only will it never be cached, but also no compound query that wraps it will be cached.	2017-11-28 14:52:42 +01:00
David Turner	7ac361d86e	Update @AwaitsFix URL to point at an issue in the current repo	2017-11-28 09:35:46 +00:00
Tanguy Leroux	50a2459adf	Update Google SDK to version 1.23 (#27381 ) This commit updates the google-api-client library to version 1.23.0. Related to #26636	2017-11-15 15:30:27 +01:00
Tanguy Leroux	dd51c231ac	Create new handlers for every new request in GoogleCloudStorageService (#27339 ) This commit changes the DefaultHttpRequestInitializer in order to make it create new HttpIOExceptionHandler and HttpUnsuccessfulResponseHandler for every new HTTP request instead of reusing the same two handlers for all requests. Closes #27092	2017-11-14 11:43:32 +01:00
Jason Tedor	d375cef73c	Upgrade AWS SDK Jackson Databind to 2.6.7.1 The AWS SDK has a transitive dependency on Jackson Databind. While the AWS SDK was recently upgraded, the Jackson Databind dependency was not pulled along with it to the version that the AWS SDK depends on. This commit upgrades the dependencies for discovery-ec2 and repository-s3 plugins to match versions on the AWS SDK transitive dependencies. Relates #27361	2017-11-13 12:05:14 -05:00
Simon Willnauer	2299c70371	Allow affix settings to specify dependencies (#27161 ) We use affix settings to group settings / values under a certain namespace. In some cases like login information for instance a setting is only valid if one or more other settings are present. For instance `x.test.user` is only valid if there is an `x.test.passwd` present and vice versa. This change allows to specify such a dependency to prevent settings updates that leave settings in an inconsistent state.	2017-11-13 12:06:36 +01:00
Tanguy Leroux	f6c2ea0f7d	[Test] Fix S3BlobStoreContainerTests.testNumberOfMultiparts()	2017-11-10 15:45:20 +01:00
Tanguy Leroux	9c4d6c629a	Remove S3 output stream (#27280 ) Now the blob size information is available before writing anything, the repository implementation can know upfront what will be the more suitable API to upload the blob to S3. This commit removes the DefaultS3OutputStream and S3OutputStream classes and moves the implementation of the upload logic directly in the S3BlobContainer. related #26993 closes #26969	2017-11-10 12:22:33 +01:00
Guillaume Le Floch	ac5fd6a7d9	Update Tika version to 1.15 This commit upgrades the Tika dependency to version 1.15. Relates #25003	2017-11-09 13:16:44 -05:00
Tanguy Leroux	184dda9eb0	Update to AWS SDK 1.11.223 (#27278 )	2017-11-09 13:25:51 +01:00
Jason Tedor	58a28dacbd	Remove colons from task and configuration names Gradle 5.0 will remove support for colons in configuration and task names. This commit fixes this for our build by removing all current uses of colons in configuration and task names. Relates #27305	2017-11-08 15:22:31 -05:00
David Roberts	749c3ec716	Remove the single argument Environment constructor (#27235 ) Only tests should use the single argument Environment constructor. To enforce this the single arg Environment constructor has been replaced with a test framework factory method. Production code (beyond initial Bootstrap) should always use the same Environment object that Node.getEnvironment() returns. This Environment is also available via dependency injection.	2017-11-04 13:25:09 +00:00
kel	0f21262b36	Do not create directories if repository is readonly (#26909 ) For FsBlobStore and HdfsBlobStore, if the repository is read only, the blob store should be aware of the readonly setting and do not create directories if they don't exist. Closes #21495	2017-11-03 13:10:50 +01:00
Colin Goodheart-Smithe	c1b8140c83	Upgrade to Lucene 7.1 (#27225 )	2017-11-02 13:25:33 +00:00
Colin Goodheart-Smithe	99aca9cdfc	Enhances exists queries to reduce need for `_field_names` (#26930 ) * Enhances exists queries to reduce need for `_field_names` Before this change we wrote the name all the fields in a document to a `_field_names` field and then implemented exists queries as a term query on this field. The problem with this approach is that it bloats the index and also affects indexing performance. This change adds a new method `existsQuery()` to `MappedFieldType` which is implemented by each sub-class. For most field types if doc values are available a `DocValuesFieldExistsQuery` is used, falling back to using `_field_names` if doc values are disabled. Note that only fields where no doc values are available are written to `_field_names`. Closes #26770 * Addresses review comments * Addresses more review comments * implements existsQuery explicitly on every mapper * Reinstates ability to perform term query on `_field_names` * Added bwc depending on index created version * Review Comments * Skips tests that are not supported in 6.1.0 These values will need to be changed after backporting this PR to 6.x	2017-11-01 10:46:59 +00:00
Christoph Büscher	9253ea8aec	Fix beidermorse phonetic token filter for unspecified `languageset` (#27112 ) Currently, when we create a BeiderMorseFilter with an unspecified `languageset`, the filter will not guess the language, which should be the default behaviour. This change fixes this and adds a simple test for the cases with and without provided `languageset` settings. Closes #26771	2017-10-27 10:07:36 +02:00
Tanguy Leroux	463e7e6fa3	Revert "Upgrade to Jackson 2.9.2 (#27032 )" This reverts commit `0b9acc5ace`.	2017-10-20 08:25:41 +02:00
Tanguy Leroux	f78b2e5bc9	Fix ingest-attachment yaml REST test	2017-10-19 22:20:50 +02:00
Simon Willnauer	cdd7c1e6c2	Return List instead of an array from settings (#26903 ) Today we return a `String[]` that requires copying values for every access. Yet, we already store the setting as a list so we can also directly return the unmodifiable list directly. This makes list / array access in settings a much cheaper operation especially if lists are large.	2017-10-09 09:52:08 +02:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
Simon Willnauer	00dfdf50cf	Represent lists as actual lists inside Settings (#26878 ) Today we represent each value of a list setting with it's own dedicated key that ends with the index of the value in the list. Aside of the obvious weirdness this has several issues especially if lists are massive since it causes massive runtime penalties when validating settings. Like a list of 100k words will literally cause a create index call to timeout and in-turn massive slowdown on all subsequent validations runs. With this change we use a simple string list to represent the list. This change also forbids to add a settings that ends with a .0 which was internally used to detect a list setting. Once this has been rolled out for an entire major version all the internal .0 handling can be removed since all settings will be converted. Relates to #26723	2017-10-05 09:27:08 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
David Pilato	84a3899550	Simplify Azure blobStore method signatures (#26752 ) While working on #26751, I found that we are passing the container name on every single method although we don't need it as it is stored within the blobstore object already. This commit simplifies a bit that part of the code. It also removes `repositoryName` from AzureBlobStore which was not used anymore. Also we move some properties in AzureBlobContainer to `private` members.	2017-10-04 20:17:50 +02:00
Simon Willnauer	d1533e2397	Remove Settings#getAsMap() (#26845 ) Since `#getAsMap` exposes internal representation we are trying to remove it step by step. This commit is cleaning up some xcontent writing as well as usage in tests	2017-10-04 01:21:38 -06:00
Simon Willnauer	7b8d036ab5	Replace group map settings with affix setting (#26819 ) We use group settings historically instead of using a prefix setting which is more restrictive and type safe. The majority of the usecases needs to access a key, value map based on the _leave node_ of the setting ie. the setting `index.tag.*` might be used to tag an index with `index.tag.test=42` and `index.tag.staging=12` which then would be turned into a `{"test": 42, "staging": 12}` map. The group settings would always use `Settings#getAsMap` which is loosing type information and uses internal representation of the settings. Using prefix settings allows now to access such a method type-safe and natively.	2017-09-30 14:27:21 +02:00
David Pilato	9ba5e168e4	Don't use create static storage service Even though you annotate the Test class with `@ThirdParty` the static code is initialized. In that case it fails with: ``` ==> Test Info: seed=529C3C6977F695FC; jvms=3; suites=6 Suite: org.elasticsearch.repositories.azure.AzureSnapshotRestoreTests ERROR 0.00s J2 \| AzureSnapshotRestoreTests (suite) <<< FAILURES! > Throwable #1: java.lang.IllegalStateException: to run integration tests, you need to set -Dtests.thirdparty=true and -Dtests.azure.account=azure-account -Dtests.azure.key=azure-key > at org.elasticsearch.cloud.azure.AzureTestUtils.generateMockSecureSettings(AzureTestUtils.java:37) > at org.elasticsearch.repositories.azure.AzureSnapshotRestoreTests.generateMockSettings(AzureSnapshotRestoreTests.java:81) > at org.elasticsearch.repositories.azure.AzureSnapshotRestoreTests.<clinit>(AzureSnapshotRestoreTests.java:84) > at java.lang.Class.forName0(Native Method) > at java.lang.Class.forName(Class.java:348) Completed [1/6] on J2 in 2.21s, 0 tests, 1 error <<< FAILURES! ``` Closes #26812. (cherry picked from commit eb6d714 for master branch)	2017-09-28 16:40:16 +01:00
Hendrik Muhs	0358bb5f34	fix of disabling #26812	2017-09-28 16:14:34 +02:00
Hendrik Muhs	bf4d3123bd	disable AzureSnapshotRestoreTests, see #26812	2017-09-28 15:42:53 +02:00
David Pilato	1ccb497c0d	Use Azure upload method instead of our own implementation (#26751 ) * Use Azure upload method instead of our own implementation We are not following the Azure documentation about uploading blobs to Azure storage. https://docs.microsoft.com/en-us/azure/storage/blobs/storage-java-how-to-use-blob-storage#upload-a-blob-into-a-container Instead we are using our own implementation which might cause some troubles and rarely some blobs can be not immediately commited just after we close the stream. Using the standard implementation provided by Azure team should allow us to benefit from all the magic Azure SDK team already wrote. And well... Let's just read the doc! * Adapt integration tests to secure settings That was a missing part in #23405. * Simplify all the integration tests and extends ESBlobStoreRepositoryIntegTestCase tests removes IT `testForbiddenContainerName()` as it is useless. The plugin does not create anymore the container but expects that the user has created it before registering the repository * merges 2 IT classes so all IT tests are ran from one single class * We don't remove/create anymore the container between each single test but only for the test suite	2017-09-28 13:15:37 +02:00
olcbean	6952f7b560	Validate top-level keys for create index request (#23755 ) (#23869 ) This commit ensures create index requests do not ignore unknown keys passed to the request. closes #23755	2017-09-26 09:49:20 -07:00
David Pilato	3f71772cd2	Azure snapshots can not be restored anymore (#26778 ) While working on #26751 and doing some manual integration testing I found that this #22858 removed an important line of our code: `AzureRepository` overrides default `initializeSnapshot` method which creates metadata files and do other stuff. But with PR #22858, I wrote: ```java @Override public void initializeSnapshot(SnapshotId snapshotId, List<IndexId> indices, MetaData clusterMetadata) { if (blobStore.doesContainerExist(blobStore.container()) == false) { throw new IllegalArgumentException("The bucket [" + blobStore.container() + "] does not exist. Please create it before " + " creating an azure snapshot repository backed by it."); } } ``` instead of ```java @Override public void initializeSnapshot(SnapshotId snapshotId, List<IndexId> indices, MetaData clusterMetadata) { if (blobStore.doesContainerExist(blobStore.container()) == false) { throw new IllegalArgumentException("The bucket [" + blobStore.container() + "] does not exist. Please create it before " + " creating an azure snapshot repository backed by it."); } super.initializeSnapshot(snapshotId, indices, clusterMetadata); } ``` As we never call `super.initializeSnapshot(...)` files are not created and we can't restore what we saved. Closes #26777.	2017-09-25 14:35:27 -04:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	2e63a13c0a	Upgrade to Log4j 2.9.1 This commit upgrades the Log4j dependency, picking up a fix for an issue with handling stack traces on JDK 9. Relates #26750	2017-09-22 11:57:06 -04:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
James Baiera	c760eec054	Add permission checks before reading from HDFS stream (#26716 ) Add checks for special permissions before reading hdfs stream data. Also adds test from readonly repository fix. MiniHDFS will now start with an existing repository with a single snapshot contained within. Readonly Repository is created in tests and attempts to list the snapshots within this repo.	2017-09-21 11:55:07 -04:00
kel	601be4f83e	Add azure storage endpoint suffix #26432 (#26568 ) Allow specifying azure storage endpoint suffix for an azure client.	2017-09-20 22:26:19 -07:00
Ryan Ernst	bebff47b5b	File Discovery: Remove fallback with zen discovery (#26667 ) When adding file based discovery, we added a fallback when the discovery type was set to zen (the default, so everyone got this warning). This commit removes the fallback for 6.0. Setting file discovery should now happen explicitly through the hosts_provider setting. closes #26661	2017-09-19 16:32:34 -07:00
Jason Tedor	bdd9953aa4	Fix discovery-file plugin to use custom config path The discovery-file plugin was not config path aware, so it always picked up the default config path (from Elasticsearch home) rather than a custom config path. This commit fixes the discovery-file plugin to respect a custom config path. Relates #26662	2017-09-16 11:00:33 -04:00
Claudio Bley	7184cf8b5b	Fix kuromoji default stoptags (#26600 ) Initialize the default stop-tags in `KuromojiPartOfSpeechFilterFactory` if the `stoptags` are not given in the config. Also adding a test which checks that part-of-speech tokens are removed when using the kuromoji_part_of_speech filter.	2017-09-15 12:25:09 +02:00

1 2 3 4 5 ...

1963 Commits