OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Pilato	f97ce3c728	Deprecate mapper-attachments plugin See #16910	2016-03-04 11:49:12 +01:00
Simon Willnauer	5008694ba1	Remove support for legacy checksums Elasticsearch 5.0 doesn't support indices wiht legacy checksums anymore. The last time we write legacy checksums was in 1.3.0 which was based on lucene 4.9 already which means that all files have CRC32 checksums. All indices that Elasticsearch can read today must be written with lucene version >= 4.8 anyway so we can drop this layer of backwards compatibility entirely. Since we are close to upgrading to Lucene 6.0 we should get rid of this in a more contiained change than the lucene upgrade.	2016-03-03 22:58:18 +01:00
Lee Hinman	6adbbff97c	Fix organization rename in all files in project Basically a query-replace of "https://github.com/elasticsearch/" with "https://github.com/elastic/"	2016-03-03 12:04:13 -07:00
Adrien Grand	eef19be072	Deprecate string in favor of text/keyword. #16877 This commit removes the ability to use string fields on indices created on or after 5.0. Dynamic mappings now generate text fields by default for strings but there are plans to also add a sub keyword field (in a future PR). Most of the changes in this commit are just about replacing string with keyword or text. Some tests have been removed because they existed because of corner cases of string mappings like setting ignore-above on a text field or enabling term vectors on a keyword field which are now impossible. The plan is to remove strings entirely in 6.0.	2016-03-03 10:20:56 +01:00
Daniel Mitterdorfer	f70e5aca50	Merge remote-tracking branch 'danielmitterdorfer/simplify-azure-settings'	2016-03-03 10:02:35 +01:00
Daniel Mitterdorfer	52acf0e6e1	Use new settings infra to parse AzureStorageSettings With this commit we simplify the parsing logic in AzureStorageSettings by leveraging the new settings infrastructure. Closes #16363	2016-03-03 10:01:14 +01:00
David Pilato	3f71c1d6a5	Replace `s -> s` by `Function.identity()`	2016-03-02 10:12:40 +01:00
David Pilato	e4d9e46508	Fix merge with master	2016-03-02 09:55:09 +01:00
David Pilato	5fbf1b95dc	Merge branch 'master' into pr/16598-register-filter-settings # Conflicts: # core/src/main/java/org/elasticsearch/common/logging/ESLoggerFactory.java # core/src/main/java/org/elasticsearch/discovery/DiscoveryService.java # core/src/main/java/org/elasticsearch/discovery/DiscoverySettings.java # core/src/main/java/org/elasticsearch/http/HttpTransportSettings.java # plugins/repository-azure/src/main/java/org/elasticsearch/cloud/azure/storage/AzureStorageService.java	2016-03-02 09:43:53 +01:00
Jason Tedor	aa8ee74c6c	Bump Elasticsearch version to 5.0.0-SNAPSHOT This commit bumps the Elasticsearch version to 5.0.0-SNAPSHOT in line with the alignment of versions across the stack. Closes #16862	2016-03-01 17:03:47 -05:00
Simon Willnauer	80e5c0acf8	Merge pull request #16860 from s1monw/issues/16485 Add setFactory permission to GceDiscoveryPlugin This commit adds a missing permission and a simple test that ensures we discover other nodes via a mock http endpoint. Closes #16485	2016-03-01 08:55:43 +01:00
Nik Everett	95cc3e38fc	Check test naming conventions on all modules The big win here is catching tests that are incorrectly named and will be skipped by gradle, providing a false sense of security. The whole thing takes about 10 seconds on my Macbook Air, not counting compiling the test classes, which seems worth it. Because this runs as a gradle task with propery UP-TO-DATE handling it can be skipped if the tests haven't been changed which should save some time. I chose to keep this in test:framework rather than a new subproject of buildSrc because ESIntegTestCase and doesn't inroduce any additional dependencies.	2016-02-29 16:31:49 -05:00
Simon Willnauer	948ee3ee3f	Move keystore creation to gradle - this prevents committing a keystore to the source repo	2016-02-29 22:01:39 +01:00
Boaz Leskes	195b43d66e	Remove DiscoveryService and reduce guice to just Discovery #16821 DiscoveryService was a bridge into the discovery universe. This is unneeded and we can just access discovery directly or do things in a different way. One of those different ways, is not having a dedicated discovery implementation for each our dicovery plugins but rather reuse ZenDiscovery. UnicastHostProviders are now classified by discovery type, removing unneeded checks on plugins. Closes #16821	2016-02-29 20:23:38 +01:00
Simon Willnauer	ecca717339	Add setFactory permission to GceDiscoveryPlugin This commit adds a missing permission and a simple test that ensures we discover other nodes via a mock http endpoint. Closes #16485	2016-02-29 15:48:16 +01:00
David Pilato	7a42014909	Upgrade Azure Storage client to 4.0.0 We are using `2.0.0` today but Azure team now recommends: ```xml <dependency> <groupId>com.microsoft.azure</groupId> <artifactId>azure-storage</artifactId> <version>4.0.0</version> </dependency> ``` This new version fix the timeout issues we have seen with azure storage although #15080 adds a timeout support. Azure storage client 2.0.0 was not passing correctly this value when it was calling Azure services. Note that the timeout is a server side timeout and not client side timeout. It means that it will raise only a timeout when: * upload of blob is complete * if azure service is not able to process the blob (and store it) within a given time range. In which case it will raise an exception which elasticsearch can deal with: ``` java.io.IOException at __randomizedtesting.SeedInfo.seed([91BC11AEF16E073F:6886FA5308FCE4D8]:0) at com.microsoft.azure.storage.core.Utility.initIOException(Utility.java:643) at com.microsoft.azure.storage.blob.BlobOutputStream.writeBlock(BlobOutputStream.java:444) at com.microsoft.azure.storage.blob.BlobOutputStream.access$000(BlobOutputStream.java:53) at com.microsoft.azure.storage.blob.BlobOutputStream$1.call(BlobOutputStream.java:388) at com.microsoft.azure.storage.blob.BlobOutputStream$1.call(BlobOutputStream.java:385) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) Caused by: com.microsoft.azure.storage.StorageException: Operation could not be completed within the specified time. at com.microsoft.azure.storage.StorageException.translateException(StorageException.java:89) at com.microsoft.azure.storage.core.StorageRequest.materializeException(StorageRequest.java:305) at com.microsoft.azure.storage.core.ExecutionEngine.executeWithRetry(ExecutionEngine.java:175) at com.microsoft.azure.storage.blob.CloudBlockBlob.uploadBlockInternal(CloudBlockBlob.java:1006) at com.microsoft.azure.storage.blob.CloudBlockBlob.uploadBlock(CloudBlockBlob.java:978) at com.microsoft.azure.storage.blob.BlobOutputStream.writeBlock(BlobOutputStream.java:438) ... 9 more ``` The following code was used to test this against Azure platform: ```java public void testDumb() throws URISyntaxException, StorageException, IOException, InvalidKeyException { String connectionString = "MY-AZURE-STRING"; CloudStorageAccount storageAccount = CloudStorageAccount.parse(connectionString); CloudBlobClient client = storageAccount.createCloudBlobClient(); client.getDefaultRequestOptions().setTimeoutIntervalInMs(1000); CloudBlobContainer container = client.getContainerReference("dumb"); container.createIfNotExists(); CloudBlockBlob blob = container.getBlockBlobReference("blob"); File sourceFile = File.createTempFile("sourceFile", ".tmp"); try { int fileSize = 10000000; byte[] buffer = new byte[fileSize]; Random random = new Random(); random.nextBytes(buffer); logger.info("Generate local file"); FileOutputStream fos = new FileOutputStream(sourceFile); fos.write(buffer); fos.close(); logger.info("End generate local file"); FileInputStream fis = new FileInputStream(sourceFile); logger.info("Start uploading"); blob.upload(fis, fileSize); logger.info("End uploading"); } finally { if (sourceFile.exists()) { sourceFile.delete(); } } } ``` With 2.0.0, the above code was not raising any exception. With 4.0.0, the exception is now thrown correctly. The default timeout is 5 minutes. See https://github.com/Azure/azure-storage-java/blob/master/microsoft-azure-storage/src/com/microsoft/azure/storage/core/Utility.java#L352-L375 Closes #12567. Release notes from 2.0.0: * Removed deprecated table AtomPub support. * Removed deprecated constructors which take service clients in favor of constructors which take credentials. * Added support for "Add" permissions on Blob SAS. * Added support for "Create" permissions on Blob and File SAS. * Added support for IP Restricted SAS and Protocol SAS. * Added support for Account SAS to all services. * Added support for Minute and Hour Metrics to FileServiceProperties and added support for File Metrics to CloudAnalyticsClient. * Removed deprecated startCopyFromBlob() on CloudBlob. Use startCopy() instead. * Removed deprecated Credentials and StorageKey classes. Please use the appropriate methods on StorageCredentialsAccountAndKey instead. * Fixed a bug in table where a select on a non-existent field resulted in a null reference exception if the corresponding field in the TableEntity was not nullable. * Fixed a bug in table where JsonParser was automatically closing the response stream before it was completely drained causing socket exhaustion. * Fixed a bug in StorageCredentialsAccountAndKey.updateKey(String) which prevented valid keys from being set. * Added CloudBlobContainer.listBlobs(final String, final boolean) method. * Fixed a bug in blob where using AccessConditions on block blob uploads larger than 64MB done with the upload* methods or block blob uploads done openOutputStream with would fail if the blob did not already exist. * Added support for setting a proxy per request. Proxy can be set on an OperationContext instance and will be used when that instance is passed to the request method. * Added support for SAS to the Azure File service. * Added support for Append Blob. * Added support for Access Control Lists (ACL) to File Shares. * Added support for getting and setting of CORS rules to File service. * Added support for ShareStats to File Shares. * Added support for copying an Azure File to another Azure File or a Block Blob asynchronously, and aborting Azure File copy operations asynchronously. * Added support for copying a Blob to an Azure File asynchronously. * Added support for setting a maximum quota property on a File Share. * Removed deprecated AuthenticationScheme and its getter and setter. In the future only SharedKey will be used. * Removed deprecated getter/setters for all request option properties on the service clients. Please use the default request options getter/setters instead. * Removed getSubDirectoryReference() for blob directories and file directories. Use getDirectoryReference() instead. * Removed getEntityClass() in TableQuery. Please use getClazzType() instead. * Added client-side verification for lease duration and break periods. * Deprecated the setters in table for timestamp as this property is only modifiable by the service. * Deprecated startCopyFromBlob() on CloudBlob. Use startCopy() instead. * Deprecated the Credentials and StorageKey classes. Please use the appropriate methods on StorageCredentialsAccountAndKey instead. * Deprecated constructors which take service clients in favor of constructors which take credentials. * Fixed a bug where the DateBackwardCompatibility flag was not applied if set on the CloudTableClient default request options. * Changed library behavior to retry all exceptions thrown when parsing a response object. * Changed behavior to stop removing query parameters passed in with the resource URI if that URI contains a SAS token. Some query parameters such as comp, restype, snapshot and api-version will still be removed. * Added support for logging StringToSign to SharedKey and SAS. * Added a connect timeout to prevent hangs when establishing the network connection. * Made performance enhancements to the BlobOutputStream class. * Fixed a bug where maximum execution time was ignored for file, queue, and table services. * Changed the socket timeout to be set to the service side timeout plus 5 minutes when maximum execution time is not set. * Changed the socket timeout to default to 5 minutes rather than infinite when neither service side timeout or maximum execution time are set. * Fixed a bug where MD5 was calculated for commitBlockList even though UseTransactionalMD5 was set to false. * Fixed a bug where selecting fields that did not exist returned an error rather than an EntityProperty with a null value. * Fixed a bug where table entities with a single quote in their partition or row key could be inserted but not operated on in any other way. * Fixed a bug for all listing API's where next() would sometimes throw an exception if hasNext() had not been called even if there were more elements to iterate on. * Added sequence number to the blob properties. This is populated for page blobs. * Creating a page blob sets its length property. * Added support for page blob sequence numbers and sequence number access conditions. * Fixed a bug in abort copy where the lease access condition was not sent to the service. * Fixed an issue in startCopyFromBlob where if the URI of the source blob contained certain non-ASCII characters they would not be encoded appropriately. This would result in Authorization failures. * Fixed a small performance issue in XML serialization. * Fixed a bug in BlobOutputStream and FileOutputStream where flush added data to a request pool rather than immediately committing it to the Azure service. * Refactored to remove the blob, queue, and file package dependency on table in the error handling code. * Added additional client-side logging for REST requests, responses, and errors. Closes #15976.	2016-02-29 15:00:34 +01:00
Martijn van Groningen	c7b626c615	updated SHAs	2016-02-28 13:20:39 +01:00
David Pilato	d77daf3861	Use an SettingsProperty.Dynamic for dynamic properties	2016-02-28 11:06:45 +01:00
David Pilato	31b5e0888f	Use an SettingsProperty enumSet Instead of modifying methods each time we need to add a new behavior for settings, we can simply pass `SettingsProperty... properties` instead. `SettingsProperty` could be defined then: ``` public enum SettingsProperty { Filtered, Dynamic, ClusterScope, NodeScope, IndexScope // HereGoesYours; } ``` Then in setting code, it become much more flexible. TODO: Note that we need to validate SettingsProperty which are added to a Setting as some of them might be mutually exclusive.	2016-02-28 00:48:04 +01:00
Sylwester Lachiewicz	3af735c69e	Update MaxMind geoip2 version to 2.6 Update to align with #16801 jackson 2.7.1	2016-02-27 12:57:24 +01:00
Nik Everett	ba5be0332d	Remove optional logger wrappers Removes all our logger wrappers except the wrapper for log4j1.2. If you depend on Elasticsearch's jar in your application you'll need to declare log4j 1.2 and/or some bridge to your favorite logger. We did this to simplify our builds and code. No more commons-logging like log implementation sniffing. No more optional dependency hacks in gradle. We might one day want to use j.u.l instead of log4j. If we do want that we can recover its wrapper by studying this commit. We didn't go directly to j.u.l in this commit because that is a bigger change. Our logging configuration is based on log4j1.2 and people are used to it. So it'd be a much more fraught breaking change to do that conversion.	2016-02-26 16:41:07 -05:00
Jason Tedor	d94e391e71	Use System#lineSeparator and not system property This commit replaces a use of the system property "line.separator" and replaces it with a dedicated method that provides the same value. Closes #16776	2016-02-25 12:22:57 -05:00
Jack Conradson	7986770e5f	Moved Painless from a plugin to a module. Closes #16755	2016-02-21 16:50:54 -08:00
Mike McCandless	5fffede2b0	Upgrade to Lucene 5.5.0 official release	2016-02-20 17:34:16 -05:00
David Pilato	90fba97a30	Moves GCE settings to the new infra Closes #16720.	2016-02-19 17:00:39 -08:00
Adrien Grand	4f8895eae3	Add a text field. This new field is intended to replace analyzed string fields.	2016-02-15 10:43:44 +01:00
David Pilato	aabb124209	Add filtering support within Setting class Now we have a nice Setting infra, we can define in Setting class if a setting should be filtered or not. So when we register a setting, setting filtering would be automatically done. Instead of writing: ```java Setting<String> KEY_SETTING = Setting.simpleString("cloud.aws.access_key", false, Setting.Scope.CLUSTER); settingsModule.registerSetting(AwsEc2Service.KEY_SETTING, false); settingsModule.registerSettingsFilterIfMissing(AwsEc2Service.KEY_SETTING.getKey()); ``` We could simply write: ```java Setting<String> KEY_SETTING = Setting.simpleString("cloud.aws.access_key", false, Setting.Scope.CLUSTER, true); settingsModule.registerSettingsFilterIfMissing(AwsEc2Service.KEY_SETTING.getKey()); ``` It also removes `settingsModule.registerSettingsFilterIfMissing` method. The plan would be to remove as well `settingsModule.registerSettingsFilter` method but it still used with wildcards. For example in Azure Repository plugin: ```java module.registerSettingsFilter(AzureStorageService.Storage.PREFIX + ".account"); module.registerSettingsFilter(AzureStorageService.Storage.PREFIX + ".key"); ``` Closes #16598.	2016-02-12 10:35:54 +01:00
Nicholas Knize	52ee4c7027	upgrade to lucene 5.5.0-snapshot-850c6c2	2016-02-11 14:28:50 -06:00
David Pilato	df50371c34	Merge branch 'pr/16477-aws-settings'	2016-02-11 19:47:43 +01:00
Adrien Grand	a1e251af20	Remove the MapperBuilders utility class. We can just call constructors directly.	2016-02-11 17:32:58 +01:00
David Pilato	37b0fc4f10	Migrate AWS settings to new settings infrastructure Reintroducing commit `fb7723c` but now deals with setting names conflicts Also adds java documentation for each setting Closes #16293. Related to https://github.com/elastic/elasticsearch/pull/16477#discussion_r52469084	2016-02-11 12:03:09 +01:00
David Pilato	7625595364	Revert Migrate AWS settings to new settings infrastructure It breaks when you load at the same time `repository-s3` and `discovery-ec2`. See https://github.com/elastic/elasticsearch/pull/16477#discussion_r52469084 Reopen #16293.	2016-02-10 16:14:53 +01:00
Alexander Reelsen	e8d24d10dc	Tests: Fix AttachmentProcessorFactoryTests to only check for existing fields	2016-02-10 15:29:16 +01:00
David Pilato	fb7723c186	Migrate AWS settings to new settings infrastructure Closes #16293.	2016-02-10 14:44:51 +01:00
javanna	d5969bb33a	Attachment Processor: setFieldValue only once as a map	2016-02-10 12:38:39 +01:00
javanna	4e3fb69861	[TEST] rewrite testEnglishTextDocumentWithRandomFields	2016-02-10 12:34:51 +01:00
javanna	fe7469dffb	Attachment processor: remove unused NAME enum	2016-02-10 12:34:21 +01:00
Yannick Welsch	848316ad0d	Merge pull request #16540 from ywelsch/fix/groovy-inflation Add permission to access sun.reflect.MethodAccessorImpl from Groovy scripts	2016-02-09 17:39:57 +01:00
Alexander Reelsen	0d4711c2fc	Ingest: Add attachment processor This is a simple port of the mapper attachment plugin to the ingest functionality, no new features. The only option is to limit the number of chars to prevent indexing of huge documents. Fields can be selected in the processor as well. Close #16303	2016-02-09 17:03:30 +01:00
Yannick Welsch	380393a5b7	Add permission to access sun.reflect.MethodAccessorImpl from Groovy scripts Groovy uses reflection to invoke closures. These reflective calls are optimized by the JVM after "sun.reflect.inflationThreshold" number of invocations. After inflation, access to sun.reflect.MethodAccessorImpl is required from the security manager. Closes #16536	2016-02-09 16:47:38 +01:00
Simon Willnauer	96bcb47fc9	Detach QueryShardContext from IndexShard and remove obsolete threadlocals IndexShard currently holds an arbitraritly used `getQueryShardContext` that comes out of a ThreadLocal. It's usage is undefined and arbitraty since there is also such a method with different semantics on `IndexService` This commit removes the threadLocal on IndexShard as well as on the context itself. It's types are now a member and the QueryShardContext lifecycle is managed byt SearchContext which passes the types on from the SearchRequest.	2016-02-08 15:05:03 +01:00
Jack Conradson	f6f2d40fd5	Minor clean up. * Minor clean up of Writer constants. * Removed synthetic attribute from the generated constructor and method. * Added a safeguard for maximum script length. Closes #16457	2016-02-05 12:00:18 -08:00
Robert Muir	dcec4791c4	fix eclipse to compile at all. why does the build not fail?	2016-02-04 18:57:53 -05:00
Martijn van Groningen	7a6adfd93a	ingest: Added foreach processor. This processor is useful when all elements of a json array need to be processed in the same way. This avoids that a processor needs to be defined for each element in an array. Also it is very likely that it is unknown how many elements are inside an json array.	2016-02-04 23:44:01 +01:00
Simon Willnauer	cc520b20e1	Merge branch 'master' into fix_settings_filter	2016-02-04 09:28:41 +01:00
Nik Everett	a2f07679fd	Add task status Implements a simple task status for superclasses of ReplicationRequest to show how you can do use the status.	2016-02-03 18:21:42 -05:00
Simon Willnauer	e1cf5e745d	some plugins share settings - make it easy to filter them	2016-02-03 21:50:06 +01:00
Simon Willnauer	baacabf9fa	convert more filters	2016-02-03 20:31:16 +01:00
Simon Willnauer	f770164d9a	fix imports	2016-02-03 20:14:29 +01:00
Simon Willnauer	e02d2e004e	Rewrite SettingsFilter to be immutable This change rewrites the entire settings filtering mechanism to be immutable. All filters must be registered up-front in the SettingsModule. Filters that are comma-sparated are not allowed anymore and check on registration. This commit also adds settings filtering to the default settings recently added to ensure we don't render filtered settings.	2016-02-03 20:05:55 +01:00
Lee Hinman	3c7f578010	Merge remote-tracking branch 'dakrone/fix-smb-plugin-master'	2016-02-03 07:50:14 -07:00
Simon Willnauer	a77344d742	Merge branch 'master' into make_settings_strict	2016-02-03 13:21:49 +01:00
Simon Willnauer	4a4e523357	Merge branch 'master' into make_settings_strict	2016-02-03 11:34:12 +01:00
Robert Muir	d5dc05f69e	Upgrade to lucene 5.5.0-snapshot-1725675	2016-02-02 22:53:39 -05:00
Lee Hinman	3918072362	Fix calling ensureOpen() on the wrong directory Also removes ensureCanWrite since this already passes the TRUNCATE_EXISTING flag when opening. Adds a REST test that fails without this fix due to the classloader isolation. Additionally, move SmbDirectoryWrapper into an elasticsearch package instead of a lucene one, so this would be found at compile time instead of runtime. Forward-port of #16383	2016-02-02 16:31:16 -07:00
Tal Levy	3191fc7347	Merge pull request #16355 from talevy/fix_ingest_exception revert PipelineFactoryError handling with throwing ElasticsearchParseException in ingest pipeline creation	2016-02-02 14:11:24 -08:00
Tal Levy	0a1580eefa	revert PipelineFactoryError handling with throwing ElasticsearchParseException in ingest pipeline creation	2016-02-02 14:08:22 -08:00
Martijn van Groningen	64ac037a82	Added an ingest qa that tests processor real world like configurations. Renamed `ingest-with-mustache` to `smoke-test-ingest-with-all-dependencies` Also renamed `ingest-disabled` to `smoke-test-ingest-disabled` so that the name is more inline with other qa smoke test modules.	2016-02-02 22:37:24 +01:00
Jack Conradson	54011da950	Fix imports.	2016-02-02 12:26:21 -08:00
Jack Conradson	58ad172492	Merge branch 'master' into tests	2016-02-02 11:48:27 -08:00
Jack Conradson	791417ca89	Fix test file name.	2016-02-02 11:48:19 -08:00
Tanguy Leroux	865bbc2096	Remove string formatting from Terminal print methods This can be trappy and wrong formating strings can throws format exceptions and hide the real message.	2016-02-02 19:57:16 +01:00
Simon Willnauer	818a9eefb2	Make settings validation strict This commit enableds strict settings validation on node startup. All settings passed to elasticsearch either through system properties, yaml files or any other way to pass settings must be registered and valid. Settings that are unknown ie. due to typos or due to deprecation or removal will cause the node to NOT start up. Plugins have to declare all their settings on the `SettingsModule#registerSetting` and settings for plugins that are not installed must be removed. This commit also removes the ability to specify the nodes name via `-Des.name` or just `name` in the configuration files. The node name must be prefixed with the node prexif like `node.name: Boom`. Left over usage of `name` will also cause startup to fail.	2016-02-02 11:32:44 +01:00
Ryan Ernst	a2c37c0989	CliTool: Allow unexpected exceptions to propagate Cli tools currently catch all exceptions, and only print the exception message, except when a special system property is set. Even with this flag set, certain exceptions, like IOException, are captured and their stack trace is always lost. This change adds a UserError class, which can be used a cli tools to specify a message to the user, as well as an exit status. All other exceptions are propagated out of main, so java will exit with non-zero and print the stack trace.	2016-02-01 16:35:22 -08:00
Jack Conradson	e3fd6c2a00	Removed ..= token from the Lexer. Fixed related tests.	2016-02-01 16:16:04 -08:00
Daniel Mitterdorfer	3bee2d3195	Migrate Azure settings to new settings infrastructure With this commit we migrate all Azure related settings to the new settings infrastructure.	2016-02-01 16:34:28 +01:00
Ryan Ernst	3787f437ec	Merge branch 'master' into remove_multicast	2016-02-01 07:25:45 -08:00
Simon Willnauer	af0e40ec7d	Merge pull request #16316 from s1monw/isseus/13685_2nd_try Ensure all resoruces are closed on Node#close()	2016-02-01 10:17:57 +01:00
Jason Tedor	105411060c	Uppercase ells ('L') in long literals This commit removes and forbids the use of lowercase ells ('l') in long literals because they are often hard to distinguish from the digit representing one ('1'). Closes #16329	2016-01-30 22:16:02 -05:00
Ryan Ernst	b8f08c35ec	Plugin: Remove multicast plugin closes #16310.	2016-01-29 18:41:31 -08:00
Tal Levy	fca442f4d1	Introduce Pipeline Factory Error Responses in Node Ingest When there is an exception thrown during pipeline creation within Rest calls (in put pipeline, and simulate) We now return a structured error response to the user with details around which processor's configuration is the cause of the issue, or which configuration property is misconfigured, etc.	2016-01-29 13:37:27 -08:00
Simon Willnauer	51745d7272	Call latch in a finally block	2016-01-29 17:38:33 +01:00
Simon Willnauer	e24fac644a	Fix AzureRepositoryF to handle exceptions on close Fix TribeUnitTests to handle exceptions on close	2016-01-29 17:34:02 +01:00
Martijn van Groningen	f5e89f7242	mappings: remove fly weight	2016-01-29 10:12:39 +01:00
Simon Willnauer	fea8676a6c	remove dead code	2016-01-28 17:14:52 +01:00
Simon Willnauer	a149ebdb7b	remove blanks	2016-01-28 17:01:01 +01:00
Simon Willnauer	687d1d83fa	Convert multcast plugin settings to the new infra	2016-01-28 17:01:01 +01:00
Boaz Leskes	2a137b5548	Make index uuid available in Index, ShardRouting & ShardId In the early days Elasticsearch used to use the index name as the index identity. Around 1.0.0 we introduced a unique index uuid which is stored in the index setting. Since then we used that uuid in a few places but it is by far not the main identifier when working with indices, partially because it's not always readily available in all places. This PR start to make a move in the direction of using uuids instead of name by making sure that the uuid is available on the Index class (currently just a wrapper around the name) and as such also available via ShardRouting and ShardId. Note that this is by no means an attempt to do the right thing with the uuid in all places. In almost all places it falls back to the name based comparison that was done before. It is meant as a first step towards slowly improving the situation. Closes #16217	2016-01-28 08:40:10 +01:00
Jack Conradson	5b836dbb11	Renamed the scripting language Plan A to Painless. Closes #16245	2016-01-27 10:37:34 -08:00
Simon Willnauer	7ff99eb89d	Merge branch 'master' into trash_context_and_headers if (name == 'expamle-fixtures') return	2016-01-27 16:33:50 +01:00
Jason Tedor	284cc3a048	Script mode settings as booleans This commit modifies the accept values for script mode settings from "on", "off", and "sandbox" to "true", "false", and "sandbox".	2016-01-27 06:26:58 -05:00
Jason Tedor	9944573449	Rename methods on ScriptEngineService This commit method renames the ScriptEngineService interface methods types, extensions, and sandboxed to getTypes, getExtensions, and isSandboxed, respectively.	2016-01-27 06:26:04 -05:00
Jason Tedor	087e55cc51	Script mode settings This commit converts the script mode settings to the new settings infrastructure. This is a major refactoring of the handling of script mode settings. This refactoring is necessary because these settings are determined at runtime based on the registered script engines and the registered script contexts.	2016-01-27 06:26:04 -05:00
Simon Willnauer	71c3e57aee	Merge branch 'master' into trash_context_and_headers	2016-01-27 11:42:42 +01:00
Adrien Grand	35709f62b6	Be stricter about parsing boolean values in mappings. Parsing is currently very lenient, which has the bad side-effect that if you have a typo and pass eg. `store: fasle` this will actually be interpreted as `store: true`. Since mappings can't be changed after the fact, it is quite bad if it happens on an index that already contains data. Note that this does not cover all settings that accept a boolean, but since the PR was quite hard to build and already covers some main settirgs like `store` or `doc_values` this would already be a good incremental improvement.	2016-01-27 09:06:00 +01:00
Martijn van Groningen	2b2552c301	test: cleanup static test resources	2016-01-26 21:17:05 +01:00
javanna	61630c2b27	migrate node.local and node.mode to new Setting infra	2016-01-26 14:40:46 +01:00
javanna	36d98478bf	Merge branch 'master' into feature/ingest	2016-01-25 18:01:09 +01:00
David Pilato	b24dde88de	Merge branch 'malpani-aws-discovery-seoul' # Please enter a commit message to explain why this merge is necessary, # especially if it merges an updated upstream into a topic branch. # # Lines starting with '#' will be ignored, and an empty message aborts # the commit.	2016-01-25 08:25:31 +01:00
javanna	720c531364	[TEST] remove check for jvm true, site plugins have been removed	2016-01-22 16:51:51 +01:00
Yannick Welsch	296b48b9d1	Move discovery.* settings to new setting infrastructure Closes #16182	2016-01-22 15:35:00 +01:00
Daniel Mitterdorfer	e9bb3d31a3	Convert "path.*" and "pidfile" to new settings infra	2016-01-22 15:14:13 +01:00
javanna	4ad5e67433	Geoip processor: remove redundant latitude and longitude fields and make location an object with lat and lon subfields	2016-01-22 12:26:39 +01:00
Ryan Ernst	df24019261	Merge pull request #16038 from rjernst/remove_site_plugin Plugins: Remove site plugins	2016-01-21 12:32:22 -08:00
Tal Levy	3a6c2d008e	rename processor_tag to tag	2016-01-21 09:05:42 -08:00
Tal Levy	c55f9bbd9b	Merge pull request #16132 from talevy/holdmytag [Ingest] add an AbstractProcessor to help hold the re-used processorTag variable	2016-01-21 07:30:29 -08:00
Simon Willnauer	142547271e	Merge branch 'master' into trash_context_and_headers	2016-01-21 14:42:15 +01:00
Martijn van Groningen	44465c94f8	Merge remote-tracking branch 'es/master' into feature/ingest	2016-01-21 11:46:27 +01:00
Tal Levy	f1204cb1ab	add an AbstractProcessor to help hold the re-used processorTag variable	2016-01-20 14:34:26 -08:00
Robert Muir	6e7e3a2274	Update lucene to r1725675 Adds DFI (divergence from independence) provider. Fixes test bugs passing invalid values for BM25 parameters.	2016-01-20 03:32:51 -05:00
Martijn van Groningen	e2e207687d	Fixes due to changes in master branch.	2016-01-19 22:03:26 +01:00
Martijn van Groningen	602a0f183e	Merge remote-tracking branch 'es/master' into feature/ingest	2016-01-19 22:01:38 +01:00
Simon Willnauer	3d0cedbabb	Merge branch 'master' into trash_context_and_headers	2016-01-19 20:47:43 +01:00
Nik Everett	e1e73d9914	Create default for ExecutableScript#unwrap Tons of scripts just return the variable they are passed and that is the most intuitive behavior so that may as well be the default implementation.	2016-01-19 13:17:11 -05:00
Jack Conradson	2249a640bb	Plan A Update... New Features: Exceptions (throw, try/catch) Infinite Loop Checking Iterators added to the API Fixes: Improved loop path analysis Fixed set/add issue with shortcuts in lists Fixed minor promotion issue with incorrect type Fixed score issue related to score extending Number Documentation: Added JavaDocs for some files	2016-01-19 09:49:03 -08:00
David Pilato	725d7f968b	Merge branch 'fix/12567-azure-timeout'	2016-01-19 18:02:33 +01:00
David Pilato	6db5b5033c	Change exception message for wrong timeout in azure repository settings Backport in master this change: https://github.com/elastic/elasticsearch/pull/15950#discussion-diff-50128378 Related to #15080 Related to #15950	2016-01-19 18:01:54 +01:00
David Pilato	d6e6ef4ec4	Merge branch 'pr/tests-azure-repo'	2016-01-19 17:07:59 +01:00
Nik Everett	6b6d01c4fe	Merge pull request #16072 from nik9000/xlint_plugins Remove remaining xlints from plugins	2016-01-19 10:54:15 -05:00
Nik Everett	4efa8c4ff5	Remove remaining xlints from plugins	2016-01-19 10:53:48 -05:00
David Pilato	8d35bca4db	Add more tests for Azure Repository client selection One test we forgot in #14843 and #13779 is the default client selection. Most of the time, users won't define explicitly which client they want to use because they are providing only one connection to Azure storage: ```yml cloud: azure: storage: my_account: account: your_azure_storage_account key: your_azure_storage_key ``` Then using the default client like this: ```sh # This one will use the default account (my_account1) curl -XPUT localhost:9200/_snapshot/my_backup1?pretty -d '{ "type": "azure" }' ``` This commit adds tests to check that the right client is still selected when no client is explicitly set when creating the snapshot.	2016-01-19 14:23:08 +01:00
David Pilato	11e7a746ae	Replace server side timeout by client side timeout This commit replaces server side timeout (which is BTW not correctly implemented in azure client 2.0.0 but fixed later #16084) with a client side timeout. As a consequence, for each request sent to azure, azure client will raise an exception after a given amount of time (timeout). Closes #12567	2016-01-19 13:56:08 +01:00
Simon Willnauer	fbfa9f4925	Merge branch 'master' into new_index_settings	2016-01-19 10:13:48 +01:00
Simon Willnauer	8e0390b09e	Register index.version_created for several analysis plugin tests	2016-01-19 09:35:32 +01:00
Adrien Grand	d6cbd6f2f0	Merge pull request #16059 from jpountz/enhancement/mapping_merge_reason Expose the reason why a mapping merge is issued.	2016-01-19 09:27:32 +01:00
Simon Willnauer	20fe8ac854	Register index.version.created in Kuromoji unittest since it's a node level setting here	2016-01-19 09:14:41 +01:00
Nik Everett	53cce3c399	Merge pull request #16071 from nik9000/xlint_lang_groovy Remove Xlint from lang-groovy and discovery-azure	2016-01-18 21:06:11 -05:00
Ryan Ernst	ef4f0a8699	Test: Make rest test framework accept http directly for the test cluster The rest test framework, because it used to be tightly integrated with ESIntegTestCase, currently expects the addresses for the test cluster to be passed using the transport protocol port. However, it only uses this to then find the http address. This change makes ESRestTestCase extend from ESTestCase instead of ESIntegTestCase, and changes the sysprop used to tests.rest.cluster, which now takes the http address. closes #15459	2016-01-18 16:44:14 -08:00
Nik Everett	bd1324ccc9	Remove Xlint from lang-groovy and discovery-azure discovery-azure didn't actually need it. This removes all non-default Xlints from modules.	2016-01-18 18:24:44 -05:00
Nik Everett	08ca9d7645	Fix warnings in lang-plan-a Removes all Xlint skips in lang-plan-a. Warnings were just some extra casts which were removed, some raw types which we just needed to <?>, and a couple of unchecked casts in data that we know to be Map<String, Object> because that structure is super-ultra-common in scripts.	2016-01-18 15:27:51 -05:00
Adrien Grand	055953d6b3	Expose the reason why a mapping merge is issued. This would be useful in order to only perform some validations in the case of a mapping update and in cases when a mapping is restored eg. after a restart, such as discussed in #15989. This replaces the current `applyDefault` parameter which can be derived from the mapping merge reason: the default mapping should be applied only in case of a mapping update, if the mapping does not exist yet and if this is not the default mapping.	2016-01-18 17:41:23 +01:00
Simon Willnauer	a61723b538	convert index.store.fs.fs_lock	2016-01-18 10:17:11 +01:00
Simon Willnauer	7925e2ef84	convert IndexModule settings	2016-01-18 09:23:35 +01:00
Simon Willnauer	79f4697f3e	Register MockFSDirectoryService settings	2016-01-18 09:23:34 +01:00
Simon Willnauer	a8eedd0457	fix all kinds of crazy test failures	2016-01-18 09:23:34 +01:00
Simon Willnauer	ceef6d7a42	fix Murmur3FieldMapperTests	2016-01-18 09:23:34 +01:00
Ryan Ernst	3b78267c71	Plugins: Remove site plugins Site plugins used to be used for things like kibana and marvel, but there is no longer a need since kibana (and marvel as a kibana plugin) uses node.js. This change removes site plugins, as well as the flag for jvm plugins. Now all plugins are jvm plugins.	2016-01-16 22:45:37 -08:00
Tal Levy	9f48df9736	Add on_failure support for verbose _simulate execution and introduce optional processor_tag to Processors	2016-01-15 14:56:20 -08:00
javanna	af36e5f01e	updated ingest-geoip plugin description	2016-01-15 15:16:09 +01:00
javanna	050585e89f	remove BiFunction<Environment, TemplateService, Processor.Factory> in favour of Function<TemplateService, Processor.Factory> the environment is now available through NodeModule#getNode#getEnvironment and can be retrieved during onModule(NodeModule), no need for this indirection anymore using the BiFunction	2016-01-15 15:12:53 +01:00
javanna	dd7cae7c19	move DatabaseReaders initialization to IngestGeoIpPlugin#onModule	2016-01-15 15:02:35 +01:00
Martijn van Groningen	21cc0b2316	Cleanup ingest initialization code. * Folded IngestModule into NodeModule * Renamed IngestBootstrapper to IngestService * Let NodeService construct IngestService and removed the Guice annotations * Let IngestService implement Closable	2016-01-15 13:35:06 +01:00
javanna	9c06736dbd	Merge branch 'master' into feature/ingest	2016-01-15 10:11:56 +01:00
David Pilato	ed45ad6327	Fix Azure repository with only one primary account Using a single azure account is now rejected. This commit fixes this issue and adds a test for it. This regression was introduced with #13779. Hopefully no elasticsearch version has been released since then. Needs to be merged in 2.2, 2.x and master branches.	2016-01-14 13:50:02 +01:00
Martijn van Groningen	f3883343cb	Move the pipeline configuration from the dedicated index to the cluster state. Closes #15842	2016-01-13 22:59:36 +01:00
Simon Willnauer	ce42ae4cf3	Merge pull request #15922 from s1monw/feature/ingest Review feedback and several cleanups	2016-01-13 12:04:04 +01:00
Simon Willnauer	574d1b35b3	Replace ContextAndHeaders with a ThreadPool based ThreadLocal implementation ContextAndHeaders has a massive impact on the core infrastructure since it has to be manually passed on to all relevant places across threads/network calls etc. For the same reason it's also very error prone and easily forgotten on potentially relevant APIs. The new ThreadContext is associated with a ThreadPool (node or transport client) and ensures that headers and context registered on a current thread are inherited to new threads spawned, send across the network to be deserialized on the receiver end as well as restored on the response handling thread once the response is received.	2016-01-13 11:53:32 +01:00
javanna	ea8065aa3d	Merge branch 'master' into feature/ingest	2016-01-12 18:28:42 +01:00
Simon Willnauer	4d38a47eb5	Review feedback and several cleanups	2016-01-12 12:06:14 +01:00
Martijn van Groningen	7bdd2583aa	Merge remote-tracking branch 'es/master' into feature/ingest	2016-01-12 01:01:30 +01:00
Nik Everett	01ce49e94e	Ban Serializable 1. Uses forbidden patterns to prevent things from referencing java.io.Serializable or from mentioning serialVersionUID. 2. Uses -Xlint:-serial so we don't have to hear from javac that we aren't declaring serialVersionUID on any classes that we make that happen to extend Serializable. 3. Remove Serializable and serialVersionUID declarations. I didn't use forbidden apis because it doesn't look like it has a way to ban explicitly implementing Serializable. If you try to ban Serializable with forbidden apis you end up banning all Exceptions and all Strings. Closes #15847	2016-01-11 16:57:31 -05:00
Nik Everett	b29c416b7c	Merge pull request #15848 from nik9000/xlint3 Remove -Xlint:-deprecation from all but core	2016-01-11 16:55:13 -05:00
Martijn van Groningen	cd2155311f	renamed ingest plugin to ingest-geoip plugin, since it only contains the geoip processor	2016-01-08 22:50:54 +01:00
Martijn van Groningen	1637fe9e0b	Moved the grok processor to its own module, so that it will available out-of-the-box, while its dependencies are isolated	2016-01-08 21:59:23 +01:00
Robert Muir	46a8a48d23	Merge pull request #15851 from rmuir/geoip-nodns [ingest] Don't do DNS lookups from GeoIpProcessor	2016-01-08 13:17:20 -05:00
Nik Everett	6250f4dbaa	Remove deprecated azure settings	2016-01-08 13:13:14 -05:00
Jason Tedor	871d1b4885	Remove and forbid use of j.u.c.ThreadLocalRandom This commit removes and now forbids all uses of java.util.concurrent.ThreadLocalRandom across the codebase. The underlying issue with ThreadLocalRandom is that it can not be seeded. This means that if ThreadLocalRandom is used in production code, then tests that cover any code path containing ThreadLocalRandom will be prevented from being reproducible by use of ThreadLocalRandom. Instead, using org.elasticsearch.common.random.Randomness#get will give reproducible sources of random when running under tests and otherwise still give an instance of ThreadLocalRandom when running as production code.	2016-01-08 12:23:48 -05:00
Nik Everett	98fdb39d3d	Remove deprecated settings	2016-01-08 11:17:56 -05:00
javanna	de2eac4c49	removed leftover jodatime dependency	2016-01-08 14:31:35 +01:00
javanna	ae69d46f92	move processors that have no deps to core, also move to core rest spec and tests and set node.inget to true by default	2016-01-08 10:39:39 +01:00
Robert Muir	9c3ebb83a7	Don't do DNS lookups from GeoIpProcessor There is no need to involve DNS in this!	2016-01-07 23:12:44 -05:00
Nik Everett	81a7607256	Remove -Xlint:-deprecation from plugins Instead we suppress warnings about using deprecated stuff near the usage site with a comment about why its ok.	2016-01-07 20:44:46 -05:00
javanna	adac314328	revert move of IngestPlugin class This was moved accidentally as part of a previous refactoring.	2016-01-07 18:32:46 +01:00
Nik Everett	42cb2a5f02	Merge pull request #15811 from nik9000/def_cleanup Cleanups for Def	2016-01-07 11:03:01 -05:00
javanna	03fe38681e	renamed qa package o.e.plugin.ingest to o.e.ingest This way InternalTemplateService constructor can be set back to package private visibility	2016-01-07 15:51:52 +01:00
javanna	1ea690e814	Merge branch 'feature/ingest' into enhancement/move_to_core	2016-01-07 15:13:31 +01:00
javanna	7d7e0db91d	Merge branch 'master' into feature/ingest	2016-01-07 15:09:13 +01:00
Nik Everett	52f28888d5	Merge pull request #15813 from nik9000/xlint1 Remove Xlint:-override,-fallthrough,-static	2016-01-07 08:34:40 -05:00
javanna	eca1594969	start ingest thread pool only when node.ingest is set to true	2016-01-07 14:33:56 +01:00
javanna	2803ae09dc	addProcessor -> registerProcessor	2016-01-07 13:25:25 +01:00
javanna	18aabd67c8	adapt qa tests for when ingest.node is set to false CRUD and simulate apis work now fine, every node has the pipelines in memory, but node.ingest disables ingestion, meaning that any index or bulk request with a pipeline id is going to fail	2016-01-07 13:21:06 +01:00
Martijn van Groningen	9ec2e140b8	Merge branch 'master' into feature/ingest	2016-01-07 10:44:21 +01:00
Nik Everett	244120a065	Remove more Xlint skips	2016-01-06 23:53:05 -05:00
Nik Everett	0786c506dc	Remove a few more Xlint skips	2016-01-06 23:28:13 -05:00
Nik Everett	20e7fa97db	Remove Xlint:-override,-fallthrough,-static Adds `@SuppressWarnings("fallthrough")` in two places where the fallthrough is used to implement well known hashing algorithms.	2016-01-06 22:27:14 -05:00
Nik Everett	74c132afc6	Standardize some methods on varargs Right now we define the same sort of methods as taking String arrays and string varargs. We should standardize on one and varargs is easier to call so lets use varargs!	2016-01-06 21:01:58 -05:00
Nik Everett	32605ecb4f	Cleanups for Def Manually I: 1. Added some missing raw types warnings suppressions. 2. Removed some unused unchecked cast warning suppressions. 3. Added <?> to Class. I let my IDE: 1. Remove unneeded casts. 2. Reorder imports (just ignore these, everyone does).	2016-01-06 20:28:49 -05:00
javanna	9079a7e891	wip: move all the ingest infra to core	2016-01-06 19:10:44 +01:00
javanna	da3f460bd1	remove ProcessorFactoryProvider	2016-01-06 17:31:40 +01:00
javanna	f651f5a531	remove MapBinder guice binding for processors, use ProcessorsRegistry instead	2016-01-06 17:31:40 +01:00
javanna	94469d75f9	revert rename InternalTemplateService -> MustacheTemplateService	2016-01-06 17:31:39 +01:00
javanna	8251a50667	make ProcessorFactoryProvider extend BiFunction	2016-01-06 17:31:39 +01:00
javanna	635b9b5a46	clarified TemplateService comments We will keep this abstractions as it's convenient, otherwise IngestDocument would depend on ScriptService directly, and would explicitly rely on mustache which is not even part of core. better to have the interface in core, and the impl as part of the ingest plugin, which relies on mustache, shipped with core by default.	2016-01-06 17:31:39 +01:00
javanna	fa4dbdaea1	create ProcessorsModule during Node creation rather than as part of IngestPlugin initialization If we createProcessorsModule as part of the plugin, other plugins will not be able to register their own processors.	2016-01-06 17:31:39 +01:00
javanna	2478aafa46	move ingest api to core	2016-01-06 17:31:39 +01:00
Martijn van Groningen	702f712204	replaced thirdPartyAudit.missingClasses with specific excludes	2016-01-06 17:28:46 +01:00
Martijn van Groningen	1eb5ae1dce	fix compile errors due to changes in master	2016-01-06 17:28:35 +01:00
Martijn van Groningen	e275af8a58	Merge remote-tracking branch 'es/master' into feature/ingest	2016-01-06 17:28:08 +01:00
Martijn van Groningen	2d6adf6428	Percolator refactoring: * Added percolator field mapper that extracts the query terms and indexes these terms with the percolator query. * At percolate time these extracted terms are used to query percolator queries that are like to be evaluated. This can significantly cut down the time it takes to percolate. Whereas before all percolator queries were evaluated if they matches with the document being percolated. * Changes made to percolator queries are no longer immediately visible, a refresh needs to happen before the changes are visible. * By default the percolate api only returns upto 10 matches instead of returning all matching percolator queries. * Made percolate more modular, so that it is easier to add unit tests. * Added unit tests for the percolator. Closes #12664 Closes #13646	2016-01-06 16:08:10 +01:00
Luca Cavanna	eabd5e2714	Merge pull request #15787 from javanna/enhancement/processor_package move all the processors under the same package org.elasticsearch.ingest.processor	2016-01-06 13:28:35 +01:00
javanna	1e8995d984	move all the processors under the same package org.elasticsearch.ingest.processor	2016-01-06 12:36:28 +01:00
Igor Motov	a89dba27c2	Task Management: Add framework for registering and communicating with tasks Adds task manager class and enables all activities to register with the task manager. Currently, the immutable Transport*Activity class represents activity itself shared across all requests. This PR adds and an additional structure Task that keeps track of currently running requests and can be used to communicate with these requests using TransportTaskAction. Related to #15117	2016-01-05 12:24:43 -05:00
Tal Levy	183386173c	cleanup simulate test and add docs	2016-01-04 16:10:42 -08:00
Tal Levy	82d87e7149	Merge pull request #15647 from talevy/ingest_fail_processor [Ingest] Fail Processor	2016-01-04 11:35:51 -08:00
Tal Levy	9dd54f2b4f	Introduce a fail processor	2016-01-04 11:33:50 -08:00
Tal Levy	f34ce9ddf4	add on_failure context to ingest metadata during executeOnFailure	2016-01-04 11:24:27 -08:00
javanna	8de31b3c64	make index type and id optional in simulate api Default values are _index, _type and _id. Closes #15711	2016-01-04 13:11:44 +01:00
Adrien Grand	1a47226d9a	Merge pull request #15663 from jpountz/remove/mapping_backcompat Remove mapping backward compatibilit with pre-2.0.	2016-01-04 10:05:39 +01:00
Robert Muir	25914ae879	Merge pull request #15688 from rmuir/thirdPartyAudit3 Improve thirdPartyAudit check, round 3	2015-12-29 09:24:51 -05:00
Martijn van Groningen	f2fce0edca	Removed ingest runner main class, `gradle run --debug-jvm` should be used instead.	2015-12-29 13:07:23 +01:00
Martijn van Groningen	9fa3f469e9	fix test after merging in master branch	2015-12-29 13:00:27 +01:00
Martijn van Groningen	79161d77df	Merge remote-tracking branch 'es/master' into feature/ingest	2015-12-29 12:56:09 +01:00
David Pilato	96b3166c6d	Add timeout settings (default to 5 minutes) By default, azure does not timeout. This commit adds support for a timeout settings which defaults to 5 minutes. It's a timeout per request not a global timeout for a snapshot request. It can be defined globally, per account or both. Defaults to `5m`. ```yml cloud: azure: storage: timeout: 10s my_account1: account: your_azure_storage_account1 key: your_azure_storage_key1 default: true my_account2: account: your_azure_storage_account2 key: your_azure_storage_key2 timeout: 30s ``` In this example, timeout will be 10s for `my_account1` and 30s for `my_account2`. Closes #14277.	2015-12-29 11:40:48 +01:00
David Pilato	a49fe189b0	Support global `repositories.azure.` settings All those repository settings can also be defined globally in `elasticsearch.yml` file using prefix `repositories.azure.`. For example: ```yml repositories.azure: container: backup-container base_path: backups chunk_size: 32m compress": true ``` Closes #13776.	2015-12-29 10:43:01 +01:00
Robert Muir	180ab2493e	Improve thirdPartyAudit check, round 3	2015-12-28 22:38:55 -05:00
Adrien Grand	5eb7555ffb	Make text parsing less lenient. It now requires that the parser is on a value.	2015-12-28 16:06:42 +01:00
Martijn van Groningen	4a0ec0da26	Merge remote-tracking branch 'es/master' into feature/ingest	2015-12-24 15:34:20 +01:00
Martijn van Groningen	1936d6118d	Added index template for the '.ingest' index and added logic that ensure the index template is installed. An index template for the '.ingest' index is required because: * We don't want arbitrary fields in pipeline documents, because that can turn into upgrade problems if we add more properties to the pipeline dsl. * We know what are the usages are of the '.ingest' index, so we can optimize for that and prevent that this index is used for different purposes. Closes #15001	2015-12-24 15:18:07 +01:00
Adrien Grand	af122f4151	Remove mapping backward compatibilit with pre-2.0. This removes the backward compatibility layer with pre-2.0 indices, notably the extraction of _id, _routing or _timestamp from the source document when a path is defined.	2015-12-24 13:47:37 +01:00
Robert Muir	d144ba24a5	Merge pull request #15588 from rmuir/hdfs2-only merge current hdfs improvements to master	2015-12-23 18:17:22 -05:00
Robert Muir	f14a21639c	add cleanups from simon	2015-12-23 18:15:33 -05:00
Martijn van Groningen	767114adec	Merge remote-tracking branch 'es/master' into feature/ingest	2015-12-23 15:16:20 +01:00
Adrien Grand	d8d8666877	Remove `index_name` back compat. Since 2.0 we enforce that fields have the same full and index names. So in 3.x we can remove the ability to have different names on the same field.	2015-12-23 14:55:26 +01:00
Adrien Grand	56d2dd701e	Fix SizeMappingTests failure.	2015-12-23 10:48:00 +01:00
Adrien Grand	a2072fe927	Merge pull request #15539 from jpountz/fix/immutable_document_mapper Make mapping updates more robust.	2015-12-23 09:55:42 +01:00
Adrien Grand	f535c27024	Make mapping updates more robust. This changes a couple of things: Mappings are truly immutable. Before, each field mapper stored a MappedFieldTypeReference that was shared across fields that have the same name across types. This means that a mapping update could have the side-effect of changing the field type in other types when updateAllTypes is true. This works differently now: after a mapping update, a new copy of the mappings is created in such a way that fields across different types have the same MappedFieldType. See the new Mapper.updateFieldType API which replaces MappedFieldTypeReference. DocumentMapper is now immutable and MapperService.merge has been refactored in such a way that if an exception is thrown while eg. lookup structures are being updated, then the whole mapping update will be aborted. As a consequence, FieldTypeLookup's checkCompatibility has been folded into copyAndAddAll. Synchronization was simplified: given that mappings are truly immutable, we don't need the read/write lock so that no documents can be parsed while a mapping update is being processed. Document parsing is not performed under a lock anymore, and mapping merging uses a simple synchronized block.	2015-12-23 09:55:07 +01:00
Martijn van Groningen	8b671d7d93	added note	2015-12-22 22:52:23 +01:00
Martijn van Groningen	dbbb296322	added a `node.ingest` setting that controls whether ingest is active or not. Defaults to `false`. If `node.ingest` isn't active then ingest related API calls fail and if the `pipeline_id` parameter is set then index and bulk requests fail.	2015-12-22 22:38:49 +01:00
Tal Levy	44d64c8a45	rename pipeline_id param to pipeline	2015-12-22 12:30:04 -08:00
Lee Hinman	482843e27b	Fix build to run correctly on FreeBSD This adds the required changes/checks so that the build can run on FreeBSD. There are a few things that differ between FreeBSD and Linux: - CPU probes return -1 for CPU usage - `hot_threads` cannot be supported on FreeBSD From OpenJDK's `os_bsd.cpp`: ```c++ bool os::is_thread_cpu_time_supported() { #ifdef __APPLE__ return true; #else return false; #endif } ``` So this API now returns (for each FreeBSD node): ``` curl -s localhost:9200/_nodes/hot_threads ::: {Devil Hunter Gabriel}{q8OJnKCcQS6EB9fygU4R4g}{127.0.0.1}{127.0.0.1:9300} hot_threads is not supported on FreeBSD ``` - multicast fails in native `join` method - known bug: https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=193246 Which causes: ``` 1> Caused by: java.net.SocketException: Invalid argument 1> at java.net.PlainDatagramSocketImpl.join(Native Method) 1> at java.net.AbstractPlainDatagramSocketImpl.join(AbstractPlainDatagramSocketImpl.java:179) 1> at java.net.MulticastSocket.joinGroup(MulticastSocket.java:323) 1> at org.elasticsearch.plugin.discovery.multicast.MulticastChannel$Plain.buildMulticastSocket(MulticastChannel.java:309) ``` So these tests are skipped on FreeBSD. Resolves #15562	2015-12-22 12:36:04 -07:00
Tal Levy	0bf4c8fb82	Add on_failure field to processors and pipelines. both processors and pipelines now have the ability to define a separate list of processors to be executed if the original line of execution throws an Exception. processors without an on_failure parameter defined will throw an exception and exit the pipeline immediately. processors with on_failure defined will catch the exception and allow for further processors to run. Exceptions within the on_failure block will be treated the same as the top-level.	2015-12-22 10:30:42 -08:00
Robert Muir	7abd051734	better containing of hadoop for actual blobstore operations	2015-12-22 12:07:37 -05:00
javanna	46f99a11a0	Add append processor The append processor allows to append one or more values to an existing list; add a new list with the provided values if the field doesn't exist yet, or convert an existing scalar into a list and add the provided values to the newly created list. This required adapting of IngestDocument#appendFieldValue behaviour, also added support for templating to it. Closes #14324	2015-12-22 16:11:43 +01:00
Martijn van Groningen	72fc34731e	applied feedback	2015-12-22 12:20:16 +01:00
Martijn van Groningen	7e99ee4edf	ExecutionService should be able to process multiple index requests at a time.	2015-12-22 12:02:36 +01:00
Martijn van Groningen	3fcb8ecd54	Upgraded geoip2 and its dependencies	2015-12-22 11:39:43 +01:00
javanna	188953922a	adapt to upstream changes Removed wildcard imports and fixed two broken license headers	2015-12-22 11:22:47 +01:00
javanna	f214271d89	Merge branch 'master' into feature/ingest	2015-12-22 11:14:55 +01:00
Robert Muir	010d1a89c5	Merge branch 'master' into hdfs2-only	2015-12-22 00:40:54 -05:00
Robert Muir	91dcc9e073	tidy up	2015-12-22 00:28:53 -05:00
Robert Muir	9573bb9f15	make sure BlobStore.close always triggers ACE on any access afterwards	2015-12-22 00:21:03 -05:00
Robert Muir	a04268e42e	reorder checks	2015-12-21 23:52:16 -05:00
Robert Muir	a587ba110c	add some safety around repository	2015-12-21 23:48:22 -05:00
Ryan Ernst	d104d6d652	Refactor hdfs unit tests to be simple and check every configuration error condition	2015-12-21 19:38:54 -08:00
Ryan Ernst	af7d6b629c	Change hdfs unit tests to be a single node test instead of integ test	2015-12-21 18:32:28 -08:00
Robert Muir	c54d53c8d5	streamline these classes a bit	2015-12-21 21:11:06 -05:00
Robert Muir	795869c345	remove filecontextfactory	2015-12-21 20:55:17 -05:00
Robert Muir	956281f039	remove shitton of permissions	2015-12-21 20:26:13 -05:00
Ryan Ernst	d0e9306413	Remove reading node settings as defaults for hdfs repository settings	2015-12-21 16:25:55 -08:00
Robert Muir	3c07a427dc	fix exc handling	2015-12-21 19:06:56 -05:00
Ryan Ernst	26eaa16a89	Remove "additional config" from hdfs repositories	2015-12-21 15:55:13 -08:00
Robert Muir	2cbfc54a81	avoid too-long classpath so it works on windows	2015-12-21 18:25:08 -05:00
Robert Muir	7065639a26	add test for listing	2015-12-21 17:25:15 -05:00
Robert Muir	5ebcf183e5	tests	2015-12-21 17:02:50 -05:00
Robert Muir	b8524bdb11	add tests	2015-12-21 16:16:24 -05:00
Robert Muir	3a2464b80e	improve build logic on windows without native libraries	2015-12-21 15:37:34 -05:00
Robert Muir	0ed45c5bfb	remove filesystem leniency	2015-12-21 14:16:53 -05:00
Robert Muir	deaf8884e9	Fix exc handling	2015-12-21 13:04:22 -05:00
Robert Muir	3ffd1a5219	final	2015-12-21 12:54:33 -05:00
Robert Muir	f81b12e327	minimize accessiblity, remove unused threadpool	2015-12-21 12:39:40 -05:00
Adrien Grand	cf52e96c42	Upgrade to lucene-5.5.0-snapshot-1721183. Some files that implement or use the Scorer API had to be changed because of https://issues.apache.org/jira/browse/LUCENE-6919.	2015-12-21 17:02:08 +01:00
Adrien Grand	ac393b7a31	Make mappings tests more realistic. DocumentMapperParser has both parse and parseCompressed methods. Except that the parse methods are ONLY used from the unit tests. This commit removes the parse method and moves all tests to parseCompressed so that they test more realistically how mappings are managed. Then I renamed parseCompressed to parse given that this is the only alternative anyway.	2015-12-21 10:44:00 +01:00
Robert Muir	f67390e0c8	in the plugin: guard against HADOOP_HOME in environment on any platform. hdfs fixture: minihdfs works on windows now, if things are properly set but our test fixture still cannot launch this on windows.	2015-12-21 02:21:53 -05:00
Robert Muir	53530f1243	remove hacks, test fixtures are clean before each execution	2015-12-20 22:23:30 -05:00
Robert Muir	935c2c75f6	Remove slf4j hack	2015-12-20 22:08:18 -05:00
Robert Muir	04966bcc3e	contain and improve hack	2015-12-20 21:02:03 -05:00
Robert Muir	03a2b6b01b	Disable HDFS fixture on windows, it requires native libraries.	2015-12-20 16:30:19 -08:00
Robert Muir	a37417085d	blind stab at unit test issues on windows	2015-12-20 18:31:55 -05:00
Robert Muir	ee546ff655	try to get windows working	2015-12-20 17:10:01 -05:00
Robert Muir	2347e3c373	Get forbidden apis passing again, this needs to be investigated	2015-12-20 16:17:17 -05:00
Robert Muir	7ac49bb278	Merge branch 'hdfs2-only' of github.com:costin/elasticsearch into hdfs2-only	2015-12-20 16:12:23 -05:00
Robert Muir	12a8428dfb	Add MiniHDFS test fixture, started before integTest and shut down after. Currently uses a hardcoded port (9999), need to apply MavenFilteringHack after it starts.	2015-12-20 16:00:37 -05:00
Martijn van Groningen	66d8b5342f	s/PipelineStoreBootstrapper/IngestBootstrapper	2015-12-20 20:18:46 +01:00
Costin Leau	3204e87220	Restrict usage to HDFS only	2015-12-20 15:53:18 +02:00
Robert Muir	ae89c6e51c	Merge branch 'master' into hdfs2-only	2015-12-19 21:52:52 -05:00
Ryan Ernst	9cb4c82c58	Build: Add fixture capabilities to integ tests This change adds a Fixture class for use by gradle. A Fixture is an external process that integration tests will use. It can be added as a dependsOn for integTest, and will automatically be shutdown upon success or failure, as well as relevant information dumped on failure. There is also an example fixture in this change.	2015-12-19 15:46:21 -08:00
Robert Muir	8c6f5a0c60	add failing test	2015-12-19 15:05:38 -08:00
Robert Muir	5dcccca848	add example fixture	2015-12-19 15:05:37 -08:00
Robert Muir	d171773bdb	remove leniency in tests	2015-12-19 04:39:01 -05:00
Robert Muir	e2b2ee24fa	Add licensing for dependencies	2015-12-19 03:06:40 -05:00
Robert Muir	9df447295c	Fix unit tests (also works from IDE).	2015-12-19 02:43:27 -05:00
Robert Muir	3269beeb4d	don't throw exceptions from ctor, guice is hell	2015-12-19 02:09:14 -05:00
Robert Muir	f174e96a14	explicitly initialize some hadoop classes elevated, so we don't rely on classloading order. maybe this allows us to do less stuff in doPriv later, we will see. at least it makes things like unit testing easier.	2015-12-19 00:21:01 -05:00
Robert Muir	2e8c68d09b	Remove no-longer needed domaincombiner stuff	2015-12-18 23:51:41 -05:00
Robert Muir	02fbd55118	enable thirdPartyAudit so you can see the crazy shit hadoop does	2015-12-18 23:45:05 -05:00
Robert Muir	bc11962438	get full snapshot restore tests passing	2015-12-18 23:16:41 -05:00
Robert Muir	fbe3d64ea4	add passing test that takes snapshot	2015-12-18 22:55:15 -05:00
Robert Muir	75ef9da53f	get up to connectexception	2015-12-18 22:11:58 -05:00
Ryan Ernst	c2c5081830	Remove uneeded class loading stuff from hdfs plugin	2015-12-18 17:01:38 -08:00
Ryan Ernst	91fe99a7f6	Make hdfs plugin not use transitive deps	2015-12-18 16:52:22 -08:00
Costin Leau	7584810ff4	* Make plugin hadoop2-only Polish MiniDFS cluster to be Hadoop2 (instead of Hadoop1) based	2015-12-19 01:35:53 +02:00
Ryan Ernst	690fb2cd3f	Rename InternalFilters.Bucket to InternalFilters.InternalBucket to avoid name collision	2015-12-18 13:22:20 -08:00
Ryan Ernst	4ea19995cf	Remove wildcard imports	2015-12-18 12:43:47 -08:00
Robert Muir	447729f0e1	add missing license headers	2015-12-18 13:08:17 -05:00
Robert Muir	2e2e328879	add missing license header	2015-12-18 13:02:39 -05:00
Martijn van Groningen	6dfcee6937	Added an internal reload pipeline api that makes sure pipeline changes are visible on all ingest nodes after modifcations have been made. * Pipeline store can now only start when there is no .ingest index or all primary shards of .ingest have been started * IngestPlugin adds`node.ingest` setting to `true`. This is used to figure out to what nodes to send the refresh request too. This setting isn't yet configurable. This will be done in a follow up issue. * Removed the background pipeline updater and added added logic to deal with specific scenarious to reload all pipelines. * Ingest services are no longer be managed by Guice. Only the bootstrapper gets managed by guice and that contructs all the services/components ingest will need.	2015-12-18 18:24:27 +01:00
Martijn van Groningen	e8a8e22e09	Add template infrastructure, removed meta processor and added template support to set and remove processor. Added ingest wide template infrastructure to IngestDocument Added a TemplateService interface that the ingest framework uses Added a TemplateService implementation that the ingest plugin provides that delegates to the ES' script service Cut SetProcessor over to use the template infrastructure for the `field` and `value` settings. Removed the MetaDataProcessor Removed dependency on mustache library Added qa ingest mustache rest test so that the ingest and mustache integration can be tested.	2015-12-18 17:35:53 +01:00
Martijn van Groningen	a56902567e	don't register rest actions on transport clients	2015-12-18 15:49:00 +01:00
Martijn van Groningen	2c5bb84851	fix copyDefaultGeoIp2DatabaseFiles task to work again	2015-12-18 15:45:56 +01:00
javanna	3e155f7b54	adapt to upstream changes: thirdPartyAudit.missingClasses set to true geoip depends on asm and google http client which we don't need	2015-12-18 11:14:39 +01:00
javanna	f349669071	adapt to upstream changes: enableMockModules => getMockPlugins	2015-12-18 11:13:34 +01:00
javanna	8bae93eee1	adapt to upstream changes: StringText => Text	2015-12-18 11:13:18 +01:00
javanna	885b01fb49	adapt to upstream changes: RestModule -> NetworkModule	2015-12-18 10:36:29 +01:00
javanna	5f2df6b95a	Merge branch 'master' into feature/ingest	2015-12-18 10:34:07 +01:00
Simon Willnauer	eca2435838	Merge branch 'master' into settings_prototype	2015-12-18 09:15:58 +01:00
Adrien Grand	6ea16671f4	Simplify the Text API. We have the Text API, which is essentially a wrapper around a String and a BytesReference and then we have 3 implementations depending on whether the String view should be cached, the BytesReference view should be cached, or both should be cached. This commit merges everything into a single Text that is essentially the old StringAndBytesText impl. Long term we should look into whether this API has any performance benefit or if we could just use plain strings. This would greatly simplify all our other APIs that currently use Text.	2015-12-17 17:22:38 +01:00
Simon Willnauer	eae3da3b54	Merge branch 'master' into settings_prototype	2015-12-17 15:13:41 +01:00
Adrien Grand	6ccc759691	Merge pull request #15480 from jpountz/fix/mapping_explicit_defaults Make mapping serialization more robust.	2015-12-17 11:23:27 +01:00
Robert Muir	a7cc91e868	Merge pull request #15501 from rmuir/sheisty_classes thirdPartyAudit round 2	2015-12-17 03:44:27 -05:00
Robert Muir	6692e42d9a	thirdPartyAudit round 2 This fixes the `lenient` parameter to be `missingClasses`. I will remove this boolean and we can handle them via the normal whitelist. It also adds a check for sheisty classes (jar hell with the jdk). This is inspired by the lucene "sheisty" classes check, but it has false positives. This check is more evil, it validates every class file against the extension classloader as a resource, to see if it exists there. If so: jar hell. This jar hell is a problem for several reasons: 1. causes insanely-hard-to-debug problems (like bugs in forbidden-apis) 2. hides problems (like internal api access) 3. the code you think is executing, is not really executing 4. security permissions are not what you think they are 5. brings in unnecessary dependencies 6. its jar hell The more difficult problems are stuff like jython, where these classes are simply 'uberjared' directly in, so you cant just fix them by removing a bogus dependency. And there is a legit reason for them to do that, they want to support java 1.4.	2015-12-17 02:35:00 -05:00
Jack Conradson	4523eaec88	Added plumbing for compile time script parameters. Closes #15464	2015-12-16 18:29:21 -08:00
Robert Muir	4f9d4103f2	Merge pull request #15491 from rmuir/forbidden_third_party Add gradle thirdPartyAudit to precommit tasks	2015-12-16 18:56:50 -05:00
Robert Muir	42138007db	add some more comments about internal api usage	2015-12-16 18:56:02 -05:00
Robert Muir	ee79d46583	Add gradle thirdPartyAudit to precommit tasks	2015-12-16 16:38:16 -05:00
Ryan Ernst	a2b8f4b90a	Merge pull request #15434 from rjernst/http_type Expose http.type setting, and collapse al(most all) modules relating to transport/http	2015-12-16 11:54:30 -08:00
Simon Willnauer	71b204ea49	Merge branch 'master' into settings_prototype	2015-12-16 20:29:21 +01:00
Adrien Grand	8ac8c1f547	Make mapping serialization more robust. When creating a metadata mapper for a new type, we reuse an existing configuration from an existing type (if any) in order to avoid introducing conflicts. However this field type that is provided is considered as both an initial configuration and the default configuration. So at serialization time, we might only serialize the difference between the current configuration and this default configuration, which might be different to what is actually considered the default configuration. This does not cause bugs today because metadata mappers usually override the toXContent method and compare the current field type with Defaults.FIELD_TYPE instead of defaultFieldType() but I would still like to do this change to avoid future bugs.	2015-12-16 16:08:45 +01:00
Martijn van Groningen	07951fc731	added comment why 'accessDeclaredMembers' permission is needed	2015-12-16 10:11:46 +01:00
Simon Willnauer	6ea266a89c	Merge branch 'master' into settings_prototype	2015-12-15 16:33:01 +01:00
Adrien Grand	d94bba2d9c	Remove back compat for the `path` option. The `path` option allowed to index/store a field `a.b.c` under just `c` when set to `just_name`. This "feature" has been removed in 2.0 in favor of `copy_to` so we can remove the back compat in 3.x.	2015-12-15 14:55:23 +01:00
Adrien Grand	50eeafa75c	Make mappings immutable. Today mappings are mutable because of two APIs: - Mapper.merge, which expects changes to be performed in-place - IncludeInAll, which allows to change whether values should be put in the `_all` field in place. This commit changes both APIs to return a modified copy instead of modifying in place so that mappings can be immutable. For now, only the type-level object is immutable, but in the future we can imagine making them immutable at the index-level so that mapping updates could be completely atomic at the index level. Close #9365	2015-12-15 10:20:28 +01:00
Martijn van Groningen	e87709f593	fix ingest runner	2015-12-15 10:18:19 +01:00
Ryan Ernst	60d35c81af	Plugins: Expose http.type setting, and collapse al(most all) modules relating to transport/http This change adds back the http.type setting. It also cleans up all the transport related guice code to be consolidated within the NetworkModule (as transport and http related stuff is what and how ES exposes over the network). The setter methods previously used by some plugins to override eg the TransportService or HttpServerTransport are removed, and those plugins should now register a custom implementation of the class with a name and set that using the appropriate config setting. Note that I think ActionModule should also be moved into here, to sit along side the rest actions, but I left that for a followup. closes #14148	2015-12-14 22:01:04 -08:00
Costin Leau	7bca97bba6	HDFS Snapshot/Restore plugin Migrated from ES-Hadoop. Contains several improvements regarding: * Security Takes advantage of the pluggable security in ES 2.2 and uses that in order to grant the necessary permissions to the Hadoop libs. It relies on a dedicated DomainCombiner to grant permissions only when needed only to the libraries installed in the plugin folder Add security checks for SpecialPermission/scripting and provides out of the box permissions for the latest Hadoop 1.x (1.2.1) and 2.x (2.7.1) * Testing Uses a customized Local FS to perform actual integration testing of the Hadoop stack (and thus to make sure the proper permissions and ACC blocks are in place) however without requiring extra permissions for testing. If needed, a MiniDFS cluster is provided (though it requires extra permissions to bind ports) Provides a RestIT test * Build system Picks the build system used in ES (still Gradle)	2015-12-14 21:50:09 +02:00
Martijn van Groningen	aaacf096d2	Merge remote-tracking branch 'es/master' into feature/ingest	2015-12-14 11:57:10 +01:00
Jason Tedor	3383c24be0	Remove and forbid use of Collections#shuffle(List) and Random#<init>() This commit removes and now forbids all uses of Collections#shuffle(List) and Random#<init>() across the codebase. The rationale for removing and forbidding these methods is to increase test reproducibility. As these methods use non-reproducible seeds, production code and tests that rely on these methods contribute to non-reproducbility of tests. Instead of Collections#shuffle(List) the method Collections#shuffle(List, Random) can be used. All that is required then is a reproducible source of randomness. Consequently, the utility class Randomness has been added to assist in creating reproducible sources of randomness. Instead of Random#<init>(), Random#<init>(long) with a reproducible seed or the aforementioned Randomess class can be used. Closes #15287	2015-12-11 11:16:38 -05:00
Martijn van Groningen	d38cccb8a1	Fix issues after merging in master	2015-12-11 14:51:45 +01:00
Martijn van Groningen	503a166b71	Merge remote-tracking branch 'es/master' into feature/ingest	2015-12-11 14:32:16 +01:00
Robert Muir	2741888498	Remove RuntimePermission("accessDeclaredMembers") Upgrades lucene to 5.5.0-1719088, randomizedtesting to 2.3.2, and securemock to 1.2	2015-12-10 14:26:55 -05:00
Boaz Leskes	fafeb3abdd	Introduce a common base response class to all single doc write ops IndexResponse, DeleteResponse and UpdateResponse share some logic. This can be unified to a single DocWriteResponse base class. On top, some replication actions are now not about write operations anymore. This commit renames ActionWriteResponse to ReplicationResponse Last some toXContent is moved from the Rest layer to the actual response classes, for more code re-sharing. Closes #15334	2015-12-10 15:14:02 +01:00
David Pilato	c1f7171e61	Add support for path_style_access From https://github.com/elastic/elasticsearch-cloud-aws/pull/159 Add a new option `path_style_access` for S3 buckets. It adds support for path style access for [virtual hosting of buckets](http://docs.aws.amazon.com/AmazonS3/latest/dev/VirtualHosting.html). Defaults to `false`. Closes https://github.com/elastic/elasticsearch-cloud-aws/issues/124.	2015-12-10 09:00:31 +01:00
Jack Conradson	da5b07ae13	Added a new scripting language (PlanA). Closes #15136	2015-12-09 16:32:37 -08:00
David Pilato	414fccb7d1	Merge branch 'fix/15268-proxy-auth'	2015-12-09 23:21:00 +01:00
javanna	a8382de09d	add comment to clarifiy why metadata fields can be set all the time to IndexRequest in PipelineExecutionService	2015-12-09 18:52:43 +01:00
javanna	a95f81c015	avoid stripping out _source prefix when in _ingest context	2015-12-09 18:42:20 +01:00
javanna	57d6971252	Streamline support for get/set/remove of metadata fields and ingest metadata fields Unify metadata map and source, add also support for _ingest prefix. Depending on the prefix, either _source, nothing or _ingest, we will figure out which map to use for values retrieval, but also modifications.	2015-12-09 18:36:44 +01:00
javanna	744d2908a8	avoid null values in simulate serialization prototypes, use empty maps instead	2015-12-09 18:36:44 +01:00
javanna	6b7446beb9	Remove sourceModified flag from IngestDocument If one is using the ingest plugin and providing a pipeline id with the request, the chance that the source is going to be modified is 99%. We shouldn't worry about keeping track of whether something changed. That seemed useful at first so we can save the resources for setting back the source (map to bytes) when not needed. Also, we are trying to unify metadata fields and source in the same map and that is going to complicate how we keep track of changes that happen in the source only. Best solution is to remove the flag.	2015-12-09 18:36:43 +01:00
javanna	b0d7d604ff	Add support for transient metadata to IngestDocument IngestDocument now holds an additional map of transient metadata. The only field that gets added automatically is `timestamp`, which contains the timestamp of ingestion in ISO8601 format. In the future it will be possible to eventually add or modify these fields, which will not get indexed, but they will be available via templates to all of the processors. Transient metadata will be visualized by the simulate api, although they will never get indexed. Moved WriteableIngestDocument to the simulate package as it's only used by simulate and it's now modelled for that specific usecase. Also taken the chance to remove one IngestDocument constructor used only for testing (accepting only a subset of es metadata fields). While doing that introduced some more randomizations to some existing processor tests. Closes #15036	2015-12-09 18:36:01 +01:00
javanna	5bc1e46113	setFieldValue for list to replace when an index is specified It used to do add instead, which is not consistent with the behaviour of set, which always replaces.	2015-12-09 18:28:07 +01:00
javanna	8240031216	Merge branch 'master' into feature/ingest	2015-12-09 18:14:32 +01:00
Simon Willnauer	a49120bfc1	fix compilation	2015-12-09 12:26:28 +01:00
Simon Willnauer	85a1b54867	fix compilation	2015-12-09 11:41:14 +01:00
Martijn van Groningen	233de434a0	Merge pull request #15310 from martijnvg/ingest/stream_put_and_delete_responses Streamline put & delete pipeline responses with index & delete responses	2015-12-09 11:30:57 +01:00
Simon Willnauer	c9d7c92243	fold ClusterSettingsService into ClusterSettings	2015-12-09 09:57:39 +01:00
Britta Weber	e0aa481bf5	Merge pull request #15213 from brwe/copy-to-in-multi-fields-exception throw exception if a copy_to is within a multi field Copy to within multi field is ignored from 2.0 on, see #10802. Instead of just ignoring it, we should throw an exception if this is found in the mapping when a mapping is added. For already existing indices we should at least log a warning. We remove the copy_to in any case. related to #14946	2015-12-08 14:41:07 +01:00
Simon Willnauer	fbbb04b87e	Add infrastructure to transactionally apply and reset dynamic settings This commit adds the infrastructure to make settings that are updateable resetable and changes the application of updates to be transactional. This means setting updates are either applied or not. If the application failes all values are rejected. This initial commit converts all dynamic cluster settings to make use of the new infrastructure. All cluster level dynamic settings are not resettable to their defaults or to the node level settings. The infrastructure also allows to list default values and descriptions which is not fully implemented yet. Values can be reset using a list of key or simple regular expressions. This has only been implemented on the java layer yet. For instance to reset all recovery settings to their defaults a user can just specify `indices.recovery.*`. This commit also adds strict settings validation, if a setting is unknown or if a setting can not be applied the entire settings update request will fail.	2015-12-08 14:39:15 +01:00
Martijn van Groningen	a2cda4e3f2	Streamline the put and delete pipelines responses with the index and delete response in core.	2015-12-08 14:01:28 +01:00
David Pilato	7dcb40bcac	Add support for proxy authentication for s3 and ec2 When using S3 or EC2, it was possible to use a proxy to access EC2 or S3 API but username and password were not possible to be set. This commit adds support for this. Also, to make all that consistent, proxy settings for both plugins have been renamed: * from `cloud.aws.proxy_host` to `cloud.aws.proxy.host` * from `cloud.aws.ec2.proxy_host` to `cloud.aws.ec2.proxy.host` * from `cloud.aws.s3.proxy_host` to `cloud.aws.s3.proxy.host` * from `cloud.aws.proxy_port` to `cloud.aws.proxy.port` * from `cloud.aws.ec2.proxy_port` to `cloud.aws.ec2.proxy.port` * from `cloud.aws.s3.proxy_port` to `cloud.aws.s3.proxy.port` New settings are `proxy.username` and `proxy.password`. ```yml cloud: aws: protocol: https proxy: host: proxy1.company.com port: 8083 username: myself password: theBestPasswordEver! ``` You can also set different proxies for `ec2` and `s3`: ```yml cloud: aws: s3: proxy: host: proxy1.company.com port: 8083 username: myself1 password: theBestPasswordEver1! ec2: proxy: host: proxy2.company.com port: 8083 username: myself2 password: theBestPasswordEver2! ``` Note that `password` is filtered with `SettingsFilter`. We also fix a potential issue in S3 repository. We were supposed to accept key/secret either set under `cloud.aws` or `cloud.aws.s3` but the actual code never implemented that. It was: ```java account = settings.get("cloud.aws.access_key"); key = settings.get("cloud.aws.secret_key"); ``` We replaced that by: ```java String account = settings.get(CLOUD_S3.KEY, settings.get(CLOUD_AWS.KEY)); String key = settings.get(CLOUD_S3.SECRET, settings.get(CLOUD_AWS.SECRET)); ``` Also, we extract all settings for S3 in `AwsS3Service` as it's already the case for `AwsEc2Service` class. Closes #15268.	2015-12-07 23:10:54 +01:00
Tal Levy	e6b287c083	Merge pull request #15133 from talevy/the_one_true_field [Ingest] Have processors operate one field at time.	2015-12-07 08:31:08 -08:00
Tal Levy	45f48ac126	update all processors to only operate on one field at a time when possible	2015-12-07 08:30:00 -08:00
Martijn van Groningen	6062c4eac9	Merge branch 'master' into feature/ingest	2015-12-07 15:53:39 +01:00
Robert Muir	2169a123a5	Filter classes loaded by scripts Since 2.2 we run all scripts with minimal privileges, similar to applets in your browser. The problem is, they have unrestricted access to other things they can muck with (ES, JDK, whatever). So they can still easily do tons of bad things This PR restricts what classes scripts can load via the classloader mechanism, to make life more difficult. The "standard" list was populated from the old list used for the groovy sandbox: though a few more were needed for tests to pass (java.lang.String, java.util.Iterator, nothing scary there). Additionally, each scripting engine typically needs permissions to some runtime stuff. That is the downside of this "good old classloader" approach, but I like the transparency and simplicity, and I don't want to waste my time with any feature provided by the engine itself for this, I don't trust them. This is not perfect and the engines are not perfect but you gotta start somewhere. For expert users that need to tweak the permissions, we already support that via the standard java security configuration files, the specification is simple, supports wildcards, etc (though we do not use them ourselves).	2015-12-05 21:46:52 -05:00
Robert Muir	46377778a9	Merge branch 'master' into getClassLoader	2015-12-04 15:58:36 -05:00
Robert Muir	b0c64910b0	ban RuntimePermission("getClassLoader") this gives more isolation between modules and plugins.	2015-12-04 15:58:02 -05:00
Ryan Ernst	01d48e2062	Merge branch 'master' into jigsaw	2015-12-04 11:29:49 -08:00
David Pilato	619fb998e8	Update Azure Service Management API to 0.9.0 Azure team released new versions of their Java SDK. According to https://github.com/Azure/azure-sdk-for-java/wiki/Azure-SDK-for-Java-Features, it comes with 2 versions. We should at least update to `0.9.0` of V1 but also consider moving to the new APIs (V2). This commit first updates to latest API V1. ```xml <dependency> <groupId>com.microsoft.azure</groupId> <artifactId>azure-svc-mgmt-compute</artifactId> <version>0.9.0</version> </dependency> ``` Closes #15209	2015-12-04 17:32:11 +01:00
javanna	d7c3b51b9c	[TEST] adapt to upstream changes	2015-12-04 16:35:53 +01:00
javanna	73986cc54f	adapt to upstream changes	2015-12-04 14:17:07 +01:00
javanna	6c43137413	Merge branch 'master' into feature/ingest	2015-12-04 14:10:01 +01:00
Adrien Grand	3f86adddbf	Remove MergeMappingException. Failures to merge a mapping can either come as a MergeMappingException if they come from Mapper.merge or as an IllegalArgumentException if they come from FieldTypeLookup.checkCompatibility. I think we should settle on one: this pull request replaces all usage of MergeMappingException with IllegalArgumentException.	2015-12-04 12:56:26 +01:00
Ryan Ernst	0a4a81afaf	Added modules, distributions now include them (just plugins installed in a diff dir)	2015-12-03 14:18:26 -08:00
Tal Levy	ffa8998f36	Merge pull request #15181 from martijnvg/ingest_geoip_only_read_mmdb_files [Ingest] The geoip processor should only try to read *.mmdb files from the geoip config directory	2015-12-03 12:17:09 -08:00
Tal Levy	56da7b32ed	add ability to define custom grok patterns within processor config	2015-12-03 08:24:07 -08:00
Tal Levy	cf1c393d70	Merge pull request #15166 from talevy/remove_pattern_utils move PatternUtils#loadBankFromStream into GrokProcessor.Factory	2015-12-03 08:08:00 -08:00
Martijn van Groningen	6acf8ec263	Removed pipeline tests with a simpler tests The PipelineTests tried to test if the configured map/list in set processor wasn't modified while documents were ingested. Creating a pipeline programmatically created more noise than the test needed. The new tests in IngestDocumentTests have the same goal, but is much smaller and clearer by directly testing against IngestDocument.	2015-12-03 15:19:04 +01:00
Jason Tedor	fbe736c9bb	Cleaner type-inference assistance	2015-12-02 10:49:35 -05:00
Jason Tedor	05430a788a	Remove and forbid use of the type-unsafe empty Collections fields This commit removes and now forbids all uses of the type-unsafe empty Collections fields Collections#EMPTY_LIST, Collections#EMPTY_MAP, and Collections#EMPTY_SET. The type-safe methods Collections#emptyList, Collections#emptyMap, and Collections#emptySet should be used instead.	2015-12-02 10:41:59 -05:00
Martijn van Groningen	9ab765b851	The geoip processor should only try to read *.mmdb files from the geoip config directory	2015-12-02 14:38:49 +01:00
Martijn van Groningen	a9ecde041b	Merge branch 'master' into feature/ingest	2015-12-02 11:21:15 +01:00
Martijn van Groningen	270a3977bc	Removed the lazy cache in DatabaseReaderService and eagerly build all available databases.	2015-12-02 11:16:02 +01:00
David Pilato	d23d8a891f	Remove "empty" licenses dir Follow up #15168 We don't need to have "fake" licenses dir anymore.	2015-12-02 10:22:52 +01:00
Ryan Ernst	d68c6673a2	Build: Cleanup precommit task gradle code This change attempts to simplify the gradle tasks for precommit. One major part of that is using a "less groovy style", as well as being more consistent about how tasks are created and where they are configured. It also allows the things creating the tasks to set up inter task dependencies, instead of assuming them (ie decoupling from tasks eleswhere in the build).	2015-12-01 22:36:54 -08:00
Tal Levy	767bd1d4d5	move PatternUtils#loadBankFromStream into GrokProcessor.Factory	2015-12-01 15:46:02 -08:00
javanna	6c0510b01d	Make rename processor less error prone Rename processor now checks whether the field to rename exists and throws exception if it doesn't. It also checks that the new field to rename to doesn't exist yet, and throws exception otherwise. Also we make sure that the rename operation is atomic, otherwise things may break between the remove and the set and we'd leave the document in an inconsistent state. Note that the requirement for the new field name to not exist simplifies the usecase for e.g. { "rename" : { "list.1": "list.2"} } as such a rename wouldn't be accepted if list is actually a list given that either list.2 already exists or the index is out of bounds for the existing list. If one really wants to replace an existing field, that field needs to be removed first through remove processor and then rename can be used.	2015-12-01 19:58:24 +01:00
Martijn van Groningen	15b6708a5d	and now make use of the lifecycle infrastructure	2015-12-01 18:20:25 +01:00
Tal Levy	8e4c288b5c	Merge pull request #15132 from talevy/no_match_for_grok [Ingest] No match for grok	2015-12-01 09:13:11 -08:00
Tal Levy	2c1effdd41	throw exception when grok processor does not match	2015-12-01 08:58:58 -08:00
Martijn van Groningen	9dd52ad7d3	Removed pollution from the Processor.Factory interface. 1) It no longer extends from Closeable. 2) Removed the config directory setter. Implementation that relied on it, now get the location to the config dir via their constructors.	2015-12-01 17:32:37 +01:00
Martijn van Groningen	fa9fcb3b11	geo processor should add a list of doubles instead of an array to the ingest document	2015-12-01 17:12:34 +01:00
Martijn van Groningen	99a4295330	If a list or map value gets set on ingest document a deep copy needs to be made. If this is not done this can lead to processor configuration being changed by an bulk or index request.	2015-12-01 16:02:04 +01:00
javanna	c67a332486	Query DSL: Enforce distance is greater than 0 in geo distance query Validation is not done as part of the distance setter method and tested in GeoDistanceQueryBuilderTests. Fixed GeoDistanceTests to adapt to the new validation. Closes #15135	2015-12-01 14:07:32 +01:00
Martijn van Groningen	4402da1af0	also change the tests to deal with Exception instead of IOException	2015-11-30 15:45:40 +01:00
Martijn van Groningen	dde274d944	Replaced IOException with Exception on factory implementations' `Processor.Factory#create(Map)` method.	2015-11-30 15:37:16 +01:00
Britta Weber	d8a1a4bd43	fix toXContent() for mapper attachments field We must use simpleName() instead of name() because otherwise when the mapping is generated as a string the field name will be the full path with dots and that is illegal from es 2.0 on. closes https://github.com/elastic/elasticsearch-mapper-attachments/issues/169	2015-11-30 15:28:12 +01:00
Martijn van Groningen	fdf4543b8e	Renamed `add` processor to `set` processor. This name makes more sense, because if a field already exists it overwrites it.	2015-11-30 15:03:20 +01:00
javanna	43b861b076	IngestDocument to support accessing and modifying list items When reading, through #getFieldValue and #hasField, and a list is encountered, the next element in the path is treated as the index of the item that the path points to (e.g. `list.0.key`). If the index is not a number or out of bounds, an exception gets thrown. Added #appendFieldValue method that has the same behaviour as setFieldValue, but when a list is the last element in the path, instead of replacing the whole list it will simply add a new element to the existing list. This method is currently unused, we have to decide whether the set processor or a new processor should use it. A few other changes made: - Renamed hasFieldValue to hasField, as this method is not really about values but only keys. It will return true if a key is there but its value is null, while it returns false only when a field is not there at all. - Changed null semantic in getFieldValue. null gets returned only when it was an actual value in the source, an exception is thrown otherwise when trying to access a non existing field, so that null != field not present. - Made remove stricter about non existing fields. Throws error when trying to remove a non existing field. This is more consistent with the other methods in IngestDocument which are strict about fields that are not present. Relates to #14324	2015-11-30 13:58:03 +01:00
Jim Ferenczi	e182072b6f	Merge pull request #15017 from jimferenczi/fields_option Refuse to load fields from _source when using the `fields` option and support wildcards.	2015-11-30 11:01:21 +01:00
Jim Ferenczi	731833cfc6	Fixes #14489 Do not to load fields from _source when using the `fields` option. Non stored (non existing) fields are ignored by the fields visitor when using the `fields` option. Fixes #10783 Support * wildcard to retrieve stored fields when using the `fields` option. Supported pattern styles are "xxx", "xxx", "xxx" and "xxx*yyy".	2015-11-30 11:00:32 +01:00
Martijn van Groningen	467a47670c	Merge remote-tracking branch 'es/master' into feature/ingest	2015-11-30 10:24:27 +01:00
Robert Muir	a2816ec574	don't assert exact expected lengths for documents. This will change depending on newline of the operating system	2015-11-29 11:48:54 -05:00
Robert Muir	415c37340a	do not assert charset for mapper-attachments tests. Its enough to test the content type for what we are testing. Currently tests are flaky if charset is detected as e.g. windows-1252 vs iso-8859-1 and so on. In fact, they fail on windows 100% of the time. We are not trying to test charset detection heuristics (which might be different even due to newlines in tests or other things). If we want to do test that, we should test it separately.	2015-11-29 11:40:27 -05:00
javanna	b4b698f653	Merge branch 'master' into feature/ingest	2015-11-28 11:15:25 +01:00
Simon Willnauer	65b661b1f4	[TEST] Fix MapperUpgrade tests to use a dedicated master to ensure dangeling index import works predictably When importing dangling indices on a single node that is data and master eligable the async dangling index call can still be in-flight when the cluster is checked for green / yellow. Adding a dedicated master node and a data only node that does the importing fixes this issus just like we do in OldIndexBackwardsCompatibilityIT	2015-11-27 10:32:21 +01:00
Martijn van Groningen	0fe1b4eab1	PipelineStore no longer is a lifecycle component Client in PipelineStore gets provided via a guice provider Processor and Factory throw Exception instead of IOException Removed PipelineExecutionService.Listener with ActionListener	2015-11-26 18:12:15 +01:00
Martijn van Groningen	9aff8c6352	fix compile errors	2015-11-26 17:00:16 +01:00
Martijn van Groningen	9d1fa0d6da	ingest: Add `meta` processor that allows to modify the metadata attributes of document being processed	2015-11-26 15:46:32 +01:00
Martijn van Groningen	afc9069c99	* Inlined PipelineStoreClient class into the PipelineStore class * Moved PipelineReference to a top level class and named it PipelineDefinition * Pulled some logic from the crud transport classes to the PipelineStore * Use IOUtils#close(...) where appropriate	2015-11-26 14:42:39 +01:00
Simon Willnauer	9f6598b18d	Fix compile errors	2015-11-26 13:41:00 +01:00
javanna	1a7391070f	Simulate api improvements Move ParsedSimulateRequest to SimulatePipelineRequest and remove Parser class in favor of static parse methods. Simplified execute methods in SimulateExecutionService.	2015-11-26 13:24:45 +01:00
Martijn van Groningen	2890432421	made updatePipelines() to not make it prone to race conditions	2015-11-25 18:32:56 +01:00
javanna	5d510b59c8	use MetaData enum for metadata field names Also rename getName to getFieldName in MetaData to prevent confusion with name() enum method.	2015-11-25 18:08:53 +01:00
javanna	ec162c458e	Replace property with field in IngestDocument getPropertyValue => getFieldValue hasPropertyValue => hasFieldValue setPropertyValue => setFieldValue removeProperty => removeField	2015-11-25 18:08:53 +01:00
Luca Cavanna	e15fa99ee3	Merge pull request #15019 from javanna/enhancement/java8_date_parser date formats: use a function instead of our own interface	2015-11-25 16:17:39 +01:00
javanna	c4cf55c196	[TEST] generate random timezone out of the available ones in joda	2015-11-25 15:58:01 +01:00
javanna	5daa73b350	date formats: use a function instead of our own interface Also turn the different date formats into an enum.	2015-11-25 15:37:35 +01:00
javanna	4759a6e50f	Merge branch 'master' into feature/ingest	2015-11-25 14:59:10 +01:00
Adrien Grand	e8520bf519	Tests: For single data path for *FieldMapperUpgradeTests.	2015-11-25 11:46:19 +01:00
javanna	e0fcee642e	[TEST] fix locale comparison	2015-11-25 10:26:46 +01:00
javanna	388e637fa9	add a few more asserts to IngestActionFilterTests	2015-11-24 19:43:54 +01:00
Adrien Grand	aad84395c9	Add a test that upgrades succeed even if a mapping contains fields that come from a plugin.	2015-11-24 19:14:19 +01:00
javanna	49bfe6410e	Rename Data leftovers	2015-11-24 15:49:25 +01:00
javanna	8f1f5d4da0	Split mutate processor into one processor per function	2015-11-24 14:31:53 +01:00
Martijn van Groningen	1e9d5c7b22	test: also test what happens if all index requests fail to be processed by the pipeline	2015-11-24 13:41:40 +01:00
Martijn van Groningen	8b1f117e51	Instead of failing the entire bulk request if the pipeline fails, only fail a bulk item.	2015-11-24 12:40:30 +01:00
javanna	eeb51ce8d0	Merge branch 'master' into feature/ingest	2015-11-24 10:23:53 +01:00
Adrien Grand	5f33fbdb75	Register field mappers at the node level. This moves the registration of field mappers from the index level to the node level and also ensures that mappers coming from plugins are treated no differently from core mappers.	2015-11-24 08:59:37 +01:00
Michael McCandless	e13b0d4bde	upgrade lucene to 5.4.0-snapshot-1715952	2015-11-23 17:13:49 -05:00
Martijn van Groningen	ecc8158b89	renamed Data to IngestDocument moved all metadata related fields to a single metadata map removed specific metadata getters with a generic getMetadata()	2015-11-23 18:24:42 +01:00
David Pilato	5b0e2823b1	Merge branch 'docs/mapper-attachments'	2015-11-23 12:14:31 +01:00

... 6 7 8 9 10 ...

1575 Commits