OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ali Beyad	05998224d8	Adding repository index generational files Before, a repository would maintain an index file (named 'index') per repository, that contained the current snapshots in the repository. This file was not atomically written, so repositories had to depend on listing the blobs in the repository to determine what the current snapshots are, and only rely on the index file if the repository does not support the listBlobs operation. This could cause an incorrect view of the current snapshots in the repository if any prior snapshot delete operations failed to delete snapshot metadata files. This commit introduces the atomic writing of the index file, and because atomic writes are not guaranteed if the file already exists, we write to a generational index file (index-N, where N is the current generation). We also maintain an index-latest file that contains the current generation, for those repositories that cannot list blobs. Closes #19002 Relates #18156	2016-07-01 17:52:57 -04:00
Ryan Ernst	e707f0ea6e	Simplify ingest useragent construction	2016-07-01 14:21:41 -07:00
Ryan Ernst	10261a615b	Update ingest useragent plugin to use new ingest plugin	2016-07-01 14:16:09 -07:00
Ryan Ernst	e5caadc4f3	Merge branch 'master' into ingest_plugin_api	2016-07-01 12:35:26 -07:00
Ryan Ernst	65c9b0b588	Merge branch 'master' into ingest_plugin_api	2016-07-01 09:26:17 -07:00
Tanguy Leroux	0a293fad29	Remove some unused code	2016-07-01 17:01:39 +02:00
Tanguy Leroux	8c40b2b54e	Fix order of modifiers	2016-07-01 16:57:14 +02:00
Christoph Wurm	368c3ccc54	Fix Factory test	2016-07-01 16:53:17 +02:00
Christoph Wurm	4c32b025d4	Fix Processor: Now implements Processor.Factory	2016-07-01 16:13:51 +02:00
Simon Willnauer	5c8164a561	Clean up BytesReference (#19196 ) BytesReference should be a really simple interface, yet it has a gazillion ways to achieve the same this. Methods like `#hasArray`, `#toBytesArray`, `#copyBytesArray` `#toBytesRef` `#bytes` are all really duplicates. This change simplifies the interface dramatically and makes implementations of it much simpler. All array access has been removed and is streamlined through a single `#toBytesRef` method. Utility methods to materialize a compact byte array has been added too for convenience.	2016-07-01 16:09:31 +02:00
Christoph Wurm	42addb5692	Add ingest-useragent plugin (#19074 )	2016-07-01 15:49:43 +02:00
javanna	598c36128e	Revert "Raised IOException on deleteBlob (#18815 )" This reverts commit `d24cc65cad` as it seems to be causing test failures.	2016-07-01 11:00:32 +02:00
Tanguy Leroux	9bfc23e958	Add missing permission to repository-s3 Repository-S3 needs a special permission because of problems in AmazonS3Client: when no region is set on a AmazonS3Client instance, the AWS SDK loads all known partitions from a JSON file and uses a Jackson's ObjectMapper for that: this one, in version 2.5.3 with the default binding options, tries to suppress access checks of ctor/field/method and thus requires this special permission. AWS must be fixed to uses Jackson correctly and have the correct modifiers on binded classes. This must be fixed in aws sdk (see https://github.com/aws/aws-sdk-java/issues/766) but in the meanwhile we have no choice. closes #18539	2016-07-01 10:32:32 +02:00
gfyoung	d24cc65cad	Raised IOException on deleteBlob (#18815 ) Raise IOException on deleteBlob if the blob doesn't exist This commit raises an IOException on BlobContainer#deleteBlob if the blob does not exist, in conformance with the BlobContainer interface contract. Each implementation of BlobContainer now conforms to this contract (file system, S3, Azure, HDFS). This commit also contains blob container tests for each of the repository implementations. Closes #18530	2016-06-30 23:00:10 -04:00
David Pilato	0c3ce1fac2	Merge branch 'master' into pr/15724-gce-network-host-master	2016-06-30 18:11:57 +02:00
Ryan Ernst	c762e7aa15	Merge branch 'master' into rest_handler_client	2016-06-30 08:16:25 -07:00
Ryan Ernst	0732004ae8	Merge pull request #19177 from rjernst/ingest_factory_generic Remove generics from ingest Processor.Factory	2016-06-30 08:08:26 -07:00
David Pilato	648b7b82b4	Fix method name typo	2016-06-30 15:32:52 +02:00
David Pilato	d78afc26ea	Fix classname Package was removed by mistake	2016-06-30 15:30:49 +02:00
David Pilato	7c7abc349c	Fix checkstyle issues	2016-06-30 15:21:30 +02:00
David Pilato	a029c147a3	Update plugin description	2016-06-30 15:16:51 +02:00
David Pilato	8a2b27076e	Merge branch 'master' into pr/19144-discovery-azure-classic # Conflicts: # plugins/discovery-azure-classic/LICENSE.txt	2016-06-30 14:46:21 +02:00
David Pilato	527a9c7f48	Deprecate discovery-azure and rename it to discovery-azure-classic As discussed at https://github.com/elastic/elasticsearch-cloud-azure/issues/91#issuecomment-229113595, we know that the current `discovery-azure` plugin only works with Azure Classic VMs / Services (which is somehow Legacy now). The proposal here is to rename `discovery-azure` to `discovery-azure-classic` in case some users are using it. And deprecate it for 5.0. Closes #19144.	2016-06-30 14:42:40 +02:00
David Pilato	cd6535ea9b	LICENSE.txt is not needed in plugin root dir We have licenses in licenses dir and the global license for the whole project is in the root dir so this file is not needed here.	2016-06-30 14:07:01 +02:00
David Pilato	74d5fb3197	LICENSE.txt is not needed in plugin root dir We have licenses in licenses dir and the global license for the whole project is in the root dir so this file is not needed here.	2016-06-30 14:06:31 +02:00
David Pilato	2dee980a1a	LICENSE.txt is not needed in plugin root dir We have licenses in licenses dir and the global license for the whole project is in the root dir so this file is not needed here.	2016-06-30 14:05:29 +02:00
David Pilato	daf08ace1e	Fix after merge with master	2016-06-30 12:04:19 +02:00
David Pilato	a9e93a0da4	Merge branch 'master' into pr/15724-gce-network-host-master # Conflicts: # plugins/discovery-gce/src/main/java/org/elasticsearch/plugin/discovery/gce/GceDiscoveryPlugin.java # plugins/discovery-gce/src/test/java/org/elasticsearch/discovery/gce/GceDiscoverTests.java # plugins/discovery-gce/src/test/java/org/elasticsearch/discovery/gce/GceDiscoveryTests.java	2016-06-30 11:53:09 +02:00
David Pilato	1ad3d2251f	Fix line width	2016-06-30 11:43:07 +02:00
Ryan Ernst	e4f265eb3a	Ingest: Remove generics from Processor.Factory The factory for ingest processor is generic, but that is only for the return type of the create mehtod. However, the actual consumer of the factories only cares about Processor, so generics are not needed. This change removes the generic type from the factory. It also removes AbstractProcessorFactory which only existed in order pull the optional tag from config. This functionality is moved to the caller of the factories in ConfigurationUtil, and the create method now takes the tag. This allows the covariant return of the implementation to work with tests not needing casts.	2016-06-30 02:33:54 -07:00
David Pilato	f9d22b3598	Add more javadoc and rename test	2016-06-30 11:32:39 +02:00
David Pilato	66e3b15d21	Fix NPE when GCE region is empty When GCE region is empty we get back from the API something like: ``` { "id": "dummy" } ``` instead of: ``` { "id": "dummy", "items":[ ] } ``` This generates a NPE when we aggregate all the lists into a single one. Closes #16967.	2016-06-30 11:12:20 +02:00
Ryan Ernst	08b3b6264e	Tests pass, started removing generics from processor factory	2016-06-30 01:49:22 -07:00
Ryan Ernst	865b951b7d	Internal: Changed rest handler interface to take NodeClient Previously all rest handlers would take Client in their injected ctor. However, it was only to hold the client around for runtime. Instead, this can be done just once in the HttpService which handles rest requests, and passed along through the handleRequest method. It also should always be a NodeClient, and other types of Clients (eg a TransportClient) would not work anyways (and some handlers can be simplified in follow ups like reindex by taking NodeClient).	2016-06-29 18:02:18 -07:00
Ryan Ernst	f1376262fe	Merge branch 'master' into ingest_plugin_api	2016-06-29 14:16:16 -07:00
Ryan Ernst	6590e77c1a	Plugins: Make plugins closeable This change allows Plugin implementions to implement Closeable when they have resources that should be released. As a first example of how this can be used, I switched over ingest plugins, which just had the geoip processor. The ingest framework had chains of closeable to support this, which is now removed.	2016-06-28 16:16:26 -07:00
Ryan Ernst	258c3e86ab	Added IngestPlugin api, cutover common and geoip, changed ingest factory api to take ProcessorsRegistry	2016-06-28 10:52:07 -07:00
Tanguy Leroux	c557663b90	Make discovery-azure work again The discovery-plugin has been broken since 2.x because the code was not compliant with the security manager and because plugins have been refactored. closes #18637, #15630	2016-06-28 10:05:44 +02:00
David Pilato	af989c0780	Support new Asia Pacific (Mumbai) ap-south-1 AWS region AWS [announced](http://www.allthingsdistributed.com/2016/06/introducing-aws-asia-pacific-mumbai-region.html) a new region: Asia Pacific (Mumbai) `ap-south-1`. We need to support it for: * repository-s3: s3.ap-south-1.amazonaws.com or s3-ap-south-1.amazonaws.com * discovery-ec2: ec2.ap-south-1.amazonaws.com For reference: http://docs.aws.amazon.com/general/latest/gr/rande.html Closes #19110.	2016-06-28 08:50:50 +02:00
Ryan Ernst	33ccc5aead	Merge branch 'master' into mapper_plugin_api	2016-06-27 11:19:59 -07:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Tanguy Leroux	59762fe487	Register group setting for repository-azure accounts Since the Settings infrastructure has been improved, a group setting must be registered by the repository-azure plugin to allow settings like "cloud.azure.storage.my_account.account" to be coherent with Azure plugin documentation.	2016-06-27 12:04:59 +02:00
Nik Everett	71b95fb63c	Switch analysis from push to pull Instead of plugins calling `registerTokenizer` to extend the analyzer they now instead have to implement `AnalysisPlugin` and override `getTokenizer`. This lines up extending plugins in with extending scripts. This allows `AnalysisModule` to construct the `AnalysisRegistry` immediately as part of its constructor which makes testing anslysis much simpler. This also moves the default analysis configuration into `AnalysisModule` which is how search is setup. Like `ScriptModule`, `AnalysisModule` no longer extends `AbstractModule`. Instead it is only responsible for building `AnslysisRegistry`. We still bind `AnalysisRegistry` but we only do so in `Node`. This is means it is available at module construction time so we slowly remove the need to bind it in guice.	2016-06-26 07:15:42 -04:00
Ryan Ernst	6995bde710	Merge branch 'master' into mapper_plugin_api	2016-06-24 11:15:06 -07:00
David Pilato	d73194b9b7	Remove settings filtering for service_account in GCS repository Related to #18945 and to this `35d3bdab84 (commitcomment-17914150)` In GCS Repository plugin we defined a `service_account` setting which is defined as `Property.Filtered`. It's not needed as it's only a path to a file. Closes #18946	2016-06-24 10:58:29 +02:00
Jason Tedor	112669daed	Merge branch 'master' into feature/seq_no * master: (416 commits) docs: removed obsolete information, percolator queries are not longer loaded into jvm heap memory. Upgrade JNA to 4.2.2 and remove optionality [TEST] Increase timeouts for Rest test client (#19042) Update migrate_5_0.asciidoc Add ThreadLeakLingering option to Rest client tests Add a MultiTermAwareComponent marker interface to analysis factories. #19028 Attempt at fixing IndexStatsIT.testFilterCacheStats. Fix docs build. Move templates out of the Search API, into lang-mustache module revert - Inline reroute with process of node join/master election (#18938) Build valid slices in SearchSourceBuilderTests Docs: Convert aggs/misc to CONSOLE Docs: migration notes for _timestamp and _ttl Group client projects under :client [TEST] Add client-test module and make client tests use randomized runner directly Move upgrade test to upgrade from version 2.3.3 Tasks: Add completed to the mapping Fail to start if plugin tries broken onModule Remove duplicated read byte array methods Rename `fields` to `stored_fields` and add `docvalue_fields` ...	2016-06-23 11:52:11 -04:00
Adrien Grand	7ba5bceebe	Add a MultiTermAwareComponent marker interface to analysis factories. #19028 This is the same as what Lucene does for its analysis factories, and we hawe tests that make sure that the elasticsearch factories are in sync with Lucene's. This is a first step to move forward on #9978 and #18064.	2016-06-23 10:19:24 +02:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Ryan Ernst	e817b5daa3	Plugins: Remove guice from Mapper plugins This changes adds a MapperPlugin interface which allows pull style retrieval of mappers and metadata mappers added by plugins. For now, I have kept the MapperRegistry, but this should be removed in the future as it is just a silly container for 2 maps which could themselves be passed around.	2016-06-21 22:50:39 -07:00
Martijn van Groningen	82f7bfad98	ingest: merged o.e.ingest.core with o.e.ingest and in ingest-common module added o.e.ingest.common package and moved all code to that package.	2016-06-21 09:24:00 +02:00
Simon Willnauer	7fea5bd8e7	Remove obsolete Modules that can simply be inlined in node creation	2016-06-20 11:28:14 +02:00
Simon Willnauer	260f38fd76	Remove VersionModule and use Version#current consistently. We pretended to be able to ackt like a different version node for so long it's time to be honest and remove this ability. It's just confusing and where needed and tested we should build dedicated extension points.	2016-06-20 10:55:52 +02:00
Tanguy Leroux	98951b1203	Compile each Groovy script in its own classloader closes #18572	2016-06-20 08:17:09 +02:00
Areek Zillur	9356a6090f	Merge branch 'master' into enhancement/rollover_api	2016-06-17 11:35:57 -04:00
David Pilato	bf7a6f5509	Merge branch 'pr/update-aws-sdk' # Conflicts: # plugins/repository-s3/src/main/java/org/elasticsearch/plugin/repository/s3/S3RepositoryPlugin.java	2016-06-17 17:23:12 +02:00
Simon Willnauer	bdb6dcea3a	Cleanup ClusterService dependencies and detached from Guice (#18941 ) This change removes some unnecessary dependencies from ClusterService and cleans up ClusterName creation. ClusterService is now not created by guice anymore.	2016-06-17 17:07:19 +02:00
David Pilato	82fee9f7a7	Revert change about registering Repository settings Will create another issue to change that. Related to this discussion: `f4cd3bd348 (r67291936)`	2016-06-17 17:02:00 +02:00
Areek Zillur	545ffa7801	Merge branch 'master' into enhancement/rollover_api	2016-06-17 10:33:11 -04:00
Adrien Grand	600cbb6ab0	Upgrade to Lucene 6.1.0. #18926	2016-06-17 09:03:00 +02:00
Areek Zillur	6adffa6b7b	Merge branch 'master' into enhancement/rollover_api	2016-06-16 17:27:32 -04:00
Ryan Ernst	8196cf01e3	Merge branch 'master' into plugin_name_api	2016-06-16 13:49:28 -07:00
Simon Willnauer	b22c526b34	Cut over settings registration to a pull model (#18890 ) Today we have a push model for registering basically anything. All our extension points are defined on modules which we pass in to plugins. This is harder to maintain and adds unnecessary dependencies on the modules itself. This change moves towards a pull model where the plugin offers a getter kind of method to get the extensions. This will also help in the future if we need to pass dependencies to the extension points which can easily be defined on the method as arguments if a pull model is used.	2016-06-16 15:52:58 +02:00
Simon Willnauer	18ff051ad5	Simplify ScriptModule and script registration (#18903 ) Registering a script engine or native scripts still uses Guice today and is much more complicated than needed. This change moves to a pull based model where script plugins have to implement a dedicated interface `ScriptPlugin` and defines simple getter returning instances rather than classes.	2016-06-16 09:35:13 +02:00
David Pilato	b036d238f5	Fix typo	2016-06-16 05:51:39 +02:00
Ryan Ernst	a4503c2aed	Plugins: Remove name() and description() from api In 2.0 we added plugin descriptors which require defining a name and description for the plugin. However, we still have name() and description() which must be overriden from the Plugin class. This still exists for classpath plugins. But classpath plugins are mainly for tests, and even then, referring to classpath plugins with their class is a better idea. This change removes name() and description(), replacing the name for classpath plugins with the full class name.	2016-06-15 17:12:22 -07:00
Ryan Ernst	7f6e0c6c02	fix compile for ingest plugin lambda	2016-06-15 16:57:01 -07:00
Tal Levy	a26260fb72	new ScriptProcessor for Ingest (#18193 ) add new ScriptProcessor for executing ES Scripts within pipelines	2016-06-15 14:57:18 -07:00
David Pilato	63223928dc	Merge branch 'master' into pr/update-aws-sdk	2016-06-15 14:48:41 +02:00
Adrien Grand	44c653f5a8	Upgrade to lucene-6.1.0-snapshot-3a57bea.	2016-06-10 16:18:12 +02:00
Martijn van Groningen	3dd3ed4905	ingest: Upgrade geoip processor's dependencies and database files The database files have been doubled in size compared to the previous files being used. For this reason the database files are now gzip compressed, which required using `GZIPInputStream` when loading database files.	2016-06-08 18:41:48 +02:00
Jason Tedor	d896886973	Merge branch 'master' into feature/seq_no * master: (51 commits) Switch QueryBuilders to new MatchPhraseQueryBuilder Added method to allow creation of new methods on-the-fly. more cleanups Remove cluster name from data path Remove explicit parallel new GC flag rehash the docvalues in DocValuesSliceQuery using BitMixer.mix instead of the naive Long.hashCode. switch FunctionRef over to methodhandles ingest: Move processors from core to ingest-common module. Fix some typos (#18746) Fix ut convert FunctionRef/Def usage to methodhandles. Add the ability to partition a scroll in multiple slices. API: use painless types in FunctionRef Update ingest-node.asciidoc compute functional interface stuff in Definition Use method name in bootstrap check might fork test Make checkstyle happy (add Lookup import, line length) Don't hide LambdaConversionException and behave like real javac compiled code when a conversion fails. This works anyways, because fallback is allowed to throw any Throwable Pass through the lookup given by invokedynamic to the LambdaMetaFactory. Without it real lambdas won't work, as their implementations are private to script class checkstyle have your upper L ...	2016-06-07 17:57:53 -04:00
Martijn van Groningen	f611f1c99e	ingest: Move processors from core to ingest-common module. Folded grok processor into ingest-common module. The rest tests have been moved to ingest-common module as well, because these tests don't run in the rest-api-spec module but in the distribution:integ-test-zip module and adding a test plugin there felt just wrong to me. I think this is ok. I left a tiny ingest rest test behind in that tests with an empty pipeline. Removed messy tests, these tests were already covered in the rest tests Added ingest test plugin in test infra so that each module testing integration with ingest doesn't need write its own plugin Moved reindex ingest tests to qa module Closes #18490	2016-06-07 17:32:52 +02:00
Jason Tedor	da74323141	Register thread pool settings This commit refactors the handling of thread pool settings so that the individual settings can be registered rather than registering the top level group. With this refactoring, individual plugins must now register their own settings for custom thread pools that they need, but a dedicated API is provided for this in the thread pool module. This commit also renames the prefix on the thread pool settings from "threadpool" to "thread_pool". This enables a hard break on the settings so that: - some of the settings can be given more sensible names (e.g., the max number of threads in a scaling thread pool is now named "max" instead of "size") - change the soft limit on the number of threads in the bulk and indexing thread pools to a hard limit - the settings names for custom plugins for thread pools can be prefixed (e.g., "xpack.watcher.thread_pool.size") - remove dynamic thread pool settings Relates #18674	2016-06-06 22:09:12 -04:00
Areek Zillur	d96fe20e3a	add named writable registry glue	2016-06-06 16:11:46 -04:00
Jason Tedor	a60b8948ba	Merge branch 'master' into feature/seq_no * master: (184 commits) Add back pending deletes (#18698) refactor matrix agg documentation from modules to main agg section Implement ctx.op = "delete" on _update_by_query and _reindex Close SearchContext if query rewrite failed Wrap lines at 140 characters (:qa projects) Remove log file painless: Add support for the new Java 9 MethodHandles#arrayLength() factory (see https://bugs.openjdk.java.net/browse/JDK-8156915) More complete exception message in settings tests Use java from path if JAVA_HOME is not set Fix uncaught checked exception in AzureTestUtils [TEST] wait for yellow after setup doc tests (#18726) Fix recovery throttling to properly handle relocating non-primary shards (#18701) Fix merge stats rendering in RestIndicesAction (#18720) [TEST] mute RandomAllocationDeciderTests.testRandomDecisions Reworked docs for index-shrink API (#18705) Improve painless compile-time exceptions Adds UUIDs to snapshots Add test rethrottle test case for delete-by-query Do not start scheduled pings until transport start Adressing review comments ...	2016-06-06 11:16:22 -04:00
Jason Tedor	974c753bf6	Fix uncaught checked exception in AzureTestUtils This commit fixes an uncaught checked IOException now thrown in AzureTestUtils after `3adaf09675`.	2016-06-03 14:17:25 -04:00
Jason Tedor	bbd5f26d45	Merge branch 'master' into rjernst-placeholder * master: (911 commits) [TEST] wait for yellow after setup doc tests (#18726) Fix recovery throttling to properly handle relocating non-primary shards (#18701) Fix merge stats rendering in RestIndicesAction (#18720) [TEST] mute RandomAllocationDeciderTests.testRandomDecisions Reworked docs for index-shrink API (#18705) Improve painless compile-time exceptions Adds UUIDs to snapshots Add test rethrottle test case for delete-by-query Do not start scheduled pings until transport start Adressing review comments Only filter intial recovery (post API) when shrinking an index (#18661) Add tests to check that toQuery() doesn't return null Removing handling of null lucene query where we catch this at parse time Handle empty query bodies at parse time and remove EmptyQueryBuilder Mute failing assertions in IndexWithShadowReplicasIT until fix Remove allow running as root Add upgrade-not-supported warning to alpha release notes remove unrecognized javadoc tag from matrix aggregation module set ValuesSourceConfig fields as private Adding MultiValuesSource support classes and documentation to matrix stats agg module ...	2016-06-03 13:32:03 -04:00
David Pilato	ef6e43e18d	We don't need many URLs here but just one	2016-06-03 18:27:27 +02:00
David Pilato	a1496f8e21	Fix getResource to call it from the current class	2016-06-03 18:26:48 +02:00
David Pilato	e7ab0a9233	Remove GceMetadataService interface as not needed	2016-06-03 18:14:00 +02:00
David Pilato	b0bdd443bd	Don't use String concatenation but actual URI	2016-06-03 18:04:55 +02:00
David Pilato	1eb022de11	Use a unique GCE_HOST setting	2016-06-03 17:33:09 +02:00
David Pilato	711515ac80	Rename GceComputeService to GceInstancesService	2016-06-03 17:26:53 +02:00
David Pilato	b9989f88d0	Merge branch 'master' into pr/15724-gce-network-host-master	2016-06-03 17:22:24 +02:00
Ali Beyad	b720216395	Adds UUIDs to snapshots This commit adds a UUID for each snapshot, in addition to the already existing repository and snapshot name. The addition of UUIDs will enable more robust handling of the deletion of previous snapshots and lingering files from partially failed delete operations, on top of being able to uniquely track each snapshot. Closes #18228 Relates #18156	2016-06-02 17:01:48 -04:00
Alexander Kazakov	f32f35bec4	Register "cloud.node.auto_attributes" setting (#18678 )	2016-06-01 13:27:11 +02:00
Adrien Grand	d182e171a4	Upgrade to Lucene 6.0.1.	2016-06-01 10:31:10 +02:00
Ali Beyad	0efac76f01	Clarify the semantics of the BlobContainer interface This commit clarifies the behavior that must be adhered to by any implementors of the BlobContainer interface. This is done through expanded Javadocs. Closes #18157 Closes #15580	2016-05-31 19:22:55 -04:00
Simon Willnauer	502a775a7c	Add primitive to shrink an index into a single shard (#18270 ) This adds a low level primitive operations to shrink an existing index into a new index with a single shard. This primitive expects all shards of the source index to allocated on a single node. Once the target index is initializing on the shrink node it takes a snapshot of the source index shards and copies all files into the target indices data folder. An [optimization](https://issues.apache.org/jira/browse/LUCENE-7300) coming in Lucene 6.1 will also allow for optional constant time copy if hard-links are supported by the filesystem. All mappings are merged into the new indexes metadata once the snapshots have been taken on the merge node. To shrink an existing index all shards must be moved to a single node (one instance of each shard) and the index must be read-only: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_settings' -d '{ "settings" : { "index.routing.allocation.require._name" : "shrink_node_name", "index.blocks.write" : true } } ``` once all shards are started on the shrink node. the new index can be created via: ```BASH $ curl -XPUT 'http://localhost:9200/logs/_shrink/logs_single_shard' -d '{ "settings" : { "index.codec" : "best_compression", "index.number_of_replicas" : 1 } }' ``` This API will perform all needed check before the new index is created and selects the shrink node based on the allocation of the source index. This call returns immediately, to monitor shrink progress the recovery API should be used since all copy operations are reflected in the recovery API with byte copy progress etc. The shrink operation does not modify the source index, if a shrink operation should be canceled or if the shrink failed, the target index can simply be deleted and all resources are released.	2016-05-31 10:41:44 +02:00
David Pilato	63622aa6b6	Fix log use_throttle_retries	2016-05-27 12:48:05 +02:00
David Pilato	623a5b7a85	Merge branch 'master' into pr/update-aws-sdk	2016-05-27 10:13:35 +02:00
David Pilato	a445654123	Fix after review * changes `throttle_retries` to `use_throttle_retries` * removes registering of all individual repository settings when the plugin starts. Not needed * adds more comment about deprecated method in AWS SDK we need to implement though in a Delegate class within our tests	2016-05-27 10:13:16 +02:00
Boaz Leskes	318a4e3ef6	Introduce dedicated master nodes in testing infrastructure (#18514 ) This PR changes the InternalTestCluster to support dedicated master nodes. The creation of dedicated master nodes can be controlled using a new `supportsMasterNodes` parameter to the ClusterScope annotation. If set to true (the default), dedicated master nodes will randomly be used. If set to false, no master nodes will be created and data nodes will also be allowed to become masters. If active, test runs will either have 1 or 3 masternodes	2016-05-27 08:44:20 +02:00
Jason Tedor	9d39b05845	Remove deprecation suppression Failing the build on deprecation warnings was removed in `19b3ec88af`. This commit removes the suppressed deprecation warnings so that their use is surfaced in the build now. Relates #18582	2016-05-25 17:15:36 -04:00
Tanguy Leroux	bdee8c2632	Disable XContent auto closing of object and arrays	2016-05-25 16:46:09 +02:00
David Pilato	c4d3bf472b	Fix comment and rename blob_container to blobContainer	2016-05-25 11:02:11 +02:00
David Pilato	fd602cc037	Merge branch 'master' into azure/fix-delete	2016-05-25 10:53:04 +02:00
Tanguy Leroux	1f011f9dea	Remove Delete-By-Query plugin closes #18469	2016-05-24 13:28:20 +02:00
Adrien Grand	459916f5dd	Remove custom Base64 implementation. #18413 This replaces o.e.common.Base64 with java.util.Base64.	2016-05-23 11:32:42 +02:00
Tanguy Leroux	e7eb664c78	Change BlobPath.buildAsString() method	2016-05-23 10:50:40 +02:00
Jason Tedor	ad7229fe72	Merge branch 'master' into feature/seq_no * master: (158 commits) Document the hack Refactor property placeholder use of env. vars Force java9 log4j hack in testing Fix log4j buggy java version detection Make java9 work again Don't mkdir directly in deb init script Fix env. var placeholder test so it's reproducible Remove ScriptMode class in favor of boolean true/false [rest api spec] fix doc urls Netty request/response tracer should wait for send Filter client/server VM options from jvm.options [rest api spec] fix url for reindex api docs Remove use of a Fields class in snapshot responses that contains x-content keys, in favor of declaring/using the keys directly. Limit retries of failed allocations per index (#18467) Proxy box method to use valueOf. Use the build-in valueOf method instead of the custom one. Fixed tests and added a comment to the box method. Fix boxing. Do not decode path when sending error Fix race condition in snapshot initialization ...	2016-05-21 21:04:43 -04:00
Ryan Ernst	37d36f2f4c	Merge branch 'master' into java9	2016-05-21 14:19:58 -07:00
Ryan Ernst	1d40c4bbc1	Make java9 work again This change makes ES compile with java9 again, build 118. * There are a handful of changes due to failure to determine types during compile. * The attachment plugins which use tika needed to have tika upgraded in order to pickup fixes there for java 9. * azure discovery and s3 repository indirectly depend on jaxb, which is no longer in the default modules. They now add a jaxb dependency externally, and make JarHell allow for this package.	2016-05-21 09:41:51 -07:00
David Pilato	1d75ee6fb9	Merge branch 'master' into azure/fix-delete	2016-05-20 16:12:30 +02:00
David Pilato	6772517f4d	Cleanup the PR and apply advices from the review * ESBlobStore tests must move to the test framework if we want to be able to reuse them in the context of plugins. * To be able to identify more easily what are Integration Tests vs Unit Tests, this commit renames `AzureTestCase` to `AzureIntegTestCase`. * Move some debug level logs to trace level * Collapse when possible identical catch blocks * `blobNameFromUri()` does not need anymore to get the container name. We just split the URI after 3 `/` and simply get the remaining part. * Added a Unit test for that * As we renamed some existing classes, checkstyle is now complaining about the lines width. * While we are at it, let's replace all calls to `execute().actionGet()` with `get()` * Move `readSettingsFromFile()` in a Util class. Note that this class might be useful for other plugins (S3/EC2/Azure-discovery for instance) so may be we should move it to the test framework? * Replace some part of the code with lambdas	2016-05-20 16:04:39 +02:00
javanna	63c5b31449	update shas for httpclient and httpcore	2016-05-20 14:10:55 +02:00
Tanguy Leroux	8486488627	Disable DeleteByQueryRestIT delete_by_query/10_basic/Basic delete_by_query Because of a REST test namespace conflict introduced by 18329. Issue tracked in 18469	2016-05-19 18:44:53 +02:00
David Pilato	f4cd3bd348	Merge branch 'master' into pr/update-aws-sdk	2016-05-19 16:55:21 +02:00
David Pilato	e289de6e96	Move `throttle_retries` under `repositories.s3.` prefix or per repository I initially wrongly put this setting under `cloud.aws.s3.` prefix which does not make sense. It should be placed at the same place as `max_retries`. Also applied @tlrx comments. We should set this even if max_retries is not set (when using default values). Also added some documentation about this setting.	2016-05-19 16:50:37 +02:00
Tanguy Leroux	35d3bdab84	Add Google Cloud Storage repository plugin Closes #12880	2016-05-19 13:26:23 +02:00
Jason Tedor	6e3b49c522	Fix inequality symbol in test assertion This commit fixes the inequality symbol used in a test assertion in RepositoryS3SettingsTests#testInvalidChunkBufferSizeRepositorySettings. The inequality symbol was previously backwards but fixed in commit `cad0608cdb` but fixing the inequality symbol here was missed in that commit. Closes #18449	2016-05-18 12:14:37 -04:00
David Pilato	9b247f9828	Fix remove of azure files Probably when we updated Azure SDK, we introduced a regression. Actually, we are not able to remove files anymore. For example, if you register a new azure repository, the snapshot service tries to create a temp file and then remove it. Removing does not work and you can see in logs: ``` [2016-05-18 11:03:24,914][WARN ][org.elasticsearch.cloud.azure.blobstore] [azure] can not remove [tests-ilmRPJ8URU-sh18yj38O6g/] in container {elasticsearch-snapshots}: The specified blob does not exist. ``` This fix deals with that. It now list all the files in a flatten mode, remove in the full URL the server and the container name. As an example, when you are removing a blob which full name is `https://dpi24329.blob.core.windows.net/elasticsearch-snapshots/bar/test` you need to actually call Azure SDK with `bar/test` as the path, `elasticsearch-snapshots` is the container. To run the test, you need to pass some parameters: `-Dtests.thirdparty=true -Dtests.config=/path/to/elasticsearch.yml` Where `elasticsearch.yml` contains something like: ``` cloud.azure.storage.default.account: account cloud.azure.storage.default.key: key ``` Related to #16472 Closes #18436.	2016-05-18 17:23:33 +02:00
Jason Tedor	db4809d906	Remove last vestigates of /bin/sh shebangs This commit removes the remaining /bin/sh shebangs in favor of /bin/bash. Relates #18448	2016-05-18 11:03:00 -04:00
David Pilato	d85dac7a9a	Add more logs	2016-05-18 16:43:56 +02:00
David Pilato	cfedda5291	Default azure container should be `elasticsearch-snapshots` This bug has been introduced in 5.0 when we refactored settings	2016-05-18 16:43:28 +02:00
$polyfractal$ polyfractal	72094feb12	[TEST] Add missing sort processor to tests, continued	2016-05-17 16:39:53 -04:00
Robert Muir	8d4c1befe5	Merge pull request #18364 from rmuir/nukeRunAsFloat Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-16 17:08:25 -04:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Robert Muir	8edf213492	Remove LeafSearchScript.runAsFloat(): Nothing calls it.	2016-05-15 22:59:28 -04:00
Jason Tedor	15d3d74444	Merge branch 'master' into feature/seq_no * master: (904 commits) Removes unused methods in the o/e/common/Strings class. Add note regarding thread stack size on Windows painless: restore accidentally removed test Documented fuzzy_transpositions in match query Add not-null precondition check in BulkRequest Build: Make run task you full zip distribution Build: More pom generation improvements Add test for wrong array index Take return type from "after" field. painless: build descriptor of array and field load/store in code; fix array index to adapt type not DEF Build: Add developer info to generated pom files painless: improve exception stacktraces painless: Rename the dynamic call site factory to DefBootstrap and make the inner class very short (PIC = Polymorphic Inline Cache) Remove dead code. Avoid race while retiring executors Allow only a single extension for a scripting engine Adding REST tests to ensure key_as_string behavior stays consistent [test] Set logging to 11 on reindex test [TEST] increase logger level until we know what is going on Don't allow `fuzziness` for `multi_match` types cross_fields, phrase and phrase_prefix ...	2016-05-14 20:23:59 -04:00
Robert Muir	2028691e66	painless: improve exception stacktraces closes #18319	2016-05-13 15:40:45 -04:00
Lee Hinman	9bcdafedda	Allow only a single extension for a scripting engine Previously multiple extensions could be provided, however, this can lead to confusion with on-disk scripts (ie, "foo.js" and "foo.javascript") having different content. Only a single extension is now supported. The only language currently supporting multiple extensions was the Javascript engine ("js" and "javascript"). It now only supports the `.js` extension. Relates to #10598	2016-05-13 09:54:31 -06:00
Lee Hinman	efff3918d8	Remove support for mulitple languages per scripting engine	2016-05-13 09:24:31 -06:00
Lee Hinman	a4060f7436	Remove vestiges of script engine sandboxing This removes all the mentions of the sandbox from the script engine services and permissions model. This means that the following settings are no longer supported: ```yaml script.inline: sandbox script.stored: sandbox ``` Instead, only a `true` or `false` value can be specified. Since this would otherwise break the default-allow parameter for languages like expressions, painless, and mustache, all script engines have been updated to have individual settings, for instance: ```yaml script.engine.groovy.inline: true ``` Would enable all inline scripts for groovy. (they can still be overridden on a per-operation basis). Expressions, Painless, and Mustache all default to `true` for inline, file, and stored scripts to preserve the old scripting behavior. Resolves #17114	2016-05-13 09:24:31 -06:00
John Barker	531dcbf20a	Add TAG_SETTING to list of allowed tags for the ec2 discovery plugin. I am unable to set ec2 discovery tags because this setting was accidentally omitted from the register settings list in Ec2DiscoveryPlugin.java. I get this: java.lang.IllegalArgumentException: unknown setting [discovery.ec2.tag.project]	2016-05-10 16:19:46 -04:00
David Pilato	e8ddf5de2f	Merge branch 'pr/hide-s3-repositories-credentials'	2016-05-10 20:22:39 +02:00
Adrien Grand	f481492af3	Remove FieldMapper.Builder.indexName. #18219 The ability to configure index names that are different from the full name was removed in 2.0.	2016-05-10 08:17:00 +02:00
Adrien Grand	5d8f684319	Mapping cleanups. #18180 This removes dead/duplicate code and makes the `_index` field not configurable. (Configuration used to jus be ignored, now we would throw an exception if any is provided.)	2016-05-10 08:14:18 +02:00
Chris Earle	5be79ed02c	Add Failure Details to every NodesResponse Most of the current implementations of BaseNodesResponse (plural Nodes) ignore FailedNodeExceptions. - This adds a helper function to do the grouping to TransportNodesAction - Requires a non-null array of FailedNodeExceptions within the BaseNodesResponse constructor - Reads/writes the array to output - Also adds StreamInput and StreamOutput methods for generically reading and writing arrays	2016-05-06 14:59:43 -04:00
Adrien Grand	7d8708716e	QueryBuilder does not need generics. #18133 QueryBuilder has generics, but those are never used: all call sites use `QueryBuilder<?>`. Only `AbstractQueryBuilder` needs generics so that the base class can contain a default implementation for setters that returns `this`.	2016-05-06 08:38:20 +02:00
Jason Tedor	784c9e5fb9	Introduce node handshake This commit introduces a handshake when initiating a light connection. During this handshake, node information, cluster name, and version are received from the target node of the connection. This information can be used to immediately validate that the target node is a member of the same cluster, and used to set the version on the stream. This will allow us to extend APIs that are used during initial cluster recovery without a major version change. Relates #15971	2016-05-04 20:06:47 -04:00
Jason Tedor	2dea449949	Remove Strings#splitStringToArray This commit removes the method Strings#splitStringToArray and replaces the call sites with invocations to String#split. There are only two explanations for the existence of this method. The first is that String#split is slightly tricky in that it accepts a regular expression rather than a character to split on. This means that if s is a string, s.split(".") does not split on the character '.', but rather splits on the regular expression '.' which splits on every character (of course, this is easily fixed by invoking s.split("\\.") instead). The second possible explanation is that (again) String#split accepts a regular expression. This means that there could be a performance concern compared to just splitting on a single character. However, it turns out that String#split has a fast path for the case of splitting on a single character and microbenchmarks show that String#split has 1.5x--2x the throughput of Strings#splitStringToArray. There is a slight behavior difference between Strings#splitStringToArray and String#split: namely, the former would return an empty array in cases when the input string was null or empty but String#split will just NPE at the call site on null and return a one-element array containing the empty string when the input string is empty. There was only one place relying on this behavior and the call site has been modified accordingly.	2016-05-04 08:12:41 -04:00
Martijn van Groningen	7aca1389e2	ingest: Add `date_index_name` processor. Closes #17814	2016-04-29 17:20:48 +02:00
Tal Levy	07c2fbf83a	Validate properties values according to database type (#17940 ) Fixes #17683.	2016-04-29 07:58:27 -07:00
David Pilato	c16d309c8c	Allow `_gce_` network when not using discovery gce For now we support `_gce_` only if discovery is set to `gce` and all information about GCE is provided (project_id and zone). But in some cases, people would like to only bind to `_gce_` on a single node (without any elasticsearch cluster). They could access the machine then from other machines running inside the same project. This commit adds a new GceMetadataService which is started as soon as the plugin is started so GceNameResolver can use it to resolve `_gce`. Closes #15724.	2016-04-29 16:56:24 +02:00
Yannick Welsch	37382ecfb2	Add Azure discovery tests mocking Azure management endpoint (#18004 )	2016-04-29 15:54:15 +02:00
David Pilato	7cc8a1419b	Update after rebase onto master	2016-04-29 15:39:51 +02:00
David Pilato	d7eb375d24	Merge branch 'master' into pr/s3-path-style-access # Conflicts: # plugins/repository-s3/src/main/java/org/elasticsearch/cloud/aws/AwsS3Service.java # plugins/repository-s3/src/main/java/org/elasticsearch/cloud/aws/InternalAwsS3Service.java # plugins/repository-s3/src/main/java/org/elasticsearch/repositories/s3/S3Repository.java # plugins/repository-s3/src/test/java/org/elasticsearch/cloud/aws/TestAwsS3Service.java	2016-04-29 15:21:16 +02:00
David Pilato	6c7a44ccd9	Fix test in mapper attachments plugin	2016-04-29 15:02:04 +02:00
David Pilato	2636703afa	Merge branch 'master' into pr/attachments-add-test-forced-values	2016-04-29 14:55:42 +02:00
David Pilato	faa3c6ef3c	Add new UnsupportedException for EC Mock	2016-04-29 14:41:57 +02:00
David Pilato	6ef81c5dcd	S3 repositories credentials should be filtered When working on #18008 I found while reading the code that we don't filter anymore `repositories.s3.access_key` and `repositories.s3.secret_key`. Also fixed a typo in REST test	2016-04-27 14:11:17 +02:00
Alexander Reelsen	f71eb0b888	Version: Set version to 5.0.0-alpha2	2016-04-26 09:30:26 +02:00
Xu Zhang	3e4b470f83	Fix icu IndexScope setting	2016-04-22 15:03:02 -07:00
Ryan Ernst	d12a4bb51d	Merge pull request #17933 from rjernst/camelcase4 Remove camelCase support	2016-04-22 13:46:43 -07:00
xuzha	cd527c5b92	Add support for customizing the rule file in ICU tokenizer Lucene allows to create a ICUTokenizer with a special config argument enabling the customization of the rule based iterator by providing custom rules files. This commit enable this feature. Users could provide a list of RBBI rule files to ICU tokenizer. closes #13146	2016-04-22 12:39:20 -07:00
Ryan Ernst	55388590c1	Remove camelCase support Now that the current uses of magical camelCase support have been deprecated, we can remove these in master (sans remaining issues like BulkRequest). This change removes camel case support from ParseField, query types, analysis, and settings lookup. see #8988	2016-04-22 09:18:10 -07:00
Martijn van Groningen	c5ad2e2865	Changed indexed scripts to be stored in the cluster state instead of the `.scripts` index. Also added max script size soft limit for stored scripts. Closes #16651	2016-04-22 13:42:55 +02:00
Martijn van Groningen	dd2184ab25	ingest: Streamline option naming for several processors: * `rename` processor, renamed `to` to `target_field` * `date` processor, renamed `match_field` to `field` and renamed `match_formats` to `formats` * `geoip` processor, renamed `source_field` to `field` and renamed `fields` to `properties` * `attachment` processor, renamed `source_field` to `field` and renamed `fields` to `properties` Closes #17835	2016-04-21 13:40:43 +02:00
Jun Ohtani	9eb242a5fe	Analyze API : Rename filters/token_filters/char_filter to filter/token_filter/char_filter Closes #15189	2016-04-21 18:05:11 +09:00
Ryan Ernst	523b071836	Internal: Remove XContentBuilderString This was previously used by xcontentbuilder to support camelCase. However, it is no longer used, and can be replaced with just String.	2016-04-18 14:32:18 -07:00
Nik Everett	ff9b28d806	Deprecate remaining readXYZ\|writeXYZ methods	2016-04-18 16:19:45 -04:00
David Pilato	44080a007f	Add cloud.aws.s3.throttle_retries setting Defaults to `true`. If anyone is having trouble with this option, you could disable it with `cloud.aws.s3.throttle_retries: false` in `elasticsearch.yml` file.	2016-04-15 14:53:09 +02:00
David Pilato	f2ee759ad5	Upgrade AWS SDK to 1.10.69 * Moving from JSON.org to Jackson for request marshallers. * The Java SDK now supports retry throttling to limit the rate of retries during periods of reduced availability. This throttling behavior can be enabled via ClientConfiguration or via the system property "-Dcom.amazonaws.sdk.enableThrottledRetry". * Fixed String case conversion issues when running with non English locales. * AWS SDK for Java introduces a new dynamic endpoint system that can compute endpoints for services in new regions. * Introducing a new AWS region, ap-northeast-2. * Added a new metric, HttpSocketReadTime, that records socket read latency. You can enable this metric by adding enableHttpSocketReadMetric to the system property com.amazonaws.sdk.enableDefaultMetrics. For more information, see [Enabling Metrics with the AWS SDK for Java](https://java.awsblog.com/post/Tx3C0RV4NRRBKTG/Enabling-Metrics-with-the-AWS-SDK-for-Java). * New Client Execution timeout feature to set a limit spent across retries, backoffs, ummarshalling, etc. This new timeout can be specified at the client level or per request. Also included in this release is the ability to specify the existing HTTP Request timeout per request rather than just per client. * Added support for RequesterPays for all operations. * Ignore the 'Connection' header when generating S3 responses. * Allow users to generate an AmazonS3URI from a string without using URL encoding. * Fixed issue that prevented creating buckets when using a client configured for the s3-external-1 endpoint. * Amazon S3 bucket lifecycle configuration supports two new features: the removal of expired object delete markers and an action to abort incomplete multipart uploads. * Allow TransferManagerConfiguration to accept integer values for multipart upload threshold. * Copy the list of ETags before sorting https://github.com/aws/aws-sdk-java/pull/589. * Option to disable chunked encoding https://github.com/aws/aws-sdk-java/pull/586. * Adding retry on InternalErrors in CompleteMultipartUpload operation. https://github.com/aws/aws-sdk-java/issues/538 * Deprecated two APIs : AmazonS3#changeObjectStorageClass and AmazonS3#setObjectRedirectLocation. * Added support for the aws-exec-read canned ACL. Owner gets FULL_CONTROL. Amazon EC2 gets READ access to GET an Amazon Machine Image (AMI) bundle from Amazon S3. * Added support for referencing security groups in peered Virtual Private Clouds (VPCs). For more information see the service announcement at https://aws.amazon.com/about-aws/whats-new/2016/03/announcing-support-for-security-group-references-in-a-peered-vpc/ . * Fixed a bug in AWS SDK for Java - Amazon EC2 module that returns NPE for dry run requests. * Regenerated client with new implementation of code generator. * This feature enables support for DNS resolution of public hostnames to private IP addresses when queried over ClassicLink. Additionally, you can now access private hosted zones associated with your VPC from a linked EC2-Classic instance. ClassicLink DNS support makes it easier for EC2-Classic instances to communicate with VPC resources using public DNS hostnames. * You can now use Network Address Translation (NAT) Gateway, a highly available AWS managed service that makes it easy to connect to the Internet from instances within a private subnet in an AWS Virtual Private Cloud (VPC). Previously, you needed to launch a NAT instance to enable NAT for instances in a private subnet. Amazon VPC NAT Gateway is available in the US East (N. Virginia), US West (Oregon), US West (N. California), EU (Ireland), Asia Pacific (Tokyo), Asia Pacific (Singapore), and Asia Pacific (Sydney) regions. To learn more about Amazon VPC NAT, see [New - Managed NAT (Network Address Translation) Gateway for AWS](https://aws.amazon.com/blogs/aws/new-managed-nat-network-address-translation-gateway-for-aws/) * A default read timeout is now applied when querying data from EC2 metadata service.	2016-04-15 14:52:48 +02:00
Adrien Grand	d84c643f58	Use the new points API to index numeric fields. #17746 This makes all numeric fields including `date`, `ip` and `token_count` use points instead of the inverted index as a lookup structure. This is expected to perform worse for exact queries, but faster for range queries. It also requires less storage. Notes about how the change works: - Numeric mappers have been split into a legacy version that is essentially the current mapper, and a new version that uses points, eg. LegacyDateFieldMapper and DateFieldMapper. - Since new and old fields have the same names, the decision about which one to use is made based on the index creation version. - If you try to force using a legacy field on a new index or a field that uses points on an old index, you will get an exception. - IP addresses now support IPv6 via Lucene's InetAddressPoint and store them in SORTED_SET doc values using the same encoding (fixed length of 16 bytes and sortable). - The internal MappedFieldType that is stored by the new mappers does not have any of the points-related properties set. Instead, it keeps setting the index options when parsing the `index` property of mappings and does `if (fieldType.indexOptions() != IndexOptions.NONE) { // add point field }` when parsing documents. Known issues that won't fix: - You can't use numeric fields in significant terms aggregations anymore since this requires document frequencies, which points do not record. - Term queries on numeric fields will now return constant scores instead of giving better scores to the rare values. Known issues that we could work around (in follow-up PRs, this one is too large already): - Range queries on `ip` addresses only work if both the lower and upper bounds are inclusive (exclusive bounds are not exposed in Lucene). We could either decide to implement it, or drop range support entirely and tell users to query subnets using the CIDR notation instead. - Since IP addresses now use a different representation for doc values, aggregations will fail when running a terms aggregation on an ip field on a list of indices that contains both pre-5.0 and 5.0 indices. - The ip range aggregation does not work on the new ip field. We need to either implement range aggs for SORTED_SET doc values or drop support for ip ranges and tell users to use filters instead. #17700 Closes #16751 Closes #17007 Closes #11513	2016-04-14 17:56:23 +02:00
Yannick Welsch	80cf9fc761	Add EC2 discovery tests to check permissions of AWS Java SDK (#17677 )	2016-04-13 10:01:49 +02:00
Adrien Grand	3bf6f4076c	Do not set analyzers on numeric fields. When it comes to query parsing, either a field is tokenized and it would go through analysis with its search_analyzer. Or it is not tokenized and the raw string should be passed to termQuery(). Since numeric fields are not tokenized and also declare a search analyzer, values would currently go through analysis twice...	2016-04-12 17:47:29 +02:00
Adrien Grand	013acf9179	Remove MappedFieldType.value. #17557 This commit removes `MappedFieldType.value` and simplifies `MappedFieldType.valueforSearch`. `valueforSearch` was used to post-process values that come for stored fields (eg. to convert a long back to a string representation of a date in the case of a date field) and also values that are extracted from the source but only in the case of GET calls: it would not be called when performing source filtering on search requests. `valueforSearch` is now only called for stored fields, since values that are extracted from the source should already be formatted as expected.	2016-04-12 09:12:56 +02:00
Adrien Grand	496c7fbd84	Upgrade Lucene 6 Release * upgrades numerics to new Point format * updates geo api changes * adds GeoPointDistanceRangeQuery as XGeoPointDistanceRangeQuery * cuts over to ES GeoHashUtils	2016-04-11 16:50:04 -05:00
Ryan Ernst	31ca8fa411	Merge branch 'master' into placeholder	2016-04-11 13:44:59 -07:00
Yannick Welsch	b08d453a0a	Fix EC2 Discovery settings (#17651 ) Fixes two bugs introduced by the settings refactoring in #16602	2016-04-11 16:17:55 +02:00
Alexander Reelsen	da19ddf3e6	Ingest Attachment: Allow to prevent base64 conversions by using raw bytes (#16601 ) CBOR is natively supported in Elasticsearch and allows for byte arrays. This means, that by using CBOR the user can prevent base64 conversions for the data being sent back and forth. This PR adds support to extract data from a byte array in addition to a string. This also required to add a ByteArrayValueSource class.	2016-04-11 14:14:56 +02:00
Adrien Grand	42526ac28e	Remove Settings.settingsBuilder. We have both `Settings.settingsBuilder` and `Settings.builder` that do exactly the same thing, so we should keep only one. I kept `Settings.builder` since it has my preference but also it is the one that we use in examples of the Java API.	2016-04-08 18:10:02 +02:00
David Pilato	c6b1beb083	Add a test for forced values in mapper-attachments plugin This PR just adds a new test where we check that we forcing a value in the JSON document actually works as expected: ```json { "file": { "_content": "BASE64" "_name": "12-240.pdf", "_language": "en", "_content_type": "pdf" } } ``` Note that we don't support forcing all values. So sending: ```json { "file": { "_content": "BASE64" "_name": "12-240.pdf", "_title": "12-240.pdf", "_keywords": "Div42 Src580 LGE Mechtech", "_language": "en", "_content_type": "pdf" } } ``` Will have absolutely no effect on fields `title` and `keywords`. Note that when `_language` is set, it only works if `index.mapping.attachment.detect_language` is set to `true`. Related to https://discuss.elastic.co/t/mapper-attachments/46615/4	2016-04-08 10:07:21 +02:00
Chris Earle	d97d5ebb8b	Remove hostname from NetworkAddress.format This removes the inconsistent output of IP addresses. The format was parsing-unfriendly and it makes it hard to reason about API responses, such as to _nodes. With this change in place, it will never print the hostname as part of the default format, which has the added benefit that it can be used consistently for URIs, which was not the case when the hostname might appear at the front with "hostname/ip:port".	2016-04-07 17:27:59 -04:00
javanna	b9f9b2e3ee	Merge branch 'master' into enhancement/discovery_node_one_getter	2016-03-30 17:22:40 +02:00
javanna	f8b5d1f5b0	Remove DiscoveryNodes#masterNodeId in favour of existing DiscoveryNodes#getMasterNodeId	2016-03-30 15:28:06 +02:00
Adrien Grand	068c788ec8	Disable fielddata on text fields by defaults. #17386 `text` fields will have fielddata disabled by default. Fielddata can still be enabled on an existing index by setting `fielddata=true` in the mappings.	2016-03-30 14:35:32 +02:00
javanna	8fc9dbbb99	Merge branch 'master' into enhancement/remove_node_client_setting	2016-03-29 14:27:04 +02:00
Clinton Gormley	579d976e90	The source parameter should not be defined in the delete-by-query REST spec	2016-03-29 11:45:20 +02:00
javanna	93ce36a198	separated attributes from node roles in DiscoveryNode Node roles are now serialized as well, they are not part of the node attributes anymore. DiscoveryNodeService takes care of dividing settings into attributes and roles. DiscoveryNode always requires to pass in attributes and roles separately.	2016-03-25 20:14:27 +01:00
Boaz Leskes	91021e3019	merge from master	2016-03-25 15:50:48 +01:00
Jason Tedor	7f0134e725	Revert "Merge pull request #16843 from xuzha/s3-encryption" This reverts commit `37a183d9ed`, reversing changes made to `08903f1ed8`.	2016-03-24 17:11:02 -04:00
Xu Zhang	38923b89c2	Update Format, add new settings into the setting test	2016-03-24 12:16:57 -07:00
Ryan Ernst	3adaf09675	Settings: Cleanup placeholder replacement This change moves placeholder replacement to a pkg private class for settings. It also adds a null check when calling replacement, as settings objects can still contain null values, because we only prohibit nulls on file loading. Finally, this cleans up file and stream loading a bit to not have unnecessary exception wrapping.	2016-03-24 11:54:05 -07:00
Xu Zhang	7499e3aa4a	Update and rebase the init implementation. Also removes the MD5 checks from our side, AWS S3 SDK java is doing the check.	2016-03-24 11:21:40 -07:00
Nicolas Trésegnie	ea78fd6560	Add client-side encryption The Java Cryptography Extension (JCE) has to be installed to use this feature.	2016-03-24 11:13:37 -07:00
David Pilato	4b1ae331f0	Update after review	2016-03-23 17:32:51 +01:00
David Pilato	e907b7c11e	Check that S3 setting `buffer_size` is always lower than `chunk_size` We can be better at checking `buffer_size` and `chunk_size` for S3 repositories. For example, we know that: * `buffer_size` should be more than `5mb` * `chunk_size` should be no more than `5tb` * `buffer_size` should be lower than `chunk_size` Otherwise, setting `buffer_size` is useless. For the record: `chunk_size` is a Snapshot setting whatever the implementation is. `buffer_size` is an S3 implementation setting. Let say that you are snapshotting a 500mb file. If you set `chunk_size` to `200mb`, then Snapshot service will call S3 repository to snapshot 3 files with the following sizes: * `200mb` * `200mb` * `100mb` If you set `buffer_size` to `100mb` (AWS maximum size recommendation), the first file of `200mb` will be uploaded on S3 using the multipart feature in 2 chunks and the workflow is basically the following: * create the multipart request and get back an `id` from AWS S3 platform * upload part1: `100mb` * upload part2: `100mb` * "commit" the full upload using the `id`. Closes #17244.	2016-03-23 10:39:54 +01:00
Boaz Leskes	7c8cdf4a71	merged from master	2016-03-22 19:21:28 +01:00
Simon Willnauer	1988b8b387	[TEST] Reuse EsTestCase#createAnalysisService in KuromojiAnalysisTests	2016-03-22 13:45:20 +01:00
Jun Ohtani	a9a0f262af	Analysis Kuromoji: Add nbest option and NumberFilter Add nbest_cost and nbest_examples parameter to KuromojiTokenizerFactory Add KuromojiNumberFilterFactory	2016-03-22 20:09:56 +09:00
Boaz Leskes	858610d0d1	merge from master	2016-03-19 13:57:40 +01:00
Ryan Ernst	f71f0d6010	Revert "Build: Switch to maven-publish plugin" This reverts commit `a90a2b34fc`.	2016-03-18 17:22:25 -07:00
Ryan Ernst	6af4c43c4f	Merge pull request #17128 from rjernst/maven_publish Build: Switch to maven-publish plugin	2016-03-17 11:53:50 -07:00
Simon Willnauer	e91a141233	Prevent index level setting from being configured on a node level Today we allow to set all kinds of index level settings on the node level which is error prone and difficult to get right in a consistent manner. For instance if some analyzers are setup in a yaml config file some nodes might not have these analyzers and then index creation fails. Nevertheless, this change allows some selected settings to be specified on a node level for instance: * `index.codec` which is used in a hot/cold node architecture and it's value is really per node or per index * `index.store.fs.fs_lock` which is also dependent on the filesystem a node uses All other index level setting must be specified on the index level. For existing clusters the index must be closed and all settings must be updated via the API on each of the indices. Closes #16799	2016-03-17 14:42:18 +01:00
Ryan Ernst	a90a2b34fc	Build: Switch to maven-publish plugin The build currently uses the old maven support in gradle. This commit switches to use the newer maven-publish plugin. This will allow future changes, for example, easily publishing to artifactory. An additional part of this change makes publishing of build-tools part of the normal publishing, instead of requiring a separate upload step from within buildSrc. That also sets us up for a follow up to enable precomit checks on the buildSrc code itself.	2016-03-15 19:16:37 -07:00
Jason Tedor	618441aea3	Merge pull request #17088 from jasontedor/simplify-bootstrap-settings Bootstrap does not set system properties	2016-03-15 19:25:16 -04:00
Jason Tedor	66ba044ec5	Use setting in integration test cluster config	2016-03-15 17:45:17 -04:00
Yannick Welsch	f5e6db4090	Remove System.out.println and Throwable.printStackTrace from tests	2016-03-15 15:40:37 +01:00
Yannick Welsch	d14ae5f8b6	Remove Python and Javascript Benchmark classes	2016-03-15 15:02:50 +01:00
David Pilato	84c862b825	Merge remote-tracking branch 'origin/master'	2016-03-15 09:25:26 +01:00
David Pilato	a3bf57d116	Upgrade azure SDK to 0.9.3 We are ATM using azure SDK 0.9.0. Azure latest release is now 0.9.3 (released in February 2016). <img width="1024" alt="the central repository search engine google chrome aujourd hui at 08 41 12" src="https://cloud.githubusercontent.com/assets/274222/13662836/a806ba3a-e69d-11e5-8655-4a838db2ef47.png"> Artifacts are on [maven central](http://search.maven.org/#search%7Cga%7C1%7Cg%3A%22com.microsoft.azure%22%20AND%20(a%3Aazure-serviceruntime%20OR%20a%3Aazure-servicebus%20OR%20a%3Aazure-svc-)) Change log: ## 2016.2.18 Version 0.9.3 Fix enum bugs in azure-svc-mgmt-websites ## 2016.1.26 Version 0.9.2 * Fix HTTP Proxy for Apache HTTP Client in Service Clients * Key Vault: Fix KeyVaultKey to not attempt to load RSA Private Key ## 2016.1.8 Version 0.9.1 * Support HTTP Proxy * Fix token expiration issue #557 * Service Bus: Add missing attributes: partitionKey, viaPartitionKey * Traffic Manager: Update API version, add MinChildEndpoints for NestedEndpoints * Media: Add support for Widevine (DRM) dynamic encryption Closes #17042.	2016-03-15 09:18:34 +01:00
Simon Willnauer	345e988bbc	Merge pull request #17072 from s1monw/add_backwards_rest_tests Add infrastructure to run REST tests on a multi-version cluster This change adds the infrastructure to run the rest tests on a multi-node cluster that users 2 different minor versions of elasticsearch. It doesn't implement any dedicated BWC tests but rather leverages the existing REST tests. Since we don't have a real version to test against, the tests uses the current version until the first minor / RC is released to ensure the infrastructure works. Given the amount of problems this change already found I think it's worth having this run with our test suite by default. The structure of this infra will likely change over time but for now it's a step into the right direction. We will likely want to split it up into integTests and integBwcTests etc. so each plugin can have it's own bwc tests but that's left for future refactoring.	2016-03-15 09:17:43 +01:00
Areek Zillur	c3078f4d65	adapt tests to use index uuid as folder name	2016-03-14 23:24:24 -04:00
Simon Willnauer	554bf2c282	[TEST] Test that all processors are available	2016-03-14 22:35:25 +01:00
Simon Willnauer	6f28c173e2	[TEST] Test that all processors are available	2016-03-14 21:42:37 +01:00
Adrien Grand	5596e31068	Upgrade to lucene-6.0.0-f0aa4fc. #17075	2016-03-14 07:58:52 +01:00
Jason Tedor	8a05c2a2be	Bootstrap does not set system properties Today, certain bootstrap properties are set and read via system properties. This action-at-distance way of managing these properties is rather confusing, and completely unnecessary. But another problem exists with setting these as system properties. Namely, these system properties are interpreted as Elasticsearch settings, not all of which are registered. This leads to Elasticsearch failing to startup if any of these special properties are set. Instead, these properties should be kept as local as possible, and passed around as method parameters where needed. This eliminates the action-at-distance way of handling these properties, and eliminates the need to register these non-setting properties. This commit does exactly that. Additionally, today we use the "-D" command line flag to set the properties, but this is confusing because "-D" is a special flag to the JVM for setting system properties. This creates confusion because some "-D" properties should be passed via arguments to the JVM (so via ES_JAVA_OPTS), and some should be passed as arguments to Elasticsearch. This commit changes the "-D" flag for Elasticsearch settings to "-E".	2016-03-13 20:09:15 -04:00
David Pilato	9acb0bb28c	Merge branch 'master' into pr/16598-register-filter-settings # Conflicts: # core/src/main/java/org/elasticsearch/cluster/service/InternalClusterService.java # core/src/main/java/org/elasticsearch/common/settings/IndexScopedSettings.java # core/src/main/java/org/elasticsearch/common/settings/Setting.java	2016-03-13 14:52:10 +01:00
Ryan Ernst	591fb8f028	Merge branch 'master' into cli-parsing	2016-03-11 10:45:05 -08:00
Yannick Welsch	04e55ecf6b	Make logging message String constant to allow static checks	2016-03-11 10:30:59 +01:00
Yannick Welsch	718876a941	Fix wrong placeholder usage in logging statements	2016-03-11 10:30:59 +01:00
Ryan Ernst	42a6869bb1	Merge pull request #17059 from elastic/fix/16864-attachment-doctypes Fix attachments plugins with docx	2016-03-10 17:27:02 -08:00
Ryan Ernst	2f3efc3fe1	Add doc and docx rest test to mapper attachment along with getClassLoader permission	2016-03-10 13:28:19 -08:00
Ryan Ernst	51d87d94dc	Add getClassLoader perm for tika in ingest	2016-03-10 11:17:25 -08:00
thefourtheye	304cbbbf31	fix redundant stack in comments	2016-03-11 00:31:38 +05:30
David Pilato	6deabac8e8	Can not extract text from Office documents (`.docx` extension) Add REST test for: * `.doc` * `.docx` The later fails with: ``` ==> Test Info: seed=DB93397128B876D4; jvm=1; suite=1 Suite: org.elasticsearch.ingest.attachment.IngestAttachmentRestIT 2> REPRODUCE WITH: gradle :plugins:ingest-attachment:integTest -Dtests.seed=DB93397128B876D4 -Dtests.class=org.elasticsearch.ingest.attachment.IngestAttachmentRestIT -Dtests.method="test {yaml=ingest_attachment/30_files_supported/Test ingest attachment processor with .docx file}" -Des.logger.level=WARN -Dtests.security.manager=true -Dtests.locale=bg -Dtests.timezone=Europe/Athens FAILURE 4.53s \| IngestAttachmentRestIT.test {yaml=ingest_attachment/30_files_supported/Test ingest attachment processor with .docx file} <<< FAILURES! > Throwable #1: java.lang.AssertionError: expected [2xx] status code but api [index] returned [400 Bad Request] [{"error":{"root_cause":[{"type":"parse_exception","reason":"Error parsing document in field [field1]"}],"type":"parse_exception","reason":"Error parsing document in field [field1]","caused_by":{"type":"tika_exception","reason":"Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser@7f85baa5","caused_by":{"type":"illegal_state_exception","reason":"access denied (\"java.lang.RuntimePermission\" \"getClassLoader\")","caused_by":{"type":"access_control_exception","reason":"access denied (\"java.lang.RuntimePermission\" \"getClassLoader\")"}}}},"status":400}] > at __randomizedtesting.SeedInfo.seed([DB93397128B876D4:53C706AB86441B2C]:0) > at org.elasticsearch.test.rest.section.DoSection.execute(DoSection.java:107) > at org.elasticsearch.test.rest.ESRestTestCase.test(ESRestTestCase.java:395) > at java.lang.Thread.run(Thread.java:745) ``` Related to #16864	2016-03-10 10:57:59 +01:00
Boaz Leskes	330f2919cb	merge from master	2016-03-10 09:37:42 +01:00
Simon Willnauer	7a53a396e4	Remove Unneded @Inject annotations	2016-03-09 12:10:47 +01:00
Ryan Ernst	80ae2b0002	Fix more licenses	2016-03-09 00:10:59 -08:00
Ryan Ernst	1dafead2eb	Fix precommit	2016-03-08 22:55:24 -08:00
Ryan Ernst	fdce9d7c4d	Merge branch 'master' into cli-parsing	2016-03-08 14:18:20 -08:00
Ryan Ernst	e5c852f767	Convert bootstrapcli parser to jopt-simple	2016-03-08 13:39:37 -08:00
Simon Willnauer	a40587b377	Merge pull request #16982 from s1monw/remove_old_version Remove old and unsupported version constants All version <= 2.0 are not supported anymore. This commit removes all uses of these versions.	2016-03-07 16:03:14 +01:00
Martijn van Groningen	82d01e4315	Added ingest info to node info API, which contains a list of available processors. Internally the put pipeline API uses this information in node info API to validate if all specified processors in a pipeline exist on all nodes in the cluster.	2016-03-07 14:44:50 +01:00
Simon Willnauer	fdfb0e56f6	Remove bw compat from size mapper	2016-03-07 12:48:02 +01:00
Simon Willnauer	f96900013c	Remove bw compat from murmur3 mapper	2016-03-07 12:44:53 +01:00
Robert Muir	54018a5d37	upgrade to lucene 6.0.0-snapshot-bea235f Closes #16964 Squashed commit of the following: commit a23f9d2d29220991aa498214530753d7a5a148c6 Merge: eec9c4e `0b0a251` Author: Robert Muir <rmuir@apache.org> Date: Mon Mar 7 04:12:02 2016 -0500 Merge branch 'master' into lucene6 commit eec9c4e5cd11e9c3e0b426f04894bb2a6dae4f21 Merge: bc67205 `675d940` Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 13:45:00 2016 -0500 Merge branch 'master' into lucene6 commit bc67205bdfe1526eae277ab7856fc050ecbdb7b2 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 09:56:31 2016 -0500 fix test bug commit a60723b007ff12d97b1810cef473bd7b553a0327 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:35:35 2016 +0100 Fix SimpleValidateQueryIT to put braces around boosted terms commit ae3a49d7ba7ced448d2a5262e5d8ec98671a9090 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:27:25 2016 +0100 fix multimatchquery commit ae23fdb88a8f6d3fb7ba60fd1aaf3fd72d899aa5 Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 15:20:49 2016 +0100 Rewrite DecayFunctionScoreIT to be independent of the similarity used This test relied a lot on the term scoring and compared scores that are dependent on the similarity. This commit changes the base query to be a predictable constant score query. commit 366c2d518c35d31251033f1b6f6a93f6e2ae327d Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 14:06:14 2016 +0100 Fix scoring in tests due to changes to idf calculation. Lucene 6 uses a different default similarity as well as a different way to calculate IDF. In contrast to older version lucene 6 uses docCount per field to calculate the IDF not the # of docs in the index to overcome the sparse field cases. commit dac99fd64ac2fa71b8d8d106fe68825e574c49f8 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 08:21:57 2016 -0500 don't hardcoded expected termquery score commit 6e9f340ba49ab10eed512df86d52a121aa775b0f Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 08:04:45 2016 -0500 suppress deprecation warning until migrated to points commit 3ac8908424b3fdad44a90a4f7bdb3eff7efd077d Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:21:43 2016 -0500 Remove invalid test: all commits have IDs, and its illegal to do this. commit c12976288124ad1a26467e7e848fb810548e7eab Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:06:14 2016 -0500 don't test with unsupported back compat commit 18bbfe76128570bc70883bf91ff4c44c82d27817 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 07:02:18 2016 -0500 remove now invalid lucene 4 backcompat test commit 7e730e572886f0ef2d3faba712e4256216ff01ec Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:58:52 2016 -0500 remove now invalid lucene 4 backwards test commit 244d2ab6868ba5ac9e0bcde3c2833743751a25ec Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:47:23 2016 -0500 use 6.0 codec commit 5f64d4a431a6fdaa1234adca23f154c2a1de8284 Author: Robert Muir <rmuir@apache.org> Date: Fri Mar 4 06:43:08 2016 -0500 compile, javadocs, forbidden-apis, etc commit 1f273cd62a7fe9ca8f8944acbbfc5cbdd3d81ccb Merge: cd33921 `29e3443` Author: Simon Willnauer <simonw@apache.org> Date: Fri Mar 4 10:45:29 2016 +0100 Merge branch 'master' into lucene6 commit cd33921ac742ef9fb351012eff35f3c7dbda7264 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:58:37 2016 -0500 fix hunspell dictionary loading commit c7fdbd837b01f7defe9cb1c24e2ec65604b0dc96 Merge: 4d4190f `d8948ba` Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:41:53 2016 -0500 Merge branch 'master' into lucene6 commit 4d4190fd82601aaafac6b8254ccb3edf218faa34 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:39:14 2016 -0500 remove nocommit commit 77ca69e288b1a41aa9595c921ed166c272a00ea8 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:38:24 2016 -0500 clean up numericutils vs legacynumericutils commit a466d696fbaad04b647ffbc0857a9439b583d0bf Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:32:43 2016 -0500 upgrade spatial4j commit 5412c747a8cfe638bacedbc8233163cb75cc3dc5 Author: Robert Muir <rmuir@apache.org> Date: Thu Mar 3 23:19:28 2016 -0500 move to 6.0.0-snapshot-8eada27 commit b32bfe924626b87e540692375ece09e7c2edb189 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:30:09 2016 +0100 Fix some test compile errors. commit 6ccde35e9840b03c68d1a2cd47c7923a06edf64a Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:25:51 2016 +0100 Current Lucene version is 6.0.0. commit f62e1015d931b4cc04c778298a8fa1ba65e97ad9 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 11:20:48 2016 +0100 Fix compile errors in NGramTokenFilterFactory. commit 6837c6eabf96075f743649da9b9b52dd39611c58 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:50:59 2016 +0100 Fix the edge ngram tokenizer/filter. commit ccd7f070de5efcdfbeb34b9555c65c4990bf1ba6 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:42:44 2016 +0100 The missing value is now accessible through a getter. commit bd3b77f9b28e5b05daa3d49683a9922a6baf2963 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:41:51 2016 +0100 Remove IndexCacheableQuery. commit 05f3091c347aeae80eeb16349ac51d2b53cf86f7 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:39:43 2016 +0100 Fix compilation of function_score queries. commit 81cda79a2431ac78f56b0cc5a5765387f662d801 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Mar 3 10:35:02 2016 +0100 Fix compile errors in BlendedTermQuery. commit 70994ce8dd1eca0b995870974a38e20f26f96a7b Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 23:33:03 2016 -0500 add bug ID commit 29d4f1a71f36f646b5a6060bed3db019564a279d Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 21:02:32 2016 -0500 easy .store changes commit 5e1a1e6fd665fa455e88d3a8987362fad5f44bb1 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 20:47:24 2016 -0500 cleanups mostly around boosting commit 333a669ec6c305ada5645d13ed1da0e19ec1d053 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 20:27:56 2016 -0500 more simple fixes commit bd5cd98a1e089c866b6b4a5e159400b110140ce6 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 19:49:38 2016 -0500 more easy fixes and removal of ancient cruft commit a68f419ee47da5f9c9ce5b372f01d707e902474c Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 19:35:02 2016 -0500 cutover numerics commit 4ca5dc1fa47dd5892db00899032133318fff3116 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:34:18 2016 -0500 fix some constants commit 88710a17817086e477c6c021ec346d0534b7fb88 Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:14:25 2016 -0500 Add spatial-extras jar as a core dependency commit c8cd6726583e5ce3f546ed355d4eca037164a30d Author: Robert Muir <rmuir@apache.org> Date: Wed Mar 2 18:03:33 2016 -0500 update to lucene 6 jars	2016-03-07 04:12:23 -05:00
David Pilato	e35032950e	Merge branch 'master' into pr/16598-register-filter-settings	2016-03-05 11:37:03 +01:00
David Pilato	2bb3846d1f	Update after review: * remove `ClusterScope` * rename `ClusterSettings` to `NodeSettings` * rename `SettingsProperty` to `Property`	2016-03-04 16:53:24 +01:00
David Pilato	76719341dc	Fix after merge	2016-03-04 13:24:39 +01:00
David Pilato	c11cf3bf1f	Merge branch 'master' into pr/16598-register-filter-settings # Conflicts: # core/src/main/java/org/elasticsearch/common/logging/ESLoggerFactory.java # core/src/main/java/org/elasticsearch/common/settings/Setting.java # core/src/test/java/org/elasticsearch/common/settings/SettingTests.java	2016-03-04 12:23:10 +01:00
David Pilato	f97ce3c728	Deprecate mapper-attachments plugin See #16910	2016-03-04 11:49:12 +01:00
Simon Willnauer	5008694ba1	Remove support for legacy checksums Elasticsearch 5.0 doesn't support indices wiht legacy checksums anymore. The last time we write legacy checksums was in 1.3.0 which was based on lucene 4.9 already which means that all files have CRC32 checksums. All indices that Elasticsearch can read today must be written with lucene version >= 4.8 anyway so we can drop this layer of backwards compatibility entirely. Since we are close to upgrading to Lucene 6.0 we should get rid of this in a more contiained change than the lucene upgrade.	2016-03-03 22:58:18 +01:00
Lee Hinman	6adbbff97c	Fix organization rename in all files in project Basically a query-replace of "https://github.com/elasticsearch/" with "https://github.com/elastic/"	2016-03-03 12:04:13 -07:00
Adrien Grand	eef19be072	Deprecate string in favor of text/keyword. #16877 This commit removes the ability to use string fields on indices created on or after 5.0. Dynamic mappings now generate text fields by default for strings but there are plans to also add a sub keyword field (in a future PR). Most of the changes in this commit are just about replacing string with keyword or text. Some tests have been removed because they existed because of corner cases of string mappings like setting ignore-above on a text field or enabling term vectors on a keyword field which are now impossible. The plan is to remove strings entirely in 6.0.	2016-03-03 10:20:56 +01:00
Daniel Mitterdorfer	f70e5aca50	Merge remote-tracking branch 'danielmitterdorfer/simplify-azure-settings'	2016-03-03 10:02:35 +01:00
Daniel Mitterdorfer	52acf0e6e1	Use new settings infra to parse AzureStorageSettings With this commit we simplify the parsing logic in AzureStorageSettings by leveraging the new settings infrastructure. Closes #16363	2016-03-03 10:01:14 +01:00
David Pilato	3f71c1d6a5	Replace `s -> s` by `Function.identity()`	2016-03-02 10:12:40 +01:00
David Pilato	e4d9e46508	Fix merge with master	2016-03-02 09:55:09 +01:00
David Pilato	5fbf1b95dc	Merge branch 'master' into pr/16598-register-filter-settings # Conflicts: # core/src/main/java/org/elasticsearch/common/logging/ESLoggerFactory.java # core/src/main/java/org/elasticsearch/discovery/DiscoveryService.java # core/src/main/java/org/elasticsearch/discovery/DiscoverySettings.java # core/src/main/java/org/elasticsearch/http/HttpTransportSettings.java # plugins/repository-azure/src/main/java/org/elasticsearch/cloud/azure/storage/AzureStorageService.java	2016-03-02 09:43:53 +01:00
Jason Tedor	aa8ee74c6c	Bump Elasticsearch version to 5.0.0-SNAPSHOT This commit bumps the Elasticsearch version to 5.0.0-SNAPSHOT in line with the alignment of versions across the stack. Closes #16862	2016-03-01 17:03:47 -05:00
Simon Willnauer	80e5c0acf8	Merge pull request #16860 from s1monw/issues/16485 Add setFactory permission to GceDiscoveryPlugin This commit adds a missing permission and a simple test that ensures we discover other nodes via a mock http endpoint. Closes #16485	2016-03-01 08:55:43 +01:00
Nik Everett	95cc3e38fc	Check test naming conventions on all modules The big win here is catching tests that are incorrectly named and will be skipped by gradle, providing a false sense of security. The whole thing takes about 10 seconds on my Macbook Air, not counting compiling the test classes, which seems worth it. Because this runs as a gradle task with propery UP-TO-DATE handling it can be skipped if the tests haven't been changed which should save some time. I chose to keep this in test:framework rather than a new subproject of buildSrc because ESIntegTestCase and doesn't inroduce any additional dependencies.	2016-02-29 16:31:49 -05:00
Simon Willnauer	948ee3ee3f	Move keystore creation to gradle - this prevents committing a keystore to the source repo	2016-02-29 22:01:39 +01:00
Boaz Leskes	195b43d66e	Remove DiscoveryService and reduce guice to just Discovery #16821 DiscoveryService was a bridge into the discovery universe. This is unneeded and we can just access discovery directly or do things in a different way. One of those different ways, is not having a dedicated discovery implementation for each our dicovery plugins but rather reuse ZenDiscovery. UnicastHostProviders are now classified by discovery type, removing unneeded checks on plugins. Closes #16821	2016-02-29 20:23:38 +01:00
Simon Willnauer	ecca717339	Add setFactory permission to GceDiscoveryPlugin This commit adds a missing permission and a simple test that ensures we discover other nodes via a mock http endpoint. Closes #16485	2016-02-29 15:48:16 +01:00
David Pilato	7a42014909	Upgrade Azure Storage client to 4.0.0 We are using `2.0.0` today but Azure team now recommends: ```xml <dependency> <groupId>com.microsoft.azure</groupId> <artifactId>azure-storage</artifactId> <version>4.0.0</version> </dependency> ``` This new version fix the timeout issues we have seen with azure storage although #15080 adds a timeout support. Azure storage client 2.0.0 was not passing correctly this value when it was calling Azure services. Note that the timeout is a server side timeout and not client side timeout. It means that it will raise only a timeout when: * upload of blob is complete * if azure service is not able to process the blob (and store it) within a given time range. In which case it will raise an exception which elasticsearch can deal with: ``` java.io.IOException at __randomizedtesting.SeedInfo.seed([91BC11AEF16E073F:6886FA5308FCE4D8]:0) at com.microsoft.azure.storage.core.Utility.initIOException(Utility.java:643) at com.microsoft.azure.storage.blob.BlobOutputStream.writeBlock(BlobOutputStream.java:444) at com.microsoft.azure.storage.blob.BlobOutputStream.access$000(BlobOutputStream.java:53) at com.microsoft.azure.storage.blob.BlobOutputStream$1.call(BlobOutputStream.java:388) at com.microsoft.azure.storage.blob.BlobOutputStream$1.call(BlobOutputStream.java:385) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) Caused by: com.microsoft.azure.storage.StorageException: Operation could not be completed within the specified time. at com.microsoft.azure.storage.StorageException.translateException(StorageException.java:89) at com.microsoft.azure.storage.core.StorageRequest.materializeException(StorageRequest.java:305) at com.microsoft.azure.storage.core.ExecutionEngine.executeWithRetry(ExecutionEngine.java:175) at com.microsoft.azure.storage.blob.CloudBlockBlob.uploadBlockInternal(CloudBlockBlob.java:1006) at com.microsoft.azure.storage.blob.CloudBlockBlob.uploadBlock(CloudBlockBlob.java:978) at com.microsoft.azure.storage.blob.BlobOutputStream.writeBlock(BlobOutputStream.java:438) ... 9 more ``` The following code was used to test this against Azure platform: ```java public void testDumb() throws URISyntaxException, StorageException, IOException, InvalidKeyException { String connectionString = "MY-AZURE-STRING"; CloudStorageAccount storageAccount = CloudStorageAccount.parse(connectionString); CloudBlobClient client = storageAccount.createCloudBlobClient(); client.getDefaultRequestOptions().setTimeoutIntervalInMs(1000); CloudBlobContainer container = client.getContainerReference("dumb"); container.createIfNotExists(); CloudBlockBlob blob = container.getBlockBlobReference("blob"); File sourceFile = File.createTempFile("sourceFile", ".tmp"); try { int fileSize = 10000000; byte[] buffer = new byte[fileSize]; Random random = new Random(); random.nextBytes(buffer); logger.info("Generate local file"); FileOutputStream fos = new FileOutputStream(sourceFile); fos.write(buffer); fos.close(); logger.info("End generate local file"); FileInputStream fis = new FileInputStream(sourceFile); logger.info("Start uploading"); blob.upload(fis, fileSize); logger.info("End uploading"); } finally { if (sourceFile.exists()) { sourceFile.delete(); } } } ``` With 2.0.0, the above code was not raising any exception. With 4.0.0, the exception is now thrown correctly. The default timeout is 5 minutes. See https://github.com/Azure/azure-storage-java/blob/master/microsoft-azure-storage/src/com/microsoft/azure/storage/core/Utility.java#L352-L375 Closes #12567. Release notes from 2.0.0: * Removed deprecated table AtomPub support. * Removed deprecated constructors which take service clients in favor of constructors which take credentials. * Added support for "Add" permissions on Blob SAS. * Added support for "Create" permissions on Blob and File SAS. * Added support for IP Restricted SAS and Protocol SAS. * Added support for Account SAS to all services. * Added support for Minute and Hour Metrics to FileServiceProperties and added support for File Metrics to CloudAnalyticsClient. * Removed deprecated startCopyFromBlob() on CloudBlob. Use startCopy() instead. * Removed deprecated Credentials and StorageKey classes. Please use the appropriate methods on StorageCredentialsAccountAndKey instead. * Fixed a bug in table where a select on a non-existent field resulted in a null reference exception if the corresponding field in the TableEntity was not nullable. * Fixed a bug in table where JsonParser was automatically closing the response stream before it was completely drained causing socket exhaustion. * Fixed a bug in StorageCredentialsAccountAndKey.updateKey(String) which prevented valid keys from being set. * Added CloudBlobContainer.listBlobs(final String, final boolean) method. * Fixed a bug in blob where using AccessConditions on block blob uploads larger than 64MB done with the upload* methods or block blob uploads done openOutputStream with would fail if the blob did not already exist. * Added support for setting a proxy per request. Proxy can be set on an OperationContext instance and will be used when that instance is passed to the request method. * Added support for SAS to the Azure File service. * Added support for Append Blob. * Added support for Access Control Lists (ACL) to File Shares. * Added support for getting and setting of CORS rules to File service. * Added support for ShareStats to File Shares. * Added support for copying an Azure File to another Azure File or a Block Blob asynchronously, and aborting Azure File copy operations asynchronously. * Added support for copying a Blob to an Azure File asynchronously. * Added support for setting a maximum quota property on a File Share. * Removed deprecated AuthenticationScheme and its getter and setter. In the future only SharedKey will be used. * Removed deprecated getter/setters for all request option properties on the service clients. Please use the default request options getter/setters instead. * Removed getSubDirectoryReference() for blob directories and file directories. Use getDirectoryReference() instead. * Removed getEntityClass() in TableQuery. Please use getClazzType() instead. * Added client-side verification for lease duration and break periods. * Deprecated the setters in table for timestamp as this property is only modifiable by the service. * Deprecated startCopyFromBlob() on CloudBlob. Use startCopy() instead. * Deprecated the Credentials and StorageKey classes. Please use the appropriate methods on StorageCredentialsAccountAndKey instead. * Deprecated constructors which take service clients in favor of constructors which take credentials. * Fixed a bug where the DateBackwardCompatibility flag was not applied if set on the CloudTableClient default request options. * Changed library behavior to retry all exceptions thrown when parsing a response object. * Changed behavior to stop removing query parameters passed in with the resource URI if that URI contains a SAS token. Some query parameters such as comp, restype, snapshot and api-version will still be removed. * Added support for logging StringToSign to SharedKey and SAS. * Added a connect timeout to prevent hangs when establishing the network connection. * Made performance enhancements to the BlobOutputStream class. * Fixed a bug where maximum execution time was ignored for file, queue, and table services. * Changed the socket timeout to be set to the service side timeout plus 5 minutes when maximum execution time is not set. * Changed the socket timeout to default to 5 minutes rather than infinite when neither service side timeout or maximum execution time are set. * Fixed a bug where MD5 was calculated for commitBlockList even though UseTransactionalMD5 was set to false. * Fixed a bug where selecting fields that did not exist returned an error rather than an EntityProperty with a null value. * Fixed a bug where table entities with a single quote in their partition or row key could be inserted but not operated on in any other way. * Fixed a bug for all listing API's where next() would sometimes throw an exception if hasNext() had not been called even if there were more elements to iterate on. * Added sequence number to the blob properties. This is populated for page blobs. * Creating a page blob sets its length property. * Added support for page blob sequence numbers and sequence number access conditions. * Fixed a bug in abort copy where the lease access condition was not sent to the service. * Fixed an issue in startCopyFromBlob where if the URI of the source blob contained certain non-ASCII characters they would not be encoded appropriately. This would result in Authorization failures. * Fixed a small performance issue in XML serialization. * Fixed a bug in BlobOutputStream and FileOutputStream where flush added data to a request pool rather than immediately committing it to the Azure service. * Refactored to remove the blob, queue, and file package dependency on table in the error handling code. * Added additional client-side logging for REST requests, responses, and errors. Closes #15976.	2016-02-29 15:00:34 +01:00
Martijn van Groningen	c7b626c615	updated SHAs	2016-02-28 13:20:39 +01:00
David Pilato	d77daf3861	Use an SettingsProperty.Dynamic for dynamic properties	2016-02-28 11:06:45 +01:00
David Pilato	31b5e0888f	Use an SettingsProperty enumSet Instead of modifying methods each time we need to add a new behavior for settings, we can simply pass `SettingsProperty... properties` instead. `SettingsProperty` could be defined then: ``` public enum SettingsProperty { Filtered, Dynamic, ClusterScope, NodeScope, IndexScope // HereGoesYours; } ``` Then in setting code, it become much more flexible. TODO: Note that we need to validate SettingsProperty which are added to a Setting as some of them might be mutually exclusive.	2016-02-28 00:48:04 +01:00
Sylwester Lachiewicz	3af735c69e	Update MaxMind geoip2 version to 2.6 Update to align with #16801 jackson 2.7.1	2016-02-27 12:57:24 +01:00
Nik Everett	ba5be0332d	Remove optional logger wrappers Removes all our logger wrappers except the wrapper for log4j1.2. If you depend on Elasticsearch's jar in your application you'll need to declare log4j 1.2 and/or some bridge to your favorite logger. We did this to simplify our builds and code. No more commons-logging like log implementation sniffing. No more optional dependency hacks in gradle. We might one day want to use j.u.l instead of log4j. If we do want that we can recover its wrapper by studying this commit. We didn't go directly to j.u.l in this commit because that is a bigger change. Our logging configuration is based on log4j1.2 and people are used to it. So it'd be a much more fraught breaking change to do that conversion.	2016-02-26 16:41:07 -05:00
Jason Tedor	d94e391e71	Use System#lineSeparator and not system property This commit replaces a use of the system property "line.separator" and replaces it with a dedicated method that provides the same value. Closes #16776	2016-02-25 12:22:57 -05:00
Jack Conradson	7986770e5f	Moved Painless from a plugin to a module. Closes #16755	2016-02-21 16:50:54 -08:00
Mike McCandless	5fffede2b0	Upgrade to Lucene 5.5.0 official release	2016-02-20 17:34:16 -05:00
David Pilato	90fba97a30	Moves GCE settings to the new infra Closes #16720.	2016-02-19 17:00:39 -08:00
Adrien Grand	4f8895eae3	Add a text field. This new field is intended to replace analyzed string fields.	2016-02-15 10:43:44 +01:00
Boaz Leskes	4bb5b4100d	merge from master	2016-02-12 15:53:31 +01:00

... 3 4 5 6 7 ...

1663 Commits