OpenSearch

Commit Graph

Author	SHA1	Message	Date
Colin Goodheart-Smithe	c60bb4d73b	Adds reducers list to InternalAggregation.reduce() The list of reducers is fed through from the AggregatorFactory	2015-02-12 13:36:05 +00:00
Adrien Grand	de41981373	Aggs: Refactor aggregations to use lucene5-style collectors. Aggregators now return a new collector instance per segment, like Lucene 5 does with its oal.search.Collector API. This is important for us because things like knowing whether the field is single or multi-valued is only known at a segment level. In order to do that I had to change aggregators to notify their sub aggregators of new incoming segments (pretty much in the spirit of #6477) while everything used to be centralized in the AggregationContext class. While this might slow down a bit deeply nested aggregation trees, this also makes the children aggregation and the `breadth_first` collection mode much better options since they can now only replay what they need while they used to have to replay the whole aggregation tree. I also took advantage of this big refactoring to remove some abstractions that were not really required like ValuesSource.MetaData or BucketAnalysisCollector. I also splitted Aggregator into Aggregator and AggregatorBase in order to separate the Aggregator API from implementation helpers. Close #9544	2015-02-12 14:13:31 +01:00
javanna	4e94be8a37	[TEST] Introduce basic validation of our REST spec Whenever we have an api that supports GET with a body, we always support the POST method too, as well as providing the body as a query_string parameter called `source`. Our REST spec should reflect this convention. FIxed them and introduced a hard check at parse time in our Java REST tests runner, which will cause the tests to fail if spec are not compliant. Closes #9629	2015-02-12 22:25:17 +11:00
Alexander Reelsen	9cd14a5c29	CliTool: Add command to warn on permission/owner change When using the CLI tool infrastructure, a command can potentially write a new file. In case it overwrites an existing one, you may want to ensure that the permissions, the owner and the group are kept the same and do not accidentally change when overwriting those files. This PR introduces a command that allows you to execute this check per path. It also adds a new testing dependency, namely jimfs, which allows you to create in-memory filesystems with certain properties (like supporting or not posix permissions on this filesystem), so that you can test those features, without executing tests on a certain operating system.	2015-02-12 10:10:11 +01:00
Alexander Reelsen	30a9d97a71	FileSystemUtils: Only create backup copies if files differ The FileSystemUtils class has a helper method to create files with a .new suffix, in case the file, which should be created already exists. If you install plugins and those have configuration files, even without changes, you will end up with tons of .new files. This commit checks the file size and sha-256 sum, and only if those differ, a .new file is actually being created.	2015-02-12 10:08:14 +01:00
Igor Motov	9b75d3ef98	Test: wait for the cluster to recover in ClusterServiceTests before waiting for update state task results On CI machines node recovery sometimes takes up to 2 seconds. When it happens an update cluster state task gets stuck behind the recovery and tests fail with 1 second timeout. This commit makes sure that we wait for recovery to complete before starting the clock.	2015-02-11 19:11:00 -05:00
Ryan Ernst	f735baf306	Core: Remove ability to run optimize and upgrade async This has been very trappy. Rather than continue to allow buggy behavior of having upgrade/optimize requests sidestep the single shard per node limits optimize is supposed to be subject to, this removes the ability to run the upgrade/optimize async. closes #9638	2015-02-11 11:30:27 -08:00
Ryan Ernst	54c1813920	Tests: Add forgotten files for static bwc tests	2015-02-11 07:07:40 -08:00
Ryan Ernst	7328aa1c15	Tests: Add static bwc tests for new releases 1.3.8 and 1.4.3	2015-02-11 07:06:17 -08:00
Simon Willnauer	764fda6420	[TEST] make sandbox settings explicit in Tests	2015-02-11 13:21:43 +01:00
Simon Willnauer	2f0d158692	[CORE] Consolidate index / shard deletion in IndicesService Today the logic related to deleting an index is spread across several classes which makes changes to this rather delicate part of the code-base very difficult. This commit consolidates this logic into the IndicesService and moves the handling of ack-ing the delete to the master entirely into `IndicesClusterStateService`.	2015-02-11 09:05:20 +01:00
Simon Willnauer	d3762d6427	[TEST] Make tests pass while flying	2015-02-11 09:05:18 +01:00
Igor Motov	00b5c6431c	Test: testSortMinValueScript - use unmappedType to handle slow propagation of mapping	2015-02-10 19:59:51 -05:00
Ryan Ernst	b3474f6b25	Mappings: Remove ability to set path for _id and _routing on 2.0+ indexes _id and _routing now no longer support the 'path' setting on indexes created with 2.0. Indexes created before 2.0 still support this setting for backcompat. closes #6730	2015-02-10 10:53:44 -08:00
Igor Motov	6544890e14	Internal: promptly cleanup updateTask timeout handler Improve cleanup of updateTask timeout handlers. The timeout handlers should be removed as soon as a corresponding update task is processed. Otherwise, timeout handlers might keep old updateTasks and all objects that they are pointing to in memory for the duration of timeout (15 minutes by default). Fixes #9621	2015-02-10 13:00:40 -05:00
Simon Willnauer	401e6c6b06	[ENGINE] Factor out settings updates from Engine The engine is already pretty complex, it's still confulated with code that doesn't necessarily belong there. Updateing the settings from the settings service can be done on the level above. This commit cleans up the settings code in the engine and moves it to the IndexShard.	2015-02-10 12:59:12 +01:00
Simon Willnauer	de7461efd0	[ENGINE] Close Engine immediately if a tragic event strikes. Until lately we couldn't close the engine in a tragic event due to some the lock order and all it's complications. Now that the engine is much more simplified in terms of having a single IndexWriter etc. we don't necessarily need the write-lock on close anymore and can easily just close and continue.	2015-02-09 23:21:53 +01:00
Lee Hinman	622d2c8e42	[CORE] Refactor InternalEngine into AbstractEngine and classes InternalEngine contains a number of inner classes that it uses, however, this makes the class overly large and hard to extend. In order to be able to easily add other Engines (such as the ShadowEngine), these helping methods have been extracted into an AbstractEngine class. The classes that were previously in `InternalEngine` have been moved to separate classes, which will allow for better unit testing as well. None of the functionality of InternalEngine has been changed, this is only refactoring. Note that this is a change I originally made on my shadow-replica branch, however it is easier to review piecemeal so I extracted it into a separate PR.	2015-02-09 13:28:55 -07:00
Igor Motov	dcc15a6460	Test: add wait for nodes to restorePersistentSettingsTest Sometimes by the time update settings is called the second node is not in the cluster yet. As a result change of minimum master node settings to 2 is ignored making this test to fail.	2015-02-09 12:51:48 -05:00
Christoph Büscher	d2f852a274	Aggregations: Add 'offset' option to date_histogram, replacing 'pre_offset' and 'post_offset' Add offset option to 'date_histogram' replacing and simplifying the previous 'pre_offset' and 'post_offset' options. This change is part of a larger clean up task for `date_histogram` from issue #9062.	2015-02-09 14:03:28 +01:00
Alexander Reelsen	98a2482825	Testing: Add test rule to repeat tests on binding exceptions Due to the possibility of ports being already used when choosing a random port, it makes sense to simply repeat a unit test upon a bind exception. This commit adds a junit rule, which does exactly this and does not require you to change the test code and add loops. Closes #9010	2015-02-09 11:18:00 +01:00
Boaz Leskes	1167beed48	Test: testRelocationWithBusyClusterUpdateThread - listener should wait for replicas to be created	2015-02-07 10:54:21 +01:00
Robert Muir	9c9b5c27d3	Upgrade to Lucene r1657571. Closes #9587 Squashed commit of the following: commit 23ac91dca4b949638ca1d3842fd6db2e00ee1d36 Author: Adrien Grand <jpountz@gmail.com> Date: Thu Feb 5 18:42:28 2015 +0100 Do not compute scores if aggregations do not need it (like top_hits) or use a script (which might compute scores). commit 51262fe2681c067337ca41ab88096ef80a2e8ebb Author: Adrien Grand <jpountz@gmail.com> Date: Thu Feb 5 15:58:38 2015 +0100 Fix more compile errors. commit a074895d55b8b3c898d23f7f5334e564d5271a56 Author: Robert Muir <rmuir@apache.org> Date: Thu Feb 5 09:31:22 2015 -0500 fix a few more obvious ones commit 399c41186cb3c9be70107f6c25b51fc4844f8fde Author: Robert Muir <rmuir@apache.org> Date: Thu Feb 5 09:28:32 2015 -0500 fix some collectors and queries commit 5f46c2f846c5020d5749233b71cbe66ae534ba51 Author: Robert Muir <rmuir@apache.org> Date: Thu Feb 5 09:24:24 2015 -0500 upgrade to lucene r1657571	2015-02-06 08:53:20 -05:00
Boaz Leskes	487ef80c35	Test: testRelocationWithBusyClusterUpdateThread - CountDownLatch.countDown should be await	2015-02-06 14:31:19 +01:00
Boaz Leskes	45ecb49a09	Test: testRelocationWithBusyClusterUpdateThread - use cluster state listener instead of assertBusy	2015-02-06 14:27:27 +01:00
Boaz Leskes	1b7920f202	Test: add document indexing back to testCancellationCleansTempFiles It was lost during a merge conflict in 796aa5c3fe2424390a8edee604cd292b8afdf514	2015-02-06 12:54:30 +01:00
Boaz Leskes	23022227d4	Recovery: add a timeout to local mapping change check After phase1 of recovery is completed, we check that all pending mapping changes have been sent to the master and processed by the other nodes. This is needed in order to make sure that the target node has the latest mapping (we just copied over the corresponding lucene files). To make sure we do not miss updates, we do so under a local cluster state update task. At the moment we don't have a timeout when waiting on the task to be completed. If the local node update thread is very busy, this may stall the recovery for too long. This commit adds a timeout (equal to `indices.recovery.internal_action_timeout`) and upgrade the task urgency to `IMMEDIATE`. If we fail to perform the check, we fail the recovery. Closes #9575	2015-02-06 10:06:47 +01:00
Ryan Ernst	c6968883a7	Mappings: Remove support for new indexes using path setting in object/nested fields or index_name in any field Backcompat is still here for indexes created before 2.0. closes #6677	2015-02-05 12:44:43 -08:00
Boaz Leskes	f7fe6b7461	Test: add awaitFix to testFullRollingRestart	2015-02-05 13:00:03 +01:00
Boaz Leskes	1b8c0056d3	Test: StaticIndexBackwardCompatibilityTest.unloadIndex should call assertAllFilesClosed That method checks that files were release properly, but also clears a static map holding references to mock directories. Since we iterate on many indexes this created memory pressure.	2015-02-05 12:12:25 +01:00
Boaz Leskes	97ac2f5144	Test: add awaitFix to SearchWithRandomExceptionsTests disabling this until further discussion. Recent failures probably relate to #9211 & #8720 (+ friends)	2015-02-05 11:41:02 +01:00
Masaru Hasegawa	b4f7d26723	Fielddata: Change threshold value of fielddata.filter.frequency.max/min Make it consider 1.0 as 100% instead of aboslute count 1. Closes: #9327	2015-02-05 13:27:42 +09:00
Reuben Sutton	2436552840	Raise an exception on an array of values being sent as the factor for a field_value_factor query closes #7408	2015-02-04 14:17:09 -07:00
Simon Willnauer	4732ef3484	[ENGINE] Remove FlushType and make resources final in InternalEngine This commit removes the FlushType entirely and replaces it in the most places with a simple `Engine#flush()` call. Flushing without committing the translog is now entirely private to the engine and is only called in one place.	2015-02-04 18:42:58 +01:00
Simon Willnauer	0c5599e1d1	[ENGINE] Remove full flush / FlushType.NEW_WRITER The `full` option and `FlushType.NEW_WRITER` only exists to allow realtime changes to two settings (`index.codec` and `index.concurrency`). Those settings are very expert and don't really need to be updateable in realtime.	2015-02-04 17:38:05 +01:00
Boaz Leskes	7beaaaef62	Discovery: publishing timeout to log at WARN and indicate pending nodes When the master publishes a new cluster state it waits (by default) for up to 30s for all nodes to respond. If not it continues to process other pending tasks. At the moment, this timeout is logged under DEBUG but it typically represent a serious issue with one or more of the nodes. We should log it in WARN and give the nodes that failed to respond in a timefly fashion Closes #9551	2015-02-04 16:39:01 +01:00
javanna	74c7b5a197	Internal: add AliasesRequest interface to mark requests that manage aliases We currently have the IndicesRequest interface to mark indices related requests and be able to retrieve the indices they relate to in a generic way. This commit introduces a similar abstraction for requests that manage aliases, to be able to retrieve/replace the aliases they relate to. Also, IndicesAliasesRequest becomes a CompositeIndicesRequest, as it allows to perform multiple operations (e.g. add/remote multiple aliases). Each single operation (AliasActions) implements now the newly introduced AliasesRequest. AliasesRequest is also implemented by GetAliasesRequest, which allows to retrieve aliases information. Closes #9460	2015-02-04 07:59:33 +01:00
Boaz Leskes	896e8657ea	Discovery: check index uuid when merging incoming cluster state into local In big deployment ClusterState can be large. To make sure we keep reusing objects that were promoted to the Old Gen, ZenDiscovery has an optimization where it tries to reuse existing IndexMetaData object (containing among other things the mappings) from the current cluster state if they didn't change. The comparison currently uses the index name and the metadata version. This is however not enough and we should also check the index uuid. In extreme cases, where cluster state processing is slow and the index in question is deleted and recreated and these operations are batch processed together, we can use the wrong meta data if the version is also identical. This can happen if people create the index with all meta data predefined and no settings were changed. Closes #9489 Closes #9541	2015-02-03 21:36:05 +01:00
Adrien Grand	13b64cc362	Aggs: Make the nested aggregation call sub aggregators with doc IDs in order. Close #9547	2015-02-03 16:51:36 +01:00
javanna	ebb7ecb00e	[TEST] RestClient to use a non static pooling connection manager When closing an instance of RestClient, the connection manager gets shutdown, which makes it not usable anymore. If that is static, like it is now, no RestClient will work anymore from that moment on. Each instance of RestClient should have its own instance of connection manager	2015-02-03 16:46:54 +01:00
Adrien Grand	8540a863aa	Search: Avoid calling DocIdSets.toSafeBits. This method is heavy as it builds a bitset out of a DocIdSet in order to be able to provide random-access. Now that Lucene has removed out-of-order scoring true random-access is very rarely needed and we could instead return an Bits instance that wraps the iterator. Ideally, we would use the DISI API directly but I have to admit that the Bits API is more friendly. Close #9546	2015-02-03 16:16:19 +01:00
javanna	e5b174ff77	[TEST] Move SimpleNettyTransportTests to expected exception Replaced try catch with expected exception, since no additional check was done on the exception thrown.	2015-02-03 15:51:51 +01:00
javanna	338766fd4d	[TEST] Remove needless ClusterScope annotation from NettyTransportMultiPortTests NettyTransportMultiPortTests is not an integration test, it doesn't rely on the test cluster thus the ClusterScope annotation doesn't have any effect.	2015-02-03 15:51:44 +01:00
javanna	0e67dda15d	[TEST] Make sure that match assertion throws error if run against an object We had a REST test that relied on matching a json response against a regex. It worked but the match wasn't done against the actual json object, but its java map representation converted into a string by calling `toString`. Since all other clients test runners don't work in this case, as they try to match a json object against a regex, we should do the same and prevent it from working.	2015-02-03 10:18:18 +01:00
javanna	dfe67da013	[TEST] support stashed values within property names in our REST tests Closes #9533	2015-02-03 10:17:50 +01:00
Boaz Leskes	4342237acf	Test: reduce load in RecoveryWhileUnderLoadTests	2015-02-03 09:32:42 +01:00
Robert Muir	027730006b	core: add 'checksum' option for index.shard.check_on_startup The current "checkindex" on startup is very very expensive. This is like running one of the old school hard drive diagnostic checkers and usually not a good idea. But we can do a CRC32 verification of files. We don't even need to open an indexreader to do this, its much more lightweight. This option (as well as the existing true/false) are randomized in tests to find problems. Also fix bug where use of the current option would always leak an indexwriter lock. Closes #9183	2015-02-03 00:10:08 -05:00
Ryan Ernst	6079d88d43	Mappings: Remove type prefix support from field names in queries This is the first part of #8872.	2015-02-02 13:10:56 -08:00
Lee Hinman	0f405e9710	Merge branch 'pr/8795'	2015-02-02 11:49:45 -07:00
Michael McCandless	e29cf903c8	Core: upgrade to Lucene snapshot r1656366 * IndexWriter deadlock and DV update concurrency fix * BytesRef reuse bug with SortedSetDVTermsEnum * Int overflow skip data corruption bug * Compound file API cleanups * IndexWriter doesn't accept per-doc Analyzer anymore Closes #9524	2015-02-02 13:37:45 -05:00
Christoph Büscher	44193e7ba5	Aggregations: Add 'offset' option to histogram aggregation Histogram aggregation supports an 'offset' option to move bucket boundaries. In a histogram with buckets of size X these can be moved from 0, X, 2X, 3X,... by an offset value of Y to Y, X+Y, 2X+Y, 3X+Y... by using the 'offset' option. The previous 'pre_offset' and 'post_offset' options are removed in favour of the simplified 'offset' option. Closes #9417 Closes #9505	2015-02-02 18:23:01 +01:00
Alexander Reelsen	a55476bf70	Tests: Ensure no use of potentially resolving internal ips	2015-02-02 09:45:42 +01:00
Boaz Leskes	79c8621a47	Test: add trace logging to testNodeFailuresAreProcessedOnce	2015-02-02 09:32:53 +01:00
Alexander Reelsen	59f8c0951a	Netty Transport: Add profiles to transport infos Until now, there was no possibility to expose infos about configured transport profiles. This commit adds the ability to expose those information in the TransportInfo class. The channel was well as the netty pipeline handler now also contain the profile they were configured for, as this information cannot be extracted elsewhere. In addition, each profile now can set its own publish host and port, which might be needed in case of portforwarding or using docker. Closes #9134	2015-02-02 08:17:55 +01:00
Martijn van Groningen	3ce05b6919	inner hits: Fix bug that resolves parent docs properly as inner hit when inner hit is defined on has_parent query.	2015-02-01 22:29:21 +01:00
Lee Hinman	9557625ae7	Disallow method pointer expressions in Groovy scripting	2015-01-30 15:55:19 -07:00
Lee Hinman	9fe84062a1	Add `beforeIndexAddedToCluster` callback This callback is executed only once, on the master node during an index's creation. An exception thrown during this listener will cancel the index creation. This also adds checks in `IndicesClusterStateService` for the indexService being null as well as if the `indicesService.createIndex` throws an exception on data nodes after an index has already been created.	2015-01-30 15:25:58 -07:00
Adrien Grand	b2010f788d	[TESTS] IndicesQueryCacheTests: Ensure that shards are searchable before starting to query them.	2015-01-30 23:22:27 +01:00
Boaz Leskes	eabc3cde98	Recovery: update access time of ongoing recoveries #8720 introduced a timeout mechanism for ongoing recoveries, based on a last access time variable. In the many iterations on that PR the update of the access time was lost. This adds it back, including a test that should have been there in the first place. Closes #9506	2015-01-30 21:06:28 +01:00
Adrien Grand	00d54fabb2	Search: Remove query-cache serialization optimization. The query-cache has an optimization to not deserialize the bytes at the shard level. However this is a bit fragile since it assumes that serialized streams can be concatenanted (which is not the case with shared strings) and also does not update the QueryResult object that is held by the SearchContext. So you need to make sure to use the right one. With this change, the query cache just deserializes bytes into the QueryResult object from the context. Close #9500	2015-01-30 20:02:18 +01:00
Simon Willnauer	380fcd1d02	Reset MergePolicProvider settings only if the value actually changed Due to some unreleased refactorings we lost the persitence of a perviously set values in MergePolicyProvider. This commit adds this back and adds a simple unittest. Closes #8890	2015-01-30 13:24:08 +01:00
Ryan Ernst	1ebc95ee28	Tests: Add type-unrestricted version of field mapper getter to SearchContext. This fixes an NPE when using TestSearchContext in SignificanceHeuristicTests.	2015-01-29 13:42:07 -08:00
Ryan Ernst	4e0e5e7328	Aggs: Remove limitation on field access within aggs to the types provided in the search Currently, doing a field lookup within a terms agg will restrict the fields available to those within the types passed into the search request. However, when doing sub aggs within a children agg, the fields available should not be restricted to those of the search. This change makes the field lookup use the index level mapper service.	2015-01-29 10:49:38 -08:00
Simon Willnauer	c0fa60eb26	Remove HandlesStreamInput/Output The optimization we do in the HandlesStreamInput / Output adds a lot of complexity with a rather unknown benefit. It tries to compress commonly used strings and write ids instead. This should rather be done on a lower level if at all necessary for the small message we send over the network.	2015-01-29 17:43:32 +01:00
Simon Willnauer	1d77c3af82	Fix compilation	2015-01-29 17:41:53 +01:00
Simon Willnauer	03f1fcc85e	[ENGINE] Remove dirty flag and force boolean for refresh Today we have a dirty flag indicating that a refresh must be executed. We also allow users to bypass this by setting a force=true boolean on the refresh request / command. All these flags are unneeded since the SearcherManager has all the information to do the right thing if it's dirty or not.	2015-01-29 17:30:00 +01:00
Britta Weber	0a07ce8916	core: disable auto gen id optimization This pr removes the optimization for auto generated ids. Previously, when ids were auto generated by elasticsearch then there was no check to see if a document with same id already existed and instead the new document was only appended. However, due to lucene improvements this optimization does not add much value. In addition, under rare circumstances it might cause duplicate documents: When an indexing request is retried (due to connect lost, node closed etc), then a flag 'canHaveDuplicates' is set to true for the indexing request that is send a second time. This was to make sure that even when an indexing request for a document with autogenerated id comes in we do not have to update unless this flag is set and instead only append. However, it might happen that for a retry or for the replication the indexing request that has the canHaveDuplicates set to true (the retried request) arrives at the destination before the original request that does have it set false. In this case both request add a document and we have a duplicated a document. This commit adds a workaround: remove the optimization for auto generated ids and always update the document. The asumtion is that this will not slow down indexing more than 10 percent, see: http://benchmarks.elasticsearch.org/ closes #8788 closes #9468	2015-01-29 16:26:04 +01:00
Lee Hinman	86e52c30a1	Make `script.groovy.sandbox.method_blacklist_patch` truly append-only Additionally, this setting can be specified in elasticsearch.yml if desired, to pre-populate the list of methods to be added to the default blacklist. When making a change to this setting dynamically, the entire blacklist is logged as well.	2015-01-28 17:09:27 -07:00
Ryan Ernst	afcedb94ed	Mappings: Remove `index_analyzer` setting to simplify analyzer logic The `analyzer` setting is now the base setting, and `search_analyzer` is simply an override of the search time analyzer. When setting `search_analyzer`, `analyzer` must be set. closes #9371	2015-01-28 13:43:15 -08:00
Lee Hinman	c610524392	Make groovy sandbox method blacklist dynamically additive Using the `script.groovy.sandbox.method_blacklist_patch` setting, the blacklist can be dynamically added to by specifying a comma-separated list of methods (for example, "toString,size" would add .toString and .size to the blacklist). When the `script.groovy.sandbox.method_blacklist_patch` setting is changed, the script cache is cleared to force new scripts to be recompiled. Additionally the on-disk cache is cleared so that scripts in the `config/scripts` directory are re-compiled as well. This also fixes an issue where script engines were injected more than once, which can cause multiple instances of the script engine per node.	2015-01-28 12:26:09 -07:00
Zachary Tong	a4eb1d5505	Aggregations: Add standard deviation bounds to extended_stats Extended_stats now displays the upper and lower bounds on standard deviations (e.g. avg +/- std). Default is to show 2 std above/below, but can be changed using the `sigma` parameter. Accepts non-negative doubles Closes #9356	2015-01-28 11:47:20 -05:00
Nicholas Knize	9622f78fe6	Revert "[GEO] Update GeoPolygonFilter to handle ambiguous polygons" This reverts commit `06667c6aa8` which introduces an undesireable dependency on JTS.	2015-01-28 08:03:26 -06:00
Boaz Leskes	1695f76f68	Test: testOldIndexes should disable merging It verifies some segments need to be upgraded, but if they are merged away, there are upgraded implicitly	2015-01-28 11:34:58 +01:00
Boaz Leskes	22a576d5ba	Recovery: flush immediately after a remote recovery finishes (unless there are ongoing ones) To properly replicate, we currently stop flushing during recovery so we can repay the translog once copying files are done. Once recovery is done, the translog will be flushed by a background thread that, by default, kicks in every 5s. In case of a recovery failure and a quick re-assignment of a new shard copy, we may fail to flush before starting a new recovery, causing it to deal with potentially even longer translog. This commit makes sure we flush immediately when the ongoing recovery count goes to 0. I also added a simple recovery benchmark. Closes #9439	2015-01-28 09:14:23 +01:00
Igor Motov	13ef7d73b9	Snapshot/Restore: better handling of index deletion during snapshot If an index is deleted during initial state of the snapshot operation, the entire snapshot can fail with NPE. This commit improves handling of this situation and allows snapshot to continue if partial snapshots are allowed. Closes #9024	2015-01-27 21:06:29 -05:00
Boaz Leskes	3512860956	Test: always use replicas in testClusterInfoServiceInformationClearOnError It assume the local node always has a shard	2015-01-28 00:23:03 +01:00
Nicholas Knize	06667c6aa8	[GEO] Update GeoPolygonFilter to handle ambiguous polygons PR #8672 addresses ambiguous polygons - those that either cross the dateline or span the map - by complying with the OGC standard right-hand rule. Since ```GeoPolygonFilter``` is self contained logic, the fix in #8672 did not address the issue for the ```GeoPolygonFilter```. This was identified in issue #5968 This fixes the ambiguous polygon issue in ```GeoPolygonFilter``` by moving the dateline crossing code from ```ShapeBuilder``` to ```GeoUtils``` and reusing the logic inside the ```pointInPolygon``` method. Unit tests are added to ensure support for coordinates specified in either standard lat/lon or great-circle coordinate systems. closes #5968 closes #9304	2015-01-27 15:45:05 -06:00
Boaz Leskes	9ac6d78308	Internal: ClusterInfoService should wipe local cache upon unknown exceptions The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes). This also adds settings for the timeout used when waiting for nodes. Closes #9449	2015-01-27 22:38:08 +01:00
simaov	f3e1a66133	#9444 throw StrictDynamicMappingException exception if dynamic is 'strict' and undeclared field value is NULL, test for this Fixes #9445	2015-01-27 18:14:56 +00:00
Lee Hinman	39c064ce8b	[TEST] remove AwaitsFix from DeleteByQuery test	2015-01-27 10:15:44 -07:00
Ryan Ernst	cff0ec3972	Mappings: Remove type level default analyzers closes #8874	2015-01-27 08:30:51 -08:00
Colin Goodheart-Smithe	6f894b1d2c	[TEST] Fix HistogramTests Fixed histogram tests for value scripts as it was picking the wrong buckets form the bucket list following the removal of the getBucketByKey method	2015-01-27 12:10:38 +00:00
Martijn van Groningen	7e6e9dbb96	Aggs: nested agg needs to reset root doc between segments. Closes #9437 Closes #9436	2015-01-27 12:53:47 +01:00
Colin Goodheart-Smithe	285ef0f06d	Aggregations: Clean up response API for Aggregations This change makes the response API object for Histogram Aggregations the same for all types of Histogram, and does the same for all types of Ranges. The change removes getBucketByKey() from all aggregations except filters and terms. It also reduces the methods on the Bucket class to just getKey() and getKeyAsString(). The getKey() method returns Object and the actual Type is returns will be appropriate for the type of aggregation being run. e.g. date_histogram will return a DateTime for this method and Histogram will return a Number.	2015-01-27 10:53:44 +00:00
Lee Hinman	0143d835d4	[TEST] Add `ensureGreen` to indices created in TopHitsTests	2015-01-26 18:45:04 -07:00
Lee Hinman	8fc58dc00a	[TEST] Add `ensureGreen` where needed in NestedTests	2015-01-26 18:26:04 -07:00
Lee Hinman	92b218ba51	[TEST] Mute DeleteByQueryTests.testDeleteAllOneIndex See: https://github.com/elasticsearch/elasticsearch/issues/9421	2015-01-26 18:01:08 -07:00
Martijn van Groningen	a645994086	Aggs: fix handling of the same child doc id being processed multiple times in the `reverse_nested` aggregation. Closes #9263 Closes #9345	2015-01-26 18:36:35 +01:00
Ryan Ernst	385c43c141	Mappings: Remove _analyzer closes #9279	2015-01-26 09:14:17 -08:00
Lee Hinman	537769c225	Relax restrictions on filesystem size reporting Apparently some filesystems such as ZFS and occasionally NTFS can report filesystem usages that are negative, or above the maximum total size of the filesystem. This relaxes the constraints on `DiskUsage` so that an exception is not thrown. If 0 is passed as the totalBytes, `.getFreeDiskAsPercentage()` will always return 100.0% free (to ensure the disk threshold decider fails open) Fixes #9249 Relates to #9260	2015-01-26 09:46:21 -07:00
Martijn van Groningen	7ca2ef9b93	Nested aggregator: Fix handling of multiple buckets being emitted for the same parent doc id. This bug was introduced by #8454 which allowed the childFilter to only be consumed once. By adding the child docid buffering multiple buckets can now be emitted by the same doc id. This child docid buffering only happens in the scope of the current root document, so the amount of child doc ids buffered is small. Closes #9317 Closes #9346	2015-01-26 17:41:25 +01:00
Britta Weber	f8294352f7	[TEST] mute test for now becasue we have an issue for it	2015-01-26 17:30:43 +01:00
Martijn van Groningen	f9c0e0d4c7	aggs: The `nested` aggregator's parent filter is n't resolved properly in the case the nested agg gets created on the fly for buckets that are constructed during query execution. The fix is the move the parent filter resolving from the nextReader(...) method to the collect(...) method, because only then any parent nested filter's parent filter is then properly instantiated. Closes #9280 Closes #9335	2015-01-26 12:17:52 +01:00
Britta Weber	c3f1982f21	[TEST] check that primaries succeeded We want to check if at least the primaries succeeded if we do not wait for green and not if all succeeded if we wait for green. That was a misconception in `c617af37e8`	2015-01-26 11:14:38 +01:00
Boaz Leskes	974fafb2da	Test: add logging to SearchWithRandomExceptionsTests	2015-01-26 09:59:01 +01:00
Ryan Ernst	dfc2c9f3a1	Tests: Tweaking static bwc tests to improve stability	2015-01-23 13:59:42 -08:00
Ryan Ernst	54d01c48cb	Tests: Add memory info to static bwc index tests.	2015-01-23 13:20:23 -08:00
Britta Weber	c617af37e8	[TESTS] ensureGreen, else reported successful shards will be lower than expected	2015-01-23 17:12:33 +01:00
Britta Weber	b6c74fb21c	[TESTS] Mute test until issue is resolved	2015-01-23 16:41:56 +01:00
Britta Weber	35e507ad54	[TEST] Check for total shards >= number of shards instead of == Requests are sent to two shard copies in case a shard is relocating. This will show up in the the _shards header. Therefore we must check with greaterThanOrEqualTo(..).	2015-01-23 15:40:30 +01:00
Christoph Büscher	eeb96db76b	Rest: Adding support of multi-index query parameters for _cluster/state Adding missing support for the multi-index query parameters 'ignore_unavailable', 'allow_no_indices' and 'expand_wildcards' to '_cluster/state' API. These parameters are supposed to be supported for APIs that work across multiple indices. So far overwriting the default settings per REST call was not possible which is fixed here. Closes #5229 Closes #9295	2015-01-22 16:47:14 +01:00
Ryan Ernst	314b62c8ae	Tests: Rename tests with the same name These two tests are confusing because they have the same class name in different packages. This results in accidentally looking at the wrong file when trying to open the test by class name. They are also not "simple"..	2015-01-20 15:41:53 -08:00
Igor Motov	c0da353ef5	Snapshot/Restore: add support for changing index settings during restore process Closes #7887	2015-01-20 15:49:47 -05:00
Ryan Ernst	81621840b8	Tests: Block replica check for static index bwc tests until all pending tasks are complete Blocking once replicas are set to 0 here should remove the chance of a race condition while the replicas are being removed.	2015-01-20 11:08:42 -08:00
Ryan Ernst	a7a726700e	Tests: Increase timeout for waiting on replicas for static index tests	2015-01-20 09:54:03 -08:00
Britta Weber	23c358d2ef	[TEST] mute test, we have issue #9270 for it	2015-01-20 15:01:05 +01:00
David Pilato	fb10346953	[Mapper] Add `ignore_missing` option to `timestamp` Related to #9049. By default, the default value for `timestamp` is `now` which means the date the document was processed by the indexing chain. You can now reject documents which not provide a `timestamp` value by setting `ignore_missing` to false (default to `true`): ```js { "tweet" : { "_timestamp" : { "enabled" : true, "ignore_missing" : false } } } ``` When you update the cluster to 1.5 or master, this index created with 1.4 we automatically migrate an index created with 1.4 to the 1.5 syntax. Let say you have defined this in elasticsearch 1.4.x: ```js DELETE test PUT test { "settings": { "number_of_shards": 1, "number_of_replicas": 0 } } PUT test/type/_mapping { "type" : { "_timestamp" : { "enabled" : true, "default" : null } } } ``` After migration, the mapping become: ```js { "test": { "mappings": { "type": { "_timestamp": { "enabled": true, "store": false, "ignore_missing": false }, "properties": {} } } } } ``` Closes #8882.	2015-01-20 13:20:05 +01:00
Igor Motov	fcb0186d2e	Tests: Make sure snapshots created with old version of elasticsearch can be restored Closes #8968	2015-01-19 20:28:33 -05:00
Britta Weber	e037250ce9	[TEST] fix scores in ExplainableScriptTests	2015-01-19 20:00:39 +01:00
javanna	6c2139c186	Tribe node: remove closed indices from cluster state At startup the tribe node ignores closed indices, but if you closed an index that was part of the tribe node cluster state, its state change was not currently handled. A NullPointerException could be seen in the logs instead as the routing table for the closed index was null. As a result, the index stayed in the tribe node cluster state in open state, although that didn't reflect reality. Also, subsequent cluster state updates happening in the tribe node kept failing, affecting updates related to any other index. The only way to recover from this was to restart the tribe node every time an index is closed on any tribe. This commit properly handles index state changes, making sure that when an index gets closed it gets removed from the tribe node cluster state. Note that it makes little sense to keep the closed index around in the tribe node, as from the tribe node you can't do anything with it. The tribe node simply doesn't see any closed index, it's the same as if they didn't exist. Closes #6411 Closes #9334	2015-01-19 16:27:52 +01:00
Lee Hinman	283a467e20	Pass through all exceptions in IndicesLifecycleListeners This allows a plugin or user that registers a listener to be able to stop actions like creating an index or starting a shard by throwing an exception. Previously all exceptions were logged without being rethrown.	2015-01-19 15:10:54 +01:00
Britta Weber	366ddfc89a	scripts: add explainable script again explainable scripts were removed in #7245 but they were used. This commit adds them again. closes #8561	2015-01-19 13:44:47 +01:00
Britta Weber	7230e605fe	function_score: use query and filter together Before, if filter and query was defined for function_score, then the filter was silently ignored. Now, if both is defined then function score query wraps this in a filtered_query. closes #8638 closes #8675	2015-01-19 12:12:38 +01:00
Britta Weber	75e1500e68	cleanup: Remove `gateway.type: local` This property was removed in `45408844e7` and `1247774ff1` closes #9128	2015-01-19 11:53:27 +01:00
Martijn van Groningen	fcbba930a4	Children aggregation: The children aggs' post collection translates the buckets on the parent level to the child level and because of that it needs to invoke the post collection of its nested aggs. Closes #9271	2015-01-19 11:42:52 +01:00
sweetest	eaa1674d6d	Introduce index option named 'index.percolator.map_unmapped_fields_as_string', that handles unmapped fields in percolator queries as type string. Closes #9053 Closes #9054	2015-01-19 09:51:10 +01:00
Igor Motov	a4c92eb67e	Snapshot/Restore: add validation of restored persistent settings Closes #8830	2015-01-17 19:39:35 -05:00
Nicholas Knize	e69b5c3424	[GEO] NPE parsing GeoJSON polygon with single LinearRing ShapeBuilder threw a NPE when a polygon coordinate array consisted of a single LinearRing. This PR fixes the error handling to throw a more useful ElasticsearchParseException to provide the user with better insight into the problem.	2015-01-16 13:08:29 -06:00
Michael McCandless	6ab21e5b1b	test must start with auto_throttle enabled	2015-01-16 13:50:17 -05:00
Ryan Ernst	795c347db3	Upgrade: Fix version check in bytes to upgrade that spans major versions The previous check did not account for major version, so upgrading to 5.1 caused 4.1 (from 0.90.0.Beta1) to look the same as current master.	2015-01-16 10:41:02 -08:00
Michael McCandless	b9358ccca8	Core: switch to auto IO throttle for merges This adds a new boolean (index.merge.scheduler.auto_throttle) dynamic setting, default true (matching Lucene), to adaptively set the IO rate limit for merges over time. This is more flexible than the previous fixed rate throttling because it responds depending on the incoming merge rate, so search-heavy applications that are not doing much indexing will see merges heavily throttled while indexing-heavy cases will lighten the throttle so merges can keep up within incoming indexing. The fixed rate throttling is still available as a fallback if things go horribly wrong. Closes #9243 Closes #9133	2015-01-16 13:00:08 -05:00
Adrien Grand	f6d5b3932a	[TESTS] Mute OldIndexBackwardsCompatibilityTests.	2015-01-16 12:15:59 +01:00
Adrien Grand	1fc24a8837	Upgrade to lucene-5.1.0-snapshot-1652032. This new Lucene snapshot does not have out-of-order scoring anymore. Close #9318	2015-01-16 09:37:29 +01:00
Ryan Ernst	1ee6ff0ea5	Tests: Add final missing static bwc index for 1.2.0 See #9297	2015-01-15 23:12:41 -08:00
Ryan Ernst	979f99af7f	Remove irrelevant bwc comment	2015-01-15 23:02:02 -08:00
Robert Muir	365ec15d2b	Backcompat: Fix backcompat for 0.90.0.Beta1 indexes	2015-01-15 22:54:26 -08:00
Ryan Ernst	8e864d037e	Tests: Add tests for missing static indexes, and ensure newly added versions will force indexes to be added.	2015-01-15 22:52:41 -08:00
Shay Banon	716cc5fb05	[TEST] Mmm, still wrap wrappers still its needed post closing dir previous push was partial by mistake, we still need the wrapped dirs around after being closed for the test infra, for now, explicitly clear it in the leak test (which is still bad apple)	2015-01-15 17:57:47 -07:00
Shay Banon	683050c6ed	[TEST] Mock Directory didn't clean wrapped dirs I ran the bad apple test for index memory leaks and still saw leaks, it seems like we don't properly clean the dirs from the static mock test dir wrapper	2015-01-15 17:38:10 -07:00
Michael McCandless	107099affa	put back fixed throttling, but off by default	2015-01-14 05:35:09 -05:00
Ryan Ernst	f241125302	Internal: Change snapshot state for unreleased versions and add validation tests for constants Currently the snapshot flag for Version constants is only set to true for CURRENT. However, this means that the snapshot state changes from branch to branch. Instead, snapshot should be "is this version released?". This change also adds a validation test checking that ID -> constant and vice versa are correct, and fixes one bug found there (for an unreleased version).	2015-01-13 12:42:48 -08:00
Adrien Grand	a56520d26d	Internal: clean up memory reuse a bit. - don't allow for soft references anymore in the recycler - remove some abusive thread locals - don't recycle independently float/double and int/long pages, they are the same and just interpret bits differently. Close #9272	2015-01-13 18:08:12 +01:00
Adrien Grand	b8be8e432e	Query cache: Make the query cache usable on time-based data. The query cache has a mechanism that disables it automatically when SearchContext.nowInMillis() is used. One issue with that is that the date math parser always evaluates the current timestamp when parsing a date, even if it is not needed. As a consequence, whenever you use a date expression in your queries, the query cache would not be used. Close #9225	2015-01-13 17:23:59 +01:00
Adrien Grand	d583080f20	Remove unused DoubleObjectPagedHashMap.	2015-01-13 10:49:32 +01:00
David Pilato	be1610ba63	[Mapper] Using default=null for _timestamp field creates a index loss on restart Step to reproduce: * Create new index and type. ``` DELETE new_index PUT new_index { "mappings": { "power": { "_timestamp" : { "enabled" : true, "default": null } } } } ``` * Add a document ``` PUT new_index/power/1 { "foo": "bar" } ``` * Restart cluster ... and index is missing... ``` GET new_index ``` Gives IndexMissingException Closes #9223. (cherry picked from commit e654a2c) (cherry picked from commit aef3bc2)	2015-01-13 09:49:26 +01:00
Ryan Ernst	69f74d714f	Tests: Fix failure introduced in #8967	2015-01-12 09:27:27 -08:00
Ryan Ernst	6f214d791f	Tests: Fix test failure introduced in #8967	2015-01-12 08:42:59 -08:00
Ryan Ernst	7d5a15e461	Mapping: serialize doc values settings for _timestamp This change fixes _timestamp's serialization method to write out `doc_values` and `doc_values_format`, which could already be set, but would not be written out. closes #8893 closes #8967	2015-01-12 08:26:59 -08:00
Lee Hinman	a3972f03c6	Pass index settings where appropriate in IndicesLifecycle This allows plugins to be able to perform some needed setup before and after an index/shard is in use.	2015-01-12 15:04:37 +01:00
Lee Hinman	6b24921bd4	[TEST] Re-add rebalance disable setting in RecoveryFromGatewayTests	2015-01-12 14:21:31 +01:00
Sebastian Utz	358bb9bd75	fixup! fixup! add support for registering custom circuit breaker	2015-01-12 10:32:14 +01:00
Martijn van Groningen	a88dd36df4	Test: Only run the test that verifies the `pending` field in the `_shards` header if the test cluster has two are more data nodes.	2015-01-12 10:29:23 +01:00
Michael McCandless	1aad275c55	expose current CMS throttle in merge stats; fix tests, docs; also log per-merge stop/throttle/rate	2015-01-11 05:52:43 -05:00
Michael McCandless	31e6acf3f2	first cut	2015-01-10 16:38:56 -05:00
Boaz Leskes	102e7adfad	Recovery: be more resilient to partial network partitions This commits adds a test that simulate disconnecting nodes and dropping requests during the various stages of recovery and solves all the issues that were raised by it. In short: 1) On going recoveries will be scheduled for retry upon network disconnect. The default retry period is 5s (cross node connections are checked every 10s by default). 2) Sometimes the disconnect happens after the target engine has started (but the shard is still in recovery). For simplicity, I opted to restart the recovery from scratch (where little to no files will be copied again, because there were just synced). 3) To protected against dropped requests, a Recovery Monitor was added that fails a recovery if no progress has been made in the last 30m (by default), which is equivalent to the long time outs we use in recovery requests. 4) When a shard fails on a node, we try to assign it to another node. If no such node is available, the shard will remain unassigned, causing the target node to clean any in memory state for it (files on disk remain). At the moment the shard will remain unassigned until another cluster state change happens, which will re-assigned it to the node in question but if no such change happens the shard will remain stuck at unassigned. The commits adds an extra delayed reroute in such cases to make sure the shard will be reassinged 5) Moved all recovery related settings to the RecoverySettings. Closes #8720	2015-01-10 15:19:30 +01:00
Ryan Ernst	4cda543637	Tests: Add logic to handle static index upgrade case where index is already on latest version. See #9207	2015-01-09 12:40:44 -08:00
Robert Muir	d226a973f7	core: upgrade to lucene 5 r1650327. refactor _version docvalues migration to be more efficient. closes #9206	2015-01-09 12:12:31 -05:00
Colin Goodheart-Smithe	91e00c6c8e	Aggregations: Numeric metric aggregations are now formattable You can now specify `format` in the request definition for most numeric metric aggregations. The exceptions are Percentile_Ranks, Cardinality and Value_Count as the response type of these can be different from the field type so the formatter won't work. Closes #6812	2015-01-09 16:10:58 +00:00
David Pilato	6d58db8868	Mapping With a `null` Default Timestamp Causes NullPointerException on Merge I have a field with a `null` [default `_timestamp` value](http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/mapping-timestamp-field.html#mapping-timestamp-field-default) and when I try to update the mapping I get a server error caused by a `NullPointerException` ``` [2015-01-08 17:28:56,040][DEBUG][action.admin.indices.mapping.put] [...] failed to put mappings on indices [[feed_170_v1, feed_204_v1, feed_229_v1, feed_232_v1, feed_239_v1, feed_248_v1, feed_268_v1, feed_256_v1, feed_272_v1, feed_159_v1, feed_255_v1, feed_164_v1, feed_259_v1, feed_266_v1, feed_188_v1, feed_240_v1, feed_233_v1, feed_13_v1, feed_184_v1, feed_261_v1, feed_267_v1, feed_271_v1, feed_257_v1, feed_172_v1, feed_238_v1, feed_254_v1, feed_223_v1, feed_274_v1, feed_203_v1, feed_269_v1, feed_262_v1, feed_205_v1, feed_168_v1, feed_219_v1, feed_253_v1, feed_251_v1, feed_173_v1, feed_252_v1, feed_210_v1, feed_216_v1, feed_218_v1, feed_118_v1, feed_273_v1, feed_227_v1, feed_166_v1, feed_213_v1, feed_226_v1]], type [history] java.lang.NullPointerException at org.elasticsearch.index.mapper.internal.TimestampFieldMapper.merge(TimestampFieldMapper.java:287) at org.elasticsearch.index.mapper.object.ObjectMapper.merge(ObjectMapper.java:936) at org.elasticsearch.index.mapper.DocumentMapper.merge(DocumentMapper.java:693) at org.elasticsearch.cluster.metadata.MetaDataMappingService$4.execute(MetaDataMappingService.java:508) at org.elasticsearch.cluster.service.InternalClusterService$UpdateTask.run(InternalClusterService.java:329) at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:153) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:745) ``` https://github.com/elasticsearch/elasticsearch/blob/v1.4.2/src/main/java/org/elasticsearch/index/mapper/internal/TimestampFieldMapper.java#L286 Looks like the existence of default timestamp is not checked before use. The next line also has the same issue -- uses of default timestamp without checked to see if it's not null. To reproduce: ``` $ curl -XPUT localhost:9200/twitter2 $ curl -XPUT localhost:9200/twitter2/tweet/_mapping -d '{ "tweet" : { "_timestamp" : { "enabled" : true, "default" : null } } }' $ curl -XPUT localhost:9200/twitter2/tweet/_mapping -d '{ "tweet" : { "_timestamp" : { "enabled" : true, "default" : null }, "properties": { "user": {"type": "string"} } } }' ``` Closes #9204. (cherry picked from commit 62c6d63)	2015-01-08 21:33:17 +01:00
Ryan Ernst	7f9ffea97c	Tests: Add upgrade step to static bwc tests	2015-01-08 11:53:48 -08:00
Ryan Ernst	060f963a8e	Mappings: Remove allow_type_wrapper setting Before Elasticsearch 1.0, the type was allowed to be passed as the root element when uploading a document. However, this was ambiguous if the mappings also contained a field with the same name as the type. The behavior was changed in 1.0 to not allow this, but a setting was added for backwards compatibility. This change removes the setting for 2.0.	2015-01-08 09:13:40 -08:00
Martijn van Groningen	ca4f27f40e	Core: Added `_shards` header to all write responses. The header indicates to how many shard copies (primary and replicas shards) a write was supposed to go to, to how many shard copies to write succeeded and potentially captures shard failures if writing into a replica shard fails. For async writes it also includes the number of shards a write is still pending. Closes #7994	2015-01-08 18:10:08 +01:00
Simon Willnauer	959e3ca9da	[CORE] Fold engine into IndexShard This commit removes most of the Engine abstractions and removes Engine exposure via dependency injection. It also removes the Holder abstraction and makes the engine itself start at constrcution time. It removes the start method from the engine entire which means no engine instances exists until it's started. There is also no way to stop the engine to restart, it needs to be an entire new Engine	2015-01-08 17:48:27 +01:00
Martijn van Groningen	dedaf9387e	Core: Also check if indices resolved via aliases resolution aren't closed and deal with this according to IndicesOptions. Closes #9057	2015-01-08 16:45:34 +01:00
Simon Willnauer	78fc7c3f01	[TEST] Ensure shard lock is acquired before we try the timeout version	2015-01-08 15:37:31 +01:00
Boaz Leskes	ad66d25fa2	Test: trace logging for testDeleteSafe	2015-01-07 23:16:19 +01:00
Simon Willnauer	77493762e2	[TEST] Add back MockIndexEngine This test class was lost accidentially in `a8fa650`	2015-01-07 16:38:21 +01:00
Martijn van Groningen	20f7be378b	Removed parent parameter from update request, because it is just sets the routing. The routing option should be used instead. The parent a child document points to can't be updated. Closes #4538	2015-01-07 10:26:20 +01:00
Martijn van Groningen	687be70736	Made sure that named filters and queries defined in a wrapped query and filter are not lost. Closes #6871	2015-01-07 09:10:16 +01:00
Martijn van Groningen	c94d056454	Fixed a bug that was caused by specifying routing on a multi percolate request causing an ArrayIndexOutOfBoundsException. The multi percolate shard responses are collected in an atomic array which uses the shard id is used as index, but the number of shards the multi percolate request was meant to go to was used as size of this array instead the total number of shards an index has. This caused the exception when routing was used. Closes #6214	2015-01-07 08:49:25 +01:00
Simon Willnauer	7ec8973fbc	[CORE] Delete shard content under lock Once we delete the the index on a node we are closing all resources and subsequently need to delete all shards contents from disk. Yet this happens today under a lock (the shard lock) that needs to be acquried in order to execute any operation on the shards data path. We try to delete all the index meta-data once we acquired all the shard lock but this operation can run into a timeout which causes the index to remain on disk. Further, all shard data will be left on disk if the timeout is reached. This commit removes all the shards data just before the shard lock is release as the last operation on a shard that belongs to a deleted index.	2015-01-06 21:53:28 +01:00
Ryan Ernst	f7f99b8dbf	Stats: Added verbose option to segments api, with full ram tree as first additional element per segment. This commit adds a verbose flag to the _segments api. Currently the only additional information returned when set to true is the full ram tree from lucene for each segment.	2015-01-06 10:04:52 -08:00
Simon Willnauer	236e2491b4	[ALLOCATION] Remove primary balance factor The `cluster.routing.allocation.balance.primary` setting has caused a lot of confusion in the past while it has very little benefit form a shard allocatioon point of view. Users tend to modify this value to evently distribute primaries across the nodes which is dangerous since a prmiary flag on it's own can trigger relocations. The primary flag for a shard is should not have any impact on cluster performance unless the high level feature suffereing from primary hotspots is buggy. Yet, this setting was intended to be a tie-breaker which is not necessary anymore since the algorithm is deterministic. This commit removes this setting entriely.	2015-01-06 16:43:39 +01:00
Simon Willnauer	4900f52619	[ALLOCATION] Weight deltas must be absolute deltas In some situations the shard balanceing weight delta becomes negative. Yet, a negative delta is always treated as `well balanced` which is wrong. I wasn't able to reproduce the issue in any way other than useing the real world data from issue #9023. This commit adds a fix for absolute deltas as well as a base test class that allows to build tests or simulations from the cat API output. Closes #9023	2015-01-06 15:48:44 +01:00
Martijn van Groningen	ca68136628	Made the `nested`, `reverse_nested` and `children` aggs ignore unmapped nested fields or unmapped child / parent types. Closes #8760	2015-01-06 12:14:15 +01:00
Adrien Grand	4cb23a0520	Search: Fix paging on strings sorted in ascending order. For the comparator to work correctly, we need to give it the same value in `setTopValue` as the value that it gave back in `value`. Close #9136	2015-01-06 11:48:05 +01:00
Adrien Grand	999bec1243	Make DistributorDirectory not call fsync on sub directories and missing files. Related to #9145	2015-01-06 11:15:42 +01:00
Adrien Grand	90f98579a2	Upgrade to Lucene 5.0.0-snapshot-1649544. A couple of changes that triggerred a refactoring in Elasticsearch: - LUCENE-6148: Accountable.getChildResources returns a collection instead of a list. - LUCENE-6121: CachingTokenFilter now propagates reset(), as a result SimpleQueryParser.newPossiblyAnalyzedQuery has been fixed to not reset both the underlying stream and the wrapper (otherwise lucene would barf because of a doubl reset). - LUCENE-6119: The auto-throttle issue changed a couple of method names/parameters. It also made `UpdateSettingsTests.testUpdateMergeMaxThreadCount` dead slow so I muted this test until we clea up merge throttling to use LUCENE-6119. Close #9145	2015-01-06 09:24:18 +01:00
Alexander Reelsen	8626c18ad9	Settings: Ensure fields are overriden and not merged when using arrays In the case you try to merge two settings, one being an array and one being a field, together, the settings were merged instead of being overridden. First config my.value: 1 Second config my.value: [ 2, 3 ] If you execute settingsBuilder().put(settings1).put(settings2).build() now only values 2,3 will be in the final settings Closes #8381	2015-01-06 09:13:10 +01:00
Robert Muir	9f6a6a832f	fix TODO for master, we don't need to support this version here	2015-01-05 15:31:49 -05:00
Robert Muir	8f2f2c5663	Tests: add 0.20 index and fix test bugs in assertNewReplicasWork()	2015-01-05 15:30:18 -05:00
Britta Weber	9454593d6a	[TEST] mute ExceptionRetryTests	2015-01-03 18:40:19 +01:00
Britta Weber	f45e6ae3f9	[index] Prevent duplication of documents when retry indexing after fail If bulk index request fails due to a disconnect, unavailable shard etc, the request is retried once before actually failing. However, even in case of failure the documents might already be indexed. For autogenerated ids the request must not add the documents again and therfore canHaveDuplicates must be set to true. closes #8788	2015-01-02 15:44:47 +01:00
Nicholas Knize	b21024b5f9	[GEO] Throw helpful exception for Polygons with holes outside the shell A recent situation occured where a MultiPolygon coordinate array was accidentally defined as a single polygon with multiple holes. Since the intent was a MultiPolygon, the holes of the unintended Polygon fell outside the outer shell. This exposed a bug in the contains logic inside BasePolygonBuilder. An ArrayIndexOutOfBoundsException was being thrown instead of a more useful ElasticsearchParseException( "hole is not within polygon" ). This pull request fixes the bug and adds additional unit tests for verifying proper MultiPolygon type parsing. closes #9071	2015-01-02 08:14:07 -06:00
Simon Willnauer	93dddcdfd9	[TEST] Wait for green if index is closed and reopened if we reopen an index and the majority of the replicas where not created the reopen will fail sicne on master this runs with local gatway all the time.	2015-01-02 14:58:18 +01:00
Simon Willnauer	3e37c89932	[INTERNAL] Remove OperationRouting abstraction This commit removes the unneeded OperationRouting interface and flattens the package structure inside cluster.routing	2015-01-02 12:17:35 +01:00
Adrien Grand	56974bf867	[TEST] Fix GroovyScriptTests failures.	2014-12-31 10:02:45 +01:00
Ryan Ernst	6304f68715	Scripting: Make _score in groovy scripts comparable closes #8828 closes #9094	2014-12-30 16:38:44 -08:00
Nicholas Knize	0e24f34b0c	[GEO] GIS envelope validation ShapeBuilder expected coordinates for Envelope types in strict Top-Left, Bottom-Right order. Given that GeoJSON does not enforce coordinate order (as seen in #8672) clients could specify envelope bounds in any order and be compliant with the GeoJSON spec but not the ES ShapeBuilder logic. This change loosens the ShapeBuilder requirements on envelope coordinate order, reordering where necessary. closes #2544 closes #9067 closes #9079 closes #9080	2014-12-30 11:54:07 -06:00
Lee Hinman	31652a8b3d	Fix TransportNodesListShardStoreMetaData for custom data paths Cleans up the testReusePeerRecovery test as well The actual fix is in TransportNodesListShardStoreMetaData.java, which needs to use `nodeEnv.shardDataPaths` instead of `nodeEnv.shardPaths`. Due to the difficulty in tracking this down, I've added a lot of additional logging. This also fixes a logging issue in GatewayAllocator	2014-12-30 17:50:38 +01:00
Simon Willnauer	bc65afba8a	[TEST] Wait for threads to finish / start before asserting	2014-12-30 15:45:28 +01:00
Lee Hinman	a4e2230ebd	Add index.data_path setting This allows specifying the path an index will be at. `index.data_path` is specified in the settings when creating an index, and can not be dynamically changed. An example request would look like: POST /myindex { "settings": { "number_of_shards": 2, "data_path": "/tmp/myindex" } } And would put data in /tmp/myindex/0/index/0 and /tmp/myindex/0/index/1 Since this can be used to write data to arbitrary locations on disk, it requires enabling the `node.enable_custom_paths` setting in elasticsearch.yml on all nodes. Relates to #8976	2014-12-29 14:40:50 +01:00
Martijn van Groningen	d8054ec299	inner_hits: Added another more compact syntax for inner hits. Closes #8770	2014-12-24 17:41:35 +01:00
Adrien Grand	cc71f7730a	[TESTS] Make sure to wait for all shards to be allocated before running the test.	2014-12-24 11:18:40 +01:00
Martijn van Groningen	a345e98575	Core: `ignore_unavailable` shouldn't ignore closed indices if a single index is specified in a search or broadcast request. Closes #9047 Closes #7153	2014-12-24 10:46:03 +01:00
Adrien Grand	7678ab5264	Parent/child: Fix concurrency issues of the _parent field data. `_parent` field data mistakenly shared some stateful data-structures across threads. Close #8396	2014-12-24 09:34:40 +01:00
Adrien Grand	24591b3c70	Search: parse terms filters on a single term as a term filter. Running a terms filter on a single term is equivalent to loading a postings list into a bit set and then returning the bit set instead of reading the postings list on the fly. Close #9014	2014-12-24 09:33:21 +01:00
Nicholas Knize	77a7ef28b3	[GEO] Add optional left/right parameter to GeoJSON This feature adds an optional orientation parameter to the GeoJSON document and geo_shape mapping enabling users to explicitly define how they want Elasticsearch to interpret vertex ordering. The default uses the right-hand rule (counterclockwise for outer ring, clockwise for inner ring) complying with OGC Simple Feature Access standards. The parameter can be explicitly specified for an entire index using the geo_shape mapping by adding "orientation":{"left"\|"right"\|"cw"\|"ccw"\|"clockwise"\|"counterclockwise"} and/or overridden on each insert by adding the same parameter to the GeoJSON document. closes #8764	2014-12-22 12:09:45 -06:00
Colin Goodheart-Smithe	391b5f3f5e	Aggregations: Adds methods to get to/from as Strings for Range Aggs Adds getToAsString and getFromAsString to Range interface and implements them for all range aggregations Closes #9003	2014-12-22 09:56:25 +00:00
Nik Everett	a95d75e074	Mappings: Reencode transformed result with same xcontent When I originally wrote the transform feature I didn't think that the XContentType of the reencoded source mattered. It actually matters because payloads for the completion suggester are stored and returned exactly as encoded by this XContentType. This revision changes the transform feature from always reencoding with smile to always reencoding with the provided XContentType to support the completion suggester. Closes #8959	2014-12-22 10:11:25 +01:00
tlrx	a4133ec4a3	Shutdown: Add support for Ctrl-Close event on Windows platforms to gracefully shutdown node This commit adds the support for the Ctrl-Close event on Windows using native system calls. This way, it is possible to catch the Ctrl-Close event sent by a 'taskill /pid' command (or when the user closes the console window where elasticsearch.bat was started) and gracefully close the node. Before this commit, the node was simply killed on taskkill/window closing.	2014-12-22 09:36:29 +01:00
Boaz Leskes	defecb3f80	Test: added some logging to NodeEnvironmentTests.testDeleteSafe	2014-12-20 00:27:37 +01:00
Boaz Leskes	4d699bd76c	Internal: remove IndexCloseListener & Store.OnCloseListener Closes #9009	2014-12-19 21:11:46 +01:00
Boaz Leskes	c077683248	Test: ZenFaultDetectionTests.testNodesFaultDetectionConnectOnDisconnect should account for initial ping There was a race condition in the test in the case where the nodes fault detection would manage to send and initial ping, followed by 2 attempts before the target service was disconnected.	2014-12-19 13:12:39 +01:00
Boaz Leskes	cb0d462aa0	Test: fix racing condition in IndicesRequestTests a request could be captured after action array was cleared.	2014-12-19 11:25:12 +01:00
Boaz Leskes	635ae29bf1	Recovery: cleaner interrupt handling during cancellation RecoveryTarget initiates the recovery by sending a start recovery request to the source node and then waits for the recovery to complete. During recovery cancellation, we interrupt the thread so it will wake up and clean the recovery. Depending on timing, this can leave an unneeded interrupted thread status causing future IO commands to fail unneeded. RecoverySource already had a handy utility called CancellableThreads. This extracts it to a top level class, and uses it in RecoveryTarget as well. Closes #9000	2014-12-19 10:39:21 +01:00
Guillaume Hiron	8738583de6	FunctionScore: Fix 'avg' score mode to correctly implement weighted mean. closes #8992 closes #9004	2014-12-18 16:36:39 -08:00
Boaz Leskes	e6a190ec58	Test: AutoFilterCachingPolicy.HISTORY_SIZE should be large enough to accommodate other param	2014-12-18 21:00:47 +01:00
Adrien Grand	55d8bfd691	[TEST] Fix IndexStatsTests failures.	2014-12-18 19:33:05 +01:00
Adrien Grand	ce11e0ee6d	Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449	2014-12-18 15:51:36 +01:00
Adrien Grand	6d253aba08	Upgrade to lucene-5.0.0-snapshot-1646179.	2014-12-18 09:51:20 +01:00
Boaz Leskes	ee7ed387d4	Test: use less shards in SimpleQueryTests	2014-12-18 09:02:51 +01:00
Michael McCandless	242e631e95	Core: ignore known idle threads by default in /_nodes/hot_threads Add a new ignore_idle_threads boolean option (default true) to /_nodes/hot_threads, to filter out threads in known idle places like waiting on a socket select or on pulling the next task from an empty queue. Closes #8985 Closes #8908	2014-12-17 11:59:31 -05:00
Sebastian Utz	3f51352b54	fixup! add support for registering custom circuit breaker	2014-12-17 15:54:27 +01:00
Sebastian Utz	b9843dbda9	add support for registering custom circuit breaker	2014-12-17 15:53:24 +01:00
Alex Ksikes	86e1655e4b	Term Vectors: support for version and version_type This commit adds support for version and version_type to the Term Vectors API. This could be useful in the following case whereby the user gets a document and later wants to generate its TVs. With version, this would ensure that only the TVs of that particular document are generated, and error out if the document has been updated in between. Closes #7480	2014-12-17 15:43:15 +01:00
Lee Hinman	ddf83a90dd	[TEST] Inject IndexSettings, not node Settings objects Guice was injecting the wrong Settings object	2014-12-17 10:55:13 +01:00
Lee Hinman	853879a121	Revert "Add index.data_path setting" This reverts commit `b2ec19ab36`.	2014-12-17 09:39:19 +01:00
Lee Hinman	154e9d90cd	[TEST] Mute IndicesCustomDataPathTests	2014-12-16 23:02:36 +01:00
Adrien Grand	a50e3930c9	Terms aggs: Validate the aggregation order on unmapped terms too. Close #8946	2014-12-16 18:50:37 +01:00
Lee Hinman	b2ec19ab36	Add index.data_path setting This allows specifying the path an index will be at. `index.data_path` is specified in the settings when creating an index, and can not be dynamically changed. An example request would look like: POST /myindex { "settings": { "number_of_shards": 2, "data_path": "/tmp/myindex" } } And would put data in /tmp/myindex/0/index/0 and /tmp/myindex/0/index/1 Since this can be used to write data to arbitrary locations on disk, it requires enabling the `node.enable_custom_paths` setting in elasticsearch.yml on all nodes.	2014-12-16 18:25:21 +01:00
Nicholas Knize	18d56f154c	Adding unit tests for clockwise non-OGC ordering Adding unit tests to validate cw defined polys not-crossing and crossing the dateline, respectively	2014-12-16 10:54:51 -06:00
Nicholas Knize	ac0e37449e	Adding unit test for self intersecting polygons. Relevant to #7751 even/odd discussion Updating documentation to describe polygon ambiguity and vertex ordering.	2014-12-16 10:54:39 -06:00
Nicholas Knize	437afd6f45	Adding dateline test with valid lat/lon pairs Cleanup: Removing unnecessary logic checks	2014-12-16 10:54:28 -06:00
Nicholas Knize	85502ac40a	Updating translation gate check to disregard order of hole vertices for non dateline crossing polys. Updating comments and code readability Correcting code formatting	2014-12-16 10:54:13 -06:00
Nicholas Knize	e9e13d5cfc	Computational geometry logic changes to support OGC standards This commit adds the logic necessary for supporting polygon vertex ordering per OGC standards. Exterior rings will be treated in ccw (right-handed rule) and interior rings will be treated in cw (left-handed rule). This feature change supports polygons that cross the dateline, and those that span the globe/map. The unit tests have been updated and corrected to test various situations. Greater test coverage will be provided in future commits. Addresses #8672	2014-12-16 10:54:02 -06:00
Nicholas Knize	f8f92f816a	[GEO] OGC compliant polygons fail with ambiguity This feature branch implements OGC compliance for Polygon/Multi-polygon. That is, vertex order for the exterior ring follows the right-hand rule (ccw) and all holes follow the left-hand rule (cw). While GeoJSON imposes no restrictions, a user that wants to specify a complex poly across the dateline must do so in compliance with the OGC spec, otherwise a polygon that spans the globe will be assumed. Reference issue #8672 Fix orientation of outer and inner ring for polygon with holes. Updated unit tests. Bug exists in boundary condition on negative side of dateline.	2014-12-16 10:53:34 -06:00
Alex Ksikes	dda33155d6	Indices API: Fix wrong search stats groups This provides a fix to issue #7644. A new Stats object must be created, and not a reference to the retrieved stats, before we can add stats to it. Otherwise, we would keep on adding to the same object on subsequent calls to IndicesStatsResponse#getPrimaries() or IndicesStatsResponse#getTotal(). Closes #7644 and #8950	2014-12-16 14:31:41 +01:00
Lee Hinman	54f2eae4d8	[TEST] Remove "compressed" field data from numeric formats The "compressed" format was removed, so this caused warnings in the log like: ``` [WARN ][index.fielddata ] [node_0] [test] failed to find format [compressed] for field [test-num], will use default ```	2014-12-16 12:38:59 +01:00
Lee Hinman	63ee24982f	[TEST] Call .cleanUp() on field data cache Now that we do not automatically call .cleanUp() when clearing the field data cache, we need to call it after the cache clear in RandomExceptionCircuitBreakerTests	2014-12-16 12:38:47 +01:00
Ryan Ernst	37287284e6	Settings: Remove `mapping.date.round_ceil` setting for date math parsing The setting `mapping.date.round_ceil` (and the undocumented setting `index.mapping.date.parse_upper_inclusive`) affect how date ranges using `lte` are parsed. In #8556 the semantics of date rounding were solidified, eliminating the need to have different parsing functions whether the date is inclusive or exclusive. This change removes these legacy settings and improves the tests for the date math parser (now at 100% coverage!). It also removes the unnecessary function `DateMathParser.parseTimeZone` for which the existing `DateTimeZone.forID` handles all use cases. Any user previously using these settings can refer to the changed semantics and change their query accordingly. This is a breaking change because even dates without datemath previously used the different parsing functions depending on context. closes #8598 closes #8889	2014-12-15 13:13:45 -08:00
Lee Hinman	8fbf45ef2b	[TEST] Make parent breaker check less strict In cases of heavy contention, it's possible for more than 2 threads to race to a circuit breaking exception. Essentially this means that if we have 3 threads all trying to add 3 and simultaneously cause a circuit breaking exception (due to retry), when adjusting after circuit breaking we can "rewind" past what this test expects the child breaker to be at. This adds leeway into the check, where it's okay to be within NUM_THREADS from the parentLimit, because each thread should only add 1 to the breaker at a time.	2014-12-15 17:06:21 +01:00
Simon Willnauer	1247774ff1	Remove Gateway abstraction We only have a single gatweway since es 1.3. There is no need to keep all these abstractsion and nested packages. We can fold most of it into simpler structures.	2014-12-15 15:53:02 +01:00
Lee Hinman	a8fa650ee6	[CORE] Remove IndexEngine IndexEngine was an abstraction where we had index-level engines (instead of shard-level) that could store meta information about the index. It was never actually used by Elasticsearch, and only there for plugins. This removes it, because it is a confusing abstraction and not needed, no plugins should be implementing their own IndexEngines.	2014-12-15 14:30:44 +01:00
Boaz Leskes	d62bf5f67f	Discovery: concurrent node failures can cause unneeded cluster state publishing When a node fails (or closes), the master processes the network disconnect event and removes the node from the cluster state. If multiple nodes fail (or shut down) in rapid succession, we process the events and remove the nodes one by one. During this process, the intermediate cluster states may cause the node fault detection to signal the failure of nodes that are not yet removed from the cluster state. While this is fine, it currently causes unneeded reroutes and cluster state publishing, which can be cumbersome in big clusters. Closes #8804 Closes #8933	2014-12-15 14:01:25 +01:00
Simon Willnauer	e47b753617	[SEARCH] close active contexts on SearchService#close() When we close a node all pending / active search requests need to be cleared otherwise a node will wait up to 30 sec for shutdown sicne there could be open scroll requests. This behavior was introduces in 1.5 such that versions <= 1.4.x are not affected. Closes #8940	2014-12-15 09:41:31 +01:00
Boaz Leskes	a63a055f63	Test: missing {} from log command in indexRandom	2014-12-13 17:24:46 +01:00
Boaz Leskes	22da975e34	Test: reduce join timeout in testFullRollingRestart Occasionally a the join thread successfully connected to a just closed node and which causes the subsequent join request to time out. It's default timeout 60s throws the test off when it waits for a cluster to form.	2014-12-13 13:05:04 +01:00
Michael McCandless	ae11c4654b	Core: use compound file by default for merged segments < 10% of index size Change Elasticsearch to use Lucene's defaults, to reduce file descriptor count. Closes #8934 Closes #8919	2014-12-12 15:51:37 -05:00
Britta Weber	60e805cc1f	[TEST] use ensureYellow()	2014-12-12 18:07:14 +01:00
Britta Weber	185521be4b	[TEST] wait for yellow before searching	2014-12-12 17:34:07 +01:00
Britta Weber	2dc9392a34	[TEST] get trace logs for search packages	2014-12-12 17:06:16 +01:00
Lee Hinman	6bf18056b0	[CORE] Remove explicit .cleanUp() on cache clear Calling cache.cleanUp() is kind of like calling System.gc(), meaning that we should never have (non-test) things that rely on this functionality. For the field data and filter cache, we already have a periodic process that runs this .cleanUp(), so there is no need to block index closing/clearing on it. Instead, we can clean the field data cache in InternalTestCluster before we check the circuit breaker. This can help tests that time out because cleaning the cache is taking too long	2014-12-12 13:24:45 +01:00
Simon Willnauer	42d9a57d0c	[TEST] Wait for yellow before verifying - sometimes the shard is not even started	2014-12-12 12:40:34 +01:00
Simon Willnauer	498331d16f	[TEST] Remove random templates for ConcurrentDynamicTemplateTests	2014-12-12 12:18:58 +01:00
Simon Willnauer	6dacf61dfc	[TEST] Add test to ensure master is not prone to #8917	2014-12-12 10:59:11 +01:00
Simon Willnauer	7b82660ffc	[TEST] Remove debug leftover	2014-12-12 09:29:33 +01:00
Simon Willnauer	dac520170f	[TEST] Close the node env after test is done	2014-12-11 21:24:15 +01:00
Simon Willnauer	3877dc618d	Remove some Internal* abstractions We have lots of boilerplate code that is unnecessarily abstracting services ie InternalIndexShard and IndexShard or InternalIndexService and IndexService. It's enough to have concrete classes for these core classes. Closes #8904	2014-12-11 17:31:01 +01:00
Simon Willnauer	59534391da	[GATEWAY] Cleanup LocalGatewayShardsState This commit tries to cleanup LocalGatewayShardsState to be more efficient and easier to understand.	2014-12-11 17:17:50 +01:00
Peter Fabian Mitchell	b2bab05c29	HTTP: Add 'http.publish_port' setting to the HTTP module This change adds a 'http.publish_port' setting to the HTTP module to configure the port which HTTP clients should use when communicating with the node. This is useful when running on a bridged network interface or when running behind a proxy or firewall. Closes #8807 Closes #8137	2014-12-11 16:10:07 +01:00
Simon Willnauer	ba881a9b58	[ENGINE] Remove engine related command classes Todaqy we pass structs to the engine to call optimize / refresh and flush. This commit cleans up this logic to reduce complexity in the engine.	2014-12-11 15:47:24 +01:00
Jun Ohtani	80bd69811d	Mappings: Fix Get field mapping api with pretty flag Closes #6552	2014-12-11 22:56:54 +09:00
Michael McCandless	084d25cdbd	Test: create private store for these test cases	2014-12-11 05:25:44 -05:00
Robert Muir	a2ffe494ae	[core] add best_compression option for Lucene 5.0 Upgrades lucene to latest, and supports the BEST_COMPRESSION parameter now supported (with backwards compatibility, etc) in Lucene. This option uses deflate, tuned for highly compressible data. index.codec:: The default value compresses stored data with LZ4 compression, but this can be set to best_compression for a higher compression ratio, at the expense of slower stored fields performance. IMO its safest to implement as a named codec here, because ES already has logic to handle this correctly, and because its unrealistic to have a plethora of options to Lucene's default codec... we are practically limited in Lucene to what we can support with back compat, so I don't think we should overengineer this and add additional unnecessary plumbing. See also: https://issues.apache.org/jira/browse/LUCENE-5914 https://issues.apache.org/jira/browse/LUCENE-6089 https://issues.apache.org/jira/browse/LUCENE-6090 https://issues.apache.org/jira/browse/LUCENE-6100 Closes #8863	2014-12-10 22:13:09 -05:00
Nicholas Knize	aa644e3ad7	[GEO] Fix for NPE enclosed in SearchParseException for a "geo_shape" filter or query This fix adds better error handling for parsing multipoint, linestring, and polygon GeoJSONs. Current logic throws a NPE when parsing a multipoint, linestring, or polygon that does not comply with the GeoJSON specification. That is, if a user provides a single coordinate instead of an array of coordinates, or array of linestrings, the ShapeParser throws a NPE wrapped in a SearchParseException instead of a more useful error message. Closes #8432	2014-12-10 16:42:36 -06:00
Simon Willnauer	f308049a90	[ENGINE] Fix updates dynamic settings in InternalEngineHolder After the refactoring in #8784 some settings didn't get passed to the actual engine and there exists a race if the settings are updated while the engine is started such that the actual starting engine doesn't see the latest settings. This commit fixes the concurrency issue as well as adds tests to ensure the settings are reflected.	2014-12-10 23:07:13 +01:00
Simon Willnauer	788d7cb451	[TEST] Reset test logger to default level	2014-12-10 22:47:03 +01:00
Britta Weber	57b77c6907	[TEST] wait for yellow to avoid searching while relocating After upgrading shard might start relocating again. If there are no replicas the cluster state of a node might not be up to data for a few miliseconds and direct a search request to a node that does not have the shard anymore. This result in the following test failures: 1> java.lang.AssertionError: Count is 99 but 101 was expected. Total shards: 13 Successful shards: 12 & 0 shard failures: 1> __randomizedtesting.SeedInfo.seed([1932F73B458703CA:6F4FAD3DAC55591C]:0) 1> [...org.junit.*] 1> org.elasticsearch.test.hamcrest.ElasticsearchAssertions.assertHitCount(ElasticsearchAssertions.java:184) 1> org.elasticsearch.bwcompat.BasicBackwardsCompatibilityTest.testIndexRollingUpgrade(BasicBackwardsCompatibilityTest.java:358) Waiting for relocation finished should fix this.	2014-12-10 17:46:32 +01:00
Simon Willnauer	e5a7eaff22	[TEST] Use private store for a test private engine The test was using the same store as the suite level engine which caused problems with write locks in some cases. Closes #8880	2014-12-10 16:56:52 +01:00
Simon Willnauer	127255f62e	[TEST] Restore test logging level after test is done	2014-12-09 23:31:24 +01:00
javanna	796ebcb88b	[TEST] LoggingListener to restore the initial logger levels after any modification Modifications to LoggingListener pushed with #8820 caused the original logger levels not to be reset after modifications, as the new state was saved for restore instead of the previous one. Added unit tests for LoggingListener as well. Closes #8845	2014-12-09 14:26:13 +01:00
Simon Willnauer	8ffe8e0259	[TEST] compare strings and strings - leftover from Path API refactoring	2014-12-09 14:22:15 +01:00
Robert Muir	39186edc86	Ban java.io.File in tests. Restrict use of java.io.File to 5 methods (excluded), but otherwise ban. This is a prerequisite to do any mocking here. I don't try to do any heavy cleanup on these tests, I am not familiar with them. So this is mostly a rote straightforward conversion. Closes #8836	2014-12-09 05:57:48 -05:00
Simon Willnauer	c0d50f2a80	[TEST] Beef up InternalEngineTest and remove bogus timeouts	2014-12-09 09:31:33 +01:00
Ryan Ernst	0c8f5ac129	Tests: Additional test for memory stats api.	2014-12-08 15:43:31 -08:00
Ryan Ernst	fde32cc599	Stats: Add more fine grained memory stats from Lucene segment reader. This is a start to exposing memory stats improvements from Lucene 5.0. This adds the following categories of Lucene index pieces to index stats: * Terms * Stored fields * Term Vectors * Norms * Doc values	2014-12-08 15:29:43 -08:00
Robert Muir	150c2203ac	Add test that ES filterreader getCoreCacheKey() behaves correctly. Closes #8831.	2014-12-08 17:58:27 -05:00
Michael McCandless	b0b96af746	Test: fix this test to work in IntelliJ	2014-12-08 15:11:06 -05:00
Simon Willnauer	b28fc1afa5	[ENGINE] Add engine lifecycle store reference to EngineHolder This commit add the engines reference to the store out of the actual implementation into the hodler since the holder manages the actual lifcycle. Engine internal references like per searcher or per recovery are kept inside the actual implemenation since the have a different lifecycle.	2014-12-08 21:07:12 +01:00
tlrx	31a77185a6	Merge branch 'fix/plugins-loading'	2014-12-08 17:15:06 +01:00
tlrx	97ec8f94ae	Plugins: Plugin failed to load since #8666 The method Path.endsWith(String s) doesn't work exactly the same way as String.endsWith() (see http://docs.oracle.com/javase/7/docs/api/java/nio/file/Path.html#endsWith(java.nio.file.Path)). This blocks the loading of plugins.	2014-12-08 17:13:39 +01:00
Simon Willnauer	84066128ed	[TEST] Pass class level test logging to external nodes This commit passes the test logging annotation from the class level to the external nodes as well. Closes #8552	2014-12-08 13:25:03 +01:00
Boaz Leskes	83bb65a020	Internal: allow InternalEngine to be stopped and started Once the current engine is started you can only close it once. Once closed the engine cannot be started again. This commit adds a stop method which signals the engine to free it's resources but in a way that allows restarting. This is done by introducing InternalEngineHolder which is a wrapper around InternalEngine. This allows to add the stop() method without adding complexity the engine implementation. InternalEngineHolder also serves an entry point for listeners (incoming and outgoing) to other ES components, which removes the needs add/remove them if the engine is stopped. Closes #8784	2014-12-08 12:40:38 +01:00
Lee Hinman	83fa7bfaba	[TEST] Add unit tests for DiskThresholdDecider settings	2014-12-08 12:14:09 +01:00
Simon Willnauer	8d7ce3c558	[STORE] Expose ShardId via LeafReader rather than Direcotry API Today we try to fetch a shard Id for a given IndexReader / LeafReader by walking it's tree until the lucene internal SegmentReader and then casting the directory into a StoreDirecotory. This class is fully internal to Elasticsearch and should not be exposed outside of the Store. This commit makes StoreDirectory a private inner class and adds dedicated ElasticsearchDirectoryReader / ElasticserachLeafReader exposing a ShardId getter to obtain information about the shard the index / segment belogs to. These classes can be used to expose other segment specific information in the future more easily.	2014-12-08 12:10:28 +01:00
tlrx	a046ee756d	Scripting: Add explicit error message when script_score script returns NaN When a scoring script returns not a number, the current message is confusing (IllegalArgumentException[docID must be >= 0 and < maxDoc=3 (got docID=2147483647)]). This commit adds the error message ScriptException[script score function returns a wrong score: NaN]. Closes #2426	2014-12-08 10:14:01 +01:00
Boaz Leskes	2bc48a4806	Tests: move RecoverAfterNodesTests to org.elasticsearch.gateway.local and increase BLOCK_WAIT_TIMEOUT to 10s The tests were still in org.elasticsearch.gateway.none but the none gateway was removed.	2014-12-07 22:10:11 +01:00
Igor Motov	0b024ad2f3	Snapshot/Restore: switch to write once mode for snapshot metadata files This commit removes creation of in-progress snapshot file and makes creation of the final snapshot file atomic. Fixes #8696	2014-12-05 12:39:24 -05:00
Simon Willnauer	0bab17ffde	[TEST] wait unitl all machines joined the cluster	2014-12-05 18:12:00 +01:00
Lee Hinman	d32f1a8ad0	[TESTS] Log what the _default_ template is in ElasticsearchIntegrationTest	2014-12-05 14:45:24 +01:00
Lee Hinman	caa5af4bf6	[TESTS] Use a _default_ template to load field data lazily Previously it was possible for the field data clearing in this test to take too long, causing the test to time out. This also switches to using `scaledRandomIntBetween` for the number of fields.	2014-12-05 14:45:17 +01:00
Simon Willnauer	b8687163c4	[TEST] produce valid symlinks in tests	2014-12-04 17:02:38 +01:00
Martijn van Groningen	7ac713aedc	Core: surgically removed slow scroll, because master (2.0) requires full cluster restart coming from previous versions.	2014-12-04 15:56:03 +01:00
Simon Willnauer	8b5bc2643e	[Store] Only fail recovery if files are inconsistent the recovery diff can return file in the `different` category since it's conservative if it can't tell if the files are the same. Yet, the cleanup code only needs to ensure both ends of the recovery are consistent. If we have a very old segments_N file no checksum is present but for the delete files they might be such that the segments file passes the consistency check but the .del file doesn't sicne it's in-fact the same but this check was missing in the last commit.	2014-12-04 15:48:40 +01:00
javanna	ad004072bb	Internal: remove optional original indices Original indices are optional in ShardDeleteByQueryRequest only for backwards compatibility, see #7406. We can remove this in master since 2.0 will require a full cluster restart. Closes #8777	2014-12-04 14:25:46 +01:00
Simon Willnauer	219bb88bc2	Remove runtime version checks This cleanup commmit removes a large protion of the versioned reads / writes in the network protocol since master requires a full cluster restart.	2014-12-04 11:31:29 +01:00
Simon Willnauer	f4052fd936	Factor out PID file creation and add tests This commit factors out the PID file creation from bootstrap and adds tests for error conditions etc. We also can't rely on DELETE_ON_CLOSE since it might not even write the file depending on the OS and JVM implementation. This impl uses a shutdown hook to best-effort remove the pid file if it was written. Closes #8771	2014-12-04 11:12:16 +01:00
Simon Willnauer	ab0e3a6db2	[CLIENT] Add internal liveness action This commit adds a very lightweight action to the transport serivce that allows to fetch clustername and the discovery node from a node. This is used by transport clients to test liveness of a node without using the nodesinfo API which can be blocking if management threadpools are busy. Closes #8763	2014-12-04 10:49:20 +01:00
javanna	171e718f88	[DOCS] Document ActionNamesTests	2014-12-03 16:28:36 +01:00
javanna	6ccb46ef37	[TEST] remove action names bwc layer The bwc layer added with #7105 is not needed in master as a full cluster restart will be required, thus from 2.0 on the only supported action names are compliant to the defined conventions and don't need to be converted to the old format Closes #8758	2014-12-03 16:18:43 +01:00
Simon Willnauer	d732077900	[TEST] Wait for yellow before running rescorer tests	2014-12-03 10:03:45 +01:00
javanna	36e12d39fd	[TEST] guarantee REST tests execution order REST tests are being shuffled before their execution. To guarantee their repeatability given the seed, their order needs to be always the same before the shuffling happens. Closes #8745	2014-12-03 08:36:15 +01:00
Simon Willnauer	3dfff84043	Revert back APIs that resolve files from classpath to java.net.URL The conversion to the Path API doesn't work if the path points to a file inside a JAR like a config. These path must be read while the ZIP filesystem is opened which can't be guaranteed across the board. This commit reverts back the relevant changes to java.net.URL and adds a util method to read UTF-8 Encoded files from URLs correctly.	2014-12-03 00:09:35 +01:00
Simon Willnauer	a6510f9245	Add File.java to forbidden APIs This commit cuts over all of core (not quite all tests) to java.nio.Path It also adds the file class to the core forbidden APIs to prevent its usage. This commit also resolves #8254 since we now consistently useing the NIO Path API. The Changes in this commit allow for more information if IO operations fail since the NIO API throws exceptions instead of boolean return values. The build-in methods used in this commit are also more resillient to encodeing errors like unmappable characters and throw exceptions if those chars are present in a file. Closes #8254 Closes #8666	2014-12-02 21:29:26 +01:00
Simon Willnauer	48ec6599c2	[TEST] use private randomness in InternalTestCluster	2014-12-02 18:22:44 +01:00
Simon Willnauer	e3ed471d30	[TEST] speed up tests by reducing the recovery retry by default	2014-12-02 17:43:06 +01:00
Simon Willnauer	8736543c71	[RECOVERY] Ensure shards are identical after recovery Today we don't check if the recovery target has all the files that we expect there after the recovery. This commit adds aditional safety to ensure all files are present with the correct checksums on recovery finalization. Closes #8723	2014-12-02 14:05:44 +01:00
Simon Willnauer	c1edcaf388	[RECOVERY] Make recovery delay configurable Today we wait 500ms before we retry a recovery if the target node is not ready. This happens if the source starts the recovery before the target has processed the clusterstate moving the target shard into the right state. This can cause a 500ms delay each time it happens while the shard is ready way earlier on the target node. This commit makes this delay configurable to mainly speed up test processing and shard allocation in tests.	2014-12-02 13:33:35 +01:00
Martijn van Groningen	d7e224da04	Added `inner_hits` feature that allows to include nested hits. Inner hits allows to embed nested inner objects, children documents or the parent document that contributed to the matching of the returned search hit as inner hits, which would otherwise be hidden. Closes #8153 Closes #3022 Closes #3152	2014-12-02 12:01:01 +01:00
Simon Willnauer	942e752ac1	Remove unused member / argument on Store	2014-12-02 10:11:34 +01:00
Simon Willnauer	9b5b281fe8	[TEST] Ensure we have a mapping for all types on open/close	2014-12-02 10:11:04 +01:00
David Pilato	317192b647	java: QueryBuilders cleanup (add and deprecate) Some QueryBuilders are missing or have a different naming than the other ones. This patch is applied to branch 1.x and master (elasticsearch 1.5 and 2.0): Added ----- * `templateQuery(...)` * `commonTermsQuery(...)` * `queryStringQuery(...)` * `simpleQueryStringQuery(...)` Deprecated ---------- * `commonTerms(...)` * `queryString(...)` * `simpleQueryString(...)`	2014-12-01 14:41:25 +01:00
Simon Willnauer	ade5aaae5f	[TEST] Ensure that all flushes happen on PeerRecovery tests	2014-11-30 16:31:49 +01:00
Simon Willnauer	c630b1e8a4	[TEST] Move NoMergePolicyProvider into it's own class	2014-11-30 16:31:49 +01:00
Simon Willnauer	ca9abb1caf	[TEST] wait for all shards to be allocated before IndexStatsTests runs	2014-11-29 20:11:21 +01:00
Simon Willnauer	539faf4e65	[TEST] Mute CircuitBreakerServiceTests.testMemoryBreaker Relates to #8710	2014-11-29 20:07:19 +01:00
Simon Willnauer	ef8802d878	[TEST] make sure number of shard is low in network corruption tests	2014-11-29 17:16:46 +01:00
Simon Willnauer	2d0309f0d4	[TEST] Use private random instance to build test cluster config	2014-11-29 16:06:05 +01:00
Simon Willnauer	75cc8ee097	[TEST] Speed up recoveries if tests.nighly=true	2014-11-29 15:37:15 +01:00
Simon Willnauer	7d3da915b0	[TEST] Don't fail test if dummy doc is not found Relates to #8706	2014-11-29 15:36:05 +01:00
Lee Hinman	7776b6b4f0	[TEST] Use a pooling connection manager for REST tests	2014-11-28 22:04:01 +01:00
Simon Willnauer	29422c645b	[TEST] Try to speed up REST tests by reducing max number of replicas and shards	2014-11-28 21:16:38 +01:00
Lee Hinman	1d8fd0fc04	[TEST] Explicit wait for fielddata breaker to be cleared	2014-11-28 15:56:11 +01:00
Lee Hinman	f9d7e76928	[TEST] Relax constraints of breaker tests even more They were very stringent, and add values in worst-case situations. The new values are acceptable while still testing the functionality of the breaker.	2014-11-28 15:12:09 +01:00
Alex Ksikes	256712640f	MLT Query: Support for ignore docs Adds a `ignore_like` parameter to the MLT Query, which simply tells the algorithm to skip all the terms from the given documents. This could be useful in order to better guide nearest neighbor search by telling the algorithm to never explore the space spanned by the given `ignore_like` docs. In essence we are interested about the characteristic of a given item, but not of the ones provided by `ignore_like`, thereby forcing the algorithm to go deeper in its selection of terms. Note that this is different than simply performing a must not boolean query on the unliked items. The syntax is exactly the same as the `like` parameter. Closes #8674	2014-11-28 14:48:43 +01:00
Britta Weber	59507cf793	function_score: match only document with score above custom score threshold functon_score matched each document regardless of the computed score. This commit adds a query parameter `min_score` (-Float.MAX_VALUE default). Documents that have a score lower than this threshold will not be mached. closes #6952	2014-11-28 12:35:26 +01:00
Simon Willnauer	93b52c925d	[TEST] With pipelining disabled requests can come back in any order Closes #8697	2014-11-28 12:28:31 +01:00
Simon Willnauer	5c6c7f23ba	[TEST] add back accidetially removed test logging	2014-11-28 11:01:15 +01:00
Simon Willnauer	bc563931c3	[TEST] move test to a different class that can disable publish timeout on demand	2014-11-28 10:49:56 +01:00
Simon Willnauer	c524e469ec	[TEST] remove outdated TestLogging annotation Conflicts: src/test/java/org/elasticsearch/bwcompat/UnicastBackwardsCompatibilityTest.java src/test/java/org/elasticsearch/recovery/RelocationTests.java	2014-11-28 10:44:06 +01:00
Lee Hinman	600f02b407	[TEST] Add an assert for null indices in InternalEngineIntegrationTest	2014-11-28 10:29:24 +01:00
Boaz Leskes	8456489773	Test: add trace logging to RelocationTests also improved error message when failing to delete a dummy doc	2014-11-28 10:22:49 +01:00
Simon Willnauer	b18675efb4	[TEST] Use different index on each iteration to ensure less timeout prone tests	2014-11-28 10:10:10 +01:00
Alex Ksikes	d7338ffdbc	MLT Query: Fix exclude with artificial documents Artificial documents get assigned a random id. When include is set to false (default), the ids of these documents also get included, when they should rather be ignored. Closes #8679	2014-11-28 08:07:57 +01:00
Simon Willnauer	fe762c0eb5	[TEST] Reduce possible number of indices in the test - 10 indices can create tons of shards	2014-11-27 23:04:40 +01:00
Martijn van Groningen	06c39e79d6	Test: predefine sort fields in mapping, otherwise during the test the field may not be found if it were to be introduced dynamically at index time.	2014-11-27 17:57:35 +01:00
Boaz Leskes	cd717ab8e7	Test: UpdateTests.stressUpdateDeleteConcurrency shouldn't turn off threaded operations This may result in all network threads being busy for too long.	2014-11-27 16:30:26 +01:00
javanna	bf0387e0bc	[TEST] make sure rest tests info is printed for any @Rest annotated test We introduced the @Rest annotation a while ago for REST tests (see #7795), we have then to make sure that relevant info to reproduce failures gets printed out for any test that is marked with such annotation, not only for ElasticsearchRestTests Closes #8680	2014-11-27 10:42:25 +01:00
javanna	e07b0deecd	[TEST] Extend unicast ports generation to support more concurrent clusters Make it possible to run multiple tests with unicast configuration, by assigning ports based on their test scope. Every jvm still gets its own port range based on the jvm id, but we now make sure that the different jvms ranges never overlap. The global cluster gets a reserved port range, while SUITE and TEST scopes are treated equally, just assuming that they never run concurrently on the same jvm, thus ports can be safely reused. Closes #8634	2014-11-27 09:02:23 +01:00
javanna	c2f1175692	[TEST] split base settings in ClusterDiscoveryConfiguration between node and transport client The default settings that are currently applied to the transport client are about discovery and gateway, modules that are not even loaded on the transport client. We can now remove the local gateway as it's not the default one anyway. Also, make sure that the discovery setting is only applied to the node, as it is not relevant for transport client. Closes #8653	2014-11-27 08:00:00 +01:00
Simon Willnauer	0c2fd314fc	[TEST] Wait for green before testing IW settings	2014-11-26 21:47:24 +01:00
Simon Willnauer	eba761e368	[TEST] Stabelize FunctionScoreBWC tests - allocation should be disabled during upgrade	2014-11-26 17:54:20 +01:00
Lee Hinman	5169339308	[TEST] Add additional logging to memoryCircuitBreaker test	2014-11-26 14:57:31 +01:00
Michael McCandless	d9dfad0e9b	Core: separately log file deletions Today, you can turn on lucene.iw TRACE logging, but that produces tons of output. This changes breaks out separate lucene.iw.ifd and index.store.deletes logger components (TRACE), disabled by default, to see what part of Elasticsearch is deleting index files. Closed #8662 Closed #8603	2014-11-26 05:10:47 -05:00
Martijn van Groningen	099b1a70d5	Core: Let the disk threshold decider take into account shards moving away from a node in order to determine if a shard can remain. By taking this into account we can prevent that we move too many shards away than is necessary. Closes #8538 Closes #8659	2014-11-26 10:14:02 +01:00
Simon Willnauer	716212c037	Raise REST test Timeout - LocalGW takes it't tall...	2014-11-26 09:12:34 +01:00
Chris Earle	08521a4066	Revert "Update to Jackson 2.4.3" This reverts commit `7523d0b150`.	2014-11-25 16:41:33 -05:00
Nicholas Knize	6692ac3b75	Adding unit test for even / odd boundary condition	2014-11-25 13:32:01 -06:00
Clément Tourrière	15db5b98d2	Fix for geohash neighbors when geohash length is even. We don't have to set XLimit and YLimit depending on the level (even or odd), since semantics of x and y are already swapped on each level. XLimit is always 7 and YLimit is always 3. Close #8526	2014-11-25 13:31:56 -06:00
Chris Earle	7523d0b150	Update to Jackson 2.4.3 - Update pom to 2.4.3 from 2.4.2 - Enable the CBOR data header (aka tag) from the CBOR Generator to provide binary identification like the Smile format - Check for the CBOR header and ensure that the data sent in represents a "major type" that is an object - Cleans up `JsonVsCborTests` unused imports	2014-11-25 14:03:16 -05:00
Adrien Grand	d22645cbfc	Scripts: Return new lists on calls to getValues. Scripts currently share the same list across invocations to getValues. This caused a bug in script fields where all documents coming from the same segment would get the same values (basically, for the next document for which script values have been requested). Scripts now return a fresh new list on every invocation to `getValues`. Close #8576	2014-11-25 17:39:26 +01:00
Nils Dijk	0f4ca09e54	Aggregations: fix rounding issues on DST switch. Closes #8339.	2014-11-25 16:48:03 +01:00
Simon Willnauer	9e7b15b8f3	[GATEWAY] Cut over MetaDataStateFormat to Path API Closes #8609	2014-11-25 15:07:10 +01:00
Simon Willnauer	82868e9cf2	remove unnecessary clearScroll call - these contexts are released by delete index now	2014-11-25 14:53:38 +01:00
Martijn van Groningen	13d1bb5681	Parent/child: Fixed parent/child not being able to be used in alias filters. Closes #8628	2014-11-25 14:46:29 +01:00
Simon Willnauer	35b278fc68	[TRANSLOG] Cut over to Path API This commit moves all the Translog related code over to the NIO2 Path API. It also make transaction logs write once since it never reuses a translog file. Closes #8611	2014-11-25 12:43:57 +01:00
Lee Hinman	6749b2c306	[TEST] Reduce stringency of breaker assertions While in a perfect world we should only ever have 2 circuit breaker trips, it's possible to get a race condition between the child and the parent breaker with many threads. Since multiple breaking exceptions are not actually a bad thing, it's okay to relax the constraints in the test. The race conditions are due to no locking inside the breaker logic, to ensure that it is as low overhead as possible. Even though no locking is used, we use atomic counters internally to ensure that the "estimated" numbers for the breakers are never out of sync (which this test still checks with no leeway).	2014-11-25 11:53:29 +01:00
Colin Goodheart-Smithe	c420a17f7d	Aggregations: Added getProperty method to Aggregations This allows arbitrary properties to be retrieved from an aggregation tree. The property is specified using the same syntax as the order parameter in the terms aggregation. If a property path contians a multi-bucket aggregation the property values from each bucket will be returned in an array.	2014-11-25 10:07:42 +00:00
Michael McCandless	856b294441	Core: let Lucene kick off merges Today, Elasticsearch has a separate merge thread pool checking once per second (by default) if any merges are necessary, but this is no longer necessary since we can and do now tell Lucene's ConcurrentMergeScheduler never to "hard pause" threads when merges fall behind, since we do our own index throttling. This change goes back to letting Lucene launch merges as needed, and removes these two expert settings: index.merge.force_async_merge index.merge.async_interval Now merges kick off immediately instead of waiting up to 1 second before running. Closes #8643	2014-11-25 04:13:57 -05:00
Igor Motov	1aff8631ed	Snapshot/Restore: restore with wait_for_completion=true should wait for succesfully restored shards to get started This commit ensures that restore operation with wait_for_completion=true doesn't return until all successfully restored shards are started. Before it was returning as soon as restore operation was over, which cause some shards to be unavailable immediately after restore completion. Fixes #8340	2014-11-24 19:37:43 -05:00
Adrien Grand	d60500f22e	Fielddata: Fix iterator over global ordinals. Our iterator over global ordinals is currently incorrect since it does NOT return -1 (NO_MORE_ORDS) when all ordinals have been consumed. This bug does not strike immediately with elasticsearch since we always consume ordinals in a random-access fashion. However it strikes when consuming ordinals through Lucene helpers such as DocValues#docsWithField. Close #8580	2014-11-24 19:42:53 +01:00
Martijn van Groningen	13b9e07522	Core: Fields defined in the `_default_` mapping of an index template should be picked up when an index alias filter is parsed if a new index is introduced when a document is indexed into an index that doesn't exist yet. Closes #8473	2014-11-24 18:25:31 +01:00
Reuben Sutton	fda1576d55	Fix SearchRequest.templateParams so that it is a Map<String, Object> so that it can take more data-types than just strings, to support Arrays.	2014-11-24 14:46:48 +00:00
Nicholas Knize	fc955551d4	[GEO] Fix for geo_shape query with polygon from -180/90 to 180/-90 This fix adds a simple consistency check that intersection edges appear pairwise. Polygonal boundary tests were passing (false positive) on the Eastern side of the dateline simply due to the initial order (edge direction) of the intersection edges. Polygons in the Eastern hemispehere (which were not being tested) were correctly failing inside of JTS due to an attempt to connect incorrect intersection edges (that is, edges that were not even intersections). While this patch fixes issue/8467 (and adds broader test coverage) it is not intented as a long term solution. The mid term fix (in work) will refactor all geospatial computational geometry to use ENU / ECF coordinate systems for higher accuracy and eliminate brute force mercator checks and conversions. Closes #8467	2014-11-24 08:31:36 -06:00
Martijn van Groningen	1d7cdd7d22	Applied PR, changed the way defaults are handled and updated the docs. Closes #4452	2014-11-24 13:32:41 +01:00
Brusic	7c10b445d4	Expose dist/pre/post options for SpanNotQuery	2014-11-24 13:28:54 +01:00
Lee Hinman	45408844e7	Remove NoneGateway, NoneGatewayAllocator, & NoneGatewayModule Always use the LocalGateway* equivalents We already check in the LocalGateway whether a node is a client node, or is not master-eligible, and skip writing the state there. This allows us to remove this code that was previously used only for tribe nodes (which are not master eligible anyway and wouldn't write state) and in tests (which can shake more bugs out)	2014-11-24 12:22:05 +01:00
Michael McCandless	dfb6d6081c	Core: upgrade to current Lucene 5.0.0 snapshot Elasticsearch no longer unlocks the Lucene index on startup (this was dangerous, and could possibly lead to corruption). Added the new serbian_normalization TokenFilter from Lucene. NoLockFactory is no longer supported (index.store.fs.fs_lock = none), and if you have a typo in your fs_lock you'll now hit a StoreException instead of silently using NoLockFactory. Closes #8588	2014-11-24 05:08:42 -05:00
Adrien Grand	8346e92ebb	Core: Fix script fields to be returned as a multivalued field when they produce a list. This change is essentially the same as #3015 but on script fields. Close #8592	2014-11-24 09:41:16 +01:00
Simon Willnauer	b6b3382a8b	[STORE] Use Lucene checksums if segment version is >= 4.9.0 We started to use the lucene CRC32 checksums instead of the legacy Adler32 in `v1.3.0` which was the first version using lucene `4.9.0`. We can safely assume that if the segment was written with this version that checksums from lucene can be used even if the legacy checksum claims that it has a Adler32 for a given file / segment. Closes #8587 Conflicts: src/main/java/org/elasticsearch/index/store/Store.java src/test/java/org/elasticsearch/index/store/StoreTest.java	2014-11-21 22:35:21 +01:00
Simon Willnauer	1c00790213	[TEST] all tests should extend ElasticsearchTestCase	2014-11-21 20:27:52 +01:00
Ryan Ernst	40598a5692	Fix test failures caused by #8556	2014-11-21 10:22:13 -08:00
Ryan Ernst	fae9dcaed7	DateMath: Fix semantics of rounding with inclusive/exclusive ranges. Date math rounding currently works by rounding the date up or down based on the scope of the rounding. For example, if you have the date `2009-12-24\|\|/d` it will round down to the inclusive lower end `2009-12-24T00:00:00.000` and round up to the non-inclusive date `2009-12-25T00:00:00.000`. The range endpoint semantics work as follows: * `gt` - round D down, and use > that value * `gte` - round D down, and use >= that value * `lt` - round D down, and use < * `lte` - round D up, and use <= There are 2 problems with these semantics: * `lte` ends up including the upper value, which should be non-inclusive * `gt` only excludes the beginning of the date, not the entire rounding scope This change makes the range endpoint semantics symmetrical. First, it changes the parser to round up and down using the first (same as before) and last (1 ms less than before) values of the rounding scope. This makes both rounded endpoints inclusive. The range endpoint semantics are then as follows: * `gt` - round D up, and use > that value * `gte` - round D down, and use >= that value * `lt` - round D down, and use < that value * `lte` - round D up, and use <= that value closes #8424 closes #8556	2014-11-21 09:28:30 -08:00
Alex Ksikes	1959275622	Term Vectors: More consistent naming for term vector[s] We speak of the term vectors of a document, where each field has an associated stored term vector. Since by default we are requesting all the term vectors of a document, the HTTP request endpoint should rather be called `_termvectors` instead of `_termvector`. The usage of `_termvector` is now deprecated, as well as the transport client call to termVector and prepareTermVector. Closes #8484	2014-11-21 14:06:44 +01:00
Adrien Grand	abc0bc4c7f	Aggregations: Fix geohash grid doc counts computation on multi-valued fields. Close #8512	2014-11-21 11:02:04 +01:00
Robert Muir	9ef69f9f36	Disable bloom filters. make the "es090" postings format read-only, just to support old segments. There is a test version that subclasses it with write-capability for testing. Closes #8571	2014-11-20 21:03:23 -05:00
Simon Willnauer	3e1b7c7a34	[BLOOM] Fix Bloom filter ram usage calculation BloomFilter actually returned the size of the bitset as the size in bytes so off by factor 8 plus a constant :) Closes #8564	2014-11-20 22:45:28 +01:00
Simon Willnauer	d5d5dece56	[INDEX] Add before/after indexDeleted callbacks to IndicesLifecycle In order to implement #8551 correctly without causing problems of relocating shards we need to be informed if an index is actually deleted. This commit adds more callbacks to the listener and makes deleteIndex a dedicated method on IndicesService	2014-11-20 15:37:35 +01:00
Simon Willnauer	26b4ebcd00	[TEST] Delete index in test to release file handles	2014-11-20 15:35:08 +01:00
Britta Weber	06e907d99e	Revert "[TEST] use logger level from test class annotation also in external nodes" This reverts commit `4604a68bef`.	2014-11-20 15:29:44 +01:00
Simon Willnauer	0fcb466555	[STORE] Remove `memory`/ `ram` store The RAM store is discuraged for production usage anyway and we don't test it in our randomized infrastructure. This commit removes it for `2.0`	2014-11-20 14:47:19 +01:00
markharwood	0c94314996	Parser throws NullPointerException when Filter aggregation clause is empty. Added Junit test that recreates the error and fixed FilterParser to default to using a MatchAllDocsFilter if the requested filter clause is left empty. Also added fix and test for the Filters (with an "s") aggregation. Closes #8438	2014-11-20 13:06:11 +00:00
Britta Weber	4b5592cc59	[root mappers] fix conflict when updating mapping with _all disabled _all reports a conflict since #7377. However, it was not checked if _all was actually configured in the updated mapping. Therefore whenever _all was disabled a mapping could not be updated unless _all was again added to the updated mapping. Also, add enabled setting to mapping always whenever enabled was set explicitely. closes #8423 closes #8426	2014-11-20 12:46:27 +01:00
Britta Weber	4604a68bef	[TEST] use logger level from test class annotation also in external nodes closes #8552	2014-11-20 12:04:08 +01:00
Adrien Grand	dc3389a97a	Tests: Fix test bug in Filter[s]Tests that made it throw a version conflict.	2014-11-20 11:30:49 +01:00
Adrien Grand	a94fb92ac5	Aggregations: Fix geohash grid aggregation on multi-valued fields. This aggregation creates an anonymous fielddata instance that takes geo points and turns them into a geo hash encoded as a long. A bug was introduced in 1.4 because of a fielddata refactoring: the fielddata instance tries to populate an array with values without first making sure that it is large enough. Close #8507	2014-11-20 10:03:56 +01:00
Adrien Grand	f30a0e846d	Aggregations: Do not take deleted documents into account in aggregations filters. Since aggregators are only called on documents that match the query, it never gets called on deleted documents, so by specifying `null` as live docs, we very likely remove a BitsFilteredDocIdSet layer. Close #8540	2014-11-20 09:59:13 +01:00
Ryan Ernst	cca5934e9d	Tests: Pass through locale and timezone to test runner, and print in repro command line. The carrot runner currently randomizes both locale and timezone, but these are not set in the maven reproduce line. Since they aren't even printed, we have no idea what locale/timezone the tests actually ran with.	2014-11-19 22:01:26 -08:00
Ryan Ernst	a0b7e5842d	Tests: Forward port tweak to prepareBackwardsDataDir to 1.x to allow passing settings when loading an old index.	2014-11-19 16:54:41 -08:00
Ryan Ernst	4f225007f0	Tests: Add static index based backcompat tests This change adds tests against static indexes for previous versions of elasticsearch. It also adds a python script to generate the indexes.	2014-11-19 15:56:04 -08:00
Simon Willnauer	5763116dbe	Revert "[TEST] Add search trace logging for debugging" This reverts commit `a7b2bdca4c`.	2014-11-20 00:12:01 +01:00
Martijn van Groningen	52b77dad8d	Test: Fix malformed mapping setting, slipped in from merging a commit from 1.x	2014-11-19 23:51:21 +01:00
Martijn van Groningen	7cc2bc8a14	Core: Added query/filter wrapper that builds the actual query to be executed on the last possible moment to aid with index aliases and percolator queries using `now` date expression. Percolator queries and index alias filters are parsed once and reused as long as they exist on a node. If they contain time based range filters with a `now` expression then the alias filters and percolator queries are going to be incorrect from the moment these are constructed (depending on the date rounding). If a range filter or range query is constructed as part of adding a percolator query or a index alias filter then these get wrapped in special query or filter wrappers that defer the resolution of now at last possible moment as apposed during parse time. In the case of the range filter a special Resolvable Filter makes sure that `now` is resolved when the DocIdSet is pulled and in the case of the range query `now` is resolved at query rewrite time. Both occur at the time the range filter or query is used as apposed when the query or filter is constructed during parse time. Closes #8474 Closes #8534	2014-11-19 23:21:39 +01:00
Simon Willnauer	a7b2bdca4c	[TEST] Add search trace logging for debugging	2014-11-19 23:13:45 +01:00
Philipp Bogensberger	69ac838259	Fix: If dangling_timeout was set to 0 and auto_import_dangled was set to yes, dangling indices were deleted by mistake, because a RemoveDanglingIndices runnable was added to every dangling indices, without considering the auto_import_dangled setting.	2014-11-19 15:08:57 +00:00
Nicholas Knize	c297ca1668	[GEO] Add LinearRing and LineString validity checks as defined by http://geojson.org/geojson-spec.html to ensure valid polygons are specified at parse time. Closes #8433	2014-11-19 08:23:50 -06:00
javanna	ecc56a57f5	[TEST] move LoggingConfigurationTests to common.logging.log4j Make also LogConfigurator#ALLOWED_SUFFIXES package private so that it can be used in LoggingConfigurationTests, now that it's in the same package as the class that it tests. Add few randomized aspects to LoggingConfigurationTests.	2014-11-19 11:34:35 +01:00
Mathias Fussenegger	2dcb1f503d	Logging: restrict files loaded as logging configuration based on their suffix Make sure that files such as logging.yml.rpmnew or logging.yml.bak are not loaded as logging configuration. Only files that start with the "logging." prefix and end with ".yaml", ".yml", ".json" and ".properties" suffix get loaded. Closes #7457	2014-11-19 11:34:03 +01:00
Igor Motov	314da4ec9e	Test: don't enable unnecessary http transport in restoreIndexWithShardsMissingInLocalGateway test	2014-11-18 21:08:24 -05:00
Simon Willnauer	7f27664ae0	[TEST] Revert accidential massive iterations	2014-11-18 17:26:31 +01:00
Simon Willnauer	fdac110368	[TEST] add more debug output when engine / store are closed	2014-11-18 17:04:34 +01:00
Lee Hinman	7bd389de61	[TEST] Give tests for ctx._ttl more leeway Fixes #8500	2014-11-18 17:01:36 +01:00
Simon Willnauer	734dc198ca	[TEST] distributor direcotory is only used if multiple datapath are configured	2014-11-18 15:33:00 +01:00
Simon Willnauer	c6c709eda2	[TEST] Register data.path for all nodes on close in InternalTestCluster We need to register those data paths otherwise we might miss path that need to get cleaned when using local gatway etc. which can otherwise cause imports of dangeling indices.	2014-11-18 13:39:06 +01:00
Michael McCandless	2f40b464ad	Test: force merge index in the end of IndexStatsTests.throttleStats This works around slow IO (fsync) causing the test-framework cleanup to timeout at 30 seconds when trying to delete the index. Closes #8528	2014-11-18 07:25:16 -05:00
Igor Motov	b0dde6ee4a	Snapshot/Restore: restore of indices that are only partially available in the cluster Fixes the issue with restoring of an index that had only some of its primary shards allocated before it was closed. Fixes #8224	2014-11-17 18:47:58 -05:00
markharwood	6f79d67f81	Bulk indexing issue - missing parent routing causes NullPointerException. Now each error is reported in bulk response rather than causing entire bulk to fail. Added a Junit test but the use of TransportClient means the error is manifested differently to a REST based request - instead of a NullPointer the whole of the bulk request failed with a RoutingMissingException. Changed TransportBulkAction to catch this exception and treat it the same as the existing logic for a ElasticsearchParseException - the individual bulk request items are flagged and reported individually rather than failing the whole bulk request. Closes #8365	2014-11-17 17:16:35 +00:00
Martijn van Groningen	28f3ea1b8d	Test: Let the random parent/child tests use the query and filter parsers instead of creating the queries and filters in a custom way. By using the query and filter parsers we increase the test coverage and make the random parent/child tests simpler.	2014-11-17 10:05:46 +01:00
Boaz Leskes	461c20049f	Test: CorruptedFileTest.testCorruptionOnNetworkLayer used node settings when creating an index Test used `indices.recovery.concurrent_streams` when creating an index but this is a node setting. Moved it to the node settings and added similar settings to speed up concurrent recoveries. Also fixed a misleading log message in ShardRecoveryHandler when logging a remove corruption	2014-11-16 23:54:36 +01:00
Simon Willnauer	b0b7c917c3	Prevent double wrapping directories in MockDirectoryWrapper	2014-11-16 20:54:49 +01:00
Simon Willnauer	e6908de04a	[TEST] Close node env otherwise windows can't delete tmp files	2014-11-16 17:44:38 +01:00
Simon Willnauer	1c64a113de	[CORE] Intorduce shards level locks to prevent concurrent shard modifications Today it's possible that the data directory for a single shard is used by more than on IndexShard->Store instances. While one shard is already closed but has a concurrent recovery running and a new shard is creating it's engine files can conflict and data can potentially be lost. We also remove shards data without checking if there are still users of the files or if files are still open which can cause pending writes / flushes or the delete operation to fail. If the latter is the case the index might be treated as a dangeling index and is brought back to life at a later point in time. This commit introduces a shard level lock that prevents modifications to the shard data while it's still in use. Locks are created per shard and maintined in NodeEnvironment.java. In contrast to most java concurrency primitives those locks are not reentrant. This commit also adds infrastructure that checks if all shard locks are released after tests.	2014-11-16 14:24:29 +01:00
Boaz Leskes	37661aed60	Logging: BroadcastOperationAction - added trace logging for successful shard-level responses In order to be able to trace the exact shards that participated in the operation.	2014-11-15 18:33:56 +01:00
Martijn van Groningen	284491d874	Core: In the bitset cache only eagerly load bitsets for parent nested object fields. Don't eagerly cache parent type filters in bitset cache or nested object fields that are leafs. Also let parent/child queries not rely on FixedBitSetFilter, but rather on regular Filter Closes #8440	2014-11-14 21:00:14 +01:00
Nicholas Knize	0067a0cb7e	Updating to throw IllegalArgument exception for null value coordinates. Tests included.	2014-11-14 10:28:30 -06:00
Nicholas Knize	49935659e4	Adding parse gates for valid GeoJSON coordinates. Includes unit tests.	2014-11-14 09:58:34 -06:00
Nicholas Knize	345c06e5e8	Correcting coordinate checks on LinearRing and LineString, updating test	2014-11-14 08:12:38 -06:00
Olivier Favre	4d68d3d053	Provide more context variables in update scripts In addition to `_source`, the following variables are available through the `ctx` map: `_index`, `_type`, `_id`, `_version`, `_routing`, `_parent`, `_timestamp`, `_ttl`. Some of these fields are more useful still within the context of an Update By Query, see #1607, #2230, #2231.	2014-11-14 10:14:39 +01:00
Alex Ksikes	936b4c63fc	Term Vectors: Fix NPE with dfs and no tvs Fixes a bug with dfs option for when term vectors are not stored and not generated.	2014-11-14 09:11:13 +01:00
Nicholas Knize	c39ca479c7	[GEO] Fix for ArithmeticException[/ by zero] when parsing a "polygon" with one pair of coordinates While this commit is primariy a fix for issue/8433 it adds more rigor to ShapeBuilder for parsing against the GeoJSON specification. Specifically, this adds LinearRing and LineString validity checks as defined in http://geojson.org/geojson-spec.html to ensure valid polygons are specified. The benefit of this fix is to provide a gate check at parse time to avoid any further processing if an invalid GeoJSON is provided. More parse checks like these will be necessary going forward to ensure full compliance with the GeoJSON specification. Closes #8433	2014-11-13 11:45:04 -06:00
Alexander Reelsen	9956e7721d	Tests: Improve netty test behaviour Based on some test failures, this commit fixes two minor things * Bind ports only on so called ephemeral ports to prevent try to bind to ports where elasticsearch already runs on * Remove @Network annotation as it was used in a wrong scope	2014-11-13 15:48:50 +01:00
Colin Goodheart-Smithe	353574d6af	Indices API: Fix GET index API always running all features Previous to this change all features (_alias,_mapping,_settings,_warmer) are run regardless of which features are actually requested. This change fixes the request object to resolve this bug	2014-11-13 13:22:46 +00:00
Colin Goodheart-Smithe	972afe61a0	Mappers: Better validation of mapping JSON Closes #7205	2014-11-12 14:32:25 +00:00
Michael McCandless	7a22bfba3c	Core: still don't load bloom filters, even when Directory instance doesn't have a codecService	2014-11-11 16:56:20 -05:00
Michael McCandless	a783d342d2	Test: dump all threads when delete index fails during test cleanup	2014-11-11 10:49:26 -05:00
Simon Willnauer	16cb0dc7a6	[TEST] Disable compression in BWC test for version < 1.3.2 The compression bug fixed in #7210 can still strike us since we are running BWC test against these version. This commit disables compression forcefully if the compatibility version is < 1.3.2 to prevent debugging already known issues.	2014-11-11 14:12:34 +01:00
Jörg Prante	8aa64c6b76	Query: add option for analyze wildcard/prefix also to simple_query_string query The query_string query has an option for analyzing wildcard/prefix (#787) by a best effort approach. This adds `analyze_wildcard` option also to simple_query_string. The default is set to `false` so the existing behavior of simple_query_string is unchanged.	2014-11-11 10:12:17 +01:00
Michael McCandless	85fba3636a	Test: restore logging to prior state	2014-11-10 18:09:12 -05:00
Michael McCandless	91bef2e40f	Test: switch to TRACE logging for some components	2014-11-10 16:30:40 -05:00
Michael McCandless	8aebb9656b	Core: add max_determinized_states to query_string and regexp query/filter This prevents too-difficult regular expressions from consuming excessive RAM/CPU; the default max_determinized_states is 10,000 (same as Lucene) but query_string and regepx query/filter can override per-request. The also upgrades to a new Lucene 5.0.0 snapshot. Closes #8386 Closes #8357	2014-11-10 13:43:48 -05:00
Colin Goodheart-Smithe	d0da605a39	[TEST] added Get Index bwc test	2014-11-10 09:14:39 +00:00
Adrien Grand	144813629a	Internal: Inverse DocIdSets' heuristic to find out fast DocIdSets. DocIdSets.isFast(DocIdSet) has two issues: - it works on the DocIdSet interface while some doc sets can generate either slow or fast sets depending on their options (eg. whether an OrDocIdSet is fast or not depends on the wrapped clauses). - it only works because the result of this method is only taken into account when a DocIdSet has non-null `bits()`. This commit changes this method to work on top of a DocIdSetIterator and to use a black-list rather than a white list: slow iterators should really be the exception rather than the rule. Close #8380	2014-11-10 09:40:44 +01:00
Boaz Leskes	a1d5bcaa35	Test: MinimumMasterNodesTests.testCanNotBringClusterDown should always set minimum master nodes.	2014-11-10 09:29:20 +01:00
Simon Willnauer	0ff44d4d27	[STORE] Synchronize operations that modify file mappings on DistributorDirectory The rename(String, String) method doesn't allow this implementation to use a simple concurrent map. There is a race during a rename operation where files are not fully renamed but already visible via #listAll(). This inconsistency can lead to problems when opening commit points since the pending_segments_N as well as segments_N are visible but not yet atomically renamed. Yet, non of the methods that are synced are long running such that adding sychronization doesn't introduce bottlenecks here. The Direcotry#sync(...) method is not synchronized since it doesn't change any mapping nor does it depend on the mapping.	2014-11-09 19:05:33 +01:00
Simon Willnauer	2eccbf50fe	[STORE] Calculate Alder32 Checksums for legacy files in Store#checkIntegrity Previously we didn't calculate this checksums even though we have a checksum to compare. Since we now also verify checksums for legacy files #checkIntegrity should also calculate the legacy checksums. Closes #8407	2014-11-09 18:18:59 +01:00
Robert Muir	0eb3402795	Internal: harden recovery for old segments When a lucene 4.8+ file is transferred, Store returns a VerifyingIndexOutput that verifies both the CRC32 integrity and the length of the file. However, for older files, problems can make it to the lucene level. This is not great since older lucene files aren't especially strong as far as detecting issues here. For example, if a network transfer is closed on the remote side, we might write a truncated file... which old lucene formats may or may not detect. The idea here is to verify old files with their legacy Adler32 checksum, plus expected length. If they don't have an Adler32 (segments_N, jurassic elasticsearch?, its optional as far as the protocol goes), then at least check the length. We could improve it for segments_N, its had an embedded CRC32 forever in lucene, but this gets trickier. Long term, we should also try to also improve tests around here, especially backwards compat testing, we should test that detected corruptions are handled properly. Closes #8399 Conflicts: src/main/java/org/elasticsearch/index/store/Store.java src/test/java/org/elasticsearch/index/store/StoreTest.java	2014-11-09 04:13:37 -05:00
Veres Lajos	4059e4ac86	typo fixes - https://github.com/vlajos/misspell_fixer Closes #8323	2014-11-08 18:55:57 +01:00
Michael McCandless	0298b6c3dd	Tests: log how long IndexWriter.rollback took, and when MocmFSDirectory service started check index	2014-11-07 16:40:58 -05:00
Lee Hinman	3712d97951	Take percentage watermarks into account for reroute listener Fixes an issue where only absolute bytes were taken into account when kicking off an automatic reroute due to disk usage. Also randomized the tests to use either an absolute value or a percentage so this is tested. Also adds logging for each node over the high and low watermark every time a new cluster info usage is gathered (defaults to every 30 seconds). Related to #8368 Fixes #8367	2014-11-07 12:58:10 +01:00
Simon Willnauer	a49b39cc21	Allow -SNAPSHOT versions to be parsed by Version.fromString	2014-11-07 12:15:11 +01:00
Simon Willnauer	cc8e8e6b89	[STATE] Observe cluster state on health request Today we use busy waiting and sampling when we execute HealthReqeusts on the master. This is tricky sicne we might sample a not yet fully applied cluster state and make a decsions base on the partial cluster state. This can lead to ugly problems since requests might be routed to nodes where shards are already marked as relocated but on the actual cluster state they are still started. Yet, this window is very small usually it can lead to ugly test failures. This commit moves the health request over to a listener pattern that gets the actual applied cluster state. Closes #8350	2014-11-07 11:02:28 +01:00
Simon Willnauer	95171e2bc2	[CORE] Cut over to Path API for file deletion Today we use the File API for file deletion as well as recursive directory deletions. This API returns a boolean if operations are successful while hiding the actual reason why they failed. The Path API throws and actual exception that might provide better insights and debug information. Closes #8366	2014-11-06 17:17:22 +01:00
Colin Goodheart-Smithe	f430c44af2	[TEST] fixed scriptedMetricTests The tests were failing because there was a shard which didn't get any documents and the tests assumed all shards had documents. This commit fixes this assumption	2014-11-06 09:43:13 +00:00
Robert Muir	610ce078fb	Upgrade master to lucene 5.0 snapshot This has a lot of improvements in lucene, particularly around memory usage, merging, safety, compressed bitsets, etc. On the elasticsearch side, summary of the larger changes: API changes: postings API became a "pull" rather than "push", collector API became per-segment, etc. packaging changes: add lucene-backwards-codecs.jar as a dependency. improvements to boolean filtering: especially ensuring it will not be slow for SparseBitSet. use generic BitSet api in plumbing so that concrete bitset type is an implementation detail. use generic BitDocIdSetFilter api for dedicated bitset cache, so there is type safety. changes to support atomic commits implement Accountable.getChildResources (detailed memory usage API) for fielddata, etc change handling of IndexFormatTooOld/New, since they no longer extends CorruptIndexException Closes #8347. Squashed commit of the following: commit d90d53f5f21b876efc1e09cbd6d63c538a16cd89 Author: Simon Willnauer <simonw@apache.org> Date: Wed Nov 5 21:35:28 2014 +0100 Make default codec/postings/docvalues format constants commit cb66c22c71cd304a36e7371b199a8c279908ae37 Merge: d4e2f6d `ad4ff43` Author: Robert Muir <rmuir@apache.org> Date: Wed Nov 5 11:41:13 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit d4e2f6dfe767a5128c9b9ae9e75036378de08f47 Merge: 4e5445c `4111d93` Author: Robert Muir <rmuir@apache.org> Date: Wed Nov 5 06:26:32 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 4e5445c775f580730eb01360244e9330c0dc3958 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 16:19:19 2014 -0500 FixedBitSet -> BitSet commit 9887ea73e8b857eeda7f851ef3722ef580c92acf Merge: 1bf8894 `fc84666` Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 15:26:25 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 1bf8894430de3e566d0dc5623b0cc28b0d674ebb Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 15:22:51 2014 -0500 remove nocommit commit a9c2a2259ff79c69bae7806b64e92d5f472c18c8 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:48:43 2014 -0500 turn jenkins red again commit 067baaaa4d52fce772c81654dcdb5051ea79139f Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:18:21 2014 -0500 unzip from stream commit 82b6fba33d362aca2313cc0ca495f28f5ebb9260 Merge: b2214bb `6523cd9` Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:10:59 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit b2214bb093ec2f759003c488c3c403c8931db914 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 13:09:53 2014 -0500 go back to my URL until we can figure out what is up with jenkins commit e7d614172240175a51f580aeaefb6460d21cede9 Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 10:52:54 2014 -0500 try this jenkins commit 337a3c7704efa7c9809bf373152d711ee55f876c Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 16:17:49 2014 +0100 Rename temp-files under lock to prevent metadata reads while renaming commit 77d5ba80d0a76efa549dd753b9f114b2f2d2d29c Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 10:07:11 2014 -0500 continue to treat too-old/too-new as corruption for now commit 98d0fd2f4851bc50e505a94ca592a694d502c51c Author: Robert Muir <rmuir@apache.org> Date: Tue Nov 4 09:24:21 2014 -0500 fix last nocommit commit 643fceed66c8caf22b97fc489d67b4a2a90a1a1c Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 14:46:17 2014 +0100 remove NoSuchDirectoryException commit 2e43c4feba05cfaf451df70f946c0930cbcc4557 Merge: 93826e4 `8163107` Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 14:38:00 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 93826e4d56a6a97c2074669014af77ff519bde63 Merge: 7f10129 `44e24d3` Author: Simon Willnauer <simonw@apache.org> Date: Tue Nov 4 12:54:27 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade Conflicts: src/main/java/org/elasticsearch/index/store/DistributorDirectory.java src/main/java/org/elasticsearch/index/store/Store.java src/main/java/org/elasticsearch/indices/recovery/RecoveryStatus.java src/test/java/org/elasticsearch/index/store/DistributorDirectoryTest.java src/test/java/org/elasticsearch/index/store/StoreTest.java src/test/java/org/elasticsearch/indices/recovery/RecoveryStatusTests.java commit 7f10129364623620575c109df725cf54488b3abb Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:32:24 2014 +0100 Fix TopHitsAggregator to not ignore the top-level/leaf collector split. commit 042fadc8603b997bdfdc45ca44fec70dc86774a6 Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:31:20 2014 +0100 Remove MatchDocIdSet in favor of DocValuesDocIdSet. commit 7d877581ff5db585a674c95ac391ac78a0282826 Author: Adrien Grand <jpountz@gmail.com> Date: Tue Nov 4 11:10:08 2014 +0100 Make the and filter use the cost API. Lucene 5 ensured that cost() can safely be used, and this will have the benefit that the order in which filters are specified is not important anymore (only for slow random-access filters in practice). commit 78f1718aa2cd82184db7c3a8393e6215f43eb4a8 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 23:55:17 2014 -0500 fix previous eclipse import braindamage commit 186c40e9258ce32f22a9a714ab442a310b6376e0 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 22:32:34 2014 -0500 allow child queries to exhaust iterators again commit b0b1271305e1b6d0c4c4da51a3c54df1aa5c0605 Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 14:50:44 2014 -0800 Fix nocommit for mapping output. index_options will not be printed if the field is not indexed. commit ba223eb85e399c9620a347a983e29bf703953e7a Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 14:07:26 2014 -0800 Remove no commit for chinese analyzer provider. We should have a separate issue to address not using this provider on new indexes. commit ca554b03c4471797682b2fb724f25205cf040c4a Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 13:41:59 2014 -0800 Fix stop tests commit de67c4653ec47dee9c671390536110749d2bb05f Author: Ryan Ernst <ryan@iernst.net> Date: Mon Nov 3 12:51:17 2014 -0800 Remove analysis nocommits, switching over to Lucene43*Filters for backcompat commit 50cae9bec72c25c33a1ab8a8931bccb3355171e2 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 15:32:25 2014 -0500 add ram accounting and TODO lazy-loading (its no worse than master, can be a followup improvement) for suggesters commit 7a7f0122f138684b312d0f0b03dc2a9c16c15f9c Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 15:11:26 2014 -0500 bump lucene version commit cd0cae5c35e7a9e049f49ae45431f658fb86676b Merge: 446bc09 `3c72073` Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 14:49:05 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 446bc09b4e8bf4602d3c252b53ddaa0da65cce2f Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 14:46:30 2014 -0500 remove hack commit a19d85a968d82e6d00292b49630ef6ff2dbf2f32 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 12:53:11 2014 -0500 dont create exceptions with circular references on corruption (will open a PR for this) commit 0beefb9e821d97c37e90ec556d81ac7b00369b8a Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 11:47:14 2014 -0500 temporarily add craptastic detector for this horrible bug commit e9f2d298bff75f3d1591f8622441e459c3ce7ac3 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:56:01 2014 -0500 add nocommit commit e97f1d50a91a7129650b8effc7a9ecf74ca0569a Merge: c57a3c8 `f1f50ac` Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:12:12 2014 -0500 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit c57a3c8341ed61dca62eaf77fad6b8b48aeb6940 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 10:11:46 2014 -0500 fix nocommit commit dd0e77e4ec07c7011ab5f6b60b2ead33dc2333d2 Author: Robert Muir <rmuir@apache.org> Date: Mon Nov 3 09:54:09 2014 -0500 nocommit -> TODO, this is in much more places in the codebase, bigger issue commit 3cc3bf56d72d642059f8fe220d6f2fed608363e9 Author: Ryan Ernst <ryan@iernst.net> Date: Sat Nov 1 23:59:17 2014 -0700 Remove nocommit and awaitsfix for edge ngram filter test. commit 89f115245155511c0fbc0d5ee62e63141c3700c1 Author: Ryan Ernst <ryan@iernst.net> Date: Sat Nov 1 23:57:44 2014 -0700 Fix EdgeNGramTokenFilter logic for version <= 4.3, and fixed instanceof checks in corresponding tests to correctly check for reverse filter when applicable. commit 112df869cd199e36aab0e1a7a288bb1fdb2ebf1c Author: Robert Muir <rmuir@apache.org> Date: Sun Nov 2 00:08:30 2014 -0400 execute geo disjoint query/filter as intersects commit e5061273cc685f1252e9a3a9ae4877ec9bce7752 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:58:59 2014 -0400 remove chinese analyzer from docs commit ea1af11b8978fcc551f198e24fe21d52806993ef Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:29:00 2014 -0400 fix ram accounting bug commit 53c0a42c6aa81aa6bf81d3aa77b95efd513e0f81 Merge: e3bcd3c `6011a18` Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:16:29 2014 -0400 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit e3bcd3cc07a4957e12c7b3affc462c31290a9186 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:15:01 2014 -0400 fix url-email back compat (thanks ryan) commit 91d6b096a96c357755abee167098607223be1aad Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 22:11:26 2014 -0400 bump lucene version commit d2bb9568df72b37ec7050d25940160b8517394bc Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 20:33:07 2014 -0400 remove nocommit commit 1d049c471e19e5c457262c7399c5bad9e023b2e3 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 20:28:58 2014 -0400 fix eclipse to group org/com imports together: without this, its madness commit 09d8c1585ee99b6e63be032732c04ef6fed84ed2 Author: Robert Muir <rmuir@apache.org> Date: Sat Nov 1 14:27:41 2014 -0400 remove nocommit, if you dont liek it, print assembly and tell me how it can be better commit 8a6a294313fdf33b50c7126ec20c07867ecd637c Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 20:01:55 2014 +0100 Remove deprecated usage of DocIdSets.newDocIDSet. commit 601bee60543610558403298124a84b1b3bbd1045 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 14:13:18 2014 -0400 maybe one of these zillions of annotations will stop thread leaks commit 9d3f69abc7267c5e455aefa26db95cb554b02d62 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 14:05:39 2014 -0400 fix some analysis nocommits commit 312e3a29c77214b8142d21c33a6b2c2b151acf9a Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 18:28:45 2014 +0100 Remove XConstantScoreQuery/XFilteredQuery/ApplyAcceptedDocsFilter. commit 5a0cb9f8e167215df7f1b1fad11eec6e6c74940f Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 17:06:45 2014 +0100 Fix misleading documentation of DocIdSets.toCacheable. commit 8b4ef2b5b476fff4c79c0c2a0e4769ead26cf82b Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 17:05:59 2014 +0100 Fix CustomRandomAccessFilterStrategy to override the right method. commit d7a9a407a615987cfffc651f724fbd8795c9c671 Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 16:21:35 2014 +0100 Better handle the special case when there is a single SHOULD clause. commit 648ad389f07e92dfc451f345549c9841ba5e4c9a Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 15:53:38 2014 +0100 Cut over XBooleanFilter to BitDocIdSet.Builder. The idea is similar to what happened to Lucene's BooleanFilter. Yet XBooleanFilter is a bit more sophisticated and I had to slightly change the way it is implemented in order to make it work. The main difference with before is that slow filters are now applied lazily, so eg. if you have 3 MUST clauses, two with a fast iterator and the third with a slow iterator, the previous implementation used to apply the fast iterators first and then only check the slow filter for bits which were set in the bit set. Now we are computing a bit set based on the fast must clauses and then basically returning a BitsFilteredDocIdSet.wrap(bitset, slowClause). Other than that, BooleanFilter still uses the bitset optimizations when or-ing and and-ind filters. Another improvement is that BooleanFilter is now aware of the cost API. commit b2dad312b4bc9f931dc3a25415dd81c0d9deee08 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 10:18:53 2014 -0400 clear nocommit commit 4851d2091e744294336dfade33906c75fbe695cd Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 15:15:16 2014 +0100 cut over to RoaringDocIdSet commit ca6aec24a901073e65ce4dd6b70964fd3612409e Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:57:30 2014 +0100 make nocommit more explicit commit d0742ee2cb7a6c48b0bbb31580b7fbcebdb6ec40 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:55:24 2014 -0400 fix standardtokenizer nocommit commit 7d6faccafff22a86af62af0384838391d46695ca Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:54:08 2014 +0100 fix compilation commit a038a405c1ff6458ad294e6b5bc469e622f699d0 Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:53:43 2014 +0100 fix compilation commit 30c9e307b1f5d80e2deca3392c0298682241207f Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:52:35 2014 +0100 fix compilation commit e5139bc5a0a9abd2bdc6ba0dfbcb7e3c2e7b8481 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:52:16 2014 -0400 clear nocommit here commit 85dd2cedf7a7994bed871ac421cfda06aaf5c0a5 Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:46:17 2014 +0100 fix CompletionPostingsFormatTest commit c0f3781f616c9b0ee3b5c4d0998810f595868649 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 09:38:00 2014 -0400 add tests for these analyzers commit 51f9999b4ad079c283ae762c862fd0e22d00445f Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 14:10:26 2014 +0100 remove nocommit - this is not an issue commit fd1388fa03e622b0738601c8aeb2dbf7949a6dd2 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 14:07:01 2014 +0100 Remove redundant null check commit 3d6dd51b0927337ba941a235446b22e8cd500dc3 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 14:01:37 2014 +0100 Removed the work around to prevent p/c error when invoking #iterator() twice, because the custom query filter wrapper now doesn't transform the result to a cache doc id set any more. I think the transforming to a cachable doc id set in CustomQueryWrappingFilter isn't needed at all, because we use the DocIdSet only once and because of that is just slowed things down. commit 821832a537e00cd1216064b379df3e01d2911d3a Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:54:33 2014 +0100 one more nocommit commit 77eb9ea4c4ea50afb2680c29682ddcb3851a9d4f Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Fri Oct 31 13:52:29 2014 +0100 Remove cast commit a400573c034ed602221f801b20a58a9186a06eae Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:49:24 2014 +0100 fix stop filter commit 51746087cf8ec34c4d20aa05ba8dbff7b3b43eec Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:21:36 2014 +0100 fix changed semantics of FBS.nextSetBit to check for NO_MORE_DOCS commit 8d0a4e2511310f1293860823fe3ba80ac771bbe3 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 08:13:44 2014 -0400 do the bogus cast differently commit 46a5cc5732dea096c0c80ae5ce42911c9c51e44e Author: Simon Willnauer <simonw@apache.org> Date: Fri Oct 31 13:00:16 2014 +0100 I hate it but P/C now passes commit 580c0c2f82bbeacf217e594f22312b11d1bdb839 Merge: a9d3c00 `1645434` Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 31 06:54:31 2014 -0400 fix nocommit/classcast commit a9d3c004d62fe04989f49a897e6ff84973c06eb9 Author: Adrien Grand <jpountz@gmail.com> Date: Fri Oct 31 08:49:31 2014 +0100 Update TODO. commit aa75af0b407792aeef32017f03a6f442ed970baa Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 19:18:25 2014 -0400 clear obselete nocommits from lucene bump commit d438534cf41fcbe2d88070e2f27c994625e082c2 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 18:53:20 2014 -0400 throw classcastexception when ES abuses regular filtercache for nested docs commit 2c751f3a8feda43ec127c34769b069de21f3d16f Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 18:31:34 2014 -0400 bump lucene revision, fix tests commit d6ef7f6304ae262bf6228a7d661b2a452df332be Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 22:37:58 2014 +0100 fix merge problems commit de9d361f88a9ce6bb3fba85285de41f223c95767 Merge: 41f6aab `f6b37a3` Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 22:28:59 2014 +0100 Merge branch 'master' into enhancement/lucene_5_0_upgrade Conflicts: pom.xml src/main/java/org/elasticsearch/Version.java src/main/java/org/elasticsearch/gateway/local/state/meta/MetaDataStateFormat.java commit 41f6aab388aa80c40b08a2facab2617576203a0d Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:48:46 2014 +0100 fix potiential NPE commit c4428b12e1ae838b91e847df8b4a8be7f49e10f4 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:38:46 2014 +0100 don't advance iterator in a match(doc) method commit 28ab948e99e3ea4497c9b1e468384806ba7e1790 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 17:34:58 2014 +0100 don't advance iterator in a match(doc) method commit eb0f33f6634fadfcf4b2bf7327400e568f0427bb Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 16:55:54 2014 +0100 fix GeoUtilsTest commit 7f711fe3eaf73b6c2268cf42d5a41132a61ad831 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 16:43:16 2014 +0100 Use a dedicated default index option if field type is not indexed by default commit 78e3f37ab779e3e1b25b45a742cc86ab5f975149 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 10:56:14 2014 -0400 disable this test with AwaitsFix to reduce noise commit 9a590f563c8e03a99ecf0505c92d12d7ab20d11d Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 09:38:49 2014 +0100 fix lucene version commit abe3ca1d8bb6b5101b545198f59aec44bacfa741 Author: Simon Willnauer <simonw@apache.org> Date: Thu Oct 30 09:35:05 2014 +0100 fix AnalyzingCompletionLookupProvider to wrok with new codec API commit 464293b245852d60bde050c6d3feb5907dcfbf5f Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:26:00 2014 -0400 don't try to write stuff to tests class directory commit 031cc6c19f4fe4423a034b515f77e5a0e282a124 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:12:36 2014 -0400 AwaitsFix these known issues to reduce noise commit 4600d51891e35847f2d344247d6f915a0605c0d1 Author: Robert Muir <rmuir@apache.org> Date: Thu Oct 30 00:06:53 2014 -0400 openbitset lives on commit 8492bae056249e2555d24acd55f1046b66a667c4 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:42:54 2014 -0400 fixes for filter tests commit 31f24ce4efeda31f97eafdb122346c7047a53bf2 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:12:38 2014 -0400 don't use fieldcache commit 8480789942fdff14a6d2b2cd8134502fe62f20c8 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 23:04:29 2014 -0400 ancient index no longer supported commit 02e78dc7ebdd827533009f542582e8db44309c57 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 23:37:02 2014 +0100 fix more tests commit ff746c6df23c50b3f3ec24922413b962c8983080 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 23:08:19 2014 +0100 fix all mapper commit e4fb84b517107b25cb064c66f83c9aa814a311b2 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:55:54 2014 +0100 fix distributor tests and cut over to FileStore API commit 20c850e2cfe3210cd1fb9e232afed8d4ac045857 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:42:18 2014 +0100 use DOCS_ONLY if index=true and current options == null commit 44169c108418413cfe51f5ce23ab82047463e4c2 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 22:33:36 2014 +0100 Fix index=yes\|no settings in mappers commit a3c5f77987461a18121156ed345d42ded301c566 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:51:41 2014 +0100 fix several field mappers conversion from setIndexed to indexOptions commit df84d736908e88a031d710f98e222be68ae96af1 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:33:35 2014 +0100 fix SourceFieldMapper to be not indexed commit b2bf01d12a8271a31fb2df601162d0e89924c8f5 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 21:23:08 2014 +0100 Cut over to .liv files in store and corruption tests commit 619004df436f9ef05d24bef1b6a7f084c6b0ad75 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 17:05:52 2014 +0100 fix more tests commit b7ed653a8b464de446e00456bce0a89e47627c38 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 16:19:08 2014 +0100 [STORE] Add dedicated method to write temporary files Recovery writes temporary files which might not end up in the right distributor directories today. This commit adds a dedicated API that allows specifying the target file name in order to create the tempoary file in the correct directory. commit 7d574659f6ae04adc2b857146ad0d8d56ca66f12 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 10:28:49 2014 -0400 add some leniency to temporary bogus method commit f97022ea7c2259f7a5cf97d924c59ed75ab65b32 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 10:24:17 2014 -0400 fix MultiCollector bug commit b760533128c2b4eb10ad76e9689ef714293dd819 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:56:08 2014 +0100 CheckIndex is now closeable we need to close it commit 9dae9fb6d63546a6c2427be2a2d5c8358f5b1934 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:45:11 2014 +0100 s/Lucene51/Lucene50 commit 7aea9b86856a8c1b06a08e7c312ede1168af1287 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:42:30 2014 +0100 fix BloomFilterPostingsFormat commit 16fea6fe842e88665d59cc091e8224e8dc6ce08c Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:41:16 2014 +0100 fix some codec format issues commit 3d77aa97dd2c4012b63befef3f2ba2525965e8a6 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:30:43 2014 +0100 fix CodecTests commit 6ef823b1fde25657438ace1aabd9d552d6ae215e Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:26:47 2014 +0100 make it compile commit 9991eee1fe99435118d4dd42b297ffc83fce5ec5 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 09:12:43 2014 -0400 add an ugly hack for TopHitsAggregator for now commit 03e768a01fcae6b1f4cb50bcceec7d42977ac3e6 Author: Simon Willnauer <simonw@apache.org> Date: Wed Oct 29 14:01:02 2014 +0100 cut over ES090PostingsFormat commit 463d281faadb794fdde3b469326bdaada25af048 Merge: 0f8740a `8eac79c` Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 08:30:36 2014 -0400 Merge branch 'master' into enhancement/lucene_5_0_upgrade commit 0f8740a782455a63524a5a82169f6bbbfc613518 Author: Robert Muir <rmuir@apache.org> Date: Wed Oct 29 01:00:15 2014 -0400 fix/hack remaining filter and analysis issues commit df534488569da13b31d66e581456dfd4b55156b9 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 23:11:47 2014 -0400 fix ngrams / openbitset usage commit 11f5dc3b9887f4da80a0fa1818e1350b30599329 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 22:42:44 2014 -0400 hack over sort comparators commit 4ebdc754350f512596f6a02770d223e9f5f7975a Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 21:27:07 2014 -0400 compiler errors < 100 commit 2d60c9e29de48ccb0347dd87f7201f47b67b83a0 Author: Robert Muir <rmuir@apache.org> Date: Tue Oct 28 03:13:08 2014 -0400 clear some nocommits around ram usage commit aaf47fe6c0aabcfb2581dd456fc50edf871da758 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 12:27:34 2014 -0400 migrate fieldinfo handling commit ef6ed6d15d8def71cd880d97249678136cd29fe3 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 12:07:13 2014 -0400 more simple fixes commit f475e1048ae697dd9da5bd9da445102b0b7bc5b3 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 11:58:21 2014 -0400 more fielddata ram accounting fixes commit 16b4239eaa9b4262df258257df4f31d39f28a3a2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:47:32 2014 +0100 add missing file commit 5b542fa2a6da81e36a0c35b8e891a1d8bc58f663 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:43:29 2014 +0100 cut over completion posting formats - still some nocommits commit ecdea49404c4ec4e1b78fb54575825f21b4e096e Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 11:21:09 2014 -0400 fielddata accountable fixes commit d43da265718917e20c8264abd43342069198fe9c Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 16:19:53 2014 +0100 cut over BloomFilterPostings to new API commit 29b192ba621c14820175775d01242162b88bd364 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 10:22:51 2014 -0400 fix more analyzers commit 74b4a0c5283e323a7d02490df469497c722780d2 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 09:54:25 2014 -0400 fix tests commit 554084ccb4779dd6b1c65fa7212ad1f64f3a6968 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:51:48 2014 +0100 maintain supressed exceptions on CorruptIndexException commit cf882d9112c5e8ef1e9f2b0f800f7aa59001a4f2 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:47:17 2014 +0100 commitOnClose=false commit ebb2a9189ab2f459b7c6c9985be610fd90dfe410 Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:46:06 2014 +0100 cut over indexwriter closeing in InternalEngine commit cd21b3d4706f0b562bd37792d077d60832aff65f Author: Simon Willnauer <simonw@apache.org> Date: Mon Oct 27 14:38:10 2014 +0100 fix constant commit f93f900c4a1c90af3a21a4af5735a7536423fe28 Author: Robert Muir <rmuir@apache.org> Date: Mon Oct 27 09:50:49 2014 -0400 fix test commit a9a752940b1ab4699a6a08ba8b34afca82b843fe Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Mon Oct 27 09:26:18 2014 +0100 Be explicit about the index options commit d9ee815babd030fa2ceaec9f467c105ee755bf6b Author: Simon Willnauer <simonw@apache.org> Date: Sun Oct 26 20:03:44 2014 +0100 cut over store and directory commit b3f5c8e39039dd8f5caac0c4dd1fc3b1116e64ca Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 13:08:39 2014 -0400 more test fixes commit 8842f2684e3606aae0860c27f7a4c53e273d47fb Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 12:14:52 2014 -0400 tests manual labor commit c43de5aec337919a3fdc3638406dff17fc80bc98 Author: Robert Muir <rmuir@apache.org> Date: Sun Oct 26 11:04:13 2014 -0400 BytesRef -> BytesRefBuilder commit 020c0d087a2f37566a1db390b0e044ebab030138 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:53:37 2014 +0100 Moved over to BitSetFilter commit 48dd1b909e6c52cef733961c9ecebfe4f67109fe Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:53:11 2014 +0100 Left over Collector api change in ScanContext commit 6ec248ef63f262bcda400181b838fd9244752625 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 15:47:40 2014 +0100 Moved indexed() over to indexOptions != null or indexOptions == null commit 9937aebfd8546ae4bb652cd976b3b43ac5ab7a63 Author: Martijn van Groningen <martijn.v.groningen@gmail.com> Date: Sun Oct 26 13:26:31 2014 +0100 Fixed many compile errors. Mainly around the breaking Collector api change in 5.0. commit fec32c4abc0e3309cf34260c8816305a6f820c9e Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 11:22:17 2014 -0400 more easy fixes commit dab22531d801800d17a65dc7c9464148ce8ebffd Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 09:33:41 2014 -0400 more progress commit 414767e9a955010076b0497cc4f6d0c1850b48d3 Author: Robert Muir <rmuir@apache.org> Date: Sat Oct 25 06:33:17 2014 -0400 more progress commit ad9d969fddf139a8830254d3eb36a908ba87cc12 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 24 14:28:01 2014 -0400 current state of fun commit 464475eecb0be15d7d084135ed16051f76a7e521 Author: Robert Muir <rmuir@apache.org> Date: Fri Oct 24 11:42:41 2014 -0400 bump to 5.0 snapshot	2014-11-05 15:48:51 -05:00
Lukas Vlcek	bf1e1de3f1	[TEST] Remove redundant call to setTemplateType()	2014-11-05 11:28:56 +00:00
Martijn van Groningen	4111d932f6	Test: make field `id` is defined in mapping, so sort can't fail	2014-11-05 11:05:28 +01:00
Adrien Grand	29613d90d3	Revert "Tests: Temporarily ignore RoutingBackwardCompatibilityUponUpgradeTests." This reverts commit `181bd6e56a`.	2014-11-05 09:16:16 +01:00
Adrien Grand	d9515e9717	Tests: Fix more bad assumptions about routing in TransportTwoNodesSearchTests.	2014-11-05 09:15:43 +01:00
Adrien Grand	fc84666756	Tests: Fix GroovyScriptTests to not depend on the way documents are routed to shards.	2014-11-04 20:12:12 +01:00
Adrien Grand	dfeb12996b	Gateway: Prefer recovering the state file that uses the latest format. Currently MetaDataStateFormat loads the first available state file that has the latest version. In case several files are available and some of them use the new format while other ones use the legacy format, it should also prefer the new format. This is typically useful when we upgrade the metadata when recovering from the gateway: we might write the upgraded state with the new format while the previous state used the legacy format, so we end up with two files having the same version but using different formats. Close #8343	2014-11-04 19:58:08 +01:00
Adrien Grand	6523cd9377	Tests: Fix SimpleQueryStringTests.testSimpleQueryString assumption that depends on how documents are routed.	2014-11-04 18:07:33 +01:00
Adrien Grand	181bd6e56a	Tests: Temporarily ignore RoutingBackwardCompatibilityUponUpgradeTests.	2014-11-04 18:01:35 +01:00
Adrien Grand	3501e32dce	Mappings: Generate dynamic mappings for empty strings. This will help the exists/missing filters behave as expected in presence of empty strings, as well as when using a default analyzer that would generate tokens for an empty string (uncommon). Close #8198	2014-11-04 17:15:48 +01:00
javanna	ab0bee47c5	[TEST] assign a name to the transport client created within ExternalTestCluster The transport client created within ExternalTestCluster needs a name that follows our naming convention otherwise the thread leak filter barfs when running tests against an external cluster. Used "transport_client_external_{n}" where n gets incremented every time a new external cluster gets created. Updated thread leak filters rules to ignore threads created by such transport client.	2014-11-04 17:08:03 +01:00
Adrien Grand	9ea25df649	Switch to murmurhash3 to route documents to shards. We currently use the djb2 hash function in order to compute the shard a document should go to. Unfortunately this hash function is not very sophisticated and you can sometimes hit adversarial cases, such as numeric ids on 33 shards. Murmur3 generates hashes with a better distribution, which should avoid the adversarial cases. Here are some examples of how 100000 incremental ids are distributed to shards using either djb2 or murmur3. 5 shards: Murmur3: [19933, 19964, 19940, 20030, 20133] DJB: [20000, 20000, 20000, 20000, 20000] 3 shards: Murmur3: [33185, 33347, 33468] DJB: [30100, 30000, 39900] 33 shards: Murmur3: [2999, 3096, 2930, 2986, 3070, 3093, 3023, 3052, 3112, 2940, 3036, 2985, 3031, 3048, 3127, 2961, 2901, 3105, 3041, 3130, 3013, 3035, 3031, 3019, 3008, 3022, 3111, 3086, 3016, 2996, 3075, 2945, 2977] DJB: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 900, 900, 900, 900, 1000, 1000, 10000, 10000, 10000, 10000, 9100, 9100, 9100, 9100, 9000, 9000, 0, 0, 0, 0, 0, 0] Even if djb2 looks ideal in some cases (5 shards), the fact that the distribution of its hashes has some patterns can raise issues with some shard counts (eg. 3, or even worse 33). Some tests have been modified because they relied on implementation details of the routing hash function. Close #7954	2014-11-04 16:32:42 +01:00
Britta Weber	8ef6e7e7ec	geo sort: remove unneded code from geo distance builder The if statements are unneded and also wrong (second else if can never be reached). closes #8338	2014-11-04 16:26:42 +01:00
Simon Willnauer	7a6fb892c9	[TEST] only assert consistency before closing	2014-11-04 14:34:28 +01:00
javanna	ac2ee35c22	[TEST] move ClusterDiscoveryConfiguration to org.elasticsearch.test.discovery ClusterDiscoveryConfiguration is part of the test infra and should get exported as part of the test jar. This is achieved by moving the class to org.elasticsearch.test.discovery Closes #8337	2014-11-04 13:56:24 +01:00
Brian Murphy	792b25e857	[TEST] Fix the throttle test.	2014-11-04 12:36:52 +00:00
javanna	8997dba52f	[TEST] move NettyTransportTests to org.elasticsearch.transport.netty package NettyTransportTests were previously in org.elasticsearch.test.transport and ended up being exported with the test jar. org.elasticsearch.transport.netty should be a better place for them together with exising tests.	2014-11-04 13:21:37 +01:00
Boaz Leskes	4396e6b48e	Test: ClusterServiceTetsts.testLocalNodeMasterListenerCallbacks should verify cluster state is applied The test verifies the correct behavior of a listener but we only call the listener after publishing a new cluster state. Only checking on the publishing of the state introduces a racing condition.	2014-11-04 12:38:45 +01:00
Boaz Leskes	1c66317443	Test: MinimumMasterNodesTests.testCanNotBringClusterDown didn't check for cluster health properly Also reduced the number of nodes the test uses	2014-11-04 12:17:19 +01:00
Simon Willnauer	44e24d3916	[STORE] Remove special file handling from DistributorDirecotry This commit removes all special file handling from DistributorDirectory that assigned certain files to the primary directory. This special handling was added to ensure that files that are written more than once are essentially overwritten. Yet this implementation is consistent all the time and doesn't need this special handling for files that are written through this directory. Writes to the underlying directory not going through the distributor directory are not and have never been supported. Note: this commit also fixes the problem of adding directories to the distributor during restart where the primary can suddenly change and file mappings are by-passed. Closes #8276	2014-11-04 11:31:18 +01:00
Adrien Grand	3e50bce822	Tests: Do not index dummy documents in ExistsMissingTests. This way we make sure that there is only one mapping for _field_names.	2014-11-04 09:49:05 +01:00
Martijn Laarman	82278bb7bc	[Aggregations] Meta data support This commit adds the ability to associate a bit of state with each individual aggregation. The aggregation response can be hard to stitch back together without having a reference to the aggregation request. In many cases this is not available, many json serializer frameworks cache types globally or have a static deserialisation override mechanism. In these cases making the original request available, if at all possible, would be a hack. The old facets returned `_type` which was just enough metadata to know what the originating facet type in the request was. This PR takes `_type` one step further by introducing ANY arbitrary meta data. This could be further <strike>ab</strike>used for instance by generic/automated aggregations that include UI state (color information, thresholds, user input states, etc) per aggregation.	2014-11-03 22:32:23 +01:00
Ryan Ernst	8aff3b6273	FunctionScore: RandomScoreFunction now accepts long, as well a strings. closes #8267 closes #8311	2014-11-03 07:53:12 -08:00
Boaz Leskes	f1f50ac423	Discovery: don't accept a dynamic update to min_master_nodes which is larger then current master node count The discovery.zen.minimum_master_nodes setting can be updated dynamically. Settings it to a value higher then the current number of master nodes will cause the current master to step down. This is dangerous because if done by mistake (typo) there is no way to restore the settings (this requires an active master). Closes #8321	2014-11-03 14:53:12 +01:00
Alexander Reelsen	f50deecf12	Tests: Stop measuring request time in HTTP pipelining tests This destabilizes tests on virtualized hardware. Functionality testing is sufficient here. Performance tests should to be conducted elsewhere.	2014-11-01 09:03:59 +01:00
Ryan Ernst	02debfd127	Tests: Remove accidentally added bwc behavior for auto choosing a version. An early version of #7966 had the ability to choose a bwc version automatically, but this was removed before the change was committed. However, the change was not removed from the ongoing work in #7922 and it made it in unknowningly.	2014-10-31 19:09:38 -07:00
Martijn van Groningen	7761154e83	Core: Allow to configure custom thread pools Closes #8247	2014-10-31 23:32:09 +01:00
Ryan Ernst	2ebf34b93e	Tests: Move logSegmentsState to shared location, and remove no longer needed verbose logging from upgrade test.	2014-10-31 15:04:05 -07:00
Alexander Reelsen	5eeac2fdf6	Netty: Add HTTP pipelining support This adds HTTP pipelining support to netty. Previously pipelining was not supported due to the asynchronous nature of elasticsearch. The first request that was returned by Elasticsearch, was returned as first response, regardless of the correct order. The solution to this problem is to add a handler to the netty pipeline that maintains an ordered list and thus orders the responses before returning them to the client. This means, we will always have some state on the server side and also requires some memory in order to keep the responses there. Pipelining is enabled by default, but can be configured by setting the http.pipelining property to true\|false. In addition the maximum size of the event queue can be configured. The initial netty handler is copied from this repo https://github.com/typesafehub/netty-http-pipelining Closes #2665	2014-10-31 16:30:11 +01:00
Lee Hinman	4ac7b02ce7	Reroute shards automatically when high disk watermark is exceeded This adds a Listener interface to the ClusterInfoService, this is used by the DiskThresholdDecider, which adds a listener to check for nodes passing the high watermark. If a node is past the high watermark an empty reroute is issued so shards can be reallocated if desired. A reroute will only be issued once every `cluster.routing.allocation.disk.reroute_interval`, which is "60s" by default. Refactors InternalClusterInfoService to delegate the nodes stats and indices stats gathering into separate methods so they have be overriden by extending classes. Each stat gathering method returns a CountDownLatch that can be used to wait until processing for that part is successful before calling the listeners. Fixes #8146	2014-10-31 11:58:22 +01:00
Satoyuki Tsukano	e3d5cb903b	Core: Fix location information for loggers This change corrects the location information gathered by the loggers so that when printing class name, method name, and line numbers in the log pattern, the information from the class calling the logger is used rather than a location within the logger itself. A reset method has also been added to the LogConfigurator class which allows the logging configuration to be reset. This is needed because if the LoggingConfigurationTests and Log4jESLoggerTests are run in the same JVM the second one to run needs to be able to override the log configuration set by the first Closes #5130, #8052	2014-10-30 09:31:53 +00:00
Simon Willnauer	67394aad81	[TEST] Upgrade test can only run if major or minor version differs	2014-10-29 20:34:07 +01:00
Colin Goodheart-Smithe	cd7c6cc47c	Revert "Core: Fix location information for loggers" This reverts commit `4ebbb657d8`.	2014-10-29 17:08:53 +00:00
Colin Goodheart-Smithe	3d436a25b0	Revert "[TEST] Added awaits fix to failing LoggingConfigurationTests" This reverts commit `f8ea7c15d7`.	2014-10-29 17:08:52 +00:00
Colin Goodheart-Smithe	b4c85f60b8	Revert "[TEST] added additional logging to LoggingConfigurationTests" This reverts commit `c639815af9`.	2014-10-29 17:08:51 +00:00
Colin Goodheart-Smithe	d9fffd32be	Revert "[TEST] added more additional logging to LoggingConfigurationTests" This reverts commit `b3a60130ba`.	2014-10-29 17:08:50 +00:00
Colin Goodheart-Smithe	75b98cf8b7	Revert "[TEST] re-enabled AwaitsFix for LoggingConfigurationTests" This reverts commit `c548a52657`.	2014-10-29 17:08:45 +00:00
Colin Goodheart-Smithe	c548a52657	[TEST] re-enabled AwaitsFix for LoggingConfigurationTests	2014-10-29 16:13:50 +00:00
Colin Goodheart-Smithe	b3a60130ba	[TEST] added more additional logging to LoggingConfigurationTests This is to try to determine why the test passes locally but not on the CI builds	2014-10-29 14:20:10 +00:00
Colin Goodheart-Smithe	c639815af9	[TEST] added additional logging to LoggingConfigurationTests This is to try to determine why the test passes locally but not on the CI builds	2014-10-29 13:34:25 +00:00
Colin Goodheart-Smithe	f8ea7c15d7	[TEST] Added awaits fix to failing LoggingConfigurationTests	2014-10-29 13:03:37 +00:00
Satoyuki Tsukano	4ebbb657d8	Core: Fix location information for loggers This change corrects the location information gathered by the loggers so that when printing class name, method name, and line numbers in the log pattern, the information from the class calling the logger is used rather than a location within the logger itself. Closes #5130	2014-10-29 10:24:23 +00:00
Alex Ksikes	35f55608cc	MLT Field Query: remove it from master The MLT field query is simply replaced by a MLT query set to specififc field. To simplify code maintenance we should deprecate it in 1.4 and remove it in 2.0. Closes #8238	2014-10-29 10:19:00 +01:00
Simon Willnauer	e3a09e1933	[TEST] Move rebalance setting to the index level / pass it on index creation	2014-10-29 09:24:35 +01:00
Simon Willnauer	b23e6e0593	[TEST] initialize SUITE \| GLOBAL scope cluster in a private random context Today any call to the current randomized context modifies the random sequence such that cluster initialization is context dependent. If due to an error for instance a static util method is used like `randomLong` inside the TestCluster instead of the provided Random instance all reproducibility guarantees are gone. This commit adds a safe mechanism to initialize these clusters even if a static helper is used. All none test scope clusters are now initialized in a private randomized context.	2014-10-28 23:53:25 +01:00
Areek Zillur	96f1606cdc	Completion Suggester: Fix CompletionFieldMapper to correctly parse weight - Allows weight to be defined as a string representation of a positive integer closes #8090	2014-10-28 18:39:02 -04:00
Michael McCandless	462336c135	Test: fix test to optimize and flush in the end to try to prevent 'Delete Index failed - not acked'	2014-10-28 18:20:30 -04:00
Michael McCandless	d988302860	Tests: stop printing all thread stacks on failure: it's too noisy	2014-10-28 17:57:17 -04:00
uboness	ae1e9edb25	Enabled overriding the request headers in the clients One can set the headers sent with request by the clients by setting the `request.headers` setting. This commit enables overriding any such set headers directly on the requests.	2014-10-28 16:36:17 +01:00
Simon Willnauer	d6ae832c90	Delegate write.lock to primary directory	2014-10-27 20:03:36 +01:00
Zachary Tong	f5b2dfd052	Aliases: Throw exception if index is null or missing when creating an alias Fixes a bug where alias creation would allow `null` for index name, which thereby applied the alias to _all_ indices. This patch makes the validator throw an exception if the index is null. ```bash POST /_aliases { "actions": [ { "add": { "alias": "empty-alias", "index": null } } ] } ``` ```json { "error": "ActionRequestValidationException[Validation Failed: 1: Alias action [add]: [index] may not be null;]", "status": 400 } ``` The reason this bug wasn't caught by the existing tests is because the old test for nullness only validated against a cluster which had zero indices. The null index is translated into "_all", and since there are no indices, this fails because the index doesn't exist. So the test passes. However, as soon as you add an index, "_all" resolves and you get the situation described in the original bug report: null index is accepted by the alias, resolves to "_all" and gets applied to everything. The REST tests, otoh, explicitly tested this bug as a real feature and therefore passed. The REST tests were modified to change this behavior. Fixes #7863	2014-10-27 14:39:01 -04:00
Simon Willnauer	5190cee7fc	[CORE] remove usage of Directory#fileExists This commit removes the usage of #fileExists which has been deprecated long ago an can be a source of race conditions.	2014-10-27 14:21:25 +01:00
Boaz Leskes	da70b5c847	Test: remove wait on events from multipleNodesShutdownNonMasterNodes The test validates that minimum master node is honored after shutting down nodes. When nodes are restarted we may go through a couple of master election of a master is elected and discovers some of the old nodes left before all new node have joined. Processing that node failure the master de-elects it self, which is fine because we will start a new master election. Test however runs a clusterHealth call with a wait for event. This is implemented using a cluster state update task which fails when the master steps down. Longer term fix requires a rewrite of the cluster health API code but a quick fix is not check for events (not needed for this test).	2014-10-27 13:31:27 +01:00
Simon Willnauer	a12b34c36c	[TEST] Beef up test and replace deprecated API	2014-10-27 13:05:12 +01:00
Adrien Grand	7ea490dfd1	Aggregations: Return the sum of the doc counts of other buckets. This commit adds a new field to the response of the terms aggregation called `sum_other_doc_count` which is equal to the sum of the doc counts of the buckets that did not make it to the list of top buckets. It is typically useful to have a sector called eg. `other` when using terms aggregations to build pie charts. Example query and response: ```json GET test/_search?search_type=count { "aggs": { "colors": { "terms": { "field": "color", "size": 3 } } } } ``` ```json { [...], "aggregations": { "colors": { "doc_count_error_upper_bound": 0, "sum_other_doc_count": 4, "buckets": [ { "key": "blue", "doc_count": 65 }, { "key": "red", "doc_count": 14 }, { "key": "brown", "doc_count": 3 } ] } } } ``` Close #8213	2014-10-27 12:11:26 +01:00
Simon Willnauer	baa1e020ce	[TEST] Add test for #8226	2014-10-27 09:19:29 +01:00
Nirmal Chidambaram	b8d2e2cd29	Core: Validates bool values in yaml for node settings - Added parseBooleanExact in booleans which throws exception in case of parse failure - Used ParseExact in static's to make it consistent + code review fixes - Added unit test - Changed Exception Type to ElasticSearchIllegalArg from Parse - used isExplicit* Closes #8097	2014-10-27 11:27:41 +09:00
Simon Willnauer	bb7ad4ab96	[TEST] NettyTransportMultiPortIntegrationTests used wrong random to initialize ports. JUnit uses an instance per test which caused the prot range to be initialized twice since suite level tests are not configured in a different context.	2014-10-26 13:07:32 +01:00
Simon Willnauer	58c292dc14	add missing imports	2014-10-26 10:09:18 +01:00
Simon Willnauer	75595bd0e6	[TEST] `Scope.SUITE` is not reproducible due to late cluster initialization The cluster for `Scope.SUITE` tests must be initialize in a static manner before the first test runs otherwise the random context used to initialize the cluster is taken from tests randomness rather than the suites randomness. This means test clusters will have different setups if only a single test is executed or even the test might have a entirely different random sequence.	2014-10-26 09:42:35 +01:00
Boaz Leskes	b82e2c7083	Test: testRecoverFromPreviousVersion should refresh from new nodes if running against v<1.3.0 This is due to a bug in older version causing refreshes to potentially be missed due to relocations #6545 Also: - Changed test to keep track of ids and report missing ones. - Removed total count check from assertSearchHits in order to enable per id checks in cased of a mismatch - Added a printable unique id part the ids of dummy documents added by indexRandom. The current random unicode id sometimes prints as ???? to the logs making them hard to trace	2014-10-25 23:08:42 +02:00
markharwood	d12ae196af	Bulk indexing: Fix 8125 hanged request when auto create index is off. If a bulk request contains a mix of indexing requests for an existing index and one that needs to be auto-created but a cluster configuration prevents the auto-create of the new index the ingest process hangs. The exception for the failure to create an index was not caught or reported back properly. Added a Junit test to recreate the issue and the associated fix is in TransportBulkAction. Closes #8125	2014-10-24 13:50:37 +01:00
Alex Ksikes	a87a23447f	ParseField: Support for when all fields are deprecated Closes #8067	2014-10-24 14:35:38 +02:00
Simon Willnauer	c09af6df61	[STORE] Don't catch FNF/NSF exception when reading metadata When reading metadata we do catch FileNotFound and NoSuchFileExceptions today, log the even and return an empty metadata object. Yet, in some cases this might be the wrong thing todo ie. if a commit point is provided these situations are actually an error and should be rethrown. This commit pushes the responsiblity to the caller to handle this exception. Closes #8207	2014-10-24 12:19:30 +02:00
Simon Willnauer	4a14c635c8	[CACHE] Expose concurrency_level on all caches The concurrency level allows to configure the cache internal segments used to cache data. This can have direct impact on evicition rates since memory bound caches are equally divided into segments which can cause early evictions if cache entries are not well balanced. Relates to #7836	2014-10-24 11:45:13 +02:00
Simon Willnauer	347ce36654	[UTILITIES] Introduce a RefCounted interface and basic impl We already have two places duplicating this rather hairy logic, this commit intorduces a new RefCoutned interace and an abstract implementation that can be used for delegation. It factors out all the reference counting and adds single and multithreaded test for it. Closes #8210	2014-10-24 10:33:54 +02:00
Boaz Leskes	f72e0c89f7	test: RelocationTests.testCancellationCleansTempFiles may fail due to io errors while check temp files are deleted Windows can throw NoSuchFileException when using File.walkFileTree and deleting files concurrently. This commit changes IO exceptions into assertion error so that assertBusy will wait for them as well.	2014-10-24 09:49:18 +02:00
Michael McCandless	ec3a473a00	Test: additional logging	2014-10-23 11:30:57 -04:00
Boaz Leskes	24bc8d331e	Recovery: refactor RecoveryTarget state management This commit rewrites the state controls in the RecoveryTarget family classes to make it easier to guarantee that: - recovery resources are only cleared once there are no ongoing requests - recovery is automatically canceled when the target shard is closed/removed - canceled recoveries do not leave temp files behind when canceled. Highlights of the change: 1) All temporary files are cleared upon failure/cancel (see #7315 ) 2) All newly created files are always temporary 3) Doesn't list local files on the cluster state update thread (which throw unwanted exception) 4) Recoveries are canceled by a listener to IndicesLifecycle.beforeIndexShardClosed, so we don't need to explicitly call it. 5) Simplifies RecoveryListener to only notify when a recovery is done or failed. Removed subtleties like ignore and retry (they are dealt with internally) Closes #8092 , Closes #7315	2014-10-23 15:06:02 +02:00
Lee Hinman	1557c34f2c	[TESTS] Change test node watermarks for DiskThresholdDecider Ensures that we can still run tests if the machine running them does not have enough free disk space to be below the high watermark	2014-10-23 14:42:06 +02:00
Simon Willnauer	d5c0a49620	[ROUTING] Add rebalance enabled allocation decider This commit adds the ability to enable / disable relocations on an entire cluster or on individual indices for either: * `primaries` - only primaries can rebalance * `replica` - only replicas can rebalance * `all` - everything can rebalance (default) * `none` - all rebalances are disabled similar to the allocation enable / disable functionality. Relates to #7288	2014-10-23 14:07:13 +02:00
Simon Willnauer	ed798296a5	[SEARCH] Passing fieddata_fields as a non array causes OOM If `fielddata_fields` are passed as a simple value instead of an array we end up in an infinite loop createing parsed elements with null values. This commit validates the incoming token Closes #8203	2014-10-23 14:04:42 +02:00
Alex Ksikes	c13f5f21de	Term Vectors: support for distributed frequencies Adds distributed frequencies support for the Term Vectors API. A new parameter called `dfs` is introduced which defaults to `false`. Closes #8144	2014-10-23 13:59:59 +02:00
Lee Hinman	19514a2ef4	Enable ClusterInfoService by default Since we enabled the disk threshold decider by default, we need to enable the cluster info service so that disk usages and shard sizes can be gathered also. Adds a test that checks that we are gathering information by default.	2014-10-23 13:48:48 +02:00
Brian Murphy	fb4a32a398	TEST : Disable throttleStats test This test has been intermittently failing, disabling while I dig into it.	2014-10-23 09:11:10 +01:00
Ryan Ernst	87f41b4c4e	Tests: Improve range tests to check inclusive/exclusive on ends. closes #8199	2014-10-22 15:06:00 -07:00
Brian Murphy	2ebcbbc66b	TEST : Force throttle test to use tiered merge This commit fixes the failing throttle test by forcing the use of the tiered merge policy.	2014-10-22 16:25:39 +01:00
Brian Murphy	7333694830	Stats : Add time in index throttle to stats. This commit adds throttle stats to the indexing stats and uses a call back from InternalEngine to manage the stats. Also includes updates the IndexStatsTests to test for these new stats. Stats added : ``` throttle_time_in_millis is_throttled ``` Closes #7861	2014-10-22 14:07:28 +01:00
Lee Hinman	f7d227e4d5	Make "noop" request breaker a non-dynamic setting The issue with making it dynamic is that in the event a cluster is switched from a noop to a concrete implementation, there may be in-flight requests, once these requests complete we adjust the breaker with a negative number and trip an assertion. This also rarely uses noop breakers in InternalTestCluster	2014-10-22 10:53:53 +02:00
Lee Hinman	26bc940101	Make simple_query_string leniency more fine-grained Previously, the leniency was on a per-query basis, with each query being parsed into multiple queries, one for each field. If any one of these queries failed, the entire query was discarded in the name of being lenient. Now query parts will only be discarded if they fail for a particular field, the entire query is not discarded. This helps when performing a query over a numeric and string field, as only the sub-queries that are invalid due to format exceptions will be discarded. Also moves the `simple_query_string` queries out of SimpleQueryTests and into a dedicated SimpleQueryStringTests class. Fixes #7967	2014-10-22 10:31:34 +02:00
Martijn van Groningen	ec86d2cd3e	Aggregations: the `children` agg didn't take deleted document into account. The live docs that is passed down was ignored by the filter impl. Now the children filter gets wrapped with ApplyAcceptedDocsFilter, so live docs are actually applied. Closes #8180	2014-10-22 10:20:25 +02:00
Martijn van Groningen	d6f1ff0150	Test: Fix expected error message	2014-10-22 10:06:10 +02:00
Martijn van Groningen	319878eb1e	Parent/child: Check if there is a search context, otherwise throw a query parse exception. Also added a bwc test that runs a delete by query with a has_child query and verifies that only that operation is ignored when recovering from disk during a upgrade. Closes #8031 Closes #8177	2014-10-22 09:49:37 +02:00
Ryan Ernst	1258401ba8	Tests: Add forgotten portions of the fix to make the upgrade test not do reallcations at the wrong time.	2014-10-21 11:59:41 -07:00
Ryan Ernst	189d432221	Tests: Control reallocation in upgrade test to stop rebalancing causing upgrade requests to be lost.	2014-10-21 09:18:10 -07:00
Adrien Grand	3728c94780	Tests: Fix SearchFieldsTests after changes to TimestampFieldMapper.	2014-10-21 10:42:54 +02:00
Ryan Ernst	3323ac1579	Tests: Adding additional debug info to upgrade test.	2014-10-20 11:26:56 -07:00
David Pilato	0ff61e1d6f	Add time_zone setting for query_string Query String query now supports a new `time_zone` option based on JODA time zones. When using a range on date field, the time zone is applied. ```json { "query": { "query_string": { "text": "date:[2012 TO 2014]", "timezone": "Europe/Paris" } } } ``` Closes #7880.	2014-10-20 19:09:45 +02:00
Boaz Leskes	0146653b92	Test: RelocationTests.testMoveShardsWhileRelocation should wait on nodes to start before asking for a client The client call may create another node if none of the previous nodes are published. That node throws the test off.	2014-10-20 15:11:15 +02:00
Adrien Grand	230c6684a9	Search: Remove partial fields. Partial fields have been deprecated since 1.0.0Beta1 in favor of _source filtering. They will be removed in 2.0.	2014-10-20 12:29:30 +02:00
Adrien Grand	f4ee3f25e4	Mappings: Store _timestamp by default. Storing `_timestamp` by default means that under the default configuration, you would have all the information you need in order to reindex into a different index. Close #8139	2014-10-20 12:17:26 +02:00
Lee Hinman	34503b8d61	[TESTS] Add a benchmark for circuit breaker overhead	2014-10-20 11:19:40 +02:00
Lee Hinman	1c4d07c96f	Allow setting individual breakers to "noop" breakers This adds a NoopCircuitBreaker, and then adds the settings `indices.breaker.fielddata.type` and `indices.breaker.request.type`, which can be set to "noop" in order to use a breaker that will never break, and incurs no overhead during computation. This also refactors the tests for the CircuitBreakerService to use @Before and @After functions as well as adding settings in ElasticsearchIntegrationTest to occasionally use NOOP breakers for all tests.	2014-10-20 10:43:41 +02:00
Michael McCandless	f4202652b1	Tests: re-enable these tests	2014-10-19 04:40:29 -04:00
Michael McCandless	3e455828f9	Test: ignore this for now	2014-10-18 15:01:49 -04:00
Michael McCandless	67974c5146	Test: ignore this for now	2014-10-18 14:58:34 -04:00
Michael McCandless	85065f9c8e	Core: cutover to Lucene's query rescorer This is functionally equivalent to before, so there should be no user-visible impact, except I added a NOTE in the docs warning about the interaction of pagination and rescoring. Closes #6232 Closes #7707	2014-10-18 05:25:50 -04:00
Alexander Reelsen	37e606819c	Test: Fix netty multiport tests Ensuring, that even in local mode, netty is started appropriately.	2014-10-17 16:16:04 -07:00
Alexander Reelsen	b06b52449f	Netty: Support to bind on multiple host/port pairs This patch allows to create several netty bootstrap, each of which listening on different ports. This will potentially allow for features to listen to different network interfaces for node-to-node or node-to-client communication and is also the base to listen to several interfaces, so that those can be used to speed up cluster communication in the future. Closes #8098	2014-10-17 13:55:07 -07:00
Colin Goodheart-Smithe	4723c2a2ee	Aggregations: Buckets can now be serialized outside of an Aggregation This change means that buckets can now be serialised to JSON and serialized and deserialized to the transport API outside of the aggregation that contains them. This is a required change for #8110 (Reducers framework) but should make sense on it's own since object should really take care of their own serialization rather than relying on their parent object.	2014-10-17 16:07:40 +01:00
Ryan Ernst	ef8ac139aa	Remove unnecessary log level setting in upgrade test.	2014-10-16 12:59:05 -07:00
Ryan Ernst	f6c915f834	More improvements to debug logging for upgrade test.	2014-10-16 12:43:09 -07:00
Igor Motov	e3d379fb08	Snapshot/Restore: fix snapshot of a single closed index Snapshot of a closed index can leave snapshot hanging in initializing state. Fixes #8046	2014-10-15 17:57:21 -04:00
Ryan Ernst	5ed193086d	Add debug logging to help diagnose intermittent upgrade test failures. Also fixed indentation in ElasticsearchMergePolicy.	2014-10-15 13:41:28 -07:00
Boaz Leskes	f885abbedb	testDanglingIndicesConflictWithAlias - wait for node left to be processed O.w. delete index command may time out waiting for the node that left to confirm deletion	2014-10-15 18:08:15 +02:00
Ryan Ernst	c9d66bf486	Tweaking upgrade test to improve reliability.	2014-10-15 09:03:39 -07:00
Chris Earle	e88d42db2b	Adding ScoreTypesTest missing license header	2014-10-15 09:54:00 -05:00
Chris Earle	9b84ad3c7b	Adding "min" score mode to parent-child queries Support for "max", "sum", and "avg" already existed.	2014-10-15 09:23:36 -05:00
Boaz Leskes	dabbefd311	Internal: dangling indices import ignores aliases Dangling indices are indexes found on disk which are not part of the cluster state. By default, we don't delete them but rather import them into the cluster state in order to not accidentally delete data and also allow for the ease of copying index data folders from one cluster to another. Currently, the import logic doesn't check for existing aliases of the same name as the imported dangling index, causing both an index and an alias with the same name. This commit add a protection against this. Note that the index is still kept as dangling and is only deleted from disk after `gateway.local.dangling_timeout` has passed (2 hours). We also change the log message indicating deletion of dangling indices to a `WARN` level. Closes #8059	2014-10-15 13:47:56 +02:00
Simon Willnauer	8dddac3195	[CORE] Add cluster and index state checksums This commit adds checksumming for cluster and index states. It moves from a plain XContent based on-disk format to a more structured binary format including a header and footer as well as a CRC32 checksum for each of these files. Since previous versions didn't write any format identifier etc. this commit adds a file extension `.st` for states that have header/footer and checksum. This commit also moves over to write temporary files and moves them into place with an atomic move operation. This commit also serializes and deserializes the states without materilazing them into opaque memory. Closes #7586	2014-10-15 11:28:50 +02:00
David Pilato	6ef6109ce9	Tests: check maven central download service is up before running plugin manager test Though we check that internet connection is fine and that the service is available, it could happen that some services are not available. ```java assumeTrue(isDownloadServiceWorking("search.maven.org", 80, "/")); singlePluginInstallAndRemove("org.elasticsearch/elasticsearch-transport-thrift/2.4.0", null); ``` In that case, the first check was successful but downloading thrift plugin from maven central download services was not successful. This is not an issue with the plugin manager itself but sometimes the test could fail. Adding a check that actual download service in Maven central works: ```java assumeTrue(isDownloadServiceWorking("repo1.maven.org", 443, "/maven2/org/elasticsearch/elasticsearch-transport-thrift/2.4.0/elasticsearch-transport-thrift-2.4.0.pom")); ```	2014-10-15 11:19:20 +02:00
Simon Willnauer	ac4b39bd8f	[CORE] Be more explicit if index.version.created is not set Today we use the current version which might enable features that are not supported. We should throw an exception if this setting is not present for any index. Closes #8018	2014-10-14 20:51:54 +02:00
Michael McCandless	6166911d8a	Test: make test less evil to not cause OOME	2014-10-14 11:27:49 -04:00
Adrien Grand	39ff60e1fd	Tests: Add more logging to ExistsMissingTests.	2014-10-14 13:07:25 +02:00
Simon Willnauer	bbda203aef	[CORE] Remove leftover unused constants	2014-10-14 10:46:03 +02:00
Lee Hinman	2c6d31df36	Clear the GroovyClassLoader cache before compiling Since we don't use the cache, it's okay to clear it entirely if needed, Elasticsearch maintains its own cache for compiled scripts. Adds loader.clearCache() into a listener, the listener is called when a script is removed from the Guava cache. This also lowers the amount of cached scripts to 100, since 500 is around the limit some users have run into before hitting an out of memory error in permgem. Fixes #7658	2014-10-14 10:19:34 +02:00
Michael McCandless	cea72705cf	Test: re-enable testUpgrade_0_20	2014-10-13 06:01:46 -04:00
Simon Willnauer	4d598f72fb	[TEST] Remove explicit network mode - not needed here	2014-10-13 09:25:46 +02:00
Ryan Ernst	5ec87ca4a1	Temporarily ignore 0.20 upgrade test until it can be made less flaky.	2014-10-12 19:55:51 -07:00
Ryan Ernst	538c44b6bf	Admin: Fix upgrade logic to work on lucene major version differences. Also improved upgrade tests, and added test against static ES 0.20 index which used Lucene 3.6. closes #8013	2014-10-12 19:04:59 -07:00
Igor Motov	28a7d63951	Throw parsing exception if terms filter or query has more than one field Closes #5014	2014-10-12 15:44:02 -04:00
Ryan Ernst	08ba5baab0	Tests: Add assumption to restrict upgrade test from running against invalid backcompat versions.	2014-10-12 11:35:11 -07:00
Zachary Tong	7821387cca	Revert "Aliases: Throw exception if index is null when creating alias" This reverts commit `ee857bc073`.	2014-10-10 17:50:30 -04:00
Zachary Tong	ee857bc073	Aliases: Throw exception if index is null when creating alias Fixes a bug where alias creation would allow `null` for index name, which thereby applied the alias to _all_ indices. This patch makes the validator throw an exception if the index is null. ```bash POST /_aliases { "actions": [ { "add": { "alias": "empty-alias", "index": null } } ] } ``` ```json { "error": "ActionRequestValidationException[Validation Failed: 1: Alias action [add]: [index] may not be null;]", "status": 400 } ``` The reason this bug wasn't caught by the existing tests is because the old test for nullness only validated against a cluster which had zero indices. The null index is translated into "_all", and since there are no indices, this fails because the index doesn't exist. So the test passes. However, as soon as you add an index, "_all" resolves and you get the situation described in the original bug report: null index is accepted by the alias, resolves to "_all" and gets applied to everything. Fixes #7976	2014-10-10 16:47:40 -04:00
Zachary Tong	4e2dd770aa	Mapper: Throw exception if null_value is set to null The mapping parser should throw an exception if "null_value" is set to `null`. Fixes #7273 ```bash PUT /foo { "mappings": { "bar": { "properties": { "exception": { "null_value": null, "type": "integer" } } } } } ``` ``` { "error": "MapperParsingException[mapping [bar]]; nested: MapperParsingException[Property [null_value] cannot be null.]; ", "status": 400 } ```	2014-10-10 16:29:23 -04:00
javanna	694cc08625	[TEST] bump benchmark apis version to 2.0.0	2014-10-10 20:53:19 +02:00
Colin Goodheart-Smithe	e9f05eed80	Aggregations: Fixes scripted metrics aggregation when used as a sub aggregation The scripted metric aggregation is now a PER_BUCKET aggregation so that parent buckets are evaluated independently. Also the params and reduceParams are copied for each instance of the aggregator (each parent bucket) so modifications to the values are kept only within the scope of its parent bucket Closes #8036	2014-10-10 08:54:26 +01:00
Michael McCandless	101ea31fdf	Tests: move thread stacks on failure to a RunListener, so it actually works	2014-10-09 18:23:26 -04:00
Michael McCandless	3729b8dbd6	Tests: dump all thread stacks on failure	2014-10-09 10:14:19 -04:00
Michael McCandless	668def0ddd	Tests: add verbosity to this test case	2014-10-09 05:35:11 -04:00
Martijn van Groningen	5763b24686	Core: Make fetch phase nested doc aware By letting the fetch phase understand the nested docs structure we can serve nested docs as hits. The `top_hits` aggregation can because of this commit be placed in a `nested` or `reverse_nested` aggregation. Closes #7164	2014-10-08 22:21:30 +02:00
Britta Weber	96729b309d	[TEST] fix test after pr #8020	2014-10-08 17:45:16 +02:00
Martijn van Groningen	6b26c2021a	Parent/child: has_parent filter must take parent filter into account when executing the inner query/filter. Closes #8020 Closes #7943	2014-10-08 16:50:26 +02:00
Boaz Leskes	fb39caa8d4	Tests: two corruptions can fix a file In some of our tests we corrupt multiple places in a file randomly. If we corrupt the same place twice, we shouldn't fix the file by mistake.	2014-10-08 16:00:35 +02:00
Martijn van Groningen	41d3ba7f29	Tests: Enable check index in tests again. * Run flush in beforeIndexShardClosed to prevent an empty shard. * Only run check index if the shard state before closing was: started, relocated or post_recovery	2014-10-08 10:28:02 +02:00
Colin Goodheart-Smithe	c047fbda9b	Scripting: Created a parameter parse to standardised script options	2014-10-08 08:49:16 +01:00
Simon Willnauer	e799ac9bd6	[TEST] Fix content comparison to not take platform dependent control chars into account.	2014-10-08 06:47:02 +02:00
Ryan Ernst	032184bd5e	Fix missed places referencing optimize force flag from removal in commit 1ae8195.	2014-10-07 09:06:25 -07:00
Ryan Ernst	c06c10bbb0	Remove deprecations from master (follow up to #7922 )	2014-10-07 08:35:11 -07:00
Ryan Ernst	c021f22523	Add Upgrade API This commit does the following: * Add the new API at the rest layer, being backed by the optimize API with upgrade flag, and segments api to find upgrade status. * Add `upgrade` flag to optimize API, and deprecate `force` flag (will remove in master) * Add test for both synchronous and async upgrade closes #7884 closes #7922	2014-10-07 08:09:50 -07:00
Igor Motov	555bfcb02b	[SNAPSHOT] Add repository validation Fixes #7096	2014-10-07 10:50:16 -04:00
David Pilato	09ff3724ee	plugins: disable support for config dir When removing and installing again the plugin all configuration files will be removed in `config/pluginname` dir. This is bad as users may have set and added specific configuration files. During an install, if we detect already existing files in `config/pluginname` directory, we simply copy the new file to the same dir but we append a `.new` at the end. Related to #5064. (cherry picked from commit 5da028f) (cherry picked from commit 4cb1f95)	2014-10-07 16:16:37 +02:00
javanna	6cc7431bd3	[TEST] fix CurrentTestFailedMarker to reset its state after each test The currently used method `testRunStarted` is only called before any tests have been run, we need to reset that state before each test, that's why we need to use `testStarted`.	2014-10-07 12:17:01 +02:00
Britta Weber	0408f90c08	[TEST] Fix config directory for external nodes in bwc tests The logging configuration was expected in the path.home folder which is set to target/JX when running the bwc tests from the console. Therefore the logger could not be initialized with error message: [INFO] Failed to configure logging... org.elasticsearch.ElasticsearchException: Failed to load logging configuration at org.elasticsearch.common.logging.log4j.LogConfigurator.resolveConfig(LogConfigurator.java:117) at org.elasticsearch.common.logging.log4j.LogConfigurator.configure(LogConfigurator.java:81) at org.elasticsearch.bootstrap.Bootstrap.setupLogging(Bootstrap.java:96) at org.elasticsearch.bootstrap.Bootstrap.main(Bootstrap.java:180) at org.elasticsearch.bootstrap.Elasticsearch.main(Elasticsearch.java:32) Caused by: java.nio.file.NoSuchFileException: /home/britta/es/target/J0/config at sun.nio.fs.UnixException.translateToIOException(UnixException.java:86) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:107) at sun.nio.fs.UnixFileAttributeViews$Basic.readAttributes(UnixFileAttributeViews.java:55) at sun.nio.fs.UnixFileSystemProvider.readAttributes(UnixFileSystemProvider.java:144) at sun.nio.fs.LinuxFileSystemProvider.readAttributes(LinuxFileSystemProvider.java:97) at java.nio.file.Files.readAttributes(Files.java:1686) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:109) at java.nio.file.FileTreeWalker.walk(FileTreeWalker.java:69) at java.nio.file.Files.walkFileTree(Files.java:2602) at org.elasticsearch.common.logging.log4j.LogConfigurator.resolveConfig(LogConfigurator.java:107) ... 4 more log4j:WARN No appenders could be found for logger (node). log4j:WARN Please initialize the log4j system properly. log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info. Setting the config directory fixes this. Logs from external nodes are still not printed properly. They are inserted to the log whenever the stdout is printed ([WARNING] JVM J0: stdout was not empty...) closes #7964	2014-10-07 11:23:20 +02:00
Igor Motov	c0129ebcc2	Snapshot/Restore: make it possible to delete snapshots with missing metadata file Fixes #7980	2014-10-06 18:43:07 -04:00
Martijn van Groningen	e1a8b027d7	Core: Perform write consistency just before writing on the primary shard Before this change the write consistency change was performed on the node that receives the write request and the node that holds the primary shard. This change removes the check on the node that receives the request, since it is redundant. Also this change moves the write consistency check on the node that holds the primary shard to a later moment after forking of the thread to perform the actual write on the primary shard. Closes #7873	2014-10-06 19:00:07 +02:00
Boaz Leskes	99187e5259	Discovery: remove MasterFaultDetection.Listener.notListedOnMaster It is never used in practice. We treat it as a master failure (using NodeDoesNotExistOnMasterException). Closes #7995	2014-10-06 17:09:30 +02:00
David Pilato	6ae6a078de	Search: add `format` support for date range filter and queries When the date format is defined in mapping, you can not use another format when querying using range date query or filter. For example, this won't work: ``` DELETE /test PUT /test/t/1 { "date": "2014-01-01" } GET /test/_search { "query": { "filtered": { "filter": { "range": { "date": { "from": "01/01/2014" } } } } } } ``` It causes: ``` Caused by: org.elasticsearch.ElasticsearchParseException: failed to parse date field [01/01/2014], tried both date format [dateOptionalTime], and timestamp number ``` It could be nice if we can support at query time another date format just like we support `analyzer` at search time on String fields. Something like: ``` GET /test/_search { "query": { "filtered": { "filter": { "range": { "date": { "from": "01/01/2014", "format": "dd/MM/yyyy" } } } } } } ``` Same for queries: ``` GET /test/_search { "query": { "range": { "date": { "from": "01/01/2014", "format": "dd/MM/yyyy" } } } } ``` Closes #7189.	2014-10-06 15:59:18 +02:00
Colin Goodheart-Smithe	6cf371395a	Aggregations: makes script params consistent with other APIs in scripted_metric This change removes the script_type parameter form the Scripted Metric Aggregation and adds support for _file and _id suffixes to the init_script, map_script, combine_script and reduce_script parameters to make defining the source of the script consistent with the other APIs which use the ScriptService	2014-10-06 09:07:25 +01:00
Igor Motov	2cbe1b9d59	Snapshot/Restore: Make sure indices cannot be renamed into restored aliases Fixes #7915	2014-10-05 13:30:36 -04:00
Igor Motov	384114f52f	Fix NPE in ScriptService when script file with no extension is deleted Fixes #7689	2014-10-03 14:21:07 -04:00
Alex Ksikes	349b7a3a8b	Term Vectors/MLT Query: support for different analyzers than default at field This adds a `per_field_analyzer` parameter to the Term Vectors API, which allows to override the default analyzer at the field. If the field already stores term vectors, then they will be re-generated. Since the MLT Query uses the Term Vectors API under its hood, this commits also adds the same ability to the MLT Query, thereby allowing users to fine grain how each field item should be processed and analyzed. Closes #7801	2014-10-03 16:40:17 +02:00
Ryan Ernst	d35d125ad8	Tests: Improve BWC preconditions to error cleanly when wire formats differ. closes #7966	2014-10-03 07:37:02 -07:00
markharwood	f878f40ae5	Aggs fix - background count for docs should include deleted docs otherwise a term’s docFreq (which includes deleted docs) can exceed the number of docs reported in the index and cause an exception. The randomisation that deletes documents is also removed from tests as this doc-accounting change would mean the specific scores being expected in tests would now be subject to random variability and so fail. Closes #7951	2014-10-03 13:20:39 +01:00
Adrien Grand	bb6e2799cf	Tests: Add more assertions to ExistsMissingTests.	2014-10-03 12:14:34 +02:00
Alex Ksikes	c4830cf862	Term Vectors: support for realtime By default term vectors are now realtime, as opposed to previously near realtime. If they are not found in the index, they will be generated on the fly. The document is fetched from the transaction log and treated as an artificial document. One can set `realtime` parameter to `false` in order to disable this functionality. This consequently makes the MLT query realtime in fetching documents, as it previsouly used to be before switching from using the multi get API to the mtv API. Closes #7846	2014-10-03 09:26:47 +02:00
Boaz Leskes	c4866b3f03	DiscoveryWithServiceDisruptions: some more java docs and todos	2014-10-02 14:02:31 +02:00
Adrien Grand	3b38db121b	Mappings: Make lookup structures immutable. This commit makes the lookup structures that are used for mappings immutable. When changes are required, a new instance is created while the current instance is left unmodified. This is done efficiently thanks to a hash table implementation based on a array hash trie, see org.elasticsearch.common.collect.CopyOnWriteHashMap. ManyMappingsBenchmark returns indexing times that are similar to the ones that can be observed in current master. Ultimately, I would like to see if we can make mappings completely immutable as well and updated atomically. This is not trivial however, eg. because of dynamic mappings. So here is a first baby step that should help move towards that direction. Close #7486	2014-10-02 13:42:20 +02:00
Alex Ksikes	8d4373ab66	[TEST] MLT malformed doc test fixed	2014-10-01 14:39:55 +02:00
Boaz Leskes	dc86ac5752	Test: AckTests.testWarmer - make sure at least one shard is started The Put Warmer API executes the search encapsulated in the warmer before accepting it. This requires that at least one shard will be started. The tests used to use ensureGreen to check for that because of a publish timeout of 0 (needed to check the ack mechanism) that doesn't guarantee the shard is really started - just that the master has changed the CS to say so. This commit changes the ensureGreen to a the indexing of a single document.	2014-10-01 13:53:37 +02:00
Simon Willnauer	5747c9ebba	[TEST] move fragile tests to BadApples rather than AwaitsFix	2014-10-01 12:37:59 +02:00
Boaz Leskes	a2029ed6ec	Test: AckClusterUpdateSettingsTests - only set publish_timeout to 0 after green	2014-10-01 12:33:58 +02:00
Lee Hinman	9c8beb8220	Be stricter parsing ids for ids query Adds a check to make sure that all ids in the query are either strings or numbers. This is to prevent the case where a user accidentally specifies: "ids": [["1", "2"]] (note the double array) With this change, an exception will be thrown since the second "[" is not a string or number, it is a Token.START_ARRAY. Fixes #7686	2014-10-01 10:34:35 +02:00
Simon Willnauer	50923a764c	[TEST] Use canonical path for comparison rather than absolute path	2014-10-01 10:25:20 +02:00
Alexander Reelsen	9903c2480e	PluginManager: Fix config path extraction from plugin handle The PluginManager had a subtle bug in case the config directory was not in the es home directory - which is always true in case of packaging. This fixes the plugin manager, so that when specifying a path.home and a path.conf variable on the commandline, the plugin manager acts appropriately.	2014-09-30 19:51:07 +02:00
Igor Motov	b7a4c6da65	Snapshot/Restore: Allow custom metadata to specify whether or not it should be in a snapshot Before this change all persistent custom metadata is stored as part of snapshot. It requires us to remove repositories metadata later during recovery process. This change allows custom metadata to specify whether or not it should be stored as part of a snapshot. Fixes #7900	2014-09-30 19:16:42 +04:00
Lee Hinman	c86fdecd25	[TESTS] Be less strict about breaker child limit Failing a parent breaker check is eventually consistent, so the test could fail the parent limit, throw an exception, and before being adjusted back down, increment more and throw a circuit breaking exception on the child. This increases the child's limit, to ensure we're only testing the parent limit. It adds an additional assert to ensure that the breaker total is correctly re-adjusted when the parent breaker has been tripped.	2014-09-30 13:01:27 +02:00
Britta Weber	e99be5cb0b	[TEST] Mute MoreLikeThisActionTests#*ArtificialDocs	2014-09-30 09:29:32 +02:00
Ryan Ernst	37b294aaec	Fix optimize behavior with 'force' and 'flush' flags. This does the following: * Make 'force' flag only build a merge if the delegate MP returned no merges * Add async handling for 'flush' when 'waitForMerges' is false * Remove flush at the beginning of optimize. This is something the user can do if they wish, before calling optimize. closes #7886 closes #7904 closes #7920	2014-09-29 15:20:19 -07:00
Simon Willnauer	20a0c68964	[BUILD] Release version should match latest version This commit ensures that the latest version in our code is identical to the project.version specified in the pom.xml file.	2014-09-29 17:45:10 +02:00
Simon Willnauer	cfd9ac2f63	[TEST] Use Shutdown API only if nodes are on 1.3.3 or newer to prevent shutdown problems	2014-09-29 17:18:26 +02:00
javanna	c06b772df0	[TEST] make sure that IndicesRequestTests is repeateable using the same seed Remove the creation of a node client if not there before each test through setup method. `numClientNodes` makes sure that the client node gets created during suite cluster initialization.	2014-09-29 15:57:14 +02:00
Alex Ksikes	b118558962	MLT Query: Support for artificial documents Previously, the only way to specify a document not present in the index was to use `like_text`. This would usually lead to complex queries made of multiple MLT queries per document field. This commit adds the ability to the MLT query to directly specify documents not present in the index (artificial documents). The syntax is similar to the Percolator API or to the Multi Term Vector API. Closes #7725	2014-09-29 15:49:13 +02:00
javanna	43a1e1c353	[TEST] create client nodes using node.client: true instead node.data: false and node.master: false Create client nodes using `node.client: true` instead of `node.data: false` and `node.master: false`. We should create client nodes in our test infra using the `node.client:true` settings as that is the one that users use, and the one that we use as well in `ClientNodePredicate` thus we end up not finding client nodes otherwise as they weren't created with the proper setting. Updated also the `DataNodePredicate` so that `client: true` is enough, no need for `data: false` as well. Closes #7911	2014-09-29 15:24:17 +02:00
Lee Hinman	ab9cc336e5	[TESTS] Additional logging for `testThreadedUpdatesToChildBreakerWithParentLimit`	2014-09-29 15:06:36 +02:00
Boaz Leskes	9b4bf4379a	Test: testNodeNotReachableFromMaster had a typo when choosing a non master node	2014-09-29 11:38:39 +02:00

... 10 11 12 13 14 ...

4053 Commits