OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ali Beyad	9878aae144	Ensure every repository has an incompatible-snapshots blob (#24403 ) In #22267, we introduced the notion of incompatible snapshots in a repository, and they were stored in a root-level blob named `incompatible-snapshots`. If there were no incompatible snapshots in the repository, then there was no `incompatible-snapshots` blob. However, this causes a problem for some cloud-based repositories, because if the blob does not exist, the cloud-based repositories may attempt to keep retrying the read of a non-existent blob with expontential backoff until giving up. This causes performance issues (and potential timeouts) on snapshot operations because getting the `incompatible-snapshots` is part of getting the repository data (see RepositoryData#getRepositoryData()). This commit fixes the issue by creating an empty `incompatible-snapshots` blob in the repository if one does not exist.	2017-05-01 15:09:21 -04:00
Nik Everett	8f1fe51f34	Test: more logging for refresh listeners many threads test This test fails from time to time but doesn't give enough information for me to figure out what caused the assertion to fail. Relates to #24418	2017-05-01 14:50:13 -04:00
Luca Cavanna	91fbb0ba28	Move IndicesAliasesRequest#concreteAliases to TransportIndicesAliasesAction (#24400 ) This method has to do with how the transport action may or may not resolve wildcards expressions to aliases names. It is only needed in TransportIndicesAliasesAction and for this reason it should be a private method in it rather than part of a request class which is also part of the Java API and later in the high level REST client.	2017-05-01 19:59:06 +02:00
Nik Everett	62712bf653	Test: extra debugging for refresh listeners stats This test failed in CI but didn't give us enough information to debug it: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+nfs/223/consoleFull This turns on more debugging and make the test fail more quickly if something is obviously wrong.	2017-05-01 11:46:55 -04:00
Koen De Groote	0fef5acd01	Cleanup collections construction This commit cleans up some cases where a list or map was being constructed, and then an existing collection was copied into the new collection. The clean is to instead use an appropriate constructor to directly copy the existing collection in during collection construction. The advantage of this is that the new collection is sized appropriately. Relates #24409	2017-04-30 21:26:51 -04:00
Yannick Welsch	86aab98fde	[TEST] Fix race condition in ZenDiscoveryIT.testDiscoveryStats With #24236, the master now uses the pending queue when publishing to itself. This means that a cluster state update is put into the pending queue, then sent to the ClusterApplierService to be applied. After it has been applied, it is marked as processed and removed from the pending queue. ensureGreen is implemented as a cluster health action that waits on certain conditions, which will lead to a cluster state update task to be submitted on the master. When this task gets to run and the conditions are not satisfied yet, it register a cluster state observer. This observer is registered on the ClusterApplierService and waits on cluster state change events. ClusterApplierService first notifies the observer and then the discovery layer. This means that there is a small time frame where ensureGreen can complete and call the node stats to find the pending queue still containing the last cluster state update. Closes #24388	2017-04-28 20:42:56 +02:00
Yannick Welsch	9c55bca8fb	Fix node failure detection race when updating cluster state Failure detection should only be updated in ZenDiscovery after the current state has been updated to prevent a race condition with handleLeaveRequest and handleNodeFailure as those check the current state to determine whether the failure is to be handled by this node.	2017-04-28 19:52:27 +02:00
olcbean	55daf743d7	Open and close index to honour allow_no_indices option (#24222 ) Open/close index API when executed providing an index expressions that matched no indices, threw an error even when allow_no_indices was set to true. The APIs should rather honour the option and behave as a no-op in that case. Closes #24031	2017-04-28 17:41:05 +02:00
Christoph Büscher	eb002340d7	Don't render InternalGeoCentroid in static helper method This was introduced by a previous commit but is not necessary.	2017-04-28 16:55:07 +02:00
Christoph Büscher	16a7cbe463	Add `count` value to rest output of `geo_centroid` (#24387 ) Currently we don't write the count value to the geo_centroid aggregation rest response, but it is provided via the java api and the count() method in the GeoCentroid interface. We should add this parameter to the rest output and also provide it via the getProperty() method.	2017-04-28 16:25:22 +02:00
Nik Everett	e3b7b88756	Fix compilation in Ecipse (#24391 ) Eclipse doesn't allow extra semicolons after an import statement: ``` import foo.Bar;; // <-- syntax error! ``` Here is the Eclipse bug: https://bugs.eclipse.org/bugs/show_bug.cgi?id=425140 which the Eclipse folks closed as "the spec doesn't allow these semicolons so why should we?" Which is fair. Here is the bug against javac for allowing them: https://bugs.openjdk.java.net/browse/JDK-8027682 which hasn't been touched since 2013 without explanation. There is, however, a rather educations mailing list thread: http://mail.openjdk.java.net/pipermail/compiler-dev/2013-August/006956.html which contains gems like, "In general, it is better/simpler to change javac to conform to the spec. (Except when it is not.)" I suspect the reason this hasn't been fixed is: ``` FWIW, if we change javac such that the set of programs accepted by javac is changed, we have an process (currently Oracle internal) to get approval for such a change. So, we would not simply change javac on a whim to meet the spec; we would at least have other eyes looking at the behavioral change to determine if it is "acceptable". ``` from http://mail.openjdk.java.net/pipermail/compiler-dev/2013-August/006973.html	2017-04-28 09:52:14 -04:00
Guillaume Le Floch	382a617d34	Handle multiple aliases in _cat/aliases api (#23698 ) The alias parameter was documented as a list in our rest-spec, yet only the first value out of a list was getting read and processed. This commit adds support for multiple aliases to _cat/aliases Closes #23661	2017-04-28 15:21:44 +02:00
Kunal Kapoor	a5bd2012b6	Added validation for upserd request (#24282 ) The version on an update request is a syntactic sugar for get of a specific version, doc merge and a version index. This changes it to reject requests with both upsert and a version. If the upsert index request is versioned, we also reject the op.	2017-04-28 08:02:09 -04:00
Yannick Welsch	a72db191f2	Weaken assertion in ZenDiscovery.publish The previous commit (`35f78d098a`) introduced an assertion in ZenDiscovery that was overly restrictive - it could trip when a cluster state that was successfully published would not be applied locally because a master with a better cluster state came along in the meantime.	2017-04-28 11:33:53 +02:00
Yannick Welsch	35f78d098a	Separate publishing from applying cluster states (#24236 ) Separates cluster state publishing from applying cluster states: - ClusterService is split into two classes MasterService and ClusterApplierService. MasterService has the responsibility to calculate cluster state updates for actions that want to change the cluster state (create index, update shard routing table, etc.). ClusterApplierService has the responsibility to apply cluster states that have been successfully published and invokes the cluster state appliers and listeners. - ClusterApplierService keeps track of the last applied state, but MasterService is stateless and uses the last cluster state that is provided by the discovery module to calculate the next prospective state. The ClusterService class is still kept around, which now just delegates actions to ClusterApplierService and MasterService. - The discovery implementation is now responsible for managing the last cluster state that is used by the consensus layer and the master service. It also exposes the initial cluster state which is used by the ClusterApplierService. The discovery implementation is also responsible for adding the right cluster-level blocks to the initial state. - NoneDiscovery has been renamed to TribeDiscovery as it is exclusively used by TribeService. It adds the tribe blocks to the initial state. - ZenDiscovery is synchronized on state changes to the last cluster state that is used by the consensus layer and the master service, and does not submit cluster state update tasks anymore to make changes to the disco state (except when becoming master). Control flow for cluster state updates is now as follows: - State updates are sent to MasterService - MasterService gets the latest committed cluster state from the discovery implementation and calculates the next cluster state to publish - MasterService submits the new prospective cluster state to the discovery implementation for publishing - Discovery implementation publishes cluster states to all nodes and, once the state is committed, asks the ClusterApplierService to apply the newly committed state. - ClusterApplierService applies state to local node.	2017-04-28 09:34:31 +02:00
Toby McLaughlin	e4bb360ae0	Fix typo in node environment exception message This commit fixes a typo in an exception message when trying to create a node environment. Relates #24381	2017-04-28 00:48:52 -04:00
Zachary Tong	350573290f	Agg builder accessibility fixes (#24323 ) - Getters for DateHisto `interval` and `offset` should return a long, not double - Add getter for the filter in a FilterAgg - Add getters for subaggs / pipelines in base AggregationBuilder	2017-04-27 16:50:59 -04:00
Ryan Ernst	cdcc75dd2a	Plugins: Add support for platform specific plugins (#24265 ) This commit adds support for plugins having a platform specific variant. It also adds unit tests for all official and maven urls.	2017-04-27 11:27:29 -07:00
Ali Beyad	2facc42a55	Change snapshot status error to use generic SnapshotException (#24355 ) Changes the snapshot status read exception from the (misleading) IndexShardRestoreFailedException to the generic SnapshotException Closes #24225	2017-04-27 09:36:26 -04:00
Yannick Welsch	2fa1c9fff1	Provide target allocation id as part of start recovery request (#24333 ) This makes it possible for the recovery source to verify that it is talking to the shard it thinks it is talking to. Closes #24167	2017-04-27 14:45:44 +02:00
Tim Vernum	65f90b25e0	Pass Context to ConstructingObjectParser's function (#24230 ) Allow the `Context` to be used in the builder function used within ConstructingObjectParser. This facilitates scenarios where a constructor argument comes from a URL parameter, or from document id.	2017-04-27 20:26:10 +10:00
Clinton Gormley	8a8410b5ce	Added bwc indices for v2.4.5	2017-04-27 10:30:53 +02:00
Ryan Ernst	4a5c3c5a4a	Test: Write node ports file before starting tribe service (#24351 ) The tribe service can take a while to initialize, depending on how many cluster it needs to connect to. This change moves writing the ports file used by tests to before the tribe service is started.	2017-04-27 09:59:54 +02:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Koen De Groote	7f9d84cb1a	The parseObject method in DocumentParse can be void. There is no point in the code that actually expects the return, plus the variable created for it was never actually used. (#24350 )	2017-04-26 16:54:23 -06:00
Ali Beyad	d387dcfd6c	[TEST] fixes NPE in RoutingTableTests	2017-04-26 18:39:26 -04:00
Ali Beyad	0e74f5ddb1	[TEST] fixes shard count of source shard index in a restore shrink index test	2017-04-26 16:34:53 -04:00
Nik Everett	bc45d10e82	Remove most usages of 1-arg Script ctor (#24325 ) The one argument ctor for `Script` creates a script with the default language but most usages of are for testing and either don't care about the language or are for use with `MockScriptEngine`. This replaces most usages of the one argument ctor on `Script` with calls to `ESTestCase#mockScript` to make it clear that the tests don't need the default scripting language. I've also factored out some copy and pasted script generation code into a single place. I would have had to change that code to use `mockScript` anyway, so it was easier to perform the refactor. Relates to #16314	2017-04-26 16:04:38 -04:00
Luca Cavanna	149629fec6	Cross Cluster Search: propagate original indices per cluster (#24328 ) In case of a Cross Cluster Search, the coordinating node should split the original indices per cluster, and send over to each cluster only its own set of original indices, rather than the set taken from the original search request which contains all the indices. In fact, each remote cluster should not be aware of the indices belonging to other remote clusters.	2017-04-26 21:45:49 +02:00
Yannick Welsch	b7bf651738	[TEST] Fix cluster forming in testDynamicUpdateMinimumMasterNodes This test can run into a split-brain situation as minimum_master_nodes is not properly set. To prevent this, make sure that at least one of the two master nodes that are initially started has minimum_master_nodes correctly set.	2017-04-26 21:13:27 +02:00
Martijn van Groningen	ebe98f9d62	test: don't randomly wrap index reader	2017-04-26 21:07:56 +02:00
Ali Beyad	0e52e3420e	Fixes restore of a shrunken index when initial recovery node is gone (#24322 ) When an index is shrunk using the shrink APIs, the shrink operation adds some internal index settings to the shrink index, for example `index.shrink.source.name\|uuid` to denote the source index, as well as `index.routing.allocation.initial_recovery._id` to denote the node on which all shards for the source index resided when the shrunken index was created. However, this presents a problem when taking a snapshot of the shrunken index and restoring it to a cluster where the initial recovery node is not present, or restoring to the same cluster where the initial recovery node is offline or decomissioned. The restore operation fails to allocate the shard in the shrunken index to a node when the initial recovery node is not present, and a restore type of recovery will not go through the PrimaryShardAllocator, meaning that it will not have the chance to force allocate the primary to a node in the cluster. Rather, restore initiated shard allocation goes through the BalancedShardAllocator which does not attempt to force allocate a primary. This commit fixes the aforementioned problem by not requiring allocation to occur on the initial recovery node when the recovery type is a restore of a snapshot. This commit also ensures that the internal shrink index settings are recognized and not archived (which can trip an assertion in the restore scenario). Closes #24257	2017-04-26 14:48:10 -04:00
Koen De Groote	3187ed73fc	Removal of dead code in ScriptedMetricAggregationBuilder (#24346 ) This code removes a few lines of dead code from ScriptedMetricAggregationBuilder. Just completely dead code, it adds things to a Set that is then not used in any way.	2017-04-26 14:44:03 -04:00
Koen De Groote	4c0eb35c22	Removal of dead code from SnapshotsService (#24347 ) This code removes a few lines of dead code from SnapshotsService. Looks like a forgotten remnant of a past implementation.	2017-04-26 14:32:35 -04:00
Nik Everett	7c3efb829b	Move char filters into analysis-common (#24261 ) Another step down the road to dropping the lucene-analyzers-common dependency from core. Note that this removes some tests that no longer compile from core. I played around with adding them to the analysis-common module where they would compile but we already test these in the tests generated from the example usage in the documentation. I'm not super happy with the way that `requriesAnalysisSettings` works with regards to plugins. I think it'd be fairly bug-prone for plugin authors to use. But I'm making it visible as is for now and I'll rethink later. A part of #23658	2017-04-26 13:25:34 -04:00
Christoph Büscher	db1b243343	InternalPercentilesBucket should not rely on ordered percents array (#24336 ) Currently InternalPercentilesBucket#percentile() relies on the percent array passed in to be in sorted order. This changes the aggregation to store an internal lookup table that is constructed from the percent/percentiles arrays passed in that can be used to look up the percentile values. Closes #24331	2017-04-26 19:15:48 +02:00
Yannick Welsch	91b61ce569	[TEST] Do a reroute with retry_failed after a bridge partition on testAckedIndexing In case of a bridge partition, shard allocation can fail "index.allocation.max_retries" times if the master is the super-connected node and recovery source and target are on opposite sides of the bridge. This commit adds a reroute with retry_failed after healing the network partition so that the ensureGreen check succeeds.	2017-04-26 16:08:16 +02:00
Jay Modi	7f8fe8b81d	StreamInput throws exceptions instead of using assertions (#24294 ) StreamInput has methods such as readVInt that perform sanity checks on the data using assertions, which will catch bad data in tests but provide no safety when running as a node without assertions enabled. The use of assertions also make testing with invalid data difficult since we would need to handle assertion errors in the code using the stream input and errors like this should not be something we try to catch. This commit introduces a flag that will throw an IOException instead of using an assertion.	2017-04-26 07:23:07 -04:00
Martijn van Groningen	c17de49a6d	[percolator] Fix memory leak when percolator uses bitset or field data cache. The percolator doesn't close the IndexReader of the memory index any more. Prior to 2.x the percolator had its own SearchContext (PercolatorContext) that did this, but that was removed when the percolator was refactored as part of the 5.0 release. I think an alternative way to fix this is to let percolator not use the bitset and fielddata caches, that way we prevent the memory leak. Closes #24108	2017-04-26 11:08:15 +02:00
Koen De Groote	3c845727f8	Replace alternating regex with character classes This commit replaces two alternating regular expressions (that is, regular expressions that consist of the form a\|b where a and b are characters) with the equivalent regular expression rewritten as a character class (that is, [ab]) The reason this is an improvement is because a\|b involves backtracking while [ab] does not. Relates #24316	2017-04-25 22:15:00 -04:00
Guillaume Le Floch	739cb35d1b	Allow passing single scrollID in clear scroll API body (#24242 ) * Allow single scrollId in string format Closes #24233	2017-04-25 13:43:21 +02:00
Koen De Groote	88de33d43d	Minor changes to collection creation from enums (#24274 ) These changes are mainly cosmetic with minor perf advantages drawn from checkstyle.	2017-04-25 13:13:55 +02:00
Ryan Ernst	6ebf08759b	Templates: Add compileTemplate method to ScriptService for template consumers (#24280 ) This commit adds a compileTemplate method to the ScriptService. Eventually this will be used to easily cutover all consumers to a new TemplateService. relates #16314	2017-04-24 15:45:20 -07:00
Christoph Büscher	026bf2e3ee	Remove getCountAsString() from InternalStats and Stats interface (#24291 ) The `count` value in the stats aggregation represents a simple doc count that doesn't require a formatted version. We didn't render an "as_string" version for count in the rest response, so the method should also be removed in favour of just using String.valueOf(getCount()) if a string version of the count is needed. Closes #24287	2017-04-24 18:40:57 +02:00
Ali Beyad	c5b6f52ecc	Fixes maintaining the shards a snapshot is waiting on (#24289 ) There was a bug in the calculation of the shards that a snapshot must wait on, due to their relocating or initializing, before the snapshot can proceed safely to snapshot the shard data. In this bug, an incorrect key was used to look up the index of the waiting shards, resulting in the fact that each index would have at most one shard in the waiting state causing the snapshot to pause. This could be problematic if there are more than one shard in the relocating or initializing state, which would result in a snapshot prematurely starting because it thinks its only waiting on one relocating or initializing shard (when in fact there could be more than one). While not a common case and likely rare in practice, it is still problematic. This commit fixes the issue by ensuring the correct key is used to look up the waiting indices map as it is being built up, so the list of waiting shards for each index (those shards that are relocating or initializing) are aggregated for a given index instead of overwritten.	2017-04-24 10:59:08 -04:00
Martijn van Groningen	dabbf5d4f4	[TEST] Added unittests for InternalGeoCentroid Relates to #22278	2017-04-24 16:57:25 +02:00
Nilabh Sagar	373edee29a	Provide informative error message in case of unknown suggestion context. (#24241 ) Provide a list of available contexts when you send an unknown context to the completion suggester.	2017-04-24 10:35:14 -04:00
Jason Tedor	1500beafc7	Check for default.path.data included in path.data If the user explicitly configured path.data to include default.path.data, then we should not fail the node if we find indices in default.path.data. This commit addresses this. Relates #24285	2017-04-24 09:31:54 -04:00
Jason Tedor	a7947b404b	Fix hash code for AliasFilter This commit fixes the hash code for AliasFilter as the previous implementation was neglecting to take into consideration the fact that the aliases field is an array and thus a deep hash code of it should be computed rather than a shallow hash code on the reference. Relates #24286	2017-04-24 09:06:36 -04:00
Yannick Welsch	7c395070e2	[TEST] Wait for tribe node to be fully connected before shutting it down The tribe was being shutdown by the test while a publishing round (that adds the tribe node to a cluster) is not completed yet (i.e. the node itself knows that it became part of the cluster, and the test shuts the tribe node down, but another node has not applied the cluster state yet, which makes that node hang while trying to connect to the node that is shutting down (due to connect_timeout being 30 seconds), delaying publishing for 30 seconds, and subsequently tripping an assertion when another tribe instance wants to join. Relates to #23695	2017-04-24 12:27:41 +02:00

1 2 3 4 5 ...

7936 Commits