OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-03 09:29:11 +00:00

Author	SHA1	Message	Date
Ali Beyad	0e52e3420e	Fixes restore of a shrunken index when initial recovery node is gone (#24322 ) When an index is shrunk using the shrink APIs, the shrink operation adds some internal index settings to the shrink index, for example `index.shrink.source.name\|uuid` to denote the source index, as well as `index.routing.allocation.initial_recovery._id` to denote the node on which all shards for the source index resided when the shrunken index was created. However, this presents a problem when taking a snapshot of the shrunken index and restoring it to a cluster where the initial recovery node is not present, or restoring to the same cluster where the initial recovery node is offline or decomissioned. The restore operation fails to allocate the shard in the shrunken index to a node when the initial recovery node is not present, and a restore type of recovery will not go through the PrimaryShardAllocator, meaning that it will not have the chance to force allocate the primary to a node in the cluster. Rather, restore initiated shard allocation goes through the BalancedShardAllocator which does not attempt to force allocate a primary. This commit fixes the aforementioned problem by not requiring allocation to occur on the initial recovery node when the recovery type is a restore of a snapshot. This commit also ensures that the internal shrink index settings are recognized and not archived (which can trip an assertion in the restore scenario). Closes #24257	2017-04-26 14:48:10 -04:00
Koen De Groote	3187ed73fc	Removal of dead code in ScriptedMetricAggregationBuilder (#24346 ) This code removes a few lines of dead code from ScriptedMetricAggregationBuilder. Just completely dead code, it adds things to a Set that is then not used in any way.	2017-04-26 14:44:03 -04:00
Koen De Groote	4c0eb35c22	Removal of dead code from SnapshotsService (#24347 ) This code removes a few lines of dead code from SnapshotsService. Looks like a forgotten remnant of a past implementation.	2017-04-26 14:32:35 -04:00
Nik Everett	7c3efb829b	Move char filters into analysis-common (#24261 ) Another step down the road to dropping the lucene-analyzers-common dependency from core. Note that this removes some tests that no longer compile from core. I played around with adding them to the analysis-common module where they would compile but we already test these in the tests generated from the example usage in the documentation. I'm not super happy with the way that `requriesAnalysisSettings` works with regards to plugins. I think it'd be fairly bug-prone for plugin authors to use. But I'm making it visible as is for now and I'll rethink later. A part of #23658	2017-04-26 13:25:34 -04:00
Christoph Büscher	db1b243343	InternalPercentilesBucket should not rely on ordered percents array (#24336 ) Currently InternalPercentilesBucket#percentile() relies on the percent array passed in to be in sorted order. This changes the aggregation to store an internal lookup table that is constructed from the percent/percentiles arrays passed in that can be used to look up the percentile values. Closes #24331	2017-04-26 19:15:48 +02:00
Yannick Welsch	91b61ce569	[TEST] Do a reroute with retry_failed after a bridge partition on testAckedIndexing In case of a bridge partition, shard allocation can fail "index.allocation.max_retries" times if the master is the super-connected node and recovery source and target are on opposite sides of the bridge. This commit adds a reroute with retry_failed after healing the network partition so that the ensureGreen check succeeds.	2017-04-26 16:08:16 +02:00
Jay Modi	7f8fe8b81d	StreamInput throws exceptions instead of using assertions (#24294 ) StreamInput has methods such as readVInt that perform sanity checks on the data using assertions, which will catch bad data in tests but provide no safety when running as a node without assertions enabled. The use of assertions also make testing with invalid data difficult since we would need to handle assertion errors in the code using the stream input and errors like this should not be something we try to catch. This commit introduces a flag that will throw an IOException instead of using an assertion.	2017-04-26 07:23:07 -04:00
Martijn van Groningen	c17de49a6d	[percolator] Fix memory leak when percolator uses bitset or field data cache. The percolator doesn't close the IndexReader of the memory index any more. Prior to 2.x the percolator had its own SearchContext (PercolatorContext) that did this, but that was removed when the percolator was refactored as part of the 5.0 release. I think an alternative way to fix this is to let percolator not use the bitset and fielddata caches, that way we prevent the memory leak. Closes #24108	2017-04-26 11:08:15 +02:00
Koen De Groote	3c845727f8	Replace alternating regex with character classes This commit replaces two alternating regular expressions (that is, regular expressions that consist of the form a\|b where a and b are characters) with the equivalent regular expression rewritten as a character class (that is, [ab]) The reason this is an improvement is because a\|b involves backtracking while [ab] does not. Relates #24316	2017-04-25 22:15:00 -04:00
Guillaume Le Floch	739cb35d1b	Allow passing single scrollID in clear scroll API body (#24242 ) * Allow single scrollId in string format Closes #24233	2017-04-25 13:43:21 +02:00
Koen De Groote	88de33d43d	Minor changes to collection creation from enums (#24274 ) These changes are mainly cosmetic with minor perf advantages drawn from checkstyle.	2017-04-25 13:13:55 +02:00
Ryan Ernst	6ebf08759b	Templates: Add compileTemplate method to ScriptService for template consumers (#24280 ) This commit adds a compileTemplate method to the ScriptService. Eventually this will be used to easily cutover all consumers to a new TemplateService. relates #16314	2017-04-24 15:45:20 -07:00
Christoph Büscher	026bf2e3ee	Remove getCountAsString() from InternalStats and Stats interface (#24291 ) The `count` value in the stats aggregation represents a simple doc count that doesn't require a formatted version. We didn't render an "as_string" version for count in the rest response, so the method should also be removed in favour of just using String.valueOf(getCount()) if a string version of the count is needed. Closes #24287	2017-04-24 18:40:57 +02:00
Ali Beyad	c5b6f52ecc	Fixes maintaining the shards a snapshot is waiting on (#24289 ) There was a bug in the calculation of the shards that a snapshot must wait on, due to their relocating or initializing, before the snapshot can proceed safely to snapshot the shard data. In this bug, an incorrect key was used to look up the index of the waiting shards, resulting in the fact that each index would have at most one shard in the waiting state causing the snapshot to pause. This could be problematic if there are more than one shard in the relocating or initializing state, which would result in a snapshot prematurely starting because it thinks its only waiting on one relocating or initializing shard (when in fact there could be more than one). While not a common case and likely rare in practice, it is still problematic. This commit fixes the issue by ensuring the correct key is used to look up the waiting indices map as it is being built up, so the list of waiting shards for each index (those shards that are relocating or initializing) are aggregated for a given index instead of overwritten.	2017-04-24 10:59:08 -04:00
Martijn van Groningen	dabbf5d4f4	[TEST] Added unittests for InternalGeoCentroid Relates to #22278	2017-04-24 16:57:25 +02:00
Nilabh Sagar	373edee29a	Provide informative error message in case of unknown suggestion context. (#24241 ) Provide a list of available contexts when you send an unknown context to the completion suggester.	2017-04-24 10:35:14 -04:00
Jason Tedor	1500beafc7	Check for default.path.data included in path.data If the user explicitly configured path.data to include default.path.data, then we should not fail the node if we find indices in default.path.data. This commit addresses this. Relates #24285	2017-04-24 09:31:54 -04:00
Jason Tedor	a7947b404b	Fix hash code for AliasFilter This commit fixes the hash code for AliasFilter as the previous implementation was neglecting to take into consideration the fact that the aliases field is an array and thus a deep hash code of it should be computed rather than a shallow hash code on the reference. Relates #24286	2017-04-24 09:06:36 -04:00
Yannick Welsch	7c395070e2	[TEST] Wait for tribe node to be fully connected before shutting it down The tribe was being shutdown by the test while a publishing round (that adds the tribe node to a cluster) is not completed yet (i.e. the node itself knows that it became part of the cluster, and the test shuts the tribe node down, but another node has not applied the cluster state yet, which makes that node hang while trying to connect to the node that is shutting down (due to connect_timeout being 30 seconds), delaying publishing for 30 seconds, and subsequently tripping an assertion when another tribe instance wants to join. Relates to #23695	2017-04-24 12:27:41 +02:00
Colin Goodheart-Smithe	6d6a230f70	Makes StoredScriptSource implement ToXContentObject	2017-04-24 10:20:15 +01:00
Colin Goodheart-Smithe	d4a6ba8ec9	No longer add illegal content type option to stored search templates (#24251 ) When parsing StoredSearchScript we were adding a Content type option that was forbidden (by a check that threw an exception) by the parser thats used to parse the template when we read it from the cluster state. This was stopping Elastisearch from starting after stored search templates had been added. This change no longer adds the content type option to the StoredScriptSource object when parsing from the put search template request. This is safe because the StoredScriptSource content is always JSON when its stored in the cluster state since we do a conversion to JSON before this point. Also removes the check for the content type in the options when parsing StoredScriptSource so users who already have stored scripts can start Elasticsearch. Closes #24227	2017-04-22 13:37:04 -04:00
Ryan Ernst	473e98981b	Scripts: Remove unnecessary executable shortcut (#24264 ) ScriptService has two executable methods, one which takes a CompiledScript, which is similar to search, and one that takes a raw Script and both compiles and returns an ExecutableScript for it. The latter is not needed, and the call sites which used one or the other were mixed. This commit removes the extra executable method in favor of callers first calling compile, then executable.	2017-04-21 17:53:03 -07:00
Ryan Ernst	aadc33d260	Scripts: Remove unwrap method from executable scripts (#24263 ) The unwrap method was leftover from support javascript and python. Since those languages are removed in 6.0, this commit removes the unwrap feature from scripts.	2017-04-21 17:50:22 -07:00
Nik Everett	447f307ebb	Fix _bulk response when it can't create an index (#24048 ) Before #22488 when an index couldn't be created during a `_bulk` operation we'd do all the other actions and return the index creation error on each failing action. In #22488 we accidentally changed it so that we now reject the entire bulk request if a single action cannot create an index that it must create to run. This gets reverts to the old behavior while still keeping the nicer error messages. Instead of failing the entire request we now only fail the portions of the request that can't work because the index doesn't exist. Closes #24028	2017-04-21 18:56:04 -04:00
Jason Tedor	fe91c72151	Use a marker file when removing a plugin Today when removing a plugin, we attempt to move the plugin directory to a temporary directory and then delete that directory from the filesystem. We do this to avoid a plugin being in a half-removed state. We previously tried an atomic move, and fell back to a non-atomic move if that failed. Atomic moves can fail on union filesystems when the plugin directory is not in the top layer of the filesystem. Interestingly, the regular move can fail as well. This is because when the JDK is executing such a move, it first tries to rename the source directory to the target directory and if this fails with EXDEV (as in the case of an atomic move failing), it falls back to copying the source to the target, and then attempts to rmdir the source. The bug here is that the JDK never deleted the contents of the source so the rmdir will always fail (except in the case of an empty directory). Given all this silliness, we were inspired to find a different strategy. The strategy is simple. We will add a marker file to the plugin directory that indicates the plugin is in a state of removal. This file will be the last file out the door during removal. If this file exists during startup, we fail startup. Relates #24252	2017-04-21 15:50:44 -04:00
Simon Willnauer	2ca7072b24	Fill missing sequence IDs up to max sequence ID when recovering from store (#24238 ) Today we might promote a primary and recover from store where after translog recovery the local checkpoint is still behind the maximum sequence ID seen. To fill the holes in the sequence ID history this PR adds a utility method that fills up all missing sequence IDs up to the maximum seen sequence ID with no-ops. Relates to #10708	2017-04-21 20:28:00 +02:00
Ryan Ernst	ba48674695	Build: Move plugin cli and tests to distribution tool (#24220 ) The plugin cli currently resides inside the elasticsearch jar. This commit moves it into a plugin-cli jar. This is change alone is a no-op; it does not change anything about what is loaded at runtime. But it will allow easier testing (with fixtures in the future to test ES or maven installation), as well as eventually not loading these classes when starting elasticsearch.	2017-04-21 09:25:58 -07:00
Boaz Leskes	badb2be066	Peer Recovery: remove maxUnsafeAutoIdTimestamp hand off (#24243 ) With #24149 , it is now stored in the Lucene commit and is implicitly transferred in the file phase of the recovery.	2017-04-21 17:31:50 +02:00
Ali Beyad	63e5aff5d6	Adds version 5.3.2 and backwards compatibility indices for 5.3.1	2017-04-21 10:48:41 -04:00
Tanguy Leroux	480bf0996d	Add utility method to parse named XContent objects with typed prefix (#24240 ) This commit adds a XContentParserUtils.parseTypedKeysObject() method that can be used to parse named XContent objects identified by a field name containing a type identifier, a delimiter and the name of the object to parse.	2017-04-21 15:41:27 +02:00
Tanguy Leroux	251b6d452b	MultiBucketsAggregation.Bucket should not extend Writeable (#24216 ) The MultiBucketsAggregation.Bucket interface extends Writeable, forcing all implementation classes to implement writeTo(). This commit removes the Writeable from the interface and move it down to the InternalBucket implementation.	2017-04-21 15:29:53 +02:00
Yannick Welsch	c2deb1c81d	Don't expose cleaned-up tasks as pending in PrioritizedEsThreadPoolExecutor (#24237 ) Changes in #24102 exposed the following oddity: PrioritizedEsThreadPoolExecutor.getPending() can return Pending entries where pending.task == null. This can happen for example when tasks are added to the pending list while they are in the clean up phase, i.e. TieBreakingPrioritizedRunnable#runAndClean has run already, but afterExecute has not removed the task yet. Instead of safeguarding consumers of the API (as was done before #24102) this changes the executor to not count these tasks as pending at all.	2017-04-21 15:25:19 +02:00
Colin Goodheart-Smithe	3c7c4bc824	Adds declareNamedObjects methods to ConstructingObjectParser (#24219 ) * Adds declareNamedObjects methods to ConstructingObjectParser * Addresses review comments	2017-04-21 09:50:30 +01:00
Christoph Büscher	c8ad26edc9	Tests: Extend InternalStatsTests (#24212 ) Currently we don't test for count = 0 which will make a difference when adding tests for parsing for the high level rest client. Also min/max/sum should also be tested with negative values and on a larger range.	2017-04-21 10:38:09 +02:00
Adrien Grand	81b64ed587	IndicesQueryCache should delegate the scorerSupplier method. (#24209 ) Otherwise the range improvements that we did on range queries would not work. This is similar to https://issues.apache.org/jira/browse/LUCENE-7749.	2017-04-21 10:33:02 +02:00
Adrien Grand	f322f537e4	Speed up parsing of large `terms` queries. (#24210 ) The addition of the normalization feature on keywords slowed down the parsing of large `terms` queries since all terms now have to go through normalization. However this can be avoided in the default case that the analyzer is a `keyword` analyzer since all that normalization will do is a UTF8 conversion. Using `Analyzer.normalize` for that is a bit overkill and could be skipped.	2017-04-21 10:32:33 +02:00
Jim Ferenczi	a4365971a0	[TEST] make sure that the random query_string query generator defines a default_field or a list of fields	2017-04-21 02:56:26 +02:00
Fabien Baligand	4a45579506	token_count type : add an option to count tokens (fix #23227 ) (#24175 ) Add option "enable_position_increments" with default value true. If option is set to false, indexed value is the number of tokens (not position increments count)	2017-04-21 00:53:28 +02:00
Jim Ferenczi	525101b64d	Query string default field (#24214 ) Currently any `query_string` query that use a wildcard field with no matching field is rewritten with the `_all` field. For instance: ```` #creating test doc PUT testing/t/1 { "test": { "field_one": "hello", "field_two": "world" } } #searching abc.* (does not exist) -> hit GET testing/t/_search { "query": { "query_string": { "fields": [ "abc.*" ], "query": "hello" } } } ```` This bug first appeared in 5.0 after the query refactoring and impacts only users that use `_all` as default field. Indices created in 6.x will not have this problem since `_all` is deactivated in this version. This change fixes this bug by returning a MatchNoDocsQuery for any term that expand to an empty list of field.	2017-04-20 22:12:20 +02:00
Luca Cavanna	82c678b5c7	Make Aggregations an abstract class rather than an interface (#24184 ) Some of the base methods that don't have to do with reduce phase and serialization can be moved to the base class which is no longer an interface. This will be reusable by the high level REST client further on the road. Also it simplify things as having an interface with a single implementor is not that helpful.	2017-04-20 21:31:34 +02:00
Areek Zillur	077a6c3ee7	[TEST] ensure expected sequence no and version are set when index/delete engine operation has a document failure	2017-04-20 13:38:52 -04:00
Yannick Welsch	22e0795990	Extract batch executor out of cluster service (#24102 ) Refactoring that extracts the task batching functionality from ClusterService and makes it a reusable component that can be tested in isolation.	2017-04-20 17:28:43 +02:00
Tanguy Leroux	55a879ee8d	Align behavior or HDR percentiles iterator with percentile() method (#24206 )	2017-04-20 12:37:33 +02:00
Nik Everett	caf376c8af	Start building analysis-common module (#23614 ) Start moving built in analysis components into the new analysis-common module. The goal of this project is: 1. Remove core's dependency on lucene-analyzers-common.jar which should shrink the dependencies for transport client and high level rest client. 2. Prove that analysis plugins can do all the "built in" things by moving all "built in" behavior to a plugin. 3. Force tests not to depend on any oddball analyzer behavior. If tests need anything more than the standard analyzer they can use the mock analyzer provided by Lucene's test infrastructure.	2017-04-19 18:51:34 -04:00
Jason Tedor	4796557a30	Add primary term to doc write response This commit adds the primary term to the doc write response. Relates #24171	2017-04-19 14:44:22 -04:00
Ryan Ernst	c7e9231a86	Plugins: Remove leniency for missing plugins dir (#24173 ) This leniency was left in after plugin installer refactoring for 2.0 because some tests still relied on it. However, the need for this leniency no longer exists.	2017-04-19 09:09:34 -07:00
Christoph Büscher	a9657a5a09	Add BucketMetricValue interface (#24188 ) Unlike other implementations of InternalNumericMetricsAggregation.SingleValue, the InternalBucketMetricValue aggregation currently doesn't implement a specialized interface that exposes the `keys()` method. This change adds this so that clients can access the keys via the interface.	2017-04-19 16:27:33 +02:00
Jim Ferenczi	f05af0a382	Enable index-time sorting (#24055 ) This change adds an index setting to define how the documents should be sorted inside each Segment. It allows any numeric, date, boolean or keyword field inside a mapping to be used to sort the index on disk. It is not allowed to use a `nested` fields inside an index that defines an index sorting since `nested` fields relies on the original sort of the index. This change does not add early termination capabilities in the search layer. This will be added in a follow up. Relates #6720	2017-04-19 14:36:11 +02:00
Boaz Leskes	8758c541b3	ElectMasterService.hasEnoughMasterNodes should return false if no masters were found This is a regression introduced in #20063	2017-04-19 09:52:06 +02:00
Tanguy Leroux	741c031384	[Test] Add unit tests for InternalHDRPercentilesTests (#24157 ) Related to #22278	2017-04-19 09:37:01 +02:00

1 2 3 4 5 ...

7905 Commits