OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-28 07:59:10 +00:00

Author	SHA1	Message	Date
Christoph Büscher	d18c3d651d	Introduce an `include_type_name` constant (#37155 ) I started referring to this parameter name from various places in #37149 so I think it's a good idea to simplify things by referring to a common constant.	2019-01-07 10:41:40 +01:00
Christoph Büscher	12a105e5ef	Remove deprecated PutIndexTemplateRequestBuilder#setTemplate (#37151 ) The method has been removed since 6.0, there is a direct replacement and it is only used in tests still.	2019-01-07 10:41:04 +01:00
Luca Cavanna	2f4dafa69f	Add support for providing absolute start time to SearchRequest (#37142 ) We have recently added support for providing a local cluster alias to a SearchRequest through a package protected constructor. When executing cross-cluster search requests with local reduction on each cluster, the CCS coordinating node will have to provide such cluster alias to each remote cluster, as well as the absolute start time of the search action in milliseconds from the time epoch, to be used when evaluating date math expressions both while executing queries / scripts as well as when resolving index names. This commit adds support for providing the start time together with the cluster alias. It is a final member in the search request, which will only be set when using cross-cluster search with local reduction (also known as alternate execution mode). When not provided, the coordinating node will determine the current time and pass it through (by calling `System.currentTimeMillis`). Relates to #32125	2019-01-07 10:28:31 +01:00
Tanguy Leroux	6347461146	Rename ClusterBlocks.hasGlobalBlock methods (#36941 ) As suggested in #36775, this pull request renames the following methods: ClusterBlocks.hasGlobalBlock(int) ClusterBlocks.hasGlobalBlock(RestStatus) ClusterBlocks.hasGlobalBlock(ClusterBlockLevel) to something that better reflects the property of the ClusterBlock that is searched for: ClusterBlocks.hasGlobalBlockWithId(int) ClusterBlocks.hasGlobalBlockWithStatus(RestStatus) ClusterBlocks.hasGlobalBlockWithLevel(ClusterBlockLevel)	2019-01-07 09:42:27 +01:00
Jason Tedor	bf5bc88f50	Fix handling of fractional time value settings (#37171 ) This commit addresses an issue when setting a time value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a time value setting to a value of 1500ms is problematic because internally this is converted to the string "1.5s". When we go to get this setting, we try to parse "1.5s" back to a time value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the time value to the string and back to the time value.	2019-01-06 22:34:52 -08:00
Armin Braun	b34e7d4f19	SNAPSHOT+TESTS: Relax Assertion in DisruptionIT (#37144 ) * The retries on the failing master can lead to concurrently trying to create and delete a snapshot, catch this for now to fix this test * closes #36779	2019-01-05 17:52:24 +01:00
Simon Willnauer	0cc877026f	Subclass NIOFSDirectory instead of using FileSwitchDirectory (#37140 ) We don't want two FSDirectories manage pending deletes separately and optimize file listing. This confuses IndexWriter and causes exceptions when files are deleted twice but are pending for deletion. This change move to using a NIOFS subclass that only delegates to MMAP for opening files all metadata and pending deletes are managed on top. Closes #37111 Relates to #36668	2019-01-05 10:15:33 +01:00
Julie Tibshirani	0bac64fbd3	Deprecate the _type field in aggregations. (#37131 )	2019-01-04 13:05:52 -08:00
Michael Basnight	e40193ae66	HLRC: Fix Reindex from remote query logic (#36908 ) The query object was incorrectly added to the remote object in the xcontent. This fix moves the query back into the source, if it was passed in as part of the RemoteInfo. It also adds a IPv6 test for reindex from remote such that we can properly validate this.	2019-01-04 13:37:59 -06:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Simon Willnauer	b4f113d3ea	Don't block on peer recovery on the target side (#37076 ) Today we block using the generic thread-pool on the target side until the source side has fully executed the recovery. We still block on the source side executing the recovery in a blocking fashion but there is no reason to block on the target side. This will release generic threads early if there are many concurrent recoveries happen. Relates to #36195	2019-01-04 13:51:06 +01:00
Simon Willnauer	41d7e3a2fe	Expose `search.throttled` on `_cat/indices` (#37073 ) Today it's very difficult to see which indices are frozen or rather throttled via the commonly used monitoring APIs. This change adds a cell to the `_cat/indices` API to render if an index is `search.throttled` Relates to #34352	2019-01-04 13:49:40 +01:00
Luca Cavanna	21d52f0dab	Ensure that local cluster alias is never treated as remote (#37121 ) With #36997 we added support for providing a local cluster alias with a `SearchRequest`. We intended to make sure that when provided as part of a search request, the cluster alias would never be used for connection lookups. Yet due to a bug we would still end up looking up the connection from the remote ones. This commit adds a test to make sure that whenever we set the cluster alias to the `SearchRequest` (which can only be done at transport), such alias is used as index prefix in the returned hits. No errors are thrown despite no remote clusters are configured indicating that such alias is never used for connection look-ups. Also, we add explicit support for the empty cluster alias when printing out index names through `RemoteClusterAware#buildRemoteIndexName`. In fact we don't want to print out `:index` when the cluster alias is set to empty string, but rather `index`. Yet, the semantic of empty string is different compared to `null` as it will still disable final reduction. This will be used in CCS when searching against remote clusters as well as the local one, the local one will have empty prefix yet it will need to disable final reduction so that its results will be properly merged with the ones coming from the remote clusters.	2019-01-04 12:19:31 +01:00
David Turner	3f7d6a989a	[Zen2] Elect freshest master in upgrade (#37122 ) Today when electing a master in Zen2 we use the cluster state version to determine whether a node has a fresh-enough cluster state to become master. However the cluster state version is not a reliable measure of freshness in the Zen1 world; furthermore in 6.x the cluster state version is not persisted. This means that when upgrading from 6.x via a full cluster restart a cluster state update may be lost if a stale master wins the initial election. This change fixes this by using the metadata version as a measure of freshness when in term 0, since this is persisted in 6.x and does more reliably indicate the freshness of nodes. It also makes changes parallel to elastic/elasticsearch-formal-models#40 to support situations in which nodes accept cluster state versions in term 0: this does not happen in a pure Zen2 cluster, but can happen in mixed clusters and during upgrades.	2019-01-04 09:09:16 +00:00
Julie Tibshirani	ac1c6940d2	Stop automatically nesting mappings in index creation requests. (#36924 ) Now that we unwrap mappings in DocumentMapperParser#extractMappings, it is not necessary for the mapping definition to always be nested under the type. This leniency around the mapping format was added in 2341825358e740b0bea4c16d164c5acdf12fc6b3.	2019-01-03 17:41:28 -08:00
Armin Braun	7686ee7631	TESTS: Shutdown ThreadPool after TestNodes (#37123 ) * If the threadpool gets shut down before the testnodes we run into an error => fixed by moving to single `After` method * Relates #36976	2019-01-03 22:35:44 +01:00
Julie Tibshirani	54f53d2a51	Make sure to accept empty unnested mappings in create index requests. (#37089 )	2019-01-03 11:53:08 -08:00
Nick Knize	e613bcae43	Remove XLatLonShape classes (#37094 ) This commit removes local XLatLonShape classes and replaces with current LatLonShape classes in latest lucene snapshot	2019-01-03 12:48:36 -06:00
Nicholas Knize	de962b2f39	Revert "Adjust Lucene version for 6.7" This reverts commit b7f6ee72a68324b8fc6b523b1eafb991544d2278.	2019-01-03 11:52:31 -06:00
Armin Braun	675ea4c59c	TESTS: Remove Static Threadpool in TaskManagerTest (#36976 ) * The static threadpool leaks a lot of memory in these tests because it prevents things like the connect listeners from `org.elasticsearch.transport.TcpTransport#initiateConnection` to be GCed between tests (since they keep being referenced by the threadpool) which in turn reference channels and their underlying buffers * I could not find any slowdown in executing these tests from this change, if anything they are slightly faster now on my machine * Relates #36906 (which may be caused by slowness from leaking memory and also becomes testable in a loop by this change)	2019-01-03 15:19:21 +01:00
Christoph Büscher	046f86f274	Deprecate use of type in reindex request body (#36823 ) Types can be used both in the source and dest section of the body which will be translated to search and index requests respectively. Adding a deprecation warning for those cases and removing examples using more than one type in reindex since support for this is going to be removed.	2019-01-03 10:29:14 +01:00
Christoph Büscher	e21054d176	Remove two unused methods in Iterables (#37075 ) These helper methods are unused in the rest of the codebase.	2019-01-03 10:28:47 +01:00
Nhat Nguyen	b7f6ee72a6	Adjust Lucene version for 6.7 Relates #37088	2019-01-03 04:20:47 -05:00
Jim Ferenczi	78ba1889cf	Replace the TreeMap in the composite aggregation (#36675 ) The `composite` aggregation uses a TreeMap to keep track of the best buckets. This ensures a log(n) time cost to insert new buckets but also to retrieve buckets that are already present in the map. In order to speed up the retrieval of buckets this change replaces the TreeMap with a priority queue and a HashMap. The insertion cost is still log(n) but the retrieval of buckets through the HashMap is now done in constant time. This optimization can bring significant improvement since each document needs to check if its associated buckets are already present in the current best buckets.	2019-01-03 09:51:35 +01:00
Daniel Mitterdorfer	75f3443c62	Rename setting to enable mmap With this commit we rename `node.store.allow_mmapfs` to `node.store.allow_mmap`. Previously this setting has controlled whether `mmapfs` could be used as a store type. With the introduction of `hybridfs` which also relies on memory-mapping, `node.store.allow_mmapfs` also applies to `hybridfs` and thus we rename it in order to convey that it is actually used to allow memory-mapping but not a specific store type. Relates #36668 Relates #37070	2019-01-03 07:10:34 +01:00
Nick Knize	b2aa655f46	Upgrade master to lucene-8.0.0-snapshot-a1c6e642aa (#37091 ) Updates the master branch to the latest snapshot of Lucene 8.0.	2019-01-02 20:18:19 -06:00
David Findley	d4e7660248	Fix weighted_avg parser not found for RestHighLevelClient (#37027 ) Add integration test for weighted avg sub aggregation Add weighted avg parser to DefaultNamedXContents Fixes #36861	2019-01-02 15:53:21 -06:00
Ke Li	62ece69b92	Remove system property es.enforce_max_shards_per_node (#36968 ) The system property es.enforce_max_shards_per_node was not needed any more, because we always enforce cluster-wide shard limit now.	2019-01-02 14:01:12 -07:00
Alan Woodward	7a0047744d	`query_string` should use indexed prefixes (#36895 ) The QueryStringQueryBuilder does not currently delegate to the field mapper's prefixQuery method, so does not use indexed prefixes. This commit corrects this. It also fixes a bug where a query a* would not match the word a if indexed prefixes were used with a minchar setting of 2.	2019-01-02 20:12:24 +00:00
Luca Cavanna	42ea644903	Remove single shard optimization when suggesting shard_size (#37041 ) When executing terms aggregations we set the shard_size, meaning the number of buckets to collect on each shard, to a value that's higher than the number of requested buckets, to guarantee some basic level of precision. We have an optimization in place so that we leave shard_size set to size whenever we are searching against a single shard, in which case maximum precision is guaranteed by definition. Such optimization requires us access to the total number of shards that the search is executing against. In the context of cross-cluster search, once we will introduce multiple reduction steps (one per cluster) each cluster will only know the number of local shards, which is problematic as we should only optimize if we are searching against a single shard in a single cluster. It could be that we are searching against one shard per cluster in which case the current code would optimize number of terms causing a loss of precision. While discussing how to address the CCS scenario, we decided that we do not want to introduce further complexity caused by this single shard optimization, as it benefits only a minority of cases, especially when the benefits are not so great. This commit removes the single shard optimization, meaning that we will always have heuristic enabled on how many number of buckets to collect on the shards, even when searching against a single shard. This will cause more buckets to be collected when searching against a single shard compared to before. If that becomes a problem for some users, they can work around that by setting the shard_size equal to the size. Relates to #32125	2019-01-02 17:45:49 +01:00
Josh Soref	f6e4b9a014	Spelling: correct wrong spellings of likelihood (#37052 )	2019-01-02 17:37:03 +01:00
Josh Soref	5d5d5c26bc	Spelling: replace interruptable with interruptible (#37049 )	2019-01-02 17:35:53 +01:00
Josh Soref	1df66d21fe	Spelling: replace uknown with unknown (#37056 )	2019-01-02 17:33:02 +01:00
Przemyslaw Gomulka	15c4d5b184	Fix line length in org.elasticsearch.get (#37071 ) Remove the line suppression for this package and fix offedning lines relates #34844	2019-01-02 17:22:15 +01:00
Luca Cavanna	450d3014f6	[TEST] Fix testLimitConcurrentShardRequests failure With #36221 we introduced shards counting to address a rare failure. This caused a worse problem in this test when replicas were allocated and shards failures were randomly returned. The latch has to take into account additional attempts caused by the shard failures, which means that in order for run to be called, performPhaseOnShard will be called (numShards + numFailures) times. To address this, we need to decide upfront which shard is going to fail, making sure that at least one shards is successful otherwise the whole request fails. Closes #37074	2019-01-02 16:46:41 +01:00
Luca Cavanna	b10a33054e	Mute failing testLimitConcurrentShardRequests relates to #37074	2019-01-02 16:35:24 +01:00
Josh Soref	565a95c382	Spelling: replace respositories with repositories (#37053 )	2019-01-02 14:13:54 +01:00
Josh Soref	d3e98278c3	Spelling: replace cachable with cacheable (#37047 )	2019-01-02 14:10:30 +01:00
Daniel Mitterdorfer	f0052b1a7a	Add hybridfs store type With this commit we introduce a new store type `hybridfs` that is a hybrid between `mmapfs` and `niofs`. This store type chooses different strategies to read Lucene files based on the read access pattern (random or linear) in order to optimize performance. This store type has been available in earlier versions of Elasticsearch as `default_fs`. We have chosen a different name now in order to convey the intent of the store type instead of tying it to the fact whether it is the default choice. Relates #36668	2019-01-02 10:10:32 +01:00
Luca Cavanna	9e70696628	[TEST] Address rejected execution in SearchAsyncActionTests (#37028 ) SearchAsyncActionTests may fail with RejectedExecutionException as InitialSearchPhase may try to execute a runnable after the test has successfully completed, and the corresponding executor was already shut down. The latch was located in getNextPhase that is almost correct, but does not cover for the last finishAndRunNext round that gets executed after onShardResult is invoked. This commit moves the latch to count the number of shards, and allowing the test to count down later, after finishAndRunNext has been potentially forked. This way nothing else will be executed once the executor is shut down at the end of the tests. Closes #36221 Closes #33699	2019-01-02 09:47:13 +01:00
Alexis Wilke	35c09adbe1	Replaced the word 'shards' with 'replicas' in an error message. (#36234 ) (#36275 ) Closes #36234	2019-01-01 16:37:15 +01:00
Luca Cavanna	fd7cde88db	Mute failing RolloverIT#testRolloverWithDateMath Relates to #37037	2018-12-31 11:51:50 +01:00
Armin Braun	85be9d6a89	SNAPSHOT: Deterministic ClusterState Tests (#36644 ) * Use `DeterministicTaskQueue` infrastructure to test `SnapshotsService`	2018-12-31 11:17:21 +01:00
Luca Cavanna	adb957b5aa	Mute failing DateMathExpressionResolverTests tests Relates to #37037	2018-12-31 10:37:10 +01:00
Luca Cavanna	d3f1fe46d3	Increase await timeouts in RemoteClusterServiceTests Closes #33852	2018-12-28 17:03:40 +01:00
Armin Braun	4ac8fc6906	Force Refresh Listeners when Acquiring all Operation Permits (#36835 ) * Fixes the issue reproduced in the added tests: * When having open index requests on a shard that are waiting for a refresh, relocating that shard becomes blocked until that refresh happens (which could be never as in the test scenario).	2018-12-28 16:42:51 +01:00
Luca Cavanna	cb6bac3f88	Skip final reduction if SearchRequest holds a cluster alias (#37000 ) With #36997 we added the ability to provide a cluster alias with a SearchRequest. The next step is to disable the final reduction whenever a cluster alias is provided with the SearchRequest. A cluster alias will be provided when executing a cross-cluster search request with alternate execution mode, where each cluster does its own reduction locally. In order for the CCS node to be able to later perform an additional reduction of the results, we need to make sure that all the needed info stays available. This means that terms aggregations can be reduced but not pruned, and pipeline aggs should not be executed. The final reduction will happen later in the CCS coordinating node. Relates to #36997 & #32125	2018-12-28 14:58:20 +01:00
Armin Braun	34d22f378d	TESTS: Mute testSnapshotCanceledOnRemovedShard * relates #37005	2018-12-28 14:20:45 +01:00
Luca Cavanna	51fe20e0c3	Add support for local cluster alias to SearchRequest (#36997 ) With the upcoming cross-cluster search alternate execution mode, the CCS node will be able to split a CCS request into multiple search requests, one per remote cluster involved. In order to do that, the CCS node has to be able to signal to each remote cluster that such sub-requests are part of a CCS request. Each cluster does not know about the other clusters involved, and does not know either what alias it is given in the CCS node, hence the CCS coordinating node needs to be able to provide the alias as part of the search request so that it is used as index prefix in the returned search hits. The cluster alias is a notion that's already supported in the search shards iterator and search shard target, but it is currently used in CCS as both index prefix and connection lookup key when fanning out to all the shards. With CCS alternate execution mode the provided cluster alias needs to be used only as index prefix, as shards are local to each cluster hence no cluster alias should be used for connection lookups. The local cluster alias can be set to the SearchRequest at the transport layer only, and its constructor/getter methods are package private. Relates to #32125	2018-12-28 12:43:25 +01:00
Yannick Welsch	935c2e98b0	Zen2: Turn to follower on follower check when no state accepted yet from new leader (#37003 ) Improves on #36449 which did not cover the situation where a node had bumped its term during the election, and not when receiving the first follower check. This was uncovered while refactoring NodeJoinTests so that they don't need to access to an internal field of Coordinator anymore (which can now be made private).	2018-12-28 08:37:04 +01:00

1 2 3 4 5 ...

2131 Commits