OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-09 22:45:04 +00:00

Author	SHA1	Message	Date
Luca Cavanna	4b85769d24	Increase InternalHistogramTests coverage (#36004 ) In `InternalHistogramTests` we were randomizing different values but `minDocCount` was hardcoded to `1`. It's important to test other values, especially `0` as it's the default. To make this possible, the test needed some adapting in the way buckets are randomly generated: all aggs need to share the same `interval`, `minDocCount` and `emptyBucketInfo`. Also assertions need to take into account that more (or less) buckets are expected depending on `minDocCount`. This was originated by #35921 and its need to test adding empty buckets as part of the reduce phase. Also relates to #26856 as one more key comparison needed to use `Double.compare` to properly handle `NaN` values, which was triggered by the increased test coverage.	2018-11-28 20:06:40 +01:00
Nik Everett	0588dad80b	Tasks: Only require task permissions (#35667 ) Right now using the `GET /_tasks/<taskid>` API and causing a task to opt in to saving its result after being completed requires permissions on the `.tasks` index. When we built this we thought that that was fine, but we've since moved towards not leaking details like "persisting task results after the task is completed is done by saving them into an index named `.tasks`." A more modern way of doing this would be to save the tasks into the index "under the hood" and to have APIs to manage the saved tasks. This is the first step down that road: it drops the requirement to have permissions to interact with the `.tasks` index when fetching task statuses and when persisting statuses beyond the lifetime of the task. In particular, this moves the concept of the "origin" of an action into a more prominent place in the Elasticsearch server. The origin of an action is ignored by the server, but the security plugin uses the origin to make requests on behalf of a user in such a way that the user need not have permissions to perform these actions. It can be made to be fairly precise. More specifically, we can create an internal user just for the tasks API that just has permission to interact with the `.tasks` index. This change doesn't do that, instead, it uses the ubiquitus "xpack" user which has most permissions because it is simpler. Adding the tasks user is something I'd like to get to in a follow up change. Instead, the majority of this change is about moving the "origin" concept from the security portion of x-pack into the server. This should allow any code to use the origin. To keep the change managable I've also opted to deprecate rather than remove the "origin" helpers in the security code. Removing them is almost entirely mechanical and I'd like to that in a follow up as well. Relates to #35573	2018-11-28 09:28:27 -05:00
Christoph Büscher	51a7dc54ec	Fix custom AUTO issue with Fuzziness#toXContent (#35807 ) Currently when a Fuzziness instance with custom AUTO distance values gets written to XContent, the customized lower and upper distance values are ommited and can consequently not be parsed back. This changes this to write the String including the optional custom values when writing to XContent and fixes the tests that should have caught this in the first place, e.g. by adding the custom low and high distance values to the equality check.	2018-11-28 15:07:11 +01:00
Andrey Ershov	0b45fb98b9	[Zen2] Generate coordinationMetaData with different configs (#35991 ) This PR fixes test failure, which is caused by equal randomly generated lastAcceptedConfiguration and lastCommittedConfguration.	2018-11-28 14:49:07 +01:00
Yannick Welsch	5f0c036183	Disable testDeleteCreateInOneBulk on Zen2 This test needs adaptation to run with Zen2	2018-11-28 13:34:48 +01:00
Christoph Büscher	2f547bac65	Remove deprecated methods from QueryStringQueryBuilder (#35912 ) This change removes the deprecated useDisMax() and useAllFields() methods from the QueryStringQueryBuilder and related tests. The disMax parameter has already been a no-op since 6.0 and also the useAllFields has been deprecated since 6.0 and there is a direct replacement via defaultField.	2018-11-28 11:09:03 +01:00
Jeff Hajewski	49087f16f5	Adds deprecation logging to ScriptDocValues#getValues. (#34279 ) `ScriptDocValues#getValues` was added for backwards compatibility but no longer needed. Scripts using the syntax `doc['foo'].values` when `doc['foo']` is a list should be using `doc['foo']` instead. Closes #22919	2018-11-27 14:30:13 -05:00
Julie Tibshirani	25c416b12d	Deprecate types in search and multi search templates. (#35669 ) This PR adds deprecation warnings to the relevant `RestAction` classes, plus tests in `RestActionTests`. No updates to REST tests, the Java HLRC, or documentation were necessary, since they didn't make use of types.	2018-11-27 10:19:19 -08:00
Simon Willnauer	ad1f0dccd4	Validate metdata on `_msearch` (#35938 ) MultiSearchRequests issues through `_msearch` now validate all keys in the metadata section. Previously unknown keys were ignored while now an exception is thrown. Closes #35869	2018-11-27 17:08:24 +01:00
Tim Brooks	cc1fa799c8	Remove `TcpChannel#setSoLinger` method (#35924 ) This commit removes the dedicated `setSoLinger` method. This simplifies the `TcpChannel` interface. This method has very little effect as the SO_LINGER is not set prior to the channels being closed in the abstract transport test case. We still will set SO_LINGER on the `MockNioTransport`. However we can do this manually.	2018-11-27 09:08:14 -07:00
Christophe Bismuth	adc0b560c0	Raise a 404 exception when document source is not found (#33384 ) (#34083 ) This pull request makes the `RestGetSourceAction` return a `ResourceNotFoundException` with a proper JSON response when source or document itself is missing (see issue #33384). Here is below a sample JSON output: ``` { "error": { "root_cause": [ { "type": "resource_not_found_exception", "reason": "Source not found [index1]/[_doc]/[1]" } ], "type": "resource_not_found_exception", "reason": "Source not found [index1]/[_doc]/[1]" }, "status": 404 } ```	2018-11-27 10:35:45 -05:00
Andrey Ershov	0e283f9670	[Zen2] PersistedState interface implementation (#35819 ) Today GatewayMetaState is capable of atomically storing MetaData to disk. We've also moved fields that are needed to be persisted in Zen2 from ClusterState to ClusterState.MetaData.CoordinationMetaData. This commit implements PersistedState interface. version and currentTerm are persisted as a part of Manifest. GatewayMetaState now implements both ClusterStateApplier and PersistedState interfaces. We started with two descendants Zen1GatewayMetaState and Zen2GatewayMetaState, but it turned out to be not easy to glue it. GatewayMetaState now constructs previousClusterState (including MetaData) and previousManifest inside the constructor so that all PersistedState methods are usable as soon as GatewayMetaState instance is constructed. Also, loadMetaData is renamed to getMetaData, because it just returns previousClusterState.metaData(). Sadly, we don't have access to localNode (obtained from TransportService in the constructor, so getLastAcceptedState should be called, after setLocalNode method is invoked. Currently, when deciding whether to write IndexMetaData to disk, we're comparing current IndexMetaData version and received IndexMetaData version. This is not safe in Zen2 if the term has changed. So updateClusterState now accepts incremental write method parameter. When it's set to false, we always write IndexMetaData to disk. Things that are not covered by GatewayMetaStateTests are covered by GatewayMetaStatePersistedStateTests. This commit also adds an option to use GatewayMetaState instead of InMemoryPersistedState in TestZenDiscovery. However, by default InMemoryPersistedState is used and only one test in PersistedStateIT used GatewayMetaState. In order to use it for other tests, proper state recovery should be implemented.	2018-11-27 15:04:52 +01:00
Martijn van Groningen	447e5d212a	Changed versions in serialization code after backporting #35535	2018-11-27 08:00:06 +01:00
Gordon Brown	119835decd	Always enforce cluster-wide shard limit (#34892 ) This removes the option to run a cluster without enforcing the cluster-wide shard limit, making strict enforcement the default and only behavior. The limit can still be adjusted as desired using the cluster settings API.	2018-11-26 17:05:12 -07:00
Igor Motov	663563f64b	Geo: better handling of malformed geo_points (#35554 ) Improves handling of malformed geo_points when `ignore_malformed` is set to true Closes #35419	2018-11-26 09:44:42 -10:00
Jim Ferenczi	900caa20ef	Handles exists query in composite aggs (#35758 ) This commit adds the support for exists query in the sorted execution mode of the `composite` aggregation. We'll execute the aggregation from the sorted points and use early termination if the main query is an `exists` query over the first source of the `composite` aggregation.	2018-11-26 19:08:14 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Simon Willnauer	ca9b2b9931	Repsect indices options on _msearch (#35887 ) Today we don't respect the indices options when they are passed as request parameters to the `_msearch` endpoint. This is unintuitive and doesn't cause any errors. This changes uses the top-level indices options as the defaults for each sub search-request. Closes #35851	2018-11-26 14:26:39 +01:00
Christophe Bismuth	04ebc63e34	RoutingMissingException in more like this (#33974 ) More like this query allows to provide identifiers of documents to be retrieved as like/unlike items. It can happen that at retrieval time an error is thrown, for instance caused by missing routing value when `_routing` is set required in the mapping. Instead of ignoring such error and returning no documents for the query, the error should be re-thrown and returned to users. As part of this change also mget and mtermvectors are unified in the way they throw such exception like it happens in other places, so that a `RoutingMissingException` is raised. Closes #29678	2018-11-26 13:57:57 +01:00
Tanguy Leroux	9bdbba23f8	[Tests] Fix IndexShardTests.testAcquirePrimaryAllOperationsPermits() This test fails on CI because of an inappropriate assertion, which is I think a leftover and has no real value.	2018-11-26 13:44:12 +01:00
Luca Cavanna	e44390ac20	InitialSearchPhase minor cleanups (#35864 ) This commit simplifies the throttling logic in InitialSearchPhase and removes some asserts from it. Also, a few formatting changes are applied to its code and surrounding classes.	2018-11-26 13:42:41 +01:00
David Turner	a68a46450b	[Zen2] Add lag detector (#35685 ) A publication can succeed and complete before all nodes have applied the published state and acknowledged it, thanks to the publication timeout; however we need every node eventually either to apply the published state (or a later state) or be removed from the cluster. This change introduces the LagDetector which achieves this liveness property by removing any lagging nodes from the cluster.	2018-11-26 10:52:49 +00:00
iverase	401b814d1a	[CI] Muting method testOperationPermitOnReplicaShards in IndexShardTests Relates to #35850	2018-11-26 09:33:23 +01:00
Martijn van Groningen	7624734f14	Added wait_for_metadata_version parameter to cluster state api. (#35535 ) The `wait_for_metadata_version` parameter will instruct the cluster state api to only return a cluster state until the metadata's version is equal or greater than the version specified in `wait_for_metadata_version`. If the specified `wait_for_timeout` has expired then a timed out response is returned. (a response with no cluster state and wait for timed out flag set to true) In the case metadata's version is equal or higher than `wait_for_metadata_version` then the api will immediately return. This feature is useful to avoid external components from constantly polling the cluster state to whether somethings have changed in the cluster state's metadata.	2018-11-26 08:50:08 +01:00
Simon Willnauer	4711c5cdf3	Always return false from `refreshNeeded` on ReadOnlyEngine (#35837 ) Acquiring a searcher is unnecessary to determine if a refresh is necessary since read-only engines never refresh. Closes #35785	2018-11-24 09:25:42 +01:00
Simon Willnauer	e46e44ce38	Wrap can_match reader with ElasticsearchDirectoryReader (#35857 ) Code that operates on-top of the engine requires all readers returned to be unwrapped into ElasticsearchDirectoryReader. The special reader the FrozenEngine uses wasn't wrapped.	2018-11-24 09:23:53 +01:00
Andrey Ershov	f47636b254	[Zen2] Introduce VotingTombstone class (#35832 ) Today voting tombstones are stored in CoordinationMetaData as Set<DiscoveryNode>. DiscoveryNode is not a lightweight object and have a lot of fields. It also has toXContent method, but no fromXContent method and the output of toXContent is not enough to re-create DiscoveryNode object. And votingTombstone set should be persisted as a part of MetaData. On the other hand, the only thing required from the tombstone is the nodeId. This PR adds VotingTombstone class for voting tombstones, which consists of two fields for now - nodeId and nodeName. It could be extended/shrank in the future if needed. This PR also resolves TODO's related to the voting tombstones xcontent story. Example of CoordinationMetaData.toXContent with voting tombstones: { "term": 1, "last_committed_config": [ "fkwLdOBvXSlgRTBfgNAL", "tmQiPGHvUxXzPkkCDSJo", "HhOmtQBZAThpHIGWhxpz", "qZHWGpoDNPYRNIiqKsDl" ], "last_accepted_config": [ "lhqacKmriwhHGFZcvqbx", "MYysmBuROkvJRlDcusyd" ], "voting_tombstones": [ { "node_id": "McjbZbRkEz", "node_name": "pdKIWeNJUO" }, { "node_id": "cpXkVibGwo", "node_name": "UnCvFgdVsc" }, { "node_id": "EylRNOztbc", "node_name": "ohOhkbMWZX" } ] }	2018-11-23 18:34:06 +01:00
Yannick Welsch	51d2e986c5	Remove BWC conditions after backport of #35731 This PR was backported to 6.x, so the extra BWC conditions are not needed anymore	2018-11-23 17:11:06 +01:00
Adrien Grand	5b370316a6	Remove some legacy code from when indices could have multiple types. (#35815 ) This code is only necessary up to indices created with version 5.x while 7.0 only supports indices created with 6.x or 7.0.	2018-11-23 15:15:26 +01:00
Yannick Welsch	2970abfce9	Add read-only repository verification (#35731 ) Adds a verification mode for read-only repositories. It also makes the extra bucket check on repository creation obsolete, which fixes #35703.	2018-11-23 14:45:05 +01:00
Christoph Büscher	88d862e69f	[CI] Muting two methods in IndexShardTests Relates to #35850	2018-11-23 14:29:26 +01:00
David Turner	d01436de3c	Copy checkpoint atomically when rolling generation (#35407 ) Today when rolling a transog generation we copy the checkpoint from `translog.ckp` to `translog-nnnn.ckp` using a simple `Files.copy()` followed by appropriate `fsync()` calls. The copy operation is not atomic, so if we crash at the wrong moment we can leave an incomplete checkpoint file on disk. In practice the checkpoint is so small that it's either empty or fully written. However, we do not correctly handle the case where it's empty when the node restarts. In contrast, in `recoverFromFiles()` we _do_ copy the checkpoint atomically. This commit extracts the atomic copy operation from `recoverFromFiles()` and re-uses it in `rollGeneration()`.	2018-11-23 08:43:34 +00:00
Jim Ferenczi	be69a774df	Fix analyzed prefix query in query_string (#35756 ) This change fixes analyzed prefix queries in `query_string` to be ignored if all terms are removed during the analysis. Closes #31702	2018-11-23 09:42:23 +01:00
Tanguy Leroux	2e37f17a7d	Expose all permits acquisition in IndexShard and TransportReplicationAction (#35540 ) This pull request exposes two new methods in the IndexShard and TransportReplicationAction classes in order to allow transport replication actions to acquire all index shard operation permits for their execution. It first adds the acquireAllPrimaryOperationPermits() and the acquireAllReplicaOperationsPermits() methods to the IndexShard class which allow to acquire all operations permits on a shard while exposing a Releasable. It also refactors the TransportReplicationAction class to expose two protected methods (acquirePrimaryOperationPermit() and acquireReplicaOperationPermit()) that can be overridden when a transport replication action requires the acquisition of all permits on primary and/or replica shard during execution. Finally, it adds a TransportReplicationAllPermitsAcquisitionTests which illustrates how a transport replication action can grab all permits before adding a cluster block in the cluster state, making subsequent operations that requires a single permit to fail). Related to elastic #33888	2018-11-23 09:26:38 +01:00
Dimitris Athanasiou	43d6ec8bcd	Remove unnecessary throws IOException in CompressedXContent.string() (#35821 )	2018-11-22 15:08:46 +00:00
Jim Ferenczi	e37a0ef844	Upgrade to lucene-8.0.0-snapshot-67cdd21996 (#35816 )	2018-11-22 15:42:59 +01:00
Albert Zaharovits	4fc911a129	Mute test InternalEngineTests Relates #35823	2018-11-22 15:34:53 +02:00
Tanguy Leroux	f9f7261d60	Revert "Revert "[RCI] Check blocks while having index shard permit in TransportReplicationAction (#35332 )"" This reverts commit d3d7c01	2018-11-22 12:13:19 +01:00
Mayya Sharipova	b6014d971c	Forbid negative scores in functon_score query (#35709 ) * Forbid negative scores in functon_score query - Throw an exception when scores are negative in field_value_factor function - Throw an exception when scores are negative in script_score function Relates to #33309	2018-11-22 06:08:48 -05:00
Tanguy Leroux	11052b75c7	TransportResyncReplicationAction should not honour blocks (#35795 ) After #35332 has been merged, we noticed some test failures like #35597 in which one or more replica shards failed to be promoted as primaries because the primary replica re-synchronization never succeed. After some digging it appeared that the execution of the resync action was blocked because of the presence of a global cluster block in the cluster state (in this case, the "no master" block), making the resync action to fail when executed on the primary. Until #35332 such failures never happened because the TransportResyncReplicationAction is skipping the reroute phase, the only place where blocks were checked. Now with #35332 blocks are checked during reroute and also during the execution of the transport replication action on the primary. After some internal discussion, we decided that the TransportResyncReplicationAction should never be blocked. This action is part of the replica to primary promotion and makes sure that replicas are in sync and should not be blocked when the cluster state has no master or when the index is read only. This commit changes the TransportResyncReplicationAction to make obvious that it does not honor blocks. It also adds a simple test that fails if the resync action is blocked during the primary action execution. Closes #35597	2018-11-22 10:50:12 +01:00
David Turner	cfdf666672	[Zen2] Fix test failures in diff-based publishing (#35684 ) `testIncompatibleDiffResendsFullState` sometimes makes a 2-node cluster and then partitions one of the nodes from the leader, which makes the leader stand down. Then when the partition is removed the cluster re-forms but does so by sending full cluster states, not diffs, causing the test to fail. Additionally `testDiffBasedPublishing` sometimes fails if a publication is delivered out-of-order, wiping out a fresher last-received cluster state with a less-fresh one. This is fixed here by passing the received cluster state to the coordinator before recording it as the last-received one, relying on the coordinator's freshness checks.	2018-11-22 09:08:52 +00:00
Igor Motov	39789d0a10	GEO: More robust handling of ignore_malformed in geoshape parsing (#35603 ) Adds an XContent sub parser class that can to wrap another XContent parser at the beginning of an object and allow skiping all children in case of the parsing failure. It also uses this subparser to ignore the rest of the GeoJson shape if the parsing fails and we need to ignore the geoshape due to the ignore_malformed flag. Supersedes #34498 Closes #34047	2018-11-21 11:04:01 -10:00
Armin Braun	1a5553d495	MINOR: Cleanup Runnables in SnapshotsService (#35796 ) * Simplify complex `Runnable` by moving to `AbstractRunnable`	2018-11-21 19:58:51 +01:00
Nick Knize	3bee25cb70	[GEO] Add support to ShapeBuilders for building Lucene geometry (#35707 ) * [GEO] Add support to ShapeBuilders for building Lucene geometry This commit adds support for building lucene geometry from the ShapeBuilders. This is needed for integrating LatLonShape as the primary indexing approach for geo_shape field types. All unit and integration tests are updated to add randomization for testing both jts/s4j shapes and lucene shapes.	2018-11-21 11:15:01 -06:00
Yannick Welsch	c816347253	Disable testClusterJoinDespiteOfPublishingIssues for Zen2 This test is failing sometimes with Zen2 due to the lack of lag detection. Zen1 does not have this problem as it only considers a join as valid if the corresponding cluster state update is successfully published and committed on the joining node.	2018-11-21 17:21:44 +01:00
Andrey Ershov	a056bd8c1c	[Zen2] Move ClusterState fields to be persisted to ClusterState.MetaData (#35625 ) Today we have a way to atomically persist global MetaData and IndexMetaData to disk when new ClusterState is received. All other ClusterState fields are not persisted. However, there are other parts of ClusterState that should be persisted, namely: version term lastCommittedConfiguration lastAcceptedConfiguration votingTombstones version is changed frequently, other fields are not. We decided to group term, lastCommittedConfiguration, lastAcceptedConfiguration and votingTombstones into CoordinationMetaData class and make CoordinationMetaData a field inside MetaData. MetaData.toXContent and MetaData.fromXContent should take care of CoordinationMetaData. version stays as a top level field in ClusterState and will be persisted as part of Manifest in a follow-up commit. Also MetaData.isGlobalStateEquals should be extended to include coordinationMetaData in comparison. This commit favors exposing getters, such as getTerm directly in ClusterState to avoid massive code changes. An example of CoordinationMetaState.toXContent: { "term": 1, "last_committed_config": [ "TiIuBcbBtpuXyDDVHXeD", "ZIAoVbkjjLPLUuYLaTkw" ], "last_accepted_config": [ "OwkXbXZNOZPJqccdFHdz", "LouzsGYwmQzpeQMrboZe", "fCKGRZdjLTqzXAqPUtGL", "pLoxshjpJXwDhbgjfYJy", "SjINLwFIlIEFZCbjrSFo", "MDkVncJEVyZLJktopWje" ] }	2018-11-21 17:03:26 +01:00
Andrey Ershov	6ac0cb1842	Merge branch master into zen2 2 types of conflicts during the merge: 1) Line length fix 2) Classes no longer extend AbstractComponent	2018-11-21 15:36:49 +01:00
Yannick Welsch	8939a7894f	Zen2: Move disruption tests to Zen2 (#35724 ) - Moves disruption tests to Zen2 - Registers a few missing settings - Removes .put(TestZenDiscovery.USE_ZEN2.getKey(), true) from tests where Zen2 is now enabled by default through the parent test class - Moves QuorumGatewayIT back to Zen1, as it is not stable with Zen2 as it currently relies on dangling indices due to the lack of proper CS persistence, which triggers secondary failures	2018-11-21 14:43:33 +01:00
Armin Braun	bdf632b6f9	SNAPSHOTING+MINOR: Simplify SnapshortShardService (#35769 )	2018-11-21 13:50:17 +01:00
Christoph Büscher	6638708b56	Remove deprecated QueryStringQueryBuilder#splitOnWhiteSpace (#35763 ) This parameter has been deprecated and was ignored since 6.0, so its Java API methods can be removed.	2018-11-21 10:29:08 +01:00

... 3 4 5 6 7 ...

2032 Commits