OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-18 19:05:06 +00:00

Author	SHA1	Message	Date
Mayya Sharipova	f884b2b1cd	Deprecate types in index API (#36575 ) * Deprecate types in index API - deprecate type-based constructors of IndexRequest - update tests to use typeless IndexRequest constructors - no yaml tests as they have been already added in #35790 Relates to #35190	2018-12-18 08:53:49 -05:00
Boaz Leskes	5f76f39386	Rename seq# powered optimistic concurrency control parameters to ifSeqNo/ifPrimaryTerm (#36757 ) This PR renames the parameters previously introduce to the following: ### URL Parameters ``` PUT twitter/_doc/1?if_seq_no=501&if_primary_term=1 { "user" : "kimchy", "post_date" : "2009-11-15T14:12:12", "message" : "trying out Elasticsearch" } DELETE twitter/_doc/1?if_seq_no=501&if_primary_term=1 ``` ### Bulk API ``` POST _bulk { "index" : { "_index" : "test", "_type" : "_doc", "_id" : "1", "if_seq_no": 501, "if_primary_term": 1 } } { "field1" : "value1" } { "delete" : { "_index" : "test", "_type" : "_doc", "_id" : "2", "if_seq_no": 501, "if_primary_term": 1 } } ``` ### Java API ``` IndexRequest.ifSeqNo(long seqNo) IndexRequest.ifPrimaryTerm(long primaryTerm) DeleteRequest.ifSeqNo(long seqNo) DeleteRequest.ifPrimaryTerm(long primaryTerm) ``` Relates #36148 Relates #10708	2018-12-18 14:35:18 +01:00
Boaz Leskes	9087c98a5a	Expose Sequence Number based Optimistic Concurrency Control in the rest layer (#36721 ) Relates #36148 Relates #10708	2018-12-18 10:56:02 +01:00
Luca Cavanna	8a412c6a26	Ensure MapperService#getAllMetaFields elements order is deterministic (#36739 ) MapperService#getAllMetaFields returns an array, which is created out of an `ObjectHashSet`. Such set does not guarantee deterministic hash ordering. The array returned by its toArray may be sorted differently at each run. This caused some repeatability issues in our tests (see #29080) as we pick random fields from the array of possible metadata fields, but that won't be repeatable if the input array is sorted differently at every run. Once setting the tests seed, hppc picks that up and the sorting is deterministic, but failures don't repeat with the seed that gets printed out originally (as a seed was not originally set). See also https://issues.carrot2.org/projects/HPPC/issues/HPPC-173. With this commit, we simply create a static sorted array that is used for `getAllMetaFields`. The change is in production code but really affects only testing as the only production usage of this method was to iterate through all values when parsing fields in the high-level REST client code. Anyways, this seems like a good change as returning an array would imply that it's deterministically sorted.	2018-12-18 10:43:14 +01:00
Luca Cavanna	b57e12aa44	Add raw sort values to SearchSortValues transport serialization (#36617 ) In order for CCS alternate execution mode (see #32125) to be able to do the final reduction step on the CCS coordinating node, we need to serialize additional info in the transport layer as part of each `SearchHit`. Sort values are already present but they are formatted according to the provided `DocValueFormat` provided. The CCS node needs to be able to reconstruct the lucene `FieldDoc` to include in the `TopFieldDocs` and `CollapseTopFieldDocs` which will feed the `mergeTopDocs` method used to reduce multiple search responses (one per cluster) into one. This commit adds such information to the `SearchSortValues` and exposes it through a new getter method added to `SearchHit` for retrieval. This info is only serialized at transport and never printed out at REST.	2018-12-18 09:20:51 +01:00
Nicholas Knize	96d279ed83	Revert "[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 )" This reverts commit 5bc7822562a6eefa4a64743233160cdc9f431adf.	2018-12-17 20:09:46 -06:00
Christoph Büscher	2f5300e3a6	Deprecate types in get_source and exist_source (#36426 ) This change adds a new untyped endpoint `{index}/_source/{id}` for both the GET and the HEAD methods to get the source of a document or check for its existance. It also adds deprecation warnings to RestGetSourceAction that emit a warning when the old deprecated "type" parameter is still used. Also updating documentation and tests where appropriate. Relates to #35190	2018-12-18 00:57:42 +01:00
Jason Tedor	f0f2b26159	Fix duplicate phrase in shrink/split error message (#36734 ) This commit removes a duplicate "must be a" from the shrink/split error messages.	2018-12-17 17:42:58 -05:00
Jake Landis	384757deff	ingest: support default pipelines + bulk upserts (#36618 ) This commit adds support to enable bulk upserts to use an index's default pipeline. Bulk upsert, doc_as_upsert, and script_as_upsert are all supported. However, bulk script_as_upsert has slightly surprising behavior since the pipeline is executed _before_ the script is evaluated. This means that the pipeline only has access the data found in the upsert field of the script_as_upsert. The non-bulk script_as_upsert (existing behavior) runs the pipeline _after_ the script is executed. This commit does _not_ attempt to consolidate the bulk and non-bulk behavior for script_as_upsert. This commit also adds additional testing for the non-bulk behavior, which remains unchanged with this commit. fixes #36219	2018-12-17 16:25:11 -06:00
Armin Braun	75bfbe92ee	TESTS:Debug Log. IndexStatsIT#testFilterCacheStats	2018-12-17 22:35:30 +01:00
Nick Knize	5bc7822562	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 ) This commit exposes lucene's LatLonShape field as the default type in GeoShapeFieldMapper. To use the new indexing approach, simply set "type" : "geo_shape" in the mappings without setting any of the strategy, precision, tree_levels, or distance_error_pct parameters. Note the following when using the new indexing approach: * geo_shape query does not support querying by MULTIPOINT. * LINESTRING and MULTILINESTRING queries do not yet support WITHIN relation. * CONTAINS relation is not yet supported. The tree, precision, tree_levels, distance_error_pct, and points_only parameters are deprecated.	2018-12-17 14:38:14 -06:00
Jake Landis	7bf822bbbb	ingest: fix on_failure with Drop processor (#36686 ) This commit allows a document to be dropped when a Drop processor is used in the on_failure fork of the processor chain. Fixes #36151	2018-12-17 14:10:13 -06:00
Armin Braun	4d0bb9dd0a	SNAPSHOTS: Adjust BwC Versions in Restore Logic (#36718 ) * Re-enables bwc tests with adjusted version conditions now that #36397 enables concurrent snapshots in 6.6+	2018-12-17 21:02:59 +01:00
Jason Tedor	cd632de116	Do not resolve addresses in remote connection info (#36671 ) The remote connection info API leads to resolving addresses of seed nodes when invoked. This is problematic because if a hostname fails to resolve, we would not display any remote connection info. Yet, a hostname not resolving can happen across remote clusters, especially in the modern world of cloud services with dynamically chaning IPs. Instead, the remote connection info API should be providing the configured seed nodes. This commit changes the remote connection info to display the configured seed nodes, avoiding a hostname resolution. Note that care was taken to preserve backwards compatibility with previous versions that expect the remote connection info to serialize a transport address instead of a string representing the hostname.	2018-12-17 13:24:31 -05:00
Evgenia Badyanova	27ad733dce	Fixing line length for EnvironmentTests and RecoveryTests (#36657 ) Relates #34884	2018-12-17 11:59:45 -05:00
Boaz Leskes	e356b8cb95	Add doc's sequence number + primary term to GetResult and use it for updates (#36680 ) This commit adds the last sequence number and primary term of the last operation that have modified a document to `GetResult` and uses it to power the Update API. Relates #36148 Relates #10708	2018-12-17 15:22:13 +01:00
David Turner	f7567b865e	Improve assertions in testStaleMasterNotHijackingMajority (#36658 ) Investigating #36556 was made a little trickier because the feedback from the failing assertion wasn't very informative, and the messages attached to other nearby assertions were misleading. This commit improves the feedback from these assertions and tidies up a few other issues in the test suite.	2018-12-17 08:34:34 +00:00
Armin Braun	e82a647740	TEST: Fix Assertion for Shard Allocation Counts (#36685 ) * If removing half the nodes completely removes a shard from the cluster we can't count it in the assertion * Also: * Remove unused logger parameter * Fix typo in var name * Closes #35365	2018-12-17 09:20:21 +01:00
Boaz Leskes	733a6d34c1	Add seq no powered optimistic locking support to the index and delete transport actions (#36619 ) This commit add support for using sequence numbers to power [optimistic concurrency control](http://en.wikipedia.org/wiki/Optimistic_concurrency_control) in the delete and index transport actions and requests. A follow up will come with adding sequence numbers to the update and get results. Relates #36148 Relates #10708	2018-12-15 17:59:57 +01:00
Tim Brooks	4e4022b7ef	Register the TCP_CONNECT_TIMEOUT setting (#36663 ) This commit registers the TCP_CONNECT_TIMEOUT setting in the ClusterSettings.	2018-12-14 18:06:48 -07:00
Tim Brooks	3065300434	Unify transport settings naming (#36623 ) This commit updates our transport settings for 7.0. It generally takes a few approaches. First, for normal transport settings, it usestransport. instead of transport.tcp. Second, it uses transport.tcp, http.tcp, or network.tcp for all settings that are proxies for OS level socket settings. Third, it marks the network.tcp.connect_timeout setting for removal. Network service level settings are only settings that apply to both the http and transport modules. There is no connect timeout in http. Fourth, it moves all the transport settings to a single class TransportSettings similar to the HttpTransportSettings class. This commit does not actually remove any settings. It just adds the new renamed settings and adds todos for settings that will be deprecated.	2018-12-14 14:41:04 -07:00
Michael Basnight	dae422fb2b	Update joda compat methods to use compat class (#36654 ) The existing joda compat methods isEquals isAfter and isBefore all took in a ZonedDateTime, but since all of the scripting is now using the new JodaCompatZonedDateTime, these are changed to take that in instead.	2018-12-14 15:38:51 -06:00
João Barbosa	276726aea2	Added keyed response to pipeline percentile aggregations 22302 (#36392 ) Closes #22302	2018-12-14 16:22:54 -05:00
Alan Woodward	c7ac9ef826	Upgrade to lucene snapshot 774e9aefbc (#36637 ) Includes LUCENE-8607: improvement to MatchAllDocsQuery	2018-12-14 20:30:07 +00:00
Julie Tibshirani	ccd1beb9b3	Deprecate types in update requests. (#36181 ) The following updates were made: * Add deprecation warnings to `RestUpdateAction`, plus a test in `RestUpdateActionTests`. * Deprecate relevant methods on the Java HLRC requests/ responses. * Add HLRC integration tests for the typed APIs. * Update documentation (for both the REST API and Java HLRC). * Fix failing integration tests. Because of an earlier PR, the REST yml tests were already updated (one version without types, and another legacy version that retains types).	2018-12-14 10:47:27 -08:00
Tim Brooks	fbf88b2ab7	Remove the `MockTcpTransport` (#36628 ) This commit removes all remaining usages of the `MockTcpTransport`. Additionally it removes the `MockTcpTransport` and its test case.	2018-12-14 10:59:07 -07:00
Luca Cavanna	bb3ae18da5	Increase coverage in SearchSortValuesTests (#36597 ) SearchSortValuesTests extends now `AbstractSerializingTestCase` which removes some code duplication and standardizes the way we test `fromXContent`, serialization and equals/hashcode. Also, we were never creating `SearchSortValues` through their public constructor that accept an array of `DocValueFormat` together with the array of raw sort values. That is covered now, which involved some conversion from `BytesRef` to String in the test. Also, the previous test was not using doing any equality check against the original and parsed versions in `testFromXContent` due to values being parsed with different types in some cases, which is now covered by converting those values using a new method added to `RandomObjects`. The code was already there as part of `randomStoredFieldValues`, but it is now exposed to be used in other scenarios.	2018-12-14 18:57:37 +01:00
Luca Cavanna	8f04536a35	Add copy constructor to SearchRequest (#36641 ) For cross cluster search alternate execution mode (see #32125), we will need to take a search request that spans across multiple clusters (based on index prefixes e.g. cluster1:index, cluster2:index etc.) and split it into multiple search requests to be sent to each cluster. A copy constructor added to `SearchRequest` would make that easy and well maintainable in the future. Something along the same lines already happens in `BulkByScrollParallelizationHelper`, but the corresponding code went outdated as some new fields were added to `SearchRequest` which were not added to the bulk by scroll code. A copy constructor helps making the task of copying a search request maintainable over time.	2018-12-14 18:30:29 +01:00
Alan Woodward	09bf93dc2a	Add intervals query (#36135 ) * Add IntervalQueryBuilder with support for match and combine intervals * Add relative intervals * feedback * YAML test - broekn * yaml test; begin to add block source * Add block; make disjunction its own source * WIP * Extract IntervalBuilder and add tests for it * Fix eq/hashcode in Disjunction * New yaml test * checkstyle * license headers * test fix * YAML format * YAML formatting again * yaml tests; javadoc * Add OR test -> requires fix from LUCENE-8586 * Add docs * Re-do API * Clint's API * Delete bash script * doc fixes * imports * docs * test fix * feedback * comma * docs fixes * Tidy up doc references to old rule	2018-12-14 15:14:00 +00:00
David Turner	278cc4c3de	[Zen2] Improve assertion when creating fake node ID (#36544 ) Today we assert that the fake node ID is greater than the real node's ID. In fact we want to assert that it's greater than _all_ proper UUIDs. This adds assertions to that effect.	2018-12-14 14:16:26 +00:00
Jeff Hajewski	f1f3b28f5c	Delete deprecated getValues from ScriptDocValues (#36183 ) * Adds deprecation logging to ScriptDocValues#getValues. First commit addressing issue #22919. `ScriptDocValues#getValues` was added for backwards compatibility but no longer needed. Scripts using the syntax `doc['foo'].values` when `doc['foo']` is a list should be using `doc['foo']` instead. * Fixes two build errors in #34279 * Removes unused import in ScriptDocValuesDatesTest * Removes used of `.values` in example in diversified-sampler-aggregation.asciidoc * Removes use of .values from painless test. Part of #34279 * Updates tests to use `doc[foo]` syntax rather than `doc[foo].values`. * Removes use of `getValues()` and replaces use of `doc[foo].values` with `doc[foo]`. * Indentation fix. * Remove unnecessary list construction at previous `getValues()` callsite in ScriptDocValues.GeoPoints. * Update migration doc and add link to `getValue` in ScriptDocValues javadoc. * Fix compile * Fix javadoc issue * Removes ScriptDocValues#getValues usage from painless whitelist.	2018-12-14 07:56:47 -05:00
David Turner	245cdd8c34	Register ResyncTask.Status as a NamedWriteable (#36610 ) Today, ResyncTask.Status is not registered, but appears as a task status sometimes, leading to `Failed to deserialize response from handler` exceptions: java.lang.IllegalArgumentException: Unknown NamedWriteable [org.elasticsearch.tasks.Task$Status][resync] This commit adds the missing registration.	2018-12-14 12:16:09 +00:00
David Turner	44ba9ab04d	Fix racy use of ConcurrentHashMap (#36603 ) ConcurrentHashMap does not always behave correctly if removing elements and concurrently checking for its emptyiness. Work around this by protecting all usages with a mutex (there was only one usage unprotected by the mutex anyway) and then we don't even need a ConcurrentHashMap at all.	2018-12-14 12:15:42 +00:00
Luca Cavanna	7dc3d3b78b	Add sort and collapse info to SearchHits transport serialization (#36555 ) In order for CCS alternate execution mode (see #32125) to be able to do the final reduction step on the CCS coordinating node, we need to serialize additional info in the transport layer as part of the `SearchHits`, specifically: - lucene `SortField[]` which contains info about the fields that sorting was performed on and their type, which depends on mappings (that the CCS node does not know about) - collapse field (`String`) that field collapsing was executed on, if requested - collapse values (`Object[]`) that field collapsing was based on, if requested This info is needed to be able to reconstruct the `TopFieldDocs` or `CollapseFieldTopDocs` in the CCS coordinating node to feed the `mergeTopDocs` method and reduce multiple search responses received (one per cluster) into one. This commit adds such information to the `SearchHits` class. It's nullable info that is not serialized through the REST layer. `SearchPhaseController` sets such info at the end of the hits reduction phase.	2018-12-14 12:22:54 +01:00
Armin Braun	c5b3ac5578	SNAPSHOTS: Allow Parallel Restore Operations (#36397 ) * Enable parallel restore operations * Add uuid to restore in progress entries to uniquely identify them * Adjust restore in progress entries to be a map in cluster state * Added tests for: * Parallel restore from two different snapshots * Parallel restore from a single snapshot to different indices to test uuid identifiers are correctly used by `RestoreService` and routing allocator * Parallel restore with waiting for completion to test transport actions correctly use uuid identifiers	2018-12-14 11:39:23 +01:00
Nhat Nguyen	1ae53e6b20	TEST: Disable merges in testRebuildLocalCheckpointTracker (#36625 ) In this test, we keep track of a list of index commits then verify that we reload exactly every operation from the safe commit. If a background merge is triggered, then we might have a new index commit which is not recorded in the tracking list. This change disables merges in the test. Closes #36470	2018-12-14 05:25:36 -05:00
Christoph Büscher	573b6325e7	[Tests] Fix edge case in SimpleQueryStringBuilderTests (#36611 ) A previous fix of a similar problem in #35201 wasn't general enough, we also need to catch cases where the randomly generated query string starts with some version of "now" and hits a date field. Closes #36595	2018-12-14 11:10:45 +01:00
Daniel Mitterdorfer	b7ba2fa74b	Fix line length for VersionTests Relates #34884 Relates #36585	2018-12-14 10:07:27 +01:00
Daniel Mitterdorfer	42f094952b	Fix line length for IndexLifecycleActionIT Relates #34884 Relates #36586	2018-12-14 10:00:37 +01:00
Ryan Ernst	254d1e8f22	Core: Add backcompat for joda time formats (#36531 ) This commit adds deprecation warnings when using format specifiers with joda data formats that will change with java time. It also adds the "8" prefix which may be used to force the new java time format parsing.	2018-12-13 12:26:51 -08:00
Christoph Büscher	b33ff16d62	Remove deprecated `useDisMax` from MultiMatchQuery (#36488 ) The getters and setters for useDisMax() have been deprecated since at least 6.0, also there hasn't been any reference to the query parameter in the documentation. Removing it from the builder and tests and replacing it with `tieBreaker(1.0f)` where necessary.	2018-12-13 20:13:16 +01:00
Tal Levy	b820d7c617	fix MultiValuesSourceFieldConfig toXContent (#36525 ) This commit turns MultiValuesSourceFieldConfig into a proper ToXContentObject for easy testing and verification of its to/from XContent methods. Closes #36474.	2018-12-13 08:17:36 -08:00
Tim Brooks	f998e04c34	Remove usages of `MockTcpTransport` from zen tests (#36579 ) We are attempting to replace the usage of the `MockTcpTransport` with the `MockNioTransport`. This commit replaces usages of `MockTcpTransport` in two zen test cases.	2018-12-13 09:10:18 -07:00
Jason Tedor	2afa7faefd	Override the JVM DNS cache policy (#36570 ) When a security manager is present, the JVM will cache positive hostname lookups indefinitely. This can be problematic, especially in the modern world with cloud services where DNS addresses can change, or environments using Docker containers where IP addresses could be considered ephemeral. This behavior impacts cluster discovery, cross-cluster replication and cross-cluster search, reindex from remote, snapshot repositories, webhooks in Watcher, external authentication mechanisms, and the Elastic Stack Monitoring Service. The experience of watching a DNS lookup change yet not be reflected within Elasticsearch is a poor experience for users. The reason the JVM has this is guard against DNS cache posioning attacks. Yet, there is already a defense in the modern world against such attacks: TLS. With proper certificate validation, even if a resolver falls prey to a DNS cache poisoning attack, using TLS would neuter the attack. Therefore we have a policy with dubious security value that significantly impacts usability. As such we make the usability/security tradeoff towards usability, since the security risks are very low. This commit introduces new system properties that Elasticsearch observes to override the JVM DNS cache policy.	2018-12-13 10:23:45 -05:00
Igor Motov	7446f75346	Geo: Adds a name of the field to geopoint parsing errors (#36529 ) Adds the field name and type to geo_point parsing errors. Closes #15965	2018-12-13 17:32:45 +04:00
Armin Braun	088d3f39c3	MINOR: Cleanup Dead Code in o.e.c.metadata Package (#36587 )	2018-12-13 12:54:22 +01:00
David Roberts	13cb0fb98b	Periodically try to reassign unassigned persistent tasks (#36069 ) Previously persistent task assignment was checked in the following situations: - Persistent tasks are changed - A node joins or leaves the cluster - The routing table is changed - Custom metadata in the cluster state is changed - A new master node is elected However, there could be situations when a persistent task that could not be assigned to a node could become assignable due to some other change, such as memory usage on the nodes. This change adds a timed recheck of persistent task assignment to account for such situations. The timer is suspended while checks triggered by cluster state changes are in-flight to avoid adding burden to an already busy cluster. Closes #35792	2018-12-13 09:15:27 +00:00
Armin Braun	34d7cc13c1	SETTINGS: Correctly Identify Noop Updates (#36560 ) * We should compare the target value with the to be applied value before interpreting the update as a change * This speeds up the test failing in #36496 considerably by preventing state updates on noop setting updates	2018-12-13 10:02:54 +01:00
Boaz Leskes	f6b5d7e013	Add sequence numbers based optimistic concurrency control support to Engine (#36467 ) This commit add support to engine operations for resolving and verifying the sequence number and primary term of the last modification to a document before performing an operation. This is infrastructure to move our (optimistic concurrency control)[http://en.wikipedia.org/wiki/Optimistic_concurrency_control] API to use sequence numbers instead of internal versioning. Relates #36148 Relates #10708	2018-12-13 08:08:40 +01:00
Tal Levy	cd1bec3a06	[refactor] add Environment in BootstrapContext (#36573 ) There are certain BootstrapCheck checks that may need access environment-specific values. Watcher's EncryptSensitiveDataBootstrapCheck passes in the node's environment via a constructor to bypass the shortcoming in BootstrapContext. This commit pulls in the node's environment into BootstrapContext. Another case is found in #36519, where it is useful to check the state of the data-path. Since PathUtils.get and Paths.get are forbidden APIs, we rely on the environment to retrieve references to things like node data paths. This means that the BootstrapContext will have the same Settings used in the Environment, which currently differs from the Node's settings.	2018-12-12 21:07:21 -08:00

1 2 3 4 5 ...

2026 Commits