OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-09 14:34:43 +00:00

Author	SHA1	Message	Date
Luca Cavanna	67f97e3686	Re-enable bwc tests Relates to #37187	2019-01-08 09:53:58 +01:00
Andrei Stefan	3fad9d25f6	SQL: fix COUNT DISTINCT filtering (#37176 ) * Use `_count` aggregation value only for not-DISTINCT COUNT function calls * COUNT DISTINCT will use the _exact_ version of a field (the `keyword` sub-field for example), if there is one	2019-01-08 08:47:35 +02:00
Tim Brooks	b0665963e8	Make remote ping and compress settings dynamic (#37200 ) Traditionally remote clusters can be configured dynamically. However, the compress and ping settings are not currently set to be configured dynamically. This commit changes that.	2019-01-07 23:35:22 -07:00
Jason Tedor	c8c596cead	Introduce retention lease expiration (#37195 ) This commit implements a straightforward approach to retention lease expiration. Namely, we inspect which leases are expired when obtaining the current leases through the replication tracker. At that moment, we clean the map that persists the retention leases in memory.	2019-01-07 22:03:52 -08:00
Julie Tibshirani	c5aac4705d	Revert "Stop automatically nesting mappings in index creation requests. (#36924 )" This reverts commit ac1c6940d20b9ac993422b279d8f3a8bd73a3527.	2019-01-07 17:56:40 -08:00
Ryan Ernst	55d3ca3aa8	Core: Rework epoch time parsing for java time (#36914 ) This commit converts the epoch time parsing implementation which uses the java time api to create DateTimeFormatters instead of DateFormatter implementations. This will allow multi formats for java time to be implemented in a single DateTimeFormatter in a future change.	2019-01-07 16:15:30 -08:00
Christoph Büscher	56e472bfbc	Removing unused methods in Numbers (#37186 ) Remove several unused helper methods. Most of them are one-liners and should be easier to be used from the corresponding primitive wrapper classes. The bytes array conversion methods are unused as well, it should be easy to re-create them if needed.	2019-01-08 00:14:50 +01:00
Jason Tedor	3b48b99861	Fix setting by time unit (#37192 ) This commit fixes an issue with a settings builder method that allows setting a duration by time unit. In particular, this method can suffer from a loss of precision. For example, if the input duration is 1500 microseconds then internally we are converting this to "1ms", demonstrating the loss of precision. Instead, we should internally convert this to a TimeValue that correctly represents the input duration, and then convert this to a string using a method that does not lose the unit. That is what this commit does.	2019-01-07 14:59:24 -08:00
lcawl	382e4d39ef	[DOCS] Cleans up xpackml attributes	2019-01-07 14:33:10 -08:00
Benjamin Trent	6b376a1ff4	ML: fix delayed data annotations on secured cluster (#37193 ) * changing executing context for writing annotation * adjusting user * removing unused import	2019-01-07 15:18:38 -06:00
Luca Cavanna	e34658edba	Update version in SearchRequest and related test Version needs to be updated after backporting #36997 & #37142 where we added support for providing and serializing localClusterAlias as well ass absoluteStartMillis. Relates to #36997 & #37142	2019-01-07 18:47:17 +01:00
Lisa Cawley	f307847f29	[DOCS] Adds overview and API ref for cluster voting configurations (#36954 )	2019-01-07 09:11:14 -08:00
Benjamin Trent	1780ced82d	ML: changing JobResultsProvider.getForecastRequestStats to support > 1 index (#37157 ) * ML: changing JobResultsProvider.getForecastRequestStats to support more than one index * moving to use idsQuery()	2019-01-07 10:58:55 -06:00
Christophe Bismuth	9602d794c6	Separate out validation of groups of settings (#34184 ) Today, a setting can declare that its validity depends on the values of other related settings. However, the validity of a setting is not always checked against the correct values of its dependent settings because those settings' correct values may not be available when the validator runs. This commit separates the validation of a settings updates into two phases, with separate methods on the `Setting.Validator` interface. In the first phase the setting's validity is checked in isolation, and in the second phase it is checked again against the values of its related settings. Most settings only use the first phase, and only the few settings with dependencies make use of the second phase.	2019-01-07 16:12:58 +00:00
David Turner	9d0e0eb0f3	[Zen2] Remove initial master node count setting (#37150 ) The `cluster.unsafe_initial_master_node_count` setting was introduced as a temporary measure while the design of `cluster.initial_master_nodes` was being finalised. This commit removes this temporary setting, replacing it with usages of `cluster.initial_master_nodes` where appropriate.	2019-01-07 16:05:00 +00:00
Jason Tedor	c0f8c89172	Introduce shard history retention leases (#37167 ) This commit is the first in a series which will culminate with fully-functional shard history retention leases. Shard history retention leases are aimed at preventing shard history consumers from having to fallback to expensive file copy operations if shard history is not available from a certain point. These consumers include following indices in cross-cluster replication, and local shard recoveries. A future consumer will be the changes API. Further, index lifecycle management requires coordinating with some of these consumers otherwise it could remove the source before all consumers have finished reading all operations. The notion of shard history retention leases that we are introducing here will also be used to address this problem. Shard history retention leases are a property of the replication group managed under the authority of the primary. A shard history retention lease is a combination of an identifier, a retaining sequence number, a timestamp indicating when the lease was acquired or renewed, and a string indicating the source of the lease. Being leases they have a limited lifespan that will expire if not renewed. The idea of these leases is that all operations above the minimum of all retaining sequence numbers will be retained during merges (which would otherwise clear away operations that are soft deleted). These leases will be periodically persisted to Lucene and restored during recovery, and broadcast to replicas under certain circumstances. This commit is merely putting the basics in place. This first commit only introduces the concept and integrates their use with the soft delete retention policy. We add some tests to demonstrate the basic management is correct, and that the soft delete policy is correctly influenced by the existence of any retention leases. We make no effort in this commit to implement any of the following: - timestamps - expiration - persistence to and recovery from Lucene - handoff during primary relocation - sharing retention leases with replicas - exposing leases in shard-level statistics - integration with cross-cluster replication These will occur individually in follow-up commits.	2019-01-07 07:43:57 -08:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Jason Tedor	a233db7367	Fix handling of fractional byte size value settings (#37172 ) This commit addresses an issue when setting a byte size value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a byte size value setting to a value of 1536 bytes is problematic because internally this is converted to the string "1.5k". When we go to get this setting, we try to parse "1.5k" back to a byte size value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the byte size value to the string and back to the byte size value.	2019-01-07 07:13:50 -08:00
Luca Cavanna	ad4e4fd9b8	Disable bwc tests until #36997 & #37142 backports are complete (#37187 )	2019-01-07 15:59:26 +01:00
Armin Braun	82b1f10eb1	SNAPSHOT+TESTS: Rem. Mock Atomic Writes Randomness (#37011 ) * Randomly doing non-atomic writes causes rare 0 byte reads from `index-N` files in tests * Removing this randomness fixes these random failures and is valid because it does not reproduce a real-world failure-mode: * Cloud-based Blob stores (S3, GCS, and Azure) do not have inconsistent partial reads of a blob, either you read a complete blob or nothing on them * For file system based blob stores the atomic move we do (to atomically write a file) by setting `java.nio.file.StandardCopyOption#ATOMIC_MOVE` would throw if the file system does not provide for atomic moves * Closes #37005	2019-01-07 15:24:23 +01:00
Josh Soref	edb48321ba	[DOCS] Various spelling corrections (#37046 )	2019-01-07 14:44:12 +01:00
Simon Willnauer	ac2e09b25a	Fix suite scope random initializaation (#37163 ) The initialization of a suite scope cluster had some sideffects on subsequent runs which causes issues when tests must be reproduced. This moves the suite scope initialization to a privte random context. Closes #36202	2019-01-07 14:20:17 +01:00
Alpar Torok	21996f7690	Expand on #37058 Use the wrapper	2019-01-07 14:06:06 +02:00
Armin Braun	617e294133	SNAPSHOT: Make Atomic Blob Writes Mandatory (#37168 ) * With #37066 introducing atomic writes to HDFS repository we can enforce atomic write capabilities on this interface * The overrides on the other three cloud implementations are ok because: * https://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectPUT.html states that "Amazon S3 never adds partial objects; if you receive a success response, Amazon S3 added the entire object to the bucket." * https://cloud.google.com/storage/docs/consistency states that GCS has strong read-after-write consistency * https://docs.microsoft.com/en-us/rest/api/storageservices/put-block#remarks Azure has the concept of committing blobs, so there's no partial content here either * Relates #37011	2019-01-07 12:11:19 +01:00
David Kyle	7cc749dced	Mute SearchStatsIT.testOpenContexts	2019-01-07 09:47:58 +00:00
Christoph Büscher	60742b12e9	Remove deprecated CardinalityAggregationBuilder#rehash method (#37154 ) It has been deprecated since at least 6.0, is a no-op and unused in the rest of our code.	2019-01-07 10:42:07 +01:00
Christoph Büscher	d18c3d651d	Introduce an `include_type_name` constant (#37155 ) I started referring to this parameter name from various places in #37149 so I think it's a good idea to simplify things by referring to a common constant.	2019-01-07 10:41:40 +01:00
Christoph Büscher	12a105e5ef	Remove deprecated PutIndexTemplateRequestBuilder#setTemplate (#37151 ) The method has been removed since 6.0, there is a direct replacement and it is only used in tests still.	2019-01-07 10:41:04 +01:00
Luca Cavanna	2f4dafa69f	Add support for providing absolute start time to SearchRequest (#37142 ) We have recently added support for providing a local cluster alias to a SearchRequest through a package protected constructor. When executing cross-cluster search requests with local reduction on each cluster, the CCS coordinating node will have to provide such cluster alias to each remote cluster, as well as the absolute start time of the search action in milliseconds from the time epoch, to be used when evaluating date math expressions both while executing queries / scripts as well as when resolving index names. This commit adds support for providing the start time together with the cluster alias. It is a final member in the search request, which will only be set when using cross-cluster search with local reduction (also known as alternate execution mode). When not provided, the coordinating node will determine the current time and pass it through (by calling `System.currentTimeMillis`). Relates to #32125	2019-01-07 10:28:31 +01:00
Tanguy Leroux	6347461146	Rename ClusterBlocks.hasGlobalBlock methods (#36941 ) As suggested in #36775, this pull request renames the following methods: ClusterBlocks.hasGlobalBlock(int) ClusterBlocks.hasGlobalBlock(RestStatus) ClusterBlocks.hasGlobalBlock(ClusterBlockLevel) to something that better reflects the property of the ClusterBlock that is searched for: ClusterBlocks.hasGlobalBlockWithId(int) ClusterBlocks.hasGlobalBlockWithStatus(RestStatus) ClusterBlocks.hasGlobalBlockWithLevel(ClusterBlockLevel)	2019-01-07 09:42:27 +01:00
Armin Braun	31c33fdb9b	MINOR: Remove some Deadcode in Gradle (#37160 )	2019-01-07 09:21:25 +01:00
Andrei Stefan	39a072389c	SQL: add sub-selects to the Limitations page (#37012 )	2019-01-07 10:08:51 +02:00
Marios Trivyzas	e778abaac5	SQL: Improve error message when unable to translate to ES query DSL (#37129 ) Improve error message returned to the client when an SQL statement cannot be translated to a ES query DSL. Cases: 1. WHERE clause evaluates to FALSE => No results returned 1. Missing FROM clause => Local execution, e.g.: SELECT SIN(PI()) 3. Special SQL command => Only valid of SQL iface, e.g.: SHOW TABLES Fixes: #37040	2019-01-07 09:21:23 +02:00
Jason Tedor	bf5bc88f50	Fix handling of fractional time value settings (#37171 ) This commit addresses an issue when setting a time value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a time value setting to a value of 1500ms is problematic because internally this is converted to the string "1.5s". When we go to get this setting, we try to parse "1.5s" back to a time value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the time value to the string and back to the time value.	2019-01-06 22:34:52 -08:00
Marios Trivyzas	da3d8fb5b7	SQL: Fix issue with wrong NULL optimization (#37124 ) Logical operators OR and AND as well as conditional functions (COALESCE, LEAST, GREATEST, etc.) cannot be folded to NULL if one of their children is NULL as is the case for most of the functions. Therefore, their nullable() implementation cannot return true. On the other hand they cannot return false as if they're wrapped within an IS NULL or IS NOT NULL expression, the expression will be folded to false and true respectively leading to wrong results. Change the signature of nullable() method and add a third value UKNOWN to handle these cases. Fixes: #35872	2019-01-06 18:29:34 +02:00
Armin Braun	b34e7d4f19	SNAPSHOT+TESTS: Relax Assertion in DisruptionIT (#37144 ) * The retries on the failing master can lead to concurrently trying to create and delete a snapshot, catch this for now to fix this test * closes #36779	2019-01-05 17:52:24 +01:00
Simon Willnauer	0cc877026f	Subclass NIOFSDirectory instead of using FileSwitchDirectory (#37140 ) We don't want two FSDirectories manage pending deletes separately and optimize file listing. This confuses IndexWriter and causes exceptions when files are deleted twice but are pending for deletion. This change move to using a NIOFS subclass that only delegates to MMAP for opening files all metadata and pending deletes are managed on top. Closes #37111 Relates to #36668	2019-01-05 10:15:33 +01:00
Julie Tibshirani	0bac64fbd3	Deprecate the _type field in aggregations. (#37131 )	2019-01-04 13:05:52 -08:00
Michael Basnight	e40193ae66	HLRC: Fix Reindex from remote query logic (#36908 ) The query object was incorrectly added to the remote object in the xcontent. This fix moves the query back into the source, if it was passed in as part of the RemoteInfo. It also adds a IPv6 test for reindex from remote such that we can properly validate this.	2019-01-04 13:37:59 -06:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Adam Thomson	ac4aecc92d	[Docs] Update ingest-node.asciidoc (#37116 )	2019-01-04 19:33:06 +01:00
Simon Willnauer	b4f113d3ea	Don't block on peer recovery on the target side (#37076 ) Today we block using the generic thread-pool on the target side until the source side has fully executed the recovery. We still block on the source side executing the recovery in a blocking fashion but there is no reason to block on the target side. This will release generic threads early if there are many concurrent recoveries happen. Relates to #36195	2019-01-04 13:51:06 +01:00
Simon Willnauer	41d7e3a2fe	Expose `search.throttled` on `_cat/indices` (#37073 ) Today it's very difficult to see which indices are frozen or rather throttled via the commonly used monitoring APIs. This change adds a cell to the `_cat/indices` API to render if an index is `search.throttled` Relates to #34352	2019-01-04 13:49:40 +01:00
David Roberts	ff7df40b20	[ML] Uplift model memory limit on job migration (#37126 ) When a 6.1-6.3 job is opened in a later version we increase the model memory limit by 30% if it's below 0.5GB. The migration of jobs from cluster state to the config index changes the job version, so we need to also do this uplift as part of that config migration. Relates #36961	2019-01-04 12:21:28 +00:00
Luca Cavanna	21d52f0dab	Ensure that local cluster alias is never treated as remote (#37121 ) With #36997 we added support for providing a local cluster alias with a `SearchRequest`. We intended to make sure that when provided as part of a search request, the cluster alias would never be used for connection lookups. Yet due to a bug we would still end up looking up the connection from the remote ones. This commit adds a test to make sure that whenever we set the cluster alias to the `SearchRequest` (which can only be done at transport), such alias is used as index prefix in the returned hits. No errors are thrown despite no remote clusters are configured indicating that such alias is never used for connection look-ups. Also, we add explicit support for the empty cluster alias when printing out index names through `RemoteClusterAware#buildRemoteIndexName`. In fact we don't want to print out `:index` when the cluster alias is set to empty string, but rather `index`. Yet, the semantic of empty string is different compared to `null` as it will still disable final reduction. This will be used in CCS when searching against remote clusters as well as the local one, the local one will have empty prefix yet it will need to disable final reduction so that its results will be properly merged with the ones coming from the remote clusters.	2019-01-04 12:19:31 +01:00
Dimitris Athanasiou	0fd27d4d6f	[ML] Unused state remover should also account for jobs in index (#37119 ) The unused state remover was never adjusted to account for jobs stored in the config index. The result was that when triggered it removed state for all jobs stored in the config index. This commit fixes the issue. Closes #37109	2019-01-04 12:43:44 +02:00
Luca Cavanna	bfe6f091da	[DOCS] fix typo in 7.0.0-alpha2 release notes	2019-01-04 10:38:34 +01:00
David Turner	3f7d6a989a	[Zen2] Elect freshest master in upgrade (#37122 ) Today when electing a master in Zen2 we use the cluster state version to determine whether a node has a fresh-enough cluster state to become master. However the cluster state version is not a reliable measure of freshness in the Zen1 world; furthermore in 6.x the cluster state version is not persisted. This means that when upgrading from 6.x via a full cluster restart a cluster state update may be lost if a stale master wins the initial election. This change fixes this by using the metadata version as a measure of freshness when in term 0, since this is persisted in 6.x and does more reliably indicate the freshness of nodes. It also makes changes parallel to elastic/elasticsearch-formal-models#40 to support situations in which nodes accept cluster state versions in term 0: this does not happen in a pure Zen2 cluster, but can happen in mixed clusters and during upgrades.	2019-01-04 09:09:16 +00:00
Dimitris Athanasiou	586453fef1	[ML] Remove types from datafeed (#36538 ) Closes #34265	2019-01-04 09:43:44 +02:00
lcawley	00997b4f60	[DOCS] Fixes broken links	2019-01-04 17:41:28 +10:00

1 2 3 4 5 ...

43589 Commits