OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	c0f8c89172	Introduce shard history retention leases (#37167 ) This commit is the first in a series which will culminate with fully-functional shard history retention leases. Shard history retention leases are aimed at preventing shard history consumers from having to fallback to expensive file copy operations if shard history is not available from a certain point. These consumers include following indices in cross-cluster replication, and local shard recoveries. A future consumer will be the changes API. Further, index lifecycle management requires coordinating with some of these consumers otherwise it could remove the source before all consumers have finished reading all operations. The notion of shard history retention leases that we are introducing here will also be used to address this problem. Shard history retention leases are a property of the replication group managed under the authority of the primary. A shard history retention lease is a combination of an identifier, a retaining sequence number, a timestamp indicating when the lease was acquired or renewed, and a string indicating the source of the lease. Being leases they have a limited lifespan that will expire if not renewed. The idea of these leases is that all operations above the minimum of all retaining sequence numbers will be retained during merges (which would otherwise clear away operations that are soft deleted). These leases will be periodically persisted to Lucene and restored during recovery, and broadcast to replicas under certain circumstances. This commit is merely putting the basics in place. This first commit only introduces the concept and integrates their use with the soft delete retention policy. We add some tests to demonstrate the basic management is correct, and that the soft delete policy is correctly influenced by the existence of any retention leases. We make no effort in this commit to implement any of the following: - timestamps - expiration - persistence to and recovery from Lucene - handoff during primary relocation - sharing retention leases with replicas - exposing leases in shard-level statistics - integration with cross-cluster replication These will occur individually in follow-up commits.	2019-01-07 07:43:57 -08:00
Alpar Torok	a7c3d5842a	Split third party audit exclusions by type (#36763 )	2019-01-07 17:24:19 +02:00
Jason Tedor	a233db7367	Fix handling of fractional byte size value settings (#37172 ) This commit addresses an issue when setting a byte size value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a byte size value setting to a value of 1536 bytes is problematic because internally this is converted to the string "1.5k". When we go to get this setting, we try to parse "1.5k" back to a byte size value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the byte size value to the string and back to the byte size value.	2019-01-07 07:13:50 -08:00
Luca Cavanna	ad4e4fd9b8	Disable bwc tests until #36997 & #37142 backports are complete (#37187 )	2019-01-07 15:59:26 +01:00
Armin Braun	82b1f10eb1	SNAPSHOT+TESTS: Rem. Mock Atomic Writes Randomness (#37011 ) * Randomly doing non-atomic writes causes rare 0 byte reads from `index-N` files in tests * Removing this randomness fixes these random failures and is valid because it does not reproduce a real-world failure-mode: * Cloud-based Blob stores (S3, GCS, and Azure) do not have inconsistent partial reads of a blob, either you read a complete blob or nothing on them * For file system based blob stores the atomic move we do (to atomically write a file) by setting `java.nio.file.StandardCopyOption#ATOMIC_MOVE` would throw if the file system does not provide for atomic moves * Closes #37005	2019-01-07 15:24:23 +01:00
Josh Soref	edb48321ba	[DOCS] Various spelling corrections (#37046 )	2019-01-07 14:44:12 +01:00
Simon Willnauer	ac2e09b25a	Fix suite scope random initializaation (#37163 ) The initialization of a suite scope cluster had some sideffects on subsequent runs which causes issues when tests must be reproduced. This moves the suite scope initialization to a privte random context. Closes #36202	2019-01-07 14:20:17 +01:00
Alpar Torok	21996f7690	Expand on #37058 Use the wrapper	2019-01-07 14:06:06 +02:00
Tanguy Leroux	f5af79b9cd	Merge branch 'master' into close-index-api-refactoring	2019-01-07 12:43:03 +01:00
Armin Braun	617e294133	SNAPSHOT: Make Atomic Blob Writes Mandatory (#37168 ) * With #37066 introducing atomic writes to HDFS repository we can enforce atomic write capabilities on this interface * The overrides on the other three cloud implementations are ok because: * https://docs.aws.amazon.com/AmazonS3/latest/API/RESTObjectPUT.html states that "Amazon S3 never adds partial objects; if you receive a success response, Amazon S3 added the entire object to the bucket." * https://cloud.google.com/storage/docs/consistency states that GCS has strong read-after-write consistency * https://docs.microsoft.com/en-us/rest/api/storageservices/put-block#remarks Azure has the concept of committing blobs, so there's no partial content here either * Relates #37011	2019-01-07 12:11:19 +01:00
David Kyle	7cc749dced	Mute SearchStatsIT.testOpenContexts	2019-01-07 09:47:58 +00:00
Christoph Büscher	60742b12e9	Remove deprecated CardinalityAggregationBuilder#rehash method (#37154 ) It has been deprecated since at least 6.0, is a no-op and unused in the rest of our code.	2019-01-07 10:42:07 +01:00
Christoph Büscher	d18c3d651d	Introduce an `include_type_name` constant (#37155 ) I started referring to this parameter name from various places in #37149 so I think it's a good idea to simplify things by referring to a common constant.	2019-01-07 10:41:40 +01:00
Christoph Büscher	12a105e5ef	Remove deprecated PutIndexTemplateRequestBuilder#setTemplate (#37151 ) The method has been removed since 6.0, there is a direct replacement and it is only used in tests still.	2019-01-07 10:41:04 +01:00
Luca Cavanna	2f4dafa69f	Add support for providing absolute start time to SearchRequest (#37142 ) We have recently added support for providing a local cluster alias to a SearchRequest through a package protected constructor. When executing cross-cluster search requests with local reduction on each cluster, the CCS coordinating node will have to provide such cluster alias to each remote cluster, as well as the absolute start time of the search action in milliseconds from the time epoch, to be used when evaluating date math expressions both while executing queries / scripts as well as when resolving index names. This commit adds support for providing the start time together with the cluster alias. It is a final member in the search request, which will only be set when using cross-cluster search with local reduction (also known as alternate execution mode). When not provided, the coordinating node will determine the current time and pass it through (by calling `System.currentTimeMillis`). Relates to #32125	2019-01-07 10:28:31 +01:00
Tanguy Leroux	19593884ee	[Close Index API] Propagate tasks ids between Freeze, Close and Verify Shard actions (#36630 ) This pull request changes the Freeze Index and Close Index actions so that these actions always requires a Task. The task's id is then propagated from the Freeze action to the Close action, and then to the Verify shard action. This way it is possible to track which Freeze task initiates the closing of an index, and which consecutive verifiy shard are executed for the index closing.	2019-01-07 09:43:50 +01:00
Tanguy Leroux	6347461146	Rename ClusterBlocks.hasGlobalBlock methods (#36941 ) As suggested in #36775, this pull request renames the following methods: ClusterBlocks.hasGlobalBlock(int) ClusterBlocks.hasGlobalBlock(RestStatus) ClusterBlocks.hasGlobalBlock(ClusterBlockLevel) to something that better reflects the property of the ClusterBlock that is searched for: ClusterBlocks.hasGlobalBlockWithId(int) ClusterBlocks.hasGlobalBlockWithStatus(RestStatus) ClusterBlocks.hasGlobalBlockWithLevel(ClusterBlockLevel)	2019-01-07 09:42:27 +01:00
Armin Braun	31c33fdb9b	MINOR: Remove some Deadcode in Gradle (#37160 )	2019-01-07 09:21:25 +01:00
Andrei Stefan	39a072389c	SQL: add sub-selects to the Limitations page (#37012 )	2019-01-07 10:08:51 +02:00
Marios Trivyzas	e778abaac5	SQL: Improve error message when unable to translate to ES query DSL (#37129 ) Improve error message returned to the client when an SQL statement cannot be translated to a ES query DSL. Cases: 1. WHERE clause evaluates to FALSE => No results returned 1. Missing FROM clause => Local execution, e.g.: SELECT SIN(PI()) 3. Special SQL command => Only valid of SQL iface, e.g.: SHOW TABLES Fixes: #37040	2019-01-07 09:21:23 +02:00
Jason Tedor	bf5bc88f50	Fix handling of fractional time value settings (#37171 ) This commit addresses an issue when setting a time value setting using a value that has a fractional component when converted to its string representation. For example, trying to set a time value setting to a value of 1500ms is problematic because internally this is converted to the string "1.5s". When we go to get this setting, we try to parse "1.5s" back to a time value, which does not support fractional values. The problem is that internally we are relying on a method which loses the unit when doing the string conversion. Instead, we are going to use a method that does not lose the unit and therefore we can roundtrip from the time value to the string and back to the time value.	2019-01-06 22:34:52 -08:00
Marios Trivyzas	da3d8fb5b7	SQL: Fix issue with wrong NULL optimization (#37124 ) Logical operators OR and AND as well as conditional functions (COALESCE, LEAST, GREATEST, etc.) cannot be folded to NULL if one of their children is NULL as is the case for most of the functions. Therefore, their nullable() implementation cannot return true. On the other hand they cannot return false as if they're wrapped within an IS NULL or IS NOT NULL expression, the expression will be folded to false and true respectively leading to wrong results. Change the signature of nullable() method and add a third value UKNOWN to handle these cases. Fixes: #35872	2019-01-06 18:29:34 +02:00
Armin Braun	b34e7d4f19	SNAPSHOT+TESTS: Relax Assertion in DisruptionIT (#37144 ) * The retries on the failing master can lead to concurrently trying to create and delete a snapshot, catch this for now to fix this test * closes #36779	2019-01-05 17:52:24 +01:00
Simon Willnauer	0cc877026f	Subclass NIOFSDirectory instead of using FileSwitchDirectory (#37140 ) We don't want two FSDirectories manage pending deletes separately and optimize file listing. This confuses IndexWriter and causes exceptions when files are deleted twice but are pending for deletion. This change move to using a NIOFS subclass that only delegates to MMAP for opening files all metadata and pending deletes are managed on top. Closes #37111 Relates to #36668	2019-01-05 10:15:33 +01:00
Julie Tibshirani	0bac64fbd3	Deprecate the _type field in aggregations. (#37131 )	2019-01-04 13:05:52 -08:00
Michael Basnight	e40193ae66	HLRC: Fix Reindex from remote query logic (#36908 ) The query object was incorrectly added to the remote object in the xcontent. This fix moves the query back into the source, if it was passed in as part of the RemoteInfo. It also adds a IPv6 test for reindex from remote such that we can properly validate this.	2019-01-04 13:37:59 -06:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Adam Thomson	ac4aecc92d	[Docs] Update ingest-node.asciidoc (#37116 )	2019-01-04 19:33:06 +01:00
Simon Willnauer	b4f113d3ea	Don't block on peer recovery on the target side (#37076 ) Today we block using the generic thread-pool on the target side until the source side has fully executed the recovery. We still block on the source side executing the recovery in a blocking fashion but there is no reason to block on the target side. This will release generic threads early if there are many concurrent recoveries happen. Relates to #36195	2019-01-04 13:51:06 +01:00
Simon Willnauer	41d7e3a2fe	Expose `search.throttled` on `_cat/indices` (#37073 ) Today it's very difficult to see which indices are frozen or rather throttled via the commonly used monitoring APIs. This change adds a cell to the `_cat/indices` API to render if an index is `search.throttled` Relates to #34352	2019-01-04 13:49:40 +01:00
David Roberts	ff7df40b20	[ML] Uplift model memory limit on job migration (#37126 ) When a 6.1-6.3 job is opened in a later version we increase the model memory limit by 30% if it's below 0.5GB. The migration of jobs from cluster state to the config index changes the job version, so we need to also do this uplift as part of that config migration. Relates #36961	2019-01-04 12:21:28 +00:00
Luca Cavanna	21d52f0dab	Ensure that local cluster alias is never treated as remote (#37121 ) With #36997 we added support for providing a local cluster alias with a `SearchRequest`. We intended to make sure that when provided as part of a search request, the cluster alias would never be used for connection lookups. Yet due to a bug we would still end up looking up the connection from the remote ones. This commit adds a test to make sure that whenever we set the cluster alias to the `SearchRequest` (which can only be done at transport), such alias is used as index prefix in the returned hits. No errors are thrown despite no remote clusters are configured indicating that such alias is never used for connection look-ups. Also, we add explicit support for the empty cluster alias when printing out index names through `RemoteClusterAware#buildRemoteIndexName`. In fact we don't want to print out `:index` when the cluster alias is set to empty string, but rather `index`. Yet, the semantic of empty string is different compared to `null` as it will still disable final reduction. This will be used in CCS when searching against remote clusters as well as the local one, the local one will have empty prefix yet it will need to disable final reduction so that its results will be properly merged with the ones coming from the remote clusters.	2019-01-04 12:19:31 +01:00
Dimitris Athanasiou	0fd27d4d6f	[ML] Unused state remover should also account for jobs in index (#37119 ) The unused state remover was never adjusted to account for jobs stored in the config index. The result was that when triggered it removed state for all jobs stored in the config index. This commit fixes the issue. Closes #37109	2019-01-04 12:43:44 +02:00
Luca Cavanna	bfe6f091da	[DOCS] fix typo in 7.0.0-alpha2 release notes	2019-01-04 10:38:34 +01:00
David Turner	3f7d6a989a	[Zen2] Elect freshest master in upgrade (#37122 ) Today when electing a master in Zen2 we use the cluster state version to determine whether a node has a fresh-enough cluster state to become master. However the cluster state version is not a reliable measure of freshness in the Zen1 world; furthermore in 6.x the cluster state version is not persisted. This means that when upgrading from 6.x via a full cluster restart a cluster state update may be lost if a stale master wins the initial election. This change fixes this by using the metadata version as a measure of freshness when in term 0, since this is persisted in 6.x and does more reliably indicate the freshness of nodes. It also makes changes parallel to elastic/elasticsearch-formal-models#40 to support situations in which nodes accept cluster state versions in term 0: this does not happen in a pure Zen2 cluster, but can happen in mixed clusters and during upgrades.	2019-01-04 09:09:16 +00:00
Dimitris Athanasiou	586453fef1	[ML] Remove types from datafeed (#36538 ) Closes #34265	2019-01-04 09:43:44 +02:00
lcawley	00997b4f60	[DOCS] Fixes broken links	2019-01-04 17:41:28 +10:00
Peter Dyson	7839cec301	subsequent fix to edit in recent cherry-pick	2019-01-04 17:34:24 +10:00
Peter Dyson	7cc9754d94	fix to edit in recent cherry-pick	2019-01-04 17:26:42 +10:00
Andrei Stefan	3578e69669	SQL: Handle the bwc Joda ZonedDateTime scripting class in Painless (#37024 ) * Handle the bwc Joda ZonedDateTime scripting class in Painless * Integrated the types checking in the already existent method	2019-01-04 09:19:24 +02:00
Peter Dyson	0ff2707c9f	Add Profile API to search speed tuning howto (#29489 ) * Add Profile API to search speed tuning howto Seemed useful to mention the Profile API in the context of tuning for search speed.	2019-01-04 16:49:12 +10:00
Ioannis Kakavas	3c12c372d0	[DOCS] Adjust role mapping docs for SAML (#37083 ) Explicitly mention that file based role mappings cannot be used with the SAML realm.	2019-01-04 08:26:41 +02:00
Julie Tibshirani	ac1c6940d2	Stop automatically nesting mappings in index creation requests. (#36924 ) Now that we unwrap mappings in DocumentMapperParser#extractMappings, it is not necessary for the mapping definition to always be nested under the type. This leniency around the mapping format was added in `2341825358`.	2019-01-03 17:41:28 -08:00
Tal Levy	eaeccd8401	[ILM] Add Freeze Action (#36910 ) This commit adds a new ILM Action for freezing indices in the cold phase. Closes #34630.	2019-01-03 15:00:40 -08:00
Costin Leau	40a30c6f5f	SQL: Preserve original source for each expression (#36912 ) Improve parsing to save the source for each token alongside the location of each Node/Expression for accurate reproducibility of an expression name and source Fix #36894	2019-01-04 00:57:50 +02:00
Armin Braun	7686ee7631	TESTS: Shutdown ThreadPool after TestNodes (#37123 ) * If the threadpool gets shut down before the testnodes we run into an error => fixed by moving to single `After` method * Relates #36976	2019-01-03 22:35:44 +01:00
Benjamin Trent	2c3ce5b9de	[ML] add min_version field to machine learning results index mapping (#37115 ) * ML: add min_version field to mapping * Adding min_version as reserved field name	2019-01-03 14:59:42 -06:00
Igor Motov	d6acd8e15f	Docs: add clarification about geohash use in geohashgrid agg (#36901 ) Adds an example on translating geohashes returned by geohashgrid agg as bucket keys into geo bounding box filters in elasticsearch as well as 3rd party applications. Closes #36413	2019-01-03 15:40:48 -05:00
Julie Tibshirani	54f53d2a51	Make sure to accept empty unnested mappings in create index requests. (#37089 )	2019-01-03 11:53:08 -08:00
Nick Knize	e613bcae43	Remove XLatLonShape classes (#37094 ) This commit removes local XLatLonShape classes and replaces with current LatLonShape classes in latest lucene snapshot	2019-01-03 12:48:36 -06:00

... 2 3 4 5 6 ...

43741 Commits All Branches Search

43741 Commits

All Branches