OpenSearch

Commit Graph

Author	SHA1	Message	Date
Benjamin Trent	858dbfc074	[ML][Data Frame] treat bulk index failures as an indexing failure (#44351 ) (#44427 ) * [ML][Data Frame] treat bulk index failures as an indexing failure * removing redundant public modifier * changing to an ElasticsearchException * fixing redundant public modifier	2019-07-16 10:04:28 -05:00
Nhat Nguyen	a2b4687d89	Make peer recovery send file chunks async (#44040 )	2019-07-16 10:43:46 -04:00
Przemysław Witek	34bf6bcec0	Treat big changes in searchCount as significant and persist the document after such changes (#44413 ) (#44424 )	2019-07-16 16:15:32 +02:00
Jake Landis	eb7d43f4cf	Log write failures for watcher history document. (#44129 ) (#44357 ) The failure is correctly getting propagated, this commit adds support to explicitly look for .watch-history failures using the same logging strategy as triggered watch failures.	2019-07-16 08:48:09 -05:00
Lee Hinman	fb0461ac76	[7.x] Add Snapshot Lifecycle Management (#44382 ) * Add Snapshot Lifecycle Management (#43934) * Add SnapshotLifecycleService and related CRUD APIs This commit adds `SnapshotLifecycleService` as a new service under the ilm plugin. This service handles snapshot lifecycle policies by scheduling based on the policies defined schedule. This also includes the get, put, and delete APIs for these policies Relates to #38461 * Make scheduledJobIds return an immutable set * Use Object.equals for SnapshotLifecyclePolicy * Remove unneeded TODO * Implement ToXContentFragment on SnapshotLifecyclePolicyItem * Copy contents of the scheduledJobIds * Handle snapshot lifecycle policy updates and deletions (#40062) (Note this is a PR against the `snapshot-lifecycle-management` feature branch) This adds logic to `SnapshotLifecycleService` to handle updates and deletes for snapshot policies. Policies with incremented versions have the old policy cancelled and the new one scheduled. Deleted policies have their schedules cancelled when they are no longer present in the cluster state metadata. Relates to #38461 * Take a snapshot for the policy when the SLM policy is triggered (#40383) (This is a PR for the `snapshot-lifecycle-management` branch) This commit fills in `SnapshotLifecycleTask` to actually perform the snapshotting when the policy is triggered. Currently there is no handling of the results (other than logging) as that will be added in subsequent work. This also adds unit tests and an integration test that schedules a policy and ensures that a snapshot is correctly taken. Relates to #38461 * Record most recent snapshot policy success/failure (#40619) Keeping a record of the results of the successes and failures will aid troubleshooting of policies and make users more confident that their snapshots are being taken as expected. This is the first step toward writing history in a more permanent fashion. * Validate snapshot lifecycle policies (#40654) (This is a PR against the `snapshot-lifecycle-management` branch) With the commit, we now validate the content of snapshot lifecycle policies when the policy is being created or updated. This checks for the validity of the id, name, schedule, and repository. Additionally, cluster state is checked to ensure that the repository exists prior to the lifecycle being added to the cluster state. Part of #38461 * Hook SLM into ILM's start and stop APIs (#40871) (This pull request is for the `snapshot-lifecycle-management` branch) This change allows the existing `/_ilm/stop` and `/_ilm/start` APIs to also manage snapshot lifecycle scheduling. When ILM is stopped all scheduled jobs are cancelled. Relates to #38461 * Add tests for SnapshotLifecyclePolicyItem (#40912) Adds serialization tests for SnapshotLifecyclePolicyItem. * Fix improper import in build.gradle after master merge * Add human readable version of modified date for snapshot lifecycle policy (#41035) * Add human readable version of modified date for snapshot lifecycle policy This small change changes it from: ``` ... "modified_date": 1554843903242, ... ``` To ``` ... "modified_date" : "2019-04-09T21:05:03.242Z", "modified_date_millis" : 1554843903242, ... ``` Including the `"modified_date"` field when the `?human` field is used. Relates to #38461 * Fix test * Add API to execute SLM policy on demand (#41038) This commit adds the ability to perform a snapshot on demand for a policy. This can be useful to take a snapshot immediately prior to performing some sort of maintenance. ```json PUT /_ilm/snapshot/<policy>/_execute ``` And it returns the response with the generated snapshot name: ```json { "snapshot_name" : "production-snap-2019.04.09-rfyv3j9qreixkdbnfuw0ug" } ``` Note that this does not allow waiting for the snapshot, and the snapshot could still fail. It does record this information into the cluster state similar to a regularly trigged SLM job. Relates to #38461 * Add next_execution to SLM policy metadata (#41221) * Add next_execution to SLM policy metadata This adds the next time a snapshot lifecycle policy will be executed when retriving a policy's metadata, for example: ```json GET /_ilm/snapshot?human { "production" : { "version" : 1, "modified_date" : "2019-04-15T21:16:21.865Z", "modified_date_millis" : 1555362981865, "policy" : { "name" : "<production-snap-{now/d}>", "schedule" : "/30 * * * ?", "repository" : "repo", "config" : { "indices" : [ "foo-", "important" ], "ignore_unavailable" : true, "include_global_state" : false } }, "next_execution" : "2019-04-15T21:16:30.000Z", "next_execution_millis" : 1555362990000 }, "other" : { "version" : 1, "modified_date" : "2019-04-15T21:12:19.959Z", "modified_date_millis" : 1555362739959, "policy" : { "name" : "<other-snap-{now/d}>", "schedule" : "0 30 2 * ?", "repository" : "repo", "config" : { "indices" : [ "other" ], "ignore_unavailable" : false, "include_global_state" : true } }, "next_execution" : "2019-04-16T02:30:00.000Z", "next_execution_millis" : 1555381800000 } } ``` Relates to #38461 * Fix and enhance tests * Figured out how to Cron * Change SLM endpoint from /_ilm/* to /_slm/* (#41320) This commit changes the endpoint for snapshot lifecycle management from: ``` GET /_ilm/snapshot/<policy> ``` to: ``` GET /_slm/policy/<policy> ``` It mimics the ILM path only using `slm` instead of `ilm`. Relates to #38461 * Add initial documentation for SLM (#41510) * Add initial documentation for SLM This adds the initial documentation for snapshot lifecycle management. It also includes the REST spec API json files since they're sort of documentation. Relates to #38461 * Add `manage_slm` and `read_slm` roles (#41607) * Add `manage_slm` and `read_slm` roles This adds two more built in roles - `manage_slm` which has permission to perform any of the SLM actions, as well as stopping, starting, and retrieving the operation status of ILM. `read_slm` which has permission to retrieve snapshot lifecycle policies as well as retrieving the operation status of ILM. Relates to #38461 * Add execute to the test * Fix ilm -> slm typo in test * Record SLM history into an index (#41707) It is useful to have a record of the actions that Snapshot Lifecycle Management takes, especially for the purposes of alerting when a snapshot fails or has not been taken successfully for a certain amount of time. This adds the infrastructure to record SLM actions into an index that can be queried at leisure, along with a lifecycle policy so that this history does not grow without bound. Additionally, SLM automatically setting up an index + lifecycle policy leads to `index_lifecycle` custom metadata in the cluster state, which some of the ML tests don't know how to deal with due to setting up custom `NamedXContentRegistry`s. Watcher would cause the same problem, but it is already disabled (for the same reason). * High Level Rest Client support for SLM (#41767) * High Level Rest Client support for SLM This commit add HLRC support for SLM. Relates to #38461 * Fill out documentation tests with tags * Add more callouts and asciidoc for HLRC * Update javadoc links to real locations * Add security test testing SLM cluster privileges (#42678) * Add security test testing SLM cluster privileges This adds a test to `PermissionsIT` that uses the `manage_slm` and `read_slm` cluster privileges. Relates to #38461 * Don't redefine vars * Add Getting Started Guide for SLM (#42878) This commit adds a basic Getting Started Guide for SLM. * Include SLM policy name in Snapshot metadata (#43132) Keep track of which SLM policy in the metadata field of the Snapshots taken by SLM. This allows users to more easily understand where the snapshot came from, and will enable future SLM features such as retention policies. * Fix compilation after master merge * [TEST] Move exception wrapping for devious exception throwing Fixes an issue where an exception was created from one line and thrown in another. * Fix SLM for the change to AcknowledgedResponse * Add Snapshot Lifecycle Management Package Docs (#43535) * Fix compilation for transport actions now that task is required * Add a note mentioning the privileges needed for SLM (#43708) * Add a note mentioning the privileges needed for SLM This adds a note to the top of the "getting started with SLM" documentation mentioning that there are two built-in privileges to assist with creating roles for SLM users and administrators. Relates to #38461 * Mention that you can create snapshots for indices you can't read * Fix REST tests for new number of cluster privileges * Mute testThatNonExistingTemplatesAreAddedImmediately (#43951) * Fix SnapshotHistoryStoreTests after merge * Remove overridden newResponse functions that have been removed * Fix compilation for backport * Fix get snapshot output parsing in test * [DOCS] Add redirects for removed autogen anchors (#44380) * Switch <tt>...</tt> in javadocs for {@code ...}	2019-07-16 07:37:13 -06:00
Lucas Groenendaal	aa9dd313cf	Fix incorrect node name in docs (#43062 ) After starting up elasticsearch the documentation said that their node name was "6-bjhwl" but in the documentation's output I did not see that node name. Instead I saw the node name as `localhost.localdomain`	2019-07-16 14:58:42 +02:00
david raistrick	ae5a917efe	Add clarification around TESTSETUP docs and error message (#43306 )	2019-07-16 14:58:16 +02:00
Mark Walkom	4a5215d22a	[DOCS] Update id-field.asciidoc (#42482 ) Adding a note around the size limit for `_id`	2019-07-16 14:57:33 +02:00
Dan Fey	8a2d23671a	[DOCS] Update split-index.asciidoc: fix shards example (#41382 ) The max value should be 640 instead of 740 in the shard example:	2019-07-16 14:54:27 +02:00
Tanguy Buchier	078efc9ec4	[DOCS] Clarify refresh_interval new behavior (#43726 ) Update indexing-speed.asciidoc to clarify refresh_interval new behavior	2019-07-16 14:53:46 +02:00
Hendrik Muhs	6c1f740759	[ML-DataFrame] make checkpointing more robust (#44344 ) (#44414 ) make checkpointing more robust: - do not let checkpointing fail if indexes got deleted - treat missing seqNoStats as just created indices (checkpoint 0) - loglevel: do not treat failed updated checks as error fixes #43992	2019-07-16 13:43:13 +02:00
magnusram05	096c03945c	[Docs] Small update to getting-started.asciidoc (#40393 )	2019-07-16 13:40:54 +02:00
Przemysław Witek	3f3a3d3f2b	[7.x] Add DatafeedTimingStats.average_search_time_per_bucket_ms and TimingStats.total_bucket_processing_time_ms stats (#44125 ) (#44404 )	2019-07-16 12:51:29 +02:00
Armin Braun	4a79ccd324	Cleaner Exception Handling on Shard Delete (#44384 ) (#44407 ) * Follow up to #44165 * We should just catch all exceptions here and not return errors after the index-N update went through since a subsequent delete attempt by the user would fail with SnapshotMissingException since the snapshot now appears deleted. Also, `SnapshotException` isn't even thrown in the changed spot it seems in the first place and certainly not the only exception possible.	2019-07-16 12:20:52 +02:00
Armin Braun	940aa71930	Cleanup S3 BlobContainer Listing Logic (#43088 ) (#44406 ) * Cleanup duplication in creating and looping over IO Requests	2019-07-16 12:19:20 +02:00
David Turner	a09389c511	AwaitsFix GatewayIndexStateIT#testJustMasterNode Relates #44416.	2019-07-16 11:02:32 +01:00
Ryan Ernst	c4cf98c538	Convert core security actions to use writeable ActionType (#44359 ) (#44390 ) This commit converts all the StreamableResponseActionType security classes in xpack core to ActionType, implementing Writeable for their response classes. relates #34389	2019-07-16 01:11:13 -07:00
Jason Tedor	be98a12cd0	Do not swallow I/O exception getting authentication (#44398 ) When getting authentication info from the thread context, it might be that we encounter an I/O exception. Today we swallow this exception and return a null authentication info to the caller. Yet, this could be hiding bugs or errors. This commits adjusts this behavior so that we no longer swallow the exception.	2019-07-16 16:14:15 +09:00
Tim Vernum	4b50de2e2e	Document xpack.security.dls.bitset.cache settings (#44400 ) Two new settings were introduced in #43669 to control the behaviour of the Document Level Security BitSet cache. This change adds documentation for these 2 settings. Backport of: #44100	2019-07-16 16:22:25 +10:00
William Brafford	673c63bb00	Restrict testCreateEmptyDirNoPermissions to Unix (#44282 ) (#44297 ) The test EmptyDirTaskTests#testCreateEmptyDirNoPermissions may fail on Windows. However, the test is only meaningful for Unix permissions structures, so we should assume a Unix-family OS and skip the test on Windows. Fixes #44064	2019-07-16 08:48:12 +03:00
David Turner	8d68d1f54d	Cluster health should await events plus other things (#44348 ) Today a cluster health request can wait on a selection of conditions, but it does not guarantee that all of these conditions have ever held simultaneously when it returns. More specifically, if a request sets `waitForEvents()` along with some other conditions then Elasticsearch will respond when the master has processed all the expected pending tasks _and then_ the cluster satisfied the other conditions, but it may be that at the time the cluster satisfied the other conditions there were undesired pending tasks again. This commit adjusts the behaviour of `waitForEvents()` to wait for all the required events to be processed and then, if the resulting cluster state does not satisfy the other conditions, it will wait until there is a cluster state that does and then retry the wait-for-events too.	2019-07-16 06:34:02 +01:00
Armin Braun	5c8275cd2c	Fix Exceptions in EventHandler#postHandling Breaking Select Loop (#44347 ) (#44396 ) * Fix Exceptions in EventHandler#postHandling Breaking Select Loop * We can run into the `write` path for SSL channels when they are not fully registered (if registration fails and a close message is attempted to be written) and thus into NPEs from missing selection keys * This is a quick fix to quiet down tests, a cleaner solution will be incoming for #44343 * Relates #44343	2019-07-16 07:06:26 +02:00
Armin Braun	099d52f3b0	Prevent Confusing Blocked Thread Warnings in MockNioTransport (#44356 ) (#44376 ) * Prevent Confusing Blocked Thread Warnings in MockNioTransport * We can run into a race where the stacktrace collection and subsequent logging happens after the thread has already unblocked thus logging a confusing stacktrace of wherever the transport thread was after it became unblocked * Fixed this by comparing whether or not the recorded timestamp is still the same before and after the stacktrace was recorded and not logging if it already changed	2019-07-16 04:40:50 +02:00
Ryan Ernst	e0b82e92f3	Convert BaseNode(s) Request/Response classes to Writeable (#44301 ) (#44358 ) This commit converts all BaseNodeResponse and BaseNodesResponse subclasses to implement Writeable.Reader instead of Streamable. relates #34389	2019-07-15 18:07:52 -07:00
Ryan Ernst	7e06888bae	Convert testclusters to use distro download plugin (#44253 ) (#44362 ) Test clusters currently has its own set of logic for dealing with finding different versions of Elasticsearch, downloading them, and extracting them. This commit converts testclusters to use the DistributionDownloadPlugin.	2019-07-15 17:53:05 -07:00
Lisa Cawley	753da8feac	[DOCS] Updates terminology for alerting features (#43945 )	2019-07-15 14:47:33 -07:00
Jake Landis	c00b082701	add 7.2.1 release notes (#44367 )	2019-07-15 15:02:56 -05:00
Yannick Welsch	a848fc9bf4	Revert "Add usage stats for frozen indices (#44286 )" This reverts commit `5e73c49ec8`.	2019-07-15 21:41:25 +02:00
Yannick Welsch	7b68bfb4e6	Revert "Add frozen indices usage for all but transport client (#44286 )" This reverts commit `d2d40afc02`.	2019-07-15 21:41:21 +02:00
Yannick Welsch	d2d40afc02	Add frozen indices usage for all but transport client (#44286 ) Backport gone wrong.	2019-07-15 20:49:23 +02:00
Adrien Grand	3734356955	Update release notes.	2019-07-15 20:01:23 +02:00
Lisa Cawley	e7ea49e32f	[DOCS] Removes unnecessary resource definition pages (#44289 )	2019-07-15 10:03:53 -07:00
Julie Tibshirani	141d09ee15	Correct a formatting mistake in the _field_caps docs. (#44303 ) The 'indices' block that was recently added should appear in the top-level of the response, as opposed to being nested under 'fields'.	2019-07-15 09:46:02 -07:00
David Turner	86ee8eab3f	Allow RerouteService to reroute at lower priority (#44338 ) Today the `BatchedRerouteService` submits its delayed reroute task at `HIGH` priority, but in some cases a lower priority would be more appropriate. This commit adds the facility to submit delayed reroute tasks at different priorities, such that each submitted reroute task runs at a priority no lower than the one requested. It does not change the fact that all delayed reroute tasks are submitted at `HIGH` priority, but at least it makes this explicit.	2019-07-15 17:41:39 +01:00
Lisa Cawley	6c7f7d4a10	[DOCS] Adds ml-cpp PRs to release notes (#44354 )	2019-07-15 09:22:36 -07:00
Ryan Ernst	59658daef9	Separate streamable based master node actions (#44313 ) This commit creates new base classes for master node actions whose response types still implement Streamable. This simplifies both finding remaining classes to convert, as well as creating new master node actions that use Writeable for their responses. relates #34389	2019-07-15 09:20:20 -07:00
Yannick Welsch	5e73c49ec8	Add usage stats for frozen indices (#44286 ) Adds usage stats for frozen indices of the form: "frozen_indices" : { "available" : true, "enabled" : true, "indices_count" : 0 }	2019-07-15 17:34:46 +02:00
David Turner	e3d2af64c4	Throw TranslogCorruptedException in more cases (#44217 ) Today we do not throw a `TranslogCorruptedException` in certain cases of translog corruption, such as for a corrupted checkpoint file or when an expected file (either checkpoint or translog) is completely missing. This means that `elasticsearch-shard` will not truncate the translog in those cases. This commit strengthens the translog corruption tests to corrupt and/or delete both translog and checkpoint files, and ensures that a `TranslogCorruptedException` is thrown in all cases. It also sometimes simulates a recovery after a crash while rolling the translog generation, including cases where the rolled checkpoint contains incorrect data. It also adjusts (and renames) `RemoveCorruptedShardDataCommandIT.getDirs()` to return only a single path, since in practice this was the only thing that could happen and yet we were relying on its callers to verify this and not all callers were doing so.	2019-07-15 15:20:33 +01:00
David Kyle	2382701547	Wait for pending tasks in docs tests cleanup (#44123 ) ML and Data Frame tests should wait for pending tasks	2019-07-15 12:04:27 +01:00
Armin Braun	eb1106c465	Stronger Cleanup Shard Snapshot Directory on Delete (#44257 ) (#44337 ) * Stronger Cleanup Shard Snapshot Directory on Delete * Use `RepositoryData` to clean up unreferenced `snap-${uuid}.dat` blobs from shard directories (and index-N) and as a result also clean up data blobs that are only referenced by them * Stop cleaning up anything but index-N on shard snapshot creation to align behavior of shard index-N handling with root path index-N handling	2019-07-15 12:59:38 +02:00
Christoph Büscher	22dc125dad	AnalyzeAction.Response doesn't need to call super.readFrom() (#44331 ) The responses super.writeTo() method was removed in #44092, so the corresponding contructor that reads from a stream shouldn't call super itself, even though its implementation is currently empty.	2019-07-15 11:53:25 +02:00
Armin Braun	7f5d40d235	Avoid Needless Set Instantiation in InboundMessage (#44318 ) (#44329 ) * Avoid Needless Set Instantiation in InboundMessage * When `features` is empty (when there's no xpack) we constantly and needless instantiated a few objects here for the empty set on every message	2019-07-15 10:59:51 +02:00
Armin Braun	0cc94a457d	Remove non-SMILE Serialization from ChecksumBlobStoreFormat (#44278 ) (#44326 ) * At least all the way back to 6.x we never use anything but `SMILE` in production code with this class so I removed the more general constructor and removed the format leniency from the deserialization	2019-07-15 10:59:33 +02:00
Tanguy Leroux	76a96c3774	Remove ReusePeerRecoverySharedTest class (#44275 )	2019-07-15 10:29:29 +02:00
Armin Braun	d73e2f9c56	HLRC: Fix '+' Not Correctly Encoded in GET Req. (#33164 ) (#44324 ) * HLRC: Fix '+' Not Correctly Encoded in GET Req. * Encode `+` correctly as `%2B` in URL paths * Keep encoding `+` as space in URL parameters * Closes #33077	2019-07-15 10:21:54 +02:00
Ryan Ernst	fc6a31e141	Use specific version constant for wire bwc check (#44316 ) This commit modifies bwc behavior in FindFileStructureAction to check against a concrete version instead of Version.CURRENT. Checking against Version.CURRENT does not work since it is changing, in addition to it having different meanings on each branch. relates #42501	2019-07-14 19:05:14 -07:00
Nhat Nguyen	2203d447aa	Fail engine if hit document failure on replicas (#43523 ) An indexing on a replica should never fail after it was successfully indexed on a primary. Hence, we should fail an engine if we hit any failure (document level or tragic failure) when processing an indexing on a replica. Relates #43228 Closes #40435	2019-07-14 19:29:16 -04:00
Christoph Büscher	835b7a120d	Fix AnalyzeAction response serialization (#44284 ) Currently we loose information about whether a token list in an AnalyzeAction response is null or an empty list, because we write a 0 value to the stream in both cases and deserialize to a null value on the receiving side. This change fixes this so we write an additional flag indicating whether the value is null or not, followed by the size of the list and its content. Closes #44078	2019-07-14 10:35:11 +02:00
Yogesh Gaikwad	b40b6dd542	Disable repository-hdfs tests in FIPS jvm (#44283 ) Due to https://github.com/elastic/elasticsearch/issues/40079, we need to disable repository-hdfs tests in FIPS jvm.	2019-07-13 20:11:32 +10:00
Hendrik Muhs	33627ef410	re-enable bwc tests	2019-07-13 08:53:44 +02:00

1 2 3 4 5 ...

46810 Commits All Branches Search

46810 Commits

All Branches