OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-18 19:05:06 +00:00

Author	SHA1	Message	Date
David Turner	dd7410d8c2	Disable rebalancing in searchable snapshots tests (#61068 ) Fixes a test failure in which we allocated some shards and then relocated them elsewhere, invalidating an assertion about the recovery statistics which assumed that the shards stayed where they were originally allocated. Closes #61067.	2020-08-13 09:08:27 +01:00
Lee Hinman	e3df64a429	[7.x] Add data tiers (hot, warm, cold, frozen) as custom node roles (#60994 ) (#61045 ) This commit adds the `data_hot`, `data_warm`, `data_cold`, and `data_frozen` node roles to the x-pack plugin. These roles are intended to be the base for the formalization of data tiers in Elasticsearch. These roles all act as data nodes (meaning shards can be allocated to them). Nodes with the existing `data` role acts as though they have all of the roles configured (it is a hot, warm, cold, and frozen node). This also includes a custom `AllocationDecider` that allows the user to configure the following settings on a cluster level: - `cluster.routing.allocation.require._tier` - `cluster.routing.allocation.include._tier` - `cluster.routing.allocation.exclude._tier` And in index settings: - `index.routing.allocation.require._tier` - `index.routing.allocation.include._tier` - `index.routing.allocation.exclude._tier` Relates to #60848	2020-08-12 11:06:23 -06:00
Andrei Dan	32173a82c8	ILM: add frozen phase (#60983 ) (#61035 ) This adds a frozen phase to ILM that will allow the execution of the set_priority, unfollow, allocate, freeze and searchable_snapshot actions. The frozen phase will be executed after the cold and before the delete phase. (cherry picked from commit 6d0148001c3481290ed7e60dab588e0191346864) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-08-12 16:36:27 +01:00
Yannick Welsch	6644f2283d	Do not access snapshot repo on dedicated voting-only master node (#61016 ) Today a snapshot repository verification ensures that all master-eligible and data nodes have write access to the snapshot repository (and can see each other's data) since taking a snapshot requires data nodes and the currently elected master to write to the repository. However, a dedicated voting-only master-eligible node is not a data node and will never be the elected master so we should not require it to have write access to the repository. Closes #59649	2020-08-12 16:56:45 +02:00
Benjamin Trent	4275a715c9	[ML] adjusting inference processor to support foreach usage (#60915 ) (#61022 ) `foreach` processors store information within the `_ingest` metadata object. This commit adds the contents of the `_ingest` metadata (if it is not empty). And will append new inference results if the result field already exists. This allows a `foreach` to execute and multiple inference results being written to the same result field. closes https://github.com/elastic/elasticsearch/issues/60867	2020-08-12 08:34:18 -04:00
markharwood	66098e0bf4	Search fix: query_string regex/wildcard searches not working on wildcard fields (#60959 ) (#61010 ) The Query string parser was not delegating the construction of wildcard/regex queries to the underlying field type. The wildcard field has special data structures and queries that operate on them so cannot rely on the basic regex/wildcard queries that were being used for other fields. Closes #60957	2020-08-12 10:44:52 +01:00
Armin Braun	32423a486d	Simplify and Speed up some Compression Usage (#60953 ) (#61008 ) Use thread-local buffers and deflater and inflater instances to speed up compressing and decompressing from in-memory bytes. Not manually invoking `end()` on these should be safe since their off-heap memory will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals that are not instantiated at a high frequency. This significantly reduces the amount of byte copying and object creation relative to the previous approach which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied bytes out of that buffer to a freshly allocated `byte[]`, used 4k stream buffers needlessly when working with bytes that are already in arrays (`writeTo` handles efficient writing to the compression logic now) etc. Relates #57284 which should be helped by this change to some degree. Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these code paths.	2020-08-12 11:06:23 +02:00
Andrei Dan	35423a75af	Tests: don't fail if ILM executed the action already (#60916 ) (#60982 ) (cherry picked from commit 8c970ad20f4f55a9c0d6a256aa643ea037281e75) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-08-12 09:04:04 +01:00
Dimitris Athanasiou	2e18c0f2ac	[7.x][ML] Audit force stopping data frame analytics (#60973 ) (#61004 ) Audits a message when a data frame analytics job is force stopped. Backport of #60973	2020-08-12 07:45:26 +03:00
Yang Wang	c7b0290256	Mute kerberos tests for jdk 8u[262,271) (#60995 ) The Kerberos bug (JDK-8246193) is introduced in JDK 8u262 and fixed in 8u271. This PR mute for any possible releases between these two versions.	2020-08-12 11:50:48 +10:00
Nhat Nguyen	ceaa28e97b	Increase timeout in testFollowIndexWithConcurrentMappingChanges (#60534 ) The test failed because the leader was taking a lot of CPUs to process many mapping updates. This commit reduces the mapping updates, increases timeout, and adds more debug info. Closes #59832	2020-08-11 17:03:22 -04:00
Nhat Nguyen	bf7eecf1dc	Fix synchronization in ShardFollowNodeTask (#60490 ) The leader mapping, settings, and aliases versions in a shard follow-task are updated without proper synchronization and can go backward.	2020-08-11 14:52:52 -04:00
James Rodewig	929f1cc9f9	[DOCS] Remove search request body page (#60972 ) (#60977 )	2020-08-11 13:04:07 -04:00
Francisco Fernández Castaño	d544528c7b	Increase information on assertRecoveryStats assertion (#60960 ) Backport of #60952	2020-08-11 15:30:59 +02:00
Dimitris Athanasiou	6062672148	[7.x][ML] Monitor reindex response in DF analytics (#60911 ) (#60958 ) Examines the reindex response in order to report potential problems that occurred during the reindexing phase of data frame analytics jobs. Backport of #60911	2020-08-11 16:17:37 +03:00
Mark Tozzi	ab8518fb5b	[7.x] Extensibility for Composite Agg #59648 (#60842 )	2020-08-11 09:14:33 -04:00
Dan Hermann	839c6cdfc0	Un-mute data stream REST test (#60120 ) (#60939 )	2020-08-11 08:10:04 -05:00
David Kyle	18a65c5b9a	DFA Get Stats can return multiple responses if more than one error occurs (#60950 ) If the search for get stats with multiple job Ids fails the listener is called for each failure. This change waits for all responses then returns the first error if there was one.	2020-08-11 10:28:05 +01:00
Alan Woodward	54279212cf	Make MetadataFieldMapper extend ParametrizedFieldMapper (#59847 ) (#60924 ) This commit cuts over all metadata field mappers to parametrized format.	2020-08-11 09:02:28 +01:00
Benjamin Trent	66b3e89482	[ML] enable logging for test failures (#60902 ) (#60910 )	2020-08-10 12:36:30 -04:00
Francisco Fernández Castaño	2a4fd8329b	Avoid a race condition while waiting for pre warm to finish on SearchableSnapshotDirectoryTests (#60906 ) Backport of #60885. Closes #60813	2020-08-10 17:29:16 +02:00
Jim Ferenczi	f30f1f04e2	Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce (#60816 ) This commit removes the ability to test the top level result of an aggregator before it runs the final reduce. All aggregator tests that use AggregatorTestCase#search are rewritten with AggregatorTestCase#searchAndReduce in order to ensure that we test the final output (the one sent to the end user) rather than an intermediary result that could be different. This change also removes spurious commits triggered on top of a random index writer. These commits slow down the tests and are redundant with the commits that the random index writer performs.	2020-08-10 17:23:00 +02:00
David Roberts	dd02e9f31a	[TEST] Mute SearchableSnapshotActionIT testSearchableSnapshotForceMergesIndexToOneSegment (#60904 ) Due to https://github.com/elastic/elasticsearch/issues/60901	2020-08-10 15:25:39 +01:00
Henning Andersen	a155315ceb	Autoscaling decider and decision service (#59005 ) (#60884 ) Split the autoscaling decider into a service and configuration in order to enable having additional context information available in the service. Added AutoscalingDeciderContext holding generic information all deciders are expected to need. Implemented GET _autoscaling/decision	2020-08-10 15:28:52 +02:00
Andrei Dan	235e5ed3ea	[7.x] ILM: add force-merge step to searchable snapshots action (#60819 ) (#60882 ) This adds a force-merge step to the searchable snapshot action, enabled by default, but parameterizable using the `force_merge-index" optional boolean. eg. ``` PUT _ilm/policy/my_policy { "policy": { "phases": { "cold": { "actions": { "searchable_snapshot" : { "snapshot_repository" : "backing_repo", "force_merge_index": true } } } } } } ``` (cherry picked from commit d0a17b2d35f1b083b574246bdbf3e1929471a4a9) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-08-10 13:45:11 +01:00
Martijn van Groningen	64bb082f9b	Improve error message for non append-only writes that target data stream (#60874 ) Backport of #60809 to 7.x branch. Closes #60581	2020-08-10 13:18:59 +02:00
David Kyle	6b2ddf4453	Fix typo in DataHistogramGroupByIT name (#60880 ) (#60883 )	2020-08-10 11:55:01 +01:00
David Turner	f168bdac7d	Change transitive -> transient in ILM log message (#60871 ) "Transitive" is technically ok here but it's an overloaded word and it's not immediately clear which meaning is intended so this log message always makes me do a double-take. I think both "transient" and "transitory" are clearer, with "transient" being the usual choice.	2020-08-10 11:37:49 +01:00
David Turner	a2d5bfca2f	Even longer timeout for XPackRestIT (#60812 ) This suite is still occasionally failing with a timeout on macOS. Suggest further increasing this timeout until this suite is broken up. Relates #58071	2020-08-10 10:26:21 +01:00
James Rodewig	ff4ea4720a	[DOCS] Update example data stream names (#60783 ) (#60820 ) Uses `my-data-stream` in place of `logs` for data stream examples. This provides a more intuitive experience for users that copy/paste their own values into snippets.	2020-08-06 09:38:35 -04:00
Benjamin Trent	bc17afc535	[7.x] [ML] have DELETE analytics ignore stats failures and clean up unused stats (#60776 ) (#60784 ) * [ML] have DELETE analytics ignore stats failures and clean up unused stats (#60776) When deleting an analytics configuration, the request MIGHT fail if the .ml-stats index does not exist or is in strange state (shards unallocated). Instead of making the request fail, we should log that we were unable to delete the stats docs and then have them cleaned up in the 'delete_expire_data' janitorial process	2020-08-06 08:55:35 -04:00
David Turner	05b2a2db8b	AwaitsFix for #60781	2020-08-06 12:28:53 +01:00
David Turner	f24a3a4e81	AwaitsFix for 60781	2020-08-06 11:35:44 +01:00
Hendrik Muhs	b210aaf666	[Transform] remove wrong test (#60807 ) remove test, scripts are excluded in the change collector, the test is a leftover from a previous solution of #57332, which has been discarded relates #60724 fixes #60794	2020-08-06 11:56:19 +02:00
Dimitris Athanasiou	cedbe6968b	[7.x][ML] Include cause in logging during test inference (#60749 ) (#60805 ) When an exception is thrown during test inference we are not including the cause message in our logging. This commit addresses this issue. Backport of #60749	2020-08-06 11:45:59 +03:00
Ryan Ernst	d88098c1d5	Mute flaky transform pivot test see https://github.com/elastic/elasticsearch/issues/60794	2020-08-05 14:53:25 -07:00
James Rodewig	029869eb35	[DOCS] Fix metadata field refs (#60764 ) (#60769 )	2020-08-05 14:04:55 -04:00
Francisco Fernández Castaño	b4044004aa	Add recovery state tracking for Searchable Snapshots (#60751 ) This pull request adds recovery state tracking for Searchable Snapshots. In order to track recoveries for searchable snapshot backed indices, this pull request adds a new type of RecoveryState. This newRecoveryState instance is able to deal with the small differences that arise during Searchable snapshots recoveries. Those differences can be summarized as follows: - The Directory implementation that's provided by SearchableSnapshots mark the snapshot files as reused during recovery. In order to keep track of the recovery process as the cache is pre-warmed, those files shouldn't be marked as reused. - Once the shard is created, the cache starts its pre-warming phase, meaning that we should keep track of those downloads during that process and tie the recovery to this pre-warming phase. The shard is considered recovered once this pre-warming phase has finished. Backport of #60505	2020-08-05 17:41:49 +02:00
Hendrik Muhs	08f94c914b	[Transform] disable optimizations when using scripts in group_by (#60724 ) disable optimizations when using scripts in group_by, when scripts using scripts we can not predict the outcome and we have no query counterpart. Other optimizations for other group_by's are not affected. fixes #57332	2020-08-05 17:27:19 +02:00
Hendrik Muhs	2b6891b584	[7.x][Transform] implement test suite to test continuous transforms (#60725 ) implements a test suite for testing continuous transform with randomization in terms of mappings, index settings, transform configuration. Add a test case for terms and date histogram. The test covers: - continuous mode with several checkpoints created - correctness of results - optimizations (minimal necessary writes) - permutations of features (index settings, aggs, data types, index or data stream)	2020-08-05 16:56:01 +02:00
Albert Zaharovits	e5dce5e805	Use the Index Access Control from the scroll search context (#60640 ) When the RBACEngine authorizes scroll searches it sets the index access control to the very limiting IndicesAccessControl.ALLOW_NO_INDICES value. This change will set it to the value for the index access control that was produced during the authorization of the initial search that created the scroll, which is now stored in the scroll context.	2020-08-05 15:37:37 +03:00
Przemysław Witek	0afa1bd972	Deprecate allow_no_jobs and allow_no_datafeeds in favor of allow_no_match (#60601 ) (#60727 )	2020-08-05 13:39:40 +02:00
Yannick Welsch	9f6f66f156	Fail searchable snapshot shards on invalid license (#60722 ) Implements license degradation behavior for searchable snapshots. Snapshot-backed shards are failed when the license becomes invalid, and shards won't be reallocated. After valid license is put in place again, shards are allocated again.	2020-08-05 13:14:15 +02:00
Adrien Grand	67f6f34c23	Remove dataset.* fields. (#60720 ) These are being replaced by the `data_stream.*` fields.	2020-08-05 11:35:05 +02:00
Rory Hunter	43762f69d1	Move deprecation HTTP tests to deprecation plugin (#60523 ) Backport of #60298. This PR moves the deprecation HTTP tests under the deprecation plugin, as a precursor to adding further tests as part of #58924.	2020-08-05 09:54:34 +01:00
Adrien Grand	602d269059	Rename `datastream` to `data_stream`. (#60714 ) The name of the feature having a space: "data stream", the key should have an underscore.	2020-08-05 09:55:02 +02:00
Russ Cam	e9c0bf1566	Remove body from indices.create_data_stream REST spec (#60705 ) This commit removes the body property from the indices.create_data_stream.json REST API spec as the API does not support sending a body. Update the description of the API to remove that a data stream can be updated with the API - data streams can only be created with this API and attempting to update yields a `resource_already_exists_exception`. Closes #60704 (cherry picked from commit 2cab2e0ee094769852df31566dbe22b5df59d900)	2020-08-05 17:01:28 +10:00
Igor Motov	959690a64a	Refactor extendedBounds to use DoubleBounds (#60556 ) (#60681 ) Refactors extendedBounds to use DoubleBounds instead of 2 variables. This is a follow up for #59175	2020-08-04 16:45:47 -04:00
Francisco Fernández Castaño	b500b3d55a	Decrease restore rate limit value to enforce its usage on SearchableSnapshotsIntegTests#testMaxRestoreBytesPerSecIsUsed (#60650 ) Fixes #59287. Backport of #59592	2020-08-04 17:44:47 +02:00
Alan Woodward	b3ae5d26bd	Move mapper validation to the mappers themselves (#60072 ) (#60649 ) Currently, validation of mappers (checking that cross-references are correct, limits on field name lengths and object depths, multiple definitions, etc) is performed by the MapperService. This means that any mapper-specific validation, for example that done on the CompletionFieldMapper, needs to be called specifically from core server code, and so we can't add validation to mappers that live in plugins. This commit reworks the validation framework so that mapper-specific validation is done on the Mapper itself. Mapper gets a new `validate(MappingLookup)` method (already present on `MetadataFieldMapper` and now pulled up to the parent interface), which is called from a new `DocumentMapper.validate()` method. All the validation code currently living on `MapperService` moves either to individual mapper implementations (FieldAliasMapper, CompletionFieldMapper) or into `MappingLookup`, an altered `DocumentFieldMappers` which now knows about object fields and can check for duplicate definitions, or into DocumentMapper which handles soft limit checks.	2020-08-04 14:39:20 +01:00

... 2 3 4 5 6 ...

6188 Commits