OpenSearch

Commit Graph

Author	SHA1	Message	Date
Przemysław Witek	283a1f605c	Rename binary_soft_classification evaluation to outlier_detection (#59951 ) (#59970 )	2020-07-21 15:15:04 +02:00
Rene Groeschke	c6d3af35b9	Simplify gradle test task error reporting (#59760 ) (#59964 ) * Simplify test error reporting - avoid using extra plugin - avoid extra task listener (should be avoided related to #57918 ) - keep all logic in the listener	2020-07-21 14:13:35 +02:00
Yannick Welsch	07784a0b16	CCR recoveries using wrong setting for chunk sizes (#59597 ) The default chunk size for CCR file-based recoveries was wrongly set to 40MB instead of 1MB.	2020-07-21 13:56:06 +02:00
Rene Groeschke	60d46f1d13	Remove superflous enforce deprecation failure plugin (#59770 ) (#59898 ) - With enforcing the build to fail on all gradle deprecated api usage we do not need this tailored plugin anymore	2020-07-21 12:54:17 +02:00
Armin Braun	cefaa17c52	Simplify CheckSumBlobStoreFormat and make it more Reusable (#59888 ) (#59950 ) Refactored `CheckSumBlobStoreFormat` so it can more easily be reused in other functionality (i.e. upcoming repair logic). Simplified away constant `failIfAlreadyExists` parameter and removed the atomic write method and its tests. The atomic write method was only used in a single spot and that spot has now been adjusted to work the same way writing root level metadata works.	2020-07-21 11:20:56 +02:00
Armin Braun	5b92596fad	Cleanup and Optimize Multiple Serialization Spots (#59626 ) (#59936 ) Follow up to #59606 using some of the new infrastructure and making similar cleanups (and due to at times better handling of size hints and empty collections also optimizations in the stream utility methods this also means speedups) in various spots in the core codebase.	2020-07-21 10:06:56 +02:00
Julie Tibshirani	8647872a1e	Simplify structure for parsing points. (#59938 ) Previously we constructed a GeometryFormat object and delegated point parsing to it. This wasn't a good fit conceptually because each GeometryFormat instance didn't represent a distinct point format.	2020-07-20 17:11:43 -07:00
Lisa Cawley	fb212269ce	[DOCS] Changes level offset of anomaly detection pages (#59911 ) (#59940 )	2020-07-20 17:04:59 -07:00
Julie Tibshirani	8dc5880c3f	Add 'point' to the top-level field type docs. (#59731 ) Before it was missing from the list. This PR also renames the 'geo data types' section to 'spatial data types' and consolidates the geo and cartesian types into that section.	2020-07-20 16:30:12 -07:00
Ryan Ernst	b7d43c5eae	Add --force-yes to apt commands (#59557 ) We use -y to automate apt install commands within vagrant provisioning. However, this is sometimes insufficient, for example when a gpg signature signing the package expires. This commit adds the extra --force-yes flag, to tell apt-get we really mean business. closes #59495	2020-07-20 16:27:37 -07:00
Tal Levy	c9ac4bf7c8	Reduce memory usage of GeoGridTiler tests (#59921 ) This PR further reduces the memory footprint of the testGeoHashGridCircuitBreaker test such that only 0.26% of the randomized runs result in memory usage of between 500kb-1mb. where most of that those that are in that range produce ~650kb of usage. Before, 3% of the runs would use > 50mb of memory resulting in OOMs in CI Closes #59853.	2020-07-20 15:45:39 -07:00
Lisa Cawley	9633d503d8	[DOCS] Changes level offset for anomaly detection APIs (#59920 ) (#59928 )	2020-07-20 13:10:54 -07:00
Lisa Cawley	8f8d24b3c1	[DOCS] Changes level offset in data frame analytics APIs (#59919 ) (#59923 )	2020-07-20 13:06:29 -07:00
James Rodewig	ff8a042580	[DOCS] Reformat agg snippets to use two-space indents (#59912 ) (#59922 )	2020-07-20 15:59:00 -04:00
James Rodewig	bb80c56e04	[DOCS] Reformat Plugin snippets to use two-space indents (#59895 ) (#59916 )	2020-07-20 15:06:12 -04:00
James Rodewig	24fec52447	[DOCS] Add performance warning for scripts (#59890 ) (#59913 )	2020-07-20 15:05:33 -04:00
Armin Braun	e16e565c5e	Fix Snapshot Status API Docs Test (#59902 ) (#59908 ) The clock resolution for this API is our default 200ms. It is unlikely but possible that a shard snapshot starts and ends on separate clock ticks and that breaks the test. Just allowing any value here seems fine to me (seems we can't match for integer specifically).	2020-07-20 18:43:40 +02:00
Jay Modi	515b53d297	Fix race in SLM master/cluster state listeners (#59896 ) This change fixes two possible race conditions in SLM related to how local master changes and cluster state events are observed. When implementing the `LocalNodeMasterListener` interface, it is only recommended to execute on a separate threadpool if the operations are heavy and would block the cluster state thread. SLM specified that the listeners should run in the Snapshot thread pool, but the operations in the listener were lightweight. This had the side effect of causing master changes to be delayed if the Snapshot threads were all busy and could also potentially cause the `onMaster` and `offMaster` calls to race if both were queued and then executed concurrently. Additionally, the `SnapshotLifecycleService` is also a `ClusterStateListener` and there is currently no order of operations guarantee between `LocalNodeMasterListeners` and `ClusterStateListeners` so this could lead to incorrect behavior. The resolution for these two issues is that the SnapshotRetentionService now specifies the `SAME` executor for its implementation of the `LocalNodeMasterListener` interface. The `SnapshotLifecycleService` is no longer a `LocalNodeMasterListener` and instead tracks local master changes in its `ClusterStateListner`. Backport of #59801	2020-07-20 09:59:46 -06:00
Igor Motov	96a5284484	Add hard_bounds documentation (#59809 ) (#59883 ) Fixes #59774	2020-07-20 10:51:23 -04:00
Nik Everett	b2ca19484a	Allocate slightly less per bucket (#59740 ) (#59873 ) This replaces that data structure that we use to resolve bucket ids in bucketing aggs that are inside other bucketing aggs. This replaces the "legoed together" data structure with a purpose built `LongLongHash` with semantics similar to `LongHash`, except that it has two `long`s as keys instead of one. The microbenchmarks show a fairly substantial performance gain on the hot path, around 30%. Rally's higher level benchmarks show anywhere from 0 to 7% speed improvements. Not as much as I'd hoped, but nothing to sneeze at. And, after all, we all allocating slightly less data per owningBucketOrd, which is always nice.	2020-07-20 10:43:11 -04:00
Nik Everett	fcd8b5fe6e	Fix top_metrics when metric is missing (backport of #59471 ) (#59881 ) This fixes a null pointer exception when the metric is missing for the latest document returned by `top_metrics`. Closes #58926	2020-07-20 10:42:58 -04:00
Nik Everett	fe10141108	Document supported scenarios for CCS (#58120 ) (#59886 ) Documents the supported scenarios for CCS. Co-authored-by: Adam Locke <adam.locke@elastic.co>	2020-07-20 10:41:53 -04:00
Stéphane Campinas	bcebdfe5b1	fix handling of alias filter in SearchService#canMatch (#59368 ) The check against the alias filter should be done after the request is rewritten. Close #59367	2020-07-20 16:25:15 +02:00
David Turner	b75207a09f	Remove sporadic min/max usage estimates from stats (#59755 ) Today `GET _nodes/stats/fs` includes `{least,most}_usage_estimate` fields for some nodes. These fields have rather strange semantics. They are only reported on the elected master and on nodes that have been the elected master since they were last restarted; when a node stops being the elected master these stats remain in place but we stop updating them so they may become arbitrarily stale. This means that these statistics are pretty meaningless and impossible to use correctly. Even if they were kept up to date they're never reported for data-only nodes anyway, despite the fact that data nodes are the ones where we care most about disk usage. The information needed to compute the path with the least/most available space is already provided in the rest the stats output, so we can treat the inclusion of these stats as a bug and fix it by simply removing them in this commit. Since these stats were always optional and mostly omitted (for opaque reasons) this is not considered a breaking change.	2020-07-20 15:22:04 +01:00
James Rodewig	e7c7ed6493	[DOCS] Fix `requests_per_second` reindex param (#59871 ) (#59876 ) Corrects the `requests_per_second` query parameter used in the reindex, delete by query, and update by query API docs. The parameter defaults to `-1` (no throttle). `0` is not an allowed value.	2020-07-20 10:08:51 -04:00
James Rodewig	76b2dd23e2	[DOCS] Document data stream stats API (#59435 ) (#59874 )	2020-07-20 09:50:26 -04:00
James Rodewig	32c8df68ba	[DOCS] Fix erroneous data stream ref (#59805 ) (#59868 ) Removes an erroneous data stream reference added in #58513. While technically possible, we don't encourage using date math to name data streams.	2020-07-20 09:30:30 -04:00
James Rodewig	82a8d9aa0c	[DOCS] Fix keyword marker docs (#59834 ) (#59863 ) Co-authored-by: Rui Almeida <ruial@outlook.com>	2020-07-20 09:27:42 -04:00
James Rodewig	828aa6f640	[DOCS] EQL: Remove collapsible sections from EQL search docs (#59819 ) (#59861 )	2020-07-20 09:26:32 -04:00
James Rodewig	a160daa5d9	[DOCS] Remove collapsible examples (#59820 ) (#59857 ) Snippets are now visible without additional clicks.	2020-07-20 09:14:36 -04:00
Rene Groeschke	e31ebc96f9	Enforce fail on deprecated gradle usage (7.x backport) (#59758 ) * Enforce fail on deprecated gradle usage (#59598) * Fix branch specific deprecated gradle api usages * Fix archiveVersion property usage	2020-07-20 08:52:30 +02:00
Albert Zaharovits	3ffb20bdfc	Fix DLS/FLS permission for the submit async search action (#59693 ) The submit async search action should not populate the thread context DLS/FLS permission set, because it is not currently authorised as an "indices request" and hence the permission set that it builds is incomplete and it overrides the DLS/FLS permission set of the actual spawned search request (which is built correctly).	2020-07-20 09:37:26 +03:00
Nhat Nguyen	120fe96402	Log shard list in testIndexVersionPropagation Relates #59494	2020-07-18 23:23:57 -04:00
Costin Leau	9cc80621c3	EQL: Fix matching of tail/desc queries (#59827 ) When dealing with tail queries, data is returned descending for the base criterion yet the rest of the queries are ascending. This caused a problem during insertion since while in a page, the data is ASC, between pages the blocks of data is DESC. This caused incorrectly sorting inside a SequenceGroup which led to incorrect results. Further more in case of limit, since the data in a page is ASC, early return is not possible neither is desc matching. Thus the page needs to be consumed first before finding the final results. A future improvement could be to keep only the top N results dropping the rest during insertion time. (cherry picked from commit 77c88da054a1ce662a264f72cde5986d4ce37e3a)	2020-07-19 00:49:16 +03:00
Lee Hinman	8c7d414a3b	[7.x] Fix retrieving data stream stats for a DS with multiple backing indices (#59806 ) (#59810 ) Backports the following commits to 7.x: Fix retrieving data stream stats for a DS with multiple backing indices (#59806)	2020-07-17 16:56:07 -06:00
Ryan Ernst	e963918830	Ensure precommit runs as part of check (#59476 ) Precommit is setup to run as a dependency of the check task, but unfortunately this wiring was only happening when the java plugin (but not java-base plugin) was applied. This commit moves the wiring to occur whenever the check task exists, which is with the lifecycle-base plugin.	2020-07-17 15:41:37 -07:00
Nik Everett	514b2f3414	Clean up a few of vwh's rough edges (#59341 ) (#59807 ) This cleans up a few rough edged in the `variable_width_histogram`, mostly found by @wwang500: 1. Setting its tuning parameters in an unexpected order could cause the request to fail. 2. We checked that the maximum number of buckets was both less than 50000 and MAX_BUCKETS. This drops the 50000. 3. Fixes a divide by 0 that can occur of the `shard_size` is 1. 4. Fixes a divide by 0 that can occur if the `shard_size * 3` overflows a signed int. 5. Requires `shard_size * 3 / 4` to be at least `buckets`. If it is less than `buckets` we will very consistently return fewer buckets than requested. For the most part we expect folks to leave it at the default. If they change it, we expect it to be much bigger than `buckets`. 6. Allocate a smaller `mergeMap` in when initially bucketing requests that don't use the entire `shard_size * 3 / 4`. Its just a waste. 7. Default `shard_size` to `10 * buckets` rather than `100`. It looks like that was our intention the whole time. And it feels like it'd keep the algorithm humming along more smoothly. 8. Default the `initial_buffer` to `min(10 * shard_size, 50000)` like we've documented it rather than `5000`. Like the point above, this feels like the right thing to do to keep the algorithm happy. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-17 15:16:09 -04:00
Lee Hinman	f6b08a3115	[7.x] Allow simulating existing composable index template (#59733 ) (#59798 ) Backports the following commits to 7.x: Allow simulating existing composable index template (#59733)	2020-07-17 13:10:07 -06:00
Adam Locke	29ff05cbac	[7.x] [DOCS] Update snapshot/restore docs to align with API changes (#59730 ) (#59803 ) * [DOCS] Updating snapshot/restore pages to align with API changes (#59730) * Updating snapshot/restore pages to align with API changes. * Fixing texts in delete snapshot page. * Removing duplicate code sample and making editorial changes. * Change "deleted" to "delete" * Incorporating review feedback and making minor editorial changes. * Remove titleabbrev * Add paragraph break * Remove titleabbrev from restore page * Remove titleabbrev from create page * Change "Create" to lowercase * Change API names to lowercase * Remove extraneous delimiters * Change "Delete" to lowercase * Single-sourcing warning and clarifying warning text. * Fixing tests and removing erroneous example.	2020-07-17 14:33:18 -04:00
Nik Everett	95e6e4a452	Small cleanup for IndexFieldData (#59724 ) (#59800 ) This drops `IndexComponent` from `IndexFieldData` because it wasn't doing anything other than forcing us to perform a bunch of ceremony to build them.	2020-07-17 13:38:15 -04:00
Tal Levy	c9ab7bb651	Fix bug in circuit-breaker check for geoshape grid aggregations (#57962 ) (#59741 ) There was a bug in the geoshape circuit-breaker check where the hash values array was being allocated before its new size was accounted for by the circuit breaker. Fixes #57847.	2020-07-17 09:26:00 -07:00
Dan Hermann	48df9b1a0e	Update regex file for es user agent node processor (#59697 ) (#59794 )	2020-07-17 11:04:01 -05:00
James Rodewig	40f319a1f6	[DOCS] Reformat Painless snippets to use two-space indents (#59776 ) (#59791 )	2020-07-17 11:31:37 -04:00
Christoph Büscher	f4ff5fe93b	Add `zero_terms_query` support to `match_phrase_prefix` (#58822 ) (#59784 ) Currently `match_phrase_prefix` doesn't support `zero_terms_query` like the other match-type queries. This change adds this support. Closes #58468	2020-07-17 17:23:23 +02:00
Adam Locke	6ccf3548e7	Fix Snapshot Status API Docs Test (#59775 ) (#59787 ) Introduce a fix to tests by snapshotting a single index+shard in the snapshot that we get the status for and verifying consistency instead of equality for total file counts. Co-authored-by: Armin Braun <me@obrown.io>	2020-07-17 11:11:23 -04:00
James Rodewig	a672a2a2d4	[DOCS] Move highlighting docs to separate page (#59768 ) (#59781 ) Moves the highlighting docs from the deprecated 'Request Body Search' chapter to the new subpage of the 'Run a search chapter' section. No substantive changes were made to the content.	2020-07-17 10:57:00 -04:00
Benjamin Trent	b7f30fc929	[7.x] Adding new `require_alias` option to indexing requests (#58917 ) (#59769 ) * Adding new `require_alias` option to indexing requests (#58917) This commit adds the `require_alias` flag to requests that create new documents. This flag, when `true` prevents the request from automatically creating an index. Instead, the destination of the request MUST be an alias. When the flag is not set, or `false`, the behavior defaults to the `action.auto_create_index` settings. This is useful when an alias is required instead of a concrete index. closes https://github.com/elastic/elasticsearch/issues/55267	2020-07-17 10:24:58 -04:00
Alan Woodward	65f6fb8e94	Shortcut mapping update if the incoming mapping version is the same as the current mapping version (#59517 ) (#59772 ) Currently, when we apply a cluster state change to a shard on a non-master node, we check to see if the mappings need to be updated by comparing the decompressed serialized mappings from the update against the serialized version of the shard's existing mappings. However, we already have a much simpler way of checking this, by comparing mapping versions on the index metadata of the old and new states. This commit adds a shortcut to MapperService.updateMappings() that compares these mapping versions, and ignores the merge if they are equal.	2020-07-17 14:53:09 +01:00
Alan Woodward	b29d368b52	Convert DateFieldMapper to parametrized format (#59429 ) (#59759 ) This commit makes DateFieldMapper extend ParametrizedFieldMapper, declaring its parameters explicitly. As well as changes to DateFieldMapper itself, there are some changes to dynamic mapping code to ensure that dynamically detected date formats are passed through to new date mapper builders.	2020-07-17 12:46:18 +01:00
Andrei Dan	301d61a98e	Tests: fix TimeSeriesDataStreamsIT.testShrinkActionInPolicyWithoutHotPhase (#59603 ) (#59689 ) The ILM policy for the source and shrunk indices run separately (ie. they are two separate managed indices). This fixes the test which exhibited some flakiness by allowing some time for the ILM policy for the source index to finish executing. (cherry picked from commit c78d5e8499fc5ca2ca1314f97bcc6b55ba06e2e7) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-07-17 11:26:06 +01:00

... 6 7 8 9 10 ...

53150 Commits All Branches Search

53150 Commits

All Branches