OpenSearch

Commit Graph

Author	SHA1	Message	Date
Armin Braun	c06c9fb966	Fix BwC Snapshot INIT Path (#60006 ) There were two subtle bugs here from backporting #56911 to 7.x. 1. We passed `null` for the `shards` map which isn't nullable any longer when creating `SnapshotsInProgress.Entry`, fixed by just passing an empty map like the `null` handling did in the past. 2. The removal of a failed `INIT` state snapshot from the cluster state tried removing it from the finalization loop (the set of repository names that are currently finalizing). This will trip an assertion since the snapshot failed before its repository was put into the set. I made the logic ignore the set in case we remove a failed `INIT` state snapshot to restore the old logic to exactly as it was before the concurrent snapshots backport to be on the safe side here. Also, added tests that explicitly call the old code paths because as can be seen from initially missing this, the BwC tests will only run in the configuration new version master, old version nodes ever so often and having a deterministic test for the old state machine seems the safest bet here. Closes #59986	2020-07-22 10:09:55 +02:00
Rene Groeschke	b210af8389	Update Gradle configurations section in CONTRIBUTING (#59906 )	2020-07-22 09:15:32 +02:00
Rene Groeschke	3fe6635b92	Remove stale gradle plugin descriptor	2020-07-22 09:10:01 +02:00
James Baiera	1c1a4297e0	Track backing indices in data streams stats from cluster state (#59817 ) (#60015 ) If shard level results are incomplete in the data streams stats call, it is possible to get inaccurate counts of the number of backing indices, despite this data being accurate and available in the cluster state.	2020-07-21 23:21:33 -04:00
Emily Li	5f27a95346	Fix grammar mistake in SQL data type docs. (#60028 ) Remove an extra 'when'.	2020-07-21 16:15:06 -07:00
Jake Landis	55216dabb4	[7.x] Per processor description for verbose simulate (#58207 ) (#60008 ) For ingest node processors a per processor description was recently added. This commit displays that description in the verbose output of the pipeline simulation. related #57906	2020-07-21 17:32:45 -05:00
James Rodewig	293cb8d48c	[DOCS] Fix typo in thread pools docs (#59944 ) (#60019 ) Fix typo where available processors should be allocated processors. Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>	2020-07-21 17:04:36 -04:00
James Rodewig	401e12dc2b	[DOCS] Fix data stream docs (#59818 ) (#60010 )	2020-07-21 17:04:13 -04:00
James Rodewig	04c68ba740	[DOCS] Update search docs to use `my-index` dataset (#60005 ) (#60012 )	2020-07-21 16:14:44 -04:00
Nik Everett	49f365ddfd	Fix bug in deep pipeline agg serialization (#59984 ) In #54716 I removed pipeline aggregators from the aggregation result tree and caused us to read them from the request. This saves a bunch of round trip bytes, which is neat. But there was a bug in the backwards compatibility logic. You see, we still have to give the pipeline aggregations to nodes older than 7.8 over the wire because that is how they know what pipelines to run. They have the pipelines in the request but they don't read them. They use the ones in the response tree. Anyway, we had a bug where we were never sending pipelines defined two levels down. So while you are upgrading the pipeline wouldn't run. Sometimes. If the data node of the "first" result was post-7.8 and the coordinating node was pre-7.8. This fixes the bug.	2020-07-21 16:03:15 -04:00
James Baiera	b3363cf8f9	[7.x] Remove unneeded rest params from Data Stream Stats (#59575 ) (#59661 ) This PR removes the expand_wildcards and forbid_closed_indices parameters from the Data Streams Stats REST endpoint. These options are required for broadcast requests, but are not needed for anything in terms of resolving data streams. Instead, we just set a default set of IndicesOptions on the transport request.	2020-07-21 15:59:16 -04:00
James Rodewig	b302b09b85	[DOCS] Reformat snippets to use two-space indents (#59973 ) (#59994 )	2020-07-21 15:49:58 -04:00
David Roberts	606b7ea139	[DOCS] Adds extra ml-cpp PRs to release notes (#59967 )	2020-07-21 11:47:36 -07:00
Tim Brooks	ed315442ac	Update thread pool docs about WRITE queue size (#59643 ) This commit updates the thread pool documentation to reflect the change in the WRITE thread pool default queue size.	2020-07-21 12:38:03 -06:00
James Rodewig	32d7fa1541	[DOCS] Introduce basic ECS logs test (#59713 ) (#59997 ) Adds a new `my-index-00001` REST test for docs snippets. This test can serve as a lightweight replacement for our existing `twitter` REST tests. The new dataset is: * Based on Apache logs, which is better aligned with Elastic use cases * Compliant with ECS * Similar to the existing `twitter` data set, containing the same field data types * Lightweight, which should keep existing test runtimes roughly the same Also updates the search API reference docs to use the new test.	2020-07-21 13:25:53 -04:00
Armin Braun	5613e4b00b	Increase Timeout in testSLMRetentionAfterRestore (#59979 ) (#59991 ) This test failed by hitting the 10s default busy assert timeout. Given how involved the retention run is (multiple disk reads, CS updates etc.) we should have a higher timeout here. Also, removed the pointless delete call for the snapshot that we just asserted is gone, at the end of the test. Closes #59956	2020-07-21 18:19:18 +02:00
David Turner	dde568caf7	Fix scheduling of ClusterInfoService#refresh (#59880 ) Today the `InternalClusterInfoService` uses the `LocalNodeMasterListener` interface to start/stop its operations. Since the `onMaster` and `offMaster` methods are called on the `MANAGEMENT` threadpool, there's no guarantee that they run in the correct sequence, which could result in an elected master failing to regularly update the cluster info. Since this service is also a `ClusterStateListener` we may as well drop the usage of the `LocalNodeMasterListener` interface and simply update the status of the local node on the applier thread in `clusterChanged` to ensure consistency. Additionally, today the `InternalClusterInfoService` uses a simple flag to track whether the local node is the elected master or not. If the node stops being the master and then starts again within a few seconds then the scheduled updates from the old mastership might carry on running in addition to the ones for the new mastership. This commit addresses that by tracking the identity of the scheduled update job and creating a new job for each mastership.	2020-07-21 17:14:49 +01:00
James Rodewig	fb40ccf8a4	[DOCS] Mark data stream stats API as stable (#59978 ) (#59987 ) Removes experimental admon from data stream stats API. Relates to #59860.	2020-07-21 11:22:36 -04:00
malpani	0555fef799	Support ignore_keywords flag for word delimiter graph token filter (#59563 ) This commit allows customizing the word delimiter token filters to skip processing tokens tagged as keyword through the `ignore_keywords` flag Lucene's WordDelimiterGraphFilter already exposes. Fix for #59491	2020-07-21 16:11:55 +01:00
Alan Woodward	a0ad1a196b	Wrap up building parametrized TypeParsers (#59977 ) The TypeParser implementations of all ParametrizedFieldMapper descendant classes are essentially the same - stateless, requiring the construction of a Builder object, and calling parse on it before returning it. We can make this easier (and less error-prone) to implement by wrapping the logic up into a final class, which takes a function to produce the Builder from a name and parser context.	2020-07-21 16:00:11 +01:00
James Rodewig	4d646ca819	[DOCS] Fix typo in LDAP config docs (#59953 ) (#59974 ) Co-authored-by: AndyHunt66 <andrew.hunt@elastic.co>	2020-07-21 10:48:08 -04:00
Nik Everett	6f6076e208	Drop some params from IndexFieldData.Builder (backport of #59934 ) (#59972 ) We never used the `IndexSettings` parameter and we only used the `MappedFieldType` parameter to get the name of the field which we already know everywhere where we build the `IFD.Builder`. This allows us to drop a fair bit of ceremony from a couple of tests.	2020-07-21 10:28:59 -04:00
Howard	466e947b0e	[DOCS] Fix missing punctuation in agg docs (#59823 )	2020-07-21 10:19:29 -04:00
Luca Cavanna	5e17f00ecf	Tweak toXContent implementation of ParametrizedFieldMapper (#59968 ) ParametrizedFieldMapper overrides `toXContent` from `FieldMapper`, yet it could override `doXContentBody` and rely on the `toXContent` from the base class. Additionally, this allows to make `doXContentBody` final. Also, toXContent is still overridden only to make it final.	2020-07-21 16:01:51 +02:00
Przemyslaw Gomulka	19fe3e511f	Deprecate camel case date format backport(#59555 ) (#59948 ) Camel case date formats are deprecated and snake case should be used instead. backports #59555	2020-07-21 15:56:44 +02:00
Armin Braun	e37bfe8a5f	Stop Checking if Segment Data Blob Exists before Write (#59905 ) (#59971 ) With uuid named segment data blobs there is no reason to ensure no overwrites are happening for these blobs when writing. On the contrary, at least on Azure this check can conflict with the SDK's retrying and cause upload failures randomly.	2020-07-21 15:23:42 +02:00
Przemysław Witek	283a1f605c	Rename binary_soft_classification evaluation to outlier_detection (#59951 ) (#59970 )	2020-07-21 15:15:04 +02:00
Rene Groeschke	c6d3af35b9	Simplify gradle test task error reporting (#59760 ) (#59964 ) * Simplify test error reporting - avoid using extra plugin - avoid extra task listener (should be avoided related to #57918 ) - keep all logic in the listener	2020-07-21 14:13:35 +02:00
Yannick Welsch	07784a0b16	CCR recoveries using wrong setting for chunk sizes (#59597 ) The default chunk size for CCR file-based recoveries was wrongly set to 40MB instead of 1MB.	2020-07-21 13:56:06 +02:00
Rene Groeschke	60d46f1d13	Remove superflous enforce deprecation failure plugin (#59770 ) (#59898 ) - With enforcing the build to fail on all gradle deprecated api usage we do not need this tailored plugin anymore	2020-07-21 12:54:17 +02:00
Armin Braun	cefaa17c52	Simplify CheckSumBlobStoreFormat and make it more Reusable (#59888 ) (#59950 ) Refactored `CheckSumBlobStoreFormat` so it can more easily be reused in other functionality (i.e. upcoming repair logic). Simplified away constant `failIfAlreadyExists` parameter and removed the atomic write method and its tests. The atomic write method was only used in a single spot and that spot has now been adjusted to work the same way writing root level metadata works.	2020-07-21 11:20:56 +02:00
Armin Braun	5b92596fad	Cleanup and Optimize Multiple Serialization Spots (#59626 ) (#59936 ) Follow up to #59606 using some of the new infrastructure and making similar cleanups (and due to at times better handling of size hints and empty collections also optimizations in the stream utility methods this also means speedups) in various spots in the core codebase.	2020-07-21 10:06:56 +02:00
Julie Tibshirani	8647872a1e	Simplify structure for parsing points. (#59938 ) Previously we constructed a GeometryFormat object and delegated point parsing to it. This wasn't a good fit conceptually because each GeometryFormat instance didn't represent a distinct point format.	2020-07-20 17:11:43 -07:00
Lisa Cawley	fb212269ce	[DOCS] Changes level offset of anomaly detection pages (#59911 ) (#59940 )	2020-07-20 17:04:59 -07:00
Julie Tibshirani	8dc5880c3f	Add 'point' to the top-level field type docs. (#59731 ) Before it was missing from the list. This PR also renames the 'geo data types' section to 'spatial data types' and consolidates the geo and cartesian types into that section.	2020-07-20 16:30:12 -07:00
Ryan Ernst	b7d43c5eae	Add --force-yes to apt commands (#59557 ) We use -y to automate apt install commands within vagrant provisioning. However, this is sometimes insufficient, for example when a gpg signature signing the package expires. This commit adds the extra --force-yes flag, to tell apt-get we really mean business. closes #59495	2020-07-20 16:27:37 -07:00
Tal Levy	c9ac4bf7c8	Reduce memory usage of GeoGridTiler tests (#59921 ) This PR further reduces the memory footprint of the testGeoHashGridCircuitBreaker test such that only 0.26% of the randomized runs result in memory usage of between 500kb-1mb. where most of that those that are in that range produce ~650kb of usage. Before, 3% of the runs would use > 50mb of memory resulting in OOMs in CI Closes #59853.	2020-07-20 15:45:39 -07:00
Lisa Cawley	9633d503d8	[DOCS] Changes level offset for anomaly detection APIs (#59920 ) (#59928 )	2020-07-20 13:10:54 -07:00
Lisa Cawley	8f8d24b3c1	[DOCS] Changes level offset in data frame analytics APIs (#59919 ) (#59923 )	2020-07-20 13:06:29 -07:00
James Rodewig	ff8a042580	[DOCS] Reformat agg snippets to use two-space indents (#59912 ) (#59922 )	2020-07-20 15:59:00 -04:00
James Rodewig	bb80c56e04	[DOCS] Reformat Plugin snippets to use two-space indents (#59895 ) (#59916 )	2020-07-20 15:06:12 -04:00
James Rodewig	24fec52447	[DOCS] Add performance warning for scripts (#59890 ) (#59913 )	2020-07-20 15:05:33 -04:00
Armin Braun	e16e565c5e	Fix Snapshot Status API Docs Test (#59902 ) (#59908 ) The clock resolution for this API is our default 200ms. It is unlikely but possible that a shard snapshot starts and ends on separate clock ticks and that breaks the test. Just allowing any value here seems fine to me (seems we can't match for integer specifically).	2020-07-20 18:43:40 +02:00
Jay Modi	515b53d297	Fix race in SLM master/cluster state listeners (#59896 ) This change fixes two possible race conditions in SLM related to how local master changes and cluster state events are observed. When implementing the `LocalNodeMasterListener` interface, it is only recommended to execute on a separate threadpool if the operations are heavy and would block the cluster state thread. SLM specified that the listeners should run in the Snapshot thread pool, but the operations in the listener were lightweight. This had the side effect of causing master changes to be delayed if the Snapshot threads were all busy and could also potentially cause the `onMaster` and `offMaster` calls to race if both were queued and then executed concurrently. Additionally, the `SnapshotLifecycleService` is also a `ClusterStateListener` and there is currently no order of operations guarantee between `LocalNodeMasterListeners` and `ClusterStateListeners` so this could lead to incorrect behavior. The resolution for these two issues is that the SnapshotRetentionService now specifies the `SAME` executor for its implementation of the `LocalNodeMasterListener` interface. The `SnapshotLifecycleService` is no longer a `LocalNodeMasterListener` and instead tracks local master changes in its `ClusterStateListner`. Backport of #59801	2020-07-20 09:59:46 -06:00
Igor Motov	96a5284484	Add hard_bounds documentation (#59809 ) (#59883 ) Fixes #59774	2020-07-20 10:51:23 -04:00
Nik Everett	b2ca19484a	Allocate slightly less per bucket (#59740 ) (#59873 ) This replaces that data structure that we use to resolve bucket ids in bucketing aggs that are inside other bucketing aggs. This replaces the "legoed together" data structure with a purpose built `LongLongHash` with semantics similar to `LongHash`, except that it has two `long`s as keys instead of one. The microbenchmarks show a fairly substantial performance gain on the hot path, around 30%. Rally's higher level benchmarks show anywhere from 0 to 7% speed improvements. Not as much as I'd hoped, but nothing to sneeze at. And, after all, we all allocating slightly less data per owningBucketOrd, which is always nice.	2020-07-20 10:43:11 -04:00
Nik Everett	fcd8b5fe6e	Fix top_metrics when metric is missing (backport of #59471 ) (#59881 ) This fixes a null pointer exception when the metric is missing for the latest document returned by `top_metrics`. Closes #58926	2020-07-20 10:42:58 -04:00
Nik Everett	fe10141108	Document supported scenarios for CCS (#58120 ) (#59886 ) Documents the supported scenarios for CCS. Co-authored-by: Adam Locke <adam.locke@elastic.co>	2020-07-20 10:41:53 -04:00
Stéphane Campinas	bcebdfe5b1	fix handling of alias filter in SearchService#canMatch (#59368 ) The check against the alias filter should be done after the request is rewritten. Close #59367	2020-07-20 16:25:15 +02:00
David Turner	b75207a09f	Remove sporadic min/max usage estimates from stats (#59755 ) Today `GET _nodes/stats/fs` includes `{least,most}_usage_estimate` fields for some nodes. These fields have rather strange semantics. They are only reported on the elected master and on nodes that have been the elected master since they were last restarted; when a node stops being the elected master these stats remain in place but we stop updating them so they may become arbitrarily stale. This means that these statistics are pretty meaningless and impossible to use correctly. Even if they were kept up to date they're never reported for data-only nodes anyway, despite the fact that data nodes are the ones where we care most about disk usage. The information needed to compute the path with the least/most available space is already provided in the rest the stats output, so we can treat the inclusion of these stats as a bug and fix it by simply removing them in this commit. Since these stats were always optional and mostly omitted (for opaque reasons) this is not considered a breaking change.	2020-07-20 15:22:04 +01:00

... 4 5 6 7 8 ...

53076 Commits All Branches Search

53076 Commits

All Branches