OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-07 13:38:49 +00:00

Author	SHA1	Message	Date
Lisa Cawley	2665bfffce	[DOCS] Fix security links in machine learning APIs (#60098 ) (#60152 )	2020-07-23 16:43:10 -07:00
Nhat Nguyen	bc65b3a590	Increase timeout in AutoFollowIT (#60004 ) It can take more than 10 seconds to auto-follow and create a follow-task on a slow CI. This commit increases timeout in AutoFollowIT by replacing assertBusy with assertLongBusy. Closes #59952	2020-07-23 16:36:53 -04:00
Nhat Nguyen	0fe4d5df67	Increase timeout testFollowIndexWithConcurrentMappingChanges Fixes #59273	2020-07-23 16:22:58 -04:00
Nhat Nguyen	0031dea9cc	Fix race in testSendSnapshotSendsOps (#59831 ) There is a race between increase and get the global checkpoint in the test as indexTranslogOperations can be executed concurrently. Closes #59492	2020-07-23 16:22:40 -04:00
Lisa Cawley	cc6edc39a1	[DOCS] Refresh transform screenshots with histograms (#59264 ) (#60145 )	2020-07-23 11:14:50 -07:00
James Rodewig	2e01f652c1	[DOCS] Move search sort docs to separate page (#60123 ) (#60142 ) Moves the search sort docs from the deprecated 'Request Body Search' page to a new subpage of 'Run a search'. No substantive changes were made to the content.	2020-07-23 13:44:47 -04:00
Albert Zaharovits	2eaf5e1c25	[DOCS] Mapping updates are deprecated for ingestion privileges (#60024 ) This PR contains the deprecation notice that `create`, `create_doc`, `index` and `write` ingest privileges do not permit mapping updates in version 8. It also updates the docs description of said privileges. This should've been part of #58784	2020-07-23 19:49:23 +03:00
James Rodewig	988e8c8fc6	[DOCS] Swap `[float]` for `[discrete]` (#60134 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 12:42:33 -04:00
Adrien Grand	716a3d5a21	Mention how CCR can help optimize indexing throughput. (#54870 )	2020-07-23 18:40:40 +02:00
Dimitris Athanasiou	6b9a362ec2	[7.x][ML] Skip test inference if DFA task has been stopped (#60116 ) (#60127 ) If the job is stopped before starting inference on test data, we should skip inference entirely. Backport of #60116	2020-07-23 18:34:09 +03:00
Dan Hermann	ca25f6ae6f	Include the resolve index action in the view_index_metadata privilege (#59785 ) (#60112 )	2020-07-23 08:13:56 -05:00
Dan Hermann	fe12217c7f	[7.x] Move REST specs for data streams (#60111 )	2020-07-23 08:10:54 -05:00
Ignacio Vera	db183c89ed	Refactor HyperLogLogPlusPlus to separate algorithms and internal data representation (#60104 ) (#60109 )	2020-07-23 15:07:05 +02:00
Albert Zaharovits	3ad3a7d268	DOCS audit attributes for API Key authn (#60033 ) This PR describes the new audit entry attributes api_key.id, api_key.name and authentication.type, as well as the meaning of existing attributes when authentication is performed using API keys. This should've been part of #58928	2020-07-23 15:51:40 +03:00
Martijn Laarman	890d35f74d	[DOCS] note breaking change from 7.8.1 in migration guide (#59642 ) Co-authored-by: Adam Locke <adam.locke@elastic.co> (cherry picked from commit b0c34020ed6a10fda8e6efa9af343bd283954ec5)	2020-07-23 13:18:46 +02:00
David Turner	bf7e53a91e	Remove node-level canAllocate override (#59389 ) Today there is a node-level `canAllocate` override which the balancer uses to ignore certain nodes to which it is certain no more shards can be allocated. In fact this override only ignores nodes which have hit the rarely-used `cluster.routing.allocation.total_shards_per_node` limit, so this optimization doesn't have a meaningful impact on real clusters. This commit removes this unnecessary fast path from the balancer, and also removes all the machinery needed to support it.	2020-07-23 08:48:59 +01:00
Armin Braun	43a6ff5eb1	Optimize some Spots around Closing Resources (#60049 ) (#60096 ) The single element `close` calls go through a very inefficient path that includes creating a one element list. `releaseOnce` is only with a single non-null input in production in two spots so no need for varargs and any complexity here. `ReleasableBytesStreamOutput` does not require any `releaseOnce` wrapping because we already have that kind of logic implemented in `org.elasticsearch.common.util.AbstractArray` (which we were wrapping here) already.	2020-07-23 08:49:06 +02:00
Julie Tibshirani	aa57bbd422	Consolidate validation for 'docvalue_fields'. (#60065 ) This improves modularity and also fixes some issues when `docvalues_fields` is used within `inner_hits` or the `top_hits` agg: * We previously didn't resolve wildcards in field names. * We also forgot to enforce the limit `index.max_docvalue_fields_search`.	2020-07-22 17:26:58 -07:00
James Rodewig	67b07ec386	[DOCS] Remove SQL access settings page (#60078 ) (#60089 ) This page previously documented `xpack.sql.enabled`. However, in 7.8 and above, `xpack.sql.enabled` is always enabled and the setting has no effect. There is no reason to maintain this page.	2020-07-22 16:59:21 -04:00
James Rodewig	f8976505cb	[DOCS] Correct the default value of `ignore_throttled` param (#60036 ) (#60086 ) Co-authored-by: bellengao <gbl_long@163.com>	2020-07-22 16:53:18 -04:00
James Rodewig	0c9791798d	[7.x] [DOCS] Reformat snippets to use two-space indents (#60080 )	2020-07-22 15:57:49 -04:00
Armin Braun	ebb6677815	Formalize and Streamline Buffer Sizes used by Repositories (#59771 ) (#60051 ) Due to complicated access checks (reads and writes execute in their own access context) on some repositories (GCS, Azure, HDFS), using a hard coded buffer size of 4k for restores was needlessly inefficient. By the same token, the use of stream copying with the default 8k buffer size for blob writes was inefficient as well. We also had dedicated, undocumented buffer size settings for HDFS and FS repositories. For these two we would use a 100k buffer by default. We did not have such a setting for e.g. GCS though, which would only use an 8k read buffer which is needlessly small for reading from a raw `URLConnection`. This commit adds an undocumented setting that sets the default buffer size to `128k` for all repositories. It removes wasteful allocation of such a large buffer for small writes and reads in case of HDFS and FS repositories (i.e. still using the smaller buffer to write metadata) but uses a large buffer for doing restores and uploading segment blobs. This should speed up Azure and GCS restores and snapshots in a non-trivial way as well as save some memory when reading small blobs on FS and HFDS repositories.	2020-07-22 21:06:31 +02:00
Lisa Cawley	9ba017f699	[DOCS] Changes level offset of transform pages (#60066 ) (#60075 )	2020-07-22 11:22:57 -07:00
Tim Brooks	ba01540d7e	Implement human readable indexing pressure stats (#60058 ) The indexing pressure stats do not currently have human readable variants. This commit add human readable variants and updates the documentation.	2020-07-22 12:07:59 -06:00
James Rodewig	ed10d7407c	[DOCS] Fix shrink index API prereqs (#59985 ) (#60067 )	2020-07-22 14:06:40 -04:00
James Rodewig	74a34777d1	[DOCS] Fix outdated Kibana UI refs and screenshots in security docs (#60023 ) (#60059 )	2020-07-22 13:08:22 -04:00
Tim Brooks	ceb54ed655	Add indexing pressure documentation (#59456 ) This commit adds documentation about the new indexing pressure memory limit setting and exposure of this metrics in node stats.	2020-07-22 10:09:18 -06:00
Adam Locke	0a73225cd8	[DOCS] Adding new page for restore snapshot API (#59937 ) (#60055 ) * Adding new page for restore snapshot API. * Improving test cases, lots of edits, and streamlining content. * Incorporating review suggestions and feedback. * Specify `index alias` vs `alias` * Change parameter order * Provide clarity around regular expression * Add link to SLM parameters * Split sentences in example * Adding link to master node page.	2020-07-22 12:08:55 -04:00
Lisa Cawley	46d33b1586	[DOCS] 7.9.0 release notes (#60053 )	2020-07-22 08:40:59 -07:00
Larry Gregory	a686ccc9b2	[Backport][7.x] Introduce reserved_ml_apm_user kibana privilege (#59854 ) (#60047 )	2020-07-22 11:06:10 -04:00
Jay Modi	c8ef2e18f7	Thread safe clean up of LocalNodeModeListeners (#60007 ) This commit continues on the work in #59801 and makes other implementors of the LocalNodeMasterListener interface thread safe in that they will no longer allow the callbacks to run on different threads and possibly race each other. This also helps address other issues where these events could be queued to wait for execution while the service keeps moving forward thinking it is the master even when that is not the case. In order to accomplish this, the LocalNodeMasterListener no longer has the executorName() method to prevent future uses that could encounter this surprising behavior. Each use was inspected and if the class was also a ClusterStateListener, the implementation of LocalNodeMasterListener was removed in favor of a single listener that combined the logic. A single listener is used and there is currently no guarantee on execution order between ClusterStateListeners and LocalNodeMasterListeners, so a future change there could cause undesired consequences. For other classes, the implementations of the callbacks were inspected and if the operations were lightweight, the overriden executorName method was removed to use the default, which runs on the same thread. Backport of #59932	2020-07-22 08:02:18 -06:00
Luca Cavanna	702c997819	ParametrizedFieldMapper to run validators against default value (#60042 ) Sometimes there is the need to make a field required in the mappings, and validate that a value has been provided for it. This can be done through a validator when using ParametrizedFieldMapper, but validators need to run also when a value for a field has not been specified. Relates to #59332	2020-07-22 14:12:38 +02:00
Dimitris Athanasiou	7e652ca873	[7.x][ML] Include same fields during test inference as in training (#… (#60034 ) In #58877, when we switched test inference on java, we just use the doc's `_source` as features. However, this could be missing out on features that were used during training, e.g. alias fields, etc. This commit addresses this by extracting fields to use as features during inference the same way they are extracted in `DataFrameDataExtractor` when they are used for training. Backport of #59963	2020-07-22 12:54:13 +03:00
David Roberts	7358f9fb05	[ML] Mute ForecastIT.testOverflowToDisk in EAR builds (#60040 ) Due to https://github.com/elastic/elasticsearch/issues/58806	2020-07-22 10:17:37 +01:00
Armin Braun	c06c9fb966	Fix BwC Snapshot INIT Path (#60006 ) There were two subtle bugs here from backporting #56911 to 7.x. 1. We passed `null` for the `shards` map which isn't nullable any longer when creating `SnapshotsInProgress.Entry`, fixed by just passing an empty map like the `null` handling did in the past. 2. The removal of a failed `INIT` state snapshot from the cluster state tried removing it from the finalization loop (the set of repository names that are currently finalizing). This will trip an assertion since the snapshot failed before its repository was put into the set. I made the logic ignore the set in case we remove a failed `INIT` state snapshot to restore the old logic to exactly as it was before the concurrent snapshots backport to be on the safe side here. Also, added tests that explicitly call the old code paths because as can be seen from initially missing this, the BwC tests will only run in the configuration new version master, old version nodes ever so often and having a deterministic test for the old state machine seems the safest bet here. Closes #59986	2020-07-22 10:09:55 +02:00
Rene Groeschke	b210af8389	Update Gradle configurations section in CONTRIBUTING (#59906 )	2020-07-22 09:15:32 +02:00
Rene Groeschke	3fe6635b92	Remove stale gradle plugin descriptor	2020-07-22 09:10:01 +02:00
James Baiera	1c1a4297e0	Track backing indices in data streams stats from cluster state (#59817 ) (#60015 ) If shard level results are incomplete in the data streams stats call, it is possible to get inaccurate counts of the number of backing indices, despite this data being accurate and available in the cluster state.	2020-07-21 23:21:33 -04:00
Emily Li	5f27a95346	Fix grammar mistake in SQL data type docs. (#60028 ) Remove an extra 'when'.	2020-07-21 16:15:06 -07:00
Jake Landis	55216dabb4	[7.x] Per processor description for verbose simulate (#58207 ) (#60008 ) For ingest node processors a per processor description was recently added. This commit displays that description in the verbose output of the pipeline simulation. related #57906	2020-07-21 17:32:45 -05:00
James Rodewig	293cb8d48c	[DOCS] Fix typo in thread pools docs (#59944 ) (#60019 ) Fix typo where available processors should be allocated processors. Co-authored-by: Leaf-Lin <39002973+Leaf-Lin@users.noreply.github.com>	2020-07-21 17:04:36 -04:00
James Rodewig	401e12dc2b	[DOCS] Fix data stream docs (#59818 ) (#60010 )	2020-07-21 17:04:13 -04:00
James Rodewig	04c68ba740	[DOCS] Update search docs to use `my-index` dataset (#60005 ) (#60012 )	2020-07-21 16:14:44 -04:00
Nik Everett	49f365ddfd	Fix bug in deep pipeline agg serialization (#59984 ) In #54716 I removed pipeline aggregators from the aggregation result tree and caused us to read them from the request. This saves a bunch of round trip bytes, which is neat. But there was a bug in the backwards compatibility logic. You see, we still have to give the pipeline aggregations to nodes older than 7.8 over the wire because that is how they know what pipelines to run. They have the pipelines in the request but they don't read them. They use the ones in the response tree. Anyway, we had a bug where we were never sending pipelines defined two levels down. So while you are upgrading the pipeline wouldn't run. Sometimes. If the data node of the "first" result was post-7.8 and the coordinating node was pre-7.8. This fixes the bug.	2020-07-21 16:03:15 -04:00
James Baiera	b3363cf8f9	[7.x] Remove unneeded rest params from Data Stream Stats (#59575 ) (#59661 ) This PR removes the expand_wildcards and forbid_closed_indices parameters from the Data Streams Stats REST endpoint. These options are required for broadcast requests, but are not needed for anything in terms of resolving data streams. Instead, we just set a default set of IndicesOptions on the transport request.	2020-07-21 15:59:16 -04:00
James Rodewig	b302b09b85	[DOCS] Reformat snippets to use two-space indents (#59973 ) (#59994 )	2020-07-21 15:49:58 -04:00
David Roberts	606b7ea139	[DOCS] Adds extra ml-cpp PRs to release notes (#59967 )	2020-07-21 11:47:36 -07:00
Tim Brooks	ed315442ac	Update thread pool docs about WRITE queue size (#59643 ) This commit updates the thread pool documentation to reflect the change in the WRITE thread pool default queue size.	2020-07-21 12:38:03 -06:00
James Rodewig	32d7fa1541	[DOCS] Introduce basic ECS logs test (#59713 ) (#59997 ) Adds a new `my-index-00001` REST test for docs snippets. This test can serve as a lightweight replacement for our existing `twitter` REST tests. The new dataset is: * Based on Apache logs, which is better aligned with Elastic use cases * Compliant with ECS * Similar to the existing `twitter` data set, containing the same field data types * Lightweight, which should keep existing test runtimes roughly the same Also updates the search API reference docs to use the new test.	2020-07-21 13:25:53 -04:00
Armin Braun	5613e4b00b	Increase Timeout in testSLMRetentionAfterRestore (#59979 ) (#59991 ) This test failed by hitting the 10s default busy assert timeout. Given how involved the retention run is (multiple disk reads, CS updates etc.) we should have a higher timeout here. Also, removed the pointless delete call for the snapshot that we just asserted is gone, at the end of the test. Closes #59956	2020-07-21 18:19:18 +02:00

1 2 3 4 5 ...

52860 Commits