OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-19 19:35:02 +00:00

Author	SHA1	Message	Date
Arne Welzel	f642baa9fb	[DOCS] Remove extra "when" (#48926 )	2019-11-11 10:11:02 +01:00
Yannick Welsch	87862868c6	Allow realtime get to read from translog (#48843 ) The realtime GET API currently has erratic performance in case where a document is accessed that has just been indexed but not refreshed yet, as the implementation will currently force an internal refresh in that case. Refreshing can be an expensive operation, and also will block the thread that executes the GET operation, blocking other GETs to be processed. In case of frequent access of recently indexed documents, this can lead to a refresh storm and terrible GET performance. While older versions of Elasticsearch (2.x and older) did not trigger refreshes and instead opted to read from the translog in case of realtime GET API or update API, this was removed in 5.0 (#20102) to avoid inconsistencies between values that were returned from the translog and those returned by the index. This was partially reverted in 6.3 (#29264) to allow _update and upsert to read from the translog again as it was easier to guarantee consistency for these, and also brought back more predictable performance characteristics of this API. Calls to the realtime GET API, however, would still always do a refresh if necessary to return consistent results. This means that users that were calling realtime GET APIs to coordinate updates on client side (realtime GET + CAS for conditional index of updated doc) would still see very erratic performance. This PR (together with #48707) resolves the inconsistencies between reading from translog and index. In particular it fixes the inconsistencies that happen when requesting stored fields, which were not available when reading from translog. In case where stored fields are requested, this PR will reparse the _source from the translog and derive the stored fields to be returned. With this, it changes the realtime GET API to allow reading from the translog again, avoid refresh storms and blocking the GET threadpool, and provide overall much better and predictable performance for this API.	2019-11-09 17:47:50 +01:00
Julian Simioni	5e4501eb3f	[Docs] Consolidate single example into a single line (#48904 ) The first example of splitting rules for the `word_delimiter` token filter was spread across two bullet points. This makes it look like they are two separate splitting rules.	2019-11-08 15:12:45 -05:00
Yannick Welsch	af887be3e5	Hide orphaned tasks from follower stats (#48901 ) CCR follower stats can return information for persistent tasks that are in the process of being cleaned up. This is problematic for tests where CCR follower indices have been deleted, but their persistent follower task is only cleaned up asynchronously afterwards. If one of the following tests then accesses the follower stats, it might still get the stats for that follower task. In addition, some tests were not cleaning up their auto-follow patterns, leaving orphaned patterns behind. Other tests cleaned up their auto-follow patterns. As always the same name was used, it just depended on the test execution order whether this led to a failure or not. This commit fixes the offensive tests, and will also automatically remove auto-follow-patterns at the end of tests, like we do for many other features. Closes #48700	2019-11-08 13:56:53 +01:00
bellengao	bdc7057d58	[DOCS] Correct typo in split index API docs (#48894 )	2019-11-07 15:27:27 -05:00
bellengao	293902c6a5	[DOCS] Fix shard type in CCR overview doc (#48882 ) Closes #48875	2019-11-07 10:09:45 -05:00
Tanguy Leroux	552381d7f9	Add mention to Pause Auto-Follower API in Upgrade Clusters docs (#48764 ) Relates #46665	2019-11-06 09:48:44 -05:00
István Zoltán Szabó	3c9bd13dca	[DOCS] Adds classification type DFA API docs and ml-shared.asciidoc (#48241 )	2019-11-06 07:41:38 -05:00
István Zoltán Szabó	70765dfb05	[DOCS] Adds classification type evaluation docs to the DFA evaluation API (#47657 )	2019-11-06 07:38:33 -05:00
glerb	baabc21a04	[DOCS] Correct typo in Discovery docs (#48494 )	2019-11-05 08:48:43 -05:00
James Rodewig	700a316bb3	[DOCS] Reformat decimal digit token filter docs (#48722 )	2019-11-01 12:38:14 -04:00
James Rodewig	680999f246	[DOCS] List `indices.lifecycle.poll_interval` as cluster-level (#48813 ) Lists `indices.lifecycle.poll_interval` with other cluster-level ILM settings. Previously, it was included under index-level settings.	2019-11-01 11:54:46 -04:00
pulysak	9a0a7ab95a	[DOCS] Fix typo in Index API reference docs (#48760 )	2019-11-01 09:16:11 -04:00
debadair	b9f4b32892	[DOCS] Fix cross-doc link. (#48783 ) * [DOCS] Fix cross-doc link. * Fixed xref	2019-10-31 18:59:17 -07:00
Lisa Cawley	40834c229f	[7.x][DOCS] Copies ESMS monitoring details to Elasticsearch Reference (#48780 )	2019-10-31 18:22:08 -07:00
debadair	457379e74e	[DOCS] Edited Docker install & tweaked Docker compose file. (#47715 ) * [DOCS] Edited Docker install & tweaked Docker compose file. * Synced with Docker GS in SO * Incorporated review comments	2019-10-31 18:12:39 -07:00
Tal Levy	4be54402de	[7.x] Add ingest info to Cluster Stats (#48485 ) (#48661 ) * Add ingest info to Cluster Stats (#48485) This commit enhances the ClusterStatsNodes response to include global processor usage stats on a per-processor basis. example output: ``` ... "processor_stats": { "gsub": { "count": 0, "failed": 0 "current": 0 "time_in_millis": 0 }, "script": { "count": 0, "failed": 0 "current": 0, "time_in_millis": 0 } } ... ``` The purpose for this enhancement is to make it easier to collect stats on how specific processors are being used across the cluster beyond the current per-node usage statistics that currently exist in node stats. Closes #46146. * fix BWC of ingest stats The introduction of processor types into IngestStats had a bug. It was set to `null` and set as the key to the map. This would throw a NPE. This commit resolves this by setting all the processor types from previous versions that are not serializing it out to `_NOT_AVAILABLE`.	2019-10-31 14:36:54 -07:00
Deb Adair	6412d0f528	[DOCS] Remove coming tag from 7.4.2 RN backport.	2019-10-31 09:43:26 -07:00
Lisa Cawley	b7559f23cc	[DOCS] Fixes PR#48055 in release notes (#48726 )	2019-10-31 07:37:44 -07:00
Peter Johnson	3f7aafa421	[DOCS] Fix typo in synonym token filter docs (#48691 )	2019-10-31 09:12:24 -04:00
James Rodewig	3d5b1725a9	[DOCS] Remove unneeded filter from common grams analyze ex (#48748 )	2019-10-31 09:08:14 -04:00
Brandon Morelli	aa02174d53	[DOCS] Fix typo in ILM policy definition docs (#48723 ) Removes an extra "by".	2019-10-31 08:30:54 -04:00
Andrei Dan	ffe5d5417f	ILM Make the `check-rollover-ready` step retryable (#48256 ) (#48740 ) This adds the infrastructure to be able to retry the execution of retryable steps and makes the `check-rollover-ready` retryable as an initial step to make the rollover action more resilient to transient errors. (cherry picked from commit 454020ac8acb147eae97acb4ccd6fb470d1e5f48) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2019-10-31 11:28:55 +00:00
debadair	a876760848	[DOCS] Add placeholder for 7.4.2 release notes (#48724 )	2019-10-30 16:09:29 -07:00
Jason Tedor	13043219ac	Fix specification for cluster.remote.connect (#48690 ) The docs specify that cluster.remote.connect disables cross-cluster search. This is correct, but not fully accurate as it disables any functionality that relies on remote cluster connections: cross-cluster search, remote data feeds, and cross-cluster replication. This commit updates the docs to reflect this.	2019-10-30 11:26:15 -04:00
James Rodewig	0b062bbc82	[DOCS] Correct required file ext for user agent ingest processor (#48688 ) For the user agent ingest processor, custom regex files must end with the `.yml` file extension. This corrects the docs which said the `.yaml` extension was required.	2019-10-30 11:11:29 -04:00
Dan Hermann	dbc05cd808	Add option to split processor for preserving trailing empty fields (#48685 )	2019-10-30 08:25:03 -05:00
James Rodewig	77acbc4fa9	[DOCS] Reformat common grams token filter (#48426 )	2019-10-30 08:40:56 -04:00
Yannick Welsch	356066ce6a	Revert "Mute get-ccr-stats doctest (#48375 )" This reverts commit f861927e8b8fc987949ce996a131a2d272b9646e.	2019-10-30 11:13:29 +01:00
Julie Tibshirani	89c65752dc	Update the signature of vector script functions. (#48653 ) Previously the functions accepted a doc values reference, whereas they now accept the name of the vector field. Here's an example of how a vector function was called before and after the change. ``` Before: cosineSimilarity(params.query_vector, doc['field']) After: cosineSimilarity(params.query_vector, 'field') ``` This seems more intuitive, since we don't allow direct access to vector doc values and the the meaning of `doc['field']` is unclear. The PR makes the following changes (broken into distinct commits): * Add new function signatures of the form `function(params.query_vector, 'field')` and deprecates the old ones. Because Painless doesn't allow two methods with the same name and number of arguments, we allow a generic `Object` to be passed in to the function and decide on the behavior through an `instanceof` check. * Refactor the class bindings so that the document field is passed to the constructor instead of the instance method. This allows us to avoid retrieving the vector doc values on every function invocation, which gives a tiny speed-up in benchmarks. Note that this PR adds new signatures for the sparse vector functions too, even though sparse vectors are deprecated. It seemed simplest to understand (for both us and users) to keep everything symmetric between dense and sparse vectors.	2019-10-29 15:46:05 -07:00
James Rodewig	7002ce1e9c	[DOCS] Replace `_uid` refs in reindex slicing docs (#48649 ) PR #25543 removed the `_uid` field in favor of the `_id` field, including for use in slicing. This removes an outdated reference to `_uid` in our reindex docs.	2019-10-29 16:41:53 -04:00
Christoph Büscher	1de49d8a70	Remove Ranking Evaluation API experimental status (#48603 ) The API has been released long enough to remove the experimental status.	2019-10-29 20:57:39 +01:00
Lisa Cawley	c6f4662038	[DOCS] Updates ML PRs in 7.4.1 release notes (#48600 )	2019-10-29 09:35:11 -07:00
Daniel Andion	d0cbbf9d58	SQL: [Docs] Typo in HAVING section (#48609 ) `HAVING` section code states `GROUP BY` instead of the appropriate keyword. (cherry picked from commit 9d505dc3db51e250fdf1b44e4d952dcd97bf1bc1)	2019-10-29 16:37:39 +01:00
lgypro	abddf51672	[Docs] Fix syntax error leading to wrong doc ID (#48554 ) In order to index a document with id 2, the "&" should be replaced by "?"	2019-10-29 10:27:23 +01:00
Ian Danforth	82e25c4ac7	[Docs] Fix typo in suggesters search API doc (#48477 )	2019-10-29 09:58:05 +01:00
Ian Danforth	4a076f5e92	[Doc] Fix typo in indices module docs (#48598 )	2019-10-28 21:40:09 +01:00
Julie Tibshirani	605500df7e	Add sparse vector deprecation to 7.6 migration docs. (#48435 ) This note was accidentally omitted from the deprecation PR.	2019-10-28 11:57:20 -07:00
Benjamin Trent	6ea59dd428	[ML][Transforms] add wait_for_checkpoint flag to stop (#47935 ) (#48591 ) Adds `wait_for_checkpoint` for `_stop` API.	2019-10-28 13:02:57 -04:00
Lisa Cawley	13ce179706	[DOCS] Re-enable code snippet testing in close anomaly detection job API (#48259 ) (#48585 )	2019-10-28 08:42:09 -07:00
Shaunak Kashyap	d27a307379	[DOCS] Remove extraneous comma in Enrich Stats API's JSON response (#48539 )	2019-10-25 12:35:50 -04:00
James Rodewig	e9c8e4f6d1	[DOCS] Fix note format in index suggestion docs (#48536 )	2019-10-25 11:31:47 -04:00
Christoph Büscher	055a0800eb	[Docs] Mention reserved completion suggestion characters (#48445 ) We currently don't mention the three reserved characters anywhere. This change adds a short note mentioning them Closes #48341	2019-10-25 16:58:23 +02:00
Julie Tibshirani	b2974e3816	Correct outdated information in _index docs. (#48436 ) This PR makes the following updates: * Update the supported query types to include `prefix` and `wildcard`. * Specify that queries accept index aliases. * Clarify that when querying on a remote index name, the separator `:` must be present.	2019-10-24 11:02:25 -07:00
Hendrik Muhs	5ecfcdb162	update warning about index names after transform rename (#48457 ) update warning about index names after transform rename	2019-10-24 15:17:20 +02:00
Julie Tibshirani	4375316b9d	Make sure to list the 7.5 migration docs.	2019-10-23 18:52:22 -07:00
Julie Tibshirani	2664cbd20b	Deprecate the sparse_vector field type. (#48368 ) We have not seen much adoption of this experimental field type, and don't see a clear use case as it's currently designed. This PR deprecates the field type in 7.x. It will be removed from 8.0 in a follow-up PR.	2019-10-23 16:35:03 -07:00
James Rodewig	06dc1fbd96	[DOCS] Reformat ASCII folding token filter docs (#48143 )	2019-10-23 15:06:55 -05:00
Jim Ferenczi	96556d72cc	Add a known issue to the release notes of 7.4.0 (#48373 ) A [bug](https://github.com/elastic/elasticsearch/issues/48358) in 7.4.0 prevents the activation of the search slow log. This change adds an entry in the release notes to warn users to not activate it in this version. Relates #48358	2019-10-23 19:57:37 +02:00
James Rodewig	640d7416b1	[DOCS] Change prev version to 7.5 in upgrade docs (#48415 )	2019-10-23 12:09:26 -05:00

... 4 5 6 7 8 ...

6527 Commits