OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-06 13:08:29 +00:00

Author	SHA1	Message	Date
Lisa Cawley	d77ba58cfd	[DOCS] Add ml-cpp PRs to 7.9.0 release notes (#60689 ) Co-authored-by: David Roberts <dave.roberts@elastic.co>	2020-08-05 10:12:11 -07:00
lcawl	a27d630bdf	[DOCS] Removes coming tag	2020-07-27 07:55:18 -07:00
James Rodewig	988e8c8fc6	[DOCS] Swap `[float]` for `[discrete]` (#60134 ) Changes instances of `[float]` in our docs for `[discrete]`. Asciidoctor prefers the `[discrete]` tag for floating headings: https://asciidoctor.org/docs/asciidoc-asciidoctor-diffs/#blocks	2020-07-23 12:42:33 -04:00
Lisa Cawley	46d33b1586	[DOCS] 7.9.0 release notes (#60053 )	2020-07-22 08:40:59 -07:00
David Roberts	606b7ea139	[DOCS] Adds extra ml-cpp PRs to release notes (#59967 )	2020-07-21 11:47:36 -07:00
David Turner	b75207a09f	Remove sporadic min/max usage estimates from stats (#59755 ) Today `GET _nodes/stats/fs` includes `{least,most}_usage_estimate` fields for some nodes. These fields have rather strange semantics. They are only reported on the elected master and on nodes that have been the elected master since they were last restarted; when a node stops being the elected master these stats remain in place but we stop updating them so they may become arbitrarily stale. This means that these statistics are pretty meaningless and impossible to use correctly. Even if they were kept up to date they're never reported for data-only nodes anyway, despite the fact that data nodes are the ones where we care most about disk usage. The information needed to compute the path with the least/most available space is already provided in the rest the stats output, so we can treat the inclusion of these stats as a bug and fix it by simply removing them in this commit. Since these stats were always optional and mostly omitted (for opaque reasons) this is not considered a breaking change.	2020-07-20 15:22:04 +01:00
James Rodewig	fa2167af0a	[7.x] [DOCS] Update upgrade docs and release highlights for 7.9 (#59674 )	2020-07-16 15:58:40 -04:00
lcawl	f2b530dbdb	[DOCS] Re-adds coming macro in release notes	2020-07-16 09:12:39 -07:00
lcawl	4ad8bef33b	[DOCS] Removes docs PR from release notes	2020-07-15 16:07:43 -07:00
Rory Hunter	b8d73a1e7e	Default gateway.auto_import_dangling_indices to false (#59302 ) Backport of #58898. Part of #48366. Now that there is a dedicated API for dangling indices, the auto-import behaviour can default to off. Also add a note to the breaking changes for 7.9.0.	2020-07-15 17:10:42 +01:00
Martijn Laarman	a699c89133	[DOCS] Add release notes for 7.8.1 (#59594 ) (cherry picked from commit f43a233948f13e487d4d0f4be668687c404a71f4)	2020-07-15 11:42:03 +02:00
Tim Brooks	a46e5e0f04	Increase default write queue size (#59464 ) This commit increases the default write queue size to 10000. This is to allow a greater number of pending indexing requests. This work is safe as we have added additional memory limits. Relates to #59263.	2020-07-14 10:35:25 -06:00
David Roberts	2f9d4a1c7a	[DOCS] Adds extra ml-cpp PRs to release notes (#59354 ) Following the rebuild of 7.8.1 two extra ml-cpp PRs will now be released in 7.8.1.	2020-07-13 09:36:21 +01:00
Lisa Cawley	233857ef6e	[DOCS] Adds ml-cpp PRs to release notes (#59188 )	2020-07-07 11:56:40 -07:00
Yannick Welsch	15c85b29fd	Account for recovery throttling when restoring snapshot (#58658 ) (#58811 ) Restoring from a snapshot (which is a particular form of recovery) does not currently take recovery throttling into account (i.e. the `indices.recovery.max_bytes_per_sec` setting). While restores are subject to their own throttling (repository setting `max_restore_bytes_per_sec`), this repository setting does not allow for values to be configured differently on a per-node basis. As restores are very similar in nature to peer recoveries (streaming bytes to the node), it makes sense to configure throttling in a single place. The `max_restore_bytes_per_sec` setting is also changed to default to unlimited now, whereas previously it was set to `40mb`, which is the current default of `indices.recovery.max_bytes_per_sec`). This means that no behavioral change will be observed by clusters where the recovery and restore settings were not adapted. Relates https://github.com/elastic/elasticsearch/issues/57023 Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-07-01 12:19:29 +02:00
markharwood	837f2643eb	Docs - Added field capabilities breaking change (#58509 )	2020-06-24 18:39:01 +01:00
Lisa Cawley	6680271691	[DOCS] Updates pull and issue release attributes (#58348 )	2020-06-18 12:55:02 -07:00
Stuart Tettemer	20abba8433	Scripting: Deprecate general cache settings (#55753 ) (#58283 ) Backport: ef543b0	2020-06-18 11:54:23 -06:00
Przemyslaw Gomulka	9894d90e0b	[doc] known issues - week based patterns not working in 7.6 (#58099 ) (#58227 ) relates #57128 # Conflicts: # docs/reference/release-notes/7.6.asciidoc	2020-06-17 10:54:22 +02:00
debadair	cfef2b2bec	[DOCS] Removed unused pages (#58209 )	2020-06-16 15:55:56 -07:00
Stuart Tettemer	01795d1925	Revert "Scripting: Deprecate general cache settings (#55753 )" (#58201 ) This reverts commit 88e8b34fc2d672060a82979cb782b8cf491a3985.	2020-06-16 14:58:18 -06:00
Stuart Tettemer	88e8b34fc2	Scripting: Deprecate general cache settings (#55753 ) Backport: ef543b0	2020-06-16 13:06:59 -06:00
debadair	2edcd064fe	[DOCS] Fix bad xref (#58150 )	2020-06-15 15:50:49 -07:00
debadair	80524098fc	[DOCS] Reformat release highlights as What's new. (#58073 )	2020-06-15 13:26:03 -07:00
Rory Hunter	e840ffa300	Add release notes for 7.8.0 (#56340 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: István Zoltán Szabó <istvan.szabo@elastic.co> Co-authored-by: Tim Vernum <tim@adjective.org> Co-authored-by: lcawl <lcawley@elastic.co>	2020-06-12 10:03:41 -04:00
Martijn van Groningen	4b5c4b7966	[DOCS] Add 7.8 release notes entry for auto create index change (#57582 ) The create index action name (`indices:admin/create`) can no longer be used to grant privileges to auto create indices and instead the `create_index` builtin privilege should be used. Relates to #55858 Co-authored-by: Jake Landis <jake.landis@elastic.co>	2020-06-04 07:36:07 +02:00
William Brafford	6e67f1b3dc	[DOCS] Add release notes for 7.7.1 (#57566 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co>	2020-06-03 08:54:32 -04:00
Zachary Tong	4dc12633cf	Better description of real-memory breaker changes to aggs (#57306 ) The old description mentions a setting that we ended up not merging. The periodic real-memory checks are automatic and do not require the user to configure any setting.	2020-06-01 14:58:20 -04:00
Lisa Cawley	b5542e0480	[DOCS] Adds ml-cpp PRs to release notes (#57444 )	2020-06-01 09:56:32 -07:00
Bogdan Pintea	ee437bef27	Docs: forward port release docs of 7.7.0 (#56706 ) Forward port the release docs of 7.7.0: breaking changes, release notes, release highlights.	2020-05-13 20:08:14 +02:00
István Zoltán Szabó	5813dfdcc7	[7.x][DOCS] Adds ML related items to release highlights (#55652 )	2020-04-23 11:58:32 +02:00
Paul Sanwald	0f7917b94b	add release notes for 7.5.2 (#51259 ) Adds release notes for 7.5.2	2020-04-21 08:19:46 -04:00
Adrien Grand	0cb6a1f089	Document the index corruption bug that gets fixed via Lucene 8.5.1. (#55232 ) Using soft deletes on shrunk indices may cause corruption.	2020-04-17 13:37:37 +02:00
Lisa Cawley	cf5278f771	[DOCS] Add ml-cpp PRs to 7.7 release notes (#55264 ) Co-Authored-By: David Roberts <dave.roberts@elastic.co>	2020-04-16 11:28:34 -07:00
Bogdan Pintea	b88dd47de3	Docs: add the change log for 7.7 (#55019 ) * Add the change log for 7.7 Add the change log for 7.7 * Update rel. notes to latest state (BC5) Update the release notes to current state (i.e. BC5). * Update docs/reference/release-notes/7.7.asciidoc Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-04-15 15:25:08 -04:00
Nik Everett	b99a50bcb9	value_count Aggregation optimization (backport of #54854 ) (#55076 ) We found some problems during the test. Data: 200Million docs, 1 shard, 0 replica hits \| avg \| sum \| value_count \| ----------- \| ------- \| ------- \| ----------- \| 20,000 \| .038s \| .033s \| .063s \| 200,000 \| .127s \| .125s \| .334s \| 2,000,000 \| .789s \| .729s \| 3.176s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| The performance of `avg`, `sum` and other is very close when performing statistics, but the performance of `value_count` has always been poor, even not on an order of magnitude. Based on some common-sense knowledge, we think that `value_count` and sum are similar operations, and the time consumed should be the same. Therefore, we have discussed the agg of `value_count`. The principle of counting in es is to traverse the field of each document. If the field is an ordinary value, the count value is increased by 1. If it is an array type, the count value is increased by n. However, the problem lies in traversing each document and taking out the field, which changes from disk to an object in the Java language. We summarize its current problems with Elasticsearch as: - Number cast to string overhead, and GC problems caused by a large number of strings - After the number type is converted to string, sorting and other unnecessary operations are performed Here is the proof of type conversion overhead. ``` // Java long to string source code, getChars is very time-consuming. public static String toString(long i) { int size = stringSize(i); if (COMPACT_STRINGS) { byte[] buf = new byte[size]; getChars(i, size, buf); return new String(buf, LATIN1); } else { byte[] buf = new byte[size * 2]; StringUTF16.getChars(i, size, buf); return new String(buf, UTF16); } } ``` test type \| average \| min \| max \| sum ------------ \| ------- \| ---- \| ----------- \| ------- double->long \| 32.2ns \| 28ns \| 0.024ms \| 3.22s long->double \| 31.9ns \| 28ns \| 0.036ms \| 3.19s long->String \| 163.8ns \| 93ns \| 1921 ms \| 16.3s particularly serious. Our optimization code is actually very simple. It is to manage different types separately, instead of uniformly converting to string unified processing. We added type identification in ValueCountAggregator, and made special treatment for number and geopoint types to cancel their type conversion. Because the string type is reduced and the string constant is reduced, the improvement effect is very obvious. hits \| avg \| sum \| value_count \| value_count \| value_count \| value_count \| value_count \| value_count \| \| \| \| double \| double \| keyword \| keyword \| geo_point \| geo_point \| \| \| \| before \| after \| before \| after \| before \| after \| ----------- \| ------- \| ------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| 20,000 \| 38s \| .033s \| .063s \| .026s \| .030s \| .030s \| .038s \| .015s \| 200,000 \| 127s \| .125s \| .334s \| .078s \| .116s \| .099s \| .278s \| .031s \| 2,000,000 \| 789s \| .729s \| 3.176s \| .439s \| .348s \| .386s \| 3.365s \| .178s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 2.700s \| 2.500s \| 2.600s \| 25.192s \| 1.278s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| 18.990s \| 19.000s \| 20.000s \| 168.971s \| 9.093s \| - The results are more in line with common sense. `value_count` is about the same as `avg`, `sum`, etc., or even lower than these. Previously, `value_count` was much larger than avg and sum, and it was not even an order of magnitude when the amount of data was large. - When calculating numeric types such as `double` and `long`, the performance is improved by about 8 to 9 times; when calculating the `geo_point` type, the performance is improved by 18 to 20 times.	2020-04-10 13:16:39 -04:00
qiye	de8e0200fe	[DOCS] Correct `shape` field release in 7.5 release highlights (#54631 ) The `shape` field was added in 7.4, not 7.3. This corrects a small error in the 7.5 release highlights.	2020-04-02 09:19:40 -04:00
lcawl	2cd35bf696	[DOCS] Adds release highlights placeholder	2020-04-01 09:22:20 -07:00
Lisa Cawley	f5ccf939d9	[DOCS] Clarifies API key breaking change (#54522 )	2020-04-01 08:58:15 -07:00
Jason Tedor	5fcda57b37	Rename MetaData to Metadata in all of the places (#54519 ) This is a simple naming change PR, to fix the fact that "metadata" is a single English word, and for too long we have not followed general naming conventions for it. We are also not consistent about it, for example, METADATA instead of META_DATA if we were trying to be consistent with MetaData (although METADATA is correct when considered in the context of "metadata"). This was a simple find and replace across the code base, only taking a few minutes to fix this naming issue forever.	2020-03-31 17:24:38 -04:00
James Rodewig	7401191019	[DOCS] Include 7.7.0 release notes (#54529 ) Includes the 7.7.0 release notes so they render in the HTML docs. Also removes a few legacy `coming[7.6.0]` tags.	2020-03-31 16:23:49 -04:00
Nik Everett	16e4bd50e2	Add breaking change note for #53669	2020-03-25 09:31:14 -04:00
Jim Ferenczi	55f2e8bff0	[DOCS] Add 7.6.2 release notes (#53720 ) Co-authored-by: James Rodewig <james.rodewig@elastic.co> Co-authored-by: lcawl <lcawley@elastic.co>	2020-03-24 22:42:25 +01:00
Przemyslaw Gomulka	015ad019d5	[docs] Known issue about joda patterns on 7.6 (#53957 )	2020-03-23 10:28:55 +01:00
Przemyslaw Gomulka	412e163cf6	[Doc] migration guide joda (#51986 ) The joda to java.time migration requires users to upgrade their mappings. We allow them to still use 6.x created indices with joda patterns in 7 but ask them to upgrade their patterns in 7.x. This migration guide is to help them understand how they could be affected and what needs to be changed in their mappings. closes #51614 closes #51236	2020-03-23 08:29:01 +01:00
Mayya Sharipova	7e2a9f58ee	script_score query errors on negative scores (#53133 ) 7.5 and 7.6 had a regression that allowed for script_score queries to have negative scores. We have corrected this regression in #52478. This is an addition to #52478 that adds a test and release notes.	2020-03-05 14:23:39 -05:00
Yannick Welsch	d1e7951e00	[DOCS] Add 7.6.1. release notes (#52874 ) Adds the release notes for 7.6.1.	2020-03-04 15:47:54 +01:00
Martijn van Groningen	6aa9aaa2c6	Add validation for dynamic templates (#52890 ) Backport of #51233 to the seven dot x branch. Tries to load a `Mapper` instance for the mapping snippet of a dynamic template. This should catch things like using an analyzer that is undefined or mapping attributes that are unused. This is best effort: * If `{{name}}` placeholder is used in the mapping snippet then validation is skipped. * If `match_mapping_type` is not specified then validation is performed for all mapping types. If parsing succeeds with a single mapping type then this the dynamic mapping is considered valid. If is detected that a dynamic template mapping snippet is invalid at mapping update time then the mapping update is failed for indices created on 8.0.0-alpha1 and later. For indices created on prior version a deprecation warning is omitted instead. In 7.x clusters the mapping update will never fail in case of an invalid dynamic template mapping snippet and a deprecation warning will always be omitted. Closes #17411 Closes #24419 Co-authored-by: Adrien Grand <jpountz@gmail.com>	2020-02-28 10:35:04 +01:00
Mayya Sharipova	3840a763d8	Correct release notes for 7.5 (#52660 ) Remove a mention to a feature that was not merged, as its corresponding PR was closed.	2020-02-21 14:59:46 -05:00
Lisa Cawley	e77e49e956	[DOCS] Adds machine learning highlights (#52334 )	2020-02-14 08:51:55 -08:00

1 2 3 4

169 Commits