OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alan Woodward	76d0edd1a4	Add prefix intervals source (#43635 ) This commit adds a prefix intervals source, allowing you to search for intervals that contain terms starting with a given prefix. The source can make use of the index_prefixes mapping option. Relates to #43198	2019-06-26 16:22:12 +01:00
Benjamin Trent	c121b00c98	[7.x] [ML][Data Frame] Add support for allow_no_match for endpoints (#43490 ) (#43637 ) * [ML][Data Frame] Add support for allow_no_match for endpoints (#43490) * [ML][Data Frame] Add support for allow_no_match parameter in endpoints Adds support for: * Get Transforms * Get Transforms stats * stop transforms * Update DataFrameTransformDocumentationIT.java	2019-06-26 10:09:56 -05:00
Stuart Tettemer	500205e8c5	Add painless method getByPath, get value from nested collections with dotted path (#43170 ) (#43606 ) Given a nested structure composed of Lists and Maps, getByPath will return the value keyed by path. getByPath is a method on Lists and Maps. The path is string Map keys and integer List indices separated by dot. An optional third argument returns a default value if the path lookup fails due to a missing value. Eg. ['key0': ['a', 'b'], 'key1': ['c', 'd']].getByPath('key1') = ['c', 'd'] ['key0': ['a', 'b'], 'key1': ['c', 'd']].getByPath('key1.0') = 'c' ['key0': ['a', 'b'], 'key1': ['c', 'd']].getByPath('key2', 'x') = 'x' [['key0': 'value0'], ['key1': 'value1']].getByPath('1.key1') = 'value1' Throws IllegalArgumentException if an item cannot be found and a default is not given. Throws NumberFormatException if a path element operating on a List is not an integer. Fixes #42769	2019-06-26 09:06:34 -06:00
Jake Landis	51161a4b0e	add 7.2.0 release notes	2019-06-26 08:50:11 -05:00
Armin Braun	83067968ca	Add SAS Token Authentication Support to Azure Repo Plugin (#42982 ) (#43618 ) * Added setting for SAS token * Added support for the token in tests * Relates #42117	2019-06-26 13:43:32 +02:00
David Roberts	558e323c89	[ML] Introduce a setting for the process connect timeout (#43234 ) This change introduces a new setting, xpack.ml.process_connect_timeout, to enable the timeout for one of the external ML processes to connect to the ES JVM to be increased. The timeout may need to be increased if many processes are being started simultaneously on the same machine. This is unlikely in clusters with many ML nodes, as we balance the processes across the ML nodes, but can happen in clusters with a single ML node and a high value for xpack.ml.node_concurrent_job_allocations.	2019-06-26 09:22:04 +01:00
Yannick Welsch	2049f715b3	Add voting-only master node (#43410 ) A voting-only master-eligible node is a node that can participate in master elections but will not act as a master in the cluster. In particular, a voting-only node can help elect another master-eligible node as master, and can serve as a tiebreaker in elections. High availability (HA) clusters require at least three master-eligible nodes, so that if one of the three nodes is down, then the remaining two can still elect a master amongst them-selves. This only requires one of the two remaining nodes to have the capability to act as master, but both need to have voting powers. This means that one of the three master-eligible nodes can be made as voting-only. If this voting-only node is a dedicated master, a less powerful machine or a smaller heap-size can be chosen for this node. Alternatively, a voting-only non-dedicated master node can play the role of the third master-eligible node, which allows running an HA cluster with only two dedicated master nodes. Closes #14340 Co-authored-by: David Turner <david.turner@elastic.co>	2019-06-26 08:07:56 +02:00
James Rodewig	50eac875e4	[DOCS] Rewrite `range` query (#43282 )	2019-06-25 15:25:48 -04:00
Dimitris Athanasiou	126c2fd2d5	[7.x][ML] Machine learning data frame analytics (#43544 ) (#43592 ) This merges the initial work that adds a framework for performing machine learning analytics on data frames. The feature is currently experimental and requires a platinum license. Note that the original commits can be found in the `feature-ml-data-frame-analytics` branch. A new set of APIs is added which allows the creation of data frame analytics jobs. Configuration allows specifying different types of analysis to be performed on a data frame. At first there is support for outlier detection. The APIs are: - PUT _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id}/_stats - POST _ml/data_frame/analysis/{id}/_start - POST _ml/data_frame/analysis/{id}/_stop - DELETE _ml/data_frame/analysis/{id} When a data frame analytics job is started a persistent task is created and started. The main steps of the task are: 1. reindex the source index into the dest index 2. analyze the data through the data_frame_analyzer c++ process 3. merge the results of the process back into the destination index In addition, an evaluation API is added which packages commonly used metrics that provide evaluation of various analysis: - POST _ml/data_frame/_evaluate	2019-06-25 20:29:11 +03:00
James Rodewig	b598701198	[DOCS] Add redirect for painless examples anchor	2019-06-25 12:34:18 -04:00
rbayet	66693c2706	Fixing backquote in fail_on_unsupported_field (#43572 )	2019-06-25 16:34:38 +02:00
Ernesto Reig	c594a956e2	Default number of shards is now 1 instead of 5 (#43573 ) As specified in the [Breaking changes for 7.X](https://www.elastic.co/guide/en/elasticsearch/reference/7.1/breaking-changes-7.0.html#breaking_70_indices_changes), the default number of shards for an index is now `1` instead of `5`.	2019-06-25 14:51:07 +02:00
debadair	df42fac9ac	[DOCS] Edited title/subtitle. (#43552 )	2019-06-24 15:31:19 -07:00
Lisa Cawley	8ffd9c6981	[DOCS] Adds administering section (#43493 )	2019-06-24 10:15:25 -07:00
David Roberts	6728e63619	[DOCS] Rename "job" to "transform" in data frame transform docs (#43534 )	2019-06-24 09:11:24 -07:00
Tanguy Leroux	9794409ca0	Fix broken link	2019-06-24 16:19:57 +02:00
Tanguy Leroux	a4dfa7c29b	Add release highlight for replicated closed indices on 7.2.0 (#43530 )	2019-06-24 15:54:36 +02:00
Matthew Adams	0bcadbf846	Clarify storage location of ML Snapshots (#43437 ) The existing language was misleading about the model snapshots and where they are located. Saying "to disk" sounds like files external to Elasticsearch IMO. It raises the obvious question, where on disk? which node? Is it in the Elasticsearch snapshot repo? The model snapshots are held in an internal index.	2019-06-24 09:14:12 +01:00
Igor Motov	6162471d2e	Docs: Add description of the coerce parameter in geo_shape mapper (#43340 ) Explains the effect of the coerce parameter on the geo_shape field. Relates #35059	2019-06-21 12:30:20 -04:00
James Rodewig	014fd19abd	[DOCS] Rewrite `constant_score` query (#43374 )	2019-06-21 12:04:00 -04:00
James Rodewig	359b103f87	[DOCS] Rewrite term-level queries overview (#43337 )	2019-06-21 11:55:02 -04:00
Luiz Guilherme Pais dos Santos	eeb1812510	Example of how to set slow logs dynamically per-index (#42384 ) * Example of how to set slow logs dynamically per-index * Make _settings API example more explicit Co-Authored-By: James Rodewig <james.rodewig@elastic.co> * Add TEST directive to fix CI Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-06-21 09:30:53 -04:00
David Kyle	d1280339a8	specifies which index to search in docs for various queries (#43307 ) (#43428 ) the geo-bounding-box and phrase-suggest docs were susceptible to failing due to other indices in the cluster. This change restricts the queries to the index that is set up for the test. relates to #43271.	2019-06-21 10:15:51 +01:00
Yu	c88f2f23a5	Make Recovery API support `detailed` params (#29076 ) Properly forwards the `detailed` parameter to show the recovery stats details. Closes #28910	2019-06-21 09:05:33 +02:00
Ryan Ernst	7b0a259b2c	Clarify unsupported secure settings behavior (#43454 ) This commit tweaks the docs for secure settings to ensure the user is aware adding non secure settings to the keystore will result in elasticsearch not starting. fixes #43328 Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-06-20 14:27:27 -07:00
Deb Adair	6b1e45b5b3	[DOCS] Updated the URL for starting in the cloud.	2019-06-20 13:09:21 -07:00
debadair	2319fe74c3	[DOCS] Fixed path to install directory. (#43443 )	2019-06-20 10:36:28 -07:00
Lisa Cawley	5f8db95d60	[DOCS] Describe setup for monitoring logs (#42655 )	2019-06-20 08:17:27 -07:00
debadair	7b740b4ea3	[DOCS] Add brew install instructions. Closes #42914 (#42915 )	2019-06-20 07:56:49 -07:00
David Kyle	12bc38d9e6	Mute put-transform docs test Relates to #43271	2019-06-20 15:54:24 +01:00
Christoph Büscher	adab7eae71	[Docs] Remove boost parameter from intervals-query example (#43331 ) The boost factor doesn't seem to be needed and can be removed.	2019-06-20 10:34:14 +02:00
Andrei Stefan	d684119618	Remove mentions of "fields with the same name in the same index" (#43077 ) Together with types removal, any mention of "fields with the same name in the same index" doesn't make sense anymore. (cherry picked from commit c5190106cbd4c007945156249cce462956933326)	2019-06-20 11:26:12 +03:00
Benjamin Trent	b333ced5a7	[7.x] [ML][Data Frame] adds new pipeline field to dest config (#43124 ) (#43388 ) * [ML][Data Frame] adds new pipeline field to dest config (#43124) * [ML][Data Frame] adds new pipeline field to dest config * Adding pipeline support to _preview * removing unused import * moving towards extracting _source from pipeline simulation * fixing permission requirement, adding _index entry to doc * adjusting for java 8 compatibility * adjusting bwc serialization version to 7.3.0	2019-06-19 16:18:27 -05:00
Jason Tedor	bf74d38782	Fix GeoIP custom database directory in docs (#43383 ) These docs were misleading for package installations of Elasticsearch. Instead, we should refer to $ES_CONFIG/ingest-geoip as the path to place the custom database files. For non-package installations, this is the same as $ES_HOME/config, but for package installations this is not the case as the config directory for package installations is /etc/elasticsearch, and is not relative to $ES_HOME. This commit corrects the docs.	2019-06-19 13:26:07 -04:00
Paul Sanwald	8578aba654	[backport] Adds a minimum interval to `auto_date_histogram`. (#42814 ) (#43285 ) Backports minimum interval to date histogram	2019-06-19 07:06:45 -04:00
Mayya Sharipova	aa6248d4d7	Move dense_vector and sparse_vector to module (#43280 ) (#43333 )	2019-06-18 11:56:04 -04:00
caminsha	11ef5e63ae	[DOCS] Added a new use case for transport.port (#42126 )	2019-06-18 09:52:36 -04:00
Colin Goodheart-Smithe	818a709377	Fixes formatting of CCS compatibility table (#43231 )	2019-06-18 13:28:27 +01:00
debadair	3204e0255c	[DOCS] Sewing SME says it should be "size 70" needle.	2019-06-17 20:30:52 -07:00
debadair	e524e45aed	[DOCS] Fix typo: extraneous {es}	2019-06-17 19:20:11 -07:00
debadair	9767fc2c95	[DOCS] Add introduction to Elasticsearch. (#43075 ) * [DOCS] Add introduction to Elasticsearch. * [DOCS] Incorporated review comments. * [DOCS] Minor edits to add an abbreviated title and cross refs. * [DOCS] Added sizing tips & link to quantatative sizing video.	2019-06-17 17:12:37 -07:00
Jack Conradson	04a7c84e8b	Add Painless Docs for Datetime Inputs (#43128 ) This changes add documentation for accessing datetimes in Painless scripts from the three most common inputs of params, _source, and doc.	2019-06-17 10:59:28 -07:00
lcawl	7ed23088c1	[DOCS] Fixes formatting of 7.2 breaking changes	2019-06-17 10:08:08 -07:00
István Zoltán Szabó	e9e8243faa	[DOCS] Simplifies wording. (#43226 ) This PR simplifies the wording of the TOC and eventually makes it shorter.	2019-06-17 09:37:21 +02:00
Przemysław Witek	b2613a123d	[7.x] Report exponential_avg_bucket_processing_time which gives more weight to recent buckets (#43189 ) (#43263 )	2019-06-17 08:58:26 +02:00
Lisa Cawley	982a23f8c3	[DOCS] Adds size and from parameters to data frame APIs (#43212 )	2019-06-14 09:11:12 -07:00
Marios Trivyzas	9cd89c3453	SQL: Increase hard limit for sorting on aggregates (#43220 ) To be consistent with the `search.max_buckets` default setting, set the hard limit of the PriorityQueue used for in memory sorting, when sorting on an aggregate function, to 10000. Fixes: #43168 (cherry picked from commit 079e012fdea68ea0a7daae078359495047e9c407)	2019-06-14 13:51:38 +02:00
lcawl	8a341a3ea5	[DOCS] Fix link to ML node description	2019-06-13 13:56:06 -07:00
Lisa Cawley	7b90ceae0c	[DOCS] Update node descriptions for default distribution (#42812 )	2019-06-13 13:55:56 -07:00
Jason Tedor	5bc3b7f741	Enable node roles to be pluggable (#43175 ) This commit introduces the possibility for a plugin to introduce additional node roles.	2019-06-13 15:15:48 -04:00
Ryan Ernst	c3ce3f6891	Add native code info to ML info api (#43172 ) The machine learning feature of xpack has native binaries with a different commit id than the rest of code. It is currently exposed in the xpack info api. This commit adds that commit information to the ML info api, so that it may be removed from the info api.	2019-06-13 11:38:58 -07:00
Luca Cavanna	a28569462f	Add 6.8 to the remote clusters compatibility table (#42389 ) The table does not include 6.8 as it was written before we knew we were releasing it. This commit adds it.	2019-06-13 11:30:35 +02:00
Mirek Svoboda	afbb791969	Document wildcard for network interfaces (#28839 ) With this commit we mention how Elasticsearch behaves when either `0` or `0.0.0.0` is used for `network.host`.	2019-06-13 10:18:49 +02:00
Lisa Cawley	7c9acdb0ac	[DOCS] Adds ML release highlights (#43169 )	2019-06-12 13:44:59 -07:00
James Baiera	51618af056	shrink may full copy when using multi data paths (#42913 ) (#42961 ) Additional scenario for full segment copy if hard link cannot work across disks.	2019-06-12 14:34:31 -04:00
Lisa Cawley	7f2f0b7620	[DOCS] Adds dataframe authorization details (#43009 )	2019-06-12 10:17:24 -07:00
Shaunak Kashyap	5ae2460782	[7.x] Metricbeat monitoring Elasticsearch: Reorder/remove steps (#42917 ) (#43130 )	2019-06-12 06:25:30 -07:00
Luca Cavanna	4da0fadedc	[DOCS] Clarify phrase suggester docs smoothing parameter (#42947 ) Closes #28512	2019-06-12 11:25:03 +02:00
Luca Cavanna	e538592652	Update max_concurrent_shard_request parameter docs (#42227 ) Some of the docs were outdated as they did not mention that the limit is not per node. Also, The default value changed. Relates to #31206	2019-06-12 11:25:03 +02:00
markharwood	a75964d8fd	Docs change for exists query. (#43092 ) Now emphasises the test is for indexed values. Previous documentation only mentioned the state of the input JSON doc (null values) but this is only one of several reasons why an indexed value may not exist. Closes #24256	2019-06-12 09:28:18 +01:00
Ryan Ernst	172cd4dbfa	Remove description from xpack feature sets (#43065 ) The description field of xpack featuresets is optionally part of the xpack info api, when using the verbose flag. However, this information is unnecessary, as it is better left for documentation (and the existing descriptions describe anything meaningful). This commit removes the description field from feature sets.	2019-06-11 09:22:58 -07:00
markharwood	b17fbe2933	Docs enhancement for quote_field_suffix. (#43093 ) * Docs enhancement for quote_field_suffix. Mentions the use of a fall-back field when specified field is missing. Closes #40778	2019-06-11 16:33:12 +01:00
Andrei Stefan	8de65daa45	Rename TESTRESPONSE[_cat] to TESTRESPONSE[non_json] (#43087 ) (cherry picked from commit 897b24e0563f59c03e85096fdb64cbc1dd1a5d60)	2019-06-11 12:40:00 +03:00
Andrei Stefan	5b35ec1d9b	Restructure the SQL Language section to have proper sub-sections (#43007 ) Rest docs page update - have the section be on separate pages - add an Overview page - add other formats examples (cherry picked from commit 309bd691ff3f8625f67ca09fc1dd8e265f7e6c92)	2019-06-11 12:39:59 +03:00
Andrei Stefan	4a3287836d	SQL: Clarify that the connections the jdbc driver creates are not pooled (#42992 ) (cherry picked from commit 406d5281bdfe682fb7ec9fefcdb61cce1b9e7270)	2019-06-11 12:39:58 +03:00
Benjamin Trent	79052050bf	[ML] Adding support for geo_shape, geo_centroid, geo_point in datafeeds (#42969 ) (#43069 ) * [ML] Adding support for geo_shape, geo_centroid, geo_point in datafeeds * only supporting doc_values for geo_point fields * moving validation into GeoPointField ctor	2019-06-10 21:52:53 -05:00
James Rodewig	5913723788	[DOCS] Change `// TESTRESPONSE[_cat]` to `// TESTRESPONSE[non_json]` (#43006 )	2019-06-10 09:53:05 -04:00
Mayya Sharipova	81a3b6e2fe	Improve documentation for smart_cn analyzer (#42822 )	2019-06-10 08:59:30 -04:00
Sachin Frayne	44aedcf97a	Correct the description of generate_word_parts (#43026 )	2019-06-10 11:36:31 +01:00
Sam Mingo	12962ee0a7	Update search-settings.asciidoc (#43016 ) Grammar and spelling fixes	2019-06-10 10:14:03 +01:00
Shubham Vipul Majmudar	b2e7045b50	Update regexp-syntax.asciidoc (#43021 ) Corrects a typo.	2019-06-10 10:13:54 +01:00
Andrei Stefan	90485c6028	Since SQL is GA, remove the sql language plugin from this list (#41533 ) (cherry picked from commit f715d722e8df54b3d3fe84d3ff57dfd6a198a2ac)	2019-06-10 09:25:55 +03:00
Jason Tedor	b96ed1f9f7	Add note to CCR docs about mapping/alias updates This commit adds a note to the docs clarifying that it is not possible to manually update the mapping nor the aliases of a follower index.	2019-06-09 22:57:23 -04:00
Jason Tedor	25ca315d78	Add note to CCR docs regarding alias replication This commit adds a note to the docs regarding the automatic replication of aliases by a follower index from its leader index.	2019-06-09 22:55:20 -04:00
James Rodewig	5342616a23	[DOCS] Add explicit `articles_case` parameter to Elision Token Filter example (#42987 )	2019-06-07 11:24:43 -04:00
Henning Andersen	dea935ac31	Reindex max_docs parameter name (#42942 ) Previously, a reindex request had two different size specifications in the body: * Outer level, determining the maximum documents to process * Inside the source element, determining the scroll/batch size. The outer level size has now been renamed to max_docs to avoid confusion and clarify its semantics, with backwards compatibility and deprecation warnings for using size. Similarly, the size parameter has been renamed to max_docs for update/delete-by-query to keep the 3 interfaces consistent. Finally, all 3 endpoints now support max_docs in both body and URL. Relates #24344	2019-06-07 12:16:36 +02:00
James Rodewig	2de919e3a8	[DOCS] Move 'Scripting' section to top-level navigation. (#42939 )	2019-06-06 10:46:02 -04:00
James Rodewig	ed186b4485	[DOCS] Rewrite terms query (#42889 )	2019-06-06 08:33:52 -04:00
David Roberts	b202a59f88	[ML] Add earliest and latest timestamps to field stats (#42890 ) This change adds the earliest and latest timestamps into the field stats for fields of type "date" in the output of the ML find_file_structure endpoint. This will enable the cards for date fields in the file data visualizer in the UI to be made to look more similar to the cards for date fields in the index data visualizer in the UI.	2019-06-06 08:58:35 +01:00
Gordon Brown	6eb4600e93	Add custom metadata to snapshots (#41281 ) Adds a metadata field to snapshots which can be used to store arbitrary key-value information. This may be useful for attaching a description of why a snapshot was taken, tagging snapshots to make categorization easier, or identifying the source of automatically-created snapshots.	2019-06-05 17:30:31 -06:00
Christoph Büscher	99542e66a6	[Docs] Clarify caveats for phonetic filters replace option (#42807 ) The `replace` option in the phonetic token filter can have suprising side effects, e.g. such as described in #26921. This PR adds a note to be mindful about such scenarios and offers alternatives to using the `replace` option. Closes #26921	2019-06-05 22:03:54 +02:00
Lisa Cawley	757c6a45a0	[DOCS] Adds discovery.type (#42823 ) Co-Authored-By: David Turner <david.turner@elastic.co>	2019-06-05 12:37:17 -07:00
Jack Conradson	790d2124f6	Clean Up Painless Datetime Docs (#42869 ) This change abstracts the specific types away from the different representations of datetime as a datetime representation in code can be all kinds of different things. This defines the three most common types of datetimes as numeric, string, and complex while outlining the type most typically used for these as long, String, and ZonedDateTime, respectively. Documentation uses the definitions while examples use the types. This makes the documentation easier to consume especially for people from a non-Java background.	2019-06-05 10:22:00 -07:00
Dimitrios Liappis	00f01aaece	Clarify heap setting in Docker docs (#42754 ) Add note in the Docker docs that even when container memory is limited, we still require specifying -Xms/-Xmx using one of the supported methods.	2019-06-05 09:44:43 +03:00
Jason Tedor	117df87b2b	Replicate aliases in cross-cluster replication (#42875 ) This commit adds functionality so that aliases that are manipulated on leader indices are replicated by the shard follow tasks to the follower indices. Note that we ignore write indices. This is due to the fact that follower indices do not receive direct writes so the concept is not useful. Relates #41815	2019-06-04 20:36:24 -04:00
James Rodewig	783159dcbc	[DOCS] Fix typo in bucket script aggregation link	2019-06-04 09:40:38 -04:00
James Rodewig	d050c52fd1	[DOCS] Fix broken bucket script agg link	2019-06-04 08:43:38 -04:00
Christoph Büscher	d9c582e66b	[Docs] Add to preference parameter docs (#42797 ) Adding notes to the existing docs about how using `preference` might increase request cache utilization but also add warning about the downsides. Closes #24278	2019-06-04 14:38:18 +02:00
Benjamin Trent	32eae0dfe9	[ML] [Data Frame] Adding supported aggs in docs (#42728 ) (#42842 ) * [ML] [Data Frame] Adding supported aggs in docs * [DOCS] Moves pivot to definitions list	2019-06-04 07:19:58 -05:00
David Turner	9f470c20ed	More improvements to cluster coordination docs (#42799 ) This commit addresses a few more frequently-asked questions: * clarifies that bootstrapping doesn't happen even after a full cluster restart. * removes the example that uses IP addresses, to try and further encourage the use of node names for bootstrapping. * clarifies that auto-bootstrapping might form different clusters on different hosts, and gives a process for starting again if this wasn't what you wanted. * adds the "do not stop half-or-more of the master-eligible nodes" slogan that was notably absent. * reformats one of the console examples to a narrower width	2019-06-04 08:25:41 +01:00
Marios Trivyzas	eab88354f2	[Docs] Fix reference to `boost` and `slop` params (#42803 ) For `multi_match` query: link `boost` param to the generic reference for query usage and `slop` to the `match_phrase` query where its usage is documented. Fixes: #40091 (cherry picked from commit 69993049a8bd9e7f042935729fe69a8266d95a0a)	2019-06-03 22:57:19 +02:00
Jack Conradson	de72fe344c	Add Basic Date Docs to Painless (#42544 )	2019-06-03 13:39:03 -07:00
Marios Trivyzas	3b42dde64f	[Docs] Add note for date patterns used for index search. (#42810 ) Add an explanatory NOTE section to draw attention to the difference between small and capital letters used for the index date patterns. e.g.: HH vs hh, MM vs mm. Closes: #22322 (cherry picked from commit c8125417dc33215651f9bb76c9b1ffaf25f41caf)	2019-06-03 22:27:19 +02:00
Marios Trivyzas	6c50246a58	SQL: [Docs] Fix links syntax (#42806 ) Fix a couple of wrong links because of the order of the anchor and the usage of backquotes. (cherry picked from commit 4e0c6525153b60a57202937c2ae57968c8e35285)	2019-06-03 17:51:19 +02:00
David Roberts	b61202b0a8	[ML] Add a limit on line merging in find_file_structure (#42501 ) When analysing a semi-structured text file the find_file_structure endpoint merges lines to form multi-line messages using the assumption that the first line in each message contains the timestamp. However, if the timestamp is misdetected then this can lead to excessive numbers of lines being merged to form massive messages. This commit adds a line_merge_size_limit setting (default 10000 characters) that halts the analysis if a message bigger than this is created. This prevents significant CPU time being spent subsequently trying to determine the internal structure of the huge bogus messages.	2019-06-03 13:45:51 +01:00
Christoph Büscher	9a9ee9abed	[Docs] Add example to reimplement stempel analyzer (#42676 ) Adding an example of how to re-implement the polish stempel analyzer in case a user want to modify or extend it. In order for the analyzer to be able to use polish stopwords, also registering a polish_stop filter for the stempel plugin. Closes #13150	2019-06-03 13:22:44 +02:00
Alan Woodward	2129d06643	Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197 ) This commit clones the existing AnalyzeRequest/AnalyzeResponse classes to the high-level rest client, and adjusts request converters to use these new classes. This is a prerequisite to removing the Streamable interface from the internal server version of these classes.	2019-06-03 09:46:36 +01:00
Christian Kotzbauer	929215c0d5	Update release-notes.asciidoc (#42779 )	2019-06-01 08:18:00 -04:00
Julie Tibshirani	3a00d08c50	Clarify that inner_hits must be used to access nested fields. (#42724 ) This PR updates the docs for `docvalue_fields` and `stored_fields` to clarify that nested fields must be accessed through `inner_hits`. It also tweaks the nested fields documentation to make this point more visible. Addresses #23766.	2019-05-31 10:06:11 -07:00
James Rodewig	f51f8ed04c	[DOCS] Remove unneeded options from `[source,sql]` code blocks (#42759 ) In AsciiDoc, `subs="attributes,callouts,macros"` options were required to render `include-tagged::` in a code block. With elastic/docs#827, Elasticsearch Reference documentation migrated from AsciiDoc to Asciidoctor. In Asciidoctor, the `subs="attributes,callouts,macros"` options are no longer needed to render `include-tagged::` in a code block. This commit removes those unneeded options. Resolves #41589	2019-05-31 13:05:13 -04:00
James Rodewig	0a37dd7a86	[DOCS] Remove unneeded `ifdef::asciidoctor[]` conditionals (#42758 ) Several `ifdef::asciidoctor` conditionals were added so that AsciiDoc and Asciidoctor doc builds rendered consistently. With https://github.com/elastic/docs/pull/827, Elasticsearch Reference documentation migrated completely to Asciidoctor. We no longer need to support AsciiDoc so we can remove these conditionals. Resolves #41722	2019-05-31 11:08:54 -04:00
James Rodewig	478919c0bb	[DOCS] Remove unneeded `ifdef::asciidoctor[]` conditionals (#42758 ) Several `ifdef::asciidoctor` conditionals were added so that AsciiDoc and Asciidoctor doc builds rendered consistently. With https://github.com/elastic/docs/pull/827, Elasticsearch Reference documentation migrated completely to Asciidoctor. We no longer need to support AsciiDoc so we can remove these conditionals. Resolves #41722	2019-05-31 11:05:44 -04:00
Marios Trivyzas	01446ff4bd	[Docs] Mention search related deprecations (#42751 ) Add deprecation entries for 7.3 regarding `common` query and `cutoff_frequency` parameter. Follows: #42691	2019-05-31 12:56:07 +02:00
Alex Pang	5f9382acc2	Fix docs typo in the certutil CSR mode (#42593 ) Changes the mention of `cert` to `csr`. Co-Authored-By: Alex Pang <pangyikhei+github@gmail.com>	2019-05-31 01:03:43 +03:00
Lisa Cawley	d83b91d56a	[DOCS] Disable Metricbeat system module (#42601 )	2019-05-30 12:19:48 -07:00
Julie Tibshirani	1bb505c70d	Clarify the settings around limiting nested mappings. (#42686 ) * Previously, we mentioned multiple times that each nested object was indexed as its own document. This is repetitive, and is also a bit confusing in the context of `index.mapping.nested_fields.limit`, as that applies to the number of distinct `nested` types in the mappings, not the number of nested objects. We now just describe the issue once at the beginning of the section, to illustrate why `nested` types can be expensive. * Reference the ongoing example to clarify the meaning of the two settings. Addresses #28363.	2019-05-30 10:36:38 -07:00
Marios Trivyzas	ce30afcd01	Deprecate CommonTermsQuery and cutoff_frequency (#42619 ) (#42691 ) Since the max_score optimization landed in Elasticsearch 7, the CommonTermsQuery is redundant and slower. Moreover the cutoff_frequency parameter for MatchQuery and MultiMatchQuery is redundant. Relates to #27096 (cherry picked from commit 04b74497314eeec076753a33b3b6cc11549646e8)	2019-05-30 18:04:47 +02:00
Mayya Sharipova	5a76f46ac6	Fix error with mapping in docs Related to #39630	2019-05-30 10:28:09 -04:00
Peter Dyson	b84b5525e1	[DOCS] path_hierarchy tokenizer examples (#39630 ) Closes #17138	2019-05-30 09:17:55 -04:00
James Rodewig	67326252d8	[DOCS] Rewrite 'wildcard' query (#42670 )	2019-05-30 08:31:27 -04:00
Mayya Sharipova	5e02dc6878	Add warning scores are floats (#42667 )	2019-05-29 16:49:04 -04:00
lcawl	78f280de9c	[DOCS] Adds more monitoring tagged regions	2019-05-29 11:21:13 -07:00
James Rodewig	3193dfa8e6	[DOCS] Set explicit anchors for TLS/SSL settings (#42524 )	2019-05-29 08:25:37 -04:00
Hendrik Muhs	345ff21ae5	[ML-DataFrame] rewrite start and stop to answer with acknowledged (#42589 ) rewrite start and stop to answer with acknowledged fixes #42450	2019-05-29 11:14:32 +02:00
Julie Tibshirani	8b325164f9	Fix a callout in the field alias docs.	2019-05-28 17:49:44 -07:00
James Rodewig	e54e74852a	[DOCS] Fix X-Pack tag for Asciidoctor (#42443 )	2019-05-28 15:19:31 -04:00
James Rodewig	54d194409e	[DOCS] Set explicit anchors for Asciidoctor (#42521 )	2019-05-28 14:21:00 -04:00
James Rodewig	ee1e4db266	[DOCS] Set literal anchors for Asciidoctor (#42462 )	2019-05-28 14:16:18 -04:00
Lisa Cawley	77fc7b2107	[DOCS] Reorg monitoring configuration for re-use (#42547 )	2019-05-28 09:13:00 -07:00
lcawl	8ff37e99f5	[DOCS] Removes coming tags	2019-05-28 08:58:41 -07:00
Benjamin Trent	d06618a70d	[ML] adding delayed_data_check_config to datafeed update docs (#42095 ) (#42626 ) * [ML] adding delayed_data_check_config to datafeed update docs * [DOCS] Edits delayed data configuration details	2019-05-28 11:36:30 -04:00
James Rodewig	31d2bdca37	[DOCS] Fix Moving Avg Aggregation `deprecated` macro for Asciidoctor (#42405 )	2019-05-28 08:56:50 -04:00
James Rodewig	b30ca8da28	[DOCS] Fix API Quick Reference rollup attribute for Asciidoctor (#42403 )	2019-05-28 08:53:20 -04:00
James Rodewig	3079d2d295	[DOCS] Escape cross-ref link comma for Asciidoctor (#42402 )	2019-05-28 08:47:51 -04:00
Travis Steel	381e100217	Fixed typo in docker.asciidoc (#42455 )	2019-05-27 11:54:56 +02:00
bellengao	380f296631	Update script-fields.asciidoc (#42490 )	2019-05-27 11:48:37 +02:00
Julie Tibshirani	3a6c2525ca	Deprecate support for chained multi-fields. (#42330 ) This PR contains a straight backport of #41926, and also updates the migration documentation and deprecation info API for 7.x.	2019-05-24 15:55:06 -07:00
James Rodewig	d521a88e19	[DOCS] Move callouts to end of line for Asciidoctor migration (#42356 )	2019-05-24 15:03:46 -04:00
David Roberts	09e8910b0f	[DOCS] Adding ML-specific prerequisites to setup docs (#42529 )	2019-05-24 10:49:41 -07:00
James Rodewig	43dd081e22	[DOCS] Fix nested def list for Asciidoctor (#42353 )	2019-05-24 13:39:49 -04:00
Simon Willnauer	46ccfba808	Remove IndexStore and DirectoryService (#42446 ) Both of these classes are basically a bloated wrapper around a simple construct that can simply be a DirectoryFactory interface. This change removes both classes and replaces them with a simple stateless interface that creates a new `Directory` per shard. The concept of `index.store` is preserved since it makes sense from a configuration perspective.	2019-05-24 12:14:56 +02:00
David Roberts	f472186b9f	[ML] Improve file structure finder timestamp format determination (#41948 ) This change contains a major refactoring of the timestamp format determination code used by the ML find file structure endpoint. Previously timestamp format determination was done separately for each piece of text supplied to the timestamp format finder. This had the drawback that it was not possible to distinguish dd/MM and MM/dd in the case where both numbers were 12 or less. In order to do this sensibly it is best to look across all the available timestamps and see if one of the numbers is greater than 12 in any of them. This necessitates making the timestamp format finder an instantiable class that can accumulate evidence over time. Another problem with the previous approach was that it was only possible to override the timestamp format to one of a limited set of timestamp formats. There was no way out if a file to be analysed had a timestamp that was sane yet not in the supported set. This is now changed to allow any timestamp format that can be parsed by a combination of these Java date/time formats: yy, yyyy, M, MM, MMM, MMMM, d, dd, EEE, EEEE, H, HH, h, mm, ss, a, XX, XXX, zzz Additionally S letter groups (fractional seconds) are supported providing they occur after ss and separated from the ss by a dot, comma or colon. Spacing and punctuation is also permitted with the exception of the question mark, newline and carriage return characters, together with literal text enclosed in single quotes. The full list of changes/improvements in this refactor is: - Make TimestampFormatFinder an instantiable class - Overrides must be specified in Java date/time format - Joda format is no longer accepted - Joda timestamp formats in outputs are now derived from the determined or overridden Java timestamp formats, not stored separately - Functionality for determining the "best" timestamp format in a set of lines has been moved from TextLogFileStructureFinder to TimestampFormatFinder, taking advantage of the fact that TimestampFormatFinder is now an instantiable class with state - The functionality to quickly rule out some possible Grok patterns when looking for timestamp formats has been changed from using simple regular expressions to the much faster approach of using the Shift-And method of sub-string search, but using an "alphabet" consisting of just 1 (representing any digit) and 0 (representing non-digits) - Timestamp format overrides are now much more flexible - Timestamp format overrides that do not correspond to a built-in Grok pattern are mapped to a %{CUSTOM_TIMESTAMP} Grok pattern whose definition is included within the date processor in the ingest pipeline - Grok patterns that correspond to multiple Java date/time patterns are now handled better - the Grok pattern is accepted as matching broadly, and the required set of Java date/time patterns is built up considering all observed samples - As a result of the more flexible acceptance of Grok patterns, when looking for the "best" timestamp in a set of lines timestamps are considered different if they are preceded by a different sequence of punctuation characters (to prevent timestamps far into some lines being considered similar to timestamps near the beginning of other lines) - Out-of-the-box Grok patterns that are considered now include %{DATE} and %{DATESTAMP}, which have indeterminate day/month ordering - The order of day/month in formats with indeterminate day/month order is determined by considering all observed samples (plus the server locale if the observed samples still do not suggest an ordering) Relates #38086 Closes #35137 Closes #35132	2019-05-24 09:10:08 +01:00
Adrien Grand	f3c33d6d96	Add 7.1.1 release notes.	2019-05-24 09:26:04 +02:00
Costin Leau	9fdf4215dd	Docs: Documentation for the upcoming SQL support of frozen indices (#41863 ) (cherry picked from commit a3cc03eb1503df24c1706a721fcc9af38c3b2873) (cherry picked from commit f42dcf2ffd7bd25f3f91aa6127515f393cd1860f)	2019-05-23 21:16:16 +03:00
Yannick Welsch	f57fdc57e9	Deprecate max_local_storage_nodes (#42426 ) Allows this setting to be removed in 8.0, see #42428	2019-05-23 15:59:55 +02:00
Jim Ferenczi	4ca5649a0d	Upgrade to lucene 8.1.0-snapshot-e460356abe (#40952 )	2019-05-23 11:45:33 +02:00
Jake Landis	496fee3333	bump to 7.3 (#42365 )	2019-05-22 11:57:07 -05:00
swstepp	4181c5ccf5	Fix grammar problem in stemming reference. (#42148 )	2019-05-22 09:50:30 -07:00
Julie Tibshirani	a3caed2bee	Fix a rendering issue in the geo envelope docs. (#42332 ) Previously the formatting information didn't display in the docs, and the sentence just rendered as "bounding rectangle in the format :".	2019-05-22 09:49:58 -07:00
Luca Cavanna	e747326b04	Adapt low-level REST client to java 8 (#41537 ) As a follow-up to #38540 we can use lambda functions and method references where convenient in the low-level REST client. Also, we need to update the docs to state that the minimum java version required is 1.8.	2019-05-22 18:47:54 +02:00
Alpar Torok	eb1639c5fc	TestClusters: Convert docs (#42100 ) * TestClusters: Convert docs	2019-05-22 14:44:08 +03:00
David Turner	b1c413ea63	Rework discovery-ec2 docs (#41630 ) This commit reworks and clarifies the docs for the `discovery-ec2` plugin: - folds the tiny "Getting started with AWS" into the page on configuration - spells out the name of each setting in full instead of noting the `discovery.ec2` prefix at the top of the page. - replaces each `(Secure)` marker with a sentence describing what that means in situ - notes some missing defaults - clarifies the behaviour of `discovery.ec2.groups` (dependent on `.any_group`) - clarifies what `discovery.ec2.host_type` is for - adds `discovery.ec2.tag.TAGNAME` as a (meta-)setting rather than describing it in a separate section - notes that the tags mentioned in `discovery.ec2.tag.TAGNAME` cannot contain colons (see #38406) - clarifies the EC2-specific interface names and what they're for - reorders and rewords the recommendations for storage - expands on why you should not span a cluster across regions - adds a suggestion on protecting instances against termination during scale-in - reformat to 80 columns where possible Fixes #38406	2019-05-22 09:46:56 +01:00
Jack Conradson	813db163d8	Reorganize Painless doc structure (#42303 )	2019-05-21 10:50:21 -07:00
Glen Smith	a6204a5eaf	Remove stray back tick that's messing up table format (#41705 )	2019-05-21 09:00:06 -04:00
Mayya Sharipova	216c74d10a	Add experimental and warnings to vector functions (#42205 )	2019-05-21 06:39:05 -04:00
David Turner	7abeaba8bb	Prevent in-place downgrades and invalid upgrades (#41731 ) Downgrading an Elasticsearch node to an earlier version is unsupported, because we do not make any attempt to guarantee that a node can read any of the on-disk data written by a future version. Yet today we do not actively prevent downgrades, and sometimes users will attempt to roll back a failed upgrade with an in-place downgrade and get into an unrecoverable state. This change adds the current version of the node to the node metadata file, and checks the version found in this file against the current version at startup. If the node cannot be sure of its ability to read the on-disk data then it refuses to start, preserving any on-disk data in its upgraded state. This change also adds a command-line tool to overwrite the node metadata file without performing any version checks, to unsafely bypass these checks and recover the historical and lenient behaviour.	2019-05-21 08:04:30 +01:00
Jake Landis	df8fef3c1a	fix assumption that 6.7 is last 6.x release (#42255 )	2019-05-20 14:35:28 -05:00
Jake Landis	87bff89500	7.1.0 release notes forward port (#42252 ) Forward port of #42208	2019-05-20 14:39:17 -04:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Jay Modi	dbbdcea128	Update ciphers for TLSv1.3 and JDK11 if available (#42082 ) This commit updates the default ciphers and TLS protocols that are used when the runtime JDK supports them. New cipher support has been introduced in JDK 11 and 12 along with performance fixes for AES GCM. The ciphers are ordered with PFS ciphers being most preferred, then AEAD ciphers, and finally those with mainstream hardware support. When available stronger encryption is preferred for a given cipher. This is a backport of #41385 and #41808. There are known JDK bugs with TLSv1.3 that have been fixed in various versions. These are: 1. The JDK's bundled HttpsServer will endless loop under JDK11 and JDK 12.0 (Fixed in 12.0.1) based on the way the Apache HttpClient performs a close (half close). 2. In all versions of JDK 11 and 12, the HttpsServer will endless loop when certificates are not trusted or another handshake error occurs. An email has been sent to the openjdk security-dev list and #38646 is open to track this. 3. In JDK 11.0.2 and prior there is a race condition with session resumption that leads to handshake errors when multiple concurrent handshakes are going on between the same client and server. This bug does not appear when client authentication is in use. This is JDK-8213202, which was fixed in 11.0.3 and 12.0. 4. In JDK 11.0.2 and prior there is a bug where resumed TLS sessions do not retain peer certificate information. This is JDK-8212885. The way these issues are addressed is that the current java version is checked and used to determine the supported protocols for tests that provoke these issues.	2019-05-20 09:45:36 -04:00

1 2 3 4 5 ...

6928 Commits