OpenSearch

Commit Graph

Author	SHA1	Message	Date
Przemko Robakowski	4619834b97	[7.x] CSV ingest processor (#49509 ) (#50083 ) * CSV ingest processor (#49509) This change adds new ingest processor that breaks line from CSV file into separate fields. By default it conforms to RFC 4180 but can be tweaked. Closes #49113	2019-12-11 23:06:05 +01:00
Wilder Pereira	8ff809af2d	[DOCS] Replace interval notation with plain English in match query docs (#47334 ) As we discussed in #36371, interval notation is confusing to some users. This makes the intention clearer by just explaining inclusivity and exclusivity in the docs.	2019-12-11 09:58:28 -05:00
Patryk Krawaczyński	df558aa0ca	[DOCS] Document `index.queries.cache.enabled` as a static setting (#49886 )	2019-12-10 14:24:03 -05:00
Adrien Grand	87e72156ce	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) (#50039 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 18:04:58 +01:00
Peter Johnson	1a6e5bf220	[Docs] Fix typo in function-score-query.asciidoc (#50030 )	2019-12-10 17:33:03 +01:00
Lisa Cawley	15f27d8c54	[DOCS] Removes realm type security setting (#50001 )	2019-12-10 08:09:07 -08:00
James Rodewig	3f5678ca79	[DOCS] Remove shadow replica reference (#50029 ) Removes a reference to shadow replicas from the cat shards API docs and a comment in cluster/routing/UnassignedInfo.java. Shadow replicas were removed with #23906.	2019-12-10 09:30:51 -05:00
Dimitris Athanasiou	8891f4db88	[7.x][ML] Introduce randomize_seed setting for regression and classification (#49990 ) (#50023 ) This adds a new `randomize_seed` for regression and classification. When not explicitly set, the seed is randomly generated. One can reuse the seed in a similar job in order to ensure the same docs are picked for training. Backport of #49990	2019-12-10 15:29:19 +02:00
James Rodewig	33594380c7	[DOCS] Skip synced flush docs tests (#49986 ) The current snippets in the synced flush docs can cause conflicts with other background syncs, such as the global checkpoint sync or retention lease sync, in the docs tests. This skips tests for those snippets to avoid conflicts.	2019-12-09 13:17:38 -05:00
James Rodewig	322dabe3de	[DOCS] Correct `for in` example in Painless docs (#49991 ) Adds a needed `def` keyword to the `for in` example in the Painless docs.	2019-12-09 11:05:12 -05:00
Artur Carvalho	d073bccaad	[Docs] Fix typo in getting-started.asciidoc (#49985 )	2019-12-09 16:24:30 +01:00
James Rodewig	1918a21baf	[DOCS] Correct inline shape snippets in shape query docs (#49921 ) In the shape query docs, the index mapping snippet uses the "geometry" shape field mapping. However, the doc index snippet uses the "location" property. This changes the "location" property to "geometry". It also adds a comment containing the search result snippet. This should prevent similar issues in the future.	2019-12-09 08:47:59 -05:00
Ryan Ernst	e66cfc4369	Fix incorrect use of multiline NOTE in rpm docs (#49962 ) This was a copy/paste error from #49893. This commit converts the NOTE to use inline style instead of one needing closing linebreak.	2019-12-06 17:43:51 -08:00
Ryan Ernst	d29f04209b	Disable repo configuration for rpm based systems (#49893 ) This commit changes the recommended repository file for rpm based systems to be disabled by default. This is a safer practice so upgrades of the system do no accidentally upgrade elasticsearch itself. closes #30660	2019-12-06 15:56:18 -08:00
Przemko Robakowski	d7083a84f4	Allow list of IPs in geoip ingest processor (#49573 ) (#49947 ) * Allow list of IPs in geoip ingest processor This change lets you use array of IPs in addition to string in geoip processor source field. It will set array containing geoip data for each element in source, unless first_only parameter option is enabled, then only first found will be returned. Closes #46193	2019-12-07 00:19:09 +01:00
István Zoltán Szabó	63d3933787	[DOCS] Fixes classification evaluation example response. (#49905 )	2019-12-06 13:25:40 +01:00
István Zoltán Szabó	bb91291273	[DOCS] Fixes attribute in transforms overview. (#49898 )	2019-12-06 10:24:29 +01:00
Hendrik Muhs	b17cfc93e3	[Transform][DOCS]rewrite client ip example to use continuous transform (#49822 ) adapt the transform example for suspicious client ips to use continuous transform	2019-12-06 08:20:48 +01:00
Orhan Toy	1641fcd488	[DOCS] Minor typo fixes in reindex.asciidoc (#49863 )	2019-12-05 20:25:11 +01:00
István Zoltán Szabó	f4b3bb7d6b	[DOCS] Adds an example of preprocessing actions to the PUT DFA API docs (#49831 )	2019-12-05 14:16:38 +01:00
István Zoltán Szabó	04e99ff1ee	[DOCS] Fixes typo in the ML anomaly detection time functions docs. (#49834 )	2019-12-05 09:58:30 +01:00
James Rodewig	42f902977d	[DOCS] Document `minimum_should_match` defaults for `bool` query (#48865 ) Adds documentation for the `minimum_should_match` parameter to the `bool` query docs. Includes docs for the default values: - `1` if the `bool` query includes at least one `should` clause and no `must` or `filter` clauses - `0` otherwise	2019-12-04 12:45:38 -05:00
James Rodewig	87a73b6bdf	[DOCS] Reformat length token filter docs (#49805 ) * Adds a title abbreviation * Updates the description and adds a Lucene link * Reformats the parameters section * Adds analyze, custom analyzer, and custom filter snippets Relates to #44726.	2019-12-04 09:59:08 -05:00
Alexander Reelsen	6e751f5536	Docs: Fix & test more grok processor documentation (#49447 ) The documentation contained a small error, as bytes and duration was not properly converted to a number and thus remained a string. The documentation is now also properly tested by providing a full blown simulate pipeline example.	2019-12-03 11:55:49 +01:00
Colin Goodheart-Smithe	0592b3c726	Removes PR that was not in 7.5.0 release	2019-12-03 10:20:05 +00:00
cachedout	c4cc90be1c	Recommend Metricbeat for 7.x (#49758 ) * Recommend Metricbeat for 7.x * Fix typo in link to configuring-metricbeat * [DOCS] Fixes build error and some terminology * Add to local exporter page per review feedback	2019-12-02 21:31:47 +00:00
James Rodewig	f1fd41cb53	[DOCS] Document CCR compatibility requirements (#49776 ) * Creates a prerequisites section in the cross-cluster replication (CCR) overview. * Adds concise definitions for local and remote cluster in a CCR context. * Documents that the ES version of the local cluster must be the same or a newer compatible version as the remote cluster.	2019-12-02 15:53:00 -05:00
Paul Sanwald	ebc13ca498	re-categorize things that appeared in multiple area labels (#49777 )	2019-12-02 15:03:38 -05:00
Hendrik Muhs	a5dc6e062e	Document issue 49730 in release notes for 7.5.0 (#49733 ) document low severity issue about transform audit index potentially disappearing during rolling upgrade See #49730 for details	2019-12-02 20:54:36 +01:00
lcawl	96f14fcfbd	[DOCS] Removes coming tags	2019-12-02 08:17:10 -08:00
Jim Ferenczi	e6dc5bf9c2	Add release highlights for 7.5.0 (#49320 )	2019-12-02 07:45:12 -08:00
Andrei Stefan	e2982b2110	SQL: handle NULL arithmetic operations with INTERVALs (#49633 ) (cherry picked from commit ce727615c08cf5ae422feb77f69ea24fb53cd9d1)	2019-12-02 17:31:05 +02:00
James Rodewig	ade72b97b7	[DOCS] Reformat keep types and keep words token filter docs (#49604 ) * Adds title abbreviations * Updates the descriptions and adds Lucene links * Reformats parameter definitions * Adds analyze and custom analyzer snippets * Adds explanations of token types to keep types token filter and tokenizer docs	2019-12-02 09:40:50 -05:00
David Turner	86a40f6d8b	Drop snapshot instructions for autobootstrap fix (#49755 ) The "Restore any snapshots as required" step is a trap: it's somewhere between tricky and impossible to restore multiple clusters into a single one. Also add a note about configuring discovery during a rolling upgrade to proscribe any rare cases where you might accidentally autobootstrap during the upgrade.	2019-12-02 14:33:42 +00:00
James Rodewig	3d44c1163a	[DOCS] Explicitly document enrich `target_field` includes `match_field` (#49407 ) When the enrich processor appends enrich data to an incoming document, it adds a `target_field` to contain the enrich data. This `target_field` contains both the `match_field` AND `enrich_fields` specified in the enrich policy. Previously, this was reflected in the documented example but not explicitly stated. This adds several explicit statements to the docs.	2019-12-02 09:13:24 -05:00
Henning Andersen	5adb33ec17	Deprecate sorting in reindex (#49458 ) (#49738 ) Reindex sort never gave a guarantee about the order of documents being indexed into the destination, though it could give a sense of locality of source data. It prevents us from doing resilient reindex and other optimizations and it has therefore been deprecated. Related to #47567	2019-12-01 19:24:27 +01:00
Henning Andersen	1d745f1e5c	Revert "Deprecate sorting in reindex (#49458 )" This reverts commit `27d45c9f1f`.	2019-11-29 22:08:19 +01:00
Mayya Sharipova	7cf170830c	Optimize sort on numeric long and date fields. (#49732 ) This rewrites long sort as a `DistanceFeatureQuery`, which can efficiently skip non-competitive blocks and segments of documents. Depending on the dataset, the speedups can be 2 - 10 times. The optimization can be disabled with setting the system property `es.search.rewrite_sort` to `false`. Optimization is skipped when an index has 50% or more data with the same value. Optimization is done through: 1. Rewriting sort as `DistanceFeatureQuery` which can efficiently skip non-competitive blocks and segments of documents. 2. Sorting segments according to the primary numeric sort field(#44021) This allows to skip non-competitive segments. 3. Using collector manager. When we optimize sort, we sort segments by their min/max value. As a collector expects to have segments in order, we can not use a single collector for sorted segments. We use collectorManager, where for every segment a dedicated collector will be created. 4. Using Lucene's shared TopFieldCollector manager This collector manager is able to exchange minimum competitive score between collectors, which allows us to efficiently skip the whole segments that don't contain competitive scores. 5. When index is force merged to a single segment, #48533 interleaving old and new segments allows for this optimization as well, as blocks with non-competitive docs can be skipped. Backport for #48804 Co-authored-by: Jim Ferenczi <jim.ferenczi@elastic.co>	2019-11-29 15:37:40 -05:00
Henning Andersen	27d45c9f1f	Deprecate sorting in reindex (#49458 ) Reindex sort never gave a guarantee about the order of documents being indexed into the destination, though it could give a sense of locality of source data. It prevents us from doing resilient reindex and other optimizations and it has therefore been deprecated. Related to #47567	2019-11-29 21:35:11 +01:00
Tugberk Ugurlu	dcb9d5177c	[Docs] Fix typo in templates.asciidoc (#49726 )	2019-11-29 18:43:13 +01:00
Dimitris Athanasiou	4edb2e7bb6	[7.x][ML] Add optional source filtering during data frame reindexing (#49690 ) (#49718 ) This adds a `_source` setting under the `source` setting of a data frame analytics config. The new `_source` is reusing the structure of a `FetchSourceContext` like `analyzed_fields` does. Specifying includes and excludes for source allows selecting which fields will get reindexed and will be available in the destination index. Closes #49531 Backport of #49690	2019-11-29 16:10:44 +02:00
Tim Vernum	e6f530c167	Improved diagnostics for TLS trust failures (#49669 ) - Improves HTTP client hostname verification failure messages - Adds "DiagnosticTrustManager" which logs certificate information when trust cannot be established (hostname failure, CA path failure, etc) These diagnostic messages are designed so that many common TLS problems can be diagnosed based solely (or primarily) on the elasticsearch logs. These diagnostics can be disabled by setting xpack.security.ssl.diagnose.trust: false Backport of: #48911	2019-11-29 15:01:20 +11:00
Tim Vernum	31f13e839c	Correct the documentation for create_doc privilege (#49354 ) The documentation was added in #47584 but those docs did not reflect the up-to-date behavior of the feature. Backport of: #47784	2019-11-29 12:59:16 +11:00
Mayya Sharipova	2dafecc398	Upgrade lucene to 8.4.0-snapshot-e648d601efb (#49641 )	2019-11-28 11:59:58 -05:00
Marios Trivyzas	d5842aebab	[Docs] Enhance rolling upgrade guide (#49686 ) Add a couple of pointers for the user to check the overall cluster health and the version of ES running on every node. Fixes: #49670 (cherry picked from commit 8ca11f54cd839f41632c556601e94da67e91a3d1)	2019-11-28 17:02:36 +01:00
Ignacio Vera	326fe7566e	New Histogram field mapper that supports percentiles aggregations. (#48580 ) (#49683 ) This commit adds a new histogram field mapper that consists in a pre-aggregated format of numerical data to be used in percentiles aggregations.	2019-11-28 15:06:26 +01:00
Jim Ferenczi	d6445fae4b	Add a cluster setting to disallow loading fielddata on _id field (#49166 ) This change adds a dynamic cluster setting named `indices.id_field_data.enabled`. When set to `false` any attempt to load the fielddata for the `_id` field will fail with an exception. The default value in this change is set to `false` in order to prevent fielddata usage on this field for future versions but it will be set to `true` when backporting to 7x. When the setting is set to true (manually or by default in 7x) the loading will also issue a deprecation warning since we want to disallow fielddata entirely when https://github.com/elastic/elasticsearch/issues/26472 is implemented. Closes #43599	2019-11-28 09:35:28 +01:00
Ryan Ernst	f288696040	Remove legacy referene to file scripts (#49339 ) This commit removes outdated documentation about a path setting for file scripts which no longer exist. closes #45827	2019-11-27 10:42:33 -08:00
Ryan Ernst	297efa8324	Add JAVA_HOME env override location to docs (#49565 ) This commit clarifies how to override JAVA_HOME from the bundled jdk for deb and rpm installs, which each have their own file that is sourced upon service startup. closes #49068	2019-11-27 10:40:30 -08:00
Martijn van Groningen	0a42395dfa	Backport: add templating support to pipeline processor (#49643 ) Backport of #49030 This commit adds templating support to the pipeline processor's `name` option. Closes #39955	2019-11-27 15:53:40 +01:00

1 2 3 4 5 ...

7681 Commits