OpenSearch

Commit Graph

Author	SHA1	Message	Date
Benjamin Trent	049d854360	[ML][Inference] adjust so target_field always has inference result and optionally allow new top classes field in the classification config (#49923 ) (#49982 )	2019-12-09 08:29:45 -05:00
Armin Braun	62e128f02d	Cleanup Old index-N Blobs in Repository Cleanup (#49862 ) (#49902 ) * Cleanup Old index-N Blobs in Repository Cleanup Repository cleanup didn't deal with old index-N, this change adds cleaning up all old index-N found in the repository.	2019-12-09 12:05:55 +01:00
Dimitris Athanasiou	e4f838e764	[7.x][ML] Update expected mem estimate in explain API integ test (#49924 ) (#49979 ) Work in progress in the c++ side is increasing memory estimates a bit and this test fails. At the time of this commit the mem estimate when there is no source query is a about 2Mb. So I am relaxing the test to assert memory estimate is less than 1Mb instead of 500Kb. Backport of #49924	2019-12-09 11:52:06 +02:00
cachedout	549b103458	[7.x] APM system_user (#47668 ) (#49912 ) * Add test for APM beats index perms * Grant monitoring index privs to apm_system user * Review feedback * Fix compilation problem	2019-12-09 08:25:03 +00:00
Armin Braun	ac2774c9fa	Use Cluster State to Track Repository Generation (#49729 ) (#49976 ) Step on the road to #49060. This commit adds the logic to keep track of a repository's generation across repository operations. See changes to package level Javadoc for the concrete changes in the distributed state machine. It updates the write side of new repository generations to be fully consistent via the cluster state. With this change, no `index-N` will be overwritten for the same repository ever. So eventual consistency issues around conflicting updates to the same `index-N` are not a possibility any longer. With this change the read side will still use listing of repository contents instead of relying solely on the cluster state contents. The logic for that will be introduced in #49060. This retains the ability to externally delete the contents of a repository and continue using it afterwards for the time being. In #49060 the use of listing to determine the repository generation will be removed in all cases (except for full-cluster restart) as the last step in this effort.	2019-12-09 09:02:57 +01:00
Yannick Welsch	7a2e35caa0	Properly fake corrupted translog (#49918 ) The fake translog corruption in the test sometimes generates invalid translog files where some assertions do not hold (e.g. minSeqNo <= maxSeqNo or minTranslogGen <= translogGen) Closes #49909	2019-12-09 08:33:40 +01:00
Yannick Welsch	01d36afa4b	Randomly run CCR tests with _source disabled (#49922 ) Makes sure that CCR also properly works with _source disabled. Changes one exception in LuceneChangesSnapshot as the case of missing _recovery_source because of a missing lease was not properly properly bubbled up to CCR (testIndexFallBehind was failing).	2019-12-09 08:33:40 +01:00
Armin Braun	f768f8ddab	Fix TimedRunnable Executing onAfter Twice (#49910 ) (#49930 ) If we have a nested `AbstractRunnable` inside of `TimedRunnable` it's executed twice on `run` (once when its own `run` method is invoked and once when the `onAfter` in the `TimedRunnable` is executed). Simply removing the `onAfter` override in `TimedRunnable` makes sure that the `onAfter` is only called once by the `run` on the nested `AbstractRunnable` itself. Same was done for `onFailure` as it was double-triggering as well on exceptions in the inner `onFailure`.	2019-12-08 17:36:05 +01:00
Armin Braun	8ae11e176a	Cleanup some in o.e.transport (#49901 ) (#49971 ) Cleaning up some obvious compile warnings and dead code.	2019-12-08 16:14:20 +01:00
Costin Leau	5b896c5bb5	SQL: Refactor usage of NamedExpression (#49693 ) To recap, Attributes form the properties of a derived table. Each LogicalPlan has Attributes as output since each one can be part of a query and as such its result are sent to its consumer. This change essentially removes the name id comparison so any changes applied to existing expressions should work as long as the said expressions are semantically equivalent. This change enforces the hashCode and equals which has the side-effect of using hashCode as identifiers for each expression. By removing any property from an Attribute, the various components need to look the original source for comparison which, while annoying, should prevent a reference from getting out of sync with its source due to optimizations. Essentially going forward there are only 3 types of NamedExpressions: Alias - user define (implicit or explicit) name FieldAttribute - field backed by Elasticsearch ReferenceAttribute - a reference to another source acting as an Attribute. Typically the Attribute of an Alias. * Remove the usage of NamedExpression as basis for all Expressions. Instead, restrict their use only for named context, such as projections by using Aliasing instead. * Remove different types of Attributes and allow only FieldAttribute, UnresolvedAttribute and ReferenceAttribute. To avoid issues with rewrites, resolve the references inside the QueryContainer so the information always stays on the source. * Side-effect, simplify the rules as the state for InnerAggs doesn't have to be contained anymore. * Improve ResolveMissingRef rule to handle references to named non-singular expression tree against the same expression used up the tree. #49693 backport to 7.x (cherry picked from commit 5d095e2173bcbf120f534a6f2a584185a7879b57)	2019-12-07 11:02:14 +02:00
Ryan Ernst	e66cfc4369	Fix incorrect use of multiline NOTE in rpm docs (#49962 ) This was a copy/paste error from #49893. This commit converts the NOTE to use inline style instead of one needing closing linebreak.	2019-12-06 17:43:51 -08:00
Ryan Ernst	d29f04209b	Disable repo configuration for rpm based systems (#49893 ) This commit changes the recommended repository file for rpm based systems to be disabled by default. This is a safer practice so upgrades of the system do no accidentally upgrade elasticsearch itself. closes #30660	2019-12-06 15:56:18 -08:00
Ryan Ernst	401c75d8b5	Dump wildfly log on start failure (#49892 ) When testing wildfly with Elasticsearch, we currently dump the wildfly log if the test fails. However, when starting wildfly we may fail to find the port number wildfly started on, and fail with no output. This change dumps the wildflog log when failing to find the http or management ports. relates #49374	2019-12-06 15:55:01 -08:00
Przemko Robakowski	d7083a84f4	Allow list of IPs in geoip ingest processor (#49573 ) (#49947 ) * Allow list of IPs in geoip ingest processor This change lets you use array of IPs in addition to string in geoip processor source field. It will set array containing geoip data for each element in source, unless first_only parameter option is enabled, then only first found will be returned. Closes #46193	2019-12-07 00:19:09 +01:00
Stuart Tettemer	17cda5b2c0	Scripting: Groundwork for caching script results (#49895 ) (#49944 ) In order to cache script results in the query shard cache, we need to check if scripts are deterministic. This change adds a default method to the script factories, `isResultDeterministic() -> false` which is used by the `QueryShardContext`. Script results were never cached and that does not change here. Future changes will implement this method based on whether the results of the scripts are deterministic or not and therefore cacheable. Refs: #49466 Backport	2019-12-06 15:08:05 -07:00
Lee Hinman	8205cdd423	[7.x] Refactor IndexLifecycleRunner to split state modificatio… (#49936 ) This commit refactors the `IndexLifecycleRunner` to split out and consolidate the number of methods that change state from within ILM. It adds a new class `IndexLifecycleTransition` that contains a number of static methods used to modify ILM's state. These methods all return new cluster states rather than making changes themselves (they can be thought of as helpers for modifying ILM state). Rather than having multiple ways to move an index to a particular step (like `moveClusterStateToStep`, `moveClusterStateToNextStep`, `moveClusterStateToPreviouslyFailedStep`, etc (there are others)) this now consolidates those into three with (hopefully) useful names: - `moveClusterStateToStep` - `moveClusterStateToErrorStep` - `moveClusterStateToPreviouslyFailedStep` In the move, I was also able to consolidate duplicate or redundant arguments to these functions. Prior to this commit there were many calls that provided duplicate information (both `IndexMetaData` and `LifecycleExecutionState` for example) where the duplicate argument could be derived from a previous argument with no problems. With this split, `IndexLifecycleRunner` now contains the methods used to actually run steps as well as the methods that kick off cluster state updates for state transitions. `IndexLifecycleTransition` contains only the helpers for constructing new states from given scenarios. This also adds Javadocs to all methods in both `IndexLifecycleRunner` and `IndexLifecycleTransition` (this accounts for almost all of the increase in code lines for this commit). It also makes all methods be as restrictive in visibility, to limit the scope of where they are used. This refactoring is part of work towards capturing actions and transitions that ILM makes, by consolidating and simplifying the places we make state changes, it will make adding operation auditing easier.	2019-12-06 12:55:16 -07:00
Jake Landis	1c5a139968	Update jackson-databind to 2.8.11.4 (#49347 ) (#49937 )	2019-12-06 13:39:33 -06:00
David Roberts	17fa9d5844	[TEST] Mute ConnectionManagerTests.testConcurrentConnectsAndDisconnects Due to https://github.com/elastic/elasticsearch/issues/49903	2019-12-06 17:06:34 +00:00
Alexander Reelsen	d299bf5760	Add tests for ingesting CBOR data attachments (#49715 ) Our docs specifically mention that CBOR is supported when ingesting attachments. However this is not tested anywhere. This adds a test, that uses specifically CBOR format in its IndexRequest and another one that behaves like CBOR in the ingest attachment unit tests.	2019-12-06 14:33:39 +01:00
Przemysław Witek	e60837aa3b	[7.x] Log whole analytics stats when the state assertion fails (#49906 ) (#49911 )	2019-12-06 14:31:17 +01:00
István Zoltán Szabó	63d3933787	[DOCS] Fixes classification evaluation example response. (#49905 )	2019-12-06 13:25:40 +01:00
István Zoltán Szabó	bb91291273	[DOCS] Fixes attribute in transforms overview. (#49898 )	2019-12-06 10:24:29 +01:00
Henning Andersen	1d3feaf18e	Reindex sort deprecation warning take 2 (#49855 ) (#49899 ) Moved the deprecation warning to ReindexValidator to ensure it runs early and works with resilient reindex. Also check that the warning is reported back for wait_for_completion=false. Follow-up to #49458	2019-12-06 09:44:36 +01:00
Hendrik Muhs	b17cfc93e3	[Transform][DOCS]rewrite client ip example to use continuous transform (#49822 ) adapt the transform example for suspicious client ips to use continuous transform	2019-12-06 08:20:48 +01:00
Jack Conradson	cd3744c0b7	Add nodes to handle types (#49785 ) This PR adds 3 nodes to handle types defined by a front-end creating a Painless AST. These types are decided with data immutability in mind - hence the reason for more than a single node.	2019-12-05 17:09:19 -08:00
Mark Vieira	abd6fa149c	Move BuildParams class to 'minimumRuntime' source set (#49890 ) Move BuildParams class to 'minimumRuntime' source set to retain compatibility with build-tools for builds using a Java 8 runtime. Closes #49766 (cherry picked from commit 1059f823acdfa7a2f1f9bff21c7256dae4f3e23c)	2019-12-05 16:32:28 -08:00
Orhan Toy	0f02e02d77	Consistent case in CLI option descriptions (#49635 ) This commit improves the casing of messages in the CLI help descriptions.	2019-12-05 13:36:11 -08:00
Zachary Tong	fec882a457	Decouple pipeline reductions from final agg reduction (#45796 ) Historically only two things happened in the final reduction: empty buckets were filled, and pipeline aggs were reduced (since it was the final reduction, this was safe). Usage of the final reduction is growing however. Auto-date-histo might need to perform many reductions on final-reduce to merge down buckets, CCS may need to side-step the final reduction if sending to a different cluster, etc Having pipelines generate their output in the final reduce was convenient, but is becoming increasingly difficult to manage as the rest of the agg framework advances. This commit decouples pipeline aggs from the final reduction by introducing a new "top level" reduce, which should be called at the beginning of the reduce cycle (e.g. from the SearchPhaseController). This will only reduce pipeline aggs on the final reduce after the non-pipeline agg tree has been fully reduced. By separating pipeline reduction into their own set of methods, aggregations are free to use the final reduction for whatever purpose without worrying about generating pipeline results which are non-reducible	2019-12-05 16:11:54 -05:00
Ryan Ernst	721a8b3d9c	Fix external integ test zip dep to expect a zip (#49813 ) When external plugin authors use build-tools, their integ tests depend on the integ-test-zip artifact. However, this dependency was broken in 7.5.0 by accidentally removing the `@zip` qualifier on the maven dependency, which works around the fact the pom for the integ-test-zip claims the artifact is a pom instead of zip packaging. This commit restores the workaround of using `@zip` until the pom can be fixed. closes #49787	2019-12-05 13:04:07 -08:00
Jack Conradson	687c6648d9	Minor Painless Clean Up (#49844 ) This cleans up two minor things. - Cleans up style of == false - Pulls maxLoopCounter into a member variable instead of accessing CompilerSettings multiple times in the SFunction node	2019-12-05 12:20:07 -08:00
Orhan Toy	1641fcd488	[DOCS] Minor typo fixes in reindex.asciidoc (#49863 )	2019-12-05 20:25:11 +01:00
James Baiera	1d97cee315	Support es7 node http publish_address format (#49279 ) (#49839 ) Add parsing support to node http publish_address format cname/ip:port.	2019-12-05 13:53:42 -05:00
Tim Brooks	b281d64e89	Ensure remote strategy settings can be updated (#49812 ) This is related to #49067. As part of this work a new sniff number of node connections setting, a simple addresses setting, and a simple number of sockets setting have been added. This commit ensures that these settings are properly hooked up to support dynamic updates.	2019-12-05 10:39:57 -07:00
István Zoltán Szabó	f4b3bb7d6b	[DOCS] Adds an example of preprocessing actions to the PUT DFA API docs (#49831 )	2019-12-05 14:16:38 +01:00
Jim Ferenczi	495762486d	Fix concurrent issue in SearchPhaseController (#49829 ) The list used by the search progress listener can be nullified by another thread that reports a query result. This change replaces the usage of this list with a new array that is synchronously modified. Closes #49778	2019-12-05 13:09:25 +01:00
István Zoltán Szabó	04e99ff1ee	[DOCS] Fixes typo in the ML anomaly detection time functions docs. (#49834 )	2019-12-05 09:58:30 +01:00
Stuart Tettemer	426c7a5e8f	Scripting: add available languages & contexts API (#49652 ) (#49815 ) Adds `GET /_script_language` to support Kibana dynamic scripting language selection. Response contains whether `inline` and/or `stored` scripts are enabled as determined by the `script.allowed_types` settings. For each scripting language registered, such as `painless`, `expression`, `mustache` or custom, available contexts for the language are included as determined by the `script.allowed_contexts` setting. Response format: ``` { "types_allowed": [ "inline", "stored" ], "language_contexts": [ { "language": "expression", "contexts": [ "aggregation_selector", "aggs" ... ] }, { "language": "painless", "contexts": [ "aggregation_selector", "aggs", "aggs_combine", ... ] } ... ] } ``` Fixes: #49463 Backport	2019-12-04 16:18:22 -07:00
Jack Conradson	dbf6183469	Remove extraneous pass (#49797 ) This removes the storeSettings pass where nodes in the AST could store information they needed out of CompilerSettings for use during later passes. CompilerSettings is part of ScriptRoot which is available during the analysis pass making the storeSettings pass redundant.	2019-12-04 12:18:04 -08:00
Ryan Ernst	ce2ca3bd3d	Fix task input for docker build (#49814 ) The docker build task depends on the docker context being built, but it was not explicitly setup as an input. This commit adds the task as an input to the docker build. relates #49613	2019-12-04 11:57:09 -08:00
Armin Braun	91ac87d75b	Stop Allocating Buffers in CopyBytesSocketChannel (#49825 ) (#49832 ) * Stop Allocating Buffers in CopyBytesSocketChannel (#49825) The way things currently work, we read up to 1M from the channel and then potentially force all of it into the `ByteBuf` passed by Netty. Since that `ByteBuf` tends to by default be `64k` in size, large reads will force the buffer to grow, completely circumventing the logic of `allocHandle`. This seems like it could break `io.netty.channel.RecvByteBufAllocator.Handle#continueReading` since that method for the fixed-size allocator does check whether the last read was equal to the attempted read size. So if we set `64k` because that's what the buffer size is, then wirte `1M` to the buffer we will stop reading on the IO loop, even though the channel may still have bytes that we can read right away. More imporatantly though, this can lead to running OOM quite easily under IO pressure as we are forcing the heap buffers passed to the read to `reallocate`. Closes #49699	2019-12-04 19:36:52 +01:00
James Rodewig	42f902977d	[DOCS] Document `minimum_should_match` defaults for `bool` query (#48865 ) Adds documentation for the `minimum_should_match` parameter to the `bool` query docs. Includes docs for the default values: - `1` if the `bool` query includes at least one `should` clause and no `must` or `filter` clauses - `0` otherwise	2019-12-04 12:45:38 -05:00
Dimitris Athanasiou	e3c959b7f1	[7.x][ML][HLRC] DF analytics setVersion and setCreateTime should not be public (#49826 ) (#49833 ) `version` and `create_time` are assigned from the action itself and thus should not be able to be set from the client. Backport of #49826	2019-12-04 18:49:08 +02:00
James Rodewig	87a73b6bdf	[DOCS] Reformat length token filter docs (#49805 ) * Adds a title abbreviation * Updates the description and adds a Lucene link * Reformats the parameters section * Adds analyze, custom analyzer, and custom filter snippets Relates to #44726.	2019-12-04 09:59:08 -05:00
Yannick Welsch	6dcb7fa50e	Add SecureSM support for newer IDEA versions (#49747 ) IntelliJ IDEA moved their JUnit runner to a different package. While this does not break running tests in IDEA, it leads to an ugly exception being thrown at the end of the tests: Exception in thread "main" java.lang.SecurityException: java.lang.System#exit(0) calls are not allowed at org.elasticsearch.secure_sm.SecureSM$2.run(SecureSM.java:248) at org.elasticsearch.secure_sm.SecureSM$2.run(SecureSM.java:215) at java.base/java.security.AccessController.doPrivileged(AccessController.java:310) at org.elasticsearch.secure_sm.SecureSM.innerCheckExit(SecureSM.java:215) at org.elasticsearch.secure_sm.SecureSM.checkExit(SecureSM.java:206) at java.base/java.lang.Runtime.exit(Runtime.java:111) at java.base/java.lang.System.exit(System.java:1781) at com.intellij.rt.junit.JUnitStarter.main(JUnitStarter.java:59) This commit adds support for newer IDEA versions in SecureSM.	2019-12-04 13:50:06 +01:00
Alan Woodward	aa443c6362	[CI] Interval queries cannot be cached if they use scripts (#49824 ) not adjust testCacheability(), which how fails occasionally when given a random interval source containing a script. This commit overrides testCacheability() to explicitly sources with and without script filters. Fixes #49821	2019-12-04 12:18:04 +00:00
Alan Woodward	312190266e	Improve coverage of equals/hashCode tests for IntervalQueryBuilder (#49820 ) By default, AbstractQueryTestCase only changes name and boost in its mutateInstance method, used when checking equals and hashcode implementations. This commit adds a mutateInstance method to InveralQueryBuilderTests that will check hashcode and equality when the field or intervals source are changed.	2019-12-04 11:33:24 +00:00
jimczi	53d801c0d7	\#49566 Fix non-deterministic sort order in testHighlightingWithKeywordIgnoreBoundaryScanner	2019-12-04 12:23:43 +01:00
Ignacio Vera	44e94555ee	Add reusable HistogramValue object (#49799 ) (#49823 ) Adds a reusable implementation of HistogramValue so we do not create an object per document.	2019-12-04 11:51:53 +01:00
jimczi	1d522c6605	add missing change after backport of #49566	2019-12-04 11:25:47 +01:00
Jim Ferenczi	691421f287	Fix invalid break iterator highlighting on keyword field (#49566 ) By default the unified highlighter splits the input into passages using a sentence break iterator. However we don't check if the field is tokenized or not so `keyword` field also applies the break iterator even though they can only match on the entire content. This means that by default we'll split the content of a `keyword` field on sentence break if the requested number of fragments is set to a value different than 0 (default to 5). This commit changes this behavior to ignore the break iterator on non-tokenized fields (keyword) in order to always highlight the entire values. The number of requested fragments control the number of matched values are returned but the boundary_scanner_type is now ignored. Note that this is the behavior in 6x but some refactoring of the Lucene's highlighter exposed this bug in Elasticsearch 7x.	2019-12-04 11:14:44 +01:00

... 2 3 4 5 6 ...

49221 Commits All Branches Search

49221 Commits

All Branches