OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	8e8d7667cb	[Tests] Fix type inference issue (#44063 )	2019-07-08 17:34:35 +02:00
David Kyle	5fc12917c3	Data frame task failure does not make a 500 response (#44058 ) Data frame task responses had logic to return a HTTP 500 status code if there was any node or task failures even if other tasks in the same request reported correctly. This is different to how other task responses are handled where a 200 is always returned leaving the client should check for failures. Returning a 500 also breaks the high level rest client so always return a 200 Closes #44011	2019-07-08 11:53:11 +01:00
Ioannis Kakavas	9beb51fc44	Revert "Mute testEnableDisableBehaviour (#42929 )" This reverts commit `6ee578c6eb`.	2019-07-08 08:52:21 +03:00
Nhat Nguyen	9089820d8f	Enable indexing optimization using sequence numbers on replicas (#43616 ) This PR enables the indexing optimization using sequence numbers on replicas. With this optimization, indexing on replicas should be faster and use less memory as it can forgo the version lookup when possible. This change also deactivates the append-only optimization on replicas. Relates #34099	2019-07-05 22:12:08 -04:00
Dimitris Athanasiou	d3ddedf9fc	[7.x][ML] Add missing doc links to df-analytics rest spec and HLRC javadocs (#44025 ) (#44033 )	2019-07-06 02:03:29 +03:00
Mayya Sharipova	37e1ad7062	Forbid empty doc values on vector functions (#43944 ) Currently when a document misses a vector value, vector function returns 0 as a score for this document. We think this is incorrect behaviour. With this change, an error will be thrown if vector functions are used with docs that are missing vector doc values. Also VectorScriptDocValues is modified to allow size() function, which can be used to check if a document has a value for the vector field.	2019-07-05 18:09:06 -04:00
Dimitris Athanasiou	a1a62fded3	[7.x][ML] Stop df-analytics action request should filter tasks (#44016 ) (#44023 ) As a `BaseTasksRequest`, `StopDataFrameAnalyticsAction.Request` should implement a `match` method that makes sure only df-analytics tasks are applied.	2019-07-05 23:10:45 +03:00
Yannick Welsch	504a43d43a	Move ConnectionManager to async APIs (#42636 ) This commit converts the ConnectionManager's openConnection and connectToNode methods to async-style. This will allow us to not block threads anymore when opening connections. This PR also adapts the cluster coordination subsystem to make use of the new async APIs, allowing to remove some hacks in the test infrastructure that had to account for the previous synchronous nature of the connection APIs.	2019-07-05 20:40:22 +02:00
Yannick Welsch	1220ff5b6d	Publish to self through transport (#43994 ) This commit ensures that cluster state publications to self also go through the transport layer. This allows voting-only nodes to intercept the publication to self. Fixes an issue discovered by a test failure where a voting-only node, which was the only bootstrapped node, would not step down as master after state transfer because publishing to self would succeed. Closes #43631	2019-07-05 13:00:52 +02:00
Dimitris Athanasiou	30b20920b9	[7.x][ML] Report correct count for df-analytics get-stats API (#43969 ) (#43981 ) The count should match the number of all df-analytics that matched the id in the request. However, we set the count to the number of df-analytics returned which was bound to the `size` parameter. This commit fixes this by setting the count to the count of the `get` response.	2019-07-05 10:28:57 +03:00
Jim Ferenczi	cdf55cb5c5	Refactor index engines to manage readers instead of searchers (#43860 ) This commit changes the way we manage refreshes in the index engines. Instead of relying on a SearcherManager, this change uses a ReaderManager that creates ElasticsearchDirectoryReader when needed. Searchers are now created on-demand (when acquireSearcher is called) from the current ElasticsearchDirectoryReader. It also slightly changes the Engine.Searcher to extend IndexSearcher in order to simplify the usage in the consumer.	2019-07-04 22:49:43 +02:00
Hendrik Muhs	4128b9b4f7	audit message missing for autostop call onStop when auto stopping (#43984) fixes #43977	2019-07-04 21:40:42 +02:00
Benjamin Trent	36f7259737	[ML] Fix datafeed checks when a concrete remote index is present (#43923 ) A bug was introduced in 6.6.0 when we added support for rollup indices. Rollup caps does NOT support looking at remote indices, consequently, since we always look up rollup caps, the datafeed fails with an error if its config includes a concrete remote index. (When all remote indices in a datafeed config are wildcards the problem did not occur.) The rollups feature does not support remote indices, so if there is any remote index in a datafeed config (wildcarded or not), we can skip the rollup cap checks. This PR implements that change.	2019-07-04 13:31:45 +01:00
Alan Woodward	4b99255fed	Add name() method to TokenizerFactory (#43909 ) This brings TokenizerFactory into line with CharFilterFactory and TokenFilterFactory, and removes the need to pass around tokenizer names when building custom analyzers. As this means that TokenizerFactory is no longer a functional interface, the commit also adds a factory method to TokenizerFactory to make construction simpler.	2019-07-04 11:28:55 +01:00
Alpar Torok	1b6109517a	Mute failing test Tracking in #43960	2019-07-04 12:13:02 +03:00
Benjamin Trent	7063a40411	[7.x] [ML][Data Frame] Adding bwc tests for pivot transform (#43506 ) (#43929 ) * [ML][Data Frame] Adding bwc tests for pivot transform (#43506) * [ML][Data Frame] Adding bwc tests for pivot transform * adding continuous transforms * adding continuous dataframes to bwc * adding continuous data frame tests * Adding rolling upgrade tests for continuous df * Fixing test * Adjusting indices used in BWC, and handling NPE for seq_no_stats * updating and muting specific bwc test * Adjusting bwc tests for backport	2019-07-03 16:39:38 -05:00
Christoph Büscher	662f517f4e	Add _reload_search_analyzers endpoint to HLRC (#43733 ) This change adds the new endpoint that allows reloading of search analyzers to the high-level java rest client. Relates to #43313	2019-07-03 12:05:59 +02:00
Dimitris Athanasiou	96b0b27f18	[7.x][ML] Set df-analytics task state to failed when appropriate (#43880 ) (#43906 ) This introduces a `failed` state to which the data frame analytics persistent task is set to when something unexpected fails. It could be the process crashing, the results processor hitting some error, etc. The failure message is then captured and set on the task state. From there, it becomes available via the _stats API as `failure_reason`. The df-analytics stop API now has a `force` boolean parameter. This allows the user to call it for a failed task in order to reset it to `stopped` after we have ensured the failure has been communicated to the user. This commit also adds the analytics version in the persistent task params as this allows us to prevent tasks to run on unsuitable nodes in the future.	2019-07-03 12:41:56 +03:00
Jay Modi	1e0f67fb38	Deprecate transport profile security type setting (#43237 ) This commit deprecates the `transport.profiles.*.xpack.security.type` setting. This setting is used to configure a profile that would only allow client actions. With the upcoming removal of the transport client the setting should also be deprecated so that it may be removed in a future version.	2019-07-03 19:31:55 +10:00
Alexander Reelsen	9077c4402f	Watcher: Allow to execute actions for each element in array (#41997 ) This adds the ability to execute an action for each element that occurs in an array, for example you could sent a dedicated slack action for each search hit returned from a search. There is also a limit for the number of actions executed, which is hardcoded to 100 right now, to prevent having watches run forever. The watch history logs each action result and the total number of actions the were executed. Relates #34546	2019-07-03 11:28:50 +02:00
Tim Vernum	2a8f30eb9a	Support builtin privileges in get privileges API (#43901 ) Adds a new "/_security/privilege/_builtin" endpoint so that builtin index and cluster privileges can be retrieved via the Rest API Backport of: #42134	2019-07-03 19:08:28 +10:00
Tim Vernum	deacc2038e	Always attach system user to internal actions (#43902 ) All valid licenses permit security, and the only license state where we don't support security is when there is a missing license. However, for safety we should attach the system (or xpack/security) user to internally originated actions even if the license is missing (or, more strictly, doesn't support security). This allows all nodes to communicate and send internal actions (shard state, handshake/pings, etc) even if a license is transitioning between a broken state and a valid state. Relates: #42215 Backport of: #43468	2019-07-03 19:07:16 +10:00
Tim Vernum	31b19bd022	Use separate BitSet cache in Doc Level Security (#43899 ) Document level security was depending on the shared "BitsetFilterCache" which (by design) never expires its entries. However, when using DLS queries - particularly templated ones - the number (and memory usage) of generated bitsets can be significant. This change introduces a new cache specifically for BitSets used in DLS queries, that has memory usage constraints and access time expiry. The whole cache is automatically cleared if the role cache is cleared. Individual bitsets are cleared when the corresponding lucene index reader is closed. The cache defaults to 50MB, and entries expire if unused for 7 days. Backport of: #43669	2019-07-03 18:04:06 +10:00
Tim Vernum	461aa39daf	Switch WriteActionsTests.testBulk to use hamcrest (#43897 ) If an item in the bulk request fails, that could be for a variety of reasons - it may be that the underlying behaviour of security has changed, or it may just be a transient failure during testing. Simply asserting a `true`/`false` value produces failure messages that are difficult to diagnose and debug. Using hamcert (`assertThat`) will make it easier to understand the causes of failures in this test. Backport of: #43725	2019-07-03 16:29:28 +10:00
Tim Vernum	14884c871f	Document API-Key APIs require manage_api_key priv (#43869 ) Add the "Authorization" section to the API key API docs. These APIs require The new manage_api_key cluster privilege. Relates: #43865 Backport of: #43811	2019-07-03 13:51:44 +10:00
Jake Landis	6e9ccda2c5	ilm test - allow more time for policy completion (#43844 )	2019-07-02 22:05:18 -05:00
Jake Landis	0a79f4ca70	Extend timeout for TimeSeriesLifecycleActionsIT> testFullPolicy (#43891 )	2019-07-02 22:05:04 -05:00
Mayya Sharipova	756c42f99f	Add dims parameter to dense_vector mapping (#43444 ) (#43895 ) Typically, dense vectors of both documents and queries must have the same number of dimensions. Different number of dimensions among documents or query vector indicate an error. This PR enforces that all vectors for the same field have the same number of dimensions. It also enforces that query vectors have the same number of dimensions.	2019-07-02 21:14:16 -04:00
Benjamin Trent	fb825a6470	[7.x] [ML][Data Frame] add node attr to GET _stats (#43842 ) (#43894 ) * [ML][Data Frame] add node attr to GET _stats (#43842) * [ML][Data Frame] add node attr to GET _stats * addressing testing issues with node.attributes * adjusting for backport	2019-07-02 19:35:37 -05:00
Benjamin Trent	2c97e26ce8	[ML][Data Frame] fix progress measurement for continuous transforms (#43838 ) (#43887 ) * [ML][Data Frame] fix progress measurement for continuous transforms * Update DataFrameIndexer.java	2019-07-02 19:35:09 -05:00
Jake Landis	eb73bed40d	7x watcher backport testfixes (#43848 ) * fix org.elasticsearch.xpack.watcher.test.integration.RejectedExecutionTests (#41777) This commit un-mutes org.elasticsearch.xpack.watcher.test.integration.RejectedExecutionTests which was failing intermittently due to a logic bug. It is not possible to use the real Watcher scheduler (which is needed for this test) and reliabliby count the .triggered-watches since current count of documents in the .triggered-watches index is based on the timing of the scheduler and the ability to delete based on the Watcher and Write thread pools. This commit simply removes the .triggered-watch check and relies soley on the .watcher-history index as an indication that operations that can occur when the Watcher threadpool is rejecting. closes #41734 * fix unlikely bug that can prevent Watcher from restarting (#42030) The bug fixed here is unlikely to happen. It requires ES to be started with ILM disabled, Watcher enabled, and Watcher explicitly stopped and restarted. Due to template validation Watcher does not fully start and can result in a partially started state. This is an unlikely scenerio outside of the testing framework. Note - this bug was introduced while the test that would have caught it was muted. The test remains muted since the underlying cuase of the random failures has not been identified. When this test is un-muted it will now work.	2019-07-02 12:16:06 -05:00
Christoph Büscher	31cf96e7bf	Return reloaded analyzers in _reload_search_ananlyzer response (#43813 ) Currently the repsonse of the "_reload_search_analyzer" endpoint contains the index names and nodeIds of indices were analyzers reloading was triggered. This change add the names of the search-time analyzers that were reloaded. Closes #43804	2019-07-02 18:51:15 +02:00
Dimitris Athanasiou	1ea53979b5	[7.x][ML] Get df-analytics action should require monitor privilege (#43831 ) (#43866 )	2019-07-02 16:00:54 +03:00
Tim Vernum	8d099dad38	Add "manage_api_key" cluster privilege (#43865 ) This adds a new cluster privilege for manage_api_key. Users with this privilege are able to create new API keys (as a child of their own user identity) and may also get and invalidate any/all API keys (including those owned by other users). Backport of: #43728	2019-07-02 21:57:42 +10:00
Benjamin Trent	b95ee7ebb2	[7.x] [ML][Data Frame] using transform creation version for node assignment (#43764 ) (#43843 ) * [ML][Data Frame] using transform creation version for node assignment (#43764) * [ML][Data Frame] using transform creation version for node assignment * removing unused imports * Addressing PR comment * adjusing for backport	2019-07-02 06:52:34 -05:00
Benjamin Trent	82c1ddc117	[7.x] [ML][Data Frame] Add deduced mappings to _preview response payload (#43742 ) (#43849 ) * [ML][Data Frame] Add deduced mappings to _preview response payload (#43742) * [ML][Data Frame] Add deduced mappings to _preview response payload * updating preview docs * fixing code for backport	2019-07-02 06:52:14 -05:00
Tanguy Leroux	b977f019b8	Expose translog stats in ReadOnlyEngine (#43752 ) (#43823 ) Backport of #43752 for 7.x.	2019-07-02 13:39:00 +02:00
Ioannis Kakavas	c8ed271937	Use URLEncoder#encode(String, String) as URLEncoder#encode(String, Charset) is only available since Java 10	2019-07-02 14:20:29 +03:00
Ioannis Kakavas	4ea17b76dc	Fix credentials encoding for OIDC token request (#43808 ) As defined in https://tools.ietf.org/html/rfc6749#section-2.3.1 both client id and client secret need to be encoded with the application/x-www-form-urlencoded encoding algorithm when used as credentials for HTTP Basic Authentication in requests to the OP. Resolves #43709	2019-07-02 13:36:00 +03:00
Tomas Della Vedova	4cdb24bceb	Use explicit string keys in data_frame test (#43854 )	2019-07-02 11:06:29 +02:00
Albert Zaharovits	4eb89a6912	UserRoleMapper non-null groups and metadata (#43836 ) This is an odd backport of #41774 UserRoleMapper.UserData is constructed by each realm and it is used to "match" role mapping expressions that eventually supply the role names of the principal. This PR filters out `null` collection values (lists and maps), for the groups and metadata, which get to take part in the role mapping, in preparation for using Java 9 collection APIs. It filters them as soon as possible, during the construction.	2019-07-02 00:10:15 +03:00
Christoph Büscher	fe3f9f0c6b	Yet another `the the` cleanup (#43815 )	2019-07-01 20:22:19 +02:00
Yogesh Gaikwad	031d5e96ac	HLRC changes for kerberos grant type (#43642 ) (#43822 ) The TODO from last PR for kerbero grant type was missed. This commit adds the changes for kerberos grant type in HLRC.	2019-07-02 00:55:02 +10:00
Benjamin Trent	8108834534	[ML][Data Frame] account for delay in writing stats docs (#43703 ) (#43819 )	2019-07-01 09:14:44 -05:00
Benjamin Trent	4c95c0c456	[ML][Data Frame] reduce audit frequency, change log msg, and level (#43771 ) (#43818 )	2019-07-01 09:14:26 -05:00
Mark Vieira	13887c01cc	Remove compile-time dependency on test fixtures (#43651 )	2019-07-01 14:59:41 +03:00
Julie Tibshirani	ffa5919d7c	Add support for 'flattened object' fields. (#43762 ) This commit merges the `object-fields` feature branch. The new 'flattened object' field type allows an entire JSON object to be indexed into a field, and provides limited search functionality over the field's contents.	2019-07-01 12:08:50 +03:00
Hendrik Muhs	a58d231f4d	relax trigger count for transform stats test (#43753 ) relax trigger count test as we can not guarantee it due to async behaviour	2019-07-01 10:30:40 +02:00
Alpar Torok	717d14a7e2	Backport: convert x pack qa (#43763 ) * Revert "Revert "Test clusters: convert x-pack qa tests (#43283)" (#43549)" This reverts commit `8d9a971259`. * Fix failing test	2019-07-01 10:38:56 +03:00
Dimitris Athanasiou	3bdb9d5f08	[7.x][ML] Correct df-analytics version introduced to 7.3.0 (#43784 ) (#43795 )	2019-07-01 09:19:04 +03:00
Ryan Ernst	3a2c698ce0	Rename Action to ActionType (#43778 ) Action is a class that encapsulates meta information about an action that allows it to be called remotely, specifically the action name and response type. With recent refactoring, the action class can now be constructed as a static constant, instead of needing to create a subclass. This makes the old pattern of creating a singleton INSTANCE both misnamed and lacking a common placement. This commit renames Action to ActionType, thus allowing the old INSTANCE naming pattern to be TYPE on the transport action itself. ActionType also conveys that this class is also not the action itself, although this change does not rename any concrete classes as those will be removed organically as they are converted to TYPE constants. relates #34389	2019-06-30 22:00:17 -07:00
Dimitris Athanasiou	8f49d01113	[7.x][ML] Rename df-analytics `_id_copy` to `ml__id_copy` (#43754 ) (#43783 ) Renames `_id_copy` to `ml__id_copy` as field names starting with underscore are deprecated. The new field name `ml__id_copy` was chosen as an obscure enough field that users won't have in their data. Otherwise, this field is only intented to be used by df-analytics.	2019-06-30 19:37:00 +03:00
Albert Zaharovits	5e17bc5dcc	Consistent Secure Settings #40416 Introduces a new `ConsistentSecureSettingsValidatorService` service that exposes a single public method, namely `allSecureSettingsConsistent`. The method returns `true` if the local node's secure settings (inside the keystore) are equal to the master's, and `false` otherwise. Technically, the local node has to have exactly the same secure settings - setting names should not be missing or in surplus - for all `SecureSetting` instances that are flagged with the newly introduced `Property.Consistent`. It is worth highlighting that the `allSecureSettingsConsistent` is not a consensus view across the cluster, but rather the local node's perspective in relation to the master.	2019-06-29 23:26:17 +03:00
David Roberts	b599c68d23	[ML] Assert that a no-op job creates no results nor state (#43681 ) If a job is opened and then closed and does nothing in between then it should not persist any results or state documents. This change adapts the no-op job test to assert no results in addition to no state, and to log any documents that cause this assertion to fail. Relates elastic/ml-cpp#512 Relates #43680	2019-06-29 14:57:49 +01:00
Ryan Ernst	28ab77a023	Add StreamableResponseAction to aid in deprecation of Streamable (#43770 ) The Action base class currently works for both Streamable and Writeable response types. This commit intorduces StreamableResponseAction, for which only the legacy Action implementions which provide newResponse() will extend. This eliminates the need for overriding newResponse() with an UnsupportedOperationException. relates #34389	2019-06-28 21:40:00 -07:00
David Roberts	7951c63b91	[ML] Mark ml-cpp dependency as regularly changing (#43760 ) Since #41817 was merged the ml-cpp zip file for any given version has been cached indefinitely by Gradle. This is problematic, particularly in the case of the master branch where the version 8.0.0-SNAPSHOT will be in use for more than a year. This change tells Gradle that the ml-cpp zip file is a "changing" dependency, and to check whether it has changed every two hours. Two hours is a compromise between checking on every build and annoying developers with slow internet connections and checking rarely causing bug fixes in the ml-cpp code to take a long time to propagate through to elasticsearch PRs that rely on them.	2019-06-28 21:21:18 +01:00
Benjamin Trent	67a3c656c3	[7.x] [ML][Data Frame] removing format support (#43659 ) (#43747 ) * [ML][Data Frame] removing format support (#43659) * Fixing conflicts	2019-06-28 10:02:37 -05:00
Jim Ferenczi	7ca69db83f	Refactor IndexSearcherWrapper to disallow the wrapping of IndexSearcher (#43645 ) This change removes the ability to wrap an IndexSearcher in plugins. The IndexSearcherWrapper is replaced by an IndexReaderWrapper and allows to wrap the DirectoryReader only. This simplifies the creation of the context IndexSearcher that is used on a per request basis. This change also moves the optimization that was implemented in the security index searcher wrapper to the ContextIndexSearcher that now checks the live docs to determine how the search should be executed. If the underlying live docs is a sparse bit set the searcher will compute the intersection betweeen the query and the live docs instead of checking the live docs on every document that match the query.	2019-06-28 16:28:02 +02:00
Alpar Torok	d1a4d8866d	Add missing dependencies so we can build in parallel (#43672 )	2019-06-28 16:41:18 +03:00
Dimitris Athanasiou	86c853a7c2	[7.x][ML] Rename outlier score setting to feature_influence_threshold (#43705 ) (#43734 ) Renames outlier score setting `minimum_score_to_write_feature_influence` to `feature_influence_threshold`.	2019-06-28 13:28:25 +03:00
Dimitris Athanasiou	cab879118d	[7.x][ML] Support multiple source indices for df-analytics (#43702 ) (#43731 ) This commit adds support for multiple source indices. In order to deal with multiple indices having different mappings, it attempts a best-effort approach to merge the mappings assuming there are no conflicts. In case conflicts exists an error will be returned. To allow users creating custom mappings for special use cases, the destination index is now allowed to exist before the analytics job runs. In addition, settings are no longer copied except for the `index.number_of_shards` and `index.number_of_replicas`.	2019-06-28 13:28:03 +03:00
Christoph Büscher	2cc7f5a744	Allow reloading of search time analyzers (#43313 ) Currently changing resources (like dictionaries, synonym files etc...) of search time analyzers is only possible by closing an index, changing the underlying resource (e.g. synonym files) and then re-opening the index for the change to take effect. This PR adds a new API endpoint that allows triggering reloading of certain analysis resources (currently token filters) that will then pick up changes in underlying file resources. To achieve this we introduce a new type of custom analyzer (ReloadableCustomAnalyzer) that uses a ReuseStrategy that allows swapping out analysis components. Custom analyzers that contain filters that are markes as "updateable" will automatically choose this implementation. This PR also adds this capability to `synonym` token filters for use in search time analyzers. Relates to #29051	2019-06-28 09:55:40 +02:00
Przemysław Witek	94f18da5df	Add version and create_time to data frame analytics config (#43683 ) (#43712 )	2019-06-28 07:37:21 +02:00
Ryan Ernst	5b4089e57e	Remove nodeId from BaseNodeRequest (#43658 ) TransportNodesAction provides a mechanism to easily broadcast a request to many nodes, and collect the respones into a high level response. Each node has its own request type, with a base class of BaseNodeRequest. This base request requires passing the nodeId to which the request will be sent. However, that nodeId is not used anywhere. It is private to the base class, yet serialized to each node, where the node could just as easily find the nodeId of the node it is on locally. This commit removes passing the nodeId through to the node request creation, and guards its serialization so that we can remove the base request class altogether in the future.	2019-06-27 18:45:14 -07:00
Igor Motov	3607876a71	Geo: Makes coordinate validator in libs/geo plugable (#43657 ) Moves coordinate validation from Geometry constructors into parser. Relates #43644	2019-06-27 19:53:41 -04:00
Nhat Nguyen	ce8771feb7	Do not use MockInternalEngine in GatewayIndexStateIT (#43716 ) GatewayIndexStateIT#testRecoverBrokenIndexMetadata replies on the flushing on shutdown. This behaviour, however, can be randomly disabled in MockInternalEngine. Closes #43034	2019-06-27 18:28:04 -04:00
Przemysław Witek	68dbbd8793	Deduplicate two similar TimeUtils classes. (#43697 ) * Deduplicate org.elasticsearch.xpack.core.dataframe.utils.TimeUtils and org.elasticsearch.xpack.core.ml.utils.time.TimeUtils into a common class: org.elasticsearch.xpack.core.common.time.TimeUtils. * Add unit tests for parseTimeField and parseTimeFieldToInstant methods	2019-06-27 18:51:48 +02:00
Yannick Welsch	6744344ef2	Handle situation where only voting-only nodes are bootstrapped (#43628 ) Adds support for the situation where only voting-only nodes are bootstrapped. In that case, they will still try to become elected and bring full master nodes into the cluster.	2019-06-27 18:10:15 +02:00
David Roberts	f39619d182	[ML] Don't write timing stats on no-op (#43680 ) Similar to elastic/ml-cpp#512, if a job opens and closes and does nothing in between we shouldn't write timing stats to the results index.	2019-06-27 16:37:54 +01:00
Jim Ferenczi	329d05f61e	Fix UOE on search requests that match a sparse role query (#43668 ) Search requests executed through the SecurityIndexSearcherWrapper throw an UnsupportedOperationException if they match a sparse role query. When low level cancellation is activated (which is the default since #42857), the context index searcher creates a weight that doesn't handle #scorer. This change fixes this bug and adds a test to ensure that we check this case.	2019-06-27 16:56:56 +02:00
Przemysław Witek	ba518722a2	[7.x] [ML] Tag destination index with data frame metadata (#43567 ) (#43660 )	2019-06-27 08:08:39 +02:00
Benjamin Trent	d05593c3ad	[ML][Data Frame] adds tests for continuous DF (#43601 ) (#43654 )	2019-06-26 14:59:19 -05:00
Benjamin Trent	52e26bbc42	[ML][Data Frame] improve pivot nested field validations (#43548 ) (#43636 ) * [ML][Data Frame] improve pivot nested field validations * addressing pr comments	2019-06-26 13:35:51 -05:00
Armin Braun	c00e305d79	Optimize Selector Wakeups (#43515 ) (#43650 ) * Use atomic boolean to guard wakeups * Don't trigger wakeups from the select loops thread itself for registering and closing channels * Don't needlessly queue writes Co-authored-by: Tim Brooks <tim@uncontended.net>	2019-06-26 20:00:42 +02:00
David Kyle	e1f761dfc7	[Ml Data Frame] Size the GET stats search by number of Ids requested (#43206 ) Set the size of the search request to the number of ids limited by 10,000	2019-06-26 17:01:12 +01:00
Benjamin Trent	c121b00c98	[7.x] [ML][Data Frame] Add support for allow_no_match for endpoints (#43490 ) (#43637 ) * [ML][Data Frame] Add support for allow_no_match for endpoints (#43490) * [ML][Data Frame] Add support for allow_no_match parameter in endpoints Adds support for: * Get Transforms * Get Transforms stats * stop transforms * Update DataFrameTransformDocumentationIT.java	2019-06-26 10:09:56 -05:00
David Roberts	31dc5b7d3a	[TEST] Wait for replicas before stopping nodes in ML distributed test (#43622 ) If we stop a node before replicas exist then the test can fail because we lose a whole index if we stop the node with the primary on.	2019-06-26 11:52:53 +01:00
David Roberts	558e323c89	[ML] Introduce a setting for the process connect timeout (#43234 ) This change introduces a new setting, xpack.ml.process_connect_timeout, to enable the timeout for one of the external ML processes to connect to the ES JVM to be increased. The timeout may need to be increased if many processes are being started simultaneously on the same machine. This is unlikely in clusters with many ML nodes, as we balance the processes across the ML nodes, but can happen in clusters with a single ML node and a high value for xpack.ml.node_concurrent_job_allocations.	2019-06-26 09:22:04 +01:00
Yannick Welsch	2049f715b3	Add voting-only master node (#43410 ) A voting-only master-eligible node is a node that can participate in master elections but will not act as a master in the cluster. In particular, a voting-only node can help elect another master-eligible node as master, and can serve as a tiebreaker in elections. High availability (HA) clusters require at least three master-eligible nodes, so that if one of the three nodes is down, then the remaining two can still elect a master amongst them-selves. This only requires one of the two remaining nodes to have the capability to act as master, but both need to have voting powers. This means that one of the three master-eligible nodes can be made as voting-only. If this voting-only node is a dedicated master, a less powerful machine or a smaller heap-size can be chosen for this node. Alternatively, a voting-only non-dedicated master node can play the role of the third master-eligible node, which allows running an HA cluster with only two dedicated master nodes. Closes #14340 Co-authored-by: David Turner <david.turner@elastic.co>	2019-06-26 08:07:56 +02:00
Yogesh Gaikwad	480453aa24	Make role descriptors optional when creating API keys (#43481 ) (#43614 ) This commit changes the `role_descriptors` field from required to optional when creating API key. The default behavior in .NET ES client is to omit properties with `null` value requiring additional workarounds. The behavior for the API does not change. Field names (`id`, `name`) in the invalidate api keys API documentation have been corrected where they were wrong. Closes #42053	2019-06-26 14:30:51 +10:00
Yogesh Gaikwad	58179af5af	Enable Kerberos tests (#43519 ) (#43612 ) Now that the fix krb5-kdc fixture (entropy problem in docker container) is in and the converting `kerberos-tests` to testclusters is done, enabling the kerberos-tests Closes #40678	2019-06-26 12:55:41 +10:00
Przemysław Witek	76a750a0a0	Remove unused mapStringsOrdered method (#42513 ) (#43585 )	2019-06-25 20:43:38 +02:00
Tanguy Leroux	0dc1c12f13	Fix indices shown in _cat/indices (#43286 ) After two recent changes (#38824 and #33888), the _cat/indices API no longer report information for active recovering indices and non-replicated closed indices. It also misreport replicated closed indices that are potentially not authorized for the user. This commit changes how the cat action works by first using the Get Settings API in order to resolve authorized indices. It then uses the Cluster State, Cluster Health and Indices Stats APIs to retrieve information about the indices. Closes #39933	2019-06-25 20:02:34 +02:00
Dimitris Athanasiou	126c2fd2d5	[7.x][ML] Machine learning data frame analytics (#43544 ) (#43592 ) This merges the initial work that adds a framework for performing machine learning analytics on data frames. The feature is currently experimental and requires a platinum license. Note that the original commits can be found in the `feature-ml-data-frame-analytics` branch. A new set of APIs is added which allows the creation of data frame analytics jobs. Configuration allows specifying different types of analysis to be performed on a data frame. At first there is support for outlier detection. The APIs are: - PUT _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id}/_stats - POST _ml/data_frame/analysis/{id}/_start - POST _ml/data_frame/analysis/{id}/_stop - DELETE _ml/data_frame/analysis/{id} When a data frame analytics job is started a persistent task is created and started. The main steps of the task are: 1. reindex the source index into the dest index 2. analyze the data through the data_frame_analyzer c++ process 3. merge the results of the process back into the destination index In addition, an evaluation API is added which packages commonly used metrics that provide evaluation of various analysis: - POST _ml/data_frame/_evaluate	2019-06-25 20:29:11 +03:00
Benjamin Trent	970e157eac	[ML][Data Frame] Adjusting error message (#43455 ) (#43580 ) * Adjusting error message * Update TransportPutDataFrameTransformAction.java * Update TransportPutDataFrameTransformAction.java	2019-06-25 10:09:39 -05:00
Przemysław Witek	c702cd7415	[7.x] Implement XContentParser.genericMap and XContentParser.genericMapOrdered methods (#42059 ) (#43575 )	2019-06-25 16:04:54 +02:00
Przemysław Witek	b15e40ffad	Extract TimingStats-related functionality into TimingStatsReporter (#43371 ) (#43557 )	2019-06-25 15:48:39 +02:00
David Roberts	9c285ddbab	[ML] Improve message when native controller cannot connect (#43565 ) The error message if the native controller failed to run (for example due to running Elasticsearch on an unsupported platform) was not easy to understand. This change removes pointless detail from the message and adds some hints about likely causes. Fixes #42341	2019-06-25 12:06:54 +01:00
Lee Hinman	8d9a971259	Revert "Test clusters: convert x-pack qa tests (#43283 )" (#43549 ) This reverts commit `ccaa8c33ba`.	2019-06-24 17:16:29 -06:00
Tim Brooks	38516a4dd5	Move nio ip filter rule to be a channel handler (#43507 ) Currently nio implements ip filtering at the channel context level. This is kind of a hack as the application logic should be implemented at the handler level. This commit moves the ip filtering into a channel handler. This requires adding an indicator to the channel handler to show when a channel should be closed.	2019-06-24 10:03:24 -06:00
Gordon Brown	fac7efba9a	[7.x] Account for node versions during allocation in ILM Shrink (#43300 ) This commit ensures that ILM's Shrink action will take node versions into account when choosing which node to allocate to when shrinking an index. Prior to this change, ILM could pick a node with a lower version than some shards are already allocated to, which causes the new allocation to fail as shards can't be relocated onto a node with a lower version than they are already on. As part of this, when making the decision about which node to allocate to prior to Shrink, all shards in the index are considered, rather than choosing a random shard to consider. Further, the unit tests for the logic that chooses a node to allocate shards to pre-shrink has been improved to validate the behavior in more realistic and varied initial conditions.	2019-06-24 10:02:49 -06:00
Mayya Sharipova	813551e070	Fix eclipse build gradle for vectors project Closes #43496	2019-06-24 09:22:48 -04:00
Martijn van Groningen	101cf384ba	Replace Streamable w/ Writable in AcknowledgedResponse and subclasses (backport 7.x) (#43525 ) This commit replaces usages of Streamable with Writeable for the AcknowledgedResponse and its subclasses, plus associated actions. Note that where possible response fields were made final and default constructors were removed. This is a large PR, but the change is mostly mechanical. Relates to #34389 Backport of #43414	2019-06-24 13:47:37 +02:00
Alpar Torok	ccaa8c33ba	Test clusters: convert x-pack qa tests (#43283 )	2019-06-24 12:20:46 +03:00
Alpar Torok	ea44da6069	Testclusters: conver remaining x-pack (#43335 ) Convert x-pack tests	2019-06-24 12:07:42 +03:00
Benjamin Trent	f4b75d6d14	[7.x] [ML][Data Frame] Add version and create_time to transform config (#43384 ) (#43480 ) * [ML][Data Frame] Add version and create_time to transform config (#43384) * [ML][Data Frame] Add version and create_time to transform config * s/transform_version/version s/Date/Instant * fixing getter/setter for version * adjusting for backport	2019-06-21 09:11:44 -05:00
David Kyle	73221d2265	[ML] Resolve NetworkDisruptionIT (#43441 ) After the network disruption a partition is created, one side of which can form a cluster the other can't. Ensure requests are sent to a node on the correct side of the cluster	2019-06-21 10:24:02 +01:00
Simon Willnauer	424ef4f158	SecurityIndexSearcherWrapper doesn't always carry over caches and similarity (#43436 ) If DocumentLevelSecurity is enabled SecurityIndexSearcherWrapper doesn't carry over the cache, cache policy and similarity from the incoming searcher.	2019-06-21 10:19:10 +02:00
Tim Vernum	059eb55108	Use SecureString for password length validation (#43465 ) This replaces the use of char[] in the password length validation code, with the use of SecureString Although the use of char[] is not in itself problematic, using a SecureString encourages callers to think about the lifetime of the password object and to clear it after use. Backport of: #42884	2019-06-21 17:11:07 +10:00
Armin Braun	21515b9ff1	Fix IpFilteringIntegrationTests (#43019 ) (#43434 ) * Increase timeout to 5s since we saw 500ms+ GC pauses on CI * closes #40689	2019-06-20 22:31:59 +02:00

1 2 3 4 5 ...

3467 Commits