OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	2a70df424d	[TEST][ML] Fix assertion after starting df-analytics job (#43957 ) (#43967 ) In MachineLearningIT.testStopDataFrameAnalytics we call start and then assert the state is `started`. However, if things go fast enough, the state could have already changed to `reindexing` or `analyzing`. The test has been failing occasionally due to the state being `reindexing`. We fix this by simply asserting the state is either of `started`, `reindexing` or `analyzing`. Closes #43924	2019-07-04 15:17:36 +03:00
Alpar Torok	3250cc53f0	Mute failing test Tracked in #43924	2019-07-03 17:43:40 +03:00
Christoph Büscher	662f517f4e	Add _reload_search_analyzers endpoint to HLRC (#43733 ) This change adds the new endpoint that allows reloading of search analyzers to the high-level java rest client. Relates to #43313	2019-07-03 12:05:59 +02:00
Dimitris Athanasiou	96b0b27f18	[7.x][ML] Set df-analytics task state to failed when appropriate (#43880 ) (#43906 ) This introduces a `failed` state to which the data frame analytics persistent task is set to when something unexpected fails. It could be the process crashing, the results processor hitting some error, etc. The failure message is then captured and set on the task state. From there, it becomes available via the _stats API as `failure_reason`. The df-analytics stop API now has a `force` boolean parameter. This allows the user to call it for a failed task in order to reset it to `stopped` after we have ensured the failure has been communicated to the user. This commit also adds the analytics version in the persistent task params as this allows us to prevent tasks to run on unsuitable nodes in the future.	2019-07-03 12:41:56 +03:00
Tim Vernum	2a8f30eb9a	Support builtin privileges in get privileges API (#43901 ) Adds a new "/_security/privilege/_builtin" endpoint so that builtin index and cluster privileges can be retrieved via the Rest API Backport of: #42134	2019-07-03 19:08:28 +10:00
Benjamin Trent	fb825a6470	[7.x] [ML][Data Frame] add node attr to GET _stats (#43842 ) (#43894 ) * [ML][Data Frame] add node attr to GET _stats (#43842) * [ML][Data Frame] add node attr to GET _stats * addressing testing issues with node.attributes * adjusting for backport	2019-07-02 19:35:37 -05:00
David Roberts	8e44f5d845	[ML-Data Frame] Add data frame transform cluster privileges to HLRC (#43879 ) Adds the monitor_data_frame_transforms and manage_data_frame_transforms cluster privileges to the high level rest client. The ALL_ARRAY variable is only used in randomized tests at the within the Elasticsearch code, so it's not a major problem that these cluster privileges weren't added from the start. But since ALL_ARRAY is public HLRC users may be using it to find out which cluster privileges exist, so it's best that it contains them all.	2019-07-02 17:52:15 +01:00
Benjamin Trent	82c1ddc117	[7.x] [ML][Data Frame] Add deduced mappings to _preview response payload (#43742 ) (#43849 ) * [ML][Data Frame] Add deduced mappings to _preview response payload (#43742) * [ML][Data Frame] Add deduced mappings to _preview response payload * updating preview docs * fixing code for backport	2019-07-02 06:52:14 -05:00
Yogesh Gaikwad	031d5e96ac	HLRC changes for kerberos grant type (#43642 ) (#43822 ) The TODO from last PR for kerbero grant type was missed. This commit adds the changes for kerberos grant type in HLRC.	2019-07-02 00:55:02 +10:00
Ryan Ernst	3a2c698ce0	Rename Action to ActionType (#43778 ) Action is a class that encapsulates meta information about an action that allows it to be called remotely, specifically the action name and response type. With recent refactoring, the action class can now be constructed as a static constant, instead of needing to create a subclass. This makes the old pattern of creating a singleton INSTANCE both misnamed and lacking a common placement. This commit renames Action to ActionType, thus allowing the old INSTANCE naming pattern to be TYPE on the transport action itself. ActionType also conveys that this class is also not the action itself, although this change does not rename any concrete classes as those will be removed organically as they are converted to TYPE constants. relates #34389	2019-06-30 22:00:17 -07:00
Ryan Ernst	28ab77a023	Add StreamableResponseAction to aid in deprecation of Streamable (#43770 ) The Action base class currently works for both Streamable and Writeable response types. This commit intorduces StreamableResponseAction, for which only the legacy Action implementions which provide newResponse() will extend. This eliminates the need for overriding newResponse() with an UnsupportedOperationException. relates #34389	2019-06-28 21:40:00 -07:00
Martijn van Groningen	9d5c66be41	Migrate watcher hlrc response tests to use AbstractResponseTestCase (#43478 ) Relates to #43472	2019-06-28 21:38:44 +02:00
Benjamin Trent	67a3c656c3	[7.x] [ML][Data Frame] removing format support (#43659 ) (#43747 ) * [ML][Data Frame] removing format support (#43659) * Fixing conflicts	2019-06-28 10:02:37 -05:00
Dimitris Athanasiou	86c853a7c2	[7.x][ML] Rename outlier score setting to feature_influence_threshold (#43705 ) (#43734 ) Renames outlier score setting `minimum_score_to_write_feature_influence` to `feature_influence_threshold`.	2019-06-28 13:28:25 +03:00
Dimitris Athanasiou	cab879118d	[7.x][ML] Support multiple source indices for df-analytics (#43702 ) (#43731 ) This commit adds support for multiple source indices. In order to deal with multiple indices having different mappings, it attempts a best-effort approach to merge the mappings assuming there are no conflicts. In case conflicts exists an error will be returned. To allow users creating custom mappings for special use cases, the destination index is now allowed to exist before the analytics job runs. In addition, settings are no longer copied except for the `index.number_of_shards` and `index.number_of_replicas`.	2019-06-28 13:28:03 +03:00
Christoph Büscher	2cc7f5a744	Allow reloading of search time analyzers (#43313 ) Currently changing resources (like dictionaries, synonym files etc...) of search time analyzers is only possible by closing an index, changing the underlying resource (e.g. synonym files) and then re-opening the index for the change to take effect. This PR adds a new API endpoint that allows triggering reloading of certain analysis resources (currently token filters) that will then pick up changes in underlying file resources. To achieve this we introduce a new type of custom analyzer (ReloadableCustomAnalyzer) that uses a ReuseStrategy that allows swapping out analysis components. Custom analyzers that contain filters that are markes as "updateable" will automatically choose this implementation. This PR also adds this capability to `synonym` token filters for use in search time analyzers. Relates to #29051	2019-06-28 09:55:40 +02:00
Przemysław Witek	94f18da5df	Add version and create_time to data frame analytics config (#43683 ) (#43712 )	2019-06-28 07:37:21 +02:00
Benjamin Trent	34a86cc321	[ML] Allowing stopped status in HLRC testStartStop (#43710 ) (#43719 )	2019-06-27 20:42:43 -05:00
James Rodewig	87566c9324	[DOCS] Change 'X-Pack APIs' section to 'REST APIs' (#43451 )	2019-06-26 13:46:12 -04:00
Benjamin Trent	c121b00c98	[7.x] [ML][Data Frame] Add support for allow_no_match for endpoints (#43490 ) (#43637 ) * [ML][Data Frame] Add support for allow_no_match for endpoints (#43490) * [ML][Data Frame] Add support for allow_no_match parameter in endpoints Adds support for: * Get Transforms * Get Transforms stats * stop transforms * Update DataFrameTransformDocumentationIT.java	2019-06-26 10:09:56 -05:00
Dimitris Athanasiou	126c2fd2d5	[7.x][ML] Machine learning data frame analytics (#43544 ) (#43592 ) This merges the initial work that adds a framework for performing machine learning analytics on data frames. The feature is currently experimental and requires a platinum license. Note that the original commits can be found in the `feature-ml-data-frame-analytics` branch. A new set of APIs is added which allows the creation of data frame analytics jobs. Configuration allows specifying different types of analysis to be performed on a data frame. At first there is support for outlier detection. The APIs are: - PUT _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id} - GET _ml/data_frame/analysis/{id}/_stats - POST _ml/data_frame/analysis/{id}/_start - POST _ml/data_frame/analysis/{id}/_stop - DELETE _ml/data_frame/analysis/{id} When a data frame analytics job is started a persistent task is created and started. The main steps of the task are: 1. reindex the source index into the dest index 2. analyze the data through the data_frame_analyzer c++ process 3. merge the results of the process back into the destination index In addition, an evaluation API is added which packages commonly used metrics that provide evaluation of various analysis: - POST _ml/data_frame/_evaluate	2019-06-25 20:29:11 +03:00
Alpar Torok	09695decb3	Fix failing LicensingDocumentationIT test (#43533 ) This PR brings corrections for cluster name after migrating to testclusters. Not sure how this slipped trough the cracks when converting. Closes #43504	2019-06-25 18:37:36 +03:00
Benjamin Trent	bfd82012e8	[ML][Data Frame] fixing some data frame hlrc tests (#43446 ) (#43491 ) * [ML][Data Frame] fixing some data frame hlrc tests * adding task\|indexer state checks back	2019-06-25 07:29:44 -05:00
Armin Braun	b4ed7f463a	Fix CreateRepository Requeset in HLRC (#43522 ) (#43566 ) * verify = false is the non-default case for this request -> adjusted the code accordingly and expanded the test to cover this case * Closes #43521	2019-06-25 13:04:43 +02:00
Przemysław Witek	e4738587c0	Implement factory methods for ValidationException (#41993 ) Implement factory methods for ValidationException to make the client code more concise (1 LOC vs 3 LOC for a single error scenario)	2019-06-25 13:24:42 +03:00
Martijn van Groningen	101cf384ba	Replace Streamable w/ Writable in AcknowledgedResponse and subclasses (backport 7.x) (#43525 ) This commit replaces usages of Streamable with Writeable for the AcknowledgedResponse and its subclasses, plus associated actions. Note that where possible response fields were made final and default constructors were removed. This is a large PR, but the change is mostly mechanical. Relates to #34389 Backport of #43414	2019-06-24 13:47:37 +02:00
Alpar Torok	ea44da6069	Testclusters: conver remaining x-pack (#43335 ) Convert x-pack tests	2019-06-24 12:07:42 +03:00
Benjamin Trent	f4b75d6d14	[7.x] [ML][Data Frame] Add version and create_time to transform config (#43384 ) (#43480 ) * [ML][Data Frame] Add version and create_time to transform config (#43384) * [ML][Data Frame] Add version and create_time to transform config * s/transform_version/version s/Date/Instant * fixing getter/setter for version * adjusting for backport	2019-06-21 09:11:44 -05:00
Benjamin Trent	77ce3260dd	[ML][Data Frame] make response.count be total count of hits (#43241 ) (#43389 ) * [ML][Data Frame] make response.count be total count of hits * addressing line length check * changing response count for filters * adjusting serialization, variable name, and total count logic * making count mandatory for creation	2019-06-19 16:19:06 -05:00
Benjamin Trent	b333ced5a7	[7.x] [ML][Data Frame] adds new pipeline field to dest config (#43124 ) (#43388 ) * [ML][Data Frame] adds new pipeline field to dest config (#43124) * [ML][Data Frame] adds new pipeline field to dest config * Adding pipeline support to _preview * removing unused import * moving towards extracting _source from pipeline simulation * fixing permission requirement, adding _index entry to doc * adjusting for java 8 compatibility * adjusting bwc serialization version to 7.3.0	2019-06-19 16:18:27 -05:00
James Baiera	1dde6ba1db	Muting DataFrameTransformIT.testGetStats See #43324	2019-06-19 13:58:13 -04:00
Yogesh Gaikwad	2f173402ec	Add kerberos grant_type to get token in exchange for Kerberos ticket (#42847 ) (#43355 ) Kibana wants to create access_token/refresh_token pair using Token management APIs in exchange for kerberos tickets. `client_credentials` grant_type requires every user to have `cluster:admin/xpack/security/token/create` cluster privilege. This commit introduces `_kerberos` grant_type for generating `access_token` and `refresh_token` in exchange for a valid base64 encoded kerberos ticket. In addition, `kibana_user` role now has cluster privilege to create tokens. This allows Kibana to create access_token/refresh_token pair in exchange for kerberos tickets. Note: The lifetime from the kerberos ticket is not used in ES and so even after it expires the access_token/refresh_token pair will be valid. Care must be taken to invalidate such tokens using token management APIs if required. Closes #41943	2019-06-19 18:26:52 +10:00
Ryan Ernst	0a79bf431a	Deprecate native code info in xpack info api (#43297 ) The xpack info api currently returns native code info within each feature. This commit deprecates retrieving that info, which is now available directly in the ML info api.	2019-06-18 07:23:27 -07:00
Przemysław Witek	b2613a123d	[7.x] Report exponential_avg_bucket_processing_time which gives more weight to recent buckets (#43189 ) (#43263 )	2019-06-17 08:58:26 +02:00
Przemysław Witek	65a584b6fb	[7.x] Report timing stats as part of the Job stats response (#42709 ) (#43193 )	2019-06-14 09:03:14 +02:00
Ryan Ernst	5be0fb32f8	Move painless context api spec to test local (#43122 ) The painless context api is internal and currently meant only for use in generating docs. This commit moves the spec file for the api so that it is only used by the test for this api, and not externally by any clients building from the public rest spec.	2019-06-12 08:19:45 -07:00
Ryan Ernst	172cd4dbfa	Remove description from xpack feature sets (#43065 ) The description field of xpack featuresets is optionally part of the xpack info api, when using the verbose flag. However, this information is unnecessary, as it is better left for documentation (and the existing descriptions describe anything meaningful). This commit removes the description field from feature sets.	2019-06-11 09:22:58 -07:00
Henning Andersen	dea935ac31	Reindex max_docs parameter name (#42942 ) Previously, a reindex request had two different size specifications in the body: * Outer level, determining the maximum documents to process * Inside the source element, determining the scroll/batch size. The outer level size has now been renamed to max_docs to avoid confusion and clarify its semantics, with backwards compatibility and deprecation warnings for using size. Similarly, the size parameter has been renamed to max_docs for update/delete-by-query to keep the 3 interfaces consistent. Finally, all 3 endpoints now support max_docs in both body and URL. Relates #24344	2019-06-07 12:16:36 +02:00
Benjamin Trent	02e6acf2d2	[ML] [Data Frame] Adding pending task wait to the hlrc cleanup (#42907 ) (#42930 )	2019-06-06 08:33:49 -05:00
David Roberts	b202a59f88	[ML] Add earliest and latest timestamps to field stats (#42890 ) This change adds the earliest and latest timestamps into the field stats for fields of type "date" in the output of the ML find_file_structure endpoint. This will enable the cards for date fields in the file data visualizer in the UI to be made to look more similar to the cards for date fields in the index data visualizer in the UI.	2019-06-06 08:58:35 +01:00
Gordon Brown	6eb4600e93	Add custom metadata to snapshots (#41281 ) Adds a metadata field to snapshots which can be used to store arbitrary key-value information. This may be useful for attaching a description of why a snapshot was taken, tagging snapshots to make categorization easier, or identifying the source of automatically-created snapshots.	2019-06-05 17:30:31 -06:00
Jason Tedor	117df87b2b	Replicate aliases in cross-cluster replication (#42875 ) This commit adds functionality so that aliases that are manipulated on leader indices are replicated by the shard follow tasks to the follower indices. Note that we ignore write indices. This is due to the fact that follower indices do not receive direct writes so the concept is not useful. Relates #41815	2019-06-04 20:36:24 -04:00
Jason Tedor	aad1b3a2a0	Fix version parsing in various tests (#42871 ) This commit fixes the version parsing in various tests. The issue here is that the parsing was relying on java.version. However, java.version can contain additional characters such as -ea for early access builds. See JEP 233: Name Syntax ------------------------------ -------------- java.version $VNUM(\-$PRE)? java.runtime.version $VSTR java.vm.version $VSTR java.specification.version $VNUM java.vm.specification.version $VNUM Instead, we want java.specification.version.	2019-06-04 18:22:20 -04:00
Mark Vieira	e44b8b1e2e	[Backport] Remove dependency substitutions 7.x (#42866 ) * Remove unnecessary usage of Gradle dependency substitution rules (#42773) (cherry picked from commit 12d583dbf6f7d44f00aa365e34fc7e937c3c61f7)	2019-06-04 13:50:23 -07:00
David Roberts	b61202b0a8	[ML] Add a limit on line merging in find_file_structure (#42501 ) When analysing a semi-structured text file the find_file_structure endpoint merges lines to form multi-line messages using the assumption that the first line in each message contains the timestamp. However, if the timestamp is misdetected then this can lead to excessive numbers of lines being merged to form massive messages. This commit adds a line_merge_size_limit setting (default 10000 characters) that halts the analysis if a message bigger than this is created. This prevents significant CPU time being spent subsequently trying to determine the internal structure of the huge bogus messages.	2019-06-03 13:45:51 +01:00
Alan Woodward	2129d06643	Create client-only AnalyzeRequest/AnalyzeResponse classes (#42197 ) This commit clones the existing AnalyzeRequest/AnalyzeResponse classes to the high-level rest client, and adjusts request converters to use these new classes. This is a prerequisite to removing the Streamable interface from the internal server version of these classes.	2019-06-03 09:46:36 +01:00
Przemyslaw Gomulka	d5061a151a	Remove suppresions for "unchecked" for hamcrest varargs methods Backport(41528) #42749 In hamcrest 2.1 warnings for unchecked varargs were fixed by hamcrest using @SafeVarargs for those matchers where this warning occurred. This PR is aimed to remove these annotations when Matchers.contains ,Matchers.containsInAnyOrder or Matchers.hasItems was used backport #41528	2019-05-31 13:58:49 +02:00
Gordon Brown	e0dbf6e82a	Refactor HLRC RequestConverters parameters to be more explicit (#42128 ) The existing `RequestConverters.Params` is confusing, because it wraps an underlying request object and mutations of the `Params` object actually mutate the `Request` that was used in the construction of the `Params`. This leads to a situation where we create a `RequestConverter.Params` object, mutate it, and then it appears nothing happens to it - it appears to be unused. What happens behind the scenes is that the Request object is mutated when methods on `Params` are invoked. This results in unclear, confusing code where mutating one object changes another with no obvious connection. This commit refactors `RequestConverters.Params` to be a simple helper class to produce a `Map` which must be passed explicitly to a Request object. This makes it apparent that the `Params` are actually used, and that they have an effect on the `request` object explicit and easier to understand. Co-authored-by: Ojas Gulati <ojasgulati100@gmail.com>	2019-05-29 17:08:46 -06:00
kevin fuksman	7c612af6d2	Added param ignore_throttled=false when indicesOptions.ignoreThrottled() is false (#42393 ) and fixed test RequestConvertersTests and added ignore_throttled on all request	2019-05-29 13:45:14 +02:00
Hendrik Muhs	345ff21ae5	[ML-DataFrame] rewrite start and stop to answer with acknowledged (#42589 ) rewrite start and stop to answer with acknowledged fixes #42450	2019-05-29 11:14:32 +02:00
Armin Braun	6166fed6f1	Fix BulkProcessorRetryIT (#41700 ) (#42618 ) * Now that we process the bulk requests themselves on the WRITE threadpool, they can run out of retries too like the item requests even when backoff is active * Fixes #41324 by using the same logic that checks failed item requests for their retry status for the top level bulk requests as well	2019-05-28 17:58:00 +02:00
Hendrik Muhs	6d47ee9268	[ML-DataFrame] add support for fixed_interval, calendar_interval, remove interval (#42427 ) * add support for fixed_interval, calendar_interval, remove interval * adapt HLRC * checkstyle * add a hlrc to server test * adapt yml test * improve naming and doc * improve interface and add test code for hlrc to server * address review comments * repair merge conflict * fix date patterns * address review comments * remove assert for warning * improve exception message * use constants	2019-05-24 20:30:17 +02:00
Luca Cavanna	c2af62455f	Cut over SearchResponse and SearchTemplateResponse to Writeable (#41855 ) Relates to #34389	2019-05-22 18:47:54 +02:00
Luca Cavanna	e747326b04	Adapt low-level REST client to java 8 (#41537 ) As a follow-up to #38540 we can use lambda functions and method references where convenient in the low-level REST client. Also, we need to update the docs to state that the minimum java version required is 1.8.	2019-05-22 18:47:54 +02:00
Guillaume Darmont	3e231bbad6	StackOverflowError when calling BulkRequest#add (#41672 ) Removing of payload in BulkRequest (#39843) had a side effect of making `BulkRequest.add(DocWriteRequest<?>...)` (with varargs) recursive, thus leading to StackOverflowError. This PR adds a small change in RequestConvertersTests to show the error and the corresponding fix in `BulkRequest`. Fixes #41668	2019-05-22 11:22:14 -05:00
Ioannis Kakavas	cdf9485e33	Allow Kibana user to use the OpenID Connect APIs (#42305 ) Add the manage_oidc privilege to the kibana user and to the role privileges list	2019-05-22 09:44:37 +03:00
David Kyle	0fd42ce1f5	[ML Data Frame] Start directly data frame rather than via the scheduler (#42224 ) Trigger indexer start directly to put the indexer in INDEXING state immediately	2019-05-21 15:48:45 +01:00
David Kyle	24144aead2	[ML] Complete the Data Frame task on stop (#41752 ) (#42063 ) Wait for indexer to stop then complete the persistent task on stop. If the wait_for_completion is true the request will not return until stopped.	2019-05-21 10:24:20 +01:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Jay Modi	dbbdcea128	Update ciphers for TLSv1.3 and JDK11 if available (#42082 ) This commit updates the default ciphers and TLS protocols that are used when the runtime JDK supports them. New cipher support has been introduced in JDK 11 and 12 along with performance fixes for AES GCM. The ciphers are ordered with PFS ciphers being most preferred, then AEAD ciphers, and finally those with mainstream hardware support. When available stronger encryption is preferred for a given cipher. This is a backport of #41385 and #41808. There are known JDK bugs with TLSv1.3 that have been fixed in various versions. These are: 1. The JDK's bundled HttpsServer will endless loop under JDK11 and JDK 12.0 (Fixed in 12.0.1) based on the way the Apache HttpClient performs a close (half close). 2. In all versions of JDK 11 and 12, the HttpsServer will endless loop when certificates are not trusted or another handshake error occurs. An email has been sent to the openjdk security-dev list and #38646 is open to track this. 3. In JDK 11.0.2 and prior there is a race condition with session resumption that leads to handshake errors when multiple concurrent handshakes are going on between the same client and server. This bug does not appear when client authentication is in use. This is JDK-8213202, which was fixed in 11.0.3 and 12.0. 4. In JDK 11.0.2 and prior there is a bug where resumed TLS sessions do not retain peer certificate information. This is JDK-8212885. The way these issues are addressed is that the current java version is checked and used to determine the supported protocols for tests that provoke these issues.	2019-05-20 09:45:36 -04:00
Ed Savage	a68b04e47b	[ML] Improve hard_limit audit message (#42086 ) Improve the hard_limit memory audit message by reporting how many bytes over the configured memory limit the job was at the point of the last allocation failure. Previously the model memory usage was reported, however this was inaccurate and hence of limited use - primarily because the total memory used by the model can decrease significantly after the models status is changed to hard_limit but before the model size stats are reported from autodetect to ES. While this PR contains the changes to the format of the hard_limit audit message it is dependent on modifications to the ml-cpp backend to send additional data fields in the model size stats message. These changes will follow in a subsequent PR. It is worth noting that this PR must be merged prior to the ml-cpp one, to keep CI tests happy.	2019-05-17 17:40:08 -04:00
Benjamin Trent	febee07dcc	[ML] adding pivot.max_search_page_size option for setting paging size (#41920 ) (#42079 ) * [ML] adding pivot.size option for setting paging size * Changing field name to address PR comments * fixing ctor usage * adjust hlrc for field name change	2019-05-10 13:22:31 -05:00
Hendrik Muhs	8823cb65f7	[ML-DataFrame] migrate to PageParams for get and stats, move PageParams into core (#41851 ) migrate hlrc dataframe get and _stats to use PageParams, moves PageParams into core for common usage, fix possible NPE in PageParams	2019-05-07 16:16:22 +02:00
Hendrik Muhs	0d9797847a	remove validation methods in client (#41754 ) remove validation methods in client (#41754)	2019-05-02 20:07:29 +02:00
Benjamin Trent	bc333a5cbf	[ML] data frame, adding builder classes for complex config classes (#41638 ) (#41704 ) * [ML] data frame, adding builder classes for complex config classes * Addressing PR comments, adding some java docs * cleaning up constructor * fixing indentation * change constructors to be package-private	2019-05-01 06:44:29 -05:00
Benjamin Trent	a0990ca239	[ML] cleanup + adding description field to transforms (#41554 ) (#41605 ) * [ML] cleanup + adding description field to transforms * making description length have a max of 1k	2019-04-26 16:50:59 -05:00
David Kyle	1f00cec36f	[Ml-Dataframe] Update URLs in Data frame client java doc (#41539 )	2019-04-26 12:04:18 +01:00
Christoph Büscher	52495843cc	[Docs] Fix common word repetitions (#39703 )	2019-04-25 20:47:47 +02:00
Benjamin Trent	08843ba62b	[ML] Adds progress reporting for transforms (#41278 ) (#41529 ) * [ML] Adds progress reporting for transforms * fixing after master merge * Addressing PR comments * removing unused imports * Adjusting afterKey handling and percentage to be 100* * Making sure it is a linked hashmap for serialization * removing unused import * addressing PR comments * removing unused import * simplifying code, only storing total docs and decrementing * adjusting for rewrite * removing initial progress gathering from executor	2019-04-25 11:23:12 -05:00
Jim Ferenczi	6184efaff6	Handle unmapped fields in _field_caps API (#34071 ) (#41426 ) Today the `_field_caps` API returns the list of indices where a field is present only if this field has different types within the requested indices. However if the request is an index pattern (or an alias, or both...) there is no way to infer the indices if the response contains only fields that have the same type in all indices. This commit changes the response to always return the list of indices in the response. It also adds a way to retrieve unmapped field in a specific section per field called `unmapped`. This section is created for each field that is present in some indices but not all if the parameter `include_unmapped` is set to true in the request (defaults to false).	2019-04-25 18:13:48 +02:00
Luca Cavanna	8a0e5f7b87	Deprecate support for first line empty in msearch API (#41442 ) In order to support empty action metadata in the first msearch item, we need to remove support for prepending msearch request body with an empty line, which prevents us from parsing the empty line as action metadata for the first search item. Relates to #41011	2019-04-25 12:45:18 +02:00
Ryan Ernst	7e3875d781	Upgrade hamcrest to 2.1 (#41464 ) hamcrest has some improvements in newer versions, like FileMatchers that make assertions regarding file exists cleaner. This commit upgrades to the latest version of hamcrest so we can start using new and improved matchers.	2019-04-24 23:40:03 -07:00
Armin Braun	381b8e2ece	Fix BulkProcessor Retry ITs (#41338 ) (#41472 ) * The test fails for the retry backoff enabled case because the retry handler in the bulk processor hasn't been adjusted to account for #40866 which now might lead to an outright rejection of the request instead of its items individually * Fixed by adding retry functionality to the top level request as well * Also fixed the duplicate test for the HLRC that wasn't handling the non-backoff case yet the same way the non-client IT did * closes #41324	2019-04-24 13:46:32 +02:00
Armin Braun	389a13b68e	Mute BulkProcessorRetryIT#testBulkRejectionLoadWithBackoff (#41325 ) (#41331 ) * For #41324	2019-04-18 11:55:28 +02:00
David Kyle	1d2365f5b6	[ML-DataFrame] Refactorings and tidying (#41248 ) Remove unnecessary generic params from SingleGroupSource and unused code from the HLRC	2019-04-17 14:58:26 +01:00
David Roberts	6cc35d3724	[ML] Unmute MachineLearningIT.testDeleteExpiredData (#41186 ) The cause of failure was fixed by elastic/ml-cpp#459, so all that remains on the Java side is to unmute the test that was failing. Closes #41070	2019-04-16 16:38:46 +01:00
Christoph Büscher	2980a6c70f	Clarify some ToXContent implementations behaviour (#41000 ) This change adds either ToXContentObject or ToXContentFragment to classes directly implementing ToXContent currently. This helps in reasoning about whether those implementations output full xcontent object or just fragments. Relates to #16347	2019-04-15 09:42:08 +02:00
Martijn van Groningen	1eff8976a8	Deprecate AbstractHlrc* and AbstractHlrcStreamable* base test classes (#41014 ) * moved hlrc parsing tests from xpack to hlrc module and removed dependency on hlrc from xpack core * deprecated old base test class * added deprecated jdoc tag * split test between xpack-core part and hlrc part * added lang-mustache test dependency, this previously came in via hlrc dependency. * added hlrc dependency on a qa module * duplicated ClusterPrivilegeName class in xpack-core, since x-pack core no longer has a dependency on hlrc. * replace ClusterPrivilegeName usages with string literals * moved tests to dedicated to hlrc packages in order to remove Hlrc part from the name and make sure to use imports instead of full qualified class where possible * remove ESTestCase. from method invocation and use method directly, because these tests indirectly extend from ESTestCase	2019-04-10 16:29:17 +02:00
Ed Savage	722362e402	Mute MachineLearningIt#testDeleteExpiredData Tracked in #41070	2019-04-10 13:07:53 +01:00
Hendrik Muhs	f9018ab11b	[ML-DataFrame] create checkpoints on every new run (#40725 ) Use the checkpoint service to create a checkpoint on every new run. Expose checkpoints stats on _stats endpoint.	2019-04-10 09:14:11 +02:00
Martijn van Groningen	46b0fdae33	Add realistic hlrc request serialization test base class and (#40362 ) changed hlrc ccr request tests to use AbstractRequestTestCase base class. This way the request classes are tested in a more realistic setting. Note this change also adds a test dependency on xpack core module. Similar to #39844 but then for hlrc request serialization tests. Removed iterators from hlrc parsing tests. Use empty xcontent registries. Relates to #39745	2019-04-10 08:00:01 +02:00
Mark Vieira	1287c7d91f	[Backport] Replace usages RandomizedTestingTask with built-in Gradle Test (#40978 ) (#40993 ) * Replace usages RandomizedTestingTask with built-in Gradle Test (#40978) This commit replaces the existing RandomizedTestingTask and supporting code with Gradle's built-in JUnit support via the Test task type. Additionally, the previous workaround to disable all tasks named "test" and create new unit testing tasks named "unitTest" has been removed such that the "test" task now runs unit tests as per the normal Gradle Java plugin conventions. (cherry picked from commit 323f312bbc829a63056a79ebe45adced5099f6e6) * Fix forking JVM runner * Don't bump shadow plugin version	2019-04-09 11:52:50 -07:00
Ed Savage	fdc1bdd4d3	[ML][TEST] Fix randomly failing HLRC test (#40973 ) Made changes to ensure that unique IDs are generated for model snapshots used by the deleteExpiredDataTest test in the MachineLearningIT suite. Previously a sleep of 1s was performed between jobs under the assumption that this would be sufficient to guarantee that the timestamps used in the composition of the snapshot IDs would be different. The new approach is to wait on the condition that the old and new timestamps are in fact different (to 1s resolution).	2019-04-09 16:55:35 +01:00
Martijn van Groningen	040a4961c7	Revert "Revert "Change HLRC CCR response tests to use AbstractResponseTestCase base class. (#40257 )"" (#40971 ) This reverts commit df91237a94fd3d3ae954eb1845c434dda692d087.	2019-04-09 15:18:33 +02:00
Martijn van Groningen	84a410d5a8	Revert "Change HLRC CCR response tests to use AbstractResponseTestCase base class. (#40257 )" This reverts commit `c29027d99e`.	2019-04-08 11:24:06 +02:00
Martijn van Groningen	c29027d99e	Change HLRC CCR response tests to use AbstractResponseTestCase base class. (#40257 ) This way the response classes are tested in a more realistic setting. Relates to #39745	2019-04-08 07:09:28 +02:00
Jay Modi	f34663282c	Update apache httpclient to version 4.5.8 (#40875 ) This change updates our version of httpclient to version 4.5.8, which contains the fix for HTTPCLIENT-1968, which is a bug where the client started re-writing paths that contained encoded reserved characters with their unreserved form.	2019-04-05 13:48:10 -06:00
Martijn van Groningen	809a5f13a4	Make -try xlint warning disabled by default. (#40833 ) Many gradle projects specifically use the -try exclude flag, because there are many cases where auto-closeable resource ignore is never referenced in body of corresponding try statement. Suppressing this warning specifically in each case that it happens using `@SuppressWarnings("try")` would be very verbose. This change removes `-try` from any gradle project and adds it to the build plugin. Also this change removes exclude flags from gradle projects that is already specified in build plugin (for example -deprecation). Relates to #40366	2019-04-05 08:02:26 +02:00
roy	075078e7e0	HLRC: fix uri encode bug when url path starts with '/' (#34436 ) This commit sets the authority of a URI to blank such that it does not misinterpret slashes in the path as the authority. Closes #34433	2019-04-04 12:59:59 -05:00
Michael Basnight	fb5a0652a8	HLRC: Convert xpack methods to client side objects (#40705 ) This commit fixes a problem with BWC that was brought up in #40511. A newer version of the code was emitting a new value for an enum to an older version, and the older version could not handle that. It caused the response to error. The MainResponse is now relaxed, and will accept whatever values the server expose, and holds most of them as Strings instead of complex objects. Fixes #40511	2019-04-04 11:06:44 -05:00
Hendrik Muhs	31e79a73d7	add HLRC protocol tests for transform state and stats (#40766 ) adds HLRC protocol tests for state and stats hrlc clients	2019-04-03 12:51:15 +02:00
Hendrik Muhs	1f947054ff	add reason to DataFrameTransformState and add hlrc protocol tests (#40736 ) add field "reason" to DataFrameTransformState, add hlrc protocol tests and allow unknown fields for DataFrameTransformState	2019-04-03 07:35:07 +02:00
Tim Vernum	2c770ba3cb	Support mustache templates in role mappings (#40571 ) This adds a new `role_templates` field to role mappings that is an alternative to the existing roles field. These templates are evaluated at runtime to determine which roles should be granted to a user. For example, it is possible to specify: "role_templates": [ { "template":{ "source": "_user_{{username}}" } } ] which would mean that every user is assigned to their own role based on their username. You may not specify both roles and role_templates in the same role mapping. This commit adds support for templates to the role mapping API, the role mapping engine, the Java high level rest client, and Elasticsearch documentation. Due to the lack of caching in our role mapping store, it is currently inefficient to use a large number of templated role mappings. This will be addressed in a future change. Backport of: #39984, #40504	2019-04-02 20:55:10 +11:00
Adrien Grand	7f7d09af2e	Deprecate types in `_graph/explore` calls. (#40466 ) (#40513 ) Any call that uses a path that sets a type will trigger a deprecation warning.	2019-03-28 09:32:26 +01:00
Adrien Grand	65a35c985c	Remove type from VersionConflictEngineException. (#37490 ) (#40514 ) It initially mentioned the type in the exception because the type used to be required to uniquely identify a document. This is not necessary anymore given that indices have at most one type.	2019-03-28 09:32:09 +01:00
Andy Bristol	c0c6d702a2	ignore 409 conflict in reindex responses (#39543 ) The reindex family of APIs (reindex, update-by-query, delete-by-query) can sometimes return responses that have an error status code (409 Conflict in this case) but still have a body in the usual BulkByScrollResponse format. When the HLRC tries to handle such responses, it blows up because it tris to parse it expecting the error format that errors in general use. This change prompts the HLRC to parse the response using the expected BulkByScrollResponse format.	2019-03-27 13:27:17 -07:00
David Kyle	c990b30019	[ML] Data Frame HLRC Get API (#40509 )	2019-03-27 12:40:39 +00:00
Benjamin Trent	12943c5d2c	[ML] Add data frame task state object and field (#40169 ) (#40490 ) * [ML] Add data frame task state object and field * A new state item is added so that the overall task state can be accoutned for * A new FAILED state and reason have been added as well so that failures can be shown to the user for optional correction * Addressing PR comments * adjusting after master merge * addressing pr comment * Adjusting auditor usage with failure state * Refactor, renamed state items to task_state and indexer_state * Adding todo and removing redundant auditor call * Address HLRC changes and PR comment * adjusting hlrc IT test	2019-03-27 06:53:58 -05:00
David Kyle	1354696db9	[ML] Data Frame HLRC Get Stats API (#40443 )	2019-03-26 11:17:13 +00:00
Benjamin Trent	7b4f964708	[ML] make source and dest objects in the transform config (#40337 ) (#40396 ) * [ML] make source and dest objects in the transform config * addressing PR comments * Fixing compilation post merge * adding comment for Arrays.hashCode * addressing changes for moving dest to object * fixing data_frame yml tests * fixing API test	2019-03-25 07:16:41 -05:00

1 2 3 4 5 ...

1192 Commits