OpenSearch

Commit Graph

Author	SHA1	Message	Date
Dimitris Athanasiou	c6a5a6d924	[ML] Put ML filter API response should contain the filter (#31362 )	2018-06-15 21:15:35 +01:00
David Kyle	ca00deb8ad	[ML] Re-enable tests muted in #30982	2018-06-15 10:54:13 +01:00
Tanguy Leroux	992c7889ee	Uncouple persistent task state and status (#31031 ) This pull request removes the relationship between the state of persistent task (as stored in the cluster state) and the status of the task (as reported by the Task APIs and used in various places) that have been confusing for some time (#29608). In order to do that, a new PersistentTaskState interface is added. This interface represents the persisted state of a persistent task. The methods used to update the state of persistent tasks are renamed: updatePersistentStatus() becomes updatePersistentTaskState() and now takes a PersistentTaskState as a parameter. The Task.Status type as been changed to PersistentTaskState in all places were it make sense (in persistent task customs in cluster state and all other methods that deal with the state of an allocated persistent task).	2018-06-15 09:26:47 +02:00
Dimitris Athanasiou	9b293275af	[ML] Add description to ML filters (#31330 ) This adds a `description` to ML filters in order to allow users to describe their filters in a human readable form which is also editable (filter updates to be added shortly).	2018-06-14 16:52:32 +01:00
Luca Cavanna	ce245a7320	Remove RestGetAllAliasesAction (#31308 ) We currently have a specific REST action to retrieve all aliaes, which uses internally the get index API. This doesn't seem to be required anymore though as the existing RestGetAliaesAction could as well take the requests with no indices and aliases specified. This commit removes the RestGetAllAliasesAction in favour of using RestGetAliasesAction also for requests that don't specify indices nor aliases. Similar to #31129.	2018-06-14 11:21:16 +02:00
Tanguy Leroux	4d7447cb5e	Reenable Checkstyle's unused import rule (#31270 )	2018-06-14 09:52:46 +02:00
Tom Veasey	66f7dd2c4d	[ML] Update test thresholds to account for changes to memory control (#31289 ) To avoid temporary failures, this also disables these tests until elastic/ml-cpp#122 is committed.	2018-06-13 13:12:53 +01:00
Dimitris Athanasiou	5c77ebe89d	[ML] Implement new rules design (#31110 ) Rules allow users to supply a detector with domain knowledge that can improve the quality of the results. The model detects statistically anomalous results but it has no knowledge of the meaning of the values being modelled. For example, a detector that performs a population analysis over IP addresses could benefit from a list of IP addresses that the user knows to be safe. Then anomalous results for those IP addresses will not be created and will not affect the quantiles either. Another example would be a detector looking for anomalies in the median value of CPU utilization. A user might want to inform the detector that any results where the actual value is less than 5 is not interesting. This commit introduces a `custom_rules` field to the `Detector`. A detector may have multiple rules which are combined with `or`. A rule has 3 fields: `actions`, `scope` and `conditions`. Actions is a list of what should happen when the rule applies. The current options include `skip_result` and `skip_model_update`. The default value for `actions` is the `skip_result` action. Scope is optional and allows for applying filters on any of the partition/over/by field. When not defined the rule applies to all series. The `filter_id` needs to be specified to match the id of the filter to be used. Optionally, the `filter_type` can be specified as either `include` (default) or `exclude`. When set to `include` the rule applies to entities that are in the filter. When set to `exclude` the rule only applies to entities not in the filter. There may be zero or more conditions. A condition requires `applies_to`, `operator` and `value` to be specified. The `applies_to` value can be either `actual`, `typical` or `diff_from_typical` and it specifies the numerical value to which the condition applies. The `operator` (`lt`, `lte`, `gt`, `gte`) and `value` complete the definition. Conditions are combined with `and` and allow to specify numerical conditions for when a rule applies. A rule must either have a scope or one or more conditions. Finally, a rule with scope and conditions applies when all of them apply.	2018-06-13 11:20:38 +01:00
Jason Tedor	0bfd18cc8b	Revert upgrade to Netty 4.1.25.Final (#31282 ) This reverts upgrading to Netty 4.1.25.Final until we have a cleaner solution to dealing with the object cleaner thread.	2018-06-12 19:26:18 -04:00
Dimitris Athanasiou	5f84e18c72	[ML][TEST] Mute tests using rules (#31204 ) This is in preparation of pushing the new rules design in the `ml-cpp` side. These tests will be switched on again after merging in the new rules implementation.	2018-06-12 11:36:26 +01:00
Jason Tedor	563141c6c9	Upgrade to Netty 4.1.25.Final (#31232 ) This commit upgrades us to Netty 4.1.25. This upgrade is more challenging than past upgrades, all because of a new object cleaner thread that they have added. This thread requires an additional security permission (set context class loader, needed to avoid leaks in certain scenarios). Additionally, there is not a clean way to shutdown this thread which means that the thread can fail thread leak control during tests. As such, we have to filter this thread from thread leak control.	2018-06-11 16:55:07 -04:00
Jason Tedor	cb952bd9ec	Enable custom credentials for core REST tests (#31235 ) The core REST tests with security currently use a hardcoded username and password. This is not amenable to running these tests in scenarios where the user controls the creation of the cluster and owns the credentials for this cluster. This commit enables running the core REST tests with security with a custom username and password.	2018-06-11 16:53:40 -04:00
Tanguy Leroux	bf58660482	Remove all unused imports and fix CRLF (#31207 ) The X-Pack opening and the recent other refactorings left a lot of unused imports in the codebase. This commit removes them all.	2018-06-11 15:12:12 +02:00
Tanguy Leroux	a1916658a9	[Tests] Fix self-referencing tests This commit adapts some test after #31044 has been merged.	2018-06-11 12:45:27 +02:00
Jason Tedor	65c107b47d	Fix unknown licenses (#31223 ) The goal of this commit is to address unknown licenses when producing the dependencies info report. We have two different checks that we run on licenses. The first check is whether or not we have stashed a copy of the license text for a dependency in the repository. The second is to map every dependency to a license type (e.g., BSD 3-clause). The problem here is that the way we were handling licenses in the second check differs from how we handle licenses in the first check. The first check works by finding a license file with the name of the artifact followed by the text -LICENSE.txt. Yet in some cases we allow mapping an artifact name to another name used to check for the license (e.g., we map lucene-.* to lucene, and opensaml-.* to shibboleth. The second check understood the first way of looking for a license file but not the second way. So in this commit we teach the second check about the mappings from artifact names to license names. We do this by copying the configuration from the dependencyLicenses task to the dependenciesInfo task and then reusing the code from the first check in the second check. There were some other challenges here though. For example, dependenciesInfo was checking too many dependencies. For now, we should only be checking direct dependencies and leaving transitive dependencies from another org.elasticsearch artifact to that artifact (we want to do this differently in a follow-up). We also want to disable dependenciesInfo for projects that we do not publish, users only care about licenses they might be exposed to if they use our assembled products. With all of the changes in this commit we have eliminated all unknown licenses. A follow-up will enforce that when we add a new dependency it does not get mapped to unknown, these will be forbidden in the future. Therefore, with this change and earlier changes are left having no unknown licenses and two custom licenses; custom here means it does not map to an SPDX license type. Those two licenses are xz and ldapsdk. A future change will not allow additional custom licenses unless they are explicitly whitelisted. This ensures that if a new dependency is added it is mapped to an SPDX license or mapped to custom because it does not have an SPDX license.	2018-06-09 07:28:41 -04:00
Igor Motov	01140a3ad8	SQL: Make a single JDBC driver jar (#31012 ) Replaces zip archive containing multiple jars with a single JDBC driver jar that shades all external dependencies. Closes #29856	2018-06-08 10:15:28 -04:00
Hendrik Muhs	253b998681	flush job to ensure all results have been written (#31187 ) flush ml job to ensure all results have been written fixes #31173	2018-06-08 07:51:45 +02:00
Nik Everett	dfcc939ef8	QA: Better seed nodes for rolling restart Use all running nodes as unicast seeds in the rolling restart tests to avoid a race between pinging and the tests. Without this if the tests are too fast then when a new node comes up and pings its single configured seed node that node might not have a ping from the other running node.	2018-06-07 13:30:37 -04:00
Nik Everett	56207ea43d	QA: Set better node names on rolling restart tests These should help with debugging failures.	2018-06-07 11:25:41 -04:00
Nik Everett	dc4bb62a78	QA: Remove mistaken timeout I pushed a test that `assertBusy`s for a whole hour accidentally. I was testing something and forgot to revert my local hack but caught it on backport. This removes it.	2018-06-06 13:51:54 -04:00
Nik Everett	7c59e7690e	QA: Switch xpack rolling upgrades to three nodes (#31112 ) This is much more realistic and can find more issues. This causes the "mixed cluster" tests to be run twice so I had to fix the tests to work in that case. In most cases I did as little as possible to get them working but in a few cases I went a little beyond that to make them easier for me to debug while getting them to work. My test changes: 1. Remove the "basic indexing" tests and replace them with a copy of the tests used in the OSS. We have no way of sharing code between these two projects so for now I copy. 2. Skip the a few tests in the "one third" upgraded scenario: * creating a scroll to be reused when the cluster is fully upgraded * creating some ml data to be used when the cluster is fully ugpraded 3. Drop many "assert yellow and that the cluster has two nodes" assertions. These assertions duplicate those made by the wait condition and they fail now that we have three nodes. 4. Switch many "assert green and that the cluster has two nodes" to 3 nodes. These assertions are unique from the wait condition and, while I imagine they aren't required in all cases, now is not the time to find that out. Thus, I made them work. 5. Rework the index audit trail test so it is more obvious that it is the same test expecting different numbers based on the shape of the cluster. The conditions for which number are expected are fairly complex because the index audit trail is shut down until the template for it is upgraded and the template is upgraded when a master node is elected that has the new version of the software. 6. Add some more information to debug the index audit trail test because it helped me figure out what was going on. I also dropped the `waitCondition` from the `rolling-upgrade-basic` tests because it wasn't needed. Closes #25336	2018-06-06 11:59:16 -04:00
Hendrik Muhs	5e48ba7cbd	run overflow forecast a 2nd time as regression test for elastic/ml-cpp#110 (#30969 ) Improve test to run overflow forecast a 2nd time as regression test for elastic/ml-cpp#110	2018-06-05 08:52:06 +02:00
Dimitris Athanasiou	9141108334	[ML][TEST] Fix bucket count assertion in all tests in ModelPlotsIT (#31026 ) This fixes the last remaining test that was missed in #30717. Closes #30715	2018-06-01 10:51:12 +01:00
Luca Cavanna	70749e01c4	Cross Cluster Search: preserve remote status code (#30976 ) In case an error is returned when calling search_shards on a remote cluster, which will lead to throwing an exception in the coordinating node, we should make sure that the status code returned by the coordinating node is the same as the one returned by the remote cluster. Up until now a 500 - Internal Server Error was always returned. This commit changes this behaviour so that for instance if an index is not found, which causes an 404, a 404 is also returned by the coordinating node to the client. Closes #27461	2018-06-01 08:53:53 +02:00
Nik Everett	b225f5e5c6	HLRest: Allow caller to set per request options (#30490 ) This modifies the high level rest client to allow calling code to customize per request options for the bulk API. You do the actual customization by passing a `RequestOptions` object to the API call which is set on the `Request` that is generated by the high level client. It also makes the `RequestOptions` a thing in the low level rest client. For now that just means you use it to customize the headers and the `httpAsyncResponseConsumerFactory` and we'll add node selectors and per request timeouts in a follow up. I only implemented this on the bulk API because it is the first one in the list alphabetically and I wanted to keep the change small enough to review. I'll convert the remaining APIs in a followup.	2018-05-31 13:59:52 -04:00
Ryan Ernst	46e8d97813	Core: Remove RequestBuilder from Action (#30966 ) This commit removes the RequestBuilder generic type from Action. It was needed to be used by the newRequest method, which in turn was used by client.prepareExecute. Both of these methods are now removed, along with the existing users of prepareExecute constructing the appropriate builder directly.	2018-05-31 16:15:00 +02:00
David Roberts	0ff2c605b6	[CI] Mute Ml rolling upgrade test for mixed cluster too It can fail in either the mixed cluster or the upgraded cluster, so it needs to be muted in both. Tracked by #30982	2018-05-31 11:17:18 +01:00
Igor Motov	8e4ab82e3d	[CI] Mute Ml rolling upgrade tests Tracked by #30982	2018-05-30 21:30:18 -04:00
Christoph Büscher	1ea9f11b03	Change ScriptException status to 400 (bad request) (#30861 ) Currently failures to compile a script usually lead to a ScriptException, which inherits the 500 INTERNAL_SERVER_ERROR from ElasticsearchException if it does not contain another root cause. Instead, this should be a 400 Bad Request error. This PR changes this more generally for script compilation errors by changing ScriptException to return 400 (bad request) as status code. Closes #12315	2018-05-30 14:00:07 +02:00
Andy Bristol	ba8bb1d4a1	[test] packaging test logging for suse distros	2018-05-29 11:06:57 -07:00
Ioannis Kakavas	a8faf9768a	Limit the scope of BouncyCastle dependency (#30358 ) Limits the scope of the runtime dependency on BouncyCastle so that it can be eventually removed. * Splits functionality related to reading and generating certificates and keys in two utility classes so that reading certificates and keys doesn't require BouncyCastle. * Implements a class for parsing PEM Encoded key material (which also adds support for reading PKCS8 encoded encrypted private keys). * Removes BouncyCastle dependency for all of our test suites(except for the tests that explicitly test certificate generation) by using pre-generated keys/certificates/keystores.	2018-05-29 19:11:09 +03:00
Igor Motov	dbb2e8143c	SQL: Remove the last remaining server dependencies from jdbc (#30771 ) Removes the last remaining server dependencies from jdbc client. In order to do that it introduces the new project sql-shared-proto that contains only XContent-serializable classes. HTTP Client and JDBC now depend only on sql-shared-proto. I had to keep the original sql-proto project since it is used as a dependency by sql-cli and security integration tests. Relates #29856	2018-05-25 15:41:41 -04:00
David Roberts	aafcd85f50	Move persistent task registrations to core (#30755 ) Persistent tasks was moved from X-Pack to core in #28455. However, registration of the named writables and named X-content was left in X-Pack. This change moves the registration of the named writables and named X-content into core. Additionally, the persistent task actions are no longer registered in the X-Pack client plugin, as they are already registered in ActionModule.	2018-05-24 09:17:17 +01:00
Adrien Grand	a19df4ab3b	Add a `format` option to `docvalue_fields`. (#29639 ) This commit adds the ability to configure how a docvalue field should be formatted, so that it would be possible eg. to return a date field formatted as the number of milliseconds since Epoch. Closes #27740	2018-05-23 14:39:04 +02:00
Luca Cavanna	a17d6cab98	Replace Request#setHeaders with addHeader (#30588 ) Adding headers rather than setting them all at once seems more user-friendly and we already do it in a similar way for parameters (see Request#addParameter).	2018-05-22 20:32:30 +02:00
Costin Leau	dcf0f9f8dd	SQL: Preserve scoring in bool queries (#30730 ) Make all bool constructs use match/should (that is a query context) as that is controlled and changed to a filter context by ES automatically based on the sort order (_doc, field vs _sort) and trackScores. Fix #29685	2018-05-21 21:50:06 +03:00
David Roberts	2b72adc8ac	[TEST] Reduce forecast overflow to disk test memory limit (#30727 ) By default ML native processes are only allowed to use 30% of RAM, so the previous 2GB setting prevented the test passing on VMs with only 4GB RAM. This change reduces the limit to 1200MB, which means it can now pass on VMs with 4GB RAM.	2018-05-18 19:01:43 +01:00
Ryan Ernst	b3f3a4312b	Plugins: Remove meta plugins (#30670 ) Meta plugins existed only for a short time, in order to enable breaking up x-pack into multiple plugins. However, now that x-pack is no longer installed as a plugin, the need for them has disappeared. This commit removes the meta plugins infrastructure.	2018-05-18 10:56:08 -07:00
Dimitris Athanasiou	6bb2a1da22	[ML][TEST] Fix bucket count assertion in ModelPlotsIT (#30717 ) As the first record is random, there's a chance it will be aligned on a bucket start. Thus we need to check the bucket count is in [23, 24]. Closes #30715	2018-05-18 17:59:01 +03:00
Dimitris Athanasiou	1484a31be5	[ML][TEST] Make AutodetectMemoryLimitIT less fragile (#30716 ) These tests aim to check the set model memory limit is respected. Additionally, it was asserting counts of partition, by, over fields in an attempt to check that the used memory is spent meaningfully. However, this made the tests fragile, as changes in the ml-cpp could lead to CI failures. This commit removes those assertions. We are working on adding tests in ml-cpp that will compensate.	2018-05-18 17:57:20 +03:00
Hendrik Muhs	6c313a9871	This implementation lazily (on 1st forecast request) checks for available diskspace and creates a subfolder for storing data outside of Lucene indexes, but as part of the ES data paths. Details: - tmp storage is managed and does not allow allocation if disk space is below a threshold (5GB at the moment) - tmp storage is supposed to be managed by the native component but in case this fails cleanup is provided: - on job close - on process crash - after node crash, on restart - available space is re-checked for every forecast call (the native component has to check again before writing) Note: The 1st path that has enough space is chosen on job open (job close/reopen triggers a new search)	2018-05-18 14:04:09 +02:00
Dimitris Athanasiou	75665a2d3e	[ML] Clean left behind model state docs (#30659 ) It is possible for state documents to be left behind in the state index. This may be because of bugs or uncontrollable scenarios. In any case, those documents may take up quite some disk space when they add up. This commit adds a step in the expired data deletion that is part of the daily maintenance service. The new step searches for state documents that do not belong to any of the current jobs and deletes them. Closes #30551	2018-05-17 17:51:26 +03:00
David Roberts	ef0daee850	[TEST] Account for increase in ML C++ memory usage (#30675 ) Recent changes to the ML C++ have resulted in higher memory usage, so fewer "by" fields can be analyzed in a given amount of model memory.	2018-05-17 12:59:20 +01:00
Tim Vernum	9f824c4aa8	Add detailed assert message to IndexAuditUpgradeIT (#30669 ) Print out the returned buckets if the size does not match the expectation.	2018-05-17 21:36:13 +10:00
Alexander Reelsen	11d776ecf0	Watcher: Fix watch history template for dynamic slack attachments (#30172 ) The part of the history template responsible for slack attachments had a dynamic mapping configured which could lead to problems, when a string value looking like a date was configured in the value field of an attachment. This commit fixes the template by setting this field always to text. This also requires a change in the template numbering to be sure this will be applied properly when starting watcher.	2018-05-17 11:57:54 +02:00
Ryan Ernst	a4c9c2fa2a	Make xpack modules instead of a meta plugin (#30589 ) This commit removes xpack from being a meta-plugin-as-a-module. It also fixes a couple tests which were missing task dependencies, which failed once the gradle execution order changed.	2018-05-16 15:35:57 -07:00
David Kyle	16f5a515f3	[ML] Wait for ML indices in rolling upgrade tests (#30615 )	2018-05-16 09:52:25 +01:00
Ryan Ernst	c7d82b378b	Build: Add task interdependencies for ssl configuration (#30633 ) This commit fixes the tasks creating ssl certs for tests to have correct dependsOn to ensure the right tasks are run before tests run.	2018-05-15 16:09:15 -07:00
Yannick Welsch	af2b9dd779	Revert "Mute ML upgrade test (#30458 )" This reverts commit `4b36ea7433`.	2018-05-15 11:20:57 +02:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00

1 2

95 Commits