OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ryan Ernst	a1e7429ccc	Allow sha512 checksum without filename for maven plugins (#52668 ) When installing plugins from remote sources, either the Elastic download service, or maven, a checksum file is downloaded and checked against the downloaded zip. The current format for official plugins is to use a sha512 checksum which includes the zip filename. This format matches that from sha512sum, and allows using the --check argument there to verify the checksum manually. However, when generating checksum files with maven and gradle, the filename is not included. This commit relaxes the requirement the filename existing within the sha512 checksum file for maven plugins. We continue to strictly enforce official plugins have the existing format of the file. closes #52413	2020-02-24 13:38:20 -08:00
David Roberts	4cae4ded4b	[TEST] Unmute DebMetadataTests.test05CheckLintian (#52719 ) The underlying problem was fixed in elastic/ml-cpp#1019 Backport of #52696	2020-02-24 21:35:32 +00:00
lcawl	c6e35b460e	[DOCS] Adds anchor for custom rules	2020-02-24 11:39:15 -08:00
Nik Everett	a7fe3329cb	Fix some top_metrics tests (#52575 ) (#52726 ) These tests didn't work properly when run against multi-shard indices. The `_score` based sorting test expects fairly specific scores which isn't going to happen with multiple shards so this disables multiple shards for that test. The other tests were failing due to a fairly sneaky race condition around `_bulk` and type inference. This fixes them by always sending metric values as floating point numbers so Elasticsearch always infers them to be doubles.	2020-02-24 14:30:37 -05:00
Ryan Ernst	5fba8cbc7b	Rename local Environment var in Node to avoid confusion (#52602 ) When the Node class is being constructed, an initial environment is passed in with the initial settings for the node. Once the plugin servicie is initialized, the final Environment+Settings are created, at which point the initial environment should no longer be used. This commit renames the constructor arg to avoid naming clashes with the final environment variable.	2020-02-24 11:14:46 -08:00
Ryan Ernst	8c295cdc87	Fix sql cli sourcing of x-pack-env (#52613 ) The sql-cli script sources x-pack-env, but it does so assuming the current directory is ES_HOME. This commit alters the source command to use ES_HOME which is available after running elasticsearch-env. closes #47803	2020-02-24 11:13:31 -08:00
Lee Hinman	7d9de8412a	[7.x] fix npe in RestPluginsAction (#52620 ) (de56de9a) (#52721 ) Relates #45321 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Kaihong.Wang <kyra.wkh@alibaba-inc.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-24 11:57:01 -07:00
Mayya Sharipova	034b1c0ba3	Correct boost calculation in script_score query (#52478 ) (#52724 ) Before boost in script_score query was wrongly applied only to the subquery. This commit makes sure that the boost is applied to the whole score that comes out of script. Closes #48465	2020-02-24 13:48:21 -05:00
Aleksandr Maus	a7bdb0b456	EQL: Add integration tests harness to test EQL feature parity with original implementation (#52248 ) (#52675 ) The tests use the original test queries from https://github.com/endgameinc/eql/blob/master/eql/etc/test_queries.toml for EQL implementation correctness validation. The file test_queries_unsupported.toml serves as a "blacklist" for the queries that we do not support. Currently all of the queries are blacklisted. Over the time the expectation is to eventually have an empty "blacklist" when all of the queries are fully supported. The tests use the original test vector from https://raw.githubusercontent.com/endgameinc/eql/master/eql/etc/test_data.json. Only one EQL and the response is stubbed for now to match the expected output from that query. This part would need some tweaking after EQL is fully wired. Related to https://github.com/elastic/elasticsearch/issues/49581	2020-02-24 12:46:59 -05:00
Przemko Robakowski	e72cb79476	Add docs for errors in GetAlias API (#51850 ) (#52716 ) Closes #31499 Co-authored-by: Maxim <timonin.maksim@mail.ru>	2020-02-24 18:22:09 +01:00
Adrien Grand	f993ef80f8	Move the terms index of `_id` off-heap. (#52518 ) In #42838 we moved the terms index of all fields off-heap except the `_id` field because we were worried it might make indexing slower. In general, the indexing rate is only affected if explicit IDs are used, as otherwise Elasticsearch almost never performs lookups in the terms dictionary for the purpose of indexing. So it's quite wasteful to require the terms index of `_id` to be loaded on-heap for users who have append-only workloads. Furthermore I've been conducting benchmarks when indexing with explicit ids on the http_logs dataset that suggest that the slowdown is low enough that it's probably not worth forcing the terms index to be kept on-heap. Here are some numbers for the median indexing rate in docs/s: \| Run \| Master \| Patch \| \| --- \| ------- \| ------- \| \| 1 \| 45851.2 \| 46401.4 \| \| 2 \| 45192.6 \| 44561.0 \| \| 3 \| 45635.2 \| 44137.0 \| \| 4 \| 46435.0 \| 44692.8 \| \| 5 \| 45829.0 \| 44949.0 \| And now heap usage in MB for segments: \| Run \| Master \| Patch \| \| --- \| ------- \| -------- \| \| 1 \| 41.1720 \| 0.352083 \| \| 2 \| 45.1545 \| 0.382534 \| \| 3 \| 41.7746 \| 0.381285 \| \| 4 \| 45.3673 \| 0.412737 \| \| 5 \| 45.4616 \| 0.375063 \| Indexing rate decreased by 1.8% on average, while memory usage decreased by more than 100x. The `http_logs` dataset contains small documents and has a simple indexing chain. More complex indexing chains, e.g. with more fields, ingest pipelines, etc. would see an even lower decrease of indexing rate.	2020-02-24 18:14:12 +01:00
David Kyle	de3d674bb7	Revert "Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart" This reverts commit `c4d91143ac`.	2020-02-24 15:22:49 +00:00
David Kyle	044a4e127a	[ML] Add reason to DataFrameAnalyticsTask setFailed log message (#52659 ) (#52707 )	2020-02-24 15:21:51 +00:00
James Rodewig	5e48811585	[DOCS] Document CCS-supported APIs (#52708 ) Explicitly notes the Elasticsearch API endpoints that support CCS. This should deter users from attempting to use CCS with other API endpoints, such as `GET <index>/_doc/<_id>`.	2020-02-24 09:59:08 -05:00
Alan Woodward	7dc41a3b83	Use BoostQuery rather than FunctionScoreQuery for query-time indices_boost (#52272 ) This is a trivial change, but it should result in a slightly more efficient query boost.	2020-02-24 14:41:46 +00:00
Albert Zaharovits	33131e2dcd	Logfile audit settings validation (#52537 ) Add validation for the following logfile audit settings: xpack.security.audit.logfile.events.include xpack.security.audit.logfile.events.exclude xpack.security.audit.logfile.events.ignore_filters..users xpack.security.audit.logfile.events.ignore_filters..realms xpack.security.audit.logfile.events.ignore_filters..roles xpack.security.audit.logfile.events.ignore_filters..indices Closes #52357 Relates #47711 #47038 Follows the example from #47246	2020-02-24 16:38:16 +02:00
Ignacio Vera	ba9d3c6389	Add support for multipoint shape queries (#52564 ) (#52705 )	2020-02-24 13:46:51 +01:00
James Rodewig	98bcf06bae	[DOCS] Correct multi search API docs (#52523 ) * Adds an example request to the top of the page. * Relocates several parameters erroneously listed under "Request body" to the appropriate "Query parameters" section. * Updates the "Request body" section to better document the NDJSON structure of msearch requests.	2020-02-24 07:43:10 -05:00
Marios Trivyzas	c03f51f68f	[Docs] Clarify default value for `allow_no_indices` (#52635 ) (#52697 ) Add default value to each one of the usages of `allow_no_indices` since it differs between different APIs. Relates to: #52534 (cherry picked from commit 2eb986488ac326d6da6ab8ad0203a94e08684a36)	2020-02-24 11:57:32 +01:00
Martijn van Groningen	225d841212	Improve watcher test by preventing a npe when closing the http client.	2020-02-24 10:23:45 +01:00
Yang Wang	7cefba78c5	License removal leads back to a basic license (#52407 ) (#52683 ) A new basic license will be generated when existing license is deleted. In addition, deleting an existing basic license is a no-op. Resolves: #45022	2020-02-24 11:02:40 +11:00
Nik Everett	d26d7721ea	Continue realizing sorting by aggregations (backport of #52298 ) (#52667 ) This drops more of the `instanceof`s from `AggregationPath`. There are still a couple in `AggregationPath`. And I ended up moving two into `BucketsAggregator`, but I think this is still an improvement!	2020-02-23 17:13:55 -05:00
Mark Vieira	a0aa808c83	Fix broken BWC builds	2020-02-23 08:46:07 -08:00
Mark Vieira	72a2d0f9d8	Skip 'setupPorts' tasks when Docker is unavailable (#52679 )	2020-02-22 18:31:36 -08:00
bellengao	02cb5b6c0e	Return 429 status code on read_only_allow_delete index block (#50166 ) We consider index level read_only_allow_delete blocks temporary since the DiskThresholdMonitor can automatically release those when an index is no longer allocated on nodes above high threshold. The rest status has therefore been changed to 429 when encountering this index block to signal retryability to clients. Related to #49393	2020-02-22 16:24:25 +01:00
Jason Tedor	1685cbe504	Add messages for CCR on license state changes (#52470 ) When a license expires, or license state changes, functionality might be disabled. This commit adds messages for CCR to inform users that CCR functionality will be disabled when a license expires, or when license state changes to a license level lower than trial/platinum/enterprise.	2020-02-22 09:09:42 -05:00
Benjamin Trent	afd90647c9	[ML] Adds feature importance to option to inference processor (#52218 ) (#52666 ) This adds machine learning model feature importance calculations to the inference processor. The new flag in the configuration matches the analytics parameter name: `num_top_feature_importance_values` Example: ``` "inference": { "field_mappings": {}, "model_id": "my_model", "inference_config": { "regression": { "num_top_feature_importance_values": 3 } } } ``` This will write to the document as follows: ``` "inference" : { "feature_importance" : { "FlightTimeMin" : -76.90955548511226, "FlightDelayType" : 114.13514762158526, "DistanceMiles" : 13.731580450792187 }, "predicted_value" : 108.33165831875137, "model_id" : "my_model" } ``` This is done through calculating the [SHAP values](https://arxiv.org/abs/1802.03888). It requires that models have populated `number_samples` for each tree node. This is not available to models that were created before 7.7. Additionally, if the inference config is requesting feature_importance, and not all nodes have been upgraded yet, it will not allow the pipeline to be created. This is to safe-guard in a mixed-version environment where only some ingest nodes have been upgraded. NOTE: the algorithm is a Java port of the one laid out in ml-cpp: https://github.com/elastic/ml-cpp/blob/master/lib/maths/CTreeShapFeatureImportance.cc usability blocked by: https://github.com/elastic/ml-cpp/pull/991	2020-02-21 18:42:31 -05:00
Mark Vieira	f06d692706	[Backport] Consolidate docker availability logic (#52656 )	2020-02-21 15:24:05 -08:00
Jay Modi	8abfda0b59	Rename assertThrows to prevent naming clash (#52651 ) This commit renames ElasticsearchAssertions#assertThrows to assertRequestBuilderThrows and assertFutureThrows to avoid a naming clash with JUnit 4.13+ and static imports of these methods. Additionally, these methods have been updated to make use of expectThrows internally to avoid duplicating the logic there. Relates #51787 Backport of #52582	2020-02-21 13:30:11 -07:00
Mayya Sharipova	3840a763d8	Correct release notes for 7.5 (#52660 ) Remove a mention to a feature that was not merged, as its corresponding PR was closed.	2020-02-21 14:59:46 -05:00
Rory Hunter	ce7ebb2d39	Limit _FILE env var support to specific vars (#52645 ) Backport of #52525. Closes #52503. Implement a list of `_FILE` env vars that will be used to populate env vars with file content, instead of processing all `_FILE` vars in the environment.	2020-02-21 19:36:15 +00:00
Stuart Tettemer	376932a47d	Scripting: split out compile limits and caching (#52498 ) (#52652 ) Phase 1 of adding compilation limits per context. * Refactor rate limiting and caching into separate class, `ScriptCache`, which will be used per context. * Disable compilation limit for certain tests. Backport of 0866031 Refs: #50152	2020-02-21 12:10:51 -07:00
Lisa Cawley	56efd8b44d	[DOCS] Adds certutil http command to TLS setup steps (#51241 ) Co-Authored-By: Ioannis Kakavas <ikakavas@protonmail.com> Co-Authored-By: Tim Vernum <tim@adjective.org>	2020-02-21 10:11:59 -08:00
Jack Conradson	c4d91143ac	Mute RunDataFrameAnalyticsIT.testOutlierDetectionStopAndRestart Relates: #52654	2020-02-21 09:32:19 -08:00
Nik Richers	101bca86d2	[DOCS] Switch to standard ESS trial links (#52552 ) Switches ESS trial sign-up links over to a standard attribute. This provides better metrics for how effective these links are.	2020-02-21 12:07:10 -05:00
Lisa Cawley	4ff78e8a00	[7.x][DOCS] Adds X-Pack usage API (#52592 )	2020-02-21 06:57:11 -08:00
Jay Modi	f3f6ff97ee	Single instance of the IndexNameExpressionResolver (#52604 ) This commit modifies the codebase so that our production code uses a single instance of the IndexNameExpressionResolver class. This change is being made in preparation for allowing name expression resolution to be augmented by a plugin. In order to remove some instances of IndexNameExpressionResolver, the single instance is added as a parameter of Plugin#createComponents and PersistentTaskPlugin#getPersistentTasksExecutor. Backport of #52596	2020-02-21 07:50:02 -07:00
Nik Everett	ed957f35a9	Cover missing case in top_metrics test (#52517 ) The top_metrics test assumed that it'd never end up only reducing unmapped results. But, rarely, it does. This handles that case in the test. Closes #52462	2020-02-21 09:49:17 -05:00
Igor Motov	e5b21a3fc6	Add HLRC for EQL search (#52550 ) Adds EQL HLRC client with the search method. Relates to #51961	2020-02-21 08:44:08 -05:00
James Rodewig	068181b0b6	[DOCS] Add missing `indices` parms returned by `_nodes/stats` (#52055 ) Adds several human-readable `indices` parameters returned by the `_nodes/stats` API.	2020-02-21 08:15:59 -05:00
Hendrik Muhs	288ccae23b	[Transform] add support for filter aggregation (#52483 ) add support for filter aggregations, refactor code for sub-aggregation support in mapping deduction fixes #52151	2020-02-21 14:05:11 +01:00
Andrei Stefan	7fe2843a9e	SQL: specify command to run the CLI on a remote machine without Elasticsearch (#52626 ) (cherry picked from commit 477b0eda8322c5dcb6861bd262bfeec17ff133fe)	2020-02-21 13:29:58 +02:00
James Rodewig	80b77e92d4	[7.x] [DOCS] Re-add redirects for API relocation (#52628 ) Re-adds several redirects removed with #50510. These redirects were related to the relocation of several API docs to new pages under the 'REST APIs' chapter. We've since decided to only remove such redirects with major releases.	2020-02-21 05:32:10 -05:00
markharwood	96d603979b	Upgrade Lucene to 8.5.0-snapshot-b01d7cb (#52584 ) Upgrading 7x to same Lucene 8.5 version used in master	2020-02-21 10:25:03 +00:00
Armin Braun	5a7db0c520	Fix GCS Test testReadLargeBlobWithRetries (#52619 ) (#52624 ) The countdown didn't work well here because it only returns `true` once the countdown reaches `0` but can on subsequent executions return `false` again if a countdown at `0` is counted down again, leading to more than the expected number of simulated failures. Closes #52607	2020-02-21 10:34:53 +01:00
Sean Story	5017bb094e	[Docs]: Fix typo 'Got' -> 'Go' (#52603 ) Fix typo 'Got' -> 'Go' (cherry picked from commit cf7eca270db964c9c474a70da647cb8396f677ba)	2020-02-21 10:25:13 +01:00
Armin Braun	1662cd45a4	Add Region and Signer Algorithm Overrides to S3 Repos (#52112 ) (#52562 ) Exposes S3 SDK signing region and algorithm override settings as requested in #51861. Closes #51861	2020-02-21 10:21:20 +01:00
Armin Braun	0a09e15959	Add Caching for RepositoryData in BlobStoreRepository (#52341 ) (#52566 ) Cache latest `RepositoryData` on heap when it's absolutely safe to do so (i.e. when the repository is in strictly consistent mode). `RepositoryData` can safely be assumed to not grow to a size that would cause trouble because we often have at least two copies of it loaded at the same time when doing repository operations. Also, concurrent snapshot API status requests currently load it independently of each other and so on, making it safe to cache on heap and assume as "small" IMO. The benefits of this move are: * Much faster repository status API calls * listing all snapshot names becomes instant * Other operations are sped up massively too because they mostly operate in two steps: load repository data then load multiple other blobs to get the additional data * Additional cloud cost savings * Better resiliency, saving another spot where an IO issue could break the snapshot * We can simplify a number of spots in the current code that currently pass around the repository data in tricky ways to avoid loading it multiple times in follow ups.	2020-02-21 10:20:07 +01:00
Przemko Robakowski	aff693bc9f	Make FreezeStep retryable (#52540 ) (#52559 ) * Make FreezeStep retryable This change marks `FreezeStep` as retryable and adds test to make sure we can really run it again. * refactor tests Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-02-21 10:11:35 +01:00
Armin Braun	4bb780bc37	Refactor Inflexible Snapshot Repository BwC (#52365 ) (#52557 ) * Refactor Inflexible Snapshot Repository BwC (#52365) Transport the version to use for a snapshot instead of whether to use shard generations in the snapshots in progress entry. This allows making upcoming repository metadata changes in a flexible manner in an analogous way to how we handle serialization BwC elsewhere. Also, exposing the version at the repository API level will make it easier to do BwC relevant changes in derived repositories like source only or encrypted.	2020-02-21 09:14:34 +01:00

1 2 3 4 5 ...

50124 Commits All Branches Search

50124 Commits

All Branches