OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Roberts	3f8d16304c	Add ML admin permissions to the kibana_system role (#58172 ) As part of the "ML in Spaces" project, access to the ML UI in Kibana is migrating to being controlled by Kibana privileges. The ML UI will check whether the logged-in user has permission to do something ML-related using Kibana privileges, and if they do will call the relevant ML Elasticsearch API using the Kibana system user. In order for this to work the kibana_system role needs to have administrative access to ML. Backport of #58061	2020-06-17 17:03:32 +01:00
Benjamin Trent	2de242f80e	[ML] rename EnsembleSizeInfo#inputFieldNameLengths to this.featureNameLengths (#58241 ) (#58253 )	2020-06-17 10:08:55 -04:00
Benjamin Trent	69338b03d7	[ML] expand data_streams when assigning datafeed to node (#58175 ) (#58242 )	2020-06-17 08:34:34 -04:00
Ignacio Vera	2d3d7ab387	mute CentroidCalculatorTests#testPolygonAsPoint (#58249 ) (#58250 )	2020-06-17 14:32:13 +02:00
Jason Tedor	b78b3edeea	Upgrade to JNA 5.5.0 (#58183 ) This commit bumps our JNA dependency from 4.5.1 to 5.5.0, so that we are now on the latest maintained line, and pick up a large collection of bug fixes that have accumulated.	2020-06-17 07:35:08 -04:00
Dimitris Athanasiou	36dbf08d47	[7.x][ML] Improve stability of stratified splitter tests (#58180 ) (#58224 ) The main improvement here is that the total expected count of training rows in the test is calculated as the sum of the training fraction times the cardinality of each class (instead of the training fraction times the total doc count). Also relaxes slightly the error bound on the uniformity test from 0.12 to 0.13. Closes #54122 Backport of #58180	2020-06-17 12:40:21 +03:00
Andrei Dan	e17c51151b	[7.x] ILM: don't take snapshot of a data stream's write index (#58159 ) (#58222 ) We don't allow converting a data stream's writeable index into a searchable snapshot. We are currently preventing swapping a data stream's write index with the restored index. This adds another step that will not proceed with the searchable snapshot action until the managed index is not the write index of a data stream anymore. (cherry picked from commit ccd618ead7cf7f5a74b9fb34524d00024de1479a) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-06-17 09:45:16 +01:00
Ignacio Vera	7080ba5b05	Check for degenerated lines when calculating the centroid (#58216 )	2020-06-17 09:34:49 +02:00
Przemysław Witek	b22e91cefc	[7.x] Delete auto-generated annotations when job is deleted. (#58169 ) (#58219 )	2020-06-17 09:17:20 +02:00
Stuart Tettemer	01795d1925	Revert "Scripting: Deprecate general cache settings (#55753 )" (#58201 ) This reverts commit `88e8b34fc2`.	2020-06-16 14:58:18 -06:00
Stuart Tettemer	88e8b34fc2	Scripting: Deprecate general cache settings (#55753 ) Backport: ef543b0	2020-06-16 13:06:59 -06:00
Benjamin Trent	081da09c72	Allow GET <pattern>/_rollup/data to expand data streams (#58173 ) (#58177 )	2020-06-16 14:01:54 -04:00
Benjamin Trent	3309817d18	[ML] fixing tree inference ctor to allow target_type to be optional (#58132 ) (#58165 ) The tree trained model object will set its target_type to be regression by default. This updates the inference object to behave the same way.	2020-06-16 13:29:11 -04:00
Benjamin Trent	6c03d97419	Mute TimeSeriesDataStreamsIT.testSearchableSnapshotAction (#58127 ) (#58181 ) Co-authored-by: Andrei Dan <andrei.dan@elastic.co>	2020-06-16 12:40:38 -04:00
Alan Woodward	12a3f6dfca	MappedFieldType should not extend FieldType (#58160 ) MappedFieldType is a combination of two concerns: * an extension of lucene's FieldType, defining how a field should be indexed * a set of query factory methods, defining how a field should be searched We want to break these two concerns apart. This commit is a first step to doing this, breaking the inheritance relationship between MappedFieldType and FieldType. MappedFieldType instead has a series of boolean flags defining whether or not the field is searchable or aggregatable, and FieldMapper has a separate FieldType passed to its constructor defining how indexing should be done. Relates to #56814	2020-06-16 16:56:43 +01:00
Dan Hermann	7079a3b09f	[7.x] Prohibit freezing the write index of a data stream (#58168 )	2020-06-16 09:37:32 -05:00
Yannick Welsch	1e235a7f55	Fix off-by-one on CCR lease (#58158 ) The leases issued by CCR keep one extra operation around on the leader shards. This is not harmful to the leader cluster, but means that there's potentially one delete that can't be cleaned up.	2020-06-16 14:04:58 +02:00
David Turner	423697f414	Default to zero replicas for searchable snapshots (#57802 ) Today a mounted searchable snapshot defaults to having the same replica configuration as the index that was snapshotted. This commit changes this behaviour so that we default to zero replicas on these indices, but allow the user to override this in the mount request. Relates #50999	2020-06-16 10:12:23 +01:00
Tal Levy	69d5e044af	Add optional description parameter to ingest processors. (#57906 ) (#58152 ) This commit adds an optional field, `description`, to all ingest processors so that users can explain the purpose of the specific processor instance. Closes #56000.	2020-06-15 19:27:57 -07:00
markharwood	03dd73dc0d	Fix for wildcard fields that returned ByteRefs not Strings to scripts. (#58060 ) (#58109 ) This need some reorg of BinaryDV field data classes to allow specialisation of scripted doc values. Moved common logic to a new abstract base class and added a new subclass to return string-based representations to scripts. Closes #58044	2020-06-15 14:52:56 +01:00
Alejandro Fernández Haro	3d0c8da66d	Add monitor and view_index_metadata to the built-in `kibana_system` role (#57755 ) Allows the kibana user to collect data telemetry in a background task by giving the kibana_system built-in role the view_index_metadata and monitoring privileges over all indices (*).	2020-06-15 14:40:27 +03:00
Shaunak Kashyap	5e2faad783	Add ILM policy PUT and GET for remote_monitoring_agent built-in role (#57963 ) Without this fix, users who try to use Metricbeat for Stack Monitoring today see the following error repeatedly in their Metricbeat log. Due to this error Metricbeat is unwilling to proceed further and, thus, no Stack Monitoring data is indexed into the Elasticsearch cluster. Co-authored-by: Albert Zaharovits <albert.zaharovits@elastic.co>	2020-06-15 14:35:30 +03:00
Rene Groeschke	01e9126588	Remove deprecated usage of testCompile configuration (#57921 ) (#58083 ) * Remove usage of deprecated testCompile configuration * Replace testCompile usage by testImplementation * Make testImplementation non transitive by default (as we did for testCompile) * Update CONTRIBUTING about using testImplementation for test dependencies * Fail on testCompile configuration usage	2020-06-14 22:30:44 +02:00
Jason Tedor	dcf4131f00	Revert "Add JNA license to SQL CLI dependency licenses" This reverts commit `076b32d4f3`.	2020-06-12 17:04:39 -04:00
Dan Hermann	17f3318732	[7.x] Resolve index API (#58037 )	2020-06-12 15:41:32 -05:00
Jason Tedor	076b32d4f3	Add JNA license to SQL CLI dependency licenses Previously we excluded requiring licenses for dependencies with the group name org.elasticsearch under the assumption that these use the top-level Elasticsearch license. This is not always correct, for example, for the org.elasticsearch:jna dependency as this is merely a wrapper around the upstream JNA project, and that is the license that we should be including. A recent change modified this check from using the group name to checking only if the dependency is a project dependency. This exposed the use of JNA in SQL CLI to this check, but the license for it was not added. This commit addresses this by adding the license. Relates #58015	2020-06-12 16:38:23 -04:00
Benjamin Trent	79c784932f	[ML] allow feature_names to be optional in ensemble inference model (#58059 ) (#58067 ) This has `EnsembleInferenceModel` not parse feature_names from the XContent. Instead, it will rely on `rewriteFeatureIndices` to be called ahead time. Consequently, protections are made for a fail fast path if `rewriteFeatureIndices` has not been called before `infer`.	2020-06-12 16:33:54 -04:00
Ignacio Vera	c518670f83	Fix Geo grid aggregation circuit breaker tests (#58028 ) (#58042 ) This commit makes sure we create index with only one shard.	2020-06-12 15:39:27 +02:00
Martijn van Groningen	01d8bb8cfa	Enforce valid field mapping exists for timestamp_field in templates. (#58036 ) Backport of #57741 to 7.x branch. Relates to #53100	2020-06-12 15:24:42 +02:00
David Roberts	93b693527a	[7.x][ML] Add categorizer stats ML result type (#58001 ) This type of result will store stats about how well categorization is performing. When per-partition categorization is in use, separate documents will be written for every partition so that it is possible to see if categorization is working well for some partitions but not others. This PR is a minimal implementation to allow the C++ side changes to be made. More Java side changes related to per-partition categorization will be in followup PRs. However, even in the long term I do not see a major benefit in introducing dedicated APIs for querying categorizer stats. Like forecast request stats the categorizer stats can be read directly from the job's results alias. Backport of #57978	2020-06-12 12:08:07 +01:00
markharwood	2da8e57f59	Search - add range query support to wildcard field (#57881 ) (#57988 ) Backport to add range query support to wildcard field Closes #57816	2020-06-12 11:30:54 +01:00
David Kyle	39020f3900	HLRC for delete expired data by job Id (#57722 ) (#57975 ) High level rest client changes for #57337	2020-06-12 09:44:17 +01:00
Mark Tozzi	36f551bdb4	Make ValuesSourceConfig behave like a config object (#57762 ) (#58012 )	2020-06-11 17:23:55 -04:00
Benjamin Trent	2881995a45	[ML] adding new inference model size estimate handling from native process (#57930 ) (#57999 ) Adds support for reading in `model_size_info` objects. These objects contain numeric values indicating the model definition size and complexity. Additionally, these objects are not stored or serialized to any other node. They are to be used for calculating and storing model metadata. They are much smaller on heap than the true model definition and should help prevent the analytics process from using too much memory. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-06-11 15:59:23 -04:00
Alan Woodward	16e230dcb8	Update to lucene snapshot e7c625430ed (#57981 ) Includes LUCENE-9148 and LUCENE-9398, which splits the BKD metadata, index and data into separate files and keeps the index off-heap.	2020-06-11 14:51:53 +01:00
David Roberts	54d4f2a623	[ML] Refresh annotations index on job flush and close (#57979 ) Now that annotations are part of the anomaly detection job results the annotations index should be refreshed on flushing and closing the job so that flush and close continue to fulfil their contracts that immediately after returning all results the job generated up to that point are searchable.	2020-06-11 12:29:04 +01:00
David Kyle	b87b147704	Add models for search to ModelLoadingService (#57592 ) (#57919 ) ModelLoadingService only caches models if they are referenced by an ingest pipeline. For models used in search we want to always cache the models and rely on TTL to evict them. Additionally when an ingest pipeline is deleted the model it references should not be evicted if it is used in search.	2020-06-11 10:48:37 +01:00
David Kyle	2905a2f623	Use Search After job iterators (#57875 ) (#57923 ) Search after is a better choice for the delete expired data iterators where processing takes a long time as unlike scroll a context does not have to be kept alive. Also changes the delete expired data endpoint to 404 if the job is unknown	2020-06-11 10:06:18 +01:00
Costin Leau	ff0ea62cb8	EQL: Fix casing for tiebreaker field (#57943 ) Use tiebreaker instead of tieBreaker (cherry picked from commit 3c774948a5d5e10fac267cb9a54f5d0559a00c1d)	2020-06-11 00:10:19 +03:00
Albert Zaharovits	c57ccd99f7	Just log 401 stacktraces (#55774 ) Ensure stacktraces of 401 errors for unauthenticated users are logged but not returned in the response body.	2020-06-10 20:39:32 +03:00
Valeriy Khakhutskyy	c0f368bbf3	[7.x][ML] Adjust assertion for job case memory usage estimates (#57929 ) Since we change the memory estimates for data frame analytics jobs from worst case to a realistic case, the strict less-than assertion in the test does not hold anymore. I replaced it with a less-or-equal-than assertion. Backport or #57882	2020-06-10 15:17:16 +02:00
Aleksandr Maus	ec60335496	EQL: implement case sensitivity for indexOf and endsWith string functions (#57707 ) (#57908 ) * EQL: implement case sensitivity for indexOf and endsWith string functions	2020-06-10 08:55:49 -04:00
Andrei Dan	9f280621ba	[7.x] ILM add data stream support to searchable snapshot action (#57873 ) (#57916 ) (cherry picked from commit 34856a90532c6c62a53817bb395399c8a8c17c0f) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-06-10 10:16:57 +01:00
Yannick Welsch	80f221e920	Use clean thread context for transport and applier service (#57792 ) (#57914 ) Adds assertions to Netty to make sure that its threads are not polluted by thread contexts (and also that thread contexts are not leaked). Moves the ClusterApplierService to use the system context (same as we do for MasterService), which allows to remove a hack from TemplateUgradeService and makes it clearer that applying CS updates is fully executing under system context.	2020-06-10 10:30:28 +02:00
Hendrik Muhs	95bd7b63b0	[Transform] fix page size return in cat transform, add dps (#57871 ) fixes the page size reported after moving page size to settings(#56007) and adds documents per second(throttling) to the output. fixes #56498	2020-06-10 08:10:25 +02:00
Yang Wang	72a6441a88	Revert "Resolve anonymous roles and deduplicate roles during authentication (#53453 ) (#55995 )" (#57858 ) This reverts commit `84a2f1adf2`.	2020-06-10 10:42:52 +10:00
Jake Landis	a370d5eead	[7.x] Ensure Joni warning are logged at debug (#57302 ) (#57897 ) When Joni, the regex engine that powers grok emits a warning it does so by default to System.err. System.err logs are all bucketed together in the server log at WARN level. When Joni emits a warning, it can be extremely verbose, logging a message for each execution again that pattern. For ingest node that means for every document that is run that through Grok. Fortunately, Joni provides a call back hook to push these warnings to a custom location. This commit implements Joni's callback hook to push the Joni warning to the Elasticsearch server logger (logger.org.elasticsearch.ingest.common.GrokProcessor) at debug level. Generally these warning indicate a possible issue with the regular expression and upon creation of the Grok processor will do a "test run" of the expression and log the result (if any) at WARN level. This WARN level log should only occur on pipeline creation which is a much lower frequency then every document. Additionally, the documentation is updated with instructions for how to set the logger to debug level.	2020-06-09 17:06:29 -05:00
Yannick Welsch	9eec819c5b	Revert "Use clean thread context for transport and applier service (#57792 )" This reverts commit `259be236cf`.	2020-06-09 22:24:54 +02:00
Costin Leau	439205d1ea	EQL: Introduce tie breaker support (#57787 ) Allow a field inside the data to be used as a tie breaker for events that have the same timestamp. The field is optional by default. If used, the tie-breaker always requires a non-null value since it is used inside `search_after` which requires a non-null value. Fix #56824 (cherry picked from commit e5719ecb474b32730d93afdbb6834a32b0b2df8b)	2020-06-09 22:50:19 +03:00
Andrei Dan	3945712c72	[7.x] ILM add data stream support to the Shrink action (#57616 ) (#57884 ) The shrink action creates a shrunken index with the target number of shards. This makes the shrink action data stream aware. If the ILM managed index is part of a data stream the shrink action will make sure to swap the original managed index with the shrunken one as part of the data stream's backing indices and then delete the original index. (cherry picked from commit 99aeed6acf4ae7cbdd97a3bcfe54c5d37ab7a574) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-06-09 19:45:22 +01:00

1 2 3 4 5 ...

4976 Commits