OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-19 19:35:02 +00:00

Author	SHA1	Message	Date
Jack Conradson	a780ec14f0	Painless: Upgrade ASM to 7.2 (#49263 ) This upgrades Painless to use the latest ASM libraries providing support up to Java 14. Note the library is not published with the latest versions in an "all" package, so we pick up each lib independently that's required. There were some changes to the getType method that require descriptors to be used in place of internal class names.	2019-11-20 07:09:47 -08:00
Jim Ferenczi	81548df2d9	Disable caching when queries are profiled (#48195 ) This change disables the query and request cache when profile is set to true in the request. This means that profiled queries will not check caches to execute the query and the result will never be added in the cache either. Closes #33298	2019-11-20 16:02:59 +01:00
Armin Braun	1cde4a6364	Make SnapshotsService#getRepositoryData Async (#49322 ) (#49358 ) * Make SnapshotsService#getRepositoryData Async (#49322) Follow up to #49299 removing the blocking step for the snapshot status APIs as well.	2019-11-20 15:22:10 +01:00
David Roberts	20558cf61c	[ML] Fix simultaneous stop and force stop datafeed (#49367 ) If a datafeed is stopped normally and force stopped at the same time then it is possible that the force stop removes the persistent task while the normal stop is performing actions. Currently this causes the normal stop to error, but since stopping a stopped datafeed is not an error this doesn't make sense. Instead the force stop should just take precedence. This is a followup to #49191 and should really have been included in the changes in that PR.	2019-11-20 12:52:47 +00:00
Mayya Sharipova	e3da60c23d	Increase the number of vector dims to 2048 (#46895 )	2019-11-20 07:47:33 -05:00
Tanguy Leroux	6bad28a835	Mute AzureBlobStoreRepositoryTests (#49364 ) Relates #48978	2019-11-20 11:16:16 +01:00
Christoph Büscher	4ffa050735	Allow custom characters in token_chars of ngram tokenizers (#49250 ) Currently the `token_chars` setting in both `edgeNGram` and `ngram` tokenizers only allows for a list of predefined character classes, which might not fit every use case. For example, including underscore "_" in a token would currently require the `punctuation` class which comes with a lot of other characters. This change adds an additional "custom" option to the `token_chars` setting, which requires an additional `custom_token_chars` setting to be present and which will be interpreted as a set of characters to inlcude into a token. Closes #25894	2019-11-20 10:37:12 +01:00
Alan Woodward	c6b31162ba	Refactor percolator's QueryAnalyzer to use QueryVisitors Lucene now allows us to explore the structure of a query using QueryVisitors, delegating the knowledge of how to recurse through and collect terms to the query implementations themselves. The percolator currently has a home-grown external version of this API to construct sets of matching terms that must be present in a document in order for it to possibly match the query. This commit removes the home-grown implementation in favour of one using QueryVisitor. This has the added benefit of making interval queries available for percolator pre-filtering. Due to a bug in multi-term intervals (LUCENE-9050) it also includes a clone of some of the lucene intervals logic, that can be removed once upstream has been fixed. Closes #45639	2019-11-20 09:21:01 +00:00
Przemysław Witek	9c0ec7ce23	[7.x] Make AnalyticsProcessManager class more robust (#49282 ) (#49356 )	2019-11-20 10:08:16 +01:00
Tanguy Leroux	f753fa2265	HttpHandlers should return correct list of objects (#49283 ) This commit fixes the server side logic of "List Objects" operations of Azure and S3 fixtures. Until today, the fixtures were returning a " flat" view of stored objects and were not correctly handling the delimiter parameter. This causes some objects listing to be wrongly interpreted by the snapshot deletion logic in Elasticsearch which relies on the ability to list child containers of BlobContainer (#42653) to correctly delete stale indices. As a consequence, the blobs were not correctly deleted from the emulated storage service and stayed in heap until they got garbage collected, causing CI failures like #48978. This commit fixes the server side logic of Azure and S3 fixture when listing objects so that it now return correct common blob prefixes as expected by the snapshot deletion process. It also adds an after-test check to ensure that tests leave the repository empty (besides the root index files). Closes #48978	2019-11-20 09:26:42 +01:00
Dimitris Athanasiou	4d6e037e90	[7.x][ML] Extract creation of DFA field extractor into a factory (#49315 ) (#49329 ) This commit moves the async calls required to retrieve the components that make up `ExtractedFieldsExtractor` out of `DataFrameDataExtractorFactory` and into a dedicated `ExtractorFieldsExtractorFactory` class. A few more refactorings are performed: - The detector no longer needs the results field. Instead, it knows whether to use it or not based on whether the task is restarting. - We pass more accurately whether the task is restarting or not. - The validation of whether fields that have a cardinality limit are valid is now performed in the detector after retrieving the respective cardinalities. Backport of #49315	2019-11-20 10:02:42 +02:00
Dimitris Athanasiou	543f5f4faf	[7.x][ML][HLRC] Add FAILED state for data frame analytics (#49326 ) (#49327 ) Backport of #49326	2019-11-20 09:58:13 +02:00
Mathew Davis	92a1faf545	Fixing a typo in the stop SLM api request header.	2019-11-19 23:06:49 -07:00
Lisa Cawley	2b9fb7ebe2	[DOCS] Merges security overview pages (#49342 )	2019-11-19 16:19:02 -08:00
Przemysław Witek	42bb8ae525	[7.x] Extract indexData method out of RegressionIT tests (#49306 ) (#49313 )	2019-11-19 22:47:12 +01:00
Mark Tozzi	17358b5af7	(refactor) Extract Empty/Script/Missing ValuesSource behavior to an interface (#48320 ) (#49330 ) This is a pure code rearrangement refactor. Logic for what specific ValuesSource instance to use for a given type (e.g. script or field) moved out of ValuesSourceConfig and into CoreValuesSourceType (previously just ValueSourceType; we extract an interface for future extensibility). ValueSourceConfig still selects which case to use, and then the ValuesSourceType instance knows how to construct the ValuesSource for that case.	2019-11-19 16:44:29 -05:00
Benjamin Trent	d068818b16	[ML][Inference] document new settings (#49309 ) (#49336 ) * [ML][Inference] document new settings * [DOCS] Minor edits	2019-11-19 16:43:19 -05:00
James Rodewig	62a3154d0e	[DOCS] [7.x] Add high-level docs for enrich processor and policies (#49194 ) (#49331 )	2019-11-19 16:38:13 -05:00
Lisa Cawley	75f1f612c2	[DOCS] Merges duplicate pages for Active Directory realms (#49205 )	2019-11-19 13:18:01 -08:00
Jay Modi	eed4cd25eb	ThreadPool and ThreadContext are not closeable (#43249 ) (#49273 ) This commit changes the ThreadContext to just use a regular ThreadLocal over the lucene CloseableThreadLocal. The CloseableThreadLocal solves issues with ThreadLocals that are no longer needed during runtime but in the case of the ThreadContext, we need it for the runtime of the node and it is typically not closed until the node closes, so we miss out on the benefits that this class provides. Additionally by removing the close logic, we simplify code in other places that deal with exceptions and tracking to see if it happens when the node is closing. Closes #42577	2019-11-19 13:15:16 -07:00
Lisa Cawley	c4c8a7a43c	[DOCS] Merges duplicate pages for PKI realms (#49206 )	2019-11-19 10:51:09 -08:00
Ryan Ernst	c6a8913c38	Fix java home validation usage by tasks (#49204 ) Tasks intending to use a particular java home provided by JAVA<N>_HOME use the getJavaHome method, which verifies the given java home is available, or will be if the task will run. However, the verification logic was broken, in addition to unnecessarily delaying retrieving the java home until runtime. This commit fixes the verification logic to run at either config time, delaying verification, or at runtime which immediately checks if java home is available. closes #49153	2019-11-19 10:30:19 -08:00
Jack Conradson	14d2e795ae	make dim files mmapped (#49272 ) This change mmaps dim files in HybridDirectory to take advantage of off- heap BKD trees. This is based off of (#48509) via (https://issues.apache.org/jira/browse/LUCENE-8932).	2019-11-19 10:22:30 -08:00
Lisa Cawley	2f5acae4a9	[DOCS] Groups pages related to encrypting communications (#49324 )	2019-11-19 10:10:39 -08:00
Lisa Cawley	62bbe419d3	[DOCS] Removes Beats security page (#49276 )	2019-11-19 09:15:30 -08:00
Andrei Dan	19780e20ba	Handle failure to retrieve ILM policy step better (#49193 ) (#49316 ) This commit wraps the calls to retrieve the current step in a try/catch so that the exception does not bubble up. Instead, step info is added containing the exception to the existing step. Semi-related to #49128 (cherry picked from commit 72530f8a7f40ae1fca3704effb38cf92daf29057) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2019-11-19 17:14:46 +00:00
Lisa Cawley	97cdfd2848	[DOCS] Clarify ML job closure prerequisites (#49265 )	2019-11-19 08:36:50 -08:00
James Rodewig	a26916cc23	[DOCS] Reformat elision token filter docs (#49262 )	2019-11-19 10:55:22 -05:00
James Rodewig	8639ddab5e	[DOCS] Reformat fingerprint token filter docs (#49311 )	2019-11-19 10:55:21 -05:00
Armin Braun	0acba44a2e	Make Repository.getRepositoryData an Async API (#49299 ) (#49312 ) This API call in most implementations is fairly IO heavy and slow so it is more natural to be async in the first place. Concretely though, this change is a prerequisite of #49060 since determining the repository generation from the cluster state introduces situations where this call would have to wait for other operations to finish. Doing so in a blocking manner would break `SnapshotResiliencyTests` and waste a thread. Also, this sets up the possibility to in the future make use of async IO where provided by the underlying Repository implementation. In a follow-up `SnapshotsService#getRepositoryData` will be made async as well (did not do it here, since it's another huge change to do so). Note: This change for now does not alter the threading behaviour in any way (since `Repository#getRepositoryData` isn't forking) and is purely mechanical.	2019-11-19 16:49:12 +01:00
jimczi	cb5169ae37	update release notes for 7.5.0 after respin	2019-11-19 16:24:04 +01:00
Henning Andersen	bc29c9877a	Reindex search response fix (#49301 ) Fixed test case to also accept another error message, now that reindex does not allow searching against red shards. Closes #49295	2019-11-19 14:38:05 +01:00
Marios Trivyzas	fd1bb4a33a	SQL: Fix issue with mins & hours for DATEDIFF (#49252 ) Previously, DATEDIFF for minutes and hours was doing a rounding calculation using all the time fields (secs, msecs/micros/nanos). Instead it should first truncate the 2 dates to the respective field (mins or hours) zeroing out all the more detailed time fields and then make the subtraction. (cherry picked from commit 124cd18e20429e19d52fd8dc383827ea5132d428)	2019-11-19 14:25:28 +01:00
Benjamin Trent	19602fd573	[ML][Inference] changing setting to be memorySizeSettting (#49259 ) (#49302 )	2019-11-19 07:56:40 -05:00
Przemysław Witek	38aec2e298	Relax assertions related to datafeed timing stats in .yml test (#49285 ) (#49291 )	2019-11-19 12:50:14 +01:00
Tanguy Leroux	abed869ec6	Mute ReindexFailureTests.testResponseOnSearchFailure (#49298 ) Relates #49295	2019-11-19 12:38:54 +01:00
David Roberts	a5204c1c80	[ML] Fixes for stop datafeed edge cases (#49284 ) The following edge cases were fixed: 1. A request to force-stop a stopping datafeed is no longer ignored. Force-stop is an important recovery mechanism if normal stop doesn't work for some reason, and needs to operate on a datafeed in any state other than stopped. 2. If the node that a datafeed is running on is removed from the cluster during a normal stop then the stop request is retried (and will likely succeed on this retry by simply cancelling the persistent task for the affected datafeed). 3. If there are multiple simultaneous force-stop requests for the same datafeed we no longer fail the one that is processed second. The previous behaviour was wrong as stopping a stopped datafeed is not an error, so stopping a datafeed twice simultaneously should not be either. Backport of #49191	2019-11-19 10:51:46 +00:00
Armin Braun	9c00648314	Make Snapshot Delete Concurrency Exception Consistent (#49266 ) (#49281 ) We shouldn't be throwing `RepositoryException` when the repository wasn't concurrently modified in an unexpected fashion (i.e. on the blob/file level). When we know that the known repo gen moved higher in terms of the generation tracked in master memory we should throw the concurrent snapshot exception. This change makes concurrent snapshot create and delete always throw the same exception, prevents unnecessary listings when the generation is known to be off and prevents future test failures in SLM tests that assume the concurrent snapshot exception is always thrown here. Without this change, the newly added test randomly fails the `instanceOf` assertion by running into a `RepositoryException`.	2019-11-19 09:50:52 +01:00
Lisa Cawley	abd4a70b10	[DOCS] Merges duplicate pages for Kerberos realms (#49207 )	2019-11-18 15:23:06 -08:00
Lisa Cawley	b4f82c9cdb	[DOCS] Merges duplicate pages for LDAP realms (#49203 )	2019-11-18 14:09:24 -08:00
Julie Tibshirani	81a9d98a47	Remove the 'experimental' marking from vector fields. (#49120 ) We wrapped up the API changes we wanted to make, and vector fields can now be considered GA.	2019-11-18 12:42:46 -08:00
Julie Tibshirani	a0ee6c8f7e	Add telemetry for flattened fields. (#48972 ) (#49125 ) Currently we just record the number of flattened fields defined in the mappings.	2019-11-18 12:29:42 -08:00
Henning Andersen	2ac38fd315	Reindex and friends fail on RED shards (#45830 ) Reindex, update by query and delete by query would silently disregard RED/unavailable shards, thus not copying, updating or deleting matching data in those shards. Now use `allow_partial_search_results=false` to ensure these operations fail if the search crosses an unavailable chard. Added the option to explicitly specify `allow_partial_search_results=true` for reindex only (seemed too strange for update/delete by query). Relates #45739 and #42612	2019-11-18 21:23:08 +01:00
Lisa Cawley	b0054eecd6	[DOCS] Merges duplicate pages for file realms (#49200 )	2019-11-18 12:02:18 -08:00
Benjamin Trent	eefe7688ce	[7.x][ML] ML Model Inference Ingest Processor (#49052 ) (#49257 ) * [ML] ML Model Inference Ingest Processor (#49052) * [ML][Inference] adds lazy model loader and inference (#47410) This adds a couple of things: - A model loader service that is accessible via transport calls. This service will load in models and cache them. They will stay loaded until a processor no longer references them - A Model class and its first sub-class LocalModel. Used to cache model information and run inference. - Transport action and handler for requests to infer against a local model Related Feature PRs: * [ML][Inference] Adjust inference configuration option API (#47812) * [ML][Inference] adds logistic_regression output aggregator (#48075) * [ML][Inference] Adding read/del trained models (#47882) * [ML][Inference] Adding inference ingest processor (#47859) * [ML][Inference] fixing classification inference for ensemble (#48463) * [ML][Inference] Adding model memory estimations (#48323) * [ML][Inference] adding more options to inference processor (#48545) * [ML][Inference] handle string values better in feature extraction (#48584) * [ML][Inference] Adding _stats endpoint for inference (#48492) * [ML][Inference] add inference processors and trained models to usage (#47869) * [ML][Inference] add new flag for optionally including model definition (#48718) * [ML][Inference] adding license checks (#49056) * [ML][Inference] Adding memory and compute estimates to inference (#48955) * fixing version of indexed docs for model inference	2019-11-18 13:19:17 -05:00
Lisa Cawley	48f53efd9a	[DOCS] Merges duplicate pages for SAML realms (#49209 )	2019-11-18 10:09:29 -08:00
Lisa Cawley	b0b5fcc4f6	[DOCS] Removes closed security PRs from release notes (#49256 )	2019-11-18 09:19:11 -08:00
gpaimla	7d20b50f45	Implement Lucene EstonianAnalyzer, Stemmer (#49149 ) This PR adds a new analyzer and stemmer for the Estonian language. Closes #48895	2019-11-18 17:24:21 +01:00
Armin Braun	25cc8e3663	Fix RepoCleanup not Removed on Master-Failover (#49217 ) (#49239 ) The logic for `cleanupInProgress()` was backwards everywhere (method itself and all but one user). Also, we weren't checking it when removing a repository. This lead to a bug (in the one spot that didn't use the method backwards) that prevented the cleanup cluster state entry from ever being removed from the cluster state if master failed over during the cleanup process. This change corrects the backwards logic, adds a test that makes sure the cleanup is always removed and adds a check that prevents repository removal during cleanup to the repositories service. Also, the failure handling logic in the cleanup action was broken. Repeated invocation would lead to the cleanup being removed from the cluster state even if it was in progress. Fixed by adding a flag that indicates whether or not any removal of the cleanup task from the cluster state must be executed. Sorry for mixing this in here, but I had to fix it in the same PR, as the first test (for master-failover) otherwise would often just delete the blocked cleanup action as a result of a transport master action retry.	2019-11-18 16:44:09 +01:00
Armin Braun	e4f6eaeaf5	Fix Runtime Java Path for OSX in Gradle (#49245 ) (#49246 ) On OSX the `bin` directory is nested under `Contents/Home` relative to where it is for the other platforms.	2019-11-18 16:43:47 +01:00

1 2 3 4 5 ...

48942 Commits