OpenSearch

Commit Graph

Author	SHA1	Message	Date
James Rodewig	e0a8adb5b2	[DOCS] Reformat `stemmer` token filter (#55693 ) Makes the following changes to the `stemmer` token filter docs: * Adds detailed analyze example * Rewrites parameter definitions * Adds custom analyzer example * Adds a `language` value for the `estonian` stemmer * Reorders the `language` values to show recommended algorithms first, followed by other values alphabetically	2020-04-24 11:25:01 -04:00
James Rodewig	96285b90c1	[DOCS] Add stemming concept docs (#55156 ) Adds conceptual documentation for stemming, including: * An overview of why stemming is helpful in search * Algorithmic vs. dictionary stemming * Token filters used to control stemming, such as `stemmer_override`, `keyword_marker`, and `conditional`	2020-04-24 11:01:28 -04:00
Christoph Büscher	f95a741ad3	[Docs] Fix fuzziness example in match-query.asciidoc (#55715 ) The example looks the same as in the previous section although it should use the "fuzziness" parameter. This seems to be okay on 6.8 and master and was probably only forgotten to port to 7.x branches.	2020-04-24 16:21:40 +02:00
Dimitris Athanasiou	210b7f1b76	[7.x][ML] Remove parsing of old progress format in DF Analytics (#55711 ) (#55720 ) Since #55580 we've introduced a new format for parsing progress from the data frame analytics process. As the process is now writing out progress in this new way, we can remove the parsing of the old format. Backport of #55711	2020-04-24 16:50:56 +03:00
David Turner	aa9a2bce37	Avoid accidental contiguous read (#55713 ) If we choose to read from two random positions that are 1024 bytes apart then this counts as a contiguous read for stats purposes, failing this test. This commit ensures that we always perform a non-contiguous read.	2020-04-24 11:50:31 +01:00
Rory Hunter	6d2a5378a0	Rename docker context artifacts to satisfy release-manager (#55692 ) Our release tool expects artifacts to have a certain naming convention. Rename the Docker context artifacts to match this convention.	2020-04-24 10:48:18 +01:00
David Turner	de30550aea	Relax elapsed time stats assertion (#55710 ) `SearchableSnapshotDirectoryStatsTests#testCachedBytesReadsAndWrites` asserts that each write takes one clock tick, but we now permit concurrent reads and writes so each write might take longer. This commit relaxes the assertion to match. Closes #55707	2020-04-24 10:21:08 +01:00
Przemysław Witek	c89917c799	Register DFA jobs on putAnalytics rather than via a separate method (#55458 ) (#55708 )	2020-04-24 10:59:32 +02:00
Dimitris Athanasiou	b8379872a7	[7.x][ML] Logs error when DFA task is set to failed (#55545 ) (#55668 ) Also unmutes the integ test that stops and restarts an outlier detection job with the hope of learning more of the failure in #55068. Backport of #55545 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-24 11:06:07 +03:00
Jim Ferenczi	0a6c74b7d3	AsyncSearchMaintenanceService should stop when closing a node (#55651 ) This change turns the AsyncSearchMaintenanceService into an AbstractLifecycleComponent and ensures that the service is stopped when a node is closing. Closes #55646	2020-04-24 09:38:40 +02:00
Hendrik Muhs	b213209f0c	[Rollup] improve stopping tests (#55666 ) improve tests related to stopping using a client that answers and can be synchronized with the test thread in order to test special situations relates #55011	2020-04-24 08:48:36 +02:00
Jay Modi	30f8c326fe	Test: fix SSLReloadDuringStartupIntegTests (#55637 ) This commit fixes reproducible test failures with the SSLReloadDuringStartupIntegTests on the 7.x branch. The failures only occur on 7.x due to the existence of the transport client and its usage in our test infrastructure. This change removes the randomized usage of transport clients when retrieving a client from a node in the internal cluster. Transport clients do not support the reloading of files for TLS configuration changes but if we build one from the nodes settings and attempt to use it after the files have been changed, the client will not know about the changes and the TLS connection will fail. Closes #55524	2020-04-23 21:36:43 -06:00
Ryan Ernst	97c4b64fb1	Add isAllowed license utility (#55424 ) (#55700 ) License state is currently made up of boolean methods that check whether a particular feature is allowed by the current license state. Each new feature must copy/past boiler plate code. While that has gotten easier with utilities like isAllowedByLicense, this is still more cumbersome than should be necessary. This commit adds a general purpose isAllowed method which takes a new Feature enum, where each value of the enum defines the minimum license mode and whether the license must be active to be allowed. Only security features are converted in this PR, in order to keep the commit size relatively small. The rest of the features will be converted in a followup.	2020-04-23 16:28:28 -07:00
Zachary Tong	715c90bf7d	Aggs must specify a `field` or `script` (or both) (#52226 ) This adds a validation to VSParserHelper to ensure that a field or script or both are specified by the user. This is technically required today already, but throws an exception much deeper in the agg framework and has a very unintuitive error for the user (as well as eating more resources instead of failing early)	2020-04-23 19:23:41 -04:00
jimczi	c857adf603	Fix AsyncSearchTaskTests#testWithFetchFailures Fix usage of a possible invalid random range [1, 0]. Relates #55688	2020-04-24 00:45:17 +02:00
Jim Ferenczi	31d1727698	Fix (de)serialization of async search failures (#55688 ) The (de)serialization code of the async search response cannot handle exceptions that extend ElasticsearchException (e.g. ScriptException). This commit fixes this bug by serializing the error with the more generic StreamInput#writeException.	2020-04-24 00:44:43 +02:00
Igor Motov	8c7ef2417f	Make AsyncSearchIndexService reusable (#55598 ) EQL will require very similar functionality to async search. This PR refactors AsyncSearchIndexService to make it reusable for EQL. Supersedes #55119 Relates to #49638	2020-04-23 18:02:17 -04:00
Nick Knize	96a02089c2	Refactor GeoShape DocValues in spatial xpack (#55691 ) This commit refactors geo_shape doc values, fielddata, and utility classes from the single mapper package in x-pack spatial plugin to a package structure that is consistent with the server module.	2020-04-23 15:32:23 -05:00
David Roberts	46be9959a0	[ML] Audit when unassigned datafeeds are stopped (#55667 ) Previously audit messages were indexed when datafeeds that were assigned to a node were stopped, but not datafeeds that were unassigned at the time they were stopped. This change adds auditing for the unassigned case. Backport of #55656	2020-04-23 20:46:35 +01:00
Dan Hermann	dd5c96c2ed	[7.x] Rollover for data streams	2020-04-23 12:04:34 -05:00
Ioannis Kakavas	55485cfa17	Print runtime.java in test failures (#55515 ) (#55624 ) This commit adds `runtime.java` as a system property in our nonInputProperties so that it will be available to be printed upon test failure by ReproduceInfoPrinter.	2020-04-23 18:53:58 +03:00
Zachary Tong	4f483ac370	Fix half-float range in SupportedTypeTests (#55409 ) Also adds a comment to the half-float number field type tests indicating why 70000 is used instead of 65504	2020-04-23 11:36:37 -04:00
James Rodewig	e74fdacabd	[DOCS] Add admonition for EQL exact matches on text fields (#53402 ) (#55670 ) Adds a important admonition to the EQL syntax page noting that the equal (`==`) operator should not be used to match `text` field values. Relates to #52709 and #53020	2020-04-23 10:59:50 -04:00
Armin Braun	dc899781f2	Fix Broken ExistingStoreRecoverySource Deserialization (#55657 ) (#55665 ) We are using `FORCE_STALE_PRIMARY_INSTANCE` in instance equality checks `==` but were creating new instances of `ExistingStoreRecoverySource` when reading from the wire. This could break these checks in corner cases, causing `org.elasticsearch.cluster.routing.allocation.IndexMetadataUpdater#shardStarted` to not remove the force allocation fake id when starting a shard. Closes #55513	2020-04-23 15:56:48 +02:00
Dimitris Athanasiou	4b11adf074	[7.x][ML] Do not fail DFA task that is stopped during reindexing (#55659 ) (#55663 ) While we were catching `TaskCancelledException` while we wait for reindexing to complete, we missed the fact that this exception may be wrapped in a multi-node cluster. This is the reason we may still fail the task when stop is called while reindexing. Some times we're lucky and the exception is thrown by the same node that runs the job. Then the exception is not wrapped and things work fine. But when that is not the case the exception is wrapped, we fail to catch it, and set the task to failed. The fix is to simply unwrap the exception when we check it it is `TaskCancelledException`. Closes #55068 Backport of #55659	2020-04-23 15:57:01 +03:00
Tanguy Leroux	8669766a81	Reduce contention in CacheFile.fileLock() method (#55662 ) The CacheFile.fileLock() method is used to acquire a lock on a cache file so that the file can't be deleted (or its file handle closed) during the execution of a read or a write operation. Today this lock is obtained by first acquiring the eviction lock (the write lock of the readwrite lock), then by checking if the cache file is evicted and the file channel still open, and finally by obtaining the file lock (the read lock of the readwrite lock). Acquiring the read lock while the eviction lock is held ensures that the cache file eviction cannot start in the meanwhile. But eviction starts (and terminations) also acquire the eviction lock; and this lock cannot be obtained while a read lock is held (the write lock of a readwrite lock is exclusive). If we were acquiring a read lock and checking the eviction flag and file channel existence while holding the read lock we know that no eviction can start or finish until the read lock is released.	2020-04-23 14:40:27 +02:00
Christoph Büscher	ed0ced9290	Don't expand default_field in query_string before required (#55158 ) (#55650 ) Currently QueryStringQueryParser already checks if the field limit is breached at construction time, which e.g. leads to errors if the default field is set to "*" or the default isn't used and there are more fields than the limit, even if the query itself does not use all these fields. This change moves this check to happen after query parsing. QueryStringQueryParser now keeps track of the fields that are actually resolved while parsing. The size of that set is later used to check against the limit set by the `indices.query.bool.max_clause_count` setting. Backport of #55158	2020-04-23 13:25:23 +02:00
Armin Braun	033e870d97	Cleanup AbstractSnapshotIntegTest (#55579 ) (#55655 ) * Dry up repository creation in spots where we aren't using any custom settings * `BlobStoreFormatIT` doesn't have to be an `IT` it's just a unit test	2020-04-23 12:38:44 +02:00
István Zoltán Szabó	5813dfdcc7	[7.x][DOCS] Adds ML related items to release highlights (#55652 )	2020-04-23 11:58:32 +02:00
Rory Hunter	d66af46724	Always use deprecateAndMaybeLog for deprecation warnings (#55319 ) Backport of #55115. Replace calls to deprecate(String,Object...) with deprecateAndMaybeLog(...), with an appropriate key, so that all messages can potentially be deduplicated.	2020-04-23 09:20:54 +01:00
David Roberts	87f4751eca	[ML] Make find_file_structure recognize Kibana CSV report timestamps (#55609 ) The Kibana CSV export feature uses a non-standard timestamp format. This change adds it to the formats the find_file_structure endpoint recognizes out-of-the-box, to make round-tripping data from Kibana back to Kibana via CSV files easier. Fixes #55586	2020-04-23 08:39:07 +01:00
Lee Hinman	86129fb6b7	[7.x] Merge V2 index/component template mappings in specific manner (#55607 ) (#55619 ) This commit changes the way that V2 index, component, and request mappings are merged. Specifically: - Fields are merged in a "replacement" manner, meaning that the entire definition is replaced rather than merging the interior configuration - Mapping metadata (all fields outside of `properties`) are merged recursively. The merging for V1 templates does not change. Relates to #53101	2020-04-22 14:33:15 -06:00
Jake Landis	25ea6a74f0	[7.x] Validate REST specs against schema (#55117 ) (#55563 ) A JSON schema was recently introduced for the REST API specification. #54252 This PR introduces a 3rd party validation tool to ensure that the REST specification conforms to the schema. The task is applied to the 3 projects that contain REST API specifications. The plugin wires this task into the precommit commit task, and should be considered as part of the public API for the build tools for any plugin developer to contribute their plugin's specification. An ignore parameter has been introduced for the task to allow specific file to be ignored from the validation. The ignored files in this PR will soon get issues logged and a link so they can be fixed. Closes #54314	2020-04-22 14:14:03 -05:00
Albert Zaharovits	82ed0ab420	Update the audit logfile list of system users (#55578 ) Out of the box "access granted" audit events are not logged for system users. The list of system users was stale and included only the _system and _xpack users. This commit expands this list with _xpack_security and _async_search, effectively reducing the auditing noise by not logging the audit events of these system users out of the box. Closes #37924	2020-04-22 21:59:31 +03:00
Tal Levy	c370b83bd7	Fix locale lowercase test issue in GenerateSnapshotNameStepTests (#55597 ) (#55605 ) The testPerformAction test has been failing periodically due to how Hamcrest's containsStringIgnoringCase does not lowercase using the same Locale set in the test infrastructure. This commit falls back to explicitly lowercasing using the root locale	2020-04-22 11:29:57 -07:00
Tal Levy	f27ce69f0c	[backport] Add geo_bounds aggregation support for geo_shape (#55328 ) (#55600 ) This commit adds a new GeoShapeBoundsAggregator to the spatial plugin and registers it with the GeoShapeValuesSourceType. This enables geo_bounds aggregations on geo_shape fields	2020-04-22 11:29:35 -07:00
Lisa Cawley	314ca78e31	[7.x][DOCS] Update example and nesting in get data frame analytics job stats API (#55612 )	2020-04-22 10:58:26 -07:00
Przemko Robakowski	6e1b958069	Fix updating Index Templates V2 (#55556 ) (#55610 ) This change fixes problem with updating Index Templates V2. Validatation added in #54933 didn't filter list of conflicting templates correctly so new template was always clashing with itself unless patterns were not changed completely.	2020-04-22 19:42:32 +02:00
Stuart Tettemer	41748f02a5	Test: don't modify defaultConfig on upgrade (#55560 ) (#55599 ) Backport: 58ec9c3	2020-04-22 11:07:27 -06:00
Igor Motov	3504755f44	Add InstantiatingObjectParser (#55483 ) (#55604 ) Introduces InstantiatingObjectParser which is similar to the ConstructingObjectParser, but instantiates the object using its constructor instead of a builder function. Closes #52499	2020-04-22 12:28:52 -04:00
Tal Levy	0844455505	Add geo_shape mapper supporting doc-values in Spatial Plugin (#55037 ) (#55500 ) After #53562, the `geo_shape` field mapper is registered within a module. This opens the door for introducing a new `geo_shape` field mapper into the Spatial Plugin that has doc-values support. This is very much an extension of server's GeoShapeFieldMapper, but with the addition of the doc values implementation.	2020-04-22 08:12:54 -07:00
James Rodewig	8d05d7dace	[DOCS] Add collapsible sections to 7.x breaking changes (#55334 ) Adds collapsible sections and new format to the 7.x breaking changes. Relates to #53229.	2020-04-22 10:56:38 -04:00
James Rodewig	6f9513915d	[DOCS] Add 'how to' doc about avoiding oversharding (#55480 ) Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>	2020-04-22 10:44:16 -04:00
James Rodewig	414f9c98f3	[DOCS] Document missing bulk API response parameters (#55414 ) Documents several parameters missing from the bulk API's response body docs. Also moves several response-related chunks of text to the response body section. Relates to #55237	2020-04-22 09:48:03 -04:00
Dimitris Athanasiou	50a5afed15	[7.x][ML] Prepare parsing phase_progress from DFA process (#55580 ) (#55587 ) Data frame analytics process currently reports progress as an integer `progress_percent`. We parse that and report it from the _stats API as the progress of the `analyzing` phase. However, we want to allow the DFA process to report progress for more than one phase. This commit prepares for this by parsing `phase_progress` from the process, an object that contains the `phase` name plus the `progress_percent` for that phase. Backport of #55580	2020-04-22 16:38:32 +03:00
Benjamin Trent	7c81cd7833	[ML] explicitly disallow partial results in datafeed extractors (#55537 ) (#55585 ) Instead of doing our own checks against REST status, shard counts, and shard failures, this commit changes all our extractor search requests to set `.setAllowPartialSearchResults(false)`. - Scrolls are automatically cleared when a search failure occurs with `.setAllowPartialSearchResults(false)` set. - Code error handling is simplified closes https://github.com/elastic/elasticsearch/issues/40793	2020-04-22 09:07:44 -04:00
David Roberts	810caf5ffe	[ML] Test that audit message is written when closing unassigned job (#55582 ) Issue #55521 suggested that audit messages were not written when closing an unassigned job. This is not the case, but we didn't have a test to prove it. Backport of #55571	2020-04-22 13:23:43 +01:00
Armin Braun	250a51bca1	Fix TransportAddVotingConfigExclusionsActionTests Leaking CS Observers (#55549 ) (#55584 ) There is no guarantee the observer and subsequent CS update will execute before we move on to the next test here and we ahve to wait for the observer + CS update cycle to complete before moving on to the next test. closes #55481	2020-04-22 13:49:24 +02:00
David Roberts	2dc5586afe	[ML] Add effective max model memory limit to ML info (#55581 ) The ML info endpoint returns the max_model_memory_limit setting if one is configured. However, it is still possible to create a job that cannot run anywhere in the current cluster because no node in the cluster has enough memory to accommodate it. This change adds an extra piece of information, limits.effective_max_model_memory_limit, to the ML info response that returns the biggest model memory limit that could be run in the current cluster assuming no other jobs were running. The idea is that the ML UI will be able to warn users who try to create jobs with higher model memory limits that their jobs will not be able to start unless they add a bigger ML node to their cluster. Backport of #55529	2020-04-22 12:28:50 +01:00
David Roberts	da5aeb8be7	[ML] Return assigned node in start/open job/datafeed response (#55570 ) Adds a "node" field to the response from the following endpoints: 1. Open anomaly detection job 2. Start datafeed 3. Start data frame analytics job If the job or datafeed is assigned to a node immediately then this field will return the ID of that node. In the case where a job or datafeed is opened or started lazily the node field will contain an empty string. Clients that want to test whether a job or datafeed was opened or started lazily can therefore check for this. Backport of #55473	2020-04-22 12:06:53 +01:00

1 2 3 4 5 ...

51381 Commits All Branches Search

51381 Commits

All Branches