OpenSearch

Commit Graph

Author	SHA1	Message	Date
Benjamin Trent	4275a715c9	[ML] adjusting inference processor to support foreach usage (#60915 ) (#61022 ) `foreach` processors store information within the `_ingest` metadata object. This commit adds the contents of the `_ingest` metadata (if it is not empty). And will append new inference results if the result field already exists. This allows a `foreach` to execute and multiple inference results being written to the same result field. closes https://github.com/elastic/elasticsearch/issues/60867	2020-08-12 08:34:18 -04:00
Alan Woodward	c81dc2b8b7	Convert KeywordFieldMapper to parametrized form (#60645 ) This makes KeywordFieldMapper extend ParametrizedFieldMapper, with explicitly defined parameters. In addition, we add a new option to Parameter, restrictedStringParam, which accepts a restricted set of string options.	2020-08-12 11:41:11 +01:00
Armin Braun	3a046e125d	Speed up MockSinglePrioritizingExecutor (#61011 ) (#61012 ) Found this while checking if I can speed up SnapshotResiliencyTests to get more coverage/time. Turns out throwing a new instance here on every task was taking 9% of the CPU wall-time in those tests. With this change it's 4% of the overall.	2020-08-12 12:24:04 +02:00
markharwood	66098e0bf4	Search fix: query_string regex/wildcard searches not working on wildcard fields (#60959 ) (#61010 ) The Query string parser was not delegating the construction of wildcard/regex queries to the underlying field type. The wildcard field has special data structures and queries that operate on them so cannot rely on the basic regex/wildcard queries that were being used for other fields. Closes #60957	2020-08-12 10:44:52 +01:00
Armin Braun	32423a486d	Simplify and Speed up some Compression Usage (#60953 ) (#61008 ) Use thread-local buffers and deflater and inflater instances to speed up compressing and decompressing from in-memory bytes. Not manually invoking `end()` on these should be safe since their off-heap memory will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals that are not instantiated at a high frequency. This significantly reduces the amount of byte copying and object creation relative to the previous approach which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied bytes out of that buffer to a freshly allocated `byte[]`, used 4k stream buffers needlessly when working with bytes that are already in arrays (`writeTo` handles efficient writing to the compression logic now) etc. Relates #57284 which should be helped by this change to some degree. Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these code paths.	2020-08-12 11:06:23 +02:00
Andrei Dan	35423a75af	Tests: don't fail if ILM executed the action already (#60916 ) (#60982 ) (cherry picked from commit 8c970ad20f4f55a9c0d6a256aa643ea037281e75) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-08-12 09:04:04 +01:00
Dimitris Athanasiou	2e18c0f2ac	[7.x][ML] Audit force stopping data frame analytics (#60973 ) (#61004 ) Audits a message when a data frame analytics job is force stopped. Backport of #60973	2020-08-12 07:45:26 +03:00
Yang Wang	c7b0290256	Mute kerberos tests for jdk 8u[262,271) (#60995 ) The Kerberos bug (JDK-8246193) is introduced in JDK 8u262 and fixed in 8u271. This PR mute for any possible releases between these two versions.	2020-08-12 11:50:48 +10:00
Nik Everett	664ba0a80a	Fix the parent join aggregator test case (#60991 ) The test was putting parent and child documents into different segments which is unrealistic and was causing errors. Closes #60980	2020-08-11 17:53:15 -04:00
Nik Everett	ce9c5f0e46	Fix diversified sample tests The test assumed that the aggregator only ran once but we turned that off. This turns it back on.	2020-08-11 17:49:43 -04:00
Nhat Nguyen	ceaa28e97b	Increase timeout in testFollowIndexWithConcurrentMappingChanges (#60534 ) The test failed because the leader was taking a lot of CPUs to process many mapping updates. This commit reduces the mapping updates, increases timeout, and adds more debug info. Closes #59832	2020-08-11 17:03:22 -04:00
Nhat Nguyen	bf7eecf1dc	Fix synchronization in ShardFollowNodeTask (#60490 ) The leader mapping, settings, and aliases versions in a shard follow-task are updated without proper synchronization and can go backward.	2020-08-11 14:52:52 -04:00
Jay Modi	2fa6448a15	System index reads in separate threadpool (#60927 ) This commit introduces a new thread pool, `system_read`, which is intended for use by system indices for all read operations (get and search). The `system_read` pool is a fixed thread pool with a maximum number of threads equal to lesser of half of the available processors or 5. Given the combination of both get and read operations in this thread pool, the queue size has been set to 2000. The motivation for this change is to allow system read operations to be serviced in spite of the number of user searches. In order to avoid a significant performance hit due to pattern matching on all search requests, a new metadata flag is added to mark indices as system or non-system. Previously created system indices will have flag added to their metadata upon upgrade to a version with this capability. Additionally, this change also introduces a new class, `SystemIndices`, which encapsulates logic around system indices. Currently, the class provides a method to check if an index is a system index and a method to find a matching index descriptor given the name of an index. Relates #50251 Relates #37867 Backport of #57936	2020-08-11 12:16:34 -06:00
Julie Tibshirani	a93be8d577	Handle nested arrays in field retrieval. (#60981 ) We accept _source values with multiple levels of arrays, such as `"field": [[[1, 2]]]`. This PR ensures that field retrieval can handle nested arrays by unwrapping the arrays before parsing the values.	2020-08-11 10:22:16 -07:00
James Rodewig	929f1cc9f9	[DOCS] Remove search request body page (#60972 ) (#60977 )	2020-08-11 13:04:07 -04:00
Nhat Nguyen	4bdf283619	Mute ChildrenToParentAggregatorTests Tracked at #60980	2020-08-11 12:56:29 -04:00
James Rodewig	7d4117426a	[DOCS] Remove unneeded word in EQL docs	2020-08-11 12:19:08 -04:00
James Rodewig	c0fa582df4	[DOCS] Make EQL example snippets more realistic (#60971 ) (#60974 )	2020-08-11 12:01:31 -04:00
James Rodewig	a1100bb770	[DOCS] Add CBOR example to ingest attachment docs (#60919 ) (#60964 )	2020-08-11 10:28:22 -04:00
Francisco Fernández Castaño	d544528c7b	Increase information on assertRecoveryStats assertion (#60960 ) Backport of #60952	2020-08-11 15:30:59 +02:00
Dimitris Athanasiou	6062672148	[7.x][ML] Monitor reindex response in DF analytics (#60911 ) (#60958 ) Examines the reindex response in order to report potential problems that occurred during the reindexing phase of data frame analytics jobs. Backport of #60911	2020-08-11 16:17:37 +03:00
Mark Tozzi	ab8518fb5b	[7.x] Extensibility for Composite Agg #59648 (#60842 )	2020-08-11 09:14:33 -04:00
Dan Hermann	839c6cdfc0	Un-mute data stream REST test (#60120 ) (#60939 )	2020-08-11 08:10:04 -05:00
James Rodewig	4aae278d1d	[DOCS] Move post filter/rescore content to new page (#60903 ) (#60961 )	2020-08-11 09:06:59 -04:00
David Kyle	18a65c5b9a	DFA Get Stats can return multiple responses if more than one error occurs (#60950 ) If the search for get stats with multiple job Ids fails the listener is called for each failure. This change waits for all responses then returns the first error if there was one.	2020-08-11 10:28:05 +01:00
Rene Groeschke	a5ef38ca40	Update gradle wrapper to 6.6 (#59909 ) (#60949 )	2020-08-11 11:03:19 +02:00
Henning Andersen	a0b54b53fc	Rest high level ReindexIT fix (#60834 ) ReindexIT would rethrottle any delete or update by query task, fixed to more precisely match the task started by the test. Closes #60811	2020-08-11 10:35:15 +02:00
Alan Woodward	54279212cf	Make MetadataFieldMapper extend ParametrizedFieldMapper (#59847 ) (#60924 ) This commit cuts over all metadata field mappers to parametrized format.	2020-08-11 09:02:28 +01:00
Armin Braun	3e2dfc6eac	Remove GCS Bucket Exists Check (#60899 ) (#60914 ) Same as https://github.com/elastic/elasticsearch/pull/43288 for GCS. We don't need to do the bucket exists check before using the repo, that just needlessly increases the necessary permissions for using the GCS repository.	2020-08-11 09:54:27 +02:00
Julie Tibshirani	d51eae6e9f	Prevent loading 'fields' with stored fields disabled. (#60938 ) Because the 'fields' option loads from _source (which is a stored field), it is not possible to retrieve 'fields' when stored_fields are disabled. This also fixes #60912, where setting stored_fields: _none_ prevented the _ignored fields from being loaded and caused a parsing exception.	2020-08-10 15:40:27 -07:00
debadair	063518ca2b	[DOCS] Mention that inline scripts need to be enabled for Kibana (#60633 ) (#60798 )	2020-08-10 13:28:59 -07:00
Nik Everett	0286d0a769	Move distance_feature query building into MFT (#60614 ) (#60846 ) This moves the `distance_feature` query building out of `DistanceFeatureQueryBuilder` and into subclasses of `MappedFieldType`. Without this we don't have a chance of supporting this for runtime fields. In general I'm not sad to see the `instanceof`s go. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-08-10 16:05:17 -04:00
James Rodewig	1b2a015734	[DOCS] Cross-link `copy_to` and search speed docs (#60926 ) (#60928 )	2020-08-10 15:35:10 -04:00
Julie Tibshirani	b216340f50	Make `FetchPhase` logic more readable. (#60779 ) * Factor out FieldsVisitor#postProcess call. * Swap logical order for normal and nested documents. * Extract the method createStoredFieldsVisitor.	2020-08-10 11:04:54 -07:00
James Rodewig	877ecd5b66	[DOCS] Add PUT example to `Date math in index names` (#60908 ) (#60920 ) Previously, all examples in this section were GET requests. This demonstrates that other CRUD operations are also supported.	2020-08-10 12:46:10 -04:00
Nik Everett	dfd502f9ca	Rework checking if a year is a leap year (#60585 ) (#60790 ) This way is faster, saving about 8% on the microbenchmark that rounds to the nearest month. That is in the hot path for `date_histogram` which is a very popular aggregation so it seems worth it to at least try and speed it up a little. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-08-10 12:45:34 -04:00
Benjamin Trent	66b3e89482	[ML] enable logging for test failures (#60902 ) (#60910 )	2020-08-10 12:36:30 -04:00
Francisco Fernández Castaño	2a4fd8329b	Avoid a race condition while waiting for pre warm to finish on SearchableSnapshotDirectoryTests (#60906 ) Backport of #60885. Closes #60813	2020-08-10 17:29:16 +02:00
Jim Ferenczi	f30f1f04e2	Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce (#60816 ) This commit removes the ability to test the top level result of an aggregator before it runs the final reduce. All aggregator tests that use AggregatorTestCase#search are rewritten with AggregatorTestCase#searchAndReduce in order to ensure that we test the final output (the one sent to the end user) rather than an intermediary result that could be different. This change also removes spurious commits triggered on top of a random index writer. These commits slow down the tests and are redundant with the commits that the random index writer performs.	2020-08-10 17:23:00 +02:00
David Roberts	dd02e9f31a	[TEST] Mute SearchableSnapshotActionIT testSearchableSnapshotForceMergesIndexToOneSegment (#60904 ) Due to https://github.com/elastic/elasticsearch/issues/60901	2020-08-10 15:25:39 +01:00
James Rodewig	739097a56c	[DOCS] Move `min_score` docs to search API page (#60895 ) (#60896 ) Reformats the `min_score` docs as a param definition on the search API reference page.	2020-08-10 09:43:07 -04:00
Henning Andersen	a155315ceb	Autoscaling decider and decision service (#59005 ) (#60884 ) Split the autoscaling decider into a service and configuration in order to enable having additional context information available in the service. Added AutoscalingDeciderContext holding generic information all deciders are expected to need. Implemented GET _autoscaling/decision	2020-08-10 15:28:52 +02:00
James Rodewig	8a0f1d8746	[DOCS] Combine highlighting docs files (#60849 ) (#60892 )	2020-08-10 09:05:49 -04:00
Dan Hermann	192dc9dd3d	[DOCS] Update get data stream API (#60862 )	2020-08-10 08:03:17 -05:00
David Turner	f44c28b595	Deprecate and ignore join timeout (#60872 ) There is no point in timing out a join attempt any more once a cluster is entirely in 7.x. Timing out and retrying with the same master is pointless, and an in-flight join attempt to one master no longer blocks attempts to join other masters. This commit deprecates this unnecessary setting and removes its effect from the joining process. Relates #60873 which removes this setting in master.	2020-08-10 13:57:41 +01:00
Andrei Dan	235e5ed3ea	[7.x] ILM: add force-merge step to searchable snapshots action (#60819 ) (#60882 ) This adds a force-merge step to the searchable snapshot action, enabled by default, but parameterizable using the `force_merge-index" optional boolean. eg. ``` PUT _ilm/policy/my_policy { "policy": { "phases": { "cold": { "actions": { "searchable_snapshot" : { "snapshot_repository" : "backing_repo", "force_merge_index": true } } } } } } ``` (cherry picked from commit d0a17b2d35f1b083b574246bdbf3e1929471a4a9) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-08-10 13:45:11 +01:00
Rene Groeschke	e03993d238	Make jdk download repo not consumable (#60875 ) - fixes #60860	2020-08-10 14:22:42 +02:00
Martijn van Groningen	64bb082f9b	Improve error message for non append-only writes that target data stream (#60874 ) Backport of #60809 to 7.x branch. Closes #60581	2020-08-10 13:18:59 +02:00
David Kyle	6b2ddf4453	Fix typo in DataHistogramGroupByIT name (#60880 ) (#60883 )	2020-08-10 11:55:01 +01:00
David Turner	f168bdac7d	Change transitive -> transient in ILM log message (#60871 ) "Transitive" is technically ok here but it's an overloaded word and it's not immediately clear which meaning is intended so this log message always makes me do a double-take. I think both "transient" and "transitory" are clearer, with "transient" being the usual choice.	2020-08-10 11:37:49 +01:00

... 6 7 8 9 10 ...

53488 Commits All Branches Search

53488 Commits

All Branches