OpenSearch

Commit Graph

Author	SHA1	Message	Date
Costin Leau	614f4c13a5	EQL: Introduce case-sensitive equality (#63121 ) Introduce : operator for doing case insensitive string comparisons. Recognizes "*" for wildcard matches in string literals. Restricted only to string types. Relates #62941 (cherry picked from commit 201e577e65f26a9b958a6197fe6c7268da39de29)	2020-10-02 00:23:08 +03:00
Igor Motov	fc13b72cea	Extract histogramFieldDocValues into an utility class (#63100 ) (#63148 ) This function will be needed in the upcoming rate aggs tests.	2020-10-01 15:44:37 -04:00
Marios Trivyzas	3ad4b00c7e	EQL: Clean grammar from `fork` (#63094 ) (#63138 ) Since `fork` is not used, is undocumented in Python EQL and there is no plan at the moment to implement it in the future, removing it from the grammar. User will get parsing exceptions instead of higher level messages about unsupported features which can lead to wrong expectations. (cherry picked from commit f6a0f8f01c1b1893bab86629d1de73e9f9dae8dc)	2020-10-01 21:14:41 +02:00
Lee Hinman	f0f0da2188	[7.x] Add telemetry for data tiers (#63031 ) (#63140 ) Backports the following commits to 7.x: Add telemetry for data tiers (#63031)	2020-10-01 12:37:32 -06:00
Benjamin Trent	95242eccee	[ML] adding `baseline` field to total_feature_importance objects (#63098 ) (#63125 ) This adds a new `baseline` field to the feature importance values. This field contains the baseline importance for a given feature and class.	2020-10-01 09:48:07 -04:00
Dimitris Athanasiou	46c3973400	[7.x][ML] Remove direct access to system index from filter_crud REST test (#63111 ) (#63115 ) This test accesses system indices for 2 reasons. First, it creates a filter that has a different type. This was done to assert that filter is not returned from the APIs. However, now that access to the `.ml-meta` index is restricted, it is not really a concern. Second, it creates a `.ml-meta` index without mappings to test the get API does not fail due to lack of mappings on a sorted field, namely the `filter_id`. Once again, this test is less useful once system indices have restricted access. Relates #62501 Backport of #63111	2020-10-01 15:15:34 +03:00
Costin Leau	c2992ea287	EQL: Fix NPE from incorrect use of ids search (#63032 ) This fixes a bug introduced when moving from mget to ids query. While mget returns all the ids given, id query is a search query and thus by default returns only 10 documents. The fix correctly sets the expected size so all the information is returned inside the response. Fix #63030 (cherry picked from commit 09ba85548a0142a1fe8376efea9cc4e7764a207c)	2020-10-01 13:49:58 +03:00
Hendrik Muhs	e001b4c021	[Transform] fix time rounding in TransformContinuousIT (#63113 ) fix a time rounding problem in the test, due to rounding down to epoch seconds instead of epoch millis fixes #62951	2020-10-01 11:43:50 +02:00
Ignacio Vera	ba5574935e	Remove dependency of Geometry queries with mapped type names (#63077 ) (#63110 ) It extracts the query capabilities from AbstractGeometryFieldType into two new interfaces, GeoshapeQueryable and ShapeQueryable. Those interfaces are implemented by the final mappers.	2020-10-01 10:49:12 +02:00
Howard	8c6e197f51	Remove allocation id from engine (#62680 ) We no longer need the allocation id in Engine.	2020-09-30 15:28:27 -04:00
Marios Trivyzas	f69d268500	SQL: Allow skip of bwc tests on `check` task (#62936 ) (#63089 ) Bwc tests can consume much time to build and to run so it's nice to be able to skip them when running the `check` task on the SQL module. Introduce a new task `checkNoBwc` so one can use: ``` ./gradlew -p x-pack/plugin/sql checkNoBwc ``` to skip them. (cherry picked from commit a52e1846f338f6869273181c6f248579581fa68c)	2020-09-30 20:03:19 +02:00
Marios Trivyzas	0ebaf8a3ec	EQL: Allow escaped backquote in identifiers (#62932 ) (#63082 ) Previously, backquote couldn't not be used inside an escaped identifier, e.g.: ``` `my`identifier` = "some_value" ``` was not allowed. Introduce escaping of the backquote by using a double backquote: ``` `my``identifier` = "some_value" ``` (cherry picked from commit 49514121486f42a58674b3e5901de4021fda5c15)	2020-09-30 19:10:09 +02:00
Alan Woodward	675d18f6ea	Convert dense/sparse vector field mappers to Parametrized form (#62992 ) Also adds a proper MapperTestCase test for dense vectors. Relates to #62988	2020-09-30 16:55:28 +01:00
Dimitris Athanasiou	e09074d382	[7.x][ML] Fix online updates with custom rules referencing filters (#63057 ) (#63064 ) When an opened anomaly detection job is updated with a detection rule that references a filter, apart from updating the c++ process with the rule, we also need to update it with the referenced filter. This commit fixes a bug which led to the job not applying such updates on-the-fly. Fixes #62948 Backport of #63057	2020-09-30 16:01:06 +03:00
Costin Leau	a6b903b783	EQL: Remove unused classes from reponse API (#62134 ) Remove Count class and related artifacts since that functionality is not (yet) available. Update parser name for better error reporting. Fix #62131 (cherry picked from commit 060f500346788c4c5d0b3b9c045facec5d677d3d)	2020-09-30 15:45:30 +03:00
Mayya Sharipova	f221349593	Fix UnsignedLongTests test failure (#63056 ) Test testSortDifferentFormatsShouldFail was occasionally failing for 2 reasons: 1) Documents on "idx2" were not available for search before a search request started 2) Running a test multiple times was causing occasional ResourceAlreadyExistsException for idx2, as idx2 was not deleted for a test. This patch makes the following fixes: 1) Sets up immediate refresh policy for docs in the index"idx2" 2) Creates an index idx2 only once per cluster Closes: #62997	2020-09-30 08:41:31 -04:00
Yang Wang	e31bef4032	Fix API key role descriptors rewrite bug for upgraded clusters (#62917 ) (#63042 ) This PR ensures that API key role descriptors are always rewritten to a target node compatible format before a request is sent.	2020-09-30 22:16:39 +10:00
Benjamin Trent	0860746bf2	[ML] changing ngram loop order for minor performance improvement (#63033 ) (#63059 ) This is a very minor optimization but trivial to implement, so might as well. ``` Benchmark (nGramStrs) Mode Cnt Score Error Units NGramProcessorBenchmark.ngramInnerLoop 1,2,3 avgt 20 4415092.443 ± 31302.115 ns/op NGramProcessorBenchmark.ngramOuterLoop 1,2,3 avgt 20 4235550.340 ± 103393.465 ns/op ``` This measurement is in nanoseconds, consequently, the overall performance of inference is dominated by other factors (i.e. map#put). But, this optimization adds up overtime and is simple.	2020-09-30 07:51:31 -04:00
Benjamin Trent	b7c47b1717	[ML] Add data frame analytics bwc testing (#63012 ) This commit adds bwc testing for data frame analytics. The bwc tests only go back to the 7.9.0. Meaning, initially only rolling upgrades from 7.9.x -> 7.10.0 are tested. Since the feature was experimental in < 7.9.0, this is acceptable.	2020-09-30 07:13:40 -04:00
Przemysław Witek	4366d58564	[7.x] [ML] Implement AucRoc metric for classification (#60502 ) (#63051 )	2020-09-30 12:55:52 +02:00
Dimitris Athanasiou	179fe9cc0e	[7.x][ML] Delete dest index and reindex if incompatible (#62960 ) (#63050 ) Data frame analytics results format changed in version `7.10.0`. If existing jobs that were not completed are restarted, it is possible the destination index had already been created. That index's mappings are not suitable for the new results format. This commit checks the version of the destination index and deletes it when the version is outdated. The job will then continue by recreating the destination index and reindexing. Backport of #62960	2020-09-30 12:57:48 +03:00
Hendrik Muhs	df93f46888	[Transform] fix issue in TransformIndexerStateTests.testStopAtCheckpoint (#63006 ) fix a test issue by improving counting the number of times the deferred listener is called fixes #62996	2020-09-30 08:54:45 +02:00
David Roberts	05427c2bb2	[ML] Add timeouts to named pipe connections (#63022 ) This PR adds timeouts to the named pipe connections of the autodetect, normalize and data_frame_analyzer processes. This argument requires the changes of elastic/ml-cpp#1514 in order to work, so that PR will be merged before this one. (The controller process already had a different mechanism, tied to the ES JVM lifetime.) Backport of #62993	2020-09-29 18:04:02 +01:00
Costin Leau	3bee28056f	EQL: Fix bug in sequences with any pattern (#63007 ) Fix query creation inside sequences with any queries due to lacking a clause to combine, which lead to an invalid request being created. Fix #62967 (cherry picked from commit ff59d8823919a6e70928816e5c3687308ebde33f)	2020-09-29 18:19:25 +03:00
Benjamin Trent	0b3af242d4	[ML] fixing classification feature importance parsing (#63003 ) (#63015 ) Classification feature importance supports various types in the class name: - string - boolean - numerical The xcontent parsing on the server side and the HLRC side should support and test these types.	2020-09-29 10:54:35 -04:00
Yang Wang	068f605040	Use compilation as validation for painless role template (#62845 ) (#63010 ) * Use compilation as validation for painless role template (#62845) Role template validation now performs only compilation if the script is painless. It no longer attempts to execute the script with empty input which is problematic. The compliation process will catch things like invalid syntax, undefined variables, which still provide certain level of protection against ill-defined role templates. Behaviour for Mustache script is unchanged. * Checkstyle	2020-09-30 00:37:41 +10:00
Alan Woodward	de08ba58bf	Convert percolator, murmur3 and histogram mappers to parametrized form (#63004 ) Relates to #62988	2020-09-29 14:42:26 +01:00
Dimitris Athanasiou	facf9ede0a	[ML] Fix binary classification importance in LegacyFeatureImportanceTests (#63000 ) Fixes #62991	2020-09-29 15:53:34 +03:00
Benjamin Trent	2b9032a07d	[7.x] [ML] fixing testTwoJobsWithSameRandomizeSeedUseSameTrainingSet tests (#62976 ) (#62999 ) * [ML] fixing testTwoJobsWithSameRandomizeSeedUseSameTrainingSet tests (#62976) This fixes the two test failures. The shard failure seems to be due to the .ml-stats index being in the middle of being created.	2020-09-29 08:12:20 -04:00
Hendrik Muhs	154a0c00b7	[Transform] add debug logging to investigate #62951 (#62990 )	2020-09-29 12:06:35 +02:00
Mayya Sharipova	ca42726a99	Ensure consistent ordering of hits in test (#62977 ) 50_script_values/Script query fails sometimes as resulting hits will be ordered differently from expected. This patch ensures consisten ordering of hits. Closes #62975	2020-09-29 06:00:34 -04:00
Armin Braun	678688dc84	Avoid Redundantly Loading Monitoring Templates on CS Applier Thread (#62913 ) (#62979 ) This refactors the loading of monitoring templates slightly so that they aren't loaded over and over again (from disk) on CS updates. This isn't an important optimization in production for obvious reasons since it only affects the install stage, but this turned out to cause some slow CS applies in tests. Relates #62853	2020-09-29 11:45:22 +02:00
David Kyle	f23603dafd	[ML][Transform] Filter null objects from field caps request (#62945 ) (#62971 ) If the transform grouping is a script then exclude the field from the source index mappings fields caps request. A null object caused an NPE in the serialisation of FieldCapabilitiesIndexRequest.	2020-09-29 09:07:01 +01:00
Dimitris Athanasiou	7f6c1ff5b4	[7.x][ML] Remove top level importance from classification inference results (#62486 ) (#62964 ) As we have decided top level importance for classification is not useful, it has been removed from the results from the training job. This commit also removes them from inference. Backport of #62486	2020-09-29 10:58:48 +03:00
Mayya Sharipova	4c8c3c8df6	Upgrade lucene to lucene-8.7.0-snapshot-3b59906 (#62978 ) Backport for #62970	2020-09-28 16:52:31 -04:00
Benjamin Trent	a054e62bc4	[ML] allow datafeeds to run if there are any concrete indices (#62827 ) (#62965 ) This commit allows a datafeed to be assigned to a node if only one index pattern has concrete indices.	2020-09-28 12:58:07 -04:00
Hendrik Muhs	be5edcfb26	[Transform] fix possible NPE if transform task has no node assigned (#62946 ) ignore transform tasks that do not have a node assigned when collecting nodes to forward the request for _stop, _stats and _update fixes #62847	2020-09-28 15:25:38 +02:00
Alan Woodward	a3ba24123e	Refactor PointParser to not take FieldMapper as a parameter (#62950 ) Passing FieldMappers to point parsing functions makes trying to build source-only fields from MappedFieldTypes more complicated. This small refactoring changes things so that the relevant parsing and factory functions from AbstractGeometryFieldMapper are instead passed as lambdas to the PointParser constructor.	2020-09-28 13:45:13 +01:00
Costin Leau	ef7a6ce4b2	EQL: Refactor testing infrastructure (#62928 ) Extract reusable methods inside QL TestUtils Rename abstract base classes for clarity Clean-up EQL DataLoader (cherry picked from commit 48db3f285aa8976ead5a9f5d071a9c1046d7bd31)	2020-09-28 14:22:56 +03:00
Hendrik Muhs	b1a8437d0b	[7.x][Transform] Improve robustness when saving state (#62927 ) refactor how state is persisted, call doSaveState only from the indexer thread, except there is none. fixes #60781 fixes #52931 fixes #51629 fixes #52035	2020-09-28 10:12:51 +02:00
Tim Brooks	43a4882951	Move CorsHandler to server (#62007 ) Currently we duplicate our specialized cors logic in all transport plugins. This is unnecessary as it could be implemented in a single place. This commit moves the logic to server. Additionally it fixes a but where we are incorrectly closing http channels on early Cors responses.	2020-09-24 16:32:59 -06:00
Mayya Sharipova	54064a1eec	Unsigned long 64bits(#62892 ) Introduce 64-bit unsigned long field type This field type supports - indexing of integer values from [0, 18446744073709551615] - precise queries (term, range) - precise sort and terms aggregations - other aggregations are based on conversion of long values to double and can be imprecise for large values. Backport for #60050 Closes #32434	2020-09-24 16:51:47 -04:00
Andrei Stefan	a43f29cfc9	EQL: data streams tests for PIT and EQL sequences (#62850 ) (#62889 ) * PIT should run well with data streams (cherry picked from commit 0a89a7db848b015b797c7678874b5c9e33bbd650)	2020-09-24 23:37:46 +03:00
Alan Woodward	e28750b001	Add parameter update and conflict tests to MapperTestCase (#62828 ) (#62902 ) This commit adds a mechanism to MapperTestCase that allows implementing test classes to check that their parameters can be updated, or throw conflict errors as advertised. Child classes override the registerParameters method and tell the passed-in UpdateChecker class about their parameters. Simple conflicts can be checked, using the existing minimal mappings as a base to compare against, or alternatively a particular initial mapping can be provided to check edge cases (eg, norms can be updated from true to false, but not vice versa). Updates are registered with a predicate that checks that the update has in fact been applied to the resulting FieldMapper. Fixes #61631	2020-09-24 20:38:12 +01:00
Armin Braun	4b9ddb48b6	Add Missing Netty Runtime Proc Property to Security Tests (#62846 ) (#62890 ) Same as in the normal Netty tests we have to disable the runtime proc setting in the normal tests task just like we do for the internal cluster tests. Closes #61919 Closes #62298	2020-09-24 20:48:38 +02:00
Jim Ferenczi	78a93dc18f	Request-level circuit breaker support on coordinating nodes (#62884 ) This commit allows coordinating node to account the memory used to perform partial and final reduce of aggregations in the request circuit breaker. The search coordinator adds the memory that it used to save and reduce the results of shard aggregations in the request circuit breaker. Before any partial or final reduce, the memory needed to reduce the aggregations is estimated and a CircuitBreakingException} is thrown if exceeds the maximum memory allowed in this breaker. This size is estimated as roughly 1.5 times the size of the serialized aggregations that need to be reduced. This estimation can be completely off for some aggregations but it is corrected with the real size after the reduce completes. If the reduce is successful, we update the circuit breaker to remove the size of the source aggregations and replace the estimation with the serialized size of the newly reduced result. As a follow up we could trigger partial reduces based on the memory accounted in the circuit breaker instead of relying on a static number of shard responses. A simpler follow up that could be done in the mean time is to [reduce the default batch reduce size](https://github.com/elastic/elasticsearch/issues/51857) of blocking search request to a more sane number. Closes #37182	2020-09-24 18:59:28 +02:00
Benjamin Trent	c56424f740	[ML] write deprecation warning when include_model_definition parameter is used (#62834 ) (#62885 ) for get trained models include_model_definition is now deprecated. This commit writes a deprecation warning if that parameter is used and suggests the caller to utilize the replacement	2020-09-24 11:38:54 -04:00
Stuart Tettemer	8d69334c2f	Scripting: Watcher defaults to unlimited compile rate (#62655 ) (#62671 ) Backport of #62655	2020-09-24 10:22:50 -05:00
Martijn van Groningen	8ca33feffd	Fail with correct error if first backing index exists when auto creating data stream (#62862 ) Backport #62825 to 7.x branch. Today if a data stream is auto created, but an index with same name as the first backing index already exists then internally that error is ignored, which then result that later in the execution of a bulk request, the bulk item fails due to that the data stream hasn't been auto created. This situation can only occur if an index with same is created that will be the backing index of a data stream prior to the creation of the data stream. Co-authored-by: Dan Hermann <danhermann@users.noreply.github.com>	2020-09-24 17:16:34 +02:00
Armin Braun	83ec8dd4e2	Upgrade GCS SDK to 1.113.1 (#62848 ) (#62864 ) Just staying on top of upgrades to the SDK and its dependencies.	2020-09-24 15:43:21 +02:00
Daniel Mitterdorfer	d2166030d1	Mute failing test case in DeleteExpiredDataIT (#62870 ) (#62871 ) Relates #62699	2020-09-24 15:42:52 +02:00
Andrei Dan	e323c5245b	[7.x] ILM: migrate action configures the _tier_preference setting (#62829 ) (#62860 ) The `migrate` action will now configure the `index.routing.allocation.include._tier_preference` setting to the corresponding tiers. For the HOT phase it will configure `data_hot`, for the WARM phase it will configure `data_warm,data_hot` and for the COLD phase `data_cold,data_warm,data_cold`. (cherry picked from commit 9dbf0e6f0c267e40c5bcfb568bb2254da103ae40) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-09-24 13:37:09 +01:00
Rory Hunter	7771d8b6fa	Tweak the ECS fields in DeprecatedMessage (#62855 ) Backport of #62855. Follow-up to #61484.	2020-09-24 12:07:48 +01:00
Costin Leau	71b92f8699	QL: Optimize Like/Rlike all (#62682 ) Replace common Like and RLike queries that match all characters with IsNotNull (exists) queries Fix #62585 (cherry picked from commit 4c23fad0468a9edd7325b06c6a96f7af37625dbf)	2020-09-24 13:44:53 +03:00
Martijn van Groningen	8d73379493	Adjust skip version in data stream yaml test. (#62831 ) (#62851 ) Relates to #62766	2020-09-24 11:00:02 +02:00
Hendrik Muhs	a70389015d	[Transform] Return parsed count for get transform stats (#62809 ) In case of more than 500 transforms, get and stats return paged results which can be requested using page parameters. For >500 transforms count wasn't parsed out of the server response but taken from size of the list of transforms. The change also adds client/server hlrc tests and fixes a wrong type for count in get. fixes #56245	2020-09-24 08:38:07 +02:00
Nhat Nguyen	38c8a55df8	Better UUID for reader context (#62799 ) We can use a single and stronger UUID for all reader contexts created by the same SearchService. Backport of #62715	2020-09-23 12:50:18 -04:00
Martijn van Groningen	0baefc8ddc	Always validate that only a create op is allowed in bulk api for data streams (#62820 ) Backport #62766 to 7.x branch. The bulk api cache the resolved concrete indices when resolving the user provided index name into the actual index name. The validation that prevents write ops other than create from being executed in a data stream was only performed if the result wasn't cached. In case of cached resolvings, the validation never occurs. The validation would be skipped for all bulk items for a data stream after a create operation for that same data stream. This commit ensures that the validation is always performed for all bulk items (whether the concrete index resolution has been cached or not cached). Closes #62762	2020-09-23 16:27:54 +02:00
Dimitris Athanasiou	7de5201291	[7.x][ML] Handle data frame analytics state spreading over multiple docs (#62564 ) (#62824 ) When state persistence was first implemented for data frame analytics we had the assumption that state would always fit in a single document. However this is not the case any more. This commit adds handling of state that spreads over multiple documents. Backport of #62564	2020-09-23 16:16:34 +03:00
James Rodewig	e3d5915566	[DOCS] Fix JSON spec linnk for PIT API (#61783 )	2020-09-23 14:29:06 +02:00
Dimitris Athanasiou	69e72656fa	[7.x][ML] Reset reindexing progress when DFA job resumes with incomplete reindexing (#62772 ) (#62816 ) This fixes reindexing progress in the scenario when a DFA job that had not finished reindexing is resumed (either because the user called stop and start or because the job was reassigned in the middle of reindexing). Before the fix reindexing progress stays to the value it had reached before until it surpasses that value. When we resume a data frame analytics job we want to preserve reindexing progress and reset all other phases. Except for when reindexing was not completed. In that case we are deleting the destination index and starting reindexing from scratch. Thus we need to reset reindexing progress too. Backport of #62772	2020-09-23 14:09:04 +03:00
Christoph Büscher	054a950ceb	Align version field plugin naming (#62757 ) To better align the plugin naming with other mapper plugins under x-pack (e.g. mapper-flattened) this PR changes the plugin name and the containing directory to "mapper-version"	2020-09-23 11:50:15 +02:00
Christoph Büscher	29074e7055	Add case insensitive prefix and wildcard to 'version' field (#62754 ) (#62782 ) This change adds support for the recently introduced case insensitivity flag for wildcard and prefix queries. Since version field values are encoded differently we need to adapt our own AutomatonQuery variation to add both cases if case insensitivity is turned on.	2020-09-23 11:48:34 +02:00
Luca Cavanna	862fab06d3	Share same existsQuery impl throughout mappers (#57607 ) Most of our field types have the same implementation for their `existsQuery` method which relies on doc_values if present, otherwise it queries norms if available or uses a term query against the _field_names meta field. This standard implementation is repeated in many different mappers. There are field types that only query doc_values, because they always have them, and field types that always query _field_names, because they never have norms nor doc_values. We could apply the same standard logic to all of these field types as `MappedFieldType` has the knowledge about what data structures are available. This commit introduces a standard implementation that does the right thing depending on the data structure that is available. With that only field types that require a different behaviour need to override the existsQuery method. At the same time, this no longer forces subclasses to override `existsQuery`, which could be forgotten when needed. To address this we introduced a new test method in `MapperTestCase` that verifies the `existsQuery` being generated and its consistency with the available data structures.	2020-09-23 11:00:53 +02:00
David Kyle	bc34ecc581	[ML] Mute annotations index upgrade mapping test (#62814 ) For #61908	2020-09-23 09:37:04 +01:00
Luca Cavanna	5ca86d541c	Move stored flag from TextSearchInfo to MappedFieldType (#62717 ) (#62770 )	2020-09-23 09:40:34 +02:00
Albert Zaharovits	b4ec821067	Fix doc-update interceptor for indices with DLS and FLS (#61516 ) This fixes the protection against updates (and bulk updates) for indices with DLS and/or FLS, when the request uses date math expressions.	2020-09-23 08:55:22 +03:00
Nhat Nguyen	663b85b98f	Make keep alive optional in PointInTimeBuilder (#62720 ) Remove the keepAlive parameter from the constructor of PointInTimeBuilder as it's optional.	2020-09-22 18:52:54 -04:00
Nik Everett	fa13585fae	Fix Eclipse build (#62733 ) (#62786 ) Eclipse was confused for two reasons: 1. `:x-pack:plugin` depended on itself. 2. `ql`, `sql`, and `eql` couldn't see some methods. I fixed problem 1 by only adding the "depends on itself" configuration outside of eclipse. I fixed problem 2 by making a `test` sub-project in `ql` that contains test utilities and depending on those where possible.	2020-09-22 17:44:25 -04:00
Jay Modi	cb1dc5260f	Dedicated threadpool for system index writes (#62792 ) This commit adds a dedicated threadpool for system index write operations. The dedicated resources for system index writes serves as a means to ensure that user activity does not block important system operations from occurring such as the management of users and roles. Backport of #61655	2020-09-22 15:31:38 -06:00
Benjamin Trent	77bfb32635	[7.x] [ML] changing to not use global bulk indexing parameters in conjunction with add(object) calls (#62694 ) (#62784 ) * [ML] changing to not use global bulk indexing parameters in conjunction with add(object) calls (#62694) * [ML] changing to not use global bulk indexing parameters in conjunction with add(object) calls global parameters, outside of the global index, are ignored for internal callers in certain cases. If the interal caller is adding requests via the following methods: ``` - BulkRequest#add(IndexRequest) - BulkRequest#add(UpdateRequest) - BulkRequest#add(DocWriteRequest) - BulkRequest#add(DocWriteRequest[]) ``` It is better to specifically set the desired parameters on the requests before they are added to the bulk request object. This commit addresses this issue for the ML plugin * unmuting test	2020-09-22 15:07:08 -04:00
Marios Trivyzas	1e72144847	EQL: Remove support for `=` for comparisons (#62756 ) (#62775 ) Since `=` is rarely used and is undocumented we its support for equality comparisons keeping `==` as the only option. `=` is now only used for assignments like in `maxspan=10m`. Closes: #62650 (cherry picked from commit ad5ae4d887b5c2feca2d0e874d7bdf738e3fd54e)	2020-09-22 20:56:04 +02:00
Nik Everett	39a617773d	Raname grok's built-in patterns (backport of #62735 ) (#62765 ) This reworks the code around grok's built-in patterns to name things more like the rest of the code. Its not a big deal, but I'm just more used to having `public static final` constants in SHOUTING_SNAKE_CASE.	2020-09-22 13:06:43 -04:00
Lisa Cawley	7e97f17845	[DOCS] Add SLM security privileges (#62737 )	2020-09-22 08:44:18 -07:00
James Rodewig	c0e611e0a7	[DOCS] Fix typo: NamedID -> NameID (#62721 ) (#62767 ) Co-authored-by: Greg Back <1045796+gtback@users.noreply.github.com>	2020-09-22 10:30:35 -04:00
markharwood	a0df0fb074	Search - add case insensitive flag for "term" family of queries #61596 (#62661 ) Backport of fe9145f Closes #61546	2020-09-22 13:56:51 +01:00
Andrei Dan	0be89bcd7f	Mute RegressionIT.testTwoJobsWithSameRandomizeSeedUseSameTrainingSet (#62763 )	2020-09-22 13:43:15 +01:00
David Kyle	31fbc6800f	[7.x] [ML] Add upgrade mappings assertions to full cluster restart tests (#62293 ) (#62305 ) Refactors the index mapping checks in the rolling upgrade tests and use that shared code in the full cluster restart tests.	2020-09-22 13:09:51 +01:00
Luca Cavanna	9ae29713fd	Dense vector field type minor fixes (#62631 ) The dense vector field is not aggregatable although it produces fielddata through its BinaryDocValuesField. It should pass up hasDocValues set to true to its parent class in its constructor, and return isAggregatable false. Same for the sparse vector field (only in 7.x). This may not have consequences today, but it will be important once we try to share the same exists query implementation throughout all of the mappers with #57607.	2020-09-22 10:40:51 +02:00
Christoph Büscher	593511e5c9	VersionFieldIT should register transportClientPlugins (#62734 )	2020-09-22 10:10:44 +02:00
Yang Wang	28503f04f7	Fix privilege requirement for CCS with Point In Time reader (#62261 ) (#62696 ) When target indices are remote only, CCS does not require user to have privileges on the local cluster. This PR ensure Point-In-Time reader follows the same pattern. Relates: #61827	2020-09-22 12:51:51 +10:00
Yang Wang	897d2e8a02	Fix ccs permission for search with a scroll id (#62053 ) (#62695 ) CCS with remote indices only does not require any privileges on the local cluster. This PR ensures that search with scroll follow the permission model.	2020-09-22 11:49:40 +10:00
Andrei Dan	79d0c4ed18	ILM: allow check-migration step to continue if tier setting unset (#62636 ) (#62724 ) This allows the `check-migration` step to move past the allocation check if the tier routing settings are manually unset. This helps a user unblock ILM in case a tier is removed (ie. if the warm tier is decommissioned this will allow users to resume the ILM policies stuck in `check-migration` waiting for the warm nodes to become available and the managed index to allocate. this allows the index to allocate on the other available tiers) (cherry picked from commit d7a1eaa7f51d0972d10c0df1d3cd77d6b755dd41) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-09-21 20:40:01 +01:00
Marios Trivyzas	1f612cccbb	SQL: Implement FORMAT function (#55454 ) (#62701 ) Implement FORMAT according to the SQL Server spec: https://docs.microsoft.com/en-us/sql/t-sql/functions/format-transact-sql?view=sql-server-ver15#ExampleD by translating to the java.time patterns used in DATETIME_FORMAT. Closes: #54965 Co-authored-by: Marios Trivyzas <matriv@users.noreply.github.com> Co-authored-by: Bogdan Pintea <bogdan.pintea@elastic.co> Co-authored-by: Andrei Stefan <astefan@users.noreply.github.com> (cherry picked from commit da511f4e033db6e8a6aa2a54b23e906b5e026845)	2020-09-21 19:22:04 +02:00
Alan Woodward	1dde4983f6	Convert ConstantKeywordFieldMapper to parametrized form (#62688 ) As part of the conversion, adds the ability to customize merge validation - in this case, we allow an update to the constant value if it is currently set to null, but refuse further updates once it has been set once. This commit also converts ParametrizedMapperTests to use MapperServiceTestCase.	2020-09-21 15:22:56 +01:00
David Roberts	4537561692	[TEST] Mute IPFilterTests.testThatNodeStartsWithIPFilterDisabled Due to https://github.com/elastic/elasticsearch/issues/62298	2020-09-21 14:49:43 +01:00
Christoph Büscher	d6e977f87b	Muting VersionFieldIT.testTermsAggregation	2020-09-21 15:46:48 +02:00
Christoph Büscher	803f78ef05	Add field type for version strings (#59773 ) (#62692 ) This PR adds a new 'version' field type that allows indexing string values representing software versions similar to the ones defined in the Semantic Versioning definition (semver.org). The field behaves very similar to a 'keyword' field but allows efficient sorting and range queries that take into accound the special ordering needed for version strings. For example, the main version parts are sorted numerically (ie 2.0.0 < 11.0.0) whereas this wouldn't be possible with 'keyword' fields today. Valid version values are similar to the Semantic Versioning definition, with the notable exception that in addition to the "main" version consiting of major.minor.patch, we allow less or more than three numeric identifiers, i.e. "1.2" or "1.4.6.123.12" are treated as valid too. Relates to #48878	2020-09-21 14:25:42 +02:00
Tanguy Leroux	f775cf8594	Add test for snapshot incrementality of snapshot-backed indices (#62641 ) This commit adds a test that verifies that snapshots incrementality is respected when a snapshot-backed index is snapshotted. This test mounts a snapshot as a snapshot-backed index, creates a new snapshot from it and then verifies that no new data blobs were added to the repository.	2020-09-21 12:06:47 +02:00
Christos Soulios	ad79a2b6a1	[7.x] Histogram field type support for min/max aggregations (#62689 ) Implement min/max aggregations for histogram fields. Backports #62532	2020-09-21 12:53:56 +03:00
Henning Andersen	2eeb1bddde	Autoscaling decision return absolute capacity (#61575 ) (#62670 ) The autoscaling decision API now returns an absolute capacity, and leaves the actual decision of whether a scale up or down is needed to the orchestration system. The decision API now returns both a tier and node level required and current capacity as wells as a decider level breakdown of the same though with in particular current memory still not populated.	2020-09-19 09:05:23 +02:00
Luca Cavanna	1580fc70bd	Minor FlatObjectFieldMapper fix (#62633 ) The splitQueriesOnWhitespace instance field can be made final, and setter and getter are not always needed.	2020-09-19 00:24:44 +02:00
William Brafford	8aeab0bec9	Add refresh policy to logstash plugin write requests (#62583 ) (#62665 )	2020-09-18 17:44:53 -04:00
Lee Hinman	4a08928c47	[7.x] Add index.routing.allocation.include._tier_preference setting (#62589 ) (#62667 ) This commit adds the `index.routing.allocation.prefer._tier` setting to the `DataTierAllocationDecider`. This special-purpose allocation setting lets a user specify a preference-based list of tiers for an index to be assigned to. For example, if the setting were set to: ``` "index.routing.allocation.prefer._tier": "data_hot,data_warm,data_content" ``` If the cluster contains any nodes with the `data_hot` role, the decider will only allow them to be allocated on the `data_hot` node(s). If there are no `data_hot` nodes, but there are `data_warm` and `data_content` nodes, then the index will be allowed to be allocated on `data_warm` nodes. This allows us to specify an index's preference for tier(s) without causing the index to be unassigned if no nodes of a preferred tier are available. Subsequent work will change the ILM migration to make additional use of this setting. Relates to #60848	2020-09-18 15:41:36 -06:00
Christos Soulios	6a298970fd	[7.x] Allow metadata fields in the _source (#62616 ) Backports #61590 to 7.x So far we don't allow metadata fields in the document _source. However, in the case of the _doc_count field mapper (#58339) we want to be able to set This PR adds a method to the metadata field parsers that exposes if the field can be included in the document source or not. This way each metadata field can configure if it can be included in the document _source	2020-09-18 19:56:41 +03:00
Benjamin Trent	0f142c6afc	[ML] all multiple wildcard values for GET Calendars, Events, and DELETE forecasts (#62563 ) (#62629 ) This commit adjusts the following APIs so now they not only support an `_all` case, but wildcard patterned Ids as well. - `GET _ml/calendars/<calendar_id>/events` - `GET _ml/calendars/<calendar_id>` - `GET _ml/anomaly_detectors/<job_id>/model_snapshots/<snapshot_id>` - `DELETE _ml/anomaly_detectors/<job_id>/_forecast/<forecast_id>`	2020-09-18 11:06:07 -04:00
Benjamin Trent	e163559e4c	[7.x] [ML] Add new include flag to GET inference/<model_id> API for model training metadata (#61922 ) (#62620 ) * [ML] Add new include flag to GET inference/<model_id> API for model training metadata (#61922) Adds new flag include to the get trained models API The flag initially has two valid values: definition, total_feature_importance. Consequently, the old include_model_definition flag is now deprecated. When total_feature_importance is included, the total_feature_importance field is included in the model metadata object. Including definition is the same as previously setting include_model_definition=true. * fixing test * Update x-pack/plugin/core/src/test/java/org/elasticsearch/xpack/core/ml/action/GetTrainedModelsRequestTests.java	2020-09-18 10:07:35 -04:00
Nhat Nguyen	8bea6b3711	Increase keep alive of point in time in async search tests (#62593 ) Async search tests can take more than one minute due to the excessive trace logs. And the point in time in the tests can be expired the midway. Closes #62451	2020-09-18 08:01:57 -04:00
Martijn van Groningen	4c949b0869	Adjust allowed warnings in data stream yaml test. (#62610 )	2020-09-18 12:44:57 +02:00
Marios Trivyzas	b072de4ce0	EQL: Disallow chained comparisons (#62567 ) (#62601 ) Expressions like `1 = 2 = 3 = 4` or `1 < 2 = 3 >= 4` were treated with leftmost priority: ((1 = 2) = 3) = 4 which can lead to confusing results. Since such expressions don't make so much change for EQL filters we disallow them in the parser to prevent unexpected results from their bad usage. Major DBs like PostgreSQL and Oracle also disallow them in their SQL syntax. (counter example would be MySQL which interprets them as we did before with leftmost priority). Fixes: #61654 (cherry picked from commit 8f94981bb093f104228d267b532e0a3d5b7f6a38)	2020-09-18 10:48:14 +02:00
Costin Leau	81f2f84177	EQL: Allow requests with size 0 (#62537 ) The purpose for this change is to allow validation of queries without having to actually execute them. The optimizer already picks up this case. Fix #62494 (cherry picked from commit 675889559b2f96a0c1faa6fc84fd537148ba2cce)	2020-09-18 11:24:39 +03:00
Martijn van Groningen	5190b0961d	adjust skip reason	2020-09-18 10:10:00 +02:00
Adrien Grand	4de8579455	Upgrade to lucene-8.7.0-snapshot-830bd186a8d. (#62596 )	2020-09-18 09:51:34 +02:00
Martijn van Groningen	c83d8ce78e	Adjust skip version data stream test (#62597 ) after #62527 was backported.	2020-09-18 09:28:16 +02:00
David Turner	7324ee1044	Remove unused upgrade actions (#62552 ) These actions were almost completely removed in #40075 but a couple of classes were left in place. This commit completes their removal.	2020-09-18 08:16:13 +01:00
Ignacio Vera	6a3d731be1	Only call reduce on a single InternalAggregation when needed (#62525 ) (#62594 ) Adds a new abstract method in InternalAggregation that flags the framework if it needs to reduce on a single InternalAggregation.	2020-09-18 08:43:58 +02:00
Jake Landis	5b7246157f	[7.x] Fix projects that failed to build within Intellij (#62258 ) (#62408 ) This commit address some build failures from the perspective of Intellij. These changes include: * changing an order of a dependency definition that seems to can cause Intellij build to fail. * introduction of an abstract class out of the test source set (seems to be an issue sharing classes cross projects with non-standard source sets. * a couple of missing dependency definitions (not sure how the command line worked prior to this)	2020-09-17 17:45:12 -05:00
William Brafford	b764f8977e	Copy Key Certs for javaRestTest (#62584 )	2020-09-17 17:45:42 -04:00
Tim Vernum	ab427534f7	[DOCS] Add warning about derived API keys to docs (#62351 )	2020-09-17 13:46:25 -07:00
Dimitris Athanasiou	7118ff7976	[7.x][ML] Remove model snapshot legacy doc ids (#62434 ) (#62569 ) Removes methods that were no longer used regarding version 5.4 doc ids of ModelState. Also adds clean up of 5.4 model state and quantile docs in the daily maintenance. Backport of #62434	2020-09-17 23:43:28 +03:00
Lee Hinman	9bb7ce0b22	[7.x] Allocate new indices on "hot" or "content" tier depending on data stream inclusion (#62338 ) (#62557 ) Backports the following commits to 7.x: Allocate new indices on "hot" or "content" tier depending on data stream inclusion (#62338)	2020-09-17 13:29:23 -06:00
William Brafford	5a0dca2491	Deprecate xpack.eql.enabled setting and make it a no-op (#61375 ) (#62491 ) * Deprecate xpack.eql.enabled and make it a no-op * Remove uses of xpack.eql.enabled	2020-09-17 14:17:27 -04:00
Martijn van Groningen	5f643433c6	Prohibit the usage of create index api in namespaces managed by data stream templates (#62574 ) Backport of #62527 to 7.x branch. This commit adds validation that prohibits the creation of regular indices in the namespace of templates with data streams enabled. It shouldn't be possible to create ordinary indices when the name of the index matches with a composable index template that enables data streams. Auto creation has logic that creates data streams instead of regular indices. However validation logic for the create index api was missing.	2020-09-17 20:10:42 +02:00
Jim Ferenczi	df93b31b15	Faster sequential access for stored fields (#62509 ) (#62573 ) Faster sequential access for stored fields Spinoff of #61806 Today retrieving stored fields at search time is optimized for random access. So we make no effort to keep state in order to not decompress the same data multiple times because two documents might be in the same compressed block. This strategy is acceptable when retrieving a top N sorted by score since there is no guarantee that documents will be on the same block. However, we have some use cases where the document to retrieve might be completely sequential: Scrolls or normal search sorted by document id. Queries on Runtime fields that extract from _source. This commit exposes a sequential stored fields reader in the custom leaf reader that we use at search time. That allows to leverage the merge instances of stored fields readers that are optimized for sequential access. This change focuses on the fetch phase for now and leverages the merge instances for stored fields only if all documents to retrieve are adjacent. Applying the same logic in the source lookup of runtime fields should be trivial but will be done in a follow up. The speedup on queries sorted by doc id is significant. I played with the scroll task of the http_logs rally track on my laptop and had the following result: \| Metric \| Task \| Baseline \| Contender \| Diff \| Unit \| \|--------------------------------------------------------------:\|-------:\|------------:\|------------:\|---------:\|--------:\| \| Total Young Gen GC \| \| 0.199 \| 0.231 \| 0.032 \| s \| \| Total Old Gen GC \| \| 0 \| 0 \| 0 \| s \| \| Store size \| \| 17.9704 \| 17.9704 \| 0 \| GB \| \| Translog size \| \| 2.04891e-06 \| 2.04891e-06 \| 0 \| GB \| \| Heap used for segments \| \| 0.820332 \| 0.820332 \| 0 \| MB \| \| Heap used for doc values \| \| 0.113979 \| 0.113979 \| 0 \| MB \| \| Heap used for terms \| \| 0.37973 \| 0.37973 \| 0 \| MB \| \| Heap used for norms \| \| 0.03302 \| 0.03302 \| 0 \| MB \| \| Heap used for points \| \| 0 \| 0 \| 0 \| MB \| \| Heap used for stored fields \| \| 0.293602 \| 0.293602 \| 0 \| MB \| \| Segment count \| \| 541 \| 541 \| 0 \| \| \| Min Throughput \| scroll \| 12.7872 \| 12.8747 \| 0.08758 \| pages/s \| \| Median Throughput \| scroll \| 12.9679 \| 13.0556 \| 0.08776 \| pages/s \| \| Max Throughput \| scroll \| 13.4001 \| 13.5705 \| 0.17046 \| pages/s \| \| 50th percentile latency \| scroll \| 524.966 \| 251.396 \| -273.57 \| ms \| \| 90th percentile latency \| scroll \| 577.593 \| 271.066 \| -306.527 \| ms \| \| 100th percentile latency \| scroll \| 664.73 \| 272.734 \| -391.997 \| ms \| \| 50th percentile service time \| scroll \| 522.387 \| 248.776 \| -273.612 \| ms \| \| 90th percentile service time \| scroll \| 573.118 \| 267.79 \| -305.328 \| ms \| \| 100th percentile service time \| scroll \| 660.642 \| 268.963 \| -391.678 \| ms \| \| error rate \| scroll \| 0 \| 0 \| 0 \| % \| Closes #62024	2020-09-17 19:58:18 +02:00
Andrei Dan	3753682877	Fix AllocationRoutedStep equals and hashcode (#62548 ) (#62559 ) (cherry picked from commit 79039e16305c7fb71ee012e693219a0d2b77e97b) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-09-17 17:40:21 +01:00
Alan Woodward	5421a743a7	Move SearchLookup into FetchContext (#62549 ) FetchSubPhase#getProcessor currently takes a SearchLookup parameter. This however is only needed by a couple of subphases, and will almost certainly change in future as we want to simplify how fetch phases retrieve values for individual hits. To future-proof against further signature changes, this commit moves the SearchLookup reference into FetchContext instead.	2020-09-17 17:39:02 +01:00
Dimitris Athanasiou	f5c28e2054	[7.x][ML] Do not start data frame analytics when too many docs are analyzed (#62547 ) (#62558 ) The data frame structure in c++ has a limit on 2^32 documents. This commit adds a check that the number of documents involved in the analysis are less than that and fails to start otherwise. That saves the cost of reindexing when it is unnecessary. Backport of #62547	2020-09-17 19:06:38 +03:00
Mark Vieira	7d36393b09	Disable composePull task on idp-fixture project due to error (#62510 )	2020-09-17 08:55:47 -07:00
Nik Everett	4d272a2a00	Runtime fields: fix a test name (#62498 ) This fixes the name of a test method so we actually run it. I broke it a few commits ago without realizing it.	2020-09-17 11:17:44 -04:00
Lee Hinman	3081b3827b	[7.x] Add host.ip and observer.ip fields to the synthetics-- mappings (#62412 ) (#62553 ) We need to ensure these are mapped as 'ip' instead of a keyword, even if they do end up not being used. Relates to #62193	2020-09-17 09:01:53 -06:00
Lee Hinman	a636d106bf	[7.x] Remove data_frozen node role (tier) and frozen ILM phase (#62403 ) (#62465 ) Backports the following commits to 7.x: Remove data_frozen node role (tier) and frozen ILM phase (#62403)	2020-09-17 08:58:07 -06:00
Andrei Dan	fe1194d58f	[7.x] ILM migrate data between tiers (#61377 ) (#62536 ) This adds ILM support for automatically migrating the managed indices between data tiers. This proposal makes use of a MigrateAction that is injected (similar to how the Unfollow action is injected) in phases that don't define index allocation rules using the AllocateAction or don't explicitly define the MigrateAction itself (regardless if it's enabled or disabled). (cherry picked from commit c1746afffd61048d0c12d3a77e6d8191a804ed49) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-09-17 15:08:31 +01:00
Adam Locke	db9dd9f7e1	[DOCS] Updating CCR setup to be more tutorial focused (#62256 ) (#62499 ) * Applying some initial changes. * Updating intro and screenshots. * Removing unnecessary links, streamlining content, and adding GIF. * Adding what's next section. * Removing what's next. * Minor edits. * Apply suggestions from code review Co-authored-by: debadair <debadair@elastic.co> * Incorporating review feedback. * Moving CCR user privileges to another page, plus more edits. * Apply suggestions from code review Co-authored-by: debadair <debadair@elastic.co> * Incorporating more review feedback. * Adding TESTSETUP to fix build errors. * Update docs/reference/ccr/getting-started.asciidoc Co-authored-by: debadair <debadair@elastic.co> * Swapping GIF for mp4 hosted on web team CMS. * Removing GIF in favor of mp4. Co-authored-by: debadair <debadair@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: debadair <debadair@elastic.co> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-09-17 09:50:21 -04:00
Luca Cavanna	3cf559bf9c	Minor cleanup of runtime fields classes (#62531 ) This commit addresses some compiler warnings in the runtime fields classes	2020-09-17 15:47:05 +02:00
David Kyle	417ce9396d	[ML] Add datafeed run time fields integration test (#62535 ) (#62538 )	2020-09-17 13:41:07 +01:00
Fernando Briano	d3bdff6bbf	Adds quotes to timestamp values in runtime_fields/40_date YAML test (#62526 )	2020-09-17 12:09:10 +01:00
Christoph Büscher	aba86d7d29	Fix condition in ILM step that cannot be met (#62377 ) (#62528 ) ReplaceDataStreamBackingIndexStep#performAction seems to perform an equality check on an original Index and the write indexes names, but because this compares an Index instance to a String, the condition can never be met. This PR changes this comparison.	2020-09-17 12:38:05 +02:00
Tanguy Leroux	a39ddb8a0f	Remove ShardClusterSnapshotRestoreIT and fold test in DataStreamsSnapshotsIT (#62470 ) (#62523 ) ShardClusterSnapshotRestoreIT is confusing as we already have a very complete SharedClusterSnapshotRestoreIT test suite. This commit removes ShardClusterSnapshotRestoreIT and folds its unique test in DataStreamsSnapshotsIT.	2020-09-17 12:37:17 +02:00
Tim Vernum	fe3bf86620	Fix RestrictedTrustManagerTests on Zulu8 (#62436 ) Since #61857 we test using BCJSSE (Bouncy Castle SSL) when running on Zulu8 because Azul have backported SSL changes from Java11 into their Java8 JRE which prevents us from using Sun JSSE in FIPS mode. BCJSSE uses different exception messages than Sun JSSE, so we needed to update RestrictedTrustManagerTests.testThatDelegateTrustManagerIsRespected to reflect the fact that sometimes we might be receive BCJSSE error messages on a Java8 JVM Resolves: #62281	2020-09-17 20:00:42 +10:00
Marios Trivyzas	abce04888f	EQL: Forbid usage of ['] for string literals (#62458 ) (#62496 ) The usage of single quotes to wrap a string literal is forbidden and an error encouraging the user to user double quotes is returned. Tests are properly adjusted. Relates to #61659 (cherry picked from commit 8be400b77370bf4cf68c89f492c2d235f3cce43c)	2020-09-17 11:29:09 +02:00
Dimitris Athanasiou	d091c12e0c	[7.x] Generalize AsyncTwoPhaseIndexer first phase (#61739 ) (#62482 ) Current implementations of the indexer are using aggregations. Thus each search step executes a search action. However, we can generalize that to allow for any action that returns a `SearchResponse`. This commit abstracts the search phase from the search action. Backport of #61739	2020-09-17 11:57:22 +03:00
Adrien Grand	9a8225bbc1	Upgrade to lucene-8.7.0-snapshot-9cd3af50f80. (#62450 ) (#62476 ) This new snapshot contains the following JIRAs that we're interested in: - [LUCENE-9525](https://issues.apache.org/jira/browse/LUCENE-9525) Better handling of small documents. This should improve retrieval times when documents are less than ~1kB. - [LUCENE-9510](https://issues.apache.org/jira/browse/LUCENE-9510) Faster flushes when index sorting is enabled by not compressing the temporary files that store stored fields and term vectors.	2020-09-17 10:28:20 +02:00
Luca Cavanna	26388fe22e	Runtime fields: rename fielddata and mapped field type classes (#62483 ) With this commit we rename all of the fielddata, doc_values and mapped field type classes for runtime fields to not start with the Script prefix but rather their runtime type (e.g. Boolean) and only then Script	2020-09-17 09:14:30 +02:00
Ignacio Vera	2d3ca9c155	Introduce a sparse HyperLogLogPlusPlus class for cloning and serializing low cardinality buckets (#62480 ) (#62520 ) Reduces the memory footprint of an HLL++ structure that uses Linear counting when cloning or deserialising the data structure.	2020-09-17 08:54:50 +02:00
Costin Leau	ceaf96061c	EQL: Fetch sequence documents using Point-In-Time (#62469 ) To preserve the PIT semantics, the retrieval of results has moved from using multi-get to using an idsQuery. (cherry picked from commit 1c2362fcf2be62ce568b3772924abce7331ef23c)	2020-09-17 00:12:19 +03:00
Luca Cavanna	1e352fdb7f	Runtime fields: rename script classes (#62448 ) With this commit we rename the script classes used for each mapped field type used for runtime fields. The new naming is a shorter version of the previous one: from e.g. BooleanScriptFieldScrip to BooleanScript . We also move such classes to the existing mapper package.	2020-09-16 18:00:06 +02:00
James Rodewig	6394629b99	[DOCS] Document `toJSON` function for role query (#62257 ) (#62462 )	2020-09-16 10:38:56 -04:00
Christoph Büscher	f8634e5bea	Muting SimpleSecurityNetty4ServerTransportTests	2020-09-16 15:14:08 +02:00
Benjamin Trent	341eeae6e7	[ML] fixes testWatchdog test verifying matcher is interrupted on timeout (#62391 ) (#62447 ) Constructing the timout checker FIRST and THEN registering the watcher allows the test to have a race condition. The timeout value could be reached BEFORE the matcher is added. To prevent the matcher never being interrupted, a new timedOut value is added to the watcher thread entry. Then when a new matcher is registered, if the thread was previously timedout, we interrupt the matcher immediately. closes #48861	2020-09-16 09:13:22 -04:00
Lyudmila Fokina	167172a057	Update authc failure headers on license change (#61734 ) (#62442 ) Backport of #61734	2020-09-16 14:37:03 +02:00
Benjamin Trent	8d89a28126	[ML] unmuting test for testTooManyPartitions memory check on windows (#62393 ) (#62405 ) This commit unmutes the windows check for testTooManyPartitions test. The assertion has since changed to include a soft_limit check. This coupled with changes over the past years means the test should be enabled again. related to: #32033	2020-09-16 07:03:10 -04:00
Christoph Büscher	6a016fb755	Muting LogstashSystemIndexIT.testPipelineCRUD	2020-09-16 11:04:41 +02:00
Hendrik Muhs	8566e9e3e7	[Transform] Make pivot validation sub-agg aware (#62381 ) With the addition of sub aggregations like filter, the validation could fail if 2 sub aggs use the same output name. This change makes validation sub-agg aware. fixes #57814	2020-09-16 07:55:58 +02:00
Yang Wang	a11dfbe031	Oidc additional client auth types (#58708 ) (#62289 ) The OpenID Connect specification defines a number of ways for a client (RP) to authenticate itself to the OP when accessing the Token Endpoint. We currently only support `client_secret_basic`. This change introduces support for 2 additional authentication methods, namely `client_secret_post` (where the client credentials are passed in the body of the POST request to the OP) and `client_secret_jwt` where the client constructs a JWT and signs it using the the client secret as a key. Support for the above, and especially `client_secret_jwt` in our integration tests meant that the OP we use ( Connect2id server ) should be able to validate the JWT that we send it from the RP. Since we run the OP in docker and it listens on an ephemeral port we would have no way of knowing the port so that we can configure the ES running via the testcluster to know the "correct" Token Endpoint, and even if we did, this would not be the Token Endpoint URL that the OP would think it listens on. To alleviate this, we run an ES single node cluster in docker, alongside the OP so that we can configured it with the correct hostname and port within the docker network. Co-authored-by: Ioannis Kakavas <ioannis@elastic.co>	2020-09-16 14:29:09 +10:00
Nik Everett	24a24d050a	Implement fields fetch for runtime fields (backport of #61995 ) (#62416 ) This implements the `fields` API in `_search` for runtime fields using doc values. Most of that implementation is stolen from the `docvalue_fields` fetch sub-phase, just moved into the same API that the `fields` API uses. At this point the `docvalue_fields` fetch phase looks like a special case of the `fields` API. While I was at it I moved the "which doc values sub-implementation should I use for fetching?" question from a bunch of `instanceof`s to a method on `LeafFieldData` so we can be much more flexible with what is returned and we're not forced to extend certain classes just to make the fetch phase happy. Relates to #59332	2020-09-15 20:24:10 -04:00
Nik Everett	e5ad3a41f1	Check for runtime field loops in queries (backport of #61927 ) (#62421 ) We were checking for loops in queries before, but we had an "off by one" error where we wouldn't notice the "top level" runtime field when detecting a loop. So the error message would be wrong. I also caught a few bugs with query generation caused by missing `@Override` annotations and fixed a few of them. There is a bug with `regexp` queries with match options that I'm not fixing in this PR but will get to later. Relates to #59332	2020-09-15 17:24:19 -04:00
Costin Leau	b2e85d5639	SQL: Do not resolve self-referencing aliases (#62382 ) Prevent the analyzer for trying to resolve aliases on expressions that reference themselves (or fields within themselves) as that causes infinite recursion. Fix #62296 (cherry picked from commit 021d27815b03e92e02859bc9c0c8eec78f30c72e)	2020-09-15 20:53:28 +03:00
Armin Braun	9ac4ee9c44	Increase Flaky Timeout in testIlmHistoryIndexCanRollover (#62353 ) (#62402 ) This busy assert easily takes about 5s on a very fast work station so the default of 10s is not sufficient here at all.	2020-09-15 19:50:45 +02:00
Nik Everett	771a8893a6	Add more debugging information for cardinality agg (#62317 ) (#62397 ) This adds two extra bits of info to the profiler: 1. Count of the number of different types of collectors. This lets us figure out if we're using the optimization for segment ordinals. It adds a few more similar counters just for good measure. 2. Profiles the `getLeafCollector` and `postCollection` methods. These are non-trivial for some aggregations, like cardinality.	2020-09-15 13:21:11 -04:00
William Brafford	af64e46065	Add logstash system index APIs (#53350 ) (#62347 ) We want Logstash indices to be system indices, but the logstash service will still need to be able to manage its indices. This PR adds special system index APIs to the logstash plugin so that logstash can manage its pipelines without direct access to the underlying indices. * Add logstash module with dedicated logstash APIs * merge with x-pack plugin * add system index access allowance * Break out serialization tests into distinct classes * Log failures for partial multiget failure * Move LogstashSystemIndexIT to javaRestTest task Co-authored-by: William Brafford <william.brafford@elastic.co> Co-authored-by: Jay Modi <jaymode@users.noreply.github.com>	2020-09-15 12:42:14 -04:00

1 2 3 4 5 ...

6455 Commits