OpenSearch

Commit Graph

Author	SHA1	Message	Date
Yannick Welsch	a610513ec7	Provide repository-level stats for searchable snapshots (#55051 ) Provides basic repository-level stats that will allow us to get some insight into how many requests are actually being made by the underlying SDK. Currently only tracks GET and LIST calls for S3 repositories. Most of the code is unfortunately boiler plate to add a new endpoint that will help us better understand some of the low-level dynamics of searchable snapshots.	2020-04-14 14:34:08 +02:00
Alan Woodward	16ebbff3b6	Mute CancellableTasksIT (#55152 ) Test failures are tracked in #55106	2020-04-14 12:55:20 +01:00
Yannick Welsch	73c522320a	Fix DanglingIndicesIT wait conditions (#55105 ) Closes #55105	2020-04-14 13:54:33 +02:00
William Brafford	52bebec51f	NodeInfo response should use a collection rather than fields (#54460 ) (#55132 ) This is a first cut at giving NodeInfo the ability to carry a flexible list of heterogeneous info responses. The trick is to be able to serialize and deserialize an arbitrary list of blocks of information. It is convenient to be able to deserialize into usable Java objects so that we can aggregate nodes stats for the cluster stats endpoint. In order to provide a little bit of clarity about which objects can and can't be used as info blocks, I've introduced a new interface called "ReportingService." I have removed the hard-coded getters (e.g., getOs()) in favor of a flexible method that can return heterogeneous kinds of info blocks (e.g., getInfo(OsInfo.class)). Taking a class as an argument removes the need to cast in the client code.	2020-04-13 17:18:39 -04:00
Nhat Nguyen	96bb1164f0	Support hierarchical task cancellation (#54757 ) With this change, when a task is canceled, the task manager will cancel not only its direct child tasks but all also its descendant tasks. Closes #50990	2020-04-13 12:35:21 -04:00
Igor Motov	51c6f69e02	[7.x] Add support for filters to T-Test aggregation (#54980 ) (#55066 ) Adds support for filters to T-Test aggregation. The filters can be used to select populations based on some criteria and use values from the same or different fields. Closes #53692	2020-04-13 12:28:58 -04:00
Ioannis Kakavas	7a8a66d9ae	[7.x] Fix ReloadSecureSettings API to consume password (#54771 ) (#55059 ) The secure_settings_password was never taken into consideration in the ReloadSecureSettings API. This commit fixes that and adds necessary REST layer testing. Doing so, it also: - Allows TestClusters to have a password protected keystore so that it can be set for tests. - Adds a parameter to the run task so that elastisearch can be run with a password protected keystore from source.	2020-04-13 09:50:55 +03:00
Yang Wang	862799956c	Deprecate local parameter for get field mapping request (#55014 ) (#55099 ) The usage of local parameter for GetFieldMappingRequest has been removed from the underlying transport action since v2.0. This PR deprecates the parameter from rest layer. It will be removed in next major version.	2020-04-12 13:48:47 +10:00
Nik Everett	c00811f3a3	Make some agg tests easier to read (#54954 ) (#55079 ) We added a fancy method to provide random realistic test data to the reduction tests in #54910. This uses that to remove some of the more esoteric machinations in the agg tests. This will marginally increase the coverage of the serialiation tests and, more importantly, remove some mysterious value generation code that only really made sense for random reduction tests but was used all over the place. It doesn't, on the other hand, make the tests shorter. Just hopefully more clear. I only cleaned up a few tests this way. If we like this it'd probably be worth grabbing others.	2020-04-10 14:15:30 -04:00
Nik Everett	b99a50bcb9	value_count Aggregation optimization (backport of #54854 ) (#55076 ) We found some problems during the test. Data: 200Million docs, 1 shard, 0 replica hits \| avg \| sum \| value_count \| ----------- \| ------- \| ------- \| ----------- \| 20,000 \| .038s \| .033s \| .063s \| 200,000 \| .127s \| .125s \| .334s \| 2,000,000 \| .789s \| .729s \| 3.176s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| The performance of `avg`, `sum` and other is very close when performing statistics, but the performance of `value_count` has always been poor, even not on an order of magnitude. Based on some common-sense knowledge, we think that `value_count` and sum are similar operations, and the time consumed should be the same. Therefore, we have discussed the agg of `value_count`. The principle of counting in es is to traverse the field of each document. If the field is an ordinary value, the count value is increased by 1. If it is an array type, the count value is increased by n. However, the problem lies in traversing each document and taking out the field, which changes from disk to an object in the Java language. We summarize its current problems with Elasticsearch as: - Number cast to string overhead, and GC problems caused by a large number of strings - After the number type is converted to string, sorting and other unnecessary operations are performed Here is the proof of type conversion overhead. ``` // Java long to string source code, getChars is very time-consuming. public static String toString(long i) { int size = stringSize(i); if (COMPACT_STRINGS) { byte[] buf = new byte[size]; getChars(i, size, buf); return new String(buf, LATIN1); } else { byte[] buf = new byte[size * 2]; StringUTF16.getChars(i, size, buf); return new String(buf, UTF16); } } ``` test type \| average \| min \| max \| sum ------------ \| ------- \| ---- \| ----------- \| ------- double->long \| 32.2ns \| 28ns \| 0.024ms \| 3.22s long->double \| 31.9ns \| 28ns \| 0.036ms \| 3.19s long->String \| 163.8ns \| 93ns \| 1921 ms \| 16.3s particularly serious. Our optimization code is actually very simple. It is to manage different types separately, instead of uniformly converting to string unified processing. We added type identification in ValueCountAggregator, and made special treatment for number and geopoint types to cancel their type conversion. Because the string type is reduced and the string constant is reduced, the improvement effect is very obvious. hits \| avg \| sum \| value_count \| value_count \| value_count \| value_count \| value_count \| value_count \| \| \| \| double \| double \| keyword \| keyword \| geo_point \| geo_point \| \| \| \| before \| after \| before \| after \| before \| after \| ----------- \| ------- \| ------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| ----------- \| 20,000 \| 38s \| .033s \| .063s \| .026s \| .030s \| .030s \| .038s \| .015s \| 200,000 \| 127s \| .125s \| .334s \| .078s \| .116s \| .099s \| .278s \| .031s \| 2,000,000 \| 789s \| .729s \| 3.176s \| .439s \| .348s \| .386s \| 3.365s \| .178s \| 20,000,000 \| 4.200s \| 3.239s \| 22.787s \| 2.700s \| 2.500s \| 2.600s \| 25.192s \| 1.278s \| 200,000,000 \| 21.000s \| 22.000s \| 154.917s \| 18.990s \| 19.000s \| 20.000s \| 168.971s \| 9.093s \| - The results are more in line with common sense. `value_count` is about the same as `avg`, `sum`, etc., or even lower than these. Previously, `value_count` was much larger than avg and sum, and it was not even an order of magnitude when the amount of data was large. - When calculating numeric types such as `double` and `long`, the performance is improved by about 8 to 9 times; when calculating the `geo_point` type, the performance is improved by 18 to 20 times.	2020-04-10 13:16:39 -04:00
Tim Brooks	98fba92022	Fail sniff process if no connections opened (#54934 ) Currently the remote cluster sniff connection process can succeed even if no connections are opened. This commit fixes this by failing the connection process if no connections are successfully opened.	2020-04-10 10:06:45 -06:00
Jim Ferenczi	d14ed34577	Explicitly test rewrite of date histogram's time zones on date_nanos (#54402 ) This commit adds an explicit test of time zone rewrite on date nanos field. Today this is working but we need tests to ensure that we don't break it unintentionally.	2020-04-10 17:37:59 +02:00
Igor Motov	da976d247f	Improve robustness of Query Result serializations (#54692 ) (#55028 ) Makes query result serialization more robust by propagating possible IOExceptions that can occur during shard level result serialization to the caller instead of throwing AssertionError that is not intercepted. Fixes #54665	2020-04-10 10:29:01 -04:00
Jason Tedor	9eeae59a83	Clarify available processors (#54907 ) The use of available processors, the terminology, and the settings around it have evolved over time. This commit cleans up some places in the codes and in the docs to adjust to the current terminology.	2020-04-10 08:48:27 -04:00
Przemko Robakowski	35c195b224	Prevent putting V2 index template when overlapping with existing template (#54933 ) (#55042 ) * Prevent putting V2 index template when overlapping with existing template This change prevents putting V2 index template when it would overlap with existing V2 template of the same priority Relates to #53101	2020-04-10 10:31:37 +02:00
Nik Everett	62d6bc31bf	Reduce memory for big aggs run against many shards (#54758 ) (#55024 ) This changes the behavior of aggregations when search is performed against enough shards to enable "batch reduce" mode. In this case we force always store aggregations in serialized form rather than a traditional java reference. This should shrink the memory usage of large aggregations at the cost of slightly slowing down aggregations where the coordinating node is also a data node. Because we're only doing this when there are many shards this is likely to be fairly rare. As a side effect this lets us add logs for the memory usage of the aggs buffer: ``` [2020-04-03T17:03:57,052][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [1320->448] max [1320] [2020-04-03T17:03:57,089][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [1328->448] max [1328] [2020-04-03T17:03:57,102][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [1328->448] max [1328] [2020-04-03T17:03:57,103][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [1328->448] max [1328] [2020-04-03T17:03:57,105][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs final reduction [888] max [1328] ``` These are useful, but you need to keep some things in mind before trusting them: 1. The buffers are oversized ala Lucene's ArrayUtils. This means that we are using more space than we need, but probably not much more. 2. Before they are merged the aggregations are inflated into their traditional Java objects which probably take up a lot more space than the serialized form. That is, after all, the reason why we store them in serialized form in the first place. And, just because I can, here is another example of the log: ``` [2020-04-03T17:06:18,731][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [147528->49176] max [147528] [2020-04-03T17:06:18,750][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [147528->49176] max [147528] [2020-04-03T17:06:18,809][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [147528->49176] max [147528] [2020-04-03T17:06:18,827][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs partial reduction [147528->49176] max [147528] [2020-04-03T17:06:18,829][TRACE][o.e.a.s.SearchPhaseController] [runTask-0] aggs final reduction [98352] max [147528] ``` I got that last one by building a ten shard index with a million docs in it and running a `sum` in three layers of `terms` aggregations, all on `long` fields, and with a `batched_reduce_size` of `3`.	2020-04-09 14:58:42 -04:00
Julie Tibshirani	850ea7c0be	Correct the name of the docvalues_fields object parser.	2020-04-09 11:36:28 -07:00
Nik Everett	83c328f125	Deprecate serializing PipelineAggregators (#54926 ) (#55025 ) `PipelineAggregator`s are only sent across the wire for backwards compatibility with 7.7.0. `PipelineAggregator` needs to continue to implement `NamedWriteable` for backwards compatibility but pipeline aggregations created after 7.7.0 need not implement any of the methods in that interface because we'll never attempt to call them. So this creates implementations in `PipelineAggregator` (the base class) that just throw exceptions.	2020-04-09 14:13:47 -04:00
Przemko Robakowski	adc6e880cf	Fix NPE in MetadataIndexTemplateService#findV2Template (#54945 ) (#55001 ) This commit fixes potential NPE when there's V2 template with `null` priority. This is done by using `null`-safe comparator.	2020-04-09 11:34:20 +02:00
Przemko Robakowski	afa3467957	[7.x] HLRC support for Index Templates V2 (#54838 ) (#54932 ) * HLRC support for Index Templates V2 (#54838) * HLRC support for Index Templates V2 This change adds High Level Rest Client support for Index Templates V2. Relates to #53101 * fixed compilation error Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-09 07:43:13 +02:00
Dan Hermann	c7f9a27d2d	Delete backing indices with data stream (#54693 ) (#54976 )	2020-04-08 15:18:12 -05:00
Lee Hinman	3b879b0821	[7.x] Use V2 templates when reading duplicate aliases and inge… (#54973 ) When a new index is rolled over, we check to see whether there are any duplicate alias configurations in the index template configuration. Additionally, when a new index is created from a bulk action, we check the templates to see if there are any ingest pipelines that need to be applied to the index that will be newly created. Both of these actions previously checked the v1 templates for their settings, they now also check the v2 index templates, with the v2 index templates taking precendence similar to the way they do when creating an index. Relates to #53101	2020-04-08 13:33:14 -06:00
Jay Modi	3600c9862f	Reintroduce system index APIs for Kibana (#54935 ) This change reintroduces the system index APIs for Kibana without the changes made for marking what system indices could be accessed using these APIs. In essence, this is a partial revert of #53912. The changes for marking what system indices should be allowed access will be handled in a separate change. The APIs introduced here are wrapped versions of the existing REST endpoints. A new setting is also introduced since the Kibana system indices' names are allowed to be changed by a user in case multiple instances of Kibana use the same instance of Elasticsearch. Relates #52385 Backport of #54858	2020-04-08 09:08:49 -06:00
Jason Tedor	6d29da05c3	Defer node environment construction (#54919 ) Today we construct the node environment relatively early in the node construction process, before we have even constructed the final environment, which means before the final settings are available. Rather, we should defer constructing the node environment until the final environment is available. This commit does that. This helps delay node environment construction until after the node roles are properly determined, which is important since the node environment does some checks on the basis of whether or not the node is neither a data nor a master node (such nodes should not have index metadata nor shard data on disk). Note that a consequence of this is that the initial log line that displays the node name, node ID, and cluster name does not appear until later in startup (after we have loaded plugins). This seems okay.	2020-04-08 09:23:19 -04:00
Ryan Ernst	37795d259a	Remove guava from transitive compile classpath (#54309 ) (#54695 ) Guava was removed from Elasticsearch many years ago, but remnants of it remain due to transitive dependencies. When a dependency pulls guava into the compile classpath, devs can inadvertently begin using methods from guava without realizing it. This commit moves guava to a runtime dependency in the modules that it is needed. Note that one special case is the html sanitizer in watcher. The third party dep uses guava in the PolicyFactory class signature. However, only calling a method on the PolicyFactory actually causes the class to be loaded, a reference alone does not trigger compilation to look at the class implementation. There we utilize a MethodHandle for invoking the relevant method at runtime, where guava will continue to exist.	2020-04-07 23:20:17 -07:00
Nhat Nguyen	65713743c2	Update translog policy before the next safe commit (#54839 ) IndexShardIT#testMaybeFlush relies on the assumption that the safe commit and translog deletion policy have advanced after IndexShard#sync returns . This assumption does not hold if there's a race with the global checkpoint sync. Closes #52223	2020-04-07 21:55:54 -04:00
Tal Levy	254d1e3543	[7.x] Create new `geo` module and migrate geo_shape registration (#53562 ) (#54924 ) This commit introduces a new `geo` module that is intended to be contain all the geo-spatial-specific features in server. As a first step, the responsibility of registering the geo_shape field mapper is moved to this module. Co-authored-by: Nicholas Knize <nknize@gmail.com>	2020-04-07 16:30:58 -07:00
Tim Brooks	619028c33e	Implement transport circuit breaking in aggregator (#54927 ) This commit moves the action name validation and circuit breaking into the InboundAggregator. This work is valuable because it lays the groundwork for incrementally circuit breaking as data is received. This PR includes the follow behavioral change: Handshakes contribute to circuit breaking, but cannot be broken. They currently do not contribute nor are they broken.	2020-04-07 17:10:31 -06:00
Julie Tibshirani	475b210eec	Improve guidance on removing default mappings. (#54915 ) In 7.x, an index template will fail to apply if it contains a `_default_` mapping. Several users have expressed confusion over the fact that loading the template doesn't show any default mappings. This docs change clarifies that in order to see all mappings in the template, you must pass `include_type_name`.	2020-04-07 15:18:13 -07:00
Tim Brooks	c7053ef824	Use TransportChannel in TransportHandshaker (#54921 ) Currently the TransportHandshaker has a specialized codepath for sending a response. In other work, we are going to start having handshakes contribute to circuit breaking (while not being breakable). This commit moves in that direction by allowing the handshaker to responding using a standard TcpTransportChannel similar to other requests.	2020-04-07 15:37:15 -06:00
Nik Everett	ce7ae4a7d1	Remove pipline aggs from agg result tree (backport of #54716 ) (#54920 ) This removes pipeline aggregators from the aggregation result tree except for a single field used for backwards compatibility with pre-7.8 versions of Elasticsearch. That field isn't populated unless we are serializing to pre-7.8 Elasticsearch. So, good news! We no longer build pipeline aggregators on the data node. Most of the time.	2020-04-07 17:22:23 -04:00
Tim Brooks	9cf2406cf1	Move network stats marking into InboundPipeline (#54908 ) This is a follow-up to #48263. It moves the inbound stats tracking inside of the InboundPipeline.	2020-04-07 13:34:05 -06:00
Nik Everett	1798d6722b	Allow terms agg to default to depth first (#54845 ) (#54885 ) If you didn't explictly set `global_ordinals` execution mode we were never collecting the information that we needed to select `depth_first` based on the request so we were always defaulting to `breadth_first`. This fixes it so we collect the information.	2020-04-07 14:11:34 -04:00
Armin Braun	37abc411dc	Remove Unused Snapshot Status Values (#54893 ) (#54906 ) * Remove Unused Snapshot Status Values This is a left-over from before #41940 when we used the same status enum for the shards and the snapshots overall. The two removed values were never used on the shard level so we can simply remove them here.	2020-04-07 19:16:13 +02:00
Lee Hinman	72f8457f52	[7.x] Only allow retrieving a single index or component templa… (#54896 ) * Only allow retrieving a single index or component template This changes the Index Template v2 APIs to only allow retrieving a single "named" entity, where the named entity can be nothing (return everything), a wildcard (return the ones that match), or the name of a template. Relates to #53101 * Throw exception when resource is not found Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-04-07 08:55:44 -06:00
Nik Everett	3c56e0de42	Fix scripted metric in ccs (backport of #54776 ) (#54888 ) `scripted_metric` did not work with cross cluster search because it assumed that you'd never perform a partial reduction, serialize the results, and then perform a final reduction. That serialized-after-partial-reduction step was broken. This is also required to support #54758.	2020-04-07 10:43:00 -04:00
Nik Everett	915092dc28	More pipeline aggregation cleanup (backport of #54298 ) (#54890 ) This replaces the last bit of validation that pipeline aggregations performed on the data nodes with explicit checks in a few `PipelineAggregationBuilders`. We were already catching these validation errors for pipeline aggregations that require that their parent be squentially ordered. This just adds validation for pipelines that require any parent like `bucket_selector` and `bucket_sort`.	2020-04-07 10:40:34 -04:00
Tanguy Leroux	4d36917e52	Merge feature/searchable-snapshots branch into 7.x (#54803 ) (#54825 ) This is a backport of #54803 for 7.x. This pull request cherry picks the squashed commit from #54803 with the additional commits: 6f50c92 which adjusts master code to 7.x a114549 to mute a failing ILM test (#54818) 48cbca1 and 50186b2 that cleans up and fixes the previous test aae12bb that adds a missing feature flag (#54861) 6f330e3 that adds missing serialization bits (#54864) bf72c02 that adjust the version in YAML tests a51955f that adds some plumbing for the transport client used in integration tests Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Yannick Welsch <yannick@welsch.lu> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com> Co-authored-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-07 13:28:53 +02:00
Jason Tedor	f3a0018175	Update link to JDK 14 compiler bug This commit updates the link to the JDK 14 compiler bug that we have found. At the time that we committed the workaround, we had a submission ID, but not yet the public bug URL. This commit adds the public bug URL.	2020-04-07 06:26:14 -04:00
Nhat Nguyen	22be925f58	Add more assertion for testRecoverLocallyUpToGlobalCheckpoint Tracked at #54829	2020-04-06 19:42:26 -04:00
Przemko Robakowski	416267c038	Remove RestController from tests where it's not needed (#53782 ) (#54833 )	2020-04-06 21:12:05 +02:00
Przemko Robakowski	7b1bb9952a	[7.x] HLRC support for Component Templates APIs (#54635 ) (#54828 ) * HLRC support for Component Templates APIs (#54635)	2020-04-06 20:24:23 +02:00
Nhat Nguyen	2fdbed7797	Broadcast cancellation to only nodes have outstanding child tasks (#54312 ) Today when canceling a task we broadcast ban/unban requests to all nodes in the cluster. This strategy does not scale well for hierarchical cancellation. With this change, we will track outstanding child requests and broadcast the cancellation to only nodes that have outstanding child tasks. This change also prevents a parent task from sending child requests once it got canceled. Relates #50990 Supersedes #51157 Co-authored-by: Igor Motov <igor@motovs.org> Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-04-06 11:11:29 -04:00
David Turner	2b8a91b7be	Ensure correct no-master block applied on restart (#54800 ) This commit addresses a long-standing `// TODO` in the coordinator tests to ensure that the correct no-master block is applied when a node restarts while disconnected from the cluster. It also strengthens this test to check that the no-master block is applied correctly on all nodes, not just the previous master.	2020-04-06 13:25:51 +01:00
David Turner	63de8c0730	Reinstate commented-out CoordinatorTest (#54784 ) Test `testAckListenerReceivesNacksFromFollowerInHigherTerm` was suppressed as when it was written it didn't work due the lack of proper term bumping. We added term bumping but never got around to implementing this test. This commit addresses this.	2020-04-06 10:27:13 +01:00
Armin Braun	baff7bfa14	Rationalize some ThreadPool Use in Snapshot Transport Actions (#54772 ) (#54782 ) Removing a few spots where we clearly don't have to fork to the generic or management pool since either we only interpret the current cluster state or fork-off directly to some other pool in the transport action logic anyway.	2020-04-06 09:57:41 +02:00
Nhat Nguyen	4ecc7dcca5	Avoid StackOverflowError if write circular reference exception (#54147 ) We should never write a circular reference exception as we will fail a node with StackOverflowError. However, we have one in #53589. I tried but failed to find its location. With this commit, we will avoid StackOverflowError in production and detect circular exceptions in tests. Closes #53589	2020-04-04 13:42:27 -04:00
Jason Tedor	05c5529b2d	Clean up a few instances of "MetaData" We recently cleaned up the use of the word "metadata" across the codebase. A few additional uses have trickled in, likely from in-progress work. This commit cleans up these last few instances. Relates #54519	2020-04-04 10:55:09 -04:00
Lee Hinman	814c248819	[7.x] Use V2 index templates during index creation (#54669 ) (#54750 ) * Use V2 index templates during index creation This commit changes our index creation code to use (and favor!) V2 index templates during index creation. The creation precedence goes like so, in order of precedence: - Existing source `IndexMetadata` - for example, when recovering from a peer or a shrink/split/clone where index templates should not be applied - A matching V2 index template, if one is found - When a V2 template is found, all component templates (in the `composed_of` field) are applied in the order that they appear, with the index template having the 2nd highest precedence (the create index request always has the top priority when it comes to index settings) - All matching V1 templates (the old style) This also adds index template validation when `PUT`-ing a new v2 index template (because this was required) and ensures that all index and component templates specify no top-level mapping type (it is automatically added when the template is added to the cluster state). This does not yet implement fine-grained component template merging of mappings, where we favor merging only a single field's configuration, that will be done in subsequent work. This also keeps the existing hidden index behavior present for v1 templates, where a hidden index will match v2 index templates unless they are global (`*`) templates. Relates to #53101	2020-04-03 14:46:15 -06:00
James Baiera	548145f4a3	Cat tasks output should respect time display settings (#54536 ) (#54735 )	2020-04-03 15:34:44 -04:00
Dan Hermann	18fef3de2a	Get data stream accepts single search parameter	2020-04-03 10:36:26 -05:00
Christoph Büscher	8c9ac14a98	Rename field name constants in AbstractBuilderTestCase (#53234 ) Some field name constants were not updaten when we moved from "string" to "text" and "keyword" fields. Renaming them makes it easier and faster to know which field type is used in test subclassing this base test case.	2020-04-03 17:28:22 +02:00
Nik Everett	195345b09e	Fix InternalAutoDateHistogramTests (#54602 ) (#54687 ) The test had errors around time units that have different length - think leap years or months that aren't 30 days. This fixes those errors. In the proces I've changed a bunch of things to debug the problem: * Replace `currentTimeMillis` with a random time. Now the test fails randomly! Wonderful. Much better than on random days of the month. * Generate buckets "closer together" to test random reduction. Without this we were super frequently getting stuck in the "year of century" rounding because some of the of the buckets we built were far apart. This generates a much greater variety of tests. * Implement `toString` on `RoundingInfo` so I can debug without going crazy. * Switch keys in the bucket assertions from epoch millis to `Instant`s so we can read the failures. Closes #54540 Closes #39497	2020-04-03 08:22:08 -04:00
markharwood	2da2305587	Backport of lowercase normalizer PR #53882 A pre-configured normalizer for lower-casing. Closes #53872	2020-04-03 11:43:40 +01:00
Martijn van Groningen	ec0bbda52f	Changed itv2 and data streams feature flag naming (#54431 ) (#54500 ) from `*_flag_registered` to `#_feature_enabled`. This previous name indicated that a flag was registered, whilst the feature flag actually controls whether a feature is enabled.	2020-04-03 10:12:00 +02:00
Jason Tedor	f2590b9984	Workaround JDK 14 compiler bug (#54689 ) This commit workarounds a bug in the JDK 14 compiler. It is choking on a method reference, so we substitute a lambda expression instead. The JDK bug ID is 9064309.	2020-04-02 19:45:52 -04:00
Julie Tibshirani	5fb7602227	Disallow changing 'enabled' on the root mapper. (#54681 ) In #33933 we disallowed changing the `enabled` parameter in object mappings. However, the fix didn't cover the root object mapper. This PR adjusts the change to also include the root mapper and clarifies the error message.	2020-04-02 15:28:48 -07:00
Dan Hermann	39c4ec6821	[7.x] Create first backing index when creating data stream	2020-04-02 17:19:35 -05:00
Nik Everett	54ea4f4f50	Begin to drop pipeline aggs from the result tree (backport of #54311 ) (#54659 ) Removes pipeline aggregations from the aggregation result tree as they are no longer used. This stops us from building the pipeline aggregators at all on data nodes except for backwards compatibility serialization. This will save a tiny bit of space in the aggregation tree which is lovely, but the biggest benefit is that it is a step towards simplifying pipeline aggregators. This only does about half of the work to remove the pipeline aggs from the tree. Removing all of it would, well, double the size of the change and make it harder to review.	2020-04-02 16:45:12 -04:00
Nik Everett	cc6468a0cb	Fix BWC error on pipeline aggs (#54672 ) I derped out on a last minute bug fix when backporting #54282 and it only causes the tests to fail about half the time. So I didn't catch it until after merging. Great! This fixes it.	2020-04-02 14:51:30 -04:00
Zachary Tong	20d67720aa	Refactor Percentiles/Ranks aggregation builders and factories (#51887 ) (#54537 ) - Consolidates HDR/TDigest factories into a single factory - Consolidates most HDR/TDigest builder into an abstract builder - Deprecates method(), compression(), numSigFig() in favor of a new unified PercentileConfig object - Disallows setting algo options that don't apply to current algo The unified config method carries both the method and algo-specific setting. This provides a mechanism to reject settings that apply to the wrong algorithm. For BWC the old methods are retained but marked as deprecated, and can be removed in future versions. Co-authored-by: Mark Tozzi <mark.tozzi@gmail.com> Co-authored-by: Mark Tozzi <mark.tozzi@gmail.com>	2020-04-02 10:39:41 -04:00
Nik Everett	a5adac0d1e	Fix pipeline agg serialization for ccs (backport of #54282 ) (#54468 ) This fixes pipeline aggregations used in cross cluster search from an older version of Elasticsearch to a newer version of Elasticsearch. I broke this in #53730 when I was too aggressive in shutting off serialization of pipeline aggs. In particular, this comes up when the coordinating node is pre-7.8.0 and the gateway node is on or after 7.8.0. The fix is another step down the line to remove pipeline aggregators from the aggregation tree. Sort of. It create a new `List<PipelineAggregator>` member in `InternalAggregation` but it is only used for bwc serialization and it is fed by the mechanism established in #53730 to read the pipelines from the	2020-04-02 10:35:40 -04:00
Nik Everett	b4feda84e8	Add scroll info to search task description (backport of #54606 ) (#54612 ) Right now you can't tell from the task description whether or not the search is a scroll. This adds that information to the description which is super useful if you are trying to debug a cluster that is running out of scroll contexts.	2020-04-02 09:04:49 -04:00
Jason Tedor	18b602280c	Add validation to the usage service (#54617 ) Today the usage service can let in some issues, such as handlers that do not have a name, where the errors do not manifest until later (calling the usage API), or conflicting handlers with the same name. This commit addresses this by adding some validation to the usage service.	2020-04-02 08:56:28 -04:00
Andy Bristol	eb14635f1f	add tests to StatsAggregatorTests (#53768 ) Adds tests for supported ValuesSourceTypes, unmapped fields, scripting, and the missing param. The tests for unmapped fields and scripting are migrated from the StatsIT integration test	2020-04-01 17:07:51 -07:00
Andy Bristol	c87b830d06	migrate tests from MissingIT to agg tests (#53448 ) Move the remaining tests for the missing aggregation into its AggregatorTestCase out of its integration test and remove the IT	2020-04-01 17:05:44 -07:00
Andy Bristol	ec76e7306e	supported field type tests for max agg (#53701 ) Adds test hooks for testing supported ValuesSource types for the max aggregation	2020-04-01 15:24:53 -07:00
Andy Bristol	5d0351ea00	add tests to SumAggregatorTests (#53568 ) This adds tests for supported ValuesSourceTypes, unmapped fields, scripting, and the missing param. The tests for unmapped fields and scripting are migrated from the SumIT integration test	2020-04-01 15:24:21 -07:00
Andy Bristol	62a52465fc	aggregator and yaml tests for missing agg (#53214 ) Tests for unmapped fields, the missing parameter, scripting, and correct ValuesSource types in MissingAggregatorTests. Basic yaml tests for the missing agg For #42949	2020-04-01 15:23:08 -07:00
William Brafford	958e9d1b78	Refactor nodes stats request builders to match requests (#54363 ) (#54604 ) * Refactor nodes stats request builders to match requests (#54363) * Remove hard-coded setters from NodesInfoRequestBuilder * Remove hard-coded setters from NodesStatsRequest * Use static imports to reduce clutter * Remove uses of old info APIs	2020-04-01 17:03:04 -04:00
Gordon Brown	f0cb8a56a9	Handle -1 gc_threshold settings explicitly (#54546 ) Because -1 is technically a valid TimeValue (as a sentinel value), that is now explicitly checked for when validating gc_thresholds. The tests are also adjusted to test this case separately from other negative values.	2020-04-01 13:56:50 -06:00
Mayya Sharipova	bf4857d9e0	Search hit refactoring (#41656 ) (#54584 ) Refactor SearchHit to have separate document and meta fields. This is a part of bigger refactoring of issue #24422 to remove dependency on MapperService to check if a field is metafield. Relates to PR: #38373 Relates to issue #24422 Co-authored-by: sandmannn <bohdanpukalskyi@gmail.com>	2020-04-01 15:19:00 -04:00
jimczi	7787603d56	Add 7.6.3 version	2020-04-01 16:23:28 +02:00
David Turner	6d976e1468	Resolve some coordination-layer TODOs (#54511 ) This commit removes a handful of TODO comments in the cluster coordination layer that no longer apply. Relates #32006	2020-04-01 12:36:18 +01:00
David Turner	5e3b6ab82b	Use VotingConfiguration#of where possible (#54507 ) This resolves a longstanding TODO in the cluster coordination subsystem. Relates #32006	2020-04-01 09:30:42 +01:00
Nhat Nguyen	c2506af8a6	Enable engine debug log for testMaybeFlush Relates #52223	2020-03-31 23:40:14 -04:00
Jason Tedor	63e5f2b765	Rename META_DATA to METADATA This is a follow up to a previous commit that renamed MetaData to Metadata in all of the places. In that commit in master, we renamed META_DATA to METADATA, but lost this on the backport. This commit addresses that.	2020-03-31 17:30:51 -04:00
Jason Tedor	5fcda57b37	Rename MetaData to Metadata in all of the places (#54519 ) This is a simple naming change PR, to fix the fact that "metadata" is a single English word, and for too long we have not followed general naming conventions for it. We are also not consistent about it, for example, METADATA instead of META_DATA if we were trying to be consistent with MetaData (although METADATA is correct when considered in the context of "metadata"). This was a simple find and replace across the code base, only taking a few minutes to fix this naming issue forever.	2020-03-31 17:24:38 -04:00
Zachary Tong	c9db2de41d	[7.x] Comprehensively test supported/unsupported field type:agg combinations (#54451 ) * Comprehensively test supported/unsupported field type:agg combinations (#52493) This adds a test to AggregatorTestCase that allows us to programmatically verify that an aggregator supports or does not support a particular field type. It fetches the list of registered field type parsers, creates a MappedFieldType from the parser and then attempts to run a basic agg against the field. A supplied list of supported VSTypes are then compared against the output (success or exception) and suceeds or fails the test accordingly. Co-Authored-By: Mark Tozzi <mark.tozzi@gmail.com> * Skip fields that are not aggregatable * Use newIndexSearcher() to avoid incompatible readers (#52723) Lucene's `newSearcher()` can generate readers like ParallelCompositeReader which we can't use. We need to instead use our helper `newIndexSearcher`	2020-03-31 14:35:03 -04:00
Jake Landis	9b1fe93363	[7.x] introduce 6.8.9 as a version (#53817 )	2020-03-31 13:03:28 -05:00
Armin Braun	c38e125425	Remove Redundant Documentation on SnapshotsService (#54482 ) (#54505 ) The docs here add nothing compared to those in the package. If anything they are somewhat confusing since they don't give all necessary details to understand the snapshot process. => remove them and link to the complete docs at the package level	2020-03-31 17:07:48 +02:00
Yannick Welsch	597dfa8481	Avoid holding onto bulk items until all completed (#54407 ) Bulk requests currently keep a reference to all bulk item requests until every one of them has completed. There is no need to do so, however, and, in case of large bulks, can mean unnecessary holding onto memory that might be better used elsewhere. More so as different shard-level bulks can complete at different speeds, and one slow shard-level request should not require holding onto every other shard-level request.	2020-03-31 16:19:07 +02:00
Dan Hermann	2ede8662e1	Bump multi-release JARs to Java 11	2020-03-31 06:48:46 -05:00
Tim Brooks	915435bbe4	Fix issue with pipeline releasing bytes early (#54474 ) Currently there is an issue with the InboundPipeline releasing bytes earlier than appropriate. This can lead to the bytes being reused before the message is handled. This commit fixes that issue and adds a test to detect when it is occurring.	2020-03-30 22:39:15 -06:00
Lee Hinman	a3d1945254	[7.x] Add warnings/errors when V2 templates would match same i… (#54449 ) * Add warnings/errors when V2 templates would match same indices… (#54367) * Add warnings/errors when V2 templates would match same indices as V1 With the introduction of V2 index templates, we want to warn users that templates they put in place might not take precedence (because v2 templates are going to "win"). This adds this validation at `PUT` time for both V1 and V2 templates with the following rules: ** When creating or updating a V2 template - If the v2 template would match indices for an existing v1 template or templates, provide a warning (through the deprecation logging so it shows up to the client) as well as logging the warning The v2 warning looks like: ``` index template [my-v2-template] has index patterns [foo-] matching patterns from existing older templates [old-v1-template,match-all-template] with patterns (old-v1-template => [foo],match-all-template => []); this template [my-v2-template] will take precedence during new index creation ``` * When creating a V1 template - If the v1 template is for index patterns of `""` and a v2 template exists, warn that the v2 template may take precedence - If the v1 template is for index patterns other than all indices, and a v2 template exists that would match, throw an error preventing creation of the v1 template * When updating a V1 template (without changing its existing `index_patterns`!) - If the v1 template is for index patterns that would match an existing v2 template, warn that the v2 template may take precedence. The v1 warning looks like: ``` template [my-v1-template] has index patterns [] matching patterns from existing index templates [existing-v2-template] with patterns (existing-v2-template => [foo]); this template [my-v1-template] may be ignored in favor of an index template at index creation time ``` And the v1 error looks like: ``` template [my-v1-template] has index patterns [foo] matching patterns from existing index templates [existing-v2-template] with patterns (existing-v2-template => [f]), use index templates (/_index_template) instead ``` Relates to #53101 * Remove v2 index and component templates when cleaning up tests * Finish half-finished comment sentence * Guard template removal and ignore for earlier versions of ES Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> * Also ignore 500 errors when clearing index template v2 templates Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-30 13:25:50 -06:00
Mark Tozzi	529622d4f4	Unit tests for Range and DateRange aggs (#52380 ) (#54455 )	2020-03-30 15:07:43 -04:00
Mark Tozzi	10e0e59561	Tests for agg missing values (#51068 ) (#54452 )	2020-03-30 15:05:38 -04:00
Ryan Ernst	3a24fe9d37	Move keystore-cli to its own tools project (#40787 ) (#54294 ) This commit moves the keystore cli into its own project, so that the test dependencies can be isolated from the rest of server.	2020-03-30 11:20:07 -07:00
Nik Everett	56047f74be	Fix auto_date_histogram serialization bug (#54447 ) This fixes a serialization bug in `auto_date_histogram` that comes up in a cluster mixed between pre-7.3.0 and post-7.3.0. Includes #54429 to keep 7.x looking like master for simpler backports. Closes #54382	2020-03-30 13:49:38 -04:00
Nik Everett	e58ad9fed3	Clean up how pipeline aggs check for multi-bucket (backport of #54161 ) (#54379 ) Pipeline aggregations like `stats_bucket`, `sum_bucket`, and `percentiles_bucket` only operate on buckets that have multiple buckets. This adds support for those aggregations to `geo_distance`, `ip_range`, `auto_date_histogram`, and `rare_terms`. This all happened because we used a marker interface to mark compatible aggs, `MultiBucketAggregationBuilder` and it was fairly easy to forget to implement the interface. This replaces the marker interface with an abstract method in `AggregationBuilder`, `bucketCardinality` which makes you return `NONE`, `ONE`, or `MANY`. The `bucket` aggregations can check for `MANY`. At this point `ONE` and `NONE` amount to about the same thing, but I suspect that'll be a useful distinction when validating bucket sorts. Closes #53215	2020-03-30 10:44:55 -04:00
Andrei Dan	d5320d9d29	Read the index.number_of_replicas from template so that wait_for_active_shards is interpreted correctly (#54231 ) (#54413 ) This commit takes into account the index.number_of_replicas (defaults to 0 - no replicas- ) value when setting an index template. This change enables the index.wait_for_active_shards value to be interpreted correctly (cherry picked from commit 07026ac3d56dc9fae69467adfda7eaed7ea3ca00) Signed-off-by: Andrei Dan <andrei.dan@elastic.co> Co-authored-by: tninokehoe <62655306+tninokehoe@users.noreply.github.com>	2020-03-30 14:34:49 +01:00
Armin Braun	9392fca36a	Improve Snapshot Abort Behavior (#54256 ) (#54410 ) This commit improves the behavior of aborting snapshots and by that fixes some extremely rare test failures. Improvements: 1. When aborting a snapshot while it is in the `INIT` stage we do not need to ever delete anything from the repository because nothing is written to the repo during INIT any more (in the past running deletes for these snapshots made sense because we were writing `snap-` and `meta-` blobs during the `INIT` step). 2. Do not try to finalize snapshots that never moved past `INIT`. Same reason as with the first step. If we never moved past `INIT` no data was written to the repo so no need to now write a useless entry for the aborted snapshot to `index-N`. This is especially true, since the reason the snapshot was aborted during `INIT` was a delete call so the useless empty snapshot just added to `index-N` would be removed by the subsequent delete that is still waiting anyway. 3. if after aborting a snapshot we wait for it to finish we should not try deleting it if it failed. If the snapshot failed it means it did not become part of the most recent `RepositoryData` so a delete for it will needlessly fail with a confusing message about that snapshot being missing or concurrent repository modification. I moved to throw the snapshot missing exception here because that seems the most user friendly. This allows the user to simply ignore `404` returns from the delete API when using it to make sure a snapshot is aborted+deleted. Marking this as a non-issue since it doesn't have any negative repercussions other than confusing exceptions on some snapshot aborts. Closes #52843	2020-03-30 15:08:18 +02:00
Jim Ferenczi	12cfdc24b0	Fixed rewrite of time zone without DST (#54398 ) We try to rewrite time zones to fixed offsets in the date histogram aggregation if the data in the shard is within a single transition. However this optimization is not applied on time zones that don't apply daylight saving changes but had some random transitions in the past (e.g. Australia/Brisbane or Asia/Katmandu). This changes fixes the rewrite of such time zones to fixed offsets.	2020-03-30 13:18:57 +02:00
Martijn van Groningen	4b4fbc160d	Refactor AliasOrIndex abstraction. (#54394 ) Backport of #53982 In order to prepare the `AliasOrIndex` abstraction for the introduction of data streams, the abstraction needs to be made more flexible, because currently it really can be only an alias or an index. * Renamed `AliasOrIndex` to `IndexAbstraction`. * Introduced a `IndexAbstraction.Type` enum to indicate what a `IndexAbstraction` instance is. * Replaced the `isAlias()` method that returns a boolean with the `getType()` method that returns the new Type enum. * Moved `getWriteIndex()` up from the `IndexAbstraction.Alias` to the `IndexAbstraction` interface. * Moved `getAliasName()` up from the `IndexAbstraction.Alias` to the `IndexAbstraction` interface and renamed it to `getName()`. * Removed unnecessary casting to `IndexAbstraction.Alias` by just checking the `getType()` method. Relates to #53100	2020-03-30 10:12:16 +02:00
Nhat Nguyen	6e025c12f0	Add debug logging for testRunningTasksCount Relates #53594	2020-03-29 18:34:41 -04:00
Jason Tedor	f0033783db	Deprecate node local storage setting (#54374 ) This setting is not documented and has dubious value since it means there can be nodes in the cluster (non-data and non-master nodes) that do not have persistent node IDs. This does not have any use cases so this commit removes the setting.	2020-03-28 14:36:41 -04:00
Jason Tedor	60437b474d	Fix line-length violation in DiscoveryNodeRole This commit fixes a line-length checkstyle violation in DiscoveryNodeRole.java.	2020-03-28 13:06:20 -04:00
Jason Tedor	03cab96b2d	Fix imports in discovery node classes This commit fixes some imports that were leftover after resolving some merge conflicts on a backport.	2020-03-28 12:56:22 -04:00
Jason Tedor	c3be3206ce	Decouple environment from DiscoveryNode (#54373 ) Today Environment is coupled to DiscoveryNode via the node.local_storage setting. This commit decouples Environment from this setting.	2020-03-28 12:52:47 -04:00
Jason Tedor	37b59a357f	Ensure that the output of node roles are sorted (#54376 ) This commit ensures that node roles are sorted by node role name, which makes the output easier to consume, and also makes it easier to rely on the behavior of the output in assertions.	2020-03-28 12:51:21 -04:00
Tim Brooks	2ccddbfa88	Move transport decoding and aggregation to server (#54360 ) Currently all of our transport protocol decoding and aggregation occurs in the individual transport modules. This means that each implementation (test, netty, nio) must implement this logic. Additionally, it means that the entire message has been read from the network before the server package receives it. This commit creates a pipeline in server which can be passed arbitrary bytes to handle. Internally, the pipeline will decode, decompress, and aggregate the messages. Additionally, this allows us to run many megabytes of bytes through the pipeline in tests to ensure that the logic works. This work will enable future work: Circuit breaking or backoff logic based on message type and byte in the content aggregator. Sharing bytes with the application layer using the ref counted releasable network bytes. Improved network monitoring based specifically on channels. Finally, this fixes the bug where we do not circuit break on the correct message size when compression is enabled.	2020-03-27 14:13:10 -06:00
Stuart Tettemer	1630de4a42	Scripting: stats per context in nodes stats (#54008 ) (#54357 ) Adds script cache stats to `_node/stats`. If using the general cache: ``` "script_cache": { "sum": { "compilations": 12, "cache_evictions": 9, "compilation_limit_triggered": 5 } } ``` If using context caches: ``` "script_cache": { "sum": { "compilations": 13, "cache_evictions": 9, "compilation_limit_triggered": 5 }, "contexts": [ { "context": "aggregation_selector", "compilations": 8, "cache_evictions": 6, "compilation_limit_triggered": 3 }, { "context": "aggs", "compilations": 5, "cache_evictions": 3, "compilation_limit_triggered": 2 }, ``` Backport of: 32f46f2 Refs: #50152	2020-03-27 12:26:00 -06:00
Tim Brooks	f5b4020819	Remove netty BytesReference implementations (#54355 ) Elasticsearch has a number of different BytesReference implementations. These implementations can all implement the interface in different ways with subtly different behavior and performance characteristics. On the other-hand, the JVM only represents bytes as an array or a direct byte buffer. This commit deletes the specialized Netty implementations and moves to using a generic ByteBuffer reference type. This will allow us to focus on standardizing performance and behave around a smaller number of implementations that can be used by all components in Elasticsearch.	2020-03-27 11:01:33 -06:00
Lee Hinman	f2cc2b1127	[7.x] Add REST APIs for IndexTemplateV2Metadata CRUD (#54039 ) (#54347 ) * Add REST APIs for IndexTemplateV2Metadata CRUD (#54039) * Add REST APIs for IndexTemplateV2Metadata CRUD This commit adds the get/put/delete APIs for interacting with the now v2 versions of index templates. These APIs are behind the existing `es.itv2_feature_flag_registered` system property feature flag. Relates to #53101 * Add exceptions for HLRC tests * Add skips for 7.x versions * Use index_template instead of template_v2 in action names * Add test for MetaDataIndexTemplateService.addIndexTemplateV2 * Move removal to static method and add test * Add unit tests for request classes (implement hashCode & equals) Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> * Fix compilation Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-27 10:47:22 -06:00
Dan Hermann	1690e78646	Validation for data stream creation	2020-03-27 10:07:46 -05:00
Alan Woodward	461f1307d6	Add XContentHelper.childBytes() method (#54287 ) We have a number of places where we want to read a fairly complex object from XContent, but aren't interested in its contents; for example, mappings are often serialized and deserialized between several objects before they are actually built into a MappingMetaData object. This means that potentially large maps of maps are constructed several times, only to immediately be re-serialized again. This commit adds a new helper method to XContentHelper that reads the children of an xcontent object directly to a BytesReference, serialized via the same xcontenttype as the parent parser, avoiding the construction of intermediary maps or lists.	2020-03-27 14:21:56 +00:00
Armin Braun	14b5daad7c	Fix Snapshot Completion Listener Lost on Master Failover (#54286 ) (#54330 ) * Fix Snapshot Completion Listener Lost on Master Failover If master fails over before (or we run into any other exception) when removing the snapshot from the CS we must still resolve all the completion listeners for the snapshot.	2020-03-27 14:11:13 +01:00
Gordon Brown	0d30b48613	Disallow negative TimeValues (#53913 ) This commit causes negative TimeValues, other than -1 which is sometimes used as a sentinel value, to be rejected during parsing. Also introduces a hack to allow ILM to load policies which were written to the cluster state with a negative min_age, treating those values as 0, which should match the behavior of prior versions.	2020-03-26 13:30:35 -06:00
William Brafford	14204f8381	Use set-based interface for NodesStatsRequest (#53637 ) (#54141 ) The NodesStatsRequest class uses a set of strings for its internal serialization. This commit updates the class's interface so that we no longer use hard-coded getters and setters, but rather methods that add strings directly. For example, the old way of adding "os" metrics to a request would be to call request.os(true). The new way of doing this is to call request.addMetric("os"). For the time being, the canonical list of metrics is an enum in NodesStatsRequest. This will eventually be replaced with something pluggable.	2020-03-26 14:41:49 -04:00
Christoph Büscher	da404bbce2	HLRC: Don't send defaults for SubmitAsyncSearchRequest (#54200 ) (#54266 ) Currently we set the defaults for ccsMinimizeRoundtrips, preFilterShardSize and requestCache on the HLRC SubmitAsyncSearchRequest in the constructor. This is no longer needed since we now only send the parameters along with the rest request that are supported (omitting e.g. ccsMinimizeRoundtrips) and the correct defaults are set on the client side. This change removes setting and sending these defaults where possible, leaving only the overwrite of batchedReduceSize with a default value of 5, since the default used in the vanilla SearchRequest is 512. However, we don't need to send this value along as a request parameter if its the default since the correct one will be set on the receiving end if no value is specified. Also adding tests for RestSubmitAsyncSearchAction that check the correct defaults are set when parameters are missing on the server side. Backport of #54200	2020-03-26 19:01:17 +01:00
Nik Everett	b6a8de0d89	Fix compilation in Eclipse (backport of #54275 ) (#54284 ) These mock calls cause Eclipse to think that `Exception` can be thrown because `CheckedFunction`'s lower bound is `Exception`. This makes Eclipse happy.	2020-03-26 12:53:13 -04:00
Lee Hinman	3e1fc8a5c9	[7.x] #32478 fixed cluster stats return 8EB (#32480 ) (32972311) (#54273 ) Co-authored-by: Darby.Han <darby.han@navercorp.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: 한우람 <hgword@gmail.com> Co-authored-by: Darby.Han <darby.han@navercorp.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-26 09:12:09 -06:00
Henning Andersen	5bfaa20dd4	Rollover: refactor out cluster state update (#53965 ) (#54269 ) Make it possible to reuse the cluster state update of rollover for simulation purposes by extracting it. Also now run the full rollover in the pre-rollover phase and the actual rollover phase, allowing a dedicated exception in case of concurrent rollovers as well as a more thorough pre-check.	2020-03-26 16:06:13 +01:00
Yannick Welsch	8176f1c7f8	Delegate toXContent logic from ClusterState to its member classes (#54192 ) Backport of #52743 Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2020-03-26 15:08:30 +01:00
Alan Woodward	048797389a	Explicitly set mappings in SearchAfterIT (#54262 ) We have some very occasional failures in SearchAfterIT, where a search throws an exception because a shard does not have the mapping for the requested sort field. The field should have been added in a dynamic mapping update after an index event, but it seems that there can sometimes be a small delay in propagating this update to the shards. This commit changes the test to explicitly define the relevant field at index creation time. Fixes #51900	2020-03-26 13:05:19 +00:00
Jason Tedor	d8f745736b	Clarify the remove keystore command can handle many (#54244 ) The remove keystore command can handle multiple settings. In a few places, we were not consistent about mentioning this. This commit addreses this, in the CLI help, and the docs.	2020-03-26 08:49:43 -04:00
William Brafford	b11960e3e6	Create set-based interface for NodesInfoRequest (#53410 ) (#54223 ) This commit begins the work of removing the "hard-coded" metric getters and setters from the NodesInfoRequest classes. We start by providing new flexible getters and setters. We then update the test classes to remove the old getters, and then remove those getters.	2020-03-26 07:28:09 -04:00
Yannick Welsch	1ba6783780	Schedule commands in current thread context (#54187 ) Changes ThreadPool's schedule method to run the schedule task in the context of the thread that scheduled the task. This is the more sensible default for this method, and eliminates a range of bugs where the current thread context is mistakenly dropped. Closes #17143	2020-03-26 10:07:59 +01:00
Jason Tedor	6af89e62d1	Allow keystore add-file to handle multiple settings (#54240 ) Today the keystore add-file command can only handle adding a single setting/file pair in a single invocation. This incurs the startup costs of the JVM many times, which in some environments can be expensive. This commit teaches the add-file keystore command to accept adding multiple settings in a single invocation.	2020-03-26 00:07:05 -04:00
Jason Tedor	fe8257d981	Allow keystore add to handle multiple settings (#54229 ) Today the keystore add command can only handle adding a single setting/value pair in a single invocation. This incurs the startup costs of the JVM many times, which in some environments can be expensive. This commit teaches the add keystore command to accept adding multiple settings in a single invocation.	2020-03-25 22:58:20 -04:00
Nik Everett	8f40f1435a	Save a little space in agg tree (backport of #53730 ) (#54213 ) This drop the "top level" pipeline aggregators from the aggregation result tree which should save a little memory and a few serialization bytes. Perhaps more imporantly, this provides a mechanism by which we can remove all pipelines from the aggregation result tree. This will save quite a bit of space when pipelines are deep in the tree. Sadly, doing this isn't simple because of backwards compatibility. Nodes before 7.7.0 need those pipelines. We provide them by setting passing a `Supplier<PipelineTree>` into the root of the aggregation tree that we only call if we need to serialize to a version before 7.7.0. This solution works for cross cluster search because we always reduce the aggregations in each remote cluster and then forward them back to the coordinating node. Its quite possible that the coordinating node needs the pipeline (say it is version 7.1.0) and the gateway node in the remote cluster doesn't (version 7.7.0). In that case the data nodes won't send the pipeline aggregations back to the gateway node. Critically, the gateway node will send the pipeline aggregations back to the coordinating node. This is all managed with that `Supplier<PipelineTree>`, but how it is managed is a bit tricky.	2020-03-25 15:51:16 -04:00
Martijn van Groningen	66861a82a1	Data stream should refer to the backing indices using the Index class (#54199 ) Backport of #54189	2020-03-25 20:18:15 +01:00
Bogdan Pintea	77da9dd040	Add version 7.8.0 Add version 7.8.0	2020-03-25 18:10:30 +01:00
jimczi	04fabead14	Fix small typo in SearchService#executeQueryPhase Relates #54044	2020-03-25 16:16:30 +01:00
Armin Braun	32fa90c9ba	Fix ClusterHealthIT.testHealthOnMasterFailover (#54170 ) (#54177 ) We can run into a state where there's no more events to wait for temporarily but the cluster still isn't green. I added the wait for green flag to the request so the assertion for green cluster health below doesn't fail. Closes #53457	2020-03-25 14:33:31 +01:00
Armin Braun	0a70250201	Fix BlobStoreIncrementalityIT Assertion (#54149 ) (#54155 ) We are using this assertion for identical shard snapshots for situations where `snapshot1` wasn't the first snapshot for the tested shard. Hence, we can't assume that it will not share any files with previous snapshots. This showed up in failing tests when `snapshot1` was equivalent to a previous snapshot because no documents were deleted from the repo randomly in the failing test but even if documents are deleted there is no guarantee that no files will be shared. => I removed this assertion since its immaterial for what is tested here anyway. Closes #54034	2020-03-25 12:13:52 +01:00
jimczi	e380a5a8c3	Fix off-by one error in TransportSearchActionTests Closes #54156	2020-03-25 11:41:02 +01:00
Tanguy Leroux	b6e482295b	Mute TransportSearchActionTests.testShouldPreFilterSearchShards (#54158 ) Relates #53873 Relates #54156	2020-03-25 11:25:06 +01:00
Jim Ferenczi	3b4751bdb7	Avoid I/O operations when rewriting shard search request (#54044 ) (#54139 ) This commit ensures that we rewrite the shard request with a short-lived can_match searcher. This is required for frozen indices since the high level rewrite is now performed on a network thread where we don't want to perform I/O. Closes #53985	2020-03-25 09:02:36 +01:00
Jason Tedor	381d7586e4	Introduce formal role for remote cluster client (#54138 ) This commit introduce a formal role for identifying nodes that are capable of making connections to remote clusters. Relates #53924	2020-03-24 21:59:43 -04:00
Henning Andersen	7ce7aff66e	Reindex negative TimeValue fix (#54057 ) Reindex would use timeValueNanos(System.nanoTime()). The intended use for TimeValue is as a duration, not as absolute time. In particular, this could result in negative TimeValue's, being unsupported in #53913. Modified to use the bare long nano-second value.	2020-03-24 22:29:09 +01:00
Przemko Robakowski	fc498f625a	[7.x] Add validation for component templates (#54023 ) (#54118 ) * Add validation for component templates (#54023) * Add validation for component templates This change adds validation to make sure that settings and mappings are correct in component template. It's done the same way as in index templates - code is reused. Reletes to #53101 * Fix checkstyle violation * Update server/src/main/java/org/elasticsearch/cluster/metadata/MetaDataIndexTemplateService.java Co-Authored-By: Lee Hinman <dakrone@users.noreply.github.com> * Update server/src/test/java/org/elasticsearch/cluster/metadata/MetaDataIndexTemplateServiceTests.java Co-Authored-By: Lee Hinman <dakrone@users.noreply.github.com> * Update server/src/test/java/org/elasticsearch/cluster/metadata/MetaDataIndexTemplateServiceTests.java Co-Authored-By: Lee Hinman <dakrone@users.noreply.github.com> * Update server/src/test/java/org/elasticsearch/cluster/metadata/MetaDataIndexTemplateServiceTests.java Co-Authored-By: Lee Hinman <dakrone@users.noreply.github.com> * Update server/src/main/java/org/elasticsearch/cluster/metadata/MetaDataIndexTemplateService.java Co-Authored-By: Lee Hinman <dakrone@users.noreply.github.com> Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com> * Adjusted to 7.7 * unused import fixed * npe fixeD * change exception type Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>	2020-03-24 22:20:34 +01:00
Jim Ferenczi	025857d949	Fix ShardSearchRequest cache key (#54071 ) This commit ensures that we don't use the non-deterministic canReturnNullResponseIfMatchNoDocs boolean in the cache key of the ShardSearchRequest. The value of this boolean has no influence on the cacheability of the request. Closes #32827	2020-03-24 22:15:52 +01:00
Przemko Robakowski	5594d57727	/_cat/shards support path stats (#53461 ) (#54119 ) * _cat/shards support path stats * fix some style case * fix some style case * fix rest-api-spec cat.shards error * fix rest-api-spec cat.shards bwc error Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: weizijun <weizijun1989@gmail.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-03-24 20:46:05 +01:00
Tim Brooks	b21b7fb09b	Allow proxy mode server name to be updated (#54107 ) Currently there is a bug where the proxy strategy will not be rebuilt if the server_name is dynamically updated. This commit fixes this issue.	2020-03-24 11:54:23 -06:00
David Turner	21afc788f8	Reduce log level for pipeline failure (#54097 ) Today we log `failed to execute pipeline for a bulk request` at `ERROR` level if an attempt to run an ingest pipeline fails. A failure here is commonly due to an `EsRejectedExecutionException`. We also feed such failures back to the client and record the rejection in the threadpool statistics. In line with #51459 there is no need to log failures within actions so noisily and with such urgency. It is better to leave it up to the client to react accordingly. Typically an `EsRejectedExecutionException` should result in the client backing off and retrying, so a failure here is not normally fatal enough to justify an `ERROR` log at all. This commit reduces the log level for this message to `DEBUG`.	2020-03-24 17:41:35 +00:00
markharwood	6a60f85bba	Wildcard field - add normalizer support (#53851 ) (#54109 ) Backport support for normalisation to wildcard field Closes #53603	2020-03-24 17:37:47 +00:00
Yannick Welsch	e006d1f6cf	Use special XContent registry for node tool (#54050 ) Fixes an issue where the elasticsearch-node command-line tools would not work correctly because PersistentTasksCustomMetaData contains named XContent from plugins. This PR makes it so that the parsing for all custom metadata is skipped, even if the core system would know how to handle it. Closes #53549	2020-03-24 17:40:51 +01:00
Nik Everett	42be39177b	Remove ceremony declaring aggs (backport of #53990 ) (#54099 ) This removes some more ceremony when declaring agg parsers. You no longer need a static `parse` method, instead you can just make the `PARSER` public in most cases. There are still a few aggs with the `parse` method, but those `parse` methods are a little more complex to untangle.	2020-03-24 12:29:52 -04:00
Tim Brooks	caefa78513	Align remote info api with new settings (#54102 ) Currently the remote info api has added a number of possible fields (proxy, num_socket_connections, etc) that are available in proxy mode. These fields are not aligned with what the settings are named. This commit modifies this API to align with the settings.	2020-03-24 10:27:24 -06:00
Christoph Büscher	1c1730facd	Mask wildcard query special characters on keyword queries (#53127 ) (#53512 ) Wildcard queries on keyword fields get normalized, however this normalization step should exclude the two special characters * and ? in order to keep the wildcard query itself intact. Closes #46300	2020-03-24 17:22:29 +01:00
Alan Woodward	39d7d0dc10	Upgrade to lucene 8.5.0 release (#54077 ) Upgrades our lucene dependency to the released 8.5.0 version.	2020-03-24 13:45:50 +00:00
Dan Hermann	30105a5ab5	[7.x] Cluster state and CRUD operations for data streams (#54073 )	2020-03-24 07:58:52 -05:00
Armin Braun	4e462db2ed	Fix BlobStoreIncrementalityIT (#54055 ) (#54060 ) The snapshot stats response list of snapshot statuses is not ordered according to the given list of snapshot names so randomly we could mix up snapshot1 and snapshot2 when asserting on the stats. Fixed by getting each snapshot's stats individually. Closes #54034	2020-03-24 11:46:40 +01:00
muachilin	b33fbe7026	Deprecate alternatives to the hot threads API (#52930 ) This commit deprecates various undocumented alternatives to the hot threads API.	2020-03-23 23:24:40 -04:00
Jim Ferenczi	9e3f7f4575	Add heuristics to compute pre_filter_shard_size when unspecified (#53873 ) (#54007 ) This commit changes the pre_filter_shard_size default from 128 to unspecified. This allows to apply heuristics based on the request and the target indices when deciding whether the can match phase should run or not. When unspecified, this pr runs the can match phase automatically if one of these conditions is met: * The request targets more than 128 shards. * The request contains read-only indices. * The primary sort of the query targets an indexed field. Users can opt-out from this behavior by setting the `pre_filter_shard_size` to a static value. Closes #39835	2020-03-24 02:05:15 +01:00
Nik Everett	4734c645f1	Fix serialization bug for aggs (#54029 ) I created this bug today in #53793. When a `DelayableWriteable` that references an existing object serializes itself it wasn't taking the version of the node on the other side of the wire into account. This fixes that.	2020-03-23 19:00:47 -04:00
Jason Tedor	5c96a7e210	Fix compilation in RemoteClusterServiceTests This commit fixes an issue when a JDK collection convenience method not available in JDK 8 was backported to 7.x.	2020-03-23 18:41:17 -04:00
Jason Tedor	d3cc5bff17	Give helpful message on remote connections disabled (#53690 ) Today when cluster.remote.connect is set to false, and some aspect of the codebase tries to get a remote client, today we return a no such remote cluster exception. This can be quite perplexing to users, especially if the remote cluster is actually defined in their cluster state, it is only that the local node is not a remote cluter client. This commit addresses this by providing a dedicated error message when a remote cluster is not available because the local node is not a remote cluster client.	2020-03-23 18:32:38 -04:00
Mark Vieira	70cfedf542	Refactor global build info plugin to leverage JavaInstallationRegistry (#54026 ) This commit removes the configuration time vs execution time distinction with regards to certain BuildParms properties. Because of the cost of determining Java versions for configuration JDK locations we deferred this until execution time. This had two main downsides. First, we had to implement all this build logic in tasks, which required a bunch of additional plumbing and complexity. Second, because some information wasn't known during configuration time, we had to nest any build logic that depended on this in awkward callbacks. We now defer to the JavaInstallationRegistry recently added in Gradle. This utility uses a much more efficient method for probing Java installations vs our jrunscript implementation. This, combined with some optimizations to avoid probing the current JVM as well as deferring some evaluation via Providers when probing installations for BWC builds we can maintain effectively the same configuration time performance while removing a bunch of complexity and runtime cost (snapshotting inputs for the GenerateGlobalBuildInfoTask was very expensive). The end result should be a much more responsive build execution in almost all scenarios. (cherry picked from commit ecdbd37f2e0f0447ed574b306adb64c19adc3ce1)	2020-03-23 15:30:10 -07:00

1 2 3 4 5 ...

4616 Commits