OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alan Woodward	18bfbeda29	Move merge compatibility logic from MappedFieldType to FieldMapper (#56915 ) Merging logic is currently split between FieldMapper, with its merge() method, and MappedFieldType, which checks for merging compatibility. The compatibility checks are called from a third class, MappingMergeValidator. This makes it difficult to reason about what is or is not compatible in updates, and even what is in fact updateable - we have a number of tests that check compatibility on changes in mapping configuration that are not in fact possible. This commit refactors the compatibility logic so that it all sits on FieldMapper, and makes it called at merge time. It adds a new FieldMapperTestCase base class that FieldMapper tests can extend, and moves the compatibility testing machinery from FieldTypeTestCase to here. Relates to #56814	2020-05-20 09:43:13 +01:00
Nik Everett	8b9c4eb3e0	Save memory when date_histogram is not on top (#56921 ) (#56960 ) When `date_histogram` is a sub-aggregator it used to allocate a bunch of objects for every one of it's parent's buckets. This uses the data structures that we built in #55873 rework the `date_histogram` aggregator instead of all of the allocation. Part of #56487	2020-05-19 17:36:55 -04:00
Lee Hinman	e208925465	[7.x] Add template simulation API for simulating template composition (#56842 ) (#56924 )	2020-05-19 08:12:21 -06:00
Tim Brooks	57c3a61535	Create HttpRequest earlier in pipeline (#56393 ) Elasticsearch requires that a HttpRequest abstraction be implemented by http modules before server processing. This abstraction controls when underlying resources are released. This commit moves this abstraction to be created immediately after content aggregation. This change will enable follow-up work including moving Cors logic into the server package and tracking bytes as they are aggregated from the network level.	2020-05-18 14:54:01 -06:00
Armin Braun	46e5c37267	Remove Dead Conditional from RoutingTable (#56870 ) (#56914 ) `delta` is always positive here. Co-authored-by: Howard <danielhuang@tencent.com>	2020-05-18 17:18:26 +02:00
David Turner	9ba897fbd6	Random iterations in testDataOnlyNodePersistence (#56906 ) PR #56893 was supposed to randomise the iteration count in `testDataOnlyNodePersistence` but this change was mistakenly omitted. This commit addresses this.	2020-05-18 15:16:22 +01:00
David Turner	64280b489b	Fix testDataOnlyNodePersistence (#56893 ) This test failed if all 1000 top-level `rarely()` calls in the loop returned `false`, because then we would never set the term of the persisted state. This commit fixes this by adding an earlier call to `persistedState#setCurrentTerm`. It also changes the test to clean up the threadpools it starts whether it passes or fails.	2020-05-18 13:57:36 +01:00
Armin Braun	e75a6f13a1	Stop Redundantly Serializing ShardId in BulkShardResponse (#56094 ) (#56866 ) When reading/writing the individual doc responses in the context of a bulk shard response there is no need to serialize the `ShardId` over and over. This can waste a lot of memory when handling large bulk requests.	2020-05-17 10:27:17 +02:00
Armin Braun	31f54c934e	Relax Assertion About SnapshotsService Listeners (#56608 ) (#56863 ) This assertion is too strict. A snapshot will be removed from the cluster state on the CS thread before it is removed from the listeners map on the snapshot thread pool. Throughout the removal from the cluster state and listener map, the snapshot is tracked in `endingSnapshots` though, so we can relax the assertion accordingly and are still able to catch leaked listeners. Closes #56607	2020-05-17 09:17:41 +02:00
Armin Braun	b9614558b9	Fix SnapshotStatusApisIT (#56859 ) (#56861 ) In the unlikely event that the data nodes started snapshotting the shards already (and hence got blocked on the data blobs) before the master has applied the cluster state to its own `SnapshotsService` on the CS applier thread, we can get a `SnapshotMissingException` here which breaks the busy assert loop so we have to deal with it explicitly. Closes #56858	2020-05-16 21:50:25 +02:00
Tim Brooks	195a5247d4	Prevent connection races in testEnsureWeReconnect (#56654 ) Currently it is possible that a sniff connection round is occurring as we enter another test loop in testEnsureWeReconnect. The problem is that once we enter another loop, closing the connection manually can cause this pre-existing connection round to fail. This round failing can fail the test. This commit fixes the issue by ensuring that there are no in-progress connections before entering another loop.	2020-05-15 14:58:46 -06:00
Nik Everett	f3e962707b	Mute TaskManagerTests#testTrackingChannelTask It fails sometimes. Tracked by #56746.	2020-05-15 16:48:33 -04:00
Nik Everett	7b626826eb	Fix sum test It was relying on the compensated sum working but the test framework was dodging it. This forces the accuracy tests to come from a single shard where we get the proper compensated sum. Closes #56757	2020-05-15 16:16:30 -04:00
Jason Tedor	da833d6cd3	Use settings infrastructure for shards and replicas (#56801 ) We get the number of shards and replicas with our bare hands in index metadata, rather than letting the settings infrastructure do the work for us. This commit switches to using the settings infrastructure.	2020-05-15 15:59:30 -04:00
David Turner	a3e845cbad	Suppress cluster UUID logs in 6.8/7.x upgrade (#56835 ) Today a 7.x node logs `cluster UUID set to [...]` on every cluster state update received from a 6.8 master, because 6.8 nodes are not able to commit the cluster UUID properly. We could try and deduplicate these logs somehow, but that would introduce a good deal of complexity. Instead, this commit suppresses these logs entirely when receiving cluster state updates from a 6.8 master.	2020-05-15 19:45:32 +01:00
Dan Hermann	66871c5342	[7.x] Rename endpoint from plural "_data_streams" to singular "_data_stream" (#56825 )	2020-05-15 10:27:53 -05:00
Alan Woodward	d33d13f2be	Simplify generics on Mapper.Builder (#56747 ) Mapper.Builder currently has some complex generics on it to allow fluent builder construction. However, the second parameter, a return type from the build() method, is unnecessary, as we can use covariant return types. This commit removes this second generic parameter.	2020-05-15 12:14:49 +01:00
Ryan Ernst	9fb80d3827	Move publishing configuration to a separate plugin (#56727 ) This is another part of the breakup of the massive BuildPlugin. This PR moves the code for configuring publications to a separate plugin. Most of the time these publications are jar files, but this also supports the zip publication we have for integ tests.	2020-05-14 20:23:07 -07:00
Tal Levy	5e90ff32f7	Add Normalize Pipeline Aggregation (#56399 ) (#56792 ) This aggregation will perform normalizations of metrics for a given series of data in the form of bucket values. The aggregations supports the following normalizations - rescale 0-1 - rescale 0-100 - percentage of sum - mean normalization - z-score normalization - softmax normalization To specify which normalization is to be used, it can be specified in the normalize agg's `normalizer` field. For example: ``` { "normalize": { "buckets_path": <>, "normalizer": "percent" } } ```	2020-05-14 17:40:15 -07:00
Lee Hinman	a73d7d9e2b	[7.x] Don't allow invalid template combinations (#56397 ) (#56795 ) Backports the following commits to 7.x: - Don't allow invalid template combinations (#56397)	2020-05-14 16:20:53 -06:00
Mark Tozzi	b718193a01	Clean up DocValuesIndexFieldData (#56372 ) (#56684 )	2020-05-14 12:42:37 -04:00
Nhat Nguyen	044ee380e8	Use ConcurrentSet in testTrackingChannelTask (#56775 ) We need to use a ConcurrentSet to track the canceled tasks as cancelTaskAndDescendants can be called concurrently. Closes #56746	2020-05-14 12:22:59 -04:00
David Turner	f0c2c25527	AwaitsFix for #56746 (and #56751 )	2020-05-14 12:46:32 +01:00
David Turner	63cc53e512	AwaitsFix for #56757	2020-05-14 12:00:15 +01:00
Martijn van Groningen	b87aeb09f7	Allow more apis to resolve data streams (#56743 ) Backporting #56683 to 7.x branch. Allow get settings, cluster state and field caps apis to resolve data streams.	2020-05-14 10:57:13 +02:00
Nhat Nguyen	ac432f6612	Reduce test load in TaskManagerTests	2020-05-13 23:52:48 -04:00
Nhat Nguyen	566b23c42c	Cancel task and descendants on channel disconnects (#56620 ) If a channel gets disconnected, then we should cancel the tasks associated with that channel as their results won't be retrieved. Closes #56327 Relates #56619 Backport of #56620	2020-05-13 22:09:58 -04:00
Jason Tedor	7c8860b7e6	Update number of replicas when removing setting (#56723 ) We previously rejected removing the number of replicas setting, which prevents users from reverting this setting to its default the natural way. To fix this, we put back the setting with the default value in the cases that the user is trying to remove it. Yet, we also need to do the work of updating the routing table and so on appropriately. This case was missed because when the setting is being removed, we were defaulting to -1 in this code path, which is treated as not being updated. Instead, we must treat the case when we are removing this setting as if the setting is being updated, too. This commit does that.	2020-05-13 20:13:25 -04:00
David Roberts	ab40466bfb	Prevent unexpected native controller output hanging the process (#56685 ) In normal operation native controllers are not expected to write anything to stdout or stderr. However, if due to an error or something unexpected with the environment a native controller does write something to stdout or stderr then it will block if nothing is reading that output. This change makes the stdout and stderr of native controllers reuse the same stdout and stderr as the Elasticsearch JVM (which are by default redirected to es.stdout.log and es.stderr.log) so that if something unexpected is written to native controller output then: 1. The native controller process does not block, waiting for something to read the output 2. We can see what the output was, making it easier to debug obscure environmental problems Backport of #56491 Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-05-13 22:57:00 +01:00
Nik Everett	b98b260048	Merge significant_terms into the terms package (backport of #56699 ) (#56715 ) This merges the code for the `significant_terms` agg into the package for the code for the `terms` agg. They are super entangled already, this mostly just admits that to ourselves. Precondition for the terms work in #56487	2020-05-13 17:36:21 -04:00
Luca Cavanna	34410814b9	Don't omit empty arrays when filtering _source (#56527 ) When using source filtering exclusions, empty arrays are not preserved in documents, and no empty arrays are returned if arrays are empty after applying exclusions. We have special treatment to make sure that we preserve empty objects, but the behaviour for arrays is different. It looks like this regression was introduced by #22593, shortly after we refactored source filtering to use automata (#20736). Note that this change affects what the search API returns when using source exclusions, as well as what gets indexed when using source exclusions for the _source field. Closes #23796	2020-05-13 23:24:21 +02:00
Nik Everett	126619ae3c	Add list of defered aggregations to the profiler (backport of #56208 ) (#56682 ) This adds a few things to the `breakdown` of the profiler: * `histogram` aggregations now contain `total_buckets` which is the count of buckets that they collected. This could be useful when debugging a histogram inside of another bucketing agg that is fairly selective. * All bucketing aggs that can delay their sub-aggregations will now add a list of delayed sub-aggregations. This is useful because we sometimes have fairly involved logic around which sub-aggregations get delayed and this will save you from having to guess. * Aggregtations wrapped in the `MultiBucketAggregatorWrapper` can't accurately add anything to the breakdown. Instead they the wrapper adds a marker entry `"multi_bucket_aggregator_wrapper": true` so we can be quickly pick out such aggregations when debugging. It also fixes a bug where `_count` breakdown entries were contributing to the overall `time_in_nanos`. They didn't add a large amount of time so it is unlikely that this caused a big problem, but I was there. To support the arbitrary breakdown data this reworks the profiler so that the `breakdown` can contain any data that is supported by `StreamOutput#writeGenericValue(Object)` and `XContentBuilder#value(Object)`.	2020-05-13 16:33:22 -04:00
Julie Tibshirani	1ad83c37c4	Use index sort range query when possible. (#56710 ) This PR proposes to use `IndexSortSortedNumericDocValuesRangeQuery` when possible to speed up certain range queries. Points-based queries are already very efficient, the only time this query makes a difference is when the range matches a large number of documents. Relates to #48665.	2020-05-13 13:24:45 -07:00
Jason Tedor	5ca2ea2dde	Allow removing replicas setting on closed indices (#56680 ) This is similar to a previous change that allowed removing the number of replicas settings (so setting it to its default) on open indices. This commit allows the same for closed indices. It is unfortunate that we have separate branches for handling open and closed indices here, but I do not see a clean way to merge these two together without making a rather unnatural method (note that they invoke different methods for doing the settings updates). For now, we leave this as-is even though it led to the miss here.	2020-05-13 15:56:58 -04:00
Mark Vieira	e3be18a443	Add version 6.8.10	2020-05-13 11:27:40 -07:00
Bogdan Pintea	2f0663c490	Add the 7.7.1 Version Add the bumped 7.7 branch new version, 7.7.1	2020-05-13 18:46:07 +02:00
Ignacio Vera	b4521d5183	upgrade to Lucene 8.6.0 snapshot (#56661 )	2020-05-13 14:25:16 +02:00
Jason Tedor	4394235c63	Allow removing index.number_of_replicas setting (#56656 ) Today a user can create an index without setting the index.number_of_replicas setting even though the index metadata requires that the setting has a value. We do this when creating an index by explicitly settings index.number_of_replicas to a default value if one is not provided. However, if a user updates the number of replicas, and then let wants to return to the default value, they are naturally inclined to try setting this setting to null, as the agreed upon way to return a setting to its default. Since the index metadata requires that this setting has a non-null value, we blow up when a user attempts to make this change. This is because we are not taking the same action when updating a setting on an index that we take when create an index. Namely, we are not explicitly setting index.number_of_replicas if the request does not carry a value for this setting. This would happen when nulling the setting, which we want to support. This commit addresses this by setting index.number_of_replicas to the default if the value for this setting is null when updating the settings for an index.	2020-05-13 06:25:43 -04:00
Christoph Büscher	73b64908b2	Fix `time_zone` on `query_string` and date fields (#55881 ) (#56668 ) Currently the `time_zone` parameter in `query_string` queries gets applied correctly only when using the range syntax, e.g "date:[2020-01-02 TO 2020-01-05]. When a date field gets searched without explicit range syntax, e.g. "date:"2020-01-01" we internally create a range query than uses the specified date as start date and rounds up to the next underspecified units for the end date (e.g. here 2020-01-01T23:59:59) without considering the `time_zone` settings. This change adds a check in QueryStringQueryParser to detect this scenario early where we have access to the time zone information and directly create a range query using it. Closes #55813	2020-05-13 11:20:25 +02:00
Henning Andersen	48a8c7eb88	Ensure search contexts are removed on index delete (#56335 ) (#56617 ) In a race condition, a search context could remain enlisted in SearchService when an index is deleted, potentially causing the index folder to not be cleaned up (for either lengthy searches or scrolls with timeouts > 30 minutes or if the scroll is kept active).	2020-05-13 09:41:02 +02:00
Jake Landis	a56fb6192e	[7.x] Fix ingest simulate verbose on failure with conditional (#56478 ) (#56635 ) If a conditional is added to a processor, and that processor fails, and that processor has an on_failure handler, the full trace of all of the executed processors may not be displayed in simulate verbose. The information is correct, but misses displaying some of the steps used to get there. This happens because a processor that is conditional processor is a wrapper around the real processor and a processor with an on_failure handler is also a wrapper around the processor(s). When decorating for simulation we treat compound processor specially, but if a compound processor is wrapped by a conditional processor that compound processor's processors can be missed for decoration resulting in the missing displayed steps. The fix to this is to treat the conditional processor specially and explicitly seperate it from the processor it is wrapping. This requires us to keep track of 2 processors a possible conditional processor and the actual processor it may be wrapping. related: #56004	2020-05-12 15:41:05 -05:00
Armin Braun	0a879b95d1	Save Bounds Checks in BytesReference (#56577 ) (#56621 ) Two spots that allow for some optimization: * We are often creating a composite reference of just a single item in the transport layer => special cased via static constructor to make sure we never do that * Also removed the pointless case of an empty composite bytes ref * `ByteBufferReference` is practically always created from a heap buffer these days so there is no point of dealing with all the bounds checks and extra references to sliced buffers from that and we can just use the underlying array directly	2020-05-12 20:33:45 +02:00
Jason Tedor	f7b8f0b2f4	Adjust warning for heap size bootstrap check (#56565 ) Today the heap size check warns the user about two issues why they might care about the heap size check: resize pauses, and if memory locking is enabled. Yet, we unconditionally make mention of the memory locking reason, even if memory locking is not enabled. This can confuse some users, so we adjust the warning about memory locking to only display if memory locking is enabled.	2020-05-12 14:31:21 -04:00
Martijn van Groningen	0c61bc63e4	Backport: auto create data streams using index templates v2 (#56596 ) Backport: #55377 This commit adds the ability to auto create data streams using index templates v2. Index templates (v2) now have a data_steam field that includes a timestamp field, if provided and index name matches with that template then a data stream (plus first backing index) is auto created. Relates to #53100	2020-05-12 17:01:15 +02:00
Dan Hermann	dfdd7e4fce	Report used memory as zero when total memory cannot be obtained (#56412 )	2020-05-12 07:43:51 -05:00
Ignacio Vera	222ee721ec	Add moving percentiles pipeline aggregation (#55441 ) (#56575 ) Similar to what the moving function aggregation does, except merging windows of percentiles sketches together instead of cumulatively merging final metrics	2020-05-12 11:35:23 +02:00
Martijn van Groningen	7b1f978931	Move data stream test (#56505 ) (#56570 ) Move data stream resolvability test from IndicesOptionsIntegrationIT to DataStreamIT class. Whether a transport action supports data streams is no longer controlled via indices options.	2020-05-12 10:44:13 +02:00
Armin Braun	2d08ef729c	Deduplicate Strings in REST Bulk Request Parsing (#56506 ) (#56568 ) We can save a little memory here since these strings might live for quite a while on the coordinating node.	2020-05-12 09:52:44 +02:00
Ryan Ernst	902fc546bd	Migrate remaining ESIntegTestCases to internalClusterTest (#56479 ) (#56563 ) This commit migrates the ESIntegTestCase tests in x-pack to the internalClusterTest source set.	2020-05-11 21:06:04 -07:00
Nik Everett	137df274ab	Add support for numeric range keys (#56452 ) (#56552 ) This adds support for parsing numbers as range keys. They get converted into a string, but we allow numbers. While I was there I replaced the parser for `Range` with a `ConstructingObjectParser` which will automatically add support for "did you mean" style corrections on errors. Closes #56402	2020-05-11 19:48:59 -04:00
Nick Knize	9b64149ad2	[Geo] Refactor Point Field Mappers (#56060 ) (#56540 ) This commit refactors the following: * GeoPointFieldMapper and PointFieldMapper to AbstractPointGeometryFieldMapper derived from AbstractGeometryFieldMapper. * .setupFieldType moved up to AbstractGeometryFieldMapper * lucene indexing moved up to AbstractGeometryFieldMapper.parse * new addStoredFields, addDocValuesFields abstract methods for implementing stored field and doc values field indexing in the concrete field mappers This refactor is the next phase for setting up a framework for extending spatial field mapper functionality in x-pack.	2020-05-11 17:11:36 -05:00
Benjamin Trent	1d6b2f074e	[Transform] adds geotile_grid support in group_by (#56514 ) (#56549 ) This adds support for grouping by geo points. This uses the agg [geotile_grid](https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-geotilegrid-aggregation.html). I am opting to store the tile results of group_by as a `geo_shape` so that users can query the results. Additionally, the shapes could be visualized and filtered in the kibana maps app. relates to https://github.com/elastic/elasticsearch/issues/56121	2020-05-11 17:02:40 -04:00
Lee Hinman	1337b35572	Remove prefer_v2_templates query string parameter (#56545 ) This commit removes the `prefer_v2_templates` flag and setting. This was a brief setting that allowed specifying whether V1 or V2 template should be used when an index is created. It has been removed in favor of V2 templates always having priority. Relates to #53101 Resolves #56528 This is not a breaking change because this flag was never in a released version.	2020-05-11 14:56:42 -06:00
zhenxianyimeng	8e96e5c936	Use CollectionUtils.isEmpty where appropriate (#55910 ) This commit uses the isEmpty utility method for arrays in place of null and greater than zero checks.	2020-05-11 09:55:57 -07:00
Martijn van Groningen	32471abc0e	Check whether data stream feature flag is enabled before deleting all data streams, (#56517 ) (#56520 ) this will fix the release build.	2020-05-11 18:34:50 +02:00
Martijn van Groningen	9ae09570d8	Allow a number of broadcast transport actions to resolve data streams (#55726 ) (#56502 ) Change TransportBroadcastByNodeAction and TransportBroadcastReplicationAction to be able to resolve data streams by default. Implementations can change this ability. This change allows to following APIs to resolve data streams: flush, refresh (already supported data streams), force merge, clear indices cache, indices stats (already supported data streams), segments, upgrade stats, upgrade, validate query, searchable snapshots stats, clear searchable snapshots cache and reload analyzers APIs. Relates to #53100	2020-05-11 12:48:35 +02:00
Nik Everett	2823300bdf	Speed up rounding in auto_date_histogram (#56384 ) (#56486 ) This wires `auto_date_histogram` into the rounding optimization that I built in #55559. This is should significantly speed up any `auto_date_histogram`s with `time_zone`s on them.	2020-05-09 11:37:30 -04:00
Nik Everett	2f38aeb5e2	Save memory when numeric terms agg is not top (#55873 ) (#56454 ) Right now all implementations of the `terms` agg allocate a new `Aggregator` per bucket. This uses a bunch of memory. Exactly how much isn't clear but each `Aggregator` ends up making its own objects to read doc values which have non-trivial buffers. And it forces all of it sub-aggregations to do the same. We allocate a new `Aggregator` per bucket for two reasons: 1. We didn't have an appropriate data structure to track the sub-ordinals of each parent bucket. 2. You can only make a single call to `runDeferredCollections(long...)` per `Aggregator` which was the only way to delay collection of sub-aggregations. This change switches the method that builds aggregation results from building them one at a time to building all of the results for the entire aggregator at the same time. It also adds a fairly simplistic data structure to track the sub-ordinals for `long`-keyed buckets. It uses both of those to power numeric `terms` aggregations and removes the per-bucket allocation of their `Aggregator`. This fairly substantially reduces memory consumption of numeric `terms` aggregations that are not the "top level", especially when those aggregations contain many sub-aggregations. It also is a pretty big speed up, especially when the aggregation is under a non-selective aggregation like the `date_histogram`. I picked numeric `terms` aggregations because those have the simplest implementation. At least, I could kind of fit it in my head. And I haven't fully understood the "bytes"-based terms aggregations, but I imagine I'll be able to make similar optimizations to them in follow up changes.	2020-05-08 20:38:53 -04:00
Nik Everett	bd4b9dd10e	Speed up time interval arounding around dst (backport #56371 ) (#56396 ) When an index spans a daylight savings time transition we can't use our optimization that rewrites the requested time zone to a fixed time zone and instead we used to fall back to a java.util.time based rounding implementation. In #55559 we optimized "time unit" rounding. This optimizes "time interval" rounding. The java.util.time based implementation is about 1650% slower than the rounding implementation for a fixed time zone. This replaces it with a similar optimization that is only about 30% slower than the fixed time zone. The java.util.time implementation allocates a ton of short lived objects but the optimized implementation doesn't. So it might end up being faster than the microbenchmarks imply.	2020-05-08 13:39:27 -04:00
Armin Braun	b18d242300	Fix Simulate Template Endpoint Temporary Index Handling (#56406 ) (#56432 ) Use proper facility for creating temporary index service for the simulation that does not add itself to the `IndicesService` unnecessarily (breaking an assertion about the internal consistency of the cluster state and the `IndicesService`). Closes #56298	2020-05-08 18:05:24 +02:00
Martijn van Groningen	83739b5806	Backport: allow cluster health api to resolve data streams (#56425 ) Backport of: #56413 Allow cluster health api to resolve data streams and automatically remove data streams after each test in test cases extending from `ESIntegTestCase` Relates to #53100	2020-05-08 17:16:25 +02:00
Tanguy Leroux	8e9b69bfd7	Use snapshot information to build searchable snapshot store MetadataSnapshot (#56289 ) (#56403 ) While investigating possible optimizations to speed up searchable snapshots shard restores, we noticed that Elasticsearch builds the list of shard files on local disk in order to compare it with the list of files contained in the snapshot to restore. This list of files is materialized with a MetadataSnapshot object whose construction involves to read the footer checksum of every files of the shard using Store.checksumFromLuceneFile() method. Further investigation shows that a MetadataSnapshot object is also created for other types of operations like building the list of files to recover in a peer recovery (and primary shard relocation) or in order to assign a shard to a node. These operations use the Store.getMetadata(IndexCommit) method to build the list of files and checksums. In the case of searchable snapshots building the MetadataSnapshot object can potentially trigger cache misses, which in turn can cause the download and the writing in cache of the last range of the file in order to check the 16 bytes footer. This in turn can cause more evictions. Since searchable snapshots already contains the footer information of every file in BlobStoreIndexShardSnapshot it can directly read the checksum from it and avoid to use the cache at all to create a MetadataSnapshot for the operations mentioned above. This commit adds a shortcut to the SearchableSnapshotDirectory.openInput() method - similarly to what already exists for segment infos - so that it creates a specific IndexInput for checksum reading operation.	2020-05-08 14:16:19 +02:00
Armin Braun	085ff8c404	Add More Trace Logging to BlobStoreRepository (#56336 ) (#56401 ) Adding more trace logging that would be helpful in understanding the precise order of blob-level operations if needed.	2020-05-08 08:31:32 +02:00
Tal Levy	13944b1bf9	Fix max-int limit for number of points reduced in geo_centroid (#56370 ) A bug in InternalGeoCentroid#reduce existed that summed up the aggregation's long-valued counts into a local integer variable. Since it is definitely possible to reduce more than Integer.MAX points, this change simply updates that variable to be a long-valued number. Closes #55992.	2020-05-07 14:30:29 -07:00
Tal Levy	6e0178fb68	Move CumulativeSumPipelineAgg to use ConstructingObjectParser parsing (#55990 ) (#56380 ) As part of #52776, this refactors the aggregation to use the context parser to parse its parameters.	2020-05-07 12:34:54 -07:00
Tim Brooks	b84d1e2577	Improve logging around SniffConnectionStrategy (#56378 ) Currently, the logging around the SniffConnectionStrategy is limited. The log messages are inconsistent and sometimes wrong. This commit cleans up these log message to describe when connections are happening and what failed if a step fails. Additionally, this commit enables TRACE logging for a problematic test (testEnsureWeReconnect).	2020-05-07 13:11:56 -06:00
Tim Brooks	9d076364d7	Fix testCollectNodes test assertion (#56294 ) Currently when a connection closes a new sniff round begins. The testCollectNodes test closes four transports before triggering the method to collect the remote nodes. This leads to a race where there are a number of reasons the collect nodes call might fail. This commit fixes that issue by changing the test assertion to include a potential failure condition. Fixes #55292.	2020-05-07 11:52:43 -06:00
Nik Everett	b5e385fa56	Fix auto_date_histogram interval (#56252 ) (#56341 ) `auto_date_histogram` was returning the incorrect `interval` because of a combination of two things: 1. When pipeline aggregations rewrote `auto_date_histogram` we reset the interval to 1. Oops. Fixed that. 2. Every bucket aggregation was rewriting its buckets as though there was a pipeline aggregation even if there aren't any. This is a bit silly so we skip that too. Closes #56116	2020-05-07 10:27:40 -04:00
Nhat Nguyen	bd0e0f41a0	Ensure unregister child node if failed to register task (#56254 ) We fail to unregister the child node in registerAndExecute if the parent task is being canceled. This leads to a bug where a cancel request never completes. Closes #55875 Relates #54312	2020-05-07 10:10:13 -04:00
Nik Everett	e35919d3b8	Optimize date_histograms across daylight savings time (backport of #55559 ) (#56334 ) Rounding dates on a shard that contains a daylight savings time transition is currently something like 1400% slower than when a shard contains dates only on one side of the DST transition. And it makes a ton of short lived garbage. This replaces that implementation with one that benchmarks to having around 30% overhead instead of the 1400%. And it doesn't generate any garbage per search hit. Some background: There are two ways to round in ES: * Round to the nearest time unit (Day/Hour/Week/Month/etc) * Round to the nearest time interval (3 days/2 weeks/etc) I'm only optimizing the first one in this change and plan to do the second in a follow up. It turns out that rounding to the nearest unit really is two problems: when the unit rounds to midnight (day/week/month/year) and when it doesn't (hour/minute/second). Rounding to midnight is consistently about 25% faster and rounding to individual hour or minutes. This optimization relies on being able to usually figure out what the minimum and maximum dates are on the shard. This is similar to an existing optimization where we rewrite time zones that aren't fixed (think America/New_York and its daylight savings time transitions) into fixed time zones so long as there isn't a daylight savings time transition on the shard (UTC-5 or UTC-4 for America/New_York). Once I implement time interval rounding the time zone rewriting optimization should no longer be needed. This optimization doesn't come into play for `composite` or `auto_date_histogram` aggs because neither have been migrated to the new `DATE` `ValuesSourceType` which is where that range lookup happens. When they are they will be able to pick up the optimization without much work. I expect this to be substantial for `auto_date_histogram` but less so for `composite` because it deals with fewer values. Note: My 30% overhead figure comes from small numbers of daylight savings time transitions. That overhead gets higher when there are more transitions in logarithmic fashion. When there are two thousand years worth of transitions my algorithm ends up being 250% slower than rounding without a time zone, but java time is 47000% slower at that point, allocating memory as fast as it possibly can.	2020-05-07 09:10:51 -04:00
Armin Braun	3bad5b3c01	Fix Noisy Logging during Snapshot Delete (#56264 ) (#56329 ) We were logging the cleanup of the snap- and meta- blobs for every snapshot delete which is needlessly noisy and confusing to users. We should only log actual stale/unexpected blobs here.	2020-05-07 13:48:53 +02:00
Ryan Ernst	33d6a55d1d	Create plugin for internalClusterTest task (#56067 ) This commit creates a new gradle plugin to provide a separate task name and source set for running ESIntegTestCase tests. The only project converted to use the new plugin in this PR is server, as an example. The remaining cases in x-pack will be handled in followups. backport of #55896	2020-05-06 17:20:52 -07:00
Mark Vieira	f28a12cbba	Add version 7.9.0 This reverts commit `350e930e`	2020-05-06 13:34:57 -07:00
Julie Tibshirani	e852bb29b7	Simplify signature of FieldMapper#parseCreateField. (#56144 ) `FieldMapper#parseCreateField` accepts the parse context, plus a list of fields as an output parameter. These fields are immediately added to the document through `ParseContext#doc()`. This commit simplifies the signature by removing the list of fields, and having the mappers add the fields directly to `ParseContext#doc()`. I think this is nicer for implementors, because previously fields could be added either through the list, or the context (through `add`, `addWithKey`, etc.)	2020-05-06 11:12:09 -07:00
Rory Hunter	350e930e55	Revert "Add version 7.9.0" This reverts commit `b8b4ebd089`.	2020-05-06 14:24:55 +01:00
Rory Hunter	b8b4ebd089	Add version 7.9.0	2020-05-06 09:12:14 +01:00
Tanguy Leroux	131a3911eb	Replace BlobContainerWrapper by FilterBlobContainer (#56200 ) A FilterBlobContainer class was introduced in #55952 and it delegates its behavior to a given BlobContainer while allowing to override only necessary methods. This commit replaces the existing BlobContainerWrapper class from the test framework with the new FilterBlobContainer from core.	2020-05-06 10:05:43 +02:00
Dan Hermann	6674f14fb3	[7.x] Get index includes parent data stream for backing indices (#56238 )	2020-05-05 15:43:42 -05:00
Mark Tozzi	33da086d7b	[7.x] Wire up DiversifiedAggregation (#56145 ) (#56222 )	2020-05-05 13:11:49 -04:00
Tal Levy	e4f2c3105d	Add geo_shape support for geotile_grid and geohash_grid (#55966 ) (#56228 ) this commit adds aggregation support for the geo_shape field type on geo*_grid aggregations. it introduces a Tiler for both tiles and hashes that enables a new type of ValuesSource to replace the GeoPoint's CellIdSource. This makes it possible for the existing Aggregator to be re-used, so no new implementations of the grid aggregators are added.	2020-05-05 09:54:14 -07:00
Lee Hinman	b77c0bbe26	[7.x] Validate V2 templates more strictly (#56170 ) (#56226 ) Backports the following commits to 7.x: - Validate V2 templates more strictly (#56170)	2020-05-05 10:34:56 -06:00
Igor Motov	94b349cd18	[7.x] Simplify the ValuesSourceRegistry structure (#56154 ) (#56197 ) Follow up to #55747.	2020-05-05 10:37:02 -04:00
Tanguy Leroux	b9636713b1	Searchable Snapshots should respect max_restore_bytes_per_sec (#55952 ) (#56199 ) This commit changes searchable snapshots so that it now respects the repository's max_restore_bytes_per_sec setting when it downloads blobs. Backport of #55952 for 7.x	2020-05-05 15:43:06 +02:00
Jason Tedor	c38388c506	Fix compiling in TransportValidateQueryActionTests This arose after a backport where we do not have the nicities of the Java 11 diamond operator. This commit fixes it by adding the proper type parameter.	2020-05-05 07:36:40 -04:00
Jason Tedor	410eb29937	Fix validate query listener invocation bug (#56157 ) When the index we are validating a query does not exist, we try to send back a response letting the client know that the index does not exist. Yet, we accidentally fallthrough into the case that the validation failed for some other reason. This means that we end up notifying the channel twice. Sometimes the notification occurs after the failure has been written out and the channel closed (so the second invocation leads to a silent failed to write to a closed channel issue), and sometimes the response does end up in the channel, creating garbled responses to the client. This commit fixes that issue by avoiding the fallthrough.	2020-05-05 07:26:02 -04:00
Nhat Nguyen	60d097e262	Avoid copying file chunks in peer covery (#56072 ) (#56172 ) A follow-up of #55353 to avoid copying file chunks before sending them to the network layer. Relates #55353	2020-05-04 23:39:34 -04:00
Ryan Ernst	39ba06cbb2	Add dummy file for new client example snippets location (#56152 ) This file is added simply to ensure the new directory exists, so it can be added to the docs configuration.	2020-05-04 15:48:56 -07:00
Lee Hinman	8fa14b333d	[7.x] Validate non-negative priorities for V2 index templates (#56139 ) (#56163 ) Backports the following commits to 7.x: - Validate non-negative priorities for V2 index templates (#56139)	2020-05-04 16:19:13 -06:00
Martijn van Groningen	2ac32db607	Move includeDataStream flag from IndicesOptions to IndexNameExpressionResolver.Context (#56151 ) Backport of #56034. Move includeDataStream flag from an IndicesOptions to IndexNameExpressionResolver.Context as a dedicated field that callers to IndexNameExpressionResolver can set. Also alter indices stats api to support data streams. The rollover api uses this api and otherwise rolling over data stream does no longer work. Relates to #53100	2020-05-04 22:38:33 +02:00
Lee Hinman	3cefe192a2	[7.x] Remove Index Templates V2 feature flag (#56123 ) (#56141 ) Backports the following commits to 7.x: - Remove Index Templates V2 feature flag (#56123)	2020-05-04 13:15:51 -06:00
Martijn van Groningen	6d03081560	Add auto create action (#56122 ) Backport of #55858 to 7.x branch. Currently the TransportBulkAction detects whether an index is missing and then decides whether it should be auto created. The coordination of the index creation also happens in the TransportBulkAction on the coordinating node. This change adds a new transport action that the TransportBulkAction delegates to if missing indices need to be created. The reasons for this change: * Auto creation of data streams can't occur on the coordinating node. Based on the index template (v2) either a regular index or a data stream should be created. However if the coordinating node is slow in processing cluster state updates then it may be unaware of the existence of certain index templates, which then can load to the TransportBulkAction creating an index instead of a data stream. Therefor the coordination of creating an index or data stream should occur on the master node. See #55377 * From a security perspective it is useful to know whether index creation originates from the create index api or from auto creating a new index via the bulk or index api. For example a user would be allowed to auto create an index, but not to use the create index api. The auto create action will allow security to distinguish these two different patterns of index creation. This change adds the following new transport actions: AutoCreateAction, the TransportBulkAction redirects to this action and this action will actually create the index (instead of the TransportCreateIndexAction). Later via #55377, can improve the AutoCreateAction to also determine whether an index or data stream should be created. The create_index index privilege is also modified, so that if this permission is granted then a user is also allowed to auto create indices. This change does not yet add an auto_create index privilege. A future change can introduce this new index privilege or modify an existing index / write index privilege. Relates to #53100	2020-05-04 19:10:09 +02:00
Julie Tibshirani	6b5cf1b031	For constant_keyword, make sure exists query handles missing values. (#55757 ) It's possible for a constant_keyword to have a 'null' value before any documents are seen that contain a value for the field. In this case, no documents have a value for the field, and 'exists' queries should return no documents.	2020-05-04 09:41:52 -07:00
Armin Braun	e8ef44ce78	Allow Bulk Snapshot Deletes to Abort (#56009 ) (#56111 ) Making use of #55773 to simplify snapshot state machine. 1. Deletes with no in-progress snapshot now add the delete entry to the cluster state right away instead of doing a second CS update after the fist update was a NOOP. 2. If a bulk delete matches in-progress as well as completed snapshots, abort the in-progress snapshot and then move on to delete from the repository.	2020-05-04 16:21:00 +02:00
Christos Soulios	c65f828cb7	[7.x] Histogram field type support for ValueCount and Avg aggregations (#56099 ) Backports #55933 to 7.x Implements value_count and avg aggregations over Histogram fields as discussed in #53285 - value_count returns the sum of all counts array of the histograms - avg computes a weighted average of the values array of the histogram by multiplying each value with its associated element in the counts array	2020-05-04 13:23:02 +03:00
Armin Braun	e01b999ef0	Add Functionality to Consistently Read RepositoryData For CS Updates (#55773 ) (#56091 ) Using optimistic locking, add the ability to run a repository state update task with a consistent view of the current repository data. Allows for a follow-up to remove the snapshot INIT state.	2020-05-04 08:13:14 +02:00
Armin Braun	3a64ecb6bf	Allow Deleting Multiple Snapshots at Once (#55474 ) (#56083 ) * Allow Deleting Multiple Snapshots at Once (#55474) Adds deleting multiple snapshots in one go without significantly changing the mechanics of snapshot deletes otherwise. This change does not yet allow mixing snapshot delete and abort. Abort is still only allowed for a single snapshot delete by exact name.	2020-05-03 20:30:58 +02:00
David Turner	69f50fe79f	Improve same-shard allocation explanations (#56010 ) I see occasional confusion about the explanations emitted by the same-shard allocation decider, particularly amongst new users setting up a single-node cluster and trying to determine why their cluster has `yellow` health. For example: the shard cannot be allocated to the same node on which a copy of the shard already exists This is technically correct but it's quite a complicated sentence. Also, by starting with "the shard cannot be allocated" it makes it sound like this is the problem, whereas in fact this message is a good thing and users should typically focus their attention elsewhere. This commit simplifies the wording of these messages and makes them sound more positive, for example: a copy of this shard is already allocated to this node	2020-05-01 10:07:14 +01:00
Mark Tozzi	d8eb51ed63	Wire up GeoDistanceAggregation (#55975 ) (#56042 )	2020-04-30 15:43:27 -04:00
Tim Brooks	54dbea6c65	Improve RemoteConnectionManager consistency (#55759 ) In order to iterate through remote connections, the remote connection manager maintains a local cache of connected nodes. Unfortunately this is difficult in relationship with testing as it is inherently racy in comparison to the parent connection manager map of connections. This commit improves the relationship by only returning a cached connection if it is still registered with the parent. If the connection is not open, we will go to the slow path of allocating a iterator directly from the parent.	2020-04-30 12:13:06 -06:00
Igor Motov	d8f9df771d	Expose agg usage in Feature Usage API (#55732 ) (#56048 ) Counts usage of the aggs and exposes them on the _nodes/usage/. Closes #53746	2020-04-30 12:53:36 -04:00
Lee Hinman	3dada1e2d3	[7.x] Handle merging dotted object names when merging V2 template mappings (#55982 ) (#56041 ) Backports the following commits to 7.x: - Handle merging dotted object names when merging V2 template mappings (#55982)	2020-04-30 10:51:43 -06:00
Przemko Robakowski	797f63e743	[7.x] Emit deprecation warning if multiple v1 templates match with a new index (#55558 ) (#56038 ) * Emit deprecation warning if multiple v1 templates match with a new index (#55558) * Emit deprecation warning if multiple v1 templates match with a new index * DEPRECATION_LOGGER rename	2020-04-30 17:36:17 +02:00
Luca Cavanna	fc6422ffcc	Consolidate DelayableWriteable (#55932 ) This commit includes a number of minor improvements around `DelayableWriteable`: javadocs were expanded and reworded, `get` was renamed to `expand` and `DelayableWriteable` no longer implements `Supplier`. Also a couple of methods are now private instead of package private.	2020-04-30 17:16:58 +02:00
Andrei Dan	68985bc1ca	Add HLRC support for simulate index template api (#55936 ) (#56029 ) (cherry picked from commit 475790c34e0bab95d352132d6be63c4f5b219fb1) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-30 15:40:48 +01:00
Nhat Nguyen	2fd257add2	Ensure no circular reference in translog tragic exception (#55959 ) We generate a circular reference exception in translog in 6.8 in the following scenario: - The first rollGeneration hits "too many open files" exception when it's copying a checkpoint file. We will set the tragic exception and close the translog - The second rollGeneration hits AlreadyClosedException as the current writer is closed. We will suppress the ACE to the current tragic exception. Unfortunately, this leads to a circular reference as ACE already suppresses the tragic exception. Other factors that help to manifest this bug: - We do not fail the engine on AlreadyClosedException in flush - We do not check for ensureOpen before rolling a new generation Closes #55893	2020-04-30 08:40:19 -04:00
Christoph Büscher	c2afbf20de	Fix ExistsQueryBuilder#testToQuery failure (#56006 ) Ocassionally this test can fail when the randomized index.version.created is before 6.1. In this case we don't check that if mappedFields.size() == 0 we expect a MatchNoDocsQuery query being returned, which we do for other versions. This fails only occasionally but with the seed provided on the original issue. It also shouldn't be an issue on master since we shouldn't test with these pre-7 index versions there. Closes #55950	2020-04-30 12:28:05 +02:00
Dan Hermann	9bf254fe36	REST test for rolling data streams	2020-04-29 17:34:52 -05:00
Nhat Nguyen	b1136948b4	Mute CancellableTasksIT.testDoNotWaitForCompletion	2020-04-29 18:13:03 -04:00
Dan Hermann	bf89e485fc	[7.x] Delete index API properly handles backing indices for data streams (#55971 )	2020-04-29 16:32:59 -05:00
Christos Soulios	43dab77186	[7.x] Modified searchAndReduce() to return empty agg when no docs exist (#55967 ) Backports #55826 to 7.x Modified AggregatorTestCase.searchAndReduce() method so that it returns an empty aggregation result when no documents have been inserted. Also refactored several aggregation tests so they do not re-implement method AggregatorTestCase.testCase() Fixes #55824	2020-04-30 00:28:32 +03:00
Mark Tozzi	9cd3175bbb	[7.x] Wire up AutoDateHistogram to the ValuesSourceRegistry (#55687 ) (#55870 )	2020-04-29 16:26:09 -04:00
Nhat Nguyen	c547a92ac6	Revert "Mute CancellableTasksIT.testDoNotWaitForCompletion" This reverts commit `0c095bbd0c`.	2020-04-29 16:21:42 -04:00
Mark Vieira	0c095bbd0c	Mute CancellableTasksIT.testDoNotWaitForCompletion	2020-04-29 12:59:07 -07:00
Nhat Nguyen	edbaa19a5d	Add trace log for task cancellation (#55940 ) Adding trace logs to the task cancellation and its tests to debug the test failure in #55875. Relates ##55875	2020-04-29 15:37:37 -04:00
Mark Vieira	144e8ce092	Update Lucene version for Elasticsearch 6.8.9 (#55963 )	2020-04-29 12:36:37 -07:00
Tim Brooks	9eb6736500	Fix NullPointer when message shortcircuited (#55945 ) Currently if we shortcircuit a message the breaker release is null since there is nothing to be broken. However, the TcpTransportChannel infrastructure still expects it. This commit resolves this issue be returning a no-op breaker release.	2020-04-29 10:11:39 -06:00
Christoph Büscher	57409fccbd	Remove unnecessary instance variable in QueryStringQueryParser (#55915 ) Currently `currentFieldType` is an instance variable that is first set and then used by all methods referring to it. We can make it local to each method instead, avoiding possible state problems and improve readability of the code instead.	2020-04-29 16:30:48 +02:00
Andrei Dan	6b886b0b7a	[7.x] Add simulate template composition API _index_template/_simulate_index/{name} (#55686 ) (#55922 ) This adds a new api to simulate matching the given index name against the index templates in the system. The syntax for the new API takes the following form: POST _index_template/_simulate_index/{index_name} { "index_patterns": ["logs-*"], "priority": 15, "template": { "settings": { "number_of_shards": 3 } ... } } Where the body is optional, but we support the entire body used by the PUT _index_template/{name} api. When the body is specified we'll simulate matching the given index against a system that'd have the given index template together with the index templates that exist in the system. The response, in both cases, will return the matching template's resolved settings, mappings and aliases, together with a special field that'll print any overlapping templates and their corresponding index patterns. (cherry picked from commit 1a5845edce1f445c58e094e9a3b6792e21e543b0) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-04-29 14:57:44 +01:00
Christos Soulios	02bf0c586a	[7.x] Histogram field type support for Sum aggregation (#55916 ) Implements Sum aggregation over Histogram fields by summing the value of each bucket multiplied by their count as requested in #53285 Backports #55681 to 7.x	2020-04-29 15:06:12 +03:00
Yang Cheng	06b3345787	Avoid double-recovery when state recovery delayed Today if state recovery is delayed by the `gateway.recover_after_*` settings then we may end up performing state recovery twice: once when enough nodes have joined the cluster, and again when the timeout elapses. The second state recovery reinitializes the routing table, effectively discarding all recovered/recovering shards and starting again from scratch. This commit adds a check to prevent this second state recovery. Closes #55564	2020-04-29 11:55:28 +01:00
Armin Braun	b96db2ee2b	Increase Timeout in ClusterDisruptionIT.testRestartNodeWhileIndexing (#55877 ) (#55880 ) The test failed in #55869 but the `docId` was never stuck, it just moved slowly upwards. => increasing to timeout. Closes #55869	2020-04-29 06:47:00 +02:00
Tim Brooks	8d1595698b	Improve start_recovery check in IndexRecoveryIT (#55867 ) Currently the testTransientErrorsDuringRecoveryAreRetried validates that the expected peer recovery starts only once. This check is coarse and is executed on all nodes and indexes. This commit modifies this check to only be performed on the expected index. Additionally this commit removes the disruption behavior from the "blue" node where it is not relevant. Finally, this commit improves the logging for this test.	2020-04-28 16:40:03 -06:00
Nik Everett	a5d0409a8f	Save memory in on aggs in async search (#55683 ) (#55879 ) This replaces a reference to the result of partially reducing aggregations that async search keeps with a reference to the serialized form of the result of the partial reduction which we need to keep anyway.	2020-04-28 16:23:30 -04:00
Ryan Ernst	fed296ebb7	Add method to check if object is generically writeable in stream (#54936 ) (#55561 ) When calling scripts in metric aggregation, the returned metric state is passed along to the coordinating node to do the final reduce. However, it is possible the object could contain nested state which is unknown to StreamOutput/StreamInput. This would then result in the node crashing as exceptions are not expected in the middle of serialization. This commit adds a method to StreamOutput that can determine if an object is writeable by the stream. It uses the same logic writeGenericValue, special casing each of the supported collection types to recursively determine if each contained value is itself writeable. relates #54708	2020-04-28 13:08:41 -07:00
Tim Brooks	9e376589a6	Fully stop RetryableAction when cancelled (#55614 ) Currently cancelling the RetryableAction does not stop one last run from being executed. This commit makes a best effort attempt to cancel a scheduled retry and guards future executions from the action already being completed.	2020-04-28 13:54:00 -06:00
Tim Brooks	cd228095df	Retry failed peer recovery due to transient errors (#55883 ) Currently a failed peer recovery action will fail an recovery. This includes when the recovery fails due to potentially short lived transient issues such as rejected exceptions or circuit breaking errors. This commit adds the concept of a retryable action. A retryable action will be retryed in face of certain errors. The action will be retried after an exponentially increasing backoff period. After defined time, the action will timeout. This commit only implements retries for responses that indicate the target node has NOT executed the action.	2020-04-28 13:52:49 -06:00
Nhat Nguyen	ad6221c0cb	Fix testKeepTranslogAfterGlobalCheckpoint (#55868 ) If we advance the global checkpoint during commit and sync that checkpoint after commit, then the assertions in the test won't hold because the deletion policy did not see the latest global checkpoint but only the value before committing. Closes #55680	2020-04-28 12:50:41 -04:00
Henning Andersen	cab7bcc156	Disk decider respect watermarks for single data node (#55805 ) (#55847 ) The disk decider had special handling for the single data node case, allowing any allocation (skipping watermark checks) for such clusters. This special handling can now be avoided via a setting.	2020-04-28 18:46:22 +02:00
Lee Hinman	777caf0725	[7.x] Add support for V2 index templates to /_cat/templates (#55829 ) (#55866 ) Backports the following commits to 7.x: - Add support for V2 index templates to /_cat/templates (#55829)	2020-04-28 10:14:19 -06:00
Mark Tozzi	bebbc375ae	Wire up IpRangeAggregation to ValuesSourceRegistry (#55831 ) (#55859 )	2020-04-28 12:10:21 -04:00
Armin Braun	f38385ee25	Fix Leaking Listener When Closing NodeClient (#55676 ) (#55864 ) If a node client (or rather its underlying node) is closed then any executions on it will just quietly fail as happens in #55660 via closing the nodes on the test thread and asynchronously using a node client. Closes #55660	2020-04-28 17:27:58 +02:00
Lee Hinman	3b211c1212	Downgrade template update error to a warning for v1 templates (#55611 ) For 7.x, we already implemented the `?prefer_v2_templates` flag and made V2 templates opt-in, so we can relax the error when updating V1 templates to just a warning. This will still be a hard error for 8.0+ Relates to #53101	2020-04-28 09:16:08 -06:00
Armin Braun	51a94102e8	Improve some Byte Array Handling Spots (#55844 ) (#55856 ) Some small memory-saving improvements in `byte[]` handling.	2020-04-28 16:38:48 +02:00
Christos Soulios	fae9ec13dd	Removed ValuesSourceRegistry.registerAny() (#55846 ) * Backports #55747 to 7.x * All ValuesSourceTypes must be registered explicitly * Removed lambdas in ValuesSourceRegistry	2020-04-28 15:44:42 +03:00
Adrien Grand	58c3bb5ae1	Repurpose `ignore_throttled` to be only about frozen indices. (#55047 ) (#55852 ) This has no practical impact on users since frozen indices are the only throttled indices today. However this has an impact on upcoming features that would use search throttling. Filtering out throttled indices made sense a couple years ago, but as we're now improving support for slow requests with `_async_search` and exploring ways to reduce storage costs, this feature has most likely become a trap, that we'd like to not have with upcoming features that would use search throttling. Relates #54058	2020-04-28 14:31:54 +02:00
Tim Brooks	80662f31a1	Introduce mechanism to stub request handling (#55832 ) Currently there is a clear mechanism to stub sending a request through the transport. However, this is limited to testing exceptions on the sender side. This commit reworks our transport related testing infrastructure to allow stubbing request handling on the receiving side.	2020-04-27 16:57:15 -06:00
Igor Motov	2ff858b290	Fix error massage for unknown value type (#55821 ) (#55825 ) Fixes confusing error message when unknown value type is specified in a terms aggregation. Adds support for parsing "numeric" and "number" value types. Fixes #55727	2020-04-27 18:34:43 -04:00
weizijun	08d328333a	Append indies to update index setting task name (#55714 ) This change adds index names to the name of the update index setting task so we have more information about the pending tasks.	2020-04-27 17:50:36 -04:00
Julie Tibshirani	4bfd65a375	Remove TODO around aggregating on _index. The _index field can in fact be used in aggregations.	2020-04-27 12:48:20 -07:00
Tal Levy	6ba5148ead	Add geo_shape support for the geo_centroid aggregation (#55602 ) (#55819 ) this commit leverages the new geo_shape doc values to register a new geo_centroid aggregator that works on geo_shape field.	2020-04-27 12:16:10 -07:00
Mark Tozzi	22a98ec279	Aggregation support for Value Scripts that change types (#54830 ) (#55752 )	2020-04-27 09:57:05 -04:00
Jim Ferenczi	b5916ac455	Ignore closed exception on refresh pending location listener (#55799 ) This newly added listener should catch closed exceptions when accessing the internal engine. Closes #55792	2020-04-27 15:06:35 +02:00
Armin Braun	fe9904fbea	More Efficient Blobstore Metdata IO (#55777 ) (#55788 ) No need to copy all these bytes multiple times, especially not when writing a multiple MB global cluster state snapshot through this method.	2020-04-27 11:48:53 +02:00
Adrien Grand	0753d9a35c	Exists queries to MatchNoneQueryBuilder when the field is unmapped (#55785 ) Co-authored-by: Sivagurunathan Velayutham <sivadeva.93@gmail.com> Closes #54062	2020-04-27 11:06:50 +02:00
Armin Braun	4403b69048	Fix NPE in Partial Snapshot Without Global State (#55776 ) (#55783 ) We make sure to filter shard generations for indices that are missing from the metadata when finalizing a partial snapshot (from concurrent index deletion) but we failed to account for the case where we manually build a fake metadata instance for snapshots without the global state. Fixed this by handling missing indices by skipping, same way we do it for filtering the shard generations. Relates #50234	2020-04-27 10:07:09 +02:00
Nhat Nguyen	1a3f9e5a07	Return true for can_match on idle search shards (#55428 ) With this change, we will always return true for can_match requests on idle search shards; otherwise, some shards will never get refreshed if all search requests perform the can_match phase (i.e., total shards > pre_filter_shard_size). Relates #27500 Relates #50043	2020-04-26 22:21:42 -04:00
Nick Knize	b0e8a8a4d1	[Backport] Refactor Spatial Field Mappers (#55696 ) This commit refactors all spatial Field Mappers to a common AbstractGeometryFieldMapper that implements shared parameter functionality (e.g., ignore_malformed, ignore_z_value) and provides a common framework for overriding type parsing, and building in xpack. Common shape functionality is implemented in a new AbstractShapeGeometryFieldMapper that is reused and overridden in GeoShapeFieldMapper, GeoShapeFieldMapperWithDocValues, LegacyGeoShapeFieldMapper, and ShapeFieldMapper. This abstraction provides a reusable foundation for adding new xpack features; such as coordinate reference system support.	2020-04-24 14:05:16 -05:00
Mark Tozzi	87b4979c24	[7.x] Make ValuesSourceRegistry immutable after initilization #55493 (#55697 )	2020-04-24 13:33:38 -04:00
Zachary Tong	715c90bf7d	Aggs must specify a `field` or `script` (or both) (#52226 ) This adds a validation to VSParserHelper to ensure that a field or script or both are specified by the user. This is technically required today already, but throws an exception much deeper in the agg framework and has a very unintuitive error for the user (as well as eating more resources instead of failing early)	2020-04-23 19:23:41 -04:00
Jim Ferenczi	31d1727698	Fix (de)serialization of async search failures (#55688 ) The (de)serialization code of the async search response cannot handle exceptions that extend ElasticsearchException (e.g. ScriptException). This commit fixes this bug by serializing the error with the more generic StreamInput#writeException.	2020-04-24 00:44:43 +02:00

1 2 3 4 5 ...

4834 Commits