OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nhat Nguyen	a3906dcef3	Enable cancellation for msearch requests (#61337 ) Today multi-search requests are not cancellable because we create regular tasks instead of cancellable ones for them.	2020-08-19 16:59:17 -04:00
Nik Everett	9789e6d154	Migrate some field mapper tests to ESTestCase (#61301 ) (#61346 ) This switches a few tests for field mappers from `ESSingleNodeTestCase` to `ESTestCase` because, in general, we prefer to avoid `ESSingleNodeTestCase` when we can because it is slow and "big". "Big" here means that it pulls in an entire node, making it difficult to reason about what you are testing.	2020-08-19 15:43:49 -04:00
Armin Braun	4a53ae203e	Fix SharedClusterSnapshotRestoreIT.testThrottling (#61323 ) (#61328 ) We have to set the recovery setting to `0` if we don't want throttling from recoveries. Otherwise the randomized value used for this setting in tests can lead to throttling unexpectedly. Closes #61311	2020-08-19 15:26:32 +02:00
Nik Everett	70128e022b	fix stats aggregator tests With #60683 we stopped forcing aggregating all docs using a single Aggregator which made some of our accuracy assumptions about the stats aggregator incorrect. This adds a test that does the forcing and asserts the old accuracy and adds a test without the forcing with much looser accuracy guarantees. Closes #61132	2020-08-19 08:55:43 -04:00
Alan Woodward	b1aa0d8731	Fix fieldnames field type for pre-6.1 indexes (#61322 ) The FieldNamesFieldMapper field has different behaviour for indexes created in clusters earlier than v6.1, and the code to deal with this was still using the vestigial FieldType field of FieldMapper in its indexing path. This meant that documents added after an upgrade were not correctly indexing their field names field. This commit corrects the parseCreateField method to use the default field type. Fixes #61305	2020-08-19 12:59:09 +01:00
David Turner	389f7779e7	Report more details of unobtainable ShardLock (#61255 ) Today a common reason for a `ShardLockObtainFailedException` is when a shard is removed from a node and then assigned straight back to it again before the node has had a chance to shut the previous shard instance down. For instance, this can happen if a node briefly leaves the cluster holding a primary with no in-sync replicas. The message in this case is typically as follows: obtaining shard lock timed out after 5000ms, previous lock details: [shard creation] trying to lock for [shard creation] This is pretty hard to interpret, and doesn't raise the important question: "why didn't the shard shut down sooner?" With this change we reword the message a bit, report the age of the shard lock, and adjust the details to report that the lock is held by a closing shard: obtaining shard lock for [starting shard] timed out after [5000ms], lock already held for [closing shard] with age [12345ms] Relates #38807	2020-08-19 06:36:28 +01:00
Nhat Nguyen	08b0e78ef4	Log more info when search ops higher than expected (#61108 ) We have seen a situation where the total search operations are higher than expected. Unfortunately, we did not have enough info to figure it out. This commit adds the failures to the error to provide more context and adjusts the log level in case of failure to debug.	2020-08-18 15:20:41 -04:00
Rory Hunter	bd7236cd65	Version bump for 7.9.0 release	2020-08-18 16:07:43 +01:00
Dimitrios Liappis	c870640cbd	[7.x] Introduce 6.8.13 as a version (#61198 ) Introduce version 6.8.13 to branch 7.x	2020-08-18 17:07:16 +03:00
Armin Braun	58d07b2ffc	Remove Unused ByteBufferReference (#61116 ) (#61250 ) We only work with heap byte buffers at this point and those we can and do unwrap the `byte[]` ourselves and use `BytesArray` instead of a needless level of indirection via `ByteBuffer`.	2020-08-18 10:53:40 +02:00
Armin Braun	6ffa7f0737	Fix testConcurrentSnapshotDeleteAndDeleteIndex (#61228 ) (#61249 ) There is a corner case here in which during partial snapshot the index is deleted right between starting the snapshot in the CS and the data node getting to work on it, causing the data node the fail that shard snapshot and making the snapshot `PARTIAL`. Closes #61208	2020-08-18 10:45:30 +02:00
Mark Tozzi	db1df6cc30	[7.x] Remove a bunch of type boilerplate from Aggs (#60852 ) (#61031 )	2020-08-17 12:13:05 -04:00
Nik Everett	1b7bbafd81	Add method to make random DateFormatter pattern (backport of #60613 ) (#61213 ) Adds a method to make a random date `DateFormatter` pattern. We expect this'll be useful for runtime fields to compate their formatting with the standard date field.	2020-08-17 10:57:52 -04:00
Christoph Büscher	6866396e1d	Improve 'ignore_malformed' handling for dates (#60211 ) Currently we occasionally can get ArithmeticException from parsing bad input values on 'date' fields that are passed on even if 'ignore_malformed' is set. This change adds this exception to the ones we already catch for malformed values. Closes #52634	2020-08-17 16:18:08 +02:00
David Turner	b21cb7f466	Reduce allocations when persisting cluster state (#61159 ) Today we allocate a new `byte[]` for each document written to the cluster state. Some of these documents may be quite large. We need a buffer that's at least as large as the largest document, but there's no need to use a fresh buffer for each document. With this commit we re-use the same `byte[]` much more, only allocating it afresh if we need a larger one, and using the buffer needed for one round of persistence as a hint for the size needed for the next one.	2020-08-17 13:45:31 +01:00
David Turner	f3e0c60896	Restrict testing of legacy discovery to tests (#61178 ) The 7.x branch preserves the legacy discovery mechanism from 6.x purely for running internal cluster tests; this mechanism is otherwise completely untested and unsupported. However it is still technically possible to use it outside of the test suite if you dig through the source code to work out what settings need to be set. With this change we make it impossible to use this mechanism in production. Closes #61177	2020-08-17 11:05:27 +01:00
Ryan Ernst	9cb45dafab	Unwrap transport exception when using transport client (#60801 ) The ReloadSecureSettingsIT makes requests to the reload settings apis. In 7.x, the client used from the integ test infrastructure may be a transport client. In that case, the expected exception type, and causes the test to fail (though it will hang indefinitely due to not counting down the latch, see https://github.com/elastic/elasticsearch/pull/60800). This commit adds unwrapping of the remote exception to get the underlying expected exception. closes #51546	2020-08-13 10:24:04 -07:00
Ryan Ernst	c73ab0b16f	Ensure hotthreads do not produce node failures (#61073 ) This commit adds an assertion that no sub-nodes requests within hot threads failed. relates #58842	2020-08-13 10:22:19 -07:00
Armin Braun	3143b5ea47	Stabilize testSnapshotDeleteRelocatingPrimaryIndex (#61088 ) (#61096 ) Use transport blocking to make relocation take forever instead of relying on the relocation to take long enough to clash with the snapshot. Closes #61069	2020-08-13 16:26:56 +02:00
Yannick Welsch	8e775394ac	Fix testNoMasterActionsMetadataWriteMasterBlock (#60605 ) We can't assert on the specific exception, unfortunately.	2020-08-13 10:48:16 +02:00
David Turner	c6276ae177	Fail invalid incremental cluster state writes (#61030 ) It is disastrous if we commit an incremental cluster state update without having written the full state first. We assert that this doesn't happen, but it is hard to fully test the myriad ways that things might fail in a messy production environment. Given the disastrous consequences it is worth erring on the side of caution in this area. This commit fails invalid writes even if assertions are disabled.	2020-08-12 19:46:19 +01:00
Lee Hinman	e3df64a429	[7.x] Add data tiers (hot, warm, cold, frozen) as custom node roles (#60994 ) (#61045 ) This commit adds the `data_hot`, `data_warm`, `data_cold`, and `data_frozen` node roles to the x-pack plugin. These roles are intended to be the base for the formalization of data tiers in Elasticsearch. These roles all act as data nodes (meaning shards can be allocated to them). Nodes with the existing `data` role acts as though they have all of the roles configured (it is a hot, warm, cold, and frozen node). This also includes a custom `AllocationDecider` that allows the user to configure the following settings on a cluster level: - `cluster.routing.allocation.require._tier` - `cluster.routing.allocation.include._tier` - `cluster.routing.allocation.exclude._tier` And in index settings: - `index.routing.allocation.require._tier` - `index.routing.allocation.include._tier` - `index.routing.allocation.exclude._tier` Relates to #60848	2020-08-12 11:06:23 -06:00
Alan Woodward	5b3c10c379	Fix serialization of AllFieldMapper (#61044 ) Converting AllFieldMapper to parametrized form ended up not being run through BWC testing, resulting in an incorrect implementation being committed. This commit fixes the serialization, and adds unit tests as well as unmuting the BWC test that uncovered the bug. Fixes #60986	2020-08-12 17:32:55 +01:00
Yannick Welsch	8c488de576	Gracefully handle null in checkSettingsForTerminalDeprecation Fixes a test failure after backport to 7.x	2020-08-12 18:03:52 +02:00
Yannick Welsch	25404cbe3d	Provide option to allow writes when master is down (#60605 ) Elasticsearch currently blocks writes by default when a master is unavailable. The cluster.no_master_block setting allows a user to change this behavior to also block reads when a master is unavailable. This PR introduces a way to now also still allow writes when a master is offline. Writes will continue to work as long as routing table changes are not needed (as those require the master for consistency), or if dynamic mapping updates are not required (as again, these require the master for consistency). Eventually we should switch the default of cluster.no_master_block to this new mode.	2020-08-12 16:56:45 +02:00
Yannick Welsch	6644f2283d	Do not access snapshot repo on dedicated voting-only master node (#61016 ) Today a snapshot repository verification ensures that all master-eligible and data nodes have write access to the snapshot repository (and can see each other's data) since taking a snapshot requires data nodes and the currently elected master to write to the repository. However, a dedicated voting-only master-eligible node is not a data node and will never be the elected master so we should not require it to have write access to the repository. Closes #59649	2020-08-12 16:56:45 +02:00
Yannick Welsch	af519be9cb	Ensure repo not in use for wildcard repo deletes (#60947 ) Repositories can't be unregistered when they are actively being used for snapshots or restores. Wildcard repository deletes could silently bypass the "repo in use" checks however, which is now fixed.	2020-08-12 16:38:06 +02:00
Dan Hermann	538c93c923	Adding Hit counts and Miss counts for QueryCache exposed through REST api. (#60114 ) (#60993 )	2020-08-12 08:21:09 -05:00
Alan Woodward	c81dc2b8b7	Convert KeywordFieldMapper to parametrized form (#60645 ) This makes KeywordFieldMapper extend ParametrizedFieldMapper, with explicitly defined parameters. In addition, we add a new option to Parameter, restrictedStringParam, which accepts a restricted set of string options.	2020-08-12 11:41:11 +01:00
markharwood	66098e0bf4	Search fix: query_string regex/wildcard searches not working on wildcard fields (#60959 ) (#61010 ) The Query string parser was not delegating the construction of wildcard/regex queries to the underlying field type. The wildcard field has special data structures and queries that operate on them so cannot rely on the basic regex/wildcard queries that were being used for other fields. Closes #60957	2020-08-12 10:44:52 +01:00
Armin Braun	32423a486d	Simplify and Speed up some Compression Usage (#60953 ) (#61008 ) Use thread-local buffers and deflater and inflater instances to speed up compressing and decompressing from in-memory bytes. Not manually invoking `end()` on these should be safe since their off-heap memory will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals that are not instantiated at a high frequency. This significantly reduces the amount of byte copying and object creation relative to the previous approach which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied bytes out of that buffer to a freshly allocated `byte[]`, used 4k stream buffers needlessly when working with bytes that are already in arrays (`writeTo` handles efficient writing to the compression logic now) etc. Relates #57284 which should be helped by this change to some degree. Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these code paths.	2020-08-12 11:06:23 +02:00
Nik Everett	ce9c5f0e46	Fix diversified sample tests The test assumed that the aggregator only ran once but we turned that off. This turns it back on.	2020-08-11 17:49:43 -04:00
Jay Modi	2fa6448a15	System index reads in separate threadpool (#60927 ) This commit introduces a new thread pool, `system_read`, which is intended for use by system indices for all read operations (get and search). The `system_read` pool is a fixed thread pool with a maximum number of threads equal to lesser of half of the available processors or 5. Given the combination of both get and read operations in this thread pool, the queue size has been set to 2000. The motivation for this change is to allow system read operations to be serviced in spite of the number of user searches. In order to avoid a significant performance hit due to pattern matching on all search requests, a new metadata flag is added to mark indices as system or non-system. Previously created system indices will have flag added to their metadata upon upgrade to a version with this capability. Additionally, this change also introduces a new class, `SystemIndices`, which encapsulates logic around system indices. Currently, the class provides a method to check if an index is a system index and a method to find a matching index descriptor given the name of an index. Relates #50251 Relates #37867 Backport of #57936	2020-08-11 12:16:34 -06:00
Julie Tibshirani	a93be8d577	Handle nested arrays in field retrieval. (#60981 ) We accept _source values with multiple levels of arrays, such as `"field": [[[1, 2]]]`. This PR ensures that field retrieval can handle nested arrays by unwrapping the arrays before parsing the values.	2020-08-11 10:22:16 -07:00
Mark Tozzi	ab8518fb5b	[7.x] Extensibility for Composite Agg #59648 (#60842 )	2020-08-11 09:14:33 -04:00
Alan Woodward	54279212cf	Make MetadataFieldMapper extend ParametrizedFieldMapper (#59847 ) (#60924 ) This commit cuts over all metadata field mappers to parametrized format.	2020-08-11 09:02:28 +01:00
Armin Braun	3e2dfc6eac	Remove GCS Bucket Exists Check (#60899 ) (#60914 ) Same as https://github.com/elastic/elasticsearch/pull/43288 for GCS. We don't need to do the bucket exists check before using the repo, that just needlessly increases the necessary permissions for using the GCS repository.	2020-08-11 09:54:27 +02:00
Julie Tibshirani	d51eae6e9f	Prevent loading 'fields' with stored fields disabled. (#60938 ) Because the 'fields' option loads from _source (which is a stored field), it is not possible to retrieve 'fields' when stored_fields are disabled. This also fixes #60912, where setting stored_fields: _none_ prevented the _ignored fields from being loaded and caused a parsing exception.	2020-08-10 15:40:27 -07:00
Nik Everett	0286d0a769	Move distance_feature query building into MFT (#60614 ) (#60846 ) This moves the `distance_feature` query building out of `DistanceFeatureQueryBuilder` and into subclasses of `MappedFieldType`. Without this we don't have a chance of supporting this for runtime fields. In general I'm not sad to see the `instanceof`s go. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-08-10 16:05:17 -04:00
Julie Tibshirani	b216340f50	Make `FetchPhase` logic more readable. (#60779 ) * Factor out FieldsVisitor#postProcess call. * Swap logical order for normal and nested documents. * Extract the method createStoredFieldsVisitor.	2020-08-10 11:04:54 -07:00
Nik Everett	dfd502f9ca	Rework checking if a year is a leap year (#60585 ) (#60790 ) This way is faster, saving about 8% on the microbenchmark that rounds to the nearest month. That is in the hot path for `date_histogram` which is a very popular aggregation so it seems worth it to at least try and speed it up a little. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-08-10 12:45:34 -04:00
Jim Ferenczi	f30f1f04e2	Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce (#60816 ) This commit removes the ability to test the top level result of an aggregator before it runs the final reduce. All aggregator tests that use AggregatorTestCase#search are rewritten with AggregatorTestCase#searchAndReduce in order to ensure that we test the final output (the one sent to the end user) rather than an intermediary result that could be different. This change also removes spurious commits triggered on top of a random index writer. These commits slow down the tests and are redundant with the commits that the random index writer performs.	2020-08-10 17:23:00 +02:00
David Turner	f44c28b595	Deprecate and ignore join timeout (#60872 ) There is no point in timing out a join attempt any more once a cluster is entirely in 7.x. Timing out and retrying with the same master is pointless, and an in-flight join attempt to one master no longer blocks attempts to join other masters. This commit deprecates this unnecessary setting and removes its effect from the joining process. Relates #60873 which removes this setting in master.	2020-08-10 13:57:41 +01:00
Martijn van Groningen	64bb082f9b	Improve error message for non append-only writes that target data stream (#60874 ) Backport of #60809 to 7.x branch. Closes #60581	2020-08-10 13:18:59 +02:00
Alan Woodward	e8d9185045	Cut over IPFieldMapper to parametrized form (#60602 ) This commit makes IpFieldMapper extend ParametrizedFieldMapper. It also updates the IpFieldMapper docs to add the ignore_malformed parameter, which was not previously documented.	2020-08-10 11:01:10 +01:00
David Turner	1f49e0b9d0	Fix testRerouteOccursOnDiskPassingHighWatermark (#60869 ) Sometimes this test would refresh the disk stats so quickly that it hit the refresh rate limiter even though it was almost completely disabled. This commit allows the rate limiter to be completely disabled. Closes #60587	2020-08-10 09:39:44 +01:00
Ryan Ernst	ddcfbec569	Add assert message for multiple lines in osprobe (#60796 ) Several /proc files are expected to contain a single line. We assert on this in tests, but the contents of the file are lost and the assertion therefore lacks important information to debug why the file appeared to have multiple lines. This commit dumps the contents of the file on assertion failure. relates #59284	2020-08-06 15:53:30 -07:00
Ryan Ernst	fc38af363e	Ensure latch is counted down when assertion trips (#60800 ) The ReloadSecureSettingsIT uses latches to ensure coordination across requests to the underlying in memory cluster. However, in the case of an expected failure, if the assertion fails, the latch will never be counted down, and will cause the test to hang indefinitely. This commit ensures the latch is always counted down with a try/finally. relates #51546	2020-08-06 15:33:46 -07:00
Jim Ferenczi	98119578a1	Disable sort optimization on search collapsing (#60838 ) Collapse search queries that sort by a field can throw an ArrayStoreException due to a bug in the [sort optimization](https://github.com/elastic/elasticsearch/pull/51852) introduced in 7.7.0. Search collapsing were not supposed to be eligible for this sort optimization so this change explicitly filters them from this new feature.	2020-08-06 21:37:12 +02:00
Jim Ferenczi	14980ff97e	Fix AOOBE when setting min_doc_count to 0 in significant_terms (#60823 ) This commit fixes the computation of the subset size on empty buckets (doc count of 0). The aggregator test refactoring in #60683 revealed this bug.	2020-08-06 18:57:09 +02:00
David Turner	721198c29e	Increase logging in testRerouteOccursOnDiskPassingHighWatermark (#60817 ) Relates #60587	2020-08-06 14:08:09 +01:00
Armin Braun	a2c7991e96	Fix CompressibleBytesOutputStreamTests (#60815 ) (#60822 ) Since #60730 the `bytes` field can be `null`. This adds the missing `null` check to the test override. Closes #60814	2020-08-06 15:07:48 +02:00
David Turner	273a6f916d	AwaitsFix for #60814	2020-08-06 12:56:28 +01:00
Tim Brooks	2f76c48ea7	Propagate forceExecution when acquiring permit (#60634 ) Currently the transport replication action does not propagate the force execution parameter when acquiring the indexing permit. The logic to acquire the index permit supports force execution, so this parameter should be propagate. Fixes #60359.	2020-08-05 15:57:40 -06:00
Francisco Fernández Castaño	b4044004aa	Add recovery state tracking for Searchable Snapshots (#60751 ) This pull request adds recovery state tracking for Searchable Snapshots. In order to track recoveries for searchable snapshot backed indices, this pull request adds a new type of RecoveryState. This newRecoveryState instance is able to deal with the small differences that arise during Searchable snapshots recoveries. Those differences can be summarized as follows: - The Directory implementation that's provided by SearchableSnapshots mark the snapshot files as reused during recovery. In order to keep track of the recovery process as the cache is pre-warmed, those files shouldn't be marked as reused. - Once the shard is created, the cache starts its pre-warming phase, meaning that we should keep track of those downloads during that process and tie the recovery to this pre-warming phase. The shard is considered recovered once this pre-warming phase has finished. Backport of #60505	2020-08-05 17:41:49 +02:00
Jake Landis	f3752ba1d5	7.x suport new path for re-index java-api doc (#60319 ) This commit uses the new location for the reindex java-api documentation. Temporary files have been left behind to pacify the docs build. related #60339	2020-08-05 09:05:07 -05:00
Armin Braun	ebfb93ff26	Improve some BytesStreamOutput Usage (#60730 ) (#60736 ) * Stop redundantly creating a `0` length `ByteArray` that is never used * Add efficient way to get a minimal size copy of the bytes in a `BytesStreamOutput` * Avoid multiple redundant `byte[]` copies in search cache key creation	2020-08-05 15:51:06 +02:00
Yannick Welsch	9f6f66f156	Fail searchable snapshot shards on invalid license (#60722 ) Implements license degradation behavior for searchable snapshots. Snapshot-backed shards are failed when the license becomes invalid, and shards won't be reallocated. After valid license is put in place again, shards are allocated again.	2020-08-05 13:14:15 +02:00
Igor Motov	959690a64a	Refactor extendedBounds to use DoubleBounds (#60556 ) (#60681 ) Refactors extendedBounds to use DoubleBounds instead of 2 variables. This is a follow up for #59175	2020-08-04 16:45:47 -04:00
Alan Woodward	b3ae5d26bd	Move mapper validation to the mappers themselves (#60072 ) (#60649 ) Currently, validation of mappers (checking that cross-references are correct, limits on field name lengths and object depths, multiple definitions, etc) is performed by the MapperService. This means that any mapper-specific validation, for example that done on the CompletionFieldMapper, needs to be called specifically from core server code, and so we can't add validation to mappers that live in plugins. This commit reworks the validation framework so that mapper-specific validation is done on the Mapper itself. Mapper gets a new `validate(MappingLookup)` method (already present on `MetadataFieldMapper` and now pulled up to the parent interface), which is called from a new `DocumentMapper.validate()` method. All the validation code currently living on `MapperService` moves either to individual mapper implementations (FieldAliasMapper, CompletionFieldMapper) or into `MappingLookup`, an altered `DocumentFieldMappers` which now knows about object fields and can check for duplicate definitions, or into DocumentMapper which handles soft limit checks.	2020-08-04 14:39:20 +01:00
Armin Braun	212ce22d15	Optimize CS Persistence Stream Use (#60643 ) (#60647 ) In the metadata persistence logic we failed to override the bulk write method on the FilterOutputStream resulting in all the writes to it running byte-by-byte in a loop adding a large number of bounds checks needlessly.	2020-08-04 15:06:57 +02:00
Armin Braun	859ad761bb	Fix Broken Stream Close in writeRawValue (#60625 ) (#60644 ) Small oversight in #56078 that only showed up during backporting where a stream copy was turned from a non-closing to a closing one. Enhanced part of a test in this PR to make it show up in master also even though we practically never use this method with stream targets that actually close.	2020-08-04 13:39:52 +02:00
Armin Braun	7ae9dc2092	Unify Stream Copy Buffer Usage (#56078 ) (#60608 ) We have various ways of copying between two streams and handling thread-local buffers throughout the codebase. This commit unifies a number of them and removes buffer allocations in many spots.	2020-08-04 09:54:52 +02:00
Julie Tibshirani	f99584c6f3	Avoid reloading _source for every inner hit. (#60632 ) Previously if an inner_hits block required _ source, we would reload and parse the root document's source for every hit. This PR adds a shared SourceLookup to the inner hits context that allows inner hits to reuse parsed source if it's already available. This matches our approach for sharing the root document ID. Relates to #32818.	2020-08-03 17:12:27 -07:00
Julie Tibshirani	fc63f8224f	Simplify class hierarchy for ordinals field data. (#60606 ) This PR simplifies the hierarchy for ordinals field data classes: * Remove `AbstractIndexFieldData`, since only `AbstractIndexOrdinalsFieldData` inherits directly from it. * Make `SortedSetOrdinalsIndexFieldData` extend `AbstractIndexOrdinalsFieldData`. This lets us remove some redundant code.	2020-08-03 09:58:29 -07:00
Yannick Welsch	3409e019d2	Ignore shutdown when retrying recoveries (#60586 ) Avoids failures when shutting down a node.	2020-08-03 15:14:38 +02:00
Nik Everett	2cde43b799	Allows nanosecond resolution in search_after (backport of #60328 ) (#60426 ) Allows nanosecond resolution in search_after (#60328) This fixes `search_after` to properly parse string formatted dates that have nanosecond resolution. Closes #52424	2020-08-03 08:17:48 -04:00
David Turner	d2ddf8cd6a	Improve deserialization failure logging (#60577 ) Today when a node fails to properly deserialize a transport message with a parent task we log the following relatively uninformative message: java.lang.IllegalStateException: Message not fully read (response) for requestId [9999], handler [org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler/org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler/org.elasticsearch.transport.TransportService$6@abcdefgh], error [false]; resetting In particular, the wrapping of the listener in the `TransportService` obscures all clues as to the source of the problem, e.g. the action name or the identity of the underlying listener. This commit exposes the inner listener to the logs. Also if the listener is wrapped with `ContextPreservingActionListener` then its identity is similarly hidden. This commit also exposes the wrapped listener in this case. Relates #38939	2020-08-03 11:51:01 +01:00
Armin Braun	3270cb3088	More Efficient Writes for Snapshot Shard Generations (#60458 ) (#60575 ) Same as #59905 but for shard level metadata. Since we wnat to retain the ability to do safe+atomic writes for non-uuid shard generations this PR has to create two separate write paths for both kinds of shard generations.	2020-08-03 11:11:36 +02:00
Armin Braun	204efe9387	Add Repository Setting to Disable Writing index.latest (#60448 ) (#60576 ) Writing the `index.latest` blob is unnecessary unless the contents of the repository are to be used as a URL-repository. Also, in some edge cases, the fact that `index.latest` is the only blob in the repository that regularly gets overwritten was causing compatibility issues with some backing blobstores (Azure no-overwrite policy, Hitachy S3 equivalent). => this commit changes behavior to make snapshots not fail if writing `index.latest` fails and adds a setting to disable writing `index.latest`.	2020-08-03 11:11:24 +02:00
Andrei Dan	ac258f10d6	Data streams: throw ResourceAlreadyExists exception (#60518 ) (#60536 ) For consistency reasons (and reducing the overload of IllegalArgumentException) this changes the exception thrown when trying to create a data stream that already exists. (cherry picked from commit ac2184c4614bba0f3ee377da49aea0daed98bab4) Signed-off-by: Andrei Dan <andrei.dan@elastic.co>	2020-08-01 16:31:09 +01:00
Julie Tibshirani	f1d4fd8c3e	Correct name of IndexFieldData#loadGlobalDirect. (#60492 ) It seems 'localGlobalDirect' was just a typo.	2020-07-31 10:53:21 -07:00
Jim Ferenczi	8db896d290	Fix race condition in SearchPhaseControllerTests#testPartialMergeFailure (#60488 ) This change ensures that we call the listener for partial merge failure before calling the completion listener in order to avoid race condition in tests. Closes #60446	2020-07-31 16:29:20 +02:00
Julie Tibshirani	8ac81a3447	Remove IndexFieldData#clear since it is unused. (#60475 ) This method was never called. It also seemed tricky that calling a method on `IndexFieldData` could clear the contents of a shared cache.	2020-07-30 14:07:55 -07:00
Julie Tibshirani	dfd7f226f0	Clarify SourceLookup sharing across fetch subphases. (#60484 ) The `SourceLookup` class provides access to the _source for a particular document, specified through `SourceLookup#setSegmentAndDocument`. Previously the search context contained a single `SourceLookup` that was shared between different fetch subphases. It was hard to reason about its state: is `SourceLookup` set to the expected document? Is the _source already loaded and available? Instead of using a global source lookup, the fetch hit context now provides access to a lookup that is set to load from the hit document. This refactor closes #31000, since the same `SourceLookup` is no longer shared between the 'fetch _source phase' and script execution.	2020-07-30 13:22:31 -07:00
Dan Hermann	5e5503ac28	Change severity of negative stats messages from WARN to DEBUG (#60375 ) (#60444 )	2020-07-30 06:06:13 -05:00
Armin Braun	3bf4c01d8e	Don't Allocate Redundant Pages in BigArrays (#60201 ) (#60441 ) The oversize algorithm was allocating more pages than necessary to accommodate `minTargetSize`. An example would be that a 16k page size and 15k `minTargetSize` would result in a new size of 32k (2 pages). The difference between the minimum number of necessary pages and the estimated size then keeps growing as sizes increase. I don't think there is much value in preemptively allocating pages by over-sizing aggressively since the behavior of the system is quite different from that of a single array where over-sizing avoids copying once the minimum target size is more than a single page. Relates #60173 which lead me to this when `BytesStreamOutput` would allocate a large number of never used pages during serialization of repository metadata.	2020-07-30 11:09:58 +02:00
Armin Braun	a2c49a4f02	Reduce Heap Use during Shard Snapshot (#60370 ) (#60440 ) Instances of `BlobStoreIndexShardSnapshots` can be of non-trivial size. In case of snapshotting a larger number of shards the previous execution order would lead to memory use proportional to the number of shards for these objects. With this change, the number of these objects on heap is bounded by the size of the snapshot pool (except for in the BwC format path). This PR makes it so that they are written to the repository at the earliest possible point in time so that they can be garbage collected. If shard generations are used, we can safely write these right at the beginning of the shard snapshot. If shard generations are not used we can only write them at the end of the shard snapshot after all other blobs have been written. Closes #60173	2020-07-30 10:45:00 +02:00
Igor Motov	00a1949852	Streamline GeoJSON to map serialization (#60413 ) (#60429 ) Optimizes GeoJSON to map serialization when retrieving spatial data through fields. Closes #60259	2020-07-29 17:56:56 -04:00
Julie Tibshirani	5359417ec3	Minor clean-up around search highlight context. (#60422 ) * Rename SearchContextHighlight -> SearchHighlightContext. * Rename HighlighterContext to FieldHighlightContext. * Make the search highlight context immutable. * Avoid storing SearchHighlightContext on HighlighterContext.	2020-07-29 11:39:17 -07:00
Tim Brooks	85fdf959ad	Add configured indexing memory limit to node stats (#60414 ) This commit adds the configured memory limit to the node stats API.	2020-07-29 12:28:21 -06:00
Nhat Nguyen	9d4a64e749	Allow CCR on nodes with legacy roles only (#60093 ) CCR will stop functioning if the master node is on 7.8, but data nodes are before that version because the master node considers that all data nodes do not have the remote cluster client role. This commit allows CCR work on data nodes with legacy roles only. Relates #54146 Relates #59375	2020-07-29 10:57:31 -04:00
Armin Braun	8429b4ace8	Fix Queued Snapshot Deletes After Finalization Failure (#60285 ) (#60379 ) This fixes the behavior of the snapshot state machine in the following edge case: 1. Snapshot is running 2. Delete/abort for the snapshot is started 3. Snapshot fails to finalize We were not removing the failed snapshot id from the list of snapshots to delete in the delete. This lead to an error in the repository, which throws if we try to delete a non-existing snapshot. This commmit updates the deletions in progress by removing the failed snapshot id. The fact that this could lead to snapshot delete entries without any snapshot ids is not optimized on purpose because it allows for another attempt at writing clean `RepositoryData` and will run basic cleanup on the repository (root level blobs and stale indices) and thus bring the repository back into a clean state after a failed finalization. Closes #60274	2020-07-29 15:54:18 +02:00
Armin Braun	381cec2ba9	Fix ConcurrentSnapshotsIT.testMasterFailOverWithQueuedDeletes (#60307 ) (#60376 ) The test assumed that the master fail-over would always work out as a single step. This is not guaranteed however and we can randomly see master failing over twice, in which case the transport listener will be failed on the node that stops being leader and we have to catch an exception for the deletes as well just like we do for the snapshot. Closes #60262	2020-07-29 15:54:00 +02:00
Armin Braun	0778274b72	Fix IPV6 Scope Id in InetAddressesTests (#60368 ) (#60369 ) Follow up to #60360, turns out at times the name of an interface that isn't loopback is not a valid scope id.	2020-07-29 13:16:12 +02:00
Armin Braun	1f6a3765e4	Fix NPE in SnapshotsInProgress Constructor (#60355 ) Merge oversight between cleanups that removed `null` for `shards` and this corner case spot of no indices in a snapshot. Closes #60330	2020-07-29 10:47:28 +02:00
Armin Braun	4307a45153	Fix IPV6 Scope ID Test (#60360 ) (#60363 ) Use real scope id from first available interface instead of `lo` which might not exist on non-Linux platforms. Closes #60332	2020-07-29 09:55:37 +02:00
Armin Braun	753fd4f6bc	Cleanup and optimize More Serialization Spots (#59959 ) (#60331 ) Same as #59626 for a few more spots.	2020-07-29 07:20:44 +02:00
Zachary Tong	e3d85feecd	Mute testForStringIPv6WithScopeIdInput test Tracking issue: https://github.com/elastic/elasticsearch/issues/60332	2020-07-28 15:05:19 -04:00
Igor Motov	0dd53b76bd	Add aggregation list to node info (#60074 ) (#60256 ) Adds a full list of supported aggregations to the node info API. This list will be used in transform tests and telemetry mapping tests that will be added as follow-up PRs. Fixes #59774	2020-07-28 14:06:12 -04:00
Julie Tibshirani	c7bfb5de41	Add search `fields` parameter to support high-level field retrieval. (#60258 ) This feature adds a new `fields` parameter to the search request, which consults both the document `_source` and the mappings to fetch fields in a consistent way. The PR merges the `field-retrieval` feature branch. Addresses #49028 and #55363.	2020-07-28 10:58:20 -07:00
James Rodewig	025e7bee80	[DOCS] Fix allowed values for numeric sort types (#60176 ) (#60299 ) Co-authored-by: Philippus Baalman <philippus@gmail.com>	2020-07-28 13:51:59 -04:00
Howard	11b86b3f88	Remove unused clusterService instance in ActionModule. (#59826 )	2020-07-28 10:36:04 -07:00
jimczi	4e4ed6ee48	fix race condition in SearchPhaseControllerTests#consumerTestCase	2020-07-28 18:27:39 +02:00
David Turner	9450ea08b4	Log and track open/close of transport connections (#60297 ) Transport connections between nodes remain in place until one or other node shuts down or the connection is disrupted by a flaky network. Today it is very difficult to demonstrate that transient failures and cluster instability are caused by the network even though this is often the case. In particular, transport connections open and close without logging anything, even at `DEBUG` level, making it very hard to quantify the scale of the problem or to correlate the networking problems with external events. This commit adds the missing `DEBUG`-level logging when transport connections open and close, and also tracks the total number of transport connections a node has opened as a measure of the stability of the underlying network.	2020-07-28 17:08:04 +01:00
Armin Braun	9222070f22	Fix Test Failure in testCorrectCountsForDoneShards (#60254 ) (#60286 ) * Fix Test Failure in testCorrectCountsForDoneShards Fixing the freak edge case where the node shard status request returns before the node was able to send the state update request to master and update the cluster state. Without this change, the snapshot shard status would report as `DONE` once the data node has finished updating the shard in the cluster state. If the data node then drops out of the cluster before the state has been updated, then the status will jump to "FAILURE" because the master updates the state once the data node leaves the cluster. Closes #60247	2020-07-28 15:46:18 +02:00
David Turner	b78caa5c00	Add more useful toString on cluster state observers (#60277 ) Today if a cluster state observer's listener takes a long time to process a notification then we log the following rather useless warning message: [notifying listener [org.elasticsearch.cluster.ClusterStateObserver$ObserverClusterStateListener@12345678]] took [34567ms] This commit adds a handful of simple `toString()` implementations in order to identify the owner of the listener in question.	2020-07-28 12:56:58 +01:00
Jim Ferenczi	1144534093	Executes incremental reduce in the search thread pool (#58461 ) (#60275 ) This change forks the execution of partial reduces in the coordinating node to the search thread pool. It also ensures that partial reduces are executed sequentially and asynchronously in order to limit the memory and cpu that a single search request can use but also to avoid blocking a network thread. If a partial reduce fails with an exception, the search request is cancelled and the reporting of the error is delayed to the start of the fetch phase (when the final reduce is performed). This ensures that we cleanup the in-flight search requests before returning an error to the user. Closes #53411 Relates #51857	2020-07-28 13:40:47 +02:00
Armin Braun	d39622e17e	Stop Serializing RepositoryData Twice when Writing (#60107 ) (#60269 ) We can save one round of serializing `RepositoryData` on the write path. This also leads to somewhat better compression because we compress larger chunks in one go potentially when compared to serializing and compressing in one go. Also, fixed the double wrapping of collections when copying the repository data instance via the `withGenId`.	2020-07-28 11:42:14 +02:00
Yannick Welsch	a55c869aab	Properly document keepalive and other tcp options (#60216 ) Keepalive options are not well-documented (only in transport section, although also available at http and network level). Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: James Rodewig <40268737+jrodewig@users.noreply.github.com>	2020-07-28 11:10:04 +02:00
Yannick Welsch	ffe114b890	Set specific keepalive options by default on supported platforms (#59278 ) keepalives tell any intermediate devices that the connection remains alive, which helps with overzealous firewalls that are killing idle connections. keepalives are enabled by default in Elasticsearch, but use system defaults for their configuration, which often times do not have reasonable defaults (e.g. 7200s for TCP_KEEP_IDLE) in the context of distributed systems such as Elasticsearch. This PR sets the socket-level keep_alive options for network.tcp.{keep_idle,keep_interval} to 5 minutes on configurations that support it (>= Java 11 & (MacOS \|\| Linux)) and where the system defaults are set to something higher than 5 minutes. This helps keep the connections alive while not interfering with system defaults or user-specified settings unless they are deemed to be set too high by providing better out-of-the-box defaults.	2020-07-28 11:10:04 +02:00
Armin Braun	fac5953d13	Let `isInetAddress` utility understand the scope ID on ipv6 (#60172 ) (#60263 ) Make `isInetAddress` utility method understand the scope ID on ipv6. Fixes #60115 Co-authored-by: Yang Cheng <chengyang2048@163.com>	2020-07-28 09:37:39 +02:00
James Rodewig	cb4c21fa7b	[DOCS] Fix typo in adapt auto expand replica comments (#60187 ) (#60239 ) Co-authored-by: Howard <danielhuang@tencent.com>	2020-07-27 14:18:53 -04:00
weizijun	5df043d0e0	Fix wait_for_no_initializing_shards params (#58379 )	2020-07-27 14:03:26 -04:00
Adrien Grand	f1f275c91b	Add 6.8.12 and 7.8.2 version constants.	2020-07-27 19:26:22 +02:00
Tim Brooks	df0f68da23	Identify the operation type in rejected exception (#60138 ) Currently, we do not categorize the operation type in the rejection exception messsage when we reject an indexing operation for indexing memory limits. This commit fixes this to ensure that it is identified as coordinating, primary, or replica.	2020-07-27 10:09:46 -06:00
Tim Brooks	47922c9e4a	Fix indexing pressure replica rejections logic (#60150 ) Currently the logic to rejection replica rejections is evaluate before adding the additional bytes of the current operation. This means that the first replica operation which should be rejected will be allowed to proceed. This commit fixes this logic and adds unit level test to ensure indexing pressure behavior is correct.	2020-07-27 10:00:01 -06:00
Nik Everett	a451dd87aa	Reduce merge map memory overhead in the Variable Width Histogram Aggregation (#59366 ) (#60171 ) When a document which is distant from existing buckets gets collected, the `variable_width_histogram` will create a new bucket and then insert it into the ordered list of buckets. Currently, a new merge map array is created to move this bucket. This is very expensive as there might be thousands of buckets. This PR creates `mergeBuckets(UnaryOperator<Long> mergeMap)` methods in `BucketsAggregator` and `MergingBucketsDefferingCollector`, and updates the `variable_width_histogram` to use them. This eliminates the need to create an entire merge map array for each new bucket and reduces the memory overhead of the algorithm. Co-authored-by: James Dorfman <jamesdorfman@users.noreply.github.com>	2020-07-27 09:23:06 -04:00
Armin Braun	196ed6b90e	Remove Mostly Redundant Deleting in FsBlobContainer (#60117 ) (#60195 ) In almost all cases we write uuid named files via this method. Preemptively deleting just wastes IO ops, we can delete after a write failed and retry the write to cover the few cases where we actually do an overwrite.	2020-07-27 14:05:41 +02:00
David Roberts	89466eefa5	Don't require separate privilege for internal detail of put pipeline (#60190 ) Putting an ingest pipeline used to require that the user calling it had permission to get nodes info as well as permission to manage ingest. This was due to an internal implementaton detail that was not visible to the end user. This change alters the behaviour so that a user with the manage_pipeline cluster privilege can put an ingest pipeline regardless of whether they have the separate privilege to get nodes info. The internal implementation detail now runs as the internal _xpack user when security is enabled. Backport of #60106	2020-07-27 10:44:48 +01:00
Armin Braun	25a75d05c0	Fix Test Failure in testConcurrentlyChangeRepositoryContentsInBwCMode (#60095 ) There is a very unlikely but possible test failure in this test. The `SnapshotsService` continues iterating over queued operations after resolving the transport listener. This can lead to a situation where the moved repository data is not picked up when running the delete (even though we have the concurrent modifications BwC mode activated) concurrently. I fixed this in the test so that the test still verifies that this setting works. Technically speaking, one could add logic to the way we queue and execute repo operations to address this special case. Since this case only comes about with the concurrent modifications setting enabled (and the setting is gone in master already) I don't really see a reason to improve the logic here since we should always fail queued up repo operations on concurrent modification for safety reasons.	2020-07-27 09:33:38 +02:00
Nhat Nguyen	0031dea9cc	Fix race in testSendSnapshotSendsOps (#59831 ) There is a race between increase and get the global checkpoint in the test as indexTranslogOperations can be executed concurrently. Closes #59492	2020-07-23 16:22:40 -04:00
Ignacio Vera	db183c89ed	Refactor HyperLogLogPlusPlus to separate algorithms and internal data representation (#60104 ) (#60109 )	2020-07-23 15:07:05 +02:00
David Turner	bf7e53a91e	Remove node-level canAllocate override (#59389 ) Today there is a node-level `canAllocate` override which the balancer uses to ignore certain nodes to which it is certain no more shards can be allocated. In fact this override only ignores nodes which have hit the rarely-used `cluster.routing.allocation.total_shards_per_node` limit, so this optimization doesn't have a meaningful impact on real clusters. This commit removes this unnecessary fast path from the balancer, and also removes all the machinery needed to support it.	2020-07-23 08:48:59 +01:00
Armin Braun	43a6ff5eb1	Optimize some Spots around Closing Resources (#60049 ) (#60096 ) The single element `close` calls go through a very inefficient path that includes creating a one element list. `releaseOnce` is only with a single non-null input in production in two spots so no need for varargs and any complexity here. `ReleasableBytesStreamOutput` does not require any `releaseOnce` wrapping because we already have that kind of logic implemented in `org.elasticsearch.common.util.AbstractArray` (which we were wrapping here) already.	2020-07-23 08:49:06 +02:00
Julie Tibshirani	aa57bbd422	Consolidate validation for 'docvalue_fields'. (#60065 ) This improves modularity and also fixes some issues when `docvalues_fields` is used within `inner_hits` or the `top_hits` agg: * We previously didn't resolve wildcards in field names. * We also forgot to enforce the limit `index.max_docvalue_fields_search`.	2020-07-22 17:26:58 -07:00
Armin Braun	ebb6677815	Formalize and Streamline Buffer Sizes used by Repositories (#59771 ) (#60051 ) Due to complicated access checks (reads and writes execute in their own access context) on some repositories (GCS, Azure, HDFS), using a hard coded buffer size of 4k for restores was needlessly inefficient. By the same token, the use of stream copying with the default 8k buffer size for blob writes was inefficient as well. We also had dedicated, undocumented buffer size settings for HDFS and FS repositories. For these two we would use a 100k buffer by default. We did not have such a setting for e.g. GCS though, which would only use an 8k read buffer which is needlessly small for reading from a raw `URLConnection`. This commit adds an undocumented setting that sets the default buffer size to `128k` for all repositories. It removes wasteful allocation of such a large buffer for small writes and reads in case of HDFS and FS repositories (i.e. still using the smaller buffer to write metadata) but uses a large buffer for doing restores and uploading segment blobs. This should speed up Azure and GCS restores and snapshots in a non-trivial way as well as save some memory when reading small blobs on FS and HFDS repositories.	2020-07-22 21:06:31 +02:00
Tim Brooks	ba01540d7e	Implement human readable indexing pressure stats (#60058 ) The indexing pressure stats do not currently have human readable variants. This commit add human readable variants and updates the documentation.	2020-07-22 12:07:59 -06:00
Jay Modi	c8ef2e18f7	Thread safe clean up of LocalNodeModeListeners (#60007 ) This commit continues on the work in #59801 and makes other implementors of the LocalNodeMasterListener interface thread safe in that they will no longer allow the callbacks to run on different threads and possibly race each other. This also helps address other issues where these events could be queued to wait for execution while the service keeps moving forward thinking it is the master even when that is not the case. In order to accomplish this, the LocalNodeMasterListener no longer has the executorName() method to prevent future uses that could encounter this surprising behavior. Each use was inspected and if the class was also a ClusterStateListener, the implementation of LocalNodeMasterListener was removed in favor of a single listener that combined the logic. A single listener is used and there is currently no guarantee on execution order between ClusterStateListeners and LocalNodeMasterListeners, so a future change there could cause undesired consequences. For other classes, the implementations of the callbacks were inspected and if the operations were lightweight, the overriden executorName method was removed to use the default, which runs on the same thread. Backport of #59932	2020-07-22 08:02:18 -06:00
Luca Cavanna	702c997819	ParametrizedFieldMapper to run validators against default value (#60042 ) Sometimes there is the need to make a field required in the mappings, and validate that a value has been provided for it. This can be done through a validator when using ParametrizedFieldMapper, but validators need to run also when a value for a field has not been specified. Relates to #59332	2020-07-22 14:12:38 +02:00
Armin Braun	c06c9fb966	Fix BwC Snapshot INIT Path (#60006 ) There were two subtle bugs here from backporting #56911 to 7.x. 1. We passed `null` for the `shards` map which isn't nullable any longer when creating `SnapshotsInProgress.Entry`, fixed by just passing an empty map like the `null` handling did in the past. 2. The removal of a failed `INIT` state snapshot from the cluster state tried removing it from the finalization loop (the set of repository names that are currently finalizing). This will trip an assertion since the snapshot failed before its repository was put into the set. I made the logic ignore the set in case we remove a failed `INIT` state snapshot to restore the old logic to exactly as it was before the concurrent snapshots backport to be on the safe side here. Also, added tests that explicitly call the old code paths because as can be seen from initially missing this, the BwC tests will only run in the configuration new version master, old version nodes ever so often and having a deterministic test for the old state machine seems the safest bet here. Closes #59986	2020-07-22 10:09:55 +02:00
Jake Landis	55216dabb4	[7.x] Per processor description for verbose simulate (#58207 ) (#60008 ) For ingest node processors a per processor description was recently added. This commit displays that description in the verbose output of the pipeline simulation. related #57906	2020-07-21 17:32:45 -05:00
Nik Everett	49f365ddfd	Fix bug in deep pipeline agg serialization (#59984 ) In #54716 I removed pipeline aggregators from the aggregation result tree and caused us to read them from the request. This saves a bunch of round trip bytes, which is neat. But there was a bug in the backwards compatibility logic. You see, we still have to give the pipeline aggregations to nodes older than 7.8 over the wire because that is how they know what pipelines to run. They have the pipelines in the request but they don't read them. They use the ones in the response tree. Anyway, we had a bug where we were never sending pipelines defined two levels down. So while you are upgrading the pipeline wouldn't run. Sometimes. If the data node of the "first" result was post-7.8 and the coordinating node was pre-7.8. This fixes the bug.	2020-07-21 16:03:15 -04:00
David Turner	dde568caf7	Fix scheduling of ClusterInfoService#refresh (#59880 ) Today the `InternalClusterInfoService` uses the `LocalNodeMasterListener` interface to start/stop its operations. Since the `onMaster` and `offMaster` methods are called on the `MANAGEMENT` threadpool, there's no guarantee that they run in the correct sequence, which could result in an elected master failing to regularly update the cluster info. Since this service is also a `ClusterStateListener` we may as well drop the usage of the `LocalNodeMasterListener` interface and simply update the status of the local node on the applier thread in `clusterChanged` to ensure consistency. Additionally, today the `InternalClusterInfoService` uses a simple flag to track whether the local node is the elected master or not. If the node stops being the master and then starts again within a few seconds then the scheduled updates from the old mastership might carry on running in addition to the ones for the new mastership. This commit addresses that by tracking the identity of the scheduled update job and creating a new job for each mastership.	2020-07-21 17:14:49 +01:00
Alan Woodward	a0ad1a196b	Wrap up building parametrized TypeParsers (#59977 ) The TypeParser implementations of all ParametrizedFieldMapper descendant classes are essentially the same - stateless, requiring the construction of a Builder object, and calling parse on it before returning it. We can make this easier (and less error-prone) to implement by wrapping the logic up into a final class, which takes a function to produce the Builder from a name and parser context.	2020-07-21 16:00:11 +01:00
Nik Everett	6f6076e208	Drop some params from IndexFieldData.Builder (backport of #59934 ) (#59972 ) We never used the `IndexSettings` parameter and we only used the `MappedFieldType` parameter to get the name of the field which we already know everywhere where we build the `IFD.Builder`. This allows us to drop a fair bit of ceremony from a couple of tests.	2020-07-21 10:28:59 -04:00
Luca Cavanna	5e17f00ecf	Tweak toXContent implementation of ParametrizedFieldMapper (#59968 ) ParametrizedFieldMapper overrides `toXContent` from `FieldMapper`, yet it could override `doXContentBody` and rely on the `toXContent` from the base class. Additionally, this allows to make `doXContentBody` final. Also, toXContent is still overridden only to make it final.	2020-07-21 16:01:51 +02:00
Przemyslaw Gomulka	19fe3e511f	Deprecate camel case date format backport(#59555 ) (#59948 ) Camel case date formats are deprecated and snake case should be used instead. backports #59555	2020-07-21 15:56:44 +02:00
Armin Braun	e37bfe8a5f	Stop Checking if Segment Data Blob Exists before Write (#59905 ) (#59971 ) With uuid named segment data blobs there is no reason to ensure no overwrites are happening for these blobs when writing. On the contrary, at least on Azure this check can conflict with the SDK's retrying and cause upload failures randomly.	2020-07-21 15:23:42 +02:00
Yannick Welsch	07784a0b16	CCR recoveries using wrong setting for chunk sizes (#59597 ) The default chunk size for CCR file-based recoveries was wrongly set to 40MB instead of 1MB.	2020-07-21 13:56:06 +02:00
Armin Braun	cefaa17c52	Simplify CheckSumBlobStoreFormat and make it more Reusable (#59888 ) (#59950 ) Refactored `CheckSumBlobStoreFormat` so it can more easily be reused in other functionality (i.e. upcoming repair logic). Simplified away constant `failIfAlreadyExists` parameter and removed the atomic write method and its tests. The atomic write method was only used in a single spot and that spot has now been adjusted to work the same way writing root level metadata works.	2020-07-21 11:20:56 +02:00
Armin Braun	5b92596fad	Cleanup and Optimize Multiple Serialization Spots (#59626 ) (#59936 ) Follow up to #59606 using some of the new infrastructure and making similar cleanups (and due to at times better handling of size hints and empty collections also optimizations in the stream utility methods this also means speedups) in various spots in the core codebase.	2020-07-21 10:06:56 +02:00
Julie Tibshirani	8647872a1e	Simplify structure for parsing points. (#59938 ) Previously we constructed a GeometryFormat object and delegated point parsing to it. This wasn't a good fit conceptually because each GeometryFormat instance didn't represent a distinct point format.	2020-07-20 17:11:43 -07:00
Nik Everett	b2ca19484a	Allocate slightly less per bucket (#59740 ) (#59873 ) This replaces that data structure that we use to resolve bucket ids in bucketing aggs that are inside other bucketing aggs. This replaces the "legoed together" data structure with a purpose built `LongLongHash` with semantics similar to `LongHash`, except that it has two `long`s as keys instead of one. The microbenchmarks show a fairly substantial performance gain on the hot path, around 30%. Rally's higher level benchmarks show anywhere from 0 to 7% speed improvements. Not as much as I'd hoped, but nothing to sneeze at. And, after all, we all allocating slightly less data per owningBucketOrd, which is always nice.	2020-07-20 10:43:11 -04:00
Stéphane Campinas	bcebdfe5b1	fix handling of alias filter in SearchService#canMatch (#59368 ) The check against the alias filter should be done after the request is rewritten. Close #59367	2020-07-20 16:25:15 +02:00
David Turner	b75207a09f	Remove sporadic min/max usage estimates from stats (#59755 ) Today `GET _nodes/stats/fs` includes `{least,most}_usage_estimate` fields for some nodes. These fields have rather strange semantics. They are only reported on the elected master and on nodes that have been the elected master since they were last restarted; when a node stops being the elected master these stats remain in place but we stop updating them so they may become arbitrarily stale. This means that these statistics are pretty meaningless and impossible to use correctly. Even if they were kept up to date they're never reported for data-only nodes anyway, despite the fact that data nodes are the ones where we care most about disk usage. The information needed to compute the path with the least/most available space is already provided in the rest the stats output, so we can treat the inclusion of these stats as a bug and fix it by simply removing them in this commit. Since these stats were always optional and mostly omitted (for opaque reasons) this is not considered a breaking change.	2020-07-20 15:22:04 +01:00
Lee Hinman	8c7d414a3b	[7.x] Fix retrieving data stream stats for a DS with multiple backing indices (#59806 ) (#59810 ) Backports the following commits to 7.x: Fix retrieving data stream stats for a DS with multiple backing indices (#59806)	2020-07-17 16:56:07 -06:00
Nik Everett	514b2f3414	Clean up a few of vwh's rough edges (#59341 ) (#59807 ) This cleans up a few rough edged in the `variable_width_histogram`, mostly found by @wwang500: 1. Setting its tuning parameters in an unexpected order could cause the request to fail. 2. We checked that the maximum number of buckets was both less than 50000 and MAX_BUCKETS. This drops the 50000. 3. Fixes a divide by 0 that can occur of the `shard_size` is 1. 4. Fixes a divide by 0 that can occur if the `shard_size * 3` overflows a signed int. 5. Requires `shard_size * 3 / 4` to be at least `buckets`. If it is less than `buckets` we will very consistently return fewer buckets than requested. For the most part we expect folks to leave it at the default. If they change it, we expect it to be much bigger than `buckets`. 6. Allocate a smaller `mergeMap` in when initially bucketing requests that don't use the entire `shard_size * 3 / 4`. Its just a waste. 7. Default `shard_size` to `10 * buckets` rather than `100`. It looks like that was our intention the whole time. And it feels like it'd keep the algorithm humming along more smoothly. 8. Default the `initial_buffer` to `min(10 * shard_size, 50000)` like we've documented it rather than `5000`. Like the point above, this feels like the right thing to do to keep the algorithm happy. Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>	2020-07-17 15:16:09 -04:00
Lee Hinman	f6b08a3115	[7.x] Allow simulating existing composable index template (#59733 ) (#59798 ) Backports the following commits to 7.x: Allow simulating existing composable index template (#59733)	2020-07-17 13:10:07 -06:00
Nik Everett	95e6e4a452	Small cleanup for IndexFieldData (#59724 ) (#59800 ) This drops `IndexComponent` from `IndexFieldData` because it wasn't doing anything other than forcing us to perform a bunch of ceremony to build them.	2020-07-17 13:38:15 -04:00
Tal Levy	c9ab7bb651	Fix bug in circuit-breaker check for geoshape grid aggregations (#57962 ) (#59741 ) There was a bug in the geoshape circuit-breaker check where the hash values array was being allocated before its new size was accounted for by the circuit breaker. Fixes #57847.	2020-07-17 09:26:00 -07:00
Christoph Büscher	f4ff5fe93b	Add `zero_terms_query` support to `match_phrase_prefix` (#58822 ) (#59784 ) Currently `match_phrase_prefix` doesn't support `zero_terms_query` like the other match-type queries. This change adds this support. Closes #58468	2020-07-17 17:23:23 +02:00
Benjamin Trent	b7f30fc929	[7.x] Adding new `require_alias` option to indexing requests (#58917 ) (#59769 ) * Adding new `require_alias` option to indexing requests (#58917) This commit adds the `require_alias` flag to requests that create new documents. This flag, when `true` prevents the request from automatically creating an index. Instead, the destination of the request MUST be an alias. When the flag is not set, or `false`, the behavior defaults to the `action.auto_create_index` settings. This is useful when an alias is required instead of a concrete index. closes https://github.com/elastic/elasticsearch/issues/55267	2020-07-17 10:24:58 -04:00
Alan Woodward	65f6fb8e94	Shortcut mapping update if the incoming mapping version is the same as the current mapping version (#59517 ) (#59772 ) Currently, when we apply a cluster state change to a shard on a non-master node, we check to see if the mappings need to be updated by comparing the decompressed serialized mappings from the update against the serialized version of the shard's existing mappings. However, we already have a much simpler way of checking this, by comparing mapping versions on the index metadata of the old and new states. This commit adds a shortcut to MapperService.updateMappings() that compares these mapping versions, and ignores the merge if they are equal.	2020-07-17 14:53:09 +01:00
Alan Woodward	b29d368b52	Convert DateFieldMapper to parametrized format (#59429 ) (#59759 ) This commit makes DateFieldMapper extend ParametrizedFieldMapper, declaring its parameters explicitly. As well as changes to DateFieldMapper itself, there are some changes to dynamic mapping code to ensure that dynamically detected date formats are passed through to new date mapper builders.	2020-07-17 12:46:18 +01:00
Przemko Robakowski	790fbbcd87	[7.x] Fix handling of final pipelines when destination is changed (#59522 ) (#59746 ) * Fix handling of final pipelines when destination is changed (#59522) This change fixes final pipelines if destination index is changed during pipeline run: -final pipelines can't change destination anymore, exception is thrown if they try to -if request/default pipeline changes destination final pipeline from old index won't be executed -if request/default pipeline changes destination and new index has final pipeline it will be executed -default pipeline from new index won't be executed Additionally TransportBulkAction.resolvePipelines was moved to IngestService as it's needed for resolving pipelines from new index. Tests were moved accordingly. Closes #57968	2020-07-17 11:13:48 +02:00
Tim Brooks	b6e6a8c090	Fix replication operation transient retry test (#58205 ) After the work to retry transient replication failures, the local and global checkpoint test metadata can be incremented on a different thread than the test thread. This appears to introduce an extremely rare scenario where this data is not visible for later test assertions. This commit fixes the issue by using synchronized maps.	2020-07-16 16:01:47 -06:00
Martijn van Groningen	0096238df1	Replaced _data_stream_timestamp meta field's 'path' option with 'enabled' option (#59727 ) Backport #59503 to 7.x and adjusted exception messages. Relates to #59076	2020-07-16 22:29:40 +02:00
Igor Motov	2408803fad	Adds hard_bounds to histogram aggregations (#59175 ) (#59656 ) Adds a hard_bounds parameter to explicitly limit the buckets that a histogram can generate. This is especially useful in case of open ended ranges that can produce a very large number of buckets.	2020-07-16 15:31:53 -04:00
Alan Woodward	10be10c99b	Migrate CompletionFieldMapper to parametrized format (#59691 ) This adds a number of new optional parameters to Parameter, including: * custom serialization (to handle analyzers) * deprecated parameter names * parameter validation * allowing default values to be based on the values of other parameters We preserve the previous serialization format of CompletionFieldMapper, always emitting most fields, in order to meet mapping checks in mixed version clusters, where the mapper service will check that mappings have been correctly parsed and updated by checking their serialized outputs.	2020-07-16 19:15:00 +01:00

1 2 3 4 5 ...

5252 Commits