OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nhat Nguyen	4813728783	Remove leniency in reset engine from translog (#44711 ) Replaying operations from the local translog must never fail as those operations were processed successfully on the primary before and the mapping is up to update already. This change removes leniency during resetting engine from translog in IndexShard and InternalEngine.	2019-07-29 16:31:45 -04:00
Jack Conradson	1a21682ed0	Fix JodaCompatibleZonedDateTime casts in Painless (#44874 ) This is a temporary fix during the Joda to Java datetime transition. This will implicitly cast a JodaCompatibleZonedDateTime to a ZonedDateTime for both def and static types. This is necessary to insulate users from needing to know about JodaCompatibleZonedDateTime explicitly.	2019-07-29 12:05:26 -07:00
Igor Motov	b6cef227a5	Geo: fix geo query decomposition (#44924 ) The recent refactoring introduced an issue where queries where not going through the decomposition processing. Fixes #44891	2019-07-29 11:48:24 -04:00
Luca Cavanna	a3cc32da64	TaskListener#onFailure to accept Exception instead of Throwable (#44946 ) TaskListener accepts today Throwable in its onFailure method. Though looking at where it is called (TransportAction), it can never be notified of a Throwable. This commit changes the signature of TaskListener#onFailure so that it accepts an `Exception` rather than a `Throwable` as second argument.	2019-07-29 16:47:19 +02:00
Michał Perlak	245c9b7914	Optimize Min and Max BKD optimizations (#44315 ) MinAggregator - skip BKD optimization when no result found after 1024 lookups. MaxAggregator - skip unnecessary conversions.	2019-07-29 10:04:39 -04:00
Yannick Welsch	24873dd3e3	Do not block transport thread on startup (#44939 ) We currently block the transport thread on startup, which has caused test failures. I think this is some kind of deadlock situation. I don't think we should even block a transport thread, and there's also no need to do so. We can just reject requests as long we're not fully set up. Note that the HTTP layer is only started much later (after we've completed full start up of the transport layer), so that one should be completely unaffected by this. Closes #41745	2019-07-29 11:35:17 +02:00
Armin Braun	f5efafd4d6	Cleanup Deadcode o.e.indices (#44931 ) (#44938 ) * none of this is used anywhere	2019-07-29 10:38:35 +02:00
Igor Motov	cfc8d17bb4	Geo: refactor geo mapper and query builder (#44884 ) Refactors out the indexing and query generation logic out of the mapper and query builder into a separate unit-testable classes.	2019-07-26 16:48:31 -04:00
Yannick Welsch	1561ab5420	Guard open connection call in RemoteClusterConnection (#44921 ) Fixes an issue where a call to openConnection was not properly guarded, allowing an exception to bubble up to the uncaught exception handler, causing test failures. Closes #44912	2019-07-26 22:27:45 +02:00
Tanguy Leroux	e1b626b947	Ensure index is green in SimpleClusterStateIT.testIndicesOptions() (#44893 ) SimpleClusterStateIT testIndicesOptions failed in #44817 because it tries to close an index at the beginning of the test. With random index settings, it is possible that the index has a high number of shards (10) and replicas (1), which means that on CI this index can take time to be fully allocated. The close index request can fail in the case where replicas are still recovering operations. Thiscommit adds a simple ensureGreen() at the beginning of the test to be sure that all replicas are started before trying to close the index. closes #44817	2019-07-26 17:07:53 +02:00
Armin Braun	1340ff19bc	Fix Test Failure in ScalingThreadPoolTests (#44898 ) (#44901 ) * Due to #44894 some constellations log a deprecation warning here now * Fixed by checking for that	2019-07-26 17:05:50 +02:00
Tanguy Leroux	8848fcfb22	Ensure cluster is stable in ShrinkIndexIT.testShrinkThenSplitWithFailedNode (#44860 ) The test ShrinkIndexIT.testShrinkThenSplitWithFailedNode sometimes fails because the resize operation is not acknowledged (see #44736). This resize operation creates a new index "splitagain" and it results in a cluster state update (TransportResizeAction uses MetaDataCreateIndexService.createIndex() to create the resized index). This cluster state update is expected to be acknowledged by all nodes (see IndexCreationTask.onAllNodesAcked()) but this is not always true: the data node that was just stopped in the test before executing the resize operation might still be considered as a "faulty" node (and not yet removed from the cluster nodes) by the FollowersChecker. The cluster state is then acked on all nodes but one, and it results in a non acknowledged resize operation. This commit adds an ensureStableCluster() check after stopping the node in the test. The goal is to ensure that the data node has been correctly removed from the cluster and that all nodes are fully connected to each before moving forward with the resize operation. Closes #44736	2019-07-26 10:14:27 +02:00
Jason Tedor	6ea2b5dec0	Deprecate setting processors to more than available (#44889 ) Today the processors setting is permitted to be set to more than the number of processors available to the JVM. The processors setting directly sizes the number of threads in the various thread pools, with most of these sizes being a linear function in the number of processors. It doesn't make any sense to set processors very high as the overhead from context switching amongst all the threads will overwhelm, and changing the setting does not control how many physical CPU resources there are on which to schedule the additional threads. We have to draw a line somewhere and this commit deprecates setting processors to more than the number of available processors. This is the right place to draw the line given the linear growth as a function of processors in most of the thread pools, and that some are capped at the number of available processors already.	2019-07-26 17:06:44 +09:00
Ignacio Vera	821f6f893b	Upgrade to Lucene 8.2.0 release (#44859 ) (#44892 )	2019-07-26 08:14:59 +02:00
Nhat Nguyen	d128188c28	Return seq_no and primary_term in noop update (#44603 ) With this change, we will return primary_term and seq_no of the current document if an update is detected as a noop. We already return the version; hence we should also return seq_no and primary_term. Relates #42497	2019-07-25 19:16:56 -04:00
Yannick Welsch	bd8470e738	Asynchronously connect to remote clusters (#44825 ) Refactors RemoteClusterConnection so that it no longer blockingly connects to remote clusters. Relates to #40150	2019-07-25 22:59:59 +02:00
Yannick Welsch	0ce841915c	Add Clone Index API (#44267 ) Adds an API to clone an index. This is similar to the index split and shrink APIs, just with the difference that the number of primary shards is kept the same. In case where the filesystem provides hard-linking capabilities, this is a very cheap operation. Indexing cloning can be done by running `POST my_source_index/_clone/my_target_index` and it supports the same options as the split and shrink APIs. Closes #44128	2019-07-25 22:02:28 +02:00
Ryan Ernst	03dd22b56c	Add missing ZonedDateTime methods for joda compat layer (#44829 ) While joda no longer exists in the apis for 7.x, the compatibility layer still exists with helper methods mimicking the behavior of joda for ZonedDateTime objects returned for date fields in scripts. This layer was originally intended to be removed in 7.0, but is now likely to exist for the lifetime of 7.x. This commit adds missing methods from ChronoZonedDateTime to the compat class. These methods were not part of joda, but are needed to act like a real ZonedDateTime. relates #44411	2019-07-25 11:45:57 -07:00
Julie Tibshirani	acb7f599a3	Fix an NPE when requesting inner hits and _source is disabled. (#44836 ) This PR makes two changes to FetchSourceSubPhase when _source is disabled and we're in a nested context: * If no source filters are provided, return early to avoid an NPE. * If there are source filters, make sure to throw an exception. The behavior was chosen to match what currently happens in a non-nested context.	2019-07-25 10:38:00 -07:00
Nicholas Knize	48757da6e1	[GEO] Fix GeoShapeQueryBuilder to check for valid spatial relations Refactor left out the spatial strategy check in GeoShapeQueryBuilder.relation setter method. This commit adds that check back in.	2019-07-25 11:32:13 -05:00
Nick Knize	133f848e9f	[Geo] Refactor GeoShapeQueryBuilder to derive from AbstractGeometryQueryBuilder (#44780 ) Refactors GeoShapeQueryBuilder to derive from a new AbstractGeometryQueryBuilder that provides common parsing and build logic for spatial geometries. This will allow development of custom geometry queries by extending AbstractGeometryQueryBuilder preventing duplication of common spatial query logic.	2019-07-25 11:32:13 -05:00
Armin Braun	383d7b7713	Cleanup Dead Code in Index Creation (#44784 ) (#44822 ) * Cleanup Dead Code in Index Creation * This is all unused and the state of a create request is always `OPEN`	2019-07-25 10:50:04 +02:00
Yannick Welsch	e0d4544ef6	Close connection manager on current thread in RemoteClusterConnection (#44805 ) The problem is that RemoteClusterConnection closes the connection manager asynchronously, which races with the threadpool being shutdown at the end of the test. Closes #44339 Closes #44610	2019-07-25 09:34:41 +02:00
Igor Motov	f9943a3e53	Geo: deprecate ShapeBuilder in QueryBuilders (#44715 ) Removes unnecessary now timeline decompositions from shape builders and deprecates ShapeBuilders in QueryBuilder in favor of libs/geo shapes. Relates to #40908	2019-07-24 14:27:58 -04:00
David Turner	4cfd2fc6b2	Fix testFirstListElementsToCommaDelimitedStringReportsFirstElementsIfLong (#44785 ) This test can fail (super-rarely) if it generates a list of length 11 containing a duplicate, because the `.distinct()` reduces the list length to 10 and then it is not abbreviated any more. This change generalises the test to cover lists of any random length.	2019-07-24 16:10:41 +01:00
Tanguy Leroux	a8905ef142	[7.x] Add CloseIndexResponse to HLRC (#44349 ) (#44788 ) The CloseIndexResponse was improved in #39687; this commit exposes it in the HLRC. Backport of #44349 to 7.x.	2019-07-24 15:51:01 +02:00
Dimitris Athanasiou	5453188cef	[TEST] Mute SharedClusterSnapshotRestoreIT.testParallelRestoreOperationsFromSingleSnapshot This was supposed to be muted in #44675 and its backports but that PR accidentally muted another test. Relates #44671	2019-07-24 14:28:09 +03:00
Armin Braun	4a3218551c	Fix ConnectionManagerTests (#44769 ) (#44789 ) * In both fake connection validators we were potentially executing the listener twice. This lead to the situation that the locking via `connectionLock` that ensures that each listener is only executed once ever would fail and the lister would run twice (in which case the listeners for that node are already `null` and we get an NPE) * The fact that two different tests fail is due to the fact that we weren't safely shutting down the threadpool which meant the the task that trips the assertion (on the generic pool) would leak into the next test and fail it * Closes #44758	2019-07-24 13:12:57 +02:00
Jason Tedor	4c77d5e2c7	Remove stale permissions from untrusted policy (#44783 ) We have some old permissions lying around, granted to untrusted code from the days of yore when we supported Groovy and Javascript scripting. This commit removes these stale permissions.	2019-07-24 15:59:16 +09:00
Jason Tedor	659ebf6cfb	Notify systemd when Elasticsearch is ready (#44673 ) Today our systemd service defaults to a service type of simple. This means that systemd assumes Elasticsearch is ready as soon as the ExecStart (bin/elasticsearch) process is forked off. This means that the service appears ready long before it actually is, so before it is ready to receive requests. It also means that services that want to depend on Elasticsearch being ready to start can not as there is not a reliable mechanism to determine this. This commit changes the service type to notify. This requires that Elasticsearch sends a notification message via libsystemd sd_notify method. This commit does that by using JNA to invoke this native method. Additionally, we use this integration to also notify systemd when we are stopping.	2019-07-24 14:04:36 +09:00
Armin Braun	818103ff1e	Fix testRetentionLeasesClearedOnRestore (#44754 ) (#44766 ) * Fix this test randomly failing when running into async translog persistence edge case and failing to successfully close index * Also, slightly improve debug logging on close failure * Closes #44681	2019-07-23 21:29:07 +02:00
Igor Motov	9338fc8536	GEO: Switch to using GeoTestUtil to generate random geo shapes (#44635 ) Switches to more robust way of generating random test geometries by reusing lucene's GeoTestUtil. Removes duplicate random geometry generators by moving them to the test framework. Closes #37278	2019-07-23 14:30:41 -04:00
Armin Braun	e5bd3ad0e9	Remove some Dead Code in o.e.transport (#44653 ) (#44734 ) * None of this is used	2019-07-23 10:52:37 +02:00
David Turner	ee23968f05	Ignore unknown fields if overriding node metadata (#44689 ) The `elasticsearch-node override-version` command fails if it cannot read the existing node metadata file. However, it reads this file strictly and fails if there are any unknown fields, which means it will not be useful if we add another field in future. This commit adds leniency to this command, allowing it to ignore any unknown fields and proceed with the downgrade. A downgrade is already unsafe, and the user is already copiously warned about this, so being lenient in this case does not make things much worse.	2019-07-23 08:54:58 +01:00
Jason Tedor	6928a315c4	Check shard limit after applying index templates (#44619 ) Today when creating an index and checking cluster shard limits, we check the number of shards before applying index templates. At this point, we do not know the actual number of shards that will be used to create the index. In a case when the defaults are used and a template would override, we could be grossly underestimating the number of shards that would be created, and thus incorrectly applying the limits. This commit addresses this by checking the shard limits after applying index templates.	2019-07-23 16:50:42 +09:00
Ignacio Vera	05ec970723	Support BucketScript paths of type string and array. (#44694 ) (#44731 )	2019-07-23 09:05:47 +02:00
Ioannis Kakavas	3714cb63da	Allow parsing the value of java.version sysprop (#44017 ) We often start testing with early access versions of new Java versions and this have caused minor issues in our tests (i.e. #43141) because the version string that the JVM reports cannot be parsed as it ends with the string -ea. This commit changes how we parse and compare Java versions to allow correct parsing and comparison of the output of java.version system property that might include an additional alphanumeric part after the version numbers (see [JEP 223[(https://openjdk.java.net/jeps/223)). In short it handles a version number part, like before, but additionally a PRE part that matches ([a-zA-Z0-9]+). It also changes a number of tests that would attempt to parse java.specification.version in order to get the full version of Java. java.specification.version only contains the major version and is thus inappropriate when trying to compare against a version that might contain a minor, patch or an early access part. We know parse java.version that can be consistently parsed. Resolves #43141	2019-07-22 20:14:56 +03:00
Tanguy Leroux	bcb3563dcf	Remove AllocationService.reroute(ClusterState, String, boolean) (#44629 ) This commit removes the method AllocationService.reroute(ClusterState, String, boolean) in favor of AllocationService.reroute(ClusterState, String). Motivations are: there are already 3 other reroute methods in this class this method is always called with the debug parameter set to false almost all tests use the method reroute(ClusterState, String)	2019-07-22 17:12:21 +02:00
Evgenia Badiyanova	5273a548a4	Unmute PendingTasksBlocksIT tests	2019-07-22 10:59:21 -04:00
Armin Braun	6ceae5d586	Document Type of Collections Returned by StreamInput (#44686 ) (#44688 ) * As a result of #44665 the collections returned by the deserialization methods on `StreamInput` may be either mutable or immutable now, this PR adds documentation for that fact	2019-07-22 16:06:34 +02:00
Evgenia Badiyanova	8ee4c4d5ba	Mute some tests in PendingTasksBlocksIT Tracked in #44695.	2019-07-22 09:55:07 -04:00
David Turner	dcb3b2c18a	Fix testPendingTasksWithClusterNotRecoveredBlock In 7.x we cannot start a new master-eligible node before the cluster has formed since we first try and update minimum_master_nodes and this is blocked. This commit changes the test to start a data-only node so that no such adjustment is necessary. Relates #44685	2019-07-22 14:42:20 +01:00
Mayya Sharipova	972a49312c	Fix testQuotedQueryStringWithBoost test (#43385 ) Add more logging to indexRandom Seems that asynchronous indexing from indexRandom sometimes indexes the same document twice, which will mess up the expected score calculations. For example, indexing: { "index" : {"_id" : "1" } } {"important" :"phrase match", "less_important": "nothing important"} { "index" : {"_id" : "2" } } {"important" :"nothing important", "less_important" :"phrase match"} Produces the expected scores: 13.8 for doc1, and 1.38 for doc2 indexing: { "index" : {"_id" : "1" } } {"important" :"phrase match", "less_important": "nothing important"} { "index" : {"_id" : "2" } } {"important" :"nothing important", "less_important" :"phrase match"} { "index" : {"_id" : "3" } } {"important" :"phrase match", "less_important": "nothing important"} Produces scores: 9.4 for doc1, and 1.96 for doc2 which are found in the error logs. Relates to #43144	2019-07-22 08:44:31 -04:00
Przemyslaw Gomulka	a154f49b94	Fix stats in slow logs to be a escaped JSON backport(#44642 ) #44687 Fields in JSON logs should be an escaped JSON fields. It is a broken json value at the moment "stats": "["group1", "group2"]", -> "stats": "[\"group1\", \"group2\"]", This should later be refactored into a JSON array of strings (the same as types in 7.x)	2019-07-22 14:28:39 +02:00
David Turner	0ce3114779	Allow pending tasks before state recovery (#44685 ) Today we block access to the pending tasks API before the cluster has recovered its state. There's no real need to do so, and the master does meaningful work even before performing state recovery so it might sometimes be useful to allow access to this API. This commit changes this API to ignore all cluster blocks. Fixes #44652	2019-07-22 13:15:10 +01:00
Przemyslaw Gomulka	09e9c4cb59	Fix types field in JSON Search Slow Logs (#44641 ) The field has to be defined in log4j2.properties and should be an escaped JSON for now (it is a broken JSON at the moment). This should later be refactored into a JSON array of strings.	2019-07-22 12:02:20 +02:00
Przemyslaw Gomulka	fe20e217a4	Deprecation messages with the same key but different x-opaque-id are allowed backport(#44587 ) #44682 Deprecation logger was filtering log entries by key, that means that if two log messages with the same key are logged from different users, then the second log messages will be filtered. This change allows to log deprecation message with the same key by different users. relates #41354 backport #44587	2019-07-22 11:38:11 +02:00
Armin Braun	a6adcecd20	Fix Tring to Mutate Immutable Collections Fixes two spots where #44665 caused a previously mutable collection to now be read as an immutable one, leading to errors	2019-07-22 11:04:05 +02:00
Armin Braun	b9067ba1ba	Remove Needless Synchronization in FollowersChecker (#44631 ) (#44680 ) * It seems redundant to synchronize here and check that the map hasn't checked via the `isRunning` under the mutex * The map won't change if under the mutex that locks on all the updates to it * Without the mutex it's very unlikely to change inside the method call relative to the likelihood of changing until the generic pool where we check for `isRunning` again anyway -> just remove the synchronization (it's on the IO loop) and check since we do check the running state on the generic pool under the mutex anyway when we actually fail it	2019-07-22 10:57:30 +02:00
Jason Tedor	ff76b0af8b	Copy field names in stored fields context We have to copy the field names otherwise we either have a handle of a list that a caller might mutate or we might mutate when they aren't expecting it, or worse, a handle of a list that is not mutable (and we end up mutating the list). Relates #44665	2019-07-22 17:40:07 +09:00
Alpar Torok	b34ac66d96	Mute multiple tests on Windows (7.x) (#44676 ) * Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Fix line endings in ESJsonLayoutTests * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute failure unconfigured node name * mute x-pack internal cluster test windows tracking #44610 * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256	2019-07-22 11:32:29 +03:00
Armin Braun	0e2e83f591	More Efficient Deserialization of Empty Collections in StreamInput (#44665 ) (#44674 ) * We only had the `size == 0` optimization in some but not all spots of deserializing collections in this class, fixed the remaining spots. * Also fixed the a similar spot when deserializing `ThreadContextStruct` that could now be simplified (it was apparently doing it's own version of this optimization for the first map it deserialized before ... but not for the second map -> made it not instantiate anything if both maps are empty since it's always the same object here anyway)	2019-07-22 09:31:12 +02:00
Armin Braun	0ac137a9a1	Optimize some StreamOutput Operations (#44660 ) (#44668 ) * Optimize some StreamOutput Operations * Writing numbers byte by byte adds a lot of unnecessary bounds checks to serialization * Serializing to a threadlocal `byte[]` instead and bulk writing gives about a 50% speedup on `long` and `vlong` (for large numbers) writes and 30% for `int`, `vint` on Linux on an i9 * Using a threadlocal of the maximum string buffer size we used to allocate before also removes allocations when writing strings in general since we now never have to allocate a `byte[]` for that * And don't have to GC one either resolving the TODO removed here	2019-07-22 07:09:32 +02:00
Tal Levy	1a9cfe9110	Removal Streamable (#44647 ) (#44655 ) This commit ends the grand adventure that was the refactoring effort to migrate all usages of Streamable to Writeable. Closes #34389.	2019-07-20 19:10:49 -07:00
Ryan Ernst	4c05d25ec7	Convert Transport Request/Response to Writeable (#44636 ) (#44654 ) This commit converts all remaining TransportRequest and TransportResponse classes to implement Writeable, and disallows Streamable implementations. relates #34389	2019-07-20 11:25:58 -07:00
Ryan Ernst	f4ee2e9e91	Convert direct implementations of Streamable to Writeable (#44605 ) (#44646 ) This commit converts Streamable to Writeable for direct implementations. relates #34389	2019-07-20 08:32:29 -07:00
Tal Levy	7c84636029	Remove StreamOutput #writeOptionalStreamable and #writeStreamableList (#44602 ) (#44643 ) remove usages of writeOptionalStreamable and writeStreambaleList relates #34389.	2019-07-19 15:55:53 -07:00
Ryan Ernst	f193d14764	Convert remaining Action Response/Request to writeable.reader (#44528 ) (#44607 ) This commit converts readFrom to ctor with StreamInput on the remaining ActionResponse and ActionRequest classes. relates #34389	2019-07-19 13:33:38 -07:00
Armin Braun	f028ab43ad	Don't Swallow Interrupt in TransportService#onRequestReceived (#44622 ) (#44627 ) * We shouldn't just swallow the interrupt here quietly and keep going on the IO thread * Currently interrupt continues here just the same way an invocation of `acceptIncomingRequests` woudl have made things continue * Relates #44610	2019-07-19 20:35:29 +02:00
Christoph Büscher	eafe54c81c	Fix AnalysisMode propagation in NamedAnalyzer (#44626 ) NamedAnalyzer should return the same AnalysisMode than any custom analyzer it wraps, otherwise AnalysisMode.ALL. This used to be only CustomAnalyzer in the past, but with the introduction of the ReloadableCustomAnalyzer this needs to be added as an option where the analysis mode gets propagated. Closes #44625	2019-07-19 18:18:43 +02:00
Nikita Glashenko	804476c35d	Remove support for old translog checkpoint formats (#44280 ) This commit removes support for the translog checkpoint format from versions before 6.0.0 since 7.x versions are incompatible with indices from these versions. Relates #44720 Fixes #44210	2019-07-19 16:01:47 +01:00
Przemyslaw Gomulka	597d2dfaf5	Add types field to slow logs in 7.x (#44592 ) By mistake in 7.x types field was removed from slow logs. Types are still present in that version, so this have to be present as a JSON field relates #41354 backport that was causing this #44178	2019-07-19 08:31:00 +02:00
Ryan Ernst	60785a9fa8	Convert several direct uses of Streamable to Writeable (#44586 ) (#44604 ) This commit converts several utility classes that implement Streamable to have StreamInput constructors. It also adds a default version of readFrom to Streamable so that overriding to throw UOE is not necessary. relates #34389	2019-07-18 21:25:44 -07:00
Julie Tibshirani	336364fefe	Convert more classes in 'server' to Writeable. (#44600 ) * Convert GetTask. Convert RemoteInfo. Convert GetFieldMappings. Convert ValidateQueryRequest. Convert MainResponse. Convert MultiGet. Convert Update. Add a missing call to parent constructors. Relates to #34389.	2019-07-18 18:45:10 -07:00
Ryan Ernst	13f46aa801	Convert index and persistent actions/response to writeable (#44582 ) (#44601 ) This commit converts several more classes from streamable to writeable in server, mostly within the o.e.index and o.e.persistent packages. relates #34389	2019-07-18 18:32:09 -07:00
Tal Levy	03f5084ac7	remove usages of #readOptionalStreamable, #readStreamableList. (#44578 ) (#44598 ) This commit removes references to Streamable from StreamInput. This is all a part of the effort to remove Streamable usage. relates #34389.	2019-07-18 16:19:02 -07:00
Ryan Ernst	af093a4095	Convert ShardOperationFailedException to Writeable (#44532 ) (#44580 ) This commit converts subclasses of ShardOperationFailedException to implement ctors with StreamInput instead of readFrom. It also simplifies IndicesShardStoresResponse.Failure to serialize its shardId after the super data. relates #34389	2019-07-18 13:29:19 -07:00
Armin Braun	3b5038b837	Implement Eventually Consistent Mock Repository for SnapshotResiliencyTests (#40893 ) (#44570 ) * Add eventually consistent mock repository for reproducing and testing AWS S3 blob store behavior * Relates #38941	2019-07-18 17:54:54 +02:00
Andrey Ershov	ef6ddd15c6	Revert "Snapshot tool: S3 orphaned files cleanup (#44551)" This reverts commit `09edeeb3`	2019-07-18 17:21:45 +02:00
Andrey Ershov	09edeeb38e	Snapshot tool: S3 orphaned files cleanup (#44551 ) A tool to work with snapshots. Co-authored by @original-brownbear. This commit adds snapshot tool and the single command cleanup, that cleans up orphaned files for S3. Snapshot tool lives in x-pack/snapshot-tool. (cherry picked from commit fc4aed44dd975d83229561090f957a95cc76b287)	2019-07-18 16:38:00 +02:00
David Turner	452f7f67a0	Defer reroute when starting shards (#44539 ) Today we reroute the cluster as part of the process of starting a shard, which runs at `URGENT` priority. In large clusters, rerouting may take some time to complete, and this means that a mere trickle of shard-started events can cause starvation for other, lower-priority, tasks that are pending on the master. However, it isn't really necessary to perform a reroute when starting a shard, as long as one occurs eventually. This commit removes the inline reroute from the process of starting a shard and replaces it with a deferred one that runs at `NORMAL` priority, avoiding starvation of higher-priority tasks. Backport of #44433 and #44543.	2019-07-18 14:10:40 +01:00
Alan Woodward	ec0a0a41db	Remove type parameter from ParserContext (#44478 ) ParserContext.getType() is never called, so we can remove it and tidy up the callers as well.	2019-07-18 11:07:46 +01:00
Luca Cavanna	a8a16e6b08	Associate sub-requests to their parent task in multi search API (#44492 ) Multi search accepts multiple search requests and runs them as independent requests, each one as part of their own search task. Today they don't get associated though with their parent multi search task, which would be useful to monitor which msearch a certain search was part of, if any, and also to cancel all of the sub-requests in case the parent msearch gets cancelled (though this will also require making the multi search task cancellable as a follow-up).	2019-07-18 11:58:30 +02:00
David Turner	7598e0186a	Harmonise indentation of cluster settings (#44540 ) Today the long list of `BUILT_IN_CLUSTER_SETTINGS` is indented differently between `master` and `7.x`. This sometimes makes backporting painful. This commit adjusts the indentation of earlier branches to match that in `master`.	2019-07-18 09:50:53 +01:00
Armin Braun	6565825a13	Avoid CharsRef Allocations in StreamInput (#44488 ) (#44519 ) * Many messages deserialized from a `StreamInput` only contain short strings, some use-cases of instantiating a `StreamInput` don't deserialize any strings * Don't allocate `CharsRef` for small strings to save some allocations (especially on the IO threads) * Lazily allocate a larger `CharsRef` if needed for larger strings like we did before and have it live as long as the `StreamInput` like before as well	2019-07-18 08:52:37 +02:00
Tal Levy	38d2ada84f	deprecate Supplier<Response> constructors in HandledTransportAction (#44456 ) (#44533 ) This commit deprecates all constructors of HandledTransportAction that take in a Supplier instead of a Writeable.Reader for response objects. in addition to the deprecation, the following modules were updated to leverage Writeable - modules:ingest-common - modules:lang-mustache relates #34389.	2019-07-17 22:47:09 -07:00
Tal Levy	075a3f0e99	remove usage of ActionType#(String) (#44459 ) (#44526 ) this commit removes usage of the deprecated constructor with a single argument and no Writeable.Reader. The purpose of this is to reduce the boilerplate necessary for properly implementing a new action, as well as reducing the chances of using the incorrect super constructor while classes are being migrated to Writeable relates #34389.	2019-07-17 20:28:11 -07:00
Nhat Nguyen	51180af91d	Make peer recovery send file chunks async (#44468 ) Relates #44040 Relates #36195	2019-07-17 22:25:43 -04:00
Nhat Nguyen	458f24c46a	Reenable accounting circuit breaker (#44495 ) We have a new Lucene 8.2 snapshot on master and 7.x; hence we can re-enable the accounting on these branches. Relates #30290	2019-07-17 22:25:43 -04:00
Julie Tibshirani	34c6067018	Convert several classes in 'server' to Writeable. (#44527 ) * Convert FieldCapabilities. Convert MultiTermVectors. Convert SyncedFlush. Convert SearchTemplateRequest. * Convert MultiSearchTemplateRequest. * Convert GrokProcessorGet. Remove a stray reference to SearchTemplateRequest#readFrom. Relates to #34389.	2019-07-17 19:04:21 -07:00
Ryan Ernst	2a2686e6e7	Convert remaining ActionTypes to writeable in xpack core (#44467 ) (#44525 ) This commit converts all remaining ActionType response classes to writeable in xpack core. It also converts a few from server which were used by xpack core. relates #34389	2019-07-17 18:01:45 -07:00
Ryan Ernst	17c4b2b839	Convert MasterNodeRequest to implement Writeable.Reader (#44452 ) (#44513 ) This commit converts all MasterNodeRequest subclasses to fullfill Writeable.Reader constructors. relates #34389	2019-07-17 18:01:29 -07:00
Paul Sanwald	7114fe786b	Fix incorrect calculation of how many buckets will result from a merge operation. (#44461 ) (#44515 )	2019-07-17 19:14:16 -04:00
Julie Tibshirani	8841779de8	Convert ClearScroll* to Writeable. (#44511 ) This PR converts `ClearScrollRequest` and `ClearScrollResponse` to `Writeable`. Relates to #34389.	2019-07-17 15:49:38 -07:00
Jason Tedor	39c5f98de7	Introduce test issue logging (#44477 ) Today we have an annotation for controlling logging levels in tests. This annotation serves two purposes, one is to control the logging level used in tests, when such control is needed to impact and assert the behavior of loggers in tests. The other use is when a test is failing and additional logging is needed. This commit separates these two concerns into separate annotations. The primary motivation for this is that we have a history of leaving behind the annotation for the purpose of investigating test failures long after the test failure is resolved. The accumulation of these stale logging annotations has led to excessive disk consumption. Having recently cleaned this up, we would like to avoid falling into this state again. To do this, we are adding a link to the test failure under investigation to the annotation when used for the purpose of investigating test failures. We will add tooling to inspect these annotations, in the same way that we have tooling on awaits fix annotations. This will enable us to report on the use of these annotations, and report when stale uses of the annotation exist.	2019-07-18 05:33:33 +09:00
Ryan Ernst	0755a13c9f	Convert AcknowledgedRequest to Writeable.Reader (#44412 ) (#44454 ) This commit adds constructors to AcknolwedgedRequest subclasses to implement Writeable.Reader, and ensures all future subclasses implement the same. relates #34389	2019-07-17 11:17:36 -07:00
Yannick Welsch	c8b66c549d	Ignore failures to set socket options on Mac (#44355 ) Brings some temporary relief for test failures until #41071 is addressed.	2019-07-17 18:51:25 +02:00
Yannick Welsch	f78e64e3e2	Terminate linearizability check early on large histories (#44444 ) Large histories can be problematic and have the linearizability checker occasionally run OOM. As it's very difficult to bound the size of the histories just right, this PR will let it instead run for 10 seconds on large histories and then abort. Closes #44429	2019-07-17 18:51:25 +02:00
Igor Motov	d3cb7bbc8f	Geo: fix GeoWKTShapeParserTests (#44448 ) Changes in #44187 introduced some optimization in the way shapes are generated. These changes were not captured in GeoWKTShapeParserTests. Relates #44187	2019-07-17 12:09:38 -04:00
Igor Motov	cd5a334864	Geo: extract dateline handling logic from ShapeBuilders (#44187 ) Extracts dateline decomposition logic from ShapeBuilder into a separate utility class that is used on the indexing side. The search side will be handled as part of another PR at this time we will remove the decomposition logic from ShapeBuilders as well. This PR also doesn't change any existing logic including bugs. Relates to #40908	2019-07-17 12:09:38 -04:00
Alan Woodward	b6a0f098e6	Don't use index_phrases on graph queries (#44340 ) Due to https://issues.apache.org/jira/browse/LUCENE-8916, when you try to use a synonym filter with the index_phrases option on a text field, you can end up with null values in a Phrase query, leading to weird exceptions further down the querying chain. As a workaround, this commit disables the index_phrases optimization for queries that produce token graphs. Fixes #43976	2019-07-17 16:46:00 +01:00
Yannick Welsch	ddd740162e	Do not use CancellableThreads for Zen1 (#44430 ) Zen 1 stops pinging threads in ZenDiscovery by calling Thread.interrupt(). This is incompatible with the CancellableThreads that only allow threads to be interrupted through cancellation. The use of CancellableThreads was introduced in #42844 and added to UnicastZenPing as part of the backport, as both Zen1 and Zen2 share the same SeedHostsResolver implementation. This commit effectively undoes the change in the backport while still allowing to share same implementation. Closes #44425	2019-07-17 17:32:47 +02:00
Zachary Tong	103ba976fd	Convert BucketScript to static parser (#44385 ) BucketScript was using the old-style parser and could easily be converted over to the newer static parser. Also adds a test for GapPolicy enum serialization	2019-07-17 10:22:42 -04:00
David Turner	377a6a47ac	Improve handshake failure messages (#44485 ) Today we report an exception on a handshake failure (e.g. cluster name mismatch) but the message does not include all the details of the mismatch. If the mismatch is something subtle like `my-cluster` instead of `my_cluster` then we cannot diagnose this from the message alone. This commit adds the details of the local cluster to the message, along with the details of the remote cluster, improving the utility of the exception message if reported in isolation.	2019-07-17 13:33:28 +01:00
Armin Braun	91673e373a	Fix Incorrect Uncompressed Error Handling in InboundMessage (#44317 ) (#44483 ) * Fix Incorrect Uncompressed Error Handling in InboundMessage * CompressorFactory.compressor does not throw uncompressed exception on uncompressed bytes, it merely returns `null` in this case if the bytes are at least XContent so the current catch and re-throw logic is dead code * Made it work again by throwing on a `null` return so we get a real error message instead of an NPE	2019-07-17 14:31:46 +02:00
Ignacio Vera	eb348d2593	Upgrade to lucene-8.2.0-snapshot-6413aae226 (#44480 )	2019-07-17 13:28:28 +02:00
Armin Braun	c8db0e9b7e	Remove blobExists Method from BlobContainer (#44472 ) (#44475 ) * We only use this method in one place in production code and can replace that with a read -> remove it to simplify the interface * Keep it as an implementation detail in the Azure repository	2019-07-17 11:56:02 +02:00
Tanguy Leroux	e423b7341a	Log non-acknowledged close index response in ReplicaToPrimaryPromotionIT Relates #44479	2019-07-17 10:32:44 +02:00
David Turner	dca8a918f3	Use applied cluster state in cluster health (#44426 ) In #44348 we changed the cluster health action so that it sometimes uses the cluster state directly from the master service rather than from the cluster applier. If the state is not recovered then this is inappropriate, because prior to state recovery the state available to the cluster applier contains no indices. This commit moves us back to using the state from the applier. Fixes #44416.	2019-07-17 08:36:13 +01:00
David Turner	0fd33b089f	Report shard state changes better (#44419 ) Today when the cluster health changes the `AllocationService` reports at most ten shards that were started or failed, and always ends its message with `...` suggesting that the list is truncated. This commit adjusts these messages to be clearer about whether the list is truncated or not. When debug logging is enabled the list is not truncated; if the list is truncated then its length is logged, and if it is not truncated then no `...` is included in the message.	2019-07-17 08:36:06 +01:00

1 2 3 4 5 ...

3464 Commits