OpenSearch

Commit Graph

Author	SHA1	Message	Date
Lee Hinman	eebda6974d	Decouple NamedXContentRegistry from ElasticsearchException (#29253 ) * Decouple NamedXContentRegistry from ElasticsearchException This commit decouples `NamedXContentRegistry` from using either `ElasticsearchException`, `ParsingException`, or `UnknownNamedObjectException`. This will allow us to move NamedXContentRegistry to its own lib as part of the xcontent extraction work. Relates to #28504	2018-03-27 16:51:31 -06:00
Lee Hinman	7df66abaf5	[TEST] Fix issue with HttpInfo passed invalid parameter HttpInfo is passed the maxContentLength as a parameter, but this value should never be negative. This fixes the test to only pass a positive random value.	2018-03-27 14:20:06 -06:00
Lee Hinman	b4c78019b0	Remove all dependencies from XContentBuilder (#29225 ) * Remove all dependencies from XContentBuilder This commit removes all of the non-JDK dependencies from XContentBuilder, with the exception of `CollectionUtils.ensureNoSelfReferences`. It adds a third extension point around dealing with time-based fields and formatters to work around the Joda dependency. This decoupling allows us to be able to move XContentBuilder to a separate lib so it can be available for things like the high level rest client. Relates to #28504	2018-03-27 12:58:22 -06:00
Jim Ferenczi	3db6f1c9d5	Fix sporadic failure in CompositeValuesCollectorQueueTests This commit fixes a test bug that causes an NPE on empty segments. Closes #29269	2018-03-27 20:11:21 +02:00
Jim Ferenczi	2aaa057387	Propagate ignore_unmapped to inner_hits (#29261 ) In 5.2 `ignore_unmapped` was added to `inner_hits` in order to ignore invalid mapping. This value was automatically set to the value defined in the parent query (`nested`, `has_child`, `has_parent`) but the refactoring of the parent/child in 5.6 removed this behavior unintentionally. This commit restores this behavior but also makes sure that we always automatically enforce this value when the query builder is used directly (previously this was only done by the XContent deserialization). Closes #29071	2018-03-27 18:55:42 +02:00
Nhat Nguyen	dfc9e721d8	TEST: Increase timeout for testPrimaryReplicaResyncFailed The default timeout (eg. 10 seconds) may not be enough for CI to re-allocate shards after the partion is healed. This commit increases the timeout to 30 seconds and enables logging in order to have more detailed information in case this test failed again. Closes #29060	2018-03-27 12:18:09 -04:00
Nhat Nguyen	d1d3edf156	TEST: Use different translog dir for a new engine In #testPruneOnlyDeletesAtMostLocalCheckpoint, we create a new engine but mistakenly use the same translog directory of the existing engine. This prevents translog files from cleaning up when closing the engines. ERROR 0.12s J2 \| InternalEngineTests.testPruneOnlyDeletesAtMostLocalCheckpoint <<< FAILURES! > Throwable #1: java.io.IOException: could not remove the following files (in the order of attempts): > translog-primary-060/translog-2.tlog: java.io.IOException: access denied: This commit makes sure to use a separate directory for each engine in this tes.	2018-03-27 09:45:51 -04:00
Christoph Büscher	8d6832c5ee	Make SearchStats implement Writeable (#29258 ) Moves another class over from Streamable to Writeable. By this, also some constructors can be removed or made private.	2018-03-27 15:21:11 +02:00
Nhat Nguyen	0ac89a32cc	Do not optimize append-only if seen normal op with higher seqno (#28787 ) When processing an append-only operation, primary knows that operations can only conflict with another instance of the same operation. This is true as the id was freshly generated. However this property doesn't hold for replicas. As soon as an auto-generated ID was indexed into the primary, it can be exposed to a search and users can issue a follow up operation on it. In extremely rare cases, the follow up operation can be arrived and processed on a replica before the original append-only request. In this case we can't simply proceed with the append-only request and blindly add it to the index without consulting the version map. The following scenario can cause difference between primary and replica. 1. Primary indexes an auto-gen-id doc. (id=X, v=1, s#=20) 2. A refresh cycle happens on primary 3. The new doc is picked up and modified - say by a delete by query request - Primary gets a delete doc (id=X, v=2, s#=30) 4. Delete doc is processed first on the replica (id=X, v=2, s#=30) 5. Indexing operation arrives on the replica, since it's an auto-gen-id request and the retry marker is lower, we put it into lucene without any check. Replica has a doc the primary doesn't have. To deal with a potential conflict between an append-only operation and a normal operation on replicas, we need to rely on sequence numbers. This commit maintains the max seqno of non-append-only operations on replica then only apply optimization for an append-only operation only if its seq# is higher than the seq# of all non-append-only.	2018-03-26 16:56:12 -04:00
Nhat Nguyen	87957603c0	Prune only gc deletes below local checkpoint (#28790 ) Once a document is deleted and Lucene is refreshed, we will not be able to look up the `version/seq#` associated with that delete in Lucene. As conflicting operations can still be indexed, we need another mechanism to remember these deletes. Therefore deletes should still be stored in the Version Map, even after Lucene is refreshed. Obviously, we can't remember all deletes forever so a trimming mechanism is needed. Currently, we remember deletes for at least 1 minute (the default GC deletes cycle) and clean them periodically. This is, at the moment, the best we can do on the primary for user facing APIs but this arbitrary time limit is problematic for replicas. Furthermore, we can't rely on the primary and replicas doing the trimming in a synchronized manner, and failing to do so results in the replica and primary making different decisions. The following scenario can cause inconsistency between primary and replica. 1. Primary index doc (index, id=1, v2) 2. Network packet issue causes index operation to back off and wait 3. Primary deletes doc (delete, id=1, v3) 4. Replica processes delete (delete, id=1, v3) 5. 1+ minute passes (GC deletes runs replica) 6. Indexing op is finally sent to the replica which no processes it because it forgot about the delete. We can reply on sequence-numbers to prevent this issue. If we prune only deletes whose seqno at most the local checkpoint, a replica will correctly remember what it needs. The correctness is explained as follows: Suppose o1 and o2 are two operations on the same document with seq#(o1) < seq#(o2), and o2 arrives before o1 on the replica. o2 is processed normally since it arrives first; when o1 arrives it should be discarded: 1. If seq#(o1) <= LCP, then it will be not be added to Lucene, as it was already previously added. 2. If seq#(o1) > LCP, then it depends on the nature of o2: - If o2 is a delete then its seq# is recorded in the VersionMap, since seq#(o2) > seq#(o1) > LCP, so a lookup can find it and determine that o1 is stale. - If o2 is an indexing then its seq# is either in Lucene (if refreshed) or the VersionMap (if not refreshed yet), so a real-time lookup can find it and determine that o1 is stale. In this PR, we prefer to deploy a single trimming strategy, which satisfies both requirements, on primary and replicas because: - It's simpler - no need to distinguish if an engine is running at primary mode or replica mode or being promoted. - If a replica subsequently is promoted, user experience is fully maintained as that replica remembers deletes for the last GC cycle. However, the version map may consume less memory if we deploy two different trimming strategies for primary and replicas.	2018-03-26 13:42:08 -04:00
Boaz Leskes	bca264699a	remove testUnassignedShardAndEmptyNodesInRoutingTable testUnassignedShardAndEmptyNodesInRoutingTable and that test is as old as time and does a very bogus thing. it is an IT test which extracts the GatewayAllocator from the node and tells it to allocated unassigned shards, while giving it a conjured cluster state with no nodes in it (it uses the DiscoveryNodes.EMPTY_NODES. This is never a cluster state we want to reroute on (we always have at least master node in it). I'm going to just delete the test as I don't think it adds much value. Closes #21463	2018-03-26 17:10:57 +02:00
Boaz Leskes	f5d4550e93	Fold EngineDiskUtils into Store, for better lock semantics (#29156 ) #28245 has introduced the utility class`EngineDiskUtils` with a set of methods to prepare/change translog and lucene commit points. That util class bundled everything that's needed to create and empty shard, bootstrap a shard from a lucene index that was just restored etc. In order to safely do these manipulations, the util methods acquired the IndexWriter's lock. That would sometime fail due to concurrent shard store fetching or other short activities that require the files not to be changed while they read from them. Since there is no way to wait on the index writer lock, the `Store` class has other locks to make sure that once we try to acquire the IW lock, it will succeed. To side step this waiting problem, this PR folds `EngineDiskUtils` into `Store`. Sadly this comes with a price - the store class doesn't and shouldn't know about the translog. As such the logic is slightly less tight and callers have to do the translog manipulations on their own.	2018-03-26 14:08:03 +02:00
Christoph Büscher	318b0af953	Remove execute mode bit from source files Some source files seem to have the execute bit (a+x) set, which doesn't really seem to hurt but is a bit odd. This change removes those, making the permissions similar to other source files in the repository.	2018-03-26 13:37:55 +02:00
Jim Ferenczi	5288235ca3	Optimize the composite aggregation for match_all and range queries (#28745 ) This change refactors the composite aggregation to add an execution mode that visits documents in the order of the values present in the leading source of the composite definition. This mode does not need to visit all documents since it can early terminate the collection when the leading source value is greater than the lowest value in the queue. Instead of collecting the documents in the order of their doc_id, this mode uses the inverted lists (or the bkd tree for numerics) to collect documents in the order of the values present in the leading source. For instance the following aggregation: ``` "composite" : { "sources" : [ { "value1": { "terms" : { "field": "timestamp", "order": "asc" } } } ], "size": 10 } ``` ... can use the field `timestamp` to collect the documents with the 10 lowest values for the field instead of visiting all documents. For composite aggregation with more than one source the execution can early terminate as soon as one of the 10 lowest values produces enough composite buckets. For instance if visiting the first two lowest timestamp created 10 composite buckets we can early terminate the collection since it is guaranteed that the third lowest timestamp cannot create a composite key that compares lower than the one already visited. This mode can execute iff: * The leading source in the composite definition uses an indexed field of type `date` (works also with `date_histogram` source), `integer`, `long` or `keyword`. * The query is a match_all query or a range query over the field that is used as the leading source in the composite definition. * The sort order of the leading source is the natural order (ascending since postings and numerics are sorted in ascending order only). If these conditions are not met this aggregation visits each document like any other agg.	2018-03-26 09:51:37 +02:00
Nicholas Knize	fede633563	Add Z value support to geo_shape This enhancement adds Z value support (source only) to geo_shape fields. If vertices are provided with a third dimension, the third dimension is ignored for indexing but returned as part of source. Like beofre, any values greater than the 3rd dimension are ignored. closes #23747	2018-03-23 08:50:55 -05:00
Nhat Nguyen	794de63232	Remove type casts in logging in server component (#28807 ) This commit removes type-casts in logging in the server component (other components will be done later). This also adds a parameterized message test which would catch breaking-changes related to lambdas in Log4J.	2018-03-23 07:35:50 -04:00
Yu	4a8099c696	Change BroadcastResponse from ToXContentFragment to ToXContentObject (#28878 ) While working on #27799, we find that it might make sense to change BroadcastResponse from ToXContentFragment to ToXContentObject, seeing that it's rather a complete XContent object and also the other Responses are normally ToXContentObject. By doing this, we can also move the XContent build logic of BroadcastResponse's subclasses, from Rest Layer to the concrete classes themselves. Relates to #3889	2018-03-23 10:53:37 +01:00
Milan Chovatiya	8328b9c5cd	REST : Split `RestUpgradeAction` into two actions (#29124 ) Closes #29062	2018-03-23 10:37:31 +01:00
Nhat Nguyen	14157c8705	Harden periodically check to avoid endless flush loop (#29125 ) In #28350, we fixed an endless flushing loop which may happen on replicas by tightening the relation between the flush action and the periodically flush condition. 1. The periodically flush condition is enabled only if it is disabled after a flush. 2. If the periodically flush condition is enabled then a flush will actually happen regardless of Lucene state. (1) and (2) guarantee that a flushing loop will be terminated. Sadly, the condition 1 can be violated in edge cases as we used two different algorithms to evaluate the current and future uncommitted translog size. - We use method `uncommittedSizeInBytes` to calculate current uncommitted size. It is the sum of translogs whose generation at least the minGen (determined by a given seqno). We pick a continuous range of translogs since the minGen to evaluate the current uncommitted size. - We use method `sizeOfGensAboveSeqNoInBytes` to calculate the future uncommitted size. It is the sum of translogs whose maxSeqNo at least the given seqNo. Here we don't pick a range but select translog one by one. Suppose we have 3 translogs `gen1={#1,#2}, gen2={}, gen3={#3} and seqno=#1`, `uncommittedSizeInBytes` is the sum of gen1, gen2, and gen3 while `sizeOfGensAboveSeqNoInBytes` is the sum of gen1 and gen3. Gen2 is excluded because its maxSeqno is still -1. This commit removes both `sizeOfGensAboveSeqNoInBytes` and `uncommittedSizeInBytes` methods, then enforces an engine to use only `sizeInBytesByMinGen` method to evaluate the periodically flush condition. Closes #29097 Relates ##28350	2018-03-22 14:31:15 -04:00
Jim Ferenczi	c93c7f3121	Remove deprecated options for query_string (#29203 ) This commit removes some parameters deprecated in 6.x (or 5.x): `use_dismax`, `split_on_whitespace`, `all_fields` and `lowercase_expanded_terms`. Closes #25551	2018-03-22 18:37:08 +01:00
Yu	24c8d8f5ef	REST high-level client: add force merge API (#28896 ) Relates to #27205	2018-03-22 17:17:16 +01:00
Lee Hinman	7d1de890b8	Decouple more classes from XContentBuilder and make builder strict (#29197 ) This commit decouples `BytesRef`, `Releaseable`, and `TimeValue` from XContentBuilder, and paves the way for doupling `ByteSizeValue` as well. It moves much of the Lucene and Joda encoding into a new SPI extension that is loaded by XContentBuilder to know how to encode these values. Part of doing this also allows us to make JSON encoding strict, as we no longer allow just any old object to be passed (in the past it was possible to get json that was `"field": "java.lang.Object@d8355a8"` if no one was careful about what was passed in). Relates to #28504	2018-03-22 08:18:55 -06:00
Christoph Büscher	d6d3fb3c73	Use EnumMap in ClusterBlocks (#29112 ) By using EnumMap instead of an ImmutableLevelHolder array we can avoid the using enum ordinals to index into the array.	2018-03-22 11:14:24 +01:00
Tanguy Leroux	edf27a599e	Add new setting to disable persistent tasks allocations (#29137 ) This commit adds a new setting `cluster.persistent_tasks.allocation.enable` that can be used to enable or disable the allocation of persistent tasks. The setting accepts the values `all` (default) or `none`. When set to none, the persistent tasks that are created (or that must be reassigned) won't be assigned to a node but will reside in the cluster state with a no "executor node" and a reason describing why it is not assigned: ``` "assignment" : { "executor_node" : null, "explanation" : "persistent task [foo/bar] cannot be assigned [no persistent task assignments are allowed due to cluster settings]" } ```	2018-03-22 09:18:07 +01:00
Nhat Nguyen	7d44d75774	Adjust PreSyncedFlushResponse bwc versions We discussed and agreed to include the synced-flush change in 6.3.0+ but not in 5.6.9. We will re-evaluate the urgency and importance of the issue then decide which versions that the change should be included.	2018-03-21 16:50:35 -04:00
markharwood	93ff973afc	Tests - fix incorrect test assumption that zero-doc buckets will be returned by the adjacency matrix aggregation. Closes #29159 (#29167 )	2018-03-21 10:42:14 +00:00
Jason Tedor	2f6c77337e	Remove 6.1.5 version constant The assumption here is that we will no longer be making a release from the 6.1 branch. Since we assume that all versions on this branch are actually released, we do not want to leave behind any versions that would require a snapshot build. We do have a test that verifies that all released versions are present here, so if another release is performed from the 6.1 branch, that test will fail and we will know to add the version constant at that time.	2018-03-21 06:28:17 -04:00
Adrien Grand	8f9d2ee4e2	Reject updates to the `_default_` mapping. (#29165 ) This will reject mapping updates to the `_default_` mapping with 7.x indices and still emit a deprecation warning with 6.x indices. Relates #15613 Supersedes #28248	2018-03-21 10:44:11 +01:00
Nhat Nguyen	f938c4267e	Fix BWC issue for PreSyncedFlushResponse I misunderstood how the bwc versions works. If we backport to 5.x, we need to backport to all supported 6.*. This commit corrects the BWC versions for PreSyncedFlushResponse. Relates #29103	2018-03-20 13:56:15 -04:00
Lee Hinman	b4af451ec5	Remove BytesArray and BytesReference usage from XContentFactory (#29151 ) * Remove BytesArray and BytesReference usage from XContentFactory This removes the usage of `BytesArray` and `BytesReference` from `XContentFactory`. Instead, a regular `byte[]` should be passed. To assist with this a helper has been added to `XContentHelper` that will preserve the offset and length from the underlying BytesReference. This is part of ongoing work to separate the XContent parts from ES so they can be factored into their own jar. Relates to #28504	2018-03-20 11:52:26 -06:00
Lee Hinman	4bd217c94f	Add pluggable XContentBuilder writers and human readable writers (#29120 ) * Add pluggable XContentBuilder writers and human readable writers This adds the ability to use SPI to plug in writers for XContentBuilder. By implementing the XContentBuilderProvider class we can allow Elasticsearch to plug in different ways to encode types to JSON. Important caveat for this, we should always try to have the class implement `ToXContentFragment` first, however, in the case of classes from our dependencies (think Joda classes or Lucene classes) we need a way to specify writers for these classes. This also makes the human-readable field writers generic and pluggable, so that we no longer need to tie XContentBuilder to things like `TimeValue` and `ByteSizeValue`. Contained as part of this moves all the TimeValue human readable fields to the new `humanReadableField` method. A future commit will move the `ByteSizeValue` calls over to this method. Relates to #28504	2018-03-20 11:39:24 -06:00
Christoph Büscher	701625b065	Add unreleased version 6.2.4 (#29171 )	2018-03-20 18:38:06 +01:00
Christoph Büscher	5a97fe75da	Add unreleased version 6.1.5 (#29168 )	2018-03-20 18:31:59 +01:00
Luca Cavanna	ff09c82319	REST high-level client: add clear cache API (#28866 ) * REST high-level client: add clear cache API Relates to #27205 Also Closes #26947 (rest-spec were outdated)	2018-03-20 10:39:36 +01:00
Lee Hinman	687577a516	Fix javadoc warning in Strings for missing parameter description Fixes a parameter in `Strings` that had a javadoc annotation but was missing the description, causing warnings in the build.	2018-03-19 12:28:15 -06:00
Lee Hinman	3025295f7e	Decouple Text and Geopoint from XContentBuilder (#29119 ) This removes the `Text` and `Geopoint` special handling from `XContentBuilder`. Instead, these classes now implement `ToXContentFragment` and render themselves accordingly. This allows us to further decouple XContentBuilder from Elasticsearch-specific classes so it can be factored into a standalone lib at a later time. Relates to #28504	2018-03-19 08:54:10 -06:00
Nik Everett	bf05c600c4	REST: Include suppressed exceptions on failures (#29115 ) This modifies xcontent serialization of Exceptions to contain suppressed exceptions. If there are any suppressed exceptions they are included in the exception response by default. The reasoning here is that they are fairly rare but when they exist they almost always add extra useful information. Take, for example, the response when you specify two broken ingest pipelines: ``` { "error" : { "root_cause" : ...snip... "type" : "parse_exception", "reason" : "[field] required property is missing", "header" : { "processor_type" : "set", "property_name" : "field" }, "suppressed" : [ { "type" : "parse_exception", "reason" : "[field] required property is missing", "header" : { "processor_type" : "convert", "property_name" : "field" } } ] }, "status" : 400 } ``` Moreover, when suppressed exceptions come from 500 level errors should give us more useful debugging information. Closes #23392	2018-03-19 10:52:50 -04:00
Tanguy Leroux	0f93b7abdf	Fix compilation errors in ML integration tests After elastic/elasticsearch#29109, the `needsReassignment` method has been moved to the PersistentTasksClusterService. This commit fixes some compilation in tests I introduced.	2018-03-19 09:46:53 +01:00
Tanguy Leroux	b57bd695f2	Small code cleanups and refactorings in persistent tasks (#29109 ) This commit consists of small code cleanups and refactorings in the persistent tasks framework. Most changes are in PersistentTasksClusterService where some methods have been renamed or merged together, documentation has been added, unused code removed in order to improve readability of the code.	2018-03-19 09:26:17 +01:00
Nhat Nguyen	f1029aaad5	getMinGenerationForSeqNo should acquire read lock (#29126 ) The method Translog#getMinGenerationForSeqNo does not modify the current translog but only access, it therefore should acquire the readLock instead of writeLock.	2018-03-17 17:43:20 -04:00
Nhat Nguyen	c9749180a1	Backport - Do not renew sync-id PR to 5.6 and 6.3 Relates ##29103	2018-03-17 11:38:22 -04:00
Jason Tedor	2e93a9158f	Align thread pool info to thread pool configuration (#29123 ) Today we report thread pool info using a common object. This means that we use a shared set of terminology that is not consistent with the terminology used to the configure thread pools. This holds in particular for the minimum and maximum number of threads in the thread pool where we use the following terminology: thread pool info \| fixed \| scaling min core size max max size This commit changes the display of thread pool info to be dependent on the type of the thread pool so that we can align the terminology in the output of thread pool info with the terminology used to configure a thread pool.	2018-03-16 22:47:06 -04:00
Nhat Nguyen	22ad52a288	TEST: Adjust translog size assumption in new engine A new engine now can have more than one empty translog since #28676. This cause #testShouldPeriodicallyFlush failed because in the test we asssume an engine should have one empty translog. This commit takes into account the extra translog size of a new engine.	2018-03-16 21:50:31 -04:00
olcbean	47211c00e9	REST: Clear Indices Cache API simplify param parsing (#29111 ) Simplify the parsing of the params in Clear Indices Cache API, as a follow up to the removing of the deprecated parameter names.	2018-03-16 16:50:34 -04:00
Jason Tedor	4d62640bf1	Fix typo in ExceptionSerializationTests This commit fixes a little typo in ExceptionSerializationTests.java replacing "weas" by "was".	2018-03-16 15:52:39 -04:00
Jason Tedor	1f1a4d17b4	Remove BWC layer for rejected execution exception The serialization changes for rejected execution exceptions has been backported to 6.x with the intention to appear in all versions since 6.3.0. Therefore, this BWC layer is no longer needed in master since master would never speak to a node that does not speak the same serialization.	2018-03-16 14:40:17 -04:00
Jason Tedor	6bf742dd1b	Fix EsAbortPolicy to conform to API (#29075 ) The rejected execution handler API says that rejectedExecution(Runnable, ThreadPoolExecutor) throws a RejectedExecutionException if the task must be rejected due to capacity on the executor. We do throw something that smells like a RejectedExecutionException (it is named EsRejectedExecutionException) yet we violate the API because EsRejectedExecutionException is not a RejectedExecutionException. This has caused problems before where we try to catch RejectedExecution when invoking rejectedExecution but this causes EsRejectedExecutionException to go uncaught. This commit addresses this by modifying EsRejectedExecutionException to extend RejectedExecutionException.	2018-03-16 14:34:36 -04:00
David Turner	158bb23887	Remove usages of obsolete settings (#29087 ) The settings `indices.recovery.concurrent_streams` and `indices.recovery.concurrent_small_file_streams` were removed in `f5e4cd4616`. This commit removes their last traces from the codebase.	2018-03-16 15:35:40 +00:00
Nhat Nguyen	2c1ef3d4c6	Do not renew sync-id if all shards are sealed (#29103 ) Today the synced-flush always issues a new sync-id even though all shards haven't been changed since the last seal. This causes active shards to have different a sync-id from offline shards even though all were sealed and no writes since then. This commit adjusts not to renew sync-id if all active shards are sealed with the same sync-id. Closes #27838	2018-03-16 11:16:30 -04:00
Adrien Grand	0755ff425f	Clarify requirements of strict date formats. (#29090 ) Closes #29014	2018-03-16 14:39:36 +01:00
Alan Woodward	a2d5cf6514	Compilation fix for #29067	2018-03-16 13:33:25 +00:00
Alan Woodward	986e518170	Store offsets in index prefix fields when stored in the parent field (#29067 ) The index prefix field is normally indexed as docs-only, given that it cannot be used in phrases. However, in the case that the parent field has been indexed with offsets, or has term-vector offsets, we should also store this in the index prefix field for highlighting. Note that this commit does not implement highlighting on prefix fields, but rather ensures that future work can implement this without a backwards-break in index data. Closes #28994	2018-03-16 11:39:46 +00:00
Tanguy Leroux	f14146982f	Use removeTask instead of finishTask in PersistentTasksClusterService (#29055 ) The method `PersistentTasksClusterService.finishTask()` has been modified since it was added and does not use any `removeOncompletion` flag anymore. Its behavior is now similar to `removeTask()` and can be replaced by this one. When a non existing task is removed, the cluster state update task will fail and its `source` will still indicate `finish persistent task`/`remove persistent task`.	2018-03-16 10:20:56 +01:00
Yogesh Gaikwad	a685784cea	CLI: Close subcommands in MultiCommand (#28954 ) * CLI Command: MultiCommand must close subcommands to release resources properly - Changes are done to override the close method and call close on subcommands using IOUtils#close - Unit Test Closes #28953	2018-03-16 09:59:23 +11:00
Nhat Nguyen	c75790e7c0	TEST: write ops should execute under shard permit (#28966 ) Currently ESIndexLevelReplicationTestCase executes write operations without acquiring index shard permit. This may prevent the primary term on replica from being updated or cause a race between resync and indexing on primary. This commit ensures that write operations are always executed under shard permit like the production code.	2018-03-15 14:42:15 -04:00
Mayya Sharipova	8cb3d18eac	Revert "Improve error message for installing plugin (#28298 )" This reverts commit `0cc1ffdf20` The reason is that Windows test are failing, because of the incorrect path for the plugin	2018-03-15 10:47:50 -07:00
Adrien Grand	404e776a45	Validate regular expressions in dynamic templates. (#29013 ) Today you would only get these errors at index time. Relates #24749	2018-03-15 16:43:56 +01:00
Christoph Büscher	312ccc05d5	[Tests] Fix GetResultTests and DocumentFieldTests failures (#29083 ) Changes made in #28972 seems to have changed some assumptions about how SMILE and CBOR write byte[] values and how this is tested. This changes the generation of the randomized DocumentField values back to BytesArray while expecting the JSON and YAML deserialisation to produce Base64 encoded strings and SMILE and CBOR to parse back BytesArray instances. Closes #29080	2018-03-15 16:42:26 +01:00
Adrien Grand	18d848f218	Reenable LiveVersionMapTests.testRamBytesUsed on Java 9. (#29063 ) I also had to make the test more lenient. This is due to the fact that Lucene's RamUsageTester was changed in order not to reflect `java.*` classes and the way that it estimates ram usage of maps is by assuming it has similar memory usage to an `Object[]` array that stores all keys and values. The implementation in `LiveVersionMap` tries to be slightly more realistic by taking the load factor and linked lists into account, so it usually gives a higher estimate which happens to be closer to reality. Closes #22548	2018-03-15 16:39:02 +01:00
Christoph Büscher	85933161d4	Mute failing GetResultTests and DocumentFieldTests	2018-03-15 11:49:45 +01:00
Mayya Sharipova	0cc1ffdf20	Improve error message for installing plugin (#28298 ) Provide more actionable error message when installing an offline plugin in the plugins directory, and the `plugins` directory for the node contains plugin distribution. Closes #27401	2018-03-14 16:19:04 -07:00
Lee Hinman	8425257593	[TEST] Fix issue parsing response out of order When parsing GetResponse it was possible that the equality check failed because items in the map were in a different order (in the `.equals` implementation).	2018-03-14 16:34:40 -06:00
Christoph Büscher	ae912cbde4	[Docs] Fix Java Api index administration usage (#28260 ) The Java API documentation for index administration currenty is wrong because the PutMappingRequestBuilder#setSource(Object... source) an CreateIndexRequestBuilder#addMapping(String type, Object... source) methods delegate to methods that check that the input arguments are valid key/value pairs. This changes the docs so the java api code examples are included from documentation integration tests so we detect compile and runtime issues earlier. Closes #28131	2018-03-14 22:02:06 +01:00
olcbean	3d81497f25	REST: Clear Indices Cache API remove deprecated url params (#29068 ) By the time the master branch is released the deprecated url parameters in the `/_cache/clear` API will have been deprecated for a couple of minor releases. Since master will be the next major release we are fine with removing these parameters.	2018-03-14 16:37:50 -04:00
Boaz Leskes	bf65cb4914	Untangle Engine Constructor logic (#28245 ) Currently we have a fairly complicated logic in the engine constructor logic to deal with all the various ways we want to mutate the lucene index and translog we're opening. We can: 1) Create an empty index 2) Use the lucene but create a new translog 3) Use both 4) Force a new history uuid in all cases. This leads complicated code flows which makes it harder and harder to make sure we cover all the corner cases. This PR tries to take another approach. Constructing an InternalEngine always opens things as they are and all needed modifications are done by static methods directly on the directory, one at a time.	2018-03-14 20:59:47 +01:00
Lee Hinman	8e8fdc4f0e	Decouple XContentBuilder from BytesReference (#28972 ) * Decouple XContentBuilder from BytesReference This commit removes all mentions of `BytesReference` from `XContentBuilder`. This is needed so that we can completely decouple the XContent code and move it into its own dependency. While this change appears large, it is due to two main changes, moving `.bytes()` and `.string()` out of XContentBuilder itself into static methods `BytesReference.bytes` and `Strings.toString` respectively. The rest of the change is code reacting to these changes (the majority of it in tests). Relates to #28504	2018-03-14 13:47:57 -06:00
David Kyle	cb9d10f971	Protect against NPE in RestNodesAction (#29059 ) * Protect against NPE in RestNodesAction	2018-03-14 15:47:18 +00:00
David Roberts	5bf92ca3b3	Enforce that java.io.tmpdir exists on startup (#28217 ) If the default java.io.tmpdir is used then the startup script creates it, but if a custom java.io.tmpdir is used then the user must ensure it exists before running Elasticsearch. If they forget then it can cause errors that are hard to understand, so this change adds an explicit check early in the bootstrap and reports a clear error if java.io.tmpdir is not an accessible directory.	2018-03-14 15:43:53 +00:00
Jason Tedor	24d10adaab	Main response should not have status 503 when okay (#29045 ) The REST status 503 means "I can not handle the request that you sent me." However today we respond to a main request with a 503 when there are certain cluster blocks despite still responding with an actual main response. This is broken, we should respond with a 200 status. This commit removes this silliness.	2018-03-14 06:36:37 -04:00
Jason Tedor	647d0a1e95	Do not swallow fail to convert exceptions (#29043 ) When converting the source for an indexing request to JSON, the conversion can throw an I/O exception which we swallow and proceed with logging to the slow log. The cause of the I/O exception is lost. This commit changes this behavior and chooses to drop the entry from the slow logs and instead lets an exception percolate up to the indexing operation listener loop. Here, the exception will be caught and logged at the warn level.	2018-03-13 23:42:16 -04:00
Jason Tedor	46fcd07153	Add total hits to the search slow log (#29034 ) This commit adds the total hits to the search slow log.	2018-03-13 20:40:47 -04:00
Jason Tedor	4dc3adad51	Archive unknown or invalid settings on updates (#28888 ) Today we can end up in a situation where the cluster state contains unknown or invalid settings. This can happen easily during a rolling upgrade. For example, consider two nodes that are on a version that considers the setting foo.bar to be known and valid. Assume one of these nodes is restarted on a higher version that considers foo.bar to now be either unknown or invalid, and then the second node is restarted too. Now, both nodes will be on a version that consider foo.bar to be unknown or invalid yet this setting will still be contained in the cluster state. This means that if a cluster settings update is applied and we validate the settings update with the existing settings then validation will fail. In such a state, the offending setting can not even be removed. This commit helps out with this situation by archiving any settings that are unknown or invalid at the time that a settings update is applied. This allows the setting update to go through, and the archived settings can be removed at a later time.	2018-03-13 17:32:18 -04:00
Jason Tedor	c8e71327ab	Log template creation and deletion (#29027 ) These can be seen at the debug level via cluster state update logging but really they should be more visible like index creation and deletion. This commit adds info-level logging for template puts and deletes.	2018-03-13 16:31:19 -04:00
Jason Tedor	697b9f8b82	Remove interning from prefix logger (#29031 ) This interning is completely unnecessary because we look up the marker by the prefix (value, not identity) anyway. This means that regardless of the identity of the prefix, we end up with the same marker. That is all that we really care about here.	2018-03-13 16:30:13 -04:00
olcbean	edc57f6f34	REST: deprecate `field_data` in Clear Cache API (#28943 ) We call it `fielddata` everywhere else in the code and API so we may as well be consistent.	2018-03-13 15:16:27 -04:00
Jason Tedor	5904d936fa	Copy Lucene IOUtils (#29012 ) As we have factored Elasticsearch into smaller libraries, we have ended up in a situation that some of the dependencies of Elasticsearch are not available to code that depends on these smaller libraries but not server Elasticsearch. This is a good thing, this was one of the goals of separating Elasticsearch into smaller libraries, to shed some of the dependencies from other components of the system. However, this now means that simple utility methods from Lucene that we rely on are no longer available everywhere. This commit copies IOUtils (with some small formatting changes for our codebase) into the fold so that other components of the system can rely on these methods where they no longer depend on Lucene.	2018-03-13 12:49:33 -04:00
Jason Tedor	6088af5887	Fix comment regarding removal of requiresKeystore The requiresKeystore flag was removed from PluginInfo in 6.3.0. This commit fixes a pair of code comments that incorrectly refer to this version as 7.0.0.	2018-03-12 14:20:02 -04:00
Jason Tedor	b8e165a994	Fix BWC versions on plugin info This commit fixes the BWC versions on the plugin info serialization which was changed to remove the requiresKeystore flag.	2018-03-12 13:05:48 -04:00
Jason Tedor	6331bcaf76	Create keystore on package install (#28928 ) This commit removes the ability to specify that a plugin requires the keystore and instead creates the keystore on package installation or when Elasticsearch is started for the first time. The reason that we opt to create the keystore on package installation is to ensure that the keystore has the correct permissions (the package installation scripts run as root as opposed to Elasticsearch running as the elasticsearch user) and to enable removing the keystore on package removal if the keystore is not modified.	2018-03-12 12:48:00 -04:00
Mika⠙	a7b53fd3b7	Add check when trying to reroute a shard to a non-data discovery node (#28886 ) While trying to reroute a shard to or from a non-data node (a node with ``node.data=false``), I encountered a null pointer exception. Though an exception is to be expected, the NPE was occurring because ``allocation.routingNodes()`` would not contain any non-data nodes, so when you attempt to do ``allocation.routingNodes.node(non-data-node)`` it would not find it, and thus error. This occurred regardless of whether I was rerouting to or from a non-data node. This PR adds a check (as well as a test for these use cases) to return a legible, useful exception if the discovery node you are rerouting to or from is not a data node.	2018-03-12 16:48:51 +01:00
Jason Tedor	b1b469e30f	Avoid class cast exception from index writer (#28989 ) When an index writer encounters a tragic exception, it could be a Throwable and not an Exception. Yet we blindly cast the tragic exception to an Exception which can encounter a ClassCastException. This commit addresses this by checking if the tragic exception is an Exception and otherwise wrapping the Throwable in a RuntimeException if it is not. We choose to wrap the Throwable instead of passing it around because passing it around leads to changing a lot of places where we handle Exception to handle Throwable instead. In general, we have tried to avoid handling Throwable and instead let those bubble up to the uncaught exception handler.	2018-03-12 08:42:02 -04:00
Yannick Welsch	be7f5dde24	Disallow logger methods with Object parameter (#28969 ) Log4j2 provides a wide range of logging methods. Our code typically only uses a subset of them. In particular, uses of the methods trace\|debug\|info\|warn\|error\|fatal(Object) or trace\|debug\|info\|warn\|error\|fatal(Object, Throwable) have all been wrong, leading to not properly logging the provided message. To prevent these issues in the future, the corresponding Logger methods have been blacklisted.	2018-03-12 03:05:24 -07:00
Jim Ferenczi	7afe5ad943	Restore tiebreaker for cross fields query (#28935 ) This commit restores the handling of tiebreaker for multi_match cross fields query. This functionality was lost during a refactoring of the multi_match query (#25115). Fixes #28933	2018-03-12 09:58:20 +01:00
Ryan Ernst	4216fc9f64	Plugins: Allow modules to spawn controllers (#28968 ) This commit makes the controller spawner also look under modules. It also fixes a bug in module security policy loading where the module is a meta plugin.	2018-03-11 09:01:27 -07:00
Nhat Nguyen	4f644d04a3	TEST: Use non-zero number for #testCompareUnits In `ByteSizeValueTests#testCompareUnits`, we expect non-zero for the variable `number` however `randomNonNegativeLong` can return zero. CI: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.2+oracle-java10-periodic/147/console	2018-03-10 22:56:22 -05:00
Jason Tedor	4ba80a7952	Maybe die before failing engine (#28973 ) Today we check for a few cases where we should maybe die before failing the engine (e.g., when a merge fails). However, there are still other cases where a fatal error can be hidden from us (for example, a failed index writer commit). This commit modifies the mechanism for failing the engine to always check for a fatal error before failing the engine.	2018-03-10 07:41:51 -05:00
Jason Tedor	950c4363bf	Remove special handling for _all in nodes info Today when requesting _all we return all nodes regardless of what other node qualifiers are in the request. This is contrary to how the remainder of the API behaves which acts as additive and subtractive based on the qualifiers and their ordering. It is also contrary to how the wildcard * behaves. This commit removes the special handling for _all so that it behaves identical to the wildcard *. Relates #28971	2018-03-09 18:23:16 -05:00
Lee Hinman	97b513e925	Remove Booleans use from XContent and ToXContent (#28768 ) * Remove Booleans use from XContent and ToXContent This removes the use of the `common.Boolean` class from two of the XContent classes, so they can be decoupled from the ES code as much as possible. Related to #28754, #28504	2018-03-09 14:58:54 -07:00
Nhat Nguyen	4973887a10	Make primary-replica resync failures less lenient (#28534 ) Today, failures from the primary-replica resync are ignored as the best effort to not mark shards as stale during the cluster restart. However this can be problematic if replicas failed to execute resync operations but just fine in the subsequent write operations. When this happens, replica will miss some operations from the new primary. There are some implications if the local checkpoint on replica can't advance because of the missing operations. 1. The global checkpoint won't advance - this causes both primary and replicas keep many index commits 2. Engine on replica won't flush periodically because uncommitted stats is calculated based on the local checkpoint 3. Replica can use a large number of bitsets to keep track operations seqno However we can prevent this issue but still reserve the best-effort by failing replicas which fail to execute resync operations but not mark them as stale. We have prepared to the required infrastructure in #28049 and #28054 for this change. Relates #24841	2018-03-09 09:55:45 -08:00
Martijn van Groningen	b32e999960	Use different pipeline id in test. (pipelines do not get removed between tests extending from ESIntegTestCase)	2018-03-09 14:29:42 +01:00
David Turner	033a83b98b	Use String.join() to describe a list of tasks (#28941 ) This change replaces the use of string concatenation with a call to String.join(). String concatenation might be quadratic, unless the compiler can optimise it away, whereas String.join() is more reliably linear. There can sometimes be a large number of pending ClusterState update tasks and #28920 includes a report that this operation sometimes takes a long time.	2018-03-09 09:42:44 +00:00
Martijn van Groningen	41519da45a	Fixed incorrect test try-catch statement	2018-03-09 09:38:16 +01:00
Ryan Ernst	62293ec1c9	Plugins: Consolidate plugin and module loading code (#28815 ) At one point, modules and plugins were very different. But effectively now they are the same, just from different directories. This commit unifies the loading methods so they are simply two different directories. Note that the main codepath to load plugin bundles had duplication (was not calling getPluginBundles) since previous refactorings to add meta plugins. Note this change also rewords the primary exception message when a plugin descriptor is missing, as the wording asking if the plugin was built before 2.0 isn't really applicable anymore (it is highly unlikely someone tries to install a 1.x plugin on any modern version).	2018-03-08 22:49:27 -08:00
Lee Hinman	46a79127ed	Remove FastStringReader in favor of vanilla StringReader (#28944 ) This allows us to remove another dependency in the decoupling of the XContent code. Rather than move this class over or decouple it, it can simply be removed. Relates tangentially to #28504	2018-03-08 17:17:36 -07:00
Lee Hinman	d6d7ee7320	Remove FastCharArrayReader and FastCharArrayWriter (#28951 ) These classes are used only in two places, and can be replaced by the `CharArrayReader` and `CharArrayWriter`. The JDK can also perform lock biasing and elision as well as escape analysis to optimize away non-contended locks, rendering their lock-free implementations unnecessary.	2018-03-08 17:05:11 -07:00
Tal Levy	7784c1bff9	Continue registering pipelines after one pipeline parse failure. (#28752 ) Ingest has been failing to apply existing pipelines from cluster-state into the in-memory representation that are no longer valid. One example of this is a pipeline with a script processor. If a cluster starts up with scripting disabled, these pipelines will not be loaded. Even though GETing a pipeline worked, indexing operations claimed that this pipeline did not exist. This is because one gets information from cluster-state and the other is from an in-memory data-structure. Now, two things happen 1. suppress the exceptions until after other successful pipelines are loaded 2. replace failed pipelines with a placeholder pipeline If the pipeline execution service encounters the stubbed pipeline, it is known that something went wrong at the time of pipeline creation and an exception was thrown to the user at some point at start-up. closes #28269.	2018-03-08 15:22:59 -08:00
Lee Hinman	17fc07a193	Switch XContentBuilder from BytesStreamOutput to ByteArrayOutputStream (#28945 ) This switches the underlying byte output representation used by default in `XContentBuilder` from `BytesStreamOutput` to a `ByteArrayOutputStream` (an `OutputStream` can still be specified manually) This is groundwork to allow us to decouple `XContent*` from the rest of the ES core code so that it may be factored into a separate jar. Since `BytesStreamOutput` was not using the recycling instance of `BigArrays`, this should not affect the circuit breaking capabilities elsewhere in the system. Relates to #28504	2018-03-08 15:45:51 -07:00
Lee Hinman	697f3f1a3b	Factor UnknownNamedObjectException into its own class (#28931 ) * Factor UnknownNamedObjectException into its own class This moves the inner class `UnknownNamedObjectException` from `NamedXContentRegistry` into a top-level class. This is so that `NamedXContentRegistry` doesn't have to depend on StreamInput and StreamOutput. Relates to #28504	2018-03-08 15:32:41 -07:00
Lee Hinman	ec92796ed8	Remove now-unused createParser that uses BytesReference (#28926 ) This removes `BytesReference` use from XContent and all subclasses. Relates to #28504	2018-03-08 09:10:21 -07:00
Jim Ferenczi	bc8b3fc71c	Revert "Rescore collapsed documents (#28521 )" This reverts commit `f057fc294a`. The rescorer does not resort the collapsed values inside the top docs during rescoring. For this reason the Lucene rescorer is not compatible with collapsing. Relates #27243	2018-03-08 11:20:29 +01:00
Lee Hinman	9d4f09db68	[TEST] AwaitsFix QueryRescorerIT.testRescoreAfterCollapse See: https://github.com/elastic/elasticsearch/issues/28932	2018-03-07 15:32:29 -07:00
Lee Hinman	818920a281	Decouple XContentType from StreamInput/Output (#28927 ) This removes the readFrom and writeTo methods from XContentType, instead using the more generic `readEnum` and `writeEnum` methods. Luckily they are both encoded exactly the same way, so there is no compatibility layer needed for backwards compatibility. Relates to #28504	2018-03-07 14:50:30 -07:00
Lee Hinman	2d1d6503a4	Remove BytesRef usage from XContentParser and its subclasses (#28792 ) * Remove BytesRef usage from XContentParser and its subclasses This removes all the BytesRef usage from XContentParser in favor of directly returning a CharBuffer (this was originally what was returned, it was just immediately wraped in a BytesRef). Relates to #28504 * Rename method after Ryan's feedback	2018-03-07 10:09:56 -07:00
Lee Hinman	e7d1e12675	Wrap stream passed to createParser in try-with-resources (#28897 ) * Wrap stream passed to createParser in try-with-resources This wraps the stream (`.streamInput()`) that is passed to many of the `createParser` instances in the enclosing (or a new) try-with-resources block. This ensures the `BytesReference.streamInput()` is closed. Relates to #28504 * Use try-with-resources instead of closing in a finally block	2018-03-04 16:48:03 -07:00
Sergey Galkin	f057fc294a	Rescore collapsed documents (#28521 ) This change adds the ability to rescore collapsed documents.	2018-03-04 13:39:50 -08:00
Jim Ferenczi	c26bd6046b	Fix (simple)_query_string to ignore removed terms (#28871 ) This change ensures that we ignore terms removed from the analysis rather than returning a match_no_docs query for the part that contain the stop word. For instance a query like "the AND fox" should ignore "the" if it is considered as a stop word instead of adding a match_no_docs query. This change also fixes the analysis of prefix terms that start with a stop word (e.g. `the*`). In such case if `analyze_wildcard` is true and `the` is considered as a stop word this part of the query is rewritten into a match_no_docs query. Since it's a prefix query this change forces the prefix query on `the` even if it is removed from the analysis. Fixes #28855 Fixes #28856	2018-03-04 13:25:46 -08:00
Simon Willnauer	5a1d9f33a0	Try if tombstone is eligable for pruning before locking on it's key (#28767 ) Pruning tombstones is quite expensive since we have to walk though all deletes in the live version map and acquire a lock on every value even though it's impossible to prune it. This change does a pre-check if a delete is old enough and if not it skips acquireing the lock.	2018-03-04 11:31:13 -08:00
Mayya Sharipova	f53d159aa1	Limit analyzed text for highlighting (improvements) (#28808 ) Increase the default limit of `index.highlight.max_analyzed_offset` to 1M instead of previous 10K. Enhance an error message when offset increased to include field name, index name and doc_id. Relates to https://github.com/elastic/kibana/issues/16764	2018-03-02 08:09:05 -08:00
olcbean	472acf7833	[DOCS] fix put_mapping snippet (#28814 ) Add a java snippet to be run in an integration test in order to guarantee that the snippet is correct Closes #28778	2018-03-01 10:59:52 +01:00
Luca Cavanna	184a8718d8	REST high-level client: add flush API (#28852 ) Relates to #27205	2018-03-01 10:56:03 +01:00
Yu	95dea2408d	Add Refresh API for RestHighLevelClient (#27799 ) Relates to #27205	2018-02-28 11:49:14 +01:00
lzh3636	929dba8667	Fix log message when virtual lock not possible (#28829 ) When virtual lock is not possible because JNA is unavailable, we log a warning message. Yet, this log message refers to mlockall rather than virtual lock, presumably because of a copy/paste error. This commit fixes this issue.	2018-02-26 06:55:32 -08:00
Nhat Nguyen	f1a94de9e7	Replace log4j.Supplier by jdk’s in non-logging usage (#28812 ) This commit replaces `org.apache.logging.log4j.util.Supplier` by `java.util.function.Supplier` in non-logging code. These usages are neither incorrect nor wrong but rather than accidental. I think our intention was to use the JDK's Supplier in these places.	2018-02-24 14:28:32 -05:00
Ke Li	a77273fc01	Reject regex search if regex string is too long (#28542 ) * Reject regex search if regex string is too long (#28344) * Add docs * Introduce index level setting `index.max_regex_length` to control the maximum length of the regular expression Closes #28344	2018-02-23 10:41:24 -08:00
Luca Cavanna	cd3d9c9f80	[TEST] share code between streamable/writeable/xcontent base test classes (#28785 ) Today we have two test base classes that have a lot in common when it comes to testing wire and xcontent serialization: `AbstractSerializingTestCase` and `AbstractXContentStreamableTestCase`. There are subtle differences though between the two, in the way they work, what can be overridden and features that they support (e.g. insertion of random fields). This commit introduces a new base class called `AbstractWireTestCase` which holds all of the serialization test code in common between `Streamable` and `Writeable`. It has two minimal subclasses called `AbstractWireSerializingTestCase` and `AbstractStreamableTestCase` which are specialized for `Writeable` and `Streamable`. This commit also introduces a new test class called `AbstractXContentTestCase` for all of the xContent testing, which holds a testFromXContent method for parsing and rendering to xContent. This one can be delegated to from the existing `AbstractStreamableXContentTestCase` and `AbstractSerializingTestCase` so that we avoid code duplicate as much as possible and all these base classes offer the same functionalities in the same way. Having this last base class decoupled from the serialization testing may also help with the REST high-level client testing, as there are some classes where it's hard to implement equals/hashcode and this makes it possible to override `assertEqualInstances` for custom equality comparisons (also this base class doesn't require implementing equals/hashcode as it doesn't test such methods.	2018-02-23 10:48:48 +01:00
Nicholas Knize	3728c50d85	[GEO] Fix points_only indexing failure for GeoShapeFieldMapper This commit fixes a bug that was introduced in PR #27415 for 6.1 and 7.0 where a change to support MULTIPOINT shapes mucked up indexing of standalone points.	2018-02-22 20:57:09 -06:00
Lee Hinman	5bb79558e7	Decouple XContentGenerator and JsonXContentGenerator from BytesReference (#28772 ) This removes the link these two classes have with BytesReference, in favor of an `InputStream` approach. Relates to #28504	2018-02-22 14:22:37 -07:00
jxy	497b3d7a20	Fix node ID reported by ThrottlingAllocationDecider (#28779 ) Previously the message reported when `node_concurrent_outgoing_recoveries` resulted in a `THROTTLE` decision included the reporting node's ID rather than that of the primary. This commit fixes that. Fixes #28777.	2018-02-22 19:23:15 +00:00
Luca Cavanna	1df711c5b7	Remove AcknowledgedRestListener in favour of RestToXContentListener (#28724 ) This commit makes AcknowledgedResponse implement ToXContentObject, so that the response knows how to print its own content out to XContent, which allows us to remove AcknowledgedRestListener.	2018-02-22 09:13:30 +01:00
Jason Tedor	3dfb4b8b18	Skip some plugins service tests on Windows These tests need to be skipped. They cause plugins to be loaded which causes a child classloader to be opened. We do not want to add the permissions to be able to close a classloader solely for these tests, and the JARs created in the test can not be deleted on Windows until the classloader is closed. Since this will not happen before test teardown, the test will fail on Windows. So, we skip these tests.	2018-02-21 15:22:27 -05:00
Luca Cavanna	8b4a298874	Migrate some *ResponseTests to AbstractStreamableXContentTestCase (#28749 ) This allows us to save a bit of code, but also adds more coverage as it tests serialization which was missing in some of the existing tests. Also it requires implementing equals/hashcode and we get the corresponding tests for them for free from the base test class.	2018-02-21 20:04:12 +01:00
Lee Hinman	d7eae4b90f	Pass InputStream when creating XContent parser (#28754 ) * Pass InputStream when creating XContent parser Rather than passing the raw `BytesReference` in when creating the xcontent parser, this passes the StreamInput (which is an InputStream), this allows us to decouple XContent from BytesReference. This also removes the use of `commons.Booleans` so it doesn't require more external commons classes. Related to #28504 * Undo boolean removal * Enhance deprecation javadoc	2018-02-21 11:03:25 -07:00
Yu	7d8fb69d50	version set in ingest pipeline (#27573 ) Add support version and version_type in ingest pipelines Add support for setting document version and version type in set processor of an ingest pipeline.	2018-02-21 09:34:51 +01:00
Michael Basnight	eaa6b41b03	Add 5.6.9 snapshot version	2018-02-20 12:16:02 -06:00
Jim Ferenczi	5991e977d2	Add unreleased v6.2.3 version	2018-02-20 17:49:41 +01:00
Lee Hinman	d4fddfa2a0	Remove log4j dependency from elasticsearch-core (#28705 ) * Remove log4j dependency from elasticsearch-core This removes the log4j dependency from our elasticsearch-core project. It was originally necessary only for our jar classpath checking. It is now replaced by a `Consumer<String>` so that the es-core dependency doesn't have external dependencies. The parts of #28191 which were moved in conjunction (like `ESLoggerFactory` and `Loggers`) have been moved back where appropriate, since they are not required in the core jar. This is tangentially related to #28504 * Add javadocs for `output` parameter * Change @code to @link	2018-02-20 09:15:54 -07:00
Simon Willnauer	b00870600b	Never block on key in `LiveVersionMap#pruneTombstones` (#28736 ) Pruning tombstones is best effort and should not block if a key is currently locked. This can cause a deadlock in rare situations if we switch of append only optimization while heavily updating the same key in the engine while the LiveVersionMap is locked. This is very rare since this code patch only executed every 15 seconds by default since that is the interval we try to prune the deletes in the version map. Closes #28714	2018-02-20 16:35:05 +01:00
Luca Cavanna	8bbb3c9ffa	REST high-level client: add support for Rollover Index API (#28698 ) Relates to #27205	2018-02-20 15:58:58 +01:00
Jason Tedor	94594f19ab	Fix handling of mandatory meta plugins This commit fixes an issue with setting plugin.mandatory to include a meta-plugin. The issue here is that the names that we collect are the underlying plugins, not the meta-plugin. We should not use the underlying plugins instead using the names of non-meta plugins and the names of meta-plugins. This commit addresses this. The strategy here is that when we look at the installed plugins on the filesystem, we keep track of which ones are meta-plugins and carry this information up to where check which plugins are installed against the mandatory plugins. Relates #28710	2018-02-20 08:57:04 -05:00
Nhat Nguyen	0c2871c4d2	Replace CAS loop by updateAndGet to improve readability Relates #28737	2018-02-20 08:03:24 -05:00
Simon Willnauer	779bc6fd5c	Simplify Engine.Searcher creation (#28728 ) Today we have several levels of indirection to acquire an Engine.Searcher. We first acquire a the reference manager for the scope then acquire an IndexSearcher and then create a searcher for the engine based on that. This change simplifies the creation into a single method call instead of 3 different ones.	2018-02-20 09:35:49 +01:00
Simon Willnauer	13a8ba4740	[TEST] Fix flaky IndexServiceTests#testRefreshActuallyWorks	2018-02-20 09:34:20 +01:00
Jason Tedor	105dcb544c	Enable selecting adaptive selection stats The node stats API enables filtlering the top-level stats for only desired top-level stats. Yet, this was never enabled for adaptive replica selection stats. This commit enables this. We also add setting these stats on the request builder, and fix an inconsistent name in a setter. Relates #28721	2018-02-19 16:56:36 -05:00
Nhat Nguyen	ff2164c4f9	Revisit deletion policy after release the last snapshot (#28627 ) We currently revisit the index deletion policy whenever the global checkpoint has advanced enough. We should also revisit the deletion policy after releasing the last snapshot of a snapshotting commit. With this change, the old index commits will be cleaned up as soon as possible. Follow-up of #28140 https://github.com/elastic/elasticsearch/pull/28140#discussion_r162458207	2018-02-19 11:39:15 -05:00
Simon Willnauer	8325786b33	Remove unused method	2018-02-19 12:24:28 +01:00
Simon Willnauer	56edb5eb3a	Track deletes only in the tombstone map instead of maintaining as copy (#27868 ) Today we maintain a copy of every delete in the live version maps. This is unnecessary and might add quite some overhead if maps grow large. This change moves out the deletes tracking into the tombstone map only and relies on the cleaning of tombstones when deletes are collected.	2018-02-19 12:23:38 +01:00
Yannick Welsch	c0026648f0	Fix AdaptiveSelectionStats serialization bug (#28718 ) The AdaptiveSelectionStats object serializes the clientOutgoingConnections map that's concurrently updated in SearchTransportService. Serializing the map consists of first writing the size of the map and then serializing the entries. If the number of entries changes while the map is being serialized, the size and number of entries go out of sync. The deserialization routine expects those to be in sync though. Closes #28713	2018-02-19 10:16:29 +01:00
Nhat Nguyen	df07943522	TEST: Fix InternalEngine#testAcquireIndexCommit The acquireIndexCommit was separated into acquireSafeIndexCommit and acquireLastIndexCommit, however the test was not updated accordingly. Relates #28271	2018-02-17 09:49:14 -05:00
Nhat Nguyen	84fd39f5bb	Separate acquiring safe commit and last commit (#28271 ) Previously we introduced a new parameter to `acquireIndexCommit` to allow acquire either a safe commit or a last commit. However with the new parameters, callers can provide a nonsense combination - flush first but acquire the safe commit. This commit separates acquireIndexCommit method into two different methods to avoid that problem. Moreover, this change should also improve the readability. Relates #28038	2018-02-16 21:25:58 -05:00
Nhat Nguyen	2f011295ec	Backported the translog files age stats to v6.3.0 Relates #28613	2018-02-16 12:07:08 -05:00
Lee Hinman	0dd79028c9	Remove deprecated createParser methods (#28697 ) * Remove deprecated createParser methods This removes the final instances of the callers of `XContent.createParser` and `XContentHelper.createParser` that did not pass in the `DeprecationHandler`. It also removes the now-unused deprecated methods and fully removes any mention of Log4j or LoggingDeprecationHandler from the XContent code. Relates to #28504 * Add comments in JsonXContentGenerator	2018-02-16 08:26:30 -07:00
Justin Wyer	5aeb479ffd	Add translog file age to Translog Stats (#28613 ) Expose the age of translog files in the translog stats. This is useful to reason about your translog retention policy. Closes #28189	2018-02-16 16:23:33 +01:00
Jason Tedor	57a56d8e64	Fix test concurrent remote connection updates This test has a race condition. The action listener used to listen for connections has a guard against being executed twice. However, this listener can be executed twice. After on success is invoked the test starts to tear down. At this point, the threads the test forked will terminate and the remote cluster connection will be closed. However, a thread forked to the management thread pool by the remote cluster connection can still be executing and try to continue connecting. This thread will be cancelled when the remote cluster connection is closed and this leads to the action listener being invoked again. To address this, we explicitly check that the reason that on failure was invoked was cancellation, and we assert that the listener was already previously invoked. Interestingly, this issue has always been present yet a recent change (#28667) exposed errors that occur on tasks submitted to the thread pool and were silently being lost. Relates #28695	2018-02-16 07:30:15 -05:00
Andy Bristol	70b279dbbc	[TEST] AwaitsFix testTriggerUpdatesConcurrently	2018-02-15 16:34:24 -08:00
Lee Hinman	d90a440bf7	Add XContentHelper shim for move to passing in deprecation handler (#28684 ) In order to allow us to gradually move to passing the deprecation handler is, we need a shim that contains both the non-passed and passed version. Relates to #28504	2018-02-15 11:01:01 -07:00
Jason Tedor	3e846ab251	Handle throws on tasks submitted to thread pools When we submit a task to a thread pool for asynchronous execution, we are returned a future. Since we submitted to go asynchronous, these futures are not inspected for failure (we would have to block a thread to do that). While we have on failure handlers for exceptions that are thrown during execution, we do not handle throwables that are not exceptions and these end up silently lost. This commit adds a check after the runnable returns that inspects the status of the future. If an unhandled throwable occurred during execution, this throwable is propogated out where it will land in the uncaught exception handler. Relates #28667	2018-02-15 11:59:12 -05:00
olcbean	02fc16f10e	Add Cluster Put Settings API to the high level REST client (#28633 ) Relates to #27205	2018-02-15 17:21:45 +01:00
Jason Tedor	671e7e2f00	Lift error finding utility to exceptions helpers We have code used in the networking layer to search for errors buried in other exceptions. This code will be useful in other locations so with this commit we move it to our exceptions helpers. Relates #28691	2018-02-15 09:48:52 -05:00
Boaz Leskes	beb55d148a	Simplify the Translog constructor by always expecting an existing translog (#28676 ) Currently the Translog constructor is capable both of opening an existing translog and creating a new one (deleting existing files). This PR separates these two into separate code paths. The constructors opens files and a dedicated static methods creates an empty translog.	2018-02-15 09:24:09 +01:00
Ke Li	fc406c9a5a	Upgrade t-digest to 3.2 (#28295 ) (#28305 )	2018-02-15 08:23:20 +00:00
Jason Tedor	cd54c96d56	Add comment explaining lazy declared versions A recent change moved computing declared versions from using reflection which occurred repeatedly to a lazily-initialized holder so that declared versions are computed exactly once. This commit adds a comment explaining the motivation for this change.	2018-02-14 23:15:59 -05:00
Nhat Nguyen	452bfc0d83	Backported synced-flush PR to v5.6.8 and v6.2.2 Relates #28464	2018-02-14 14:48:29 -05:00
Lee Hinman	b59b1cf59d	Move more XContent.createParser calls to non-deprecated version (#28672 ) * Move more XContent.createParser calls to non-deprecated version Part 2 This moves more of the callers to pass in the DeprecationHandler. Relates to #28504 * Use parser's deprecation handler where appropriate * Use logging handler in test that uses deprecated field on purpose	2018-02-14 11:24:48 -07:00
Lee Hinman	7c1f5f5054	Move more XContent.createParser calls to non-deprecated version (#28670 ) * Move more XContent.createParser calls to non-deprecated version This moves more of the callers to pass in the DeprecationHandler. Relates to #28504 * Use parser's deprecation handler where available	2018-02-14 09:01:40 -07:00
Tal Levy	6c7d12c34c	[TEST] bump timeout in testFetchShardsSkipUnavailable to 5s in response to #28668.	2018-02-13 13:09:43 -08:00
Lee Hinman	7c201a64b5	[TEST] Synchronize searcher list in IndexShardTests It's possible to check the list size, then attempt to remove a searcher and throw an IndexOutOfBoundsException due to multiple threads. Resolves #27651	2018-02-13 12:16:04 -07:00
Scott Somerville	a138e0e225	Compute declared versions in a static block This method is called often enough (when computing minimum compatibility versions) that the reflection and sort can be seen while profiling. This commit addresses this issue by computing the declared versions exactly once. Relates #28661	2018-02-13 13:24:19 -05:00
Jim Ferenczi	3b9f530839	Inc store reference before refresh (#28656 ) If a tragic even happens while we are refreshing a searcher/reader the engine can open new files on a store that is already closed For instance the following CI job failed because a merge was concurrently called on a failing shard: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+oracle-java10-periodic/84 This change increments the ref count of the store during a refresh in order to postpone the closing after a tragic event.	2018-02-13 15:38:02 +01:00
Jim Ferenczi	813d8e1f7e	Fix meta plugin installation that contains plugins with dependencies When installing a meta plugin we check the dependency of each sub plugin during the installation. Though if the extended plugin is part of the meta plugin the installation fails because we only check for plugins that are already installed. This change is a workaround that extracts all plugins (even those that are not fully installed yet) when the dependency check is made during the installation. Note that this is how the plugin installation worked before https://github.com/elastic/elasticsearch/pull/28581.	2018-02-13 11:43:34 +01:00
Ryan Ernst	ea381969be	Plugins: Separate plugin semantic validation from properties format validation (#28581 ) This commit moves the semantic validation (like which version a plugin was built for or which java version it is compatible with) from reading a plugin descriptor, leaving the checks on the format of the descriptor intact. relates #28540	2018-02-12 21:30:11 -08:00
Robin Neatherway	282974215c	MetaDataIndexAliasesService wrong get type (#28614 ) A get of the wrong type would always have returned null so these indices would have been inserted into the map repeatedly.	2018-02-12 15:55:17 -08:00
Robin Neatherway	3174b2cbfa	NXYSignificanceHeuristic equality update (#28616 ) * NXYSignificanceHeuristic.java: implementation of equality would have failed with a ClassCastException when comparing to another type. Replaced with the Eclipse generated form.	2018-02-12 15:51:57 -08:00
Robin Neatherway	68b7a5c281	Fix DeadlockAnalyzer printer (#28615 ) Remove `if` block that was always true.	2018-02-12 15:28:56 -08:00
Nhat Nguyen	9eb9ce3843	Require translogUUID when reading global checkpoint (#28587 ) Today we use the persisted global checkpoint to calculate the starting seqno in peer-recovery. However we do not check whether the translog actually belongs to the existing Lucene index when reading the global checkpoint. In some rare cases if the translog does not match the Lucene index, that recovering replica won't be able to complete its recovery. This can happen as follows. 1. Replica executes a file-based recovery 2. Index files are copied to replica but crashed before finishing the recovery 3. Replica starts recovery again with seq-based as the copied commit is safe 4. Replica fails to open engine because translog and Lucene index are not matched 5. Replica won't be able to recover from primary This commit enforces the translogUUID requirement when reading the global checkpoint directly from the checkpoint file. Relates #28435	2018-02-12 13:23:32 -05:00
Lee Hinman	6538542603	Switch to hardcoding Smile as the state format (#28610 ) This commit changes the state format that was previously passed in to `MetaDataStateFormat` to always use Smile. This doesn't actually change the format, since we have used Smile for writing the format since at least 5.0. This removes the automatic detection of the state format when reading state, since any state that could be processed in 6.x and 7.x would already have been written in Smile format. This is work towards removing the deprecated methods in the XContent code where we do automatic content-type detection. Relates to #28504	2018-02-12 08:07:01 -07:00
Jim Ferenczi	e6a8528554	Force depth_first mode execution for terms aggregation under a nested context (#28421 ) This commit forces the depth_first mode for `terms` aggregation that contain a sub-aggregation that need to access the score of the document in a nested context (the `terms` aggregation is a child of a `nested` aggregation). The score of children documents is not accessible in breadth_first mode because the `terms` aggregation cannot access the nested context. Close #28394	2018-02-12 13:38:11 +01:00
Jim Ferenczi	7dc00ef1f5	Search option terminate_after does not handle post_filters and aggregations correctly (#28459 ) * Search option terminate_after does not handle post_filters and aggregations correctly This change fixes the handling of the `terminate_after` option when post_filters (or min_score) are used. `post_filter` should be applied before `terminate_after` in order to terminate the query when enough document are accepted by the post_filters. This commit also changes the type of exception thrown by `terminate_after` in order to ensure that multi collectors (aggregations) do not try to continue the collection when enough documents have been collected. Closes #28411	2018-02-12 13:36:33 +01:00
Ke Li	55448b2630	[Tests] Remove unnecessary condition check (#28559 ) The condition value in question is true, regardless of the randomBoolean() value. This change simplifies this removing the condition blocks.	2018-02-12 11:33:19 +01:00
Boaz Leskes	4aece92b2c	IndexShardOperationPermits: shouldn't use new Throwable to capture stack traces (#28598 ) The is a follow up to #28567 changing the method used to capture stack traces, as requested during the review. Instead of creating a throwable, we explicitly capture the stack trace of the current thread. This should Make Jason Happy Again ™️ .	2018-02-12 10:33:13 +01:00
Michael Basnight	e0bea70070	Generalize BWC logic (#28505 ) Generalizing BWC building so that there is less code to modify for a release. This ensures we do not need to think about what major or minor version is in the gradle code. It follows the general rules of the elastic release structure. For more information on the rules, see the VersionCollection's javadoc. This also removes the additional bwc snapshots that will never be released, such as 6.0.2, which were being built and tested against every time we ran bwc tests. Additionally, it creates 4 new projects that correspond to the different types of snapshots that may exist for a given version. Its possible to now run those individual tasks to work out bwc logic whereas previously it was impossible and the entire suite of bwc tests had to be run to work out any logic changes in the build tools' bwc project. Please note that if the project does not make sense for the version that is current, that an error will be thrown from that individual project if an attempt is made to run it. This should allow for automating the version bumps as well, since it removes all the hardcoded version logic from the configs.	2018-02-09 14:55:10 -06:00
Lee Hinman	5263b8cc7e	Remove all instances of the deprecated `ParseField.match` method (#28586 ) This removes all the server references to the deprecated `ParseField.match` method in favor of the method that passes in the deprecation logger. Relates to #28504	2018-02-09 09:19:24 -07:00
Martijn van Groningen	766b9d600e	Fixed a bug that prevents pipelines to load that use stored scripts after a restart. The bug was caused because the ScriptService had no reference to a ClusterState instance, because it received the ClusterState after the PipelineStore. This only is the case after a restart. A bad side effect is that during a restart, any pipeline to be loaded after the pipeline that uses a stored script, was never loaded, which caused many pipeline to be missing in bulk / index request api calls.	2018-02-09 17:14:00 +01:00
Yannick Welsch	5735e088f9	Fsync directory after cleanup (#28604 ) After copying over the Lucene segments during peer recovery, we call cleanupAndVerify which removes all other files in the directory and which then calls getMetadata to check if the resulting files are a proper index. There are two issues with this: - the directory is not fsynced after the deletions, so that the call to getMetadata, which lists files in the directory, can get a stale view, possibly seeing a deleted corruption marker (which leads to the exception seen in #28435) - failing to delete a corruption marker should result in a hard failure, as the shard is otherwise unusable.	2018-02-09 17:06:36 +01:00
Igor Motov	da1a10fa92	Add generic array support to AbstractObjectParser (#28552 ) Adds a generic declareFieldArray that can process arrays of arbitrary elements.	2018-02-08 19:46:12 -05:00
Nhat Nguyen	dbf9fb31e4	Do not ignore shard not-available exceptions in replication (#28571 ) The shard not-available exceptions are currently ignored in the replication as the best effort avoids failing not-yet-ready shards. However these exceptions can also happen from fully active shards. If this is the case, we may have skipped important failures from replicas. Since #28049, only fully initialized shards are received write requests. This restriction allows us to handle all exceptions in the replication. There is a side-effect with this change. If a replica retries its peer recovery second time after being tracked in the replication group, it can receive replication requests even though it's not-yet-ready. That shard may be failed and allocated to another node even though it has a good lucene index on that node. This PR does not change the way we report replication errors to users, hence the shard not-available exceptions won't be reported as before. Relates #28049 Relates #28534	2018-02-08 18:05:27 -05:00
Boaz Leskes	ba59cf1262	Capture stack traces while issuing IndexShard operations permits to easy debugging (#28567 ) Today we acquire a permit from the shard to coordinate between indexing operations, recoveries and other state transitions. When we leak an permit it's practically impossible to find who the culprit is. This PR add stack traces capturing for each permit so we can identify which part of the code is responsible for acquiring the unreleased permit. This code is only active when assertions are active. The output is something like: ``` java.lang.AssertionError: shard [test][1] on node [node_s0] has pending operations: --> java.lang.RuntimeException: something helpful 2 at org.elasticsearch.index.shard.IndexShardOperationPermits.acquire(IndexShardOperationPermits.java:223) at org.elasticsearch.index.shard.IndexShard.<init>(IndexShard.java:322) at org.elasticsearch.index.IndexService.createShard(IndexService.java:382) at org.elasticsearch.indices.IndicesService.createShard(IndicesService.java:514) at org.elasticsearch.indices.IndicesService.createShard(IndicesService.java:143) at org.elasticsearch.indices.cluster.IndicesClusterStateService.createShard(IndicesClusterStateService.java:552) at org.elasticsearch.indices.cluster.IndicesClusterStateService.createOrUpdateShards(IndicesClusterStateService.java:529) at org.elasticsearch.indices.cluster.IndicesClusterStateService.applyClusterState(IndicesClusterStateService.java:231) at org.elasticsearch.cluster.service.ClusterApplierService.lambda$callClusterStateAppliers$6(ClusterApplierService.java:498) at java.base/java.lang.Iterable.forEach(Iterable.java:75) at org.elasticsearch.cluster.service.ClusterApplierService.callClusterStateAppliers(ClusterApplierService.java:495) at org.elasticsearch.cluster.service.ClusterApplierService.applyChanges(ClusterApplierService.java:482) at org.elasticsearch.cluster.service.ClusterApplierService.runTask(ClusterApplierService.java:432) at org.elasticsearch.cluster.service.ClusterApplierService$UpdateTask.run(ClusterApplierService.java:161) at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:566) at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.runAndClean(PrioritizedEsThreadPoolExecutor.java:244) at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:207) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1167) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:641) at java.base/java.lang.Thread.run(Thread.java:844) --> java.lang.RuntimeException: something helpful at org.elasticsearch.index.shard.IndexShardOperationPermits.acquire(IndexShardOperationPermits.java:223) at org.elasticsearch.index.shard.IndexShard.<init>(IndexShard.java:311) at org.elasticsearch.index.IndexService.createShard(IndexService.java:382) at org.elasticsearch.indices.IndicesService.createShard(IndicesService.java:514) at org.elasticsearch.indices.IndicesService.createShard(IndicesService.java:143) at org.elasticsearch.indices.cluster.IndicesClusterStateService.createShard(IndicesClusterStateService.java:552) at org.elasticsearch.indices.cluster.IndicesClusterStateService.createOrUpdateShards(IndicesClusterStateService.java:529) at org.elasticsearch.indices.cluster.IndicesClusterStateService.applyClusterState(IndicesClusterStateService.java:231) at org.elasticsearch.cluster.service.ClusterApplierService.lambda$callClusterStateAppliers$6(ClusterApplierService.java:498) at java.base/java.lang.Iterable.forEach(Iterable.java:75) at org.elasticsearch.cluster.service.ClusterApplierService.callClusterStateAppliers(ClusterApplierService.java:495) at org.elasticsearch.cluster.service.ClusterApplierService.applyChanges(ClusterApplierService.java:482) at org.elasticsearch.cluster.service.ClusterApplierService.runTask(ClusterApplierService.java:432) at org.elasticsearch.cluster.service.ClusterApplierService$UpdateTask.run(ClusterApplierService.java:161) at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:566) at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.runAndClean(PrioritizedEsThreadPoolExecutor.java:244) at org.elasticsearch.common.util.concurrent.PrioritizedEsThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedEsThreadPoolExecutor.java:207) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1167) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:641) at java.base/java.lang.Thread.run(Thread.java:844) ```	2018-02-08 22:59:02 +01:00
Nhat Nguyen	5b8870f193	Only log warning when actually failing shards (#28558 ) Currently the master node logs a warning message whenever it receives a failed shard request. However, this can be noisy because - Multiple failed shard requests can be issued for a single shard - Failed shard requests can be still issued for an already failed shard This commit moves the log-warn to AllocationService in which the failing shard action actually happens. This is another prerequisite step in order to not ignore the shard not-available exceptions in the replication. Relates #28534	2018-02-08 15:37:52 -05:00
Jason Tedor	5badacf391	Fix race condition in queue size test The queue size test has a race condition. Namely the offering thread can run so quickly completing all of its offering iterations before the queue size thread ever has a chance to run a single size poll iteration. This means that the size will never actually be polled and the test can spuriously fail. What we really want to do here, since this test is checking for a race condition between polling the size of the queue and offers to the queue, we want to execute each iteration in lockstep giving the threads multiple changes for the race between polling the size and offers to occur. This commit addresses this by running the two threads in lockstep for multiple iterations so that they have multiple chances to race. Relates #28584	2018-02-08 14:23:24 -05:00
Igor Motov	80c8c3b114	Add 6.2.2 version constant	2018-02-08 13:43:34 -05:00
Lee Hinman	2e4c834a13	Switch to non-deprecated ParseField.match method for o.e.search (#28526 ) * Switch to non-deprecated ParseField.match method for o.e.search This replaces more of the `ParseField.match` calls with the same call using a deprecation handler. It encapsulates all of the instances in the `org.elastsicsearch.search` package. Relates to #28504 * Address Nik's comments	2018-02-08 11:17:57 -07:00
Lee Hinman	b64bf51000	Replace more deprecated ParseField.match calls with non-deprecated call (#28525 ) * Replace more deprecated ParseField.match calls with non-deprecated call This replaces more of the `ParseField.match` calls with the same call using a deprecation handler. Relates to #28504 * Address Nik's comments	2018-02-08 09:45:28 -07:00
Ryan Ernst	a55eda626f	Plugins: Store elasticsearch and java versions in PluginInfo (#28556 ) Plugin descriptors currently contain an elasticsearch version, which the plugin was built against, and a java version, which the plugin was built with. These versions are read and validated, but not stored. This commit keeps them in PluginInfo so they can be used later. While seeing the elasticsearch version is less interesting (since it is enforced to match that of the running elasticsearc node), the java version is interesting since we only validate the format, not the actual version. This also makes PluginInfo have full parity with the plugin properties file.	2018-02-08 08:31:39 -08:00
Jason Tedor	86fd48e5f5	Fix size blocking queue to not lie about its weight Today when offering an item to a size blocking queue that is at capacity, we first increment the size of the queue and then check if the capacity is exceeded or not. If the capacity is indeed exceeded, we do not add the item to the queue and immediately decrement the size of the queue. However, this incremented size is exposed externally even though the offered item was never added to the queue (this is effectively a race on the size of the queue). This can lead to misleading statistics such as the size of a queue backing a thread pool. This commit fixes this issue so that such a size is never exposed. To do this, we replace the hidden CAS loop that increments the size of the queue with a CAS loop that only increments the size of the queue if we are going to be successful in adding the item to the queue. Relates #28557	2018-02-08 06:54:39 -05:00
Jason Tedor	666c4f9414	Index shard should roll generation via the engine Today when a replica shard detects a new primary shard (via a primary term transition), we roll the translog generation. However, the mechanism that we are using here is by reaching through the engine to the translog directly. By poking all the way through rather than asking the engine to manage the roll for us we miss: - taking a read lock in the engine while the roll is occurring - trimming unreferenced readers This commit addresses this by asking the engine to roll the translog generation for us. Relates #28537	2018-02-07 14:57:50 -05:00
Martijn van Groningen	2023c98bea	Added more parameter to PersistentTaskPlugin#getPersistentTasksExecutor(...)	2018-02-07 17:43:35 +01:00
Christoph Büscher	c0886cf7c6	[Tests] Relax assertion in SuggestStatsIT (#28544 ) The test expects suggest times in milliseconds that are strictly positive. Internally they are measured in nanos, it is possible that on really fast execution this is rounded to 0L, so this should also be an accepted value. Closes #28543	2018-02-07 17:26:08 +01:00
Christoph Büscher	305b87b4b7	Make internal Rounding fields final (#28532 ) The fields in the internal rounding classes can be made final with very minor adjustments to how they are read from a StreamInput.	2018-02-07 09:33:21 +01:00
Jason Tedor	c2fcf15d9d	Fix the ability to remove old plugin We now read the plugin descriptor when removing an old plugin. This is to check if we are removing a plugin that is extended by another plugin. However, when reading the descriptor we enforce that it is of the same version that we are. This is not the case when a user has upgraded Elasticsearch and is now trying to remove an old plugin. This commit fixes this by skipping the version enforcement when reading the plugin descriptor only when removing a plugin. Relates #28540	2018-02-06 17:38:26 -05:00
Lee Hinman	6b4ea4e6fb	Add 6.2.1 version constant	2018-02-06 12:13:24 -07:00
Yannick Welsch	e6f873c620	Remove feature parsing for GetIndicesAction (#28535 ) Removes dead code. Follow-up of #24723	2018-02-06 18:00:14 +01:00
Yannick Welsch	c8df446000	No refresh on shard activation needed (#28013 ) A shard is fully baked when it moves to POST_RECOVERY. There is no need to do an extra refresh on shard activation again as the shard has already been refreshed when it moved to POST_RECOVERY.	2018-02-06 17:29:22 +01:00
Yannick Welsch	d43f0b5f26	Improve failure message when restoring an index that already exists in the cluster (#28498 ) Makes the message more actionable and removes the focus on the fact that the index is open.	2018-02-06 14:24:52 +01:00
Lee Hinman	eebff4d2b3	Use non deprecated xcontenthelper (#28503 ) * Move to non-deprecated XContentHelper.createParser(...) This moves away from one of the now-deprecated XContentHelper.createParser methods in favor of specifying the deprecation logger at parser creation time. Relates to #28449 Note that this doesn't move all the `createParser` calls because some of them use the already-deprecated method that doesn't specify the XContentType. * Remove the deprecated (and now non-needed) createParser method	2018-02-05 16:18:18 -07:00
Nik Everett	5003ef18ac	Scripts: Fix security for deprecation warning (#28485 ) If you call `getDates()` on a long or date type field add a deprecation warning to the response and log something to the deprecation logger. This mostly worked just fine but if the deprecation logger happens to roll then the roll will be performed with the script's permissions rather than the permissions of the server. And scripts don't have permissions to, say, open files. So the rolling failed. This fixes that by wrapping the call the deprecation logger in `doPriviledged`. This is a strange `doPrivileged` call because it doens't check Elasticsearch's `SpecialPermission`. `SpecialPermission` is a permission that no-script code has and that scripts never have. Usually all `doPrivileged` calls check `SpecialPermission` to make sure that they are not accidentally acting on behalf of a script. But in this case we are intentionally acting on behalf of a script. Closes #28408	2018-02-03 14:56:08 -05:00
Nhat Nguyen	de6d31ebc2	Backport fail shard w/o marking as stale PR to v6.3 Relates #28054	2018-02-03 12:07:39 -05:00
Nhat Nguyen	965efa51cc	Allows failing shards without marking as stale (#28054 ) Currently when failing a shard we also mark it as stale (eg. remove its allocationId from from the InSync set). However in some cases, we need to be able to fail shards but keep them InSync set. This commit adds such capacity. This is a preparatory change to make the primary-replica resync less lenient. Relates #24841	2018-02-03 09:41:53 -05:00
Nhat Nguyen	875bbfe699	Backported the harden synced-flush PR to v6.3.0 Relates #28464	2018-02-02 14:31:37 -05:00
David Turner	ab8f5ea54c	Forbid trappy methods from java.time (#28476 ) ava.time has the functionality needed to deal with timezones with varying offsets correctly, but it also has a bunch of methods that silently let you forget about the hard cases, which raises the risk that we'll quietly do the wrong thing at some point in the future. This change adds the trappy methods to the list of forbidden methods to try and help stop this from happening. It also fixes the only use of these methods in the codebase so far: IngestDocument#deepCopy() used ZonedDateTime.of() which may alter the offset of the given time in cases where the offset is ambiguous.	2018-02-02 18:24:02 +00:00
Lee Hinman	3ddea8d8d2	Start switching to non-deprecated ParseField.match method (#28488 ) This commit switches all the modules and server test code to use the non-deprecated `ParseField.match` method, passing in the parser's deprecation handler or the logging deprecation handler when a parser is not available (like in tests). Relates to #28449	2018-02-02 10:10:13 -07:00
Nhat Nguyen	5f2121960e	Synced-flush should not seal index of out of sync replicas (#28464 ) Today the correctness of synced-flush is guaranteed by ensuring that there is no ongoing indexing operations on the primary. Unfortunately, a replica might fall out of sync with the primary even the condition is met. Moreover, if synced-flush mistakenly issues a sync_id for an out of sync replica, then that replica would not be able to recover from the primary. ES prevents that peer-recovery because it detects that both indexes from primary and replica were sealed with the same sync_id but have a different content. This commit modifies the synced-flush to not issue sync_id for out of sync replicas. This change will report the divergence issue earlier to users and also prevent replicas from getting into the "unrecoverable" state. Relates #10032	2018-02-02 11:20:38 -05:00

... 2 3 4 5 6 ...

502 Commits