OpenSearch

Commit Graph

Author	SHA1	Message	Date
Colin Goodheart-Smithe	41abccf6c5	Adds rewrite phase to aggregations (#25495 ) * Adds rewrite phase to aggregations This change adds aggregations to the rewrite performed by the `SearchSourceBuilder`. This means that `AggregationBuilder`s are able to implement a `rewrite()` method where they can return a new `AggregationBuilder` which is functionally the same but in a more primitive form. This is exactly analogous to the rewrite done by the `QueryBuilder`s. The first aggregation to implement the rewrite are the filter and filters aggregations so they can rewrite the filters they contain. Closes #17676 * Removes rewrite from PipelineAggregationBuilder Rewrite is based on shard level information. Since pipeline aggregation are run in the reduce phase it doesn’t make sense to rewrite them on the shards. In fact eventually we shouldn’t be transporting them to the shards at all and should be retaining them on the coordinating node for execution in the reduce phase * Addresses review comments * addresses more review comments * Fixed imports	2017-07-04 16:47:48 +01:00
Simon Willnauer	1c4ef0d214	Upgrade randomizedrunner to 2.5.2 (#25533 ) An issue causing confusing error messages during test execution has been fixed randomizedtesting/randomizedtesting#250	2017-07-04 16:48:11 +02:00
Jun Ohtani	6894ef6057	[Analysis] Support normalizer in request param (#24767 ) * [Analysis] Support normalizer in request param Support normalizer param Support custom normalizer with char_filter/filter param Closes #23347	2017-07-04 19:16:56 +09:00
Christoph Büscher	5200665295	Remove deprecated IdsQueryBuilder constructor (#25529 ) The constructor using `types` has been deprecated for a while now (starting with ES 5.1.). It can be removed in the next mayor version. Since types are optional they should be added with the #types() setter.	2017-07-04 11:59:48 +02:00
Colin Goodheart-Smithe	43efcffcc2	Adds check for negative search request size (#25397 ) * Adds check for negative search request size This change adds a check to `SearchSourceBuilder` to throw and exception if the size set on it is set to a negative value. Closes #22530 * fix error in reindex * update re-index tests * Addresses review comment * Fixed tests * Added random negative size test * Fixes test	2017-07-04 10:51:38 +01:00
Christoph Büscher	f576c987ce	Remove QueryParseContext (#25486 ) QueryParseContext is currently only used as a wrapper for an XContentParser, so this change removes it entirely and changes the appropriate APIs that use it so far to only accept a parser instead.	2017-07-03 17:30:40 +02:00
Tanguy Leroux	0e2cfc66bb	[Test] Use a common testing class for all XContent filtering tests (#25491 ) We have two ways to filter XContent: - The first method is to parse the XContent as a map and use XContentMapValues.filter(). This method filters the content of the map using an automaton. It is used for source filtering, both at search and indexing time. It performs well but can generate a lot of objects and garbage collections when large XContent are filtered. It also returns empty objects (see `f2710c16eb`) when all the sub fields have been filtered out and handle dots in field names as if they were sub fields. - The second method is to parse the XContent and copy the XContentParser structure to a XContentBuilder initialized with includes/excludes filters. This method uses the Jackson streaming filter feature. It is used by the Response Filtering ('filter_path') feature. It does not generate a lot of objects, and does not return empty objects and also does not handle dots in field names explicitely. Both methods have similar goals but different tests. This commit changes the current XContentBuilder test class so that it becomes a more generic testing class and we can now ensure that filtering methods generate the same results. It also removes some tests from the XContentMapValuesTests class that should be in XContentParserTests.	2017-07-03 14:45:26 +02:00
markharwood	a9ea742a85	Tests fix - Significant terms/text aggs (#25499 ) The significance aggs return Lucene index-level statistics that when merged are assumed to be from different shards. The Aggregator unit tests assume segments can be treated as shards and thus break the significance stats and introduce double-counting of background doc frequencies. This change addresses this problem by ensuring test indexes have only one shard. Closes #25429	2017-07-03 09:52:23 +01:00
Simon Willnauer	1205610023	[TEST] Expect nodes getting disconnected quickly If all nodes get disconnected before we can send the request we might try to reconnect and that will fail with an ISE instead of the a transport exception. Closes #25301	2017-07-02 22:12:35 +02:00
Boaz Leskes	a4fae1540e	testPrimaryFailureIncreasesTerm should use assertBusy to wait for yellow ensureYellow ensures at least yellow. Also, since we only have 1 replica, we don't need to index for it to know about the primary term promotion Closes #25287	2017-07-02 21:19:51 +02:00
Simon Willnauer	5a7c8bb04e	Cleanup network / transport related settings (#25489 ) This commit makes the use of the global network settings explicit instead of implicit within NetworkService. It cleans up several places where we fall back to the global settings while we should have used tcp or http ones. In addition this change also removes unnecessary settings classes	2017-07-02 10:16:50 +02:00
Yannick Welsch	bb23d3b2c5	Remove allocation id from replica replication response (#25488 ) The replica replication response object has an extra allocationId field that contains the allocation id of the replica on which the request was executed. As we are sending the allocation id with the actual replica replication request, and check when executing the replica replication action that the allocation id of the replica shard is what we expect, there is no need to communicate back the allocation id as part of the response object.	2017-07-01 11:36:45 +02:00
Jason Tedor	c70c440050	Adjust status on bad allocation explain requests When a user requests a cluster allocation explain in a situation where it does not make sense (for example, there are no unassigned shards), we should consider this a bad request instead of a server error. Yet, today by throwing an illegal state exception, these are treated as server errors. This commit adjusts these so that they throw illegal argument exceptions and are treated as bad requests. Relates #25503	2017-06-30 17:50:20 -04:00
Drew Raines	6deb18c0de	Preliminary support for ARM This commit adds preliminary support for 64-bit ARM architectures. Relates #25318	2017-06-30 14:22:20 -04:00
Jason Tedor	dd93ef3f24	Add additional test for sequence-number recovery This commit adds a test for a scenario where a replica receives an extra document that the promoted replica does not receive, misses the primary/replica re-sync, and the recovers from the newly-promoted primary. Relates #25493	2017-06-30 10:59:03 -04:00
Martijn van Groningen	c8da7f84a2	WrapperQueryBuilder should also rewrite the parsed query. Failing to do so can cause other errors later on during query execution. For example if `WrapperQueryBuilder` wraps a `GeoShapeQueryBuilder` that fetches the shape from an index then it will skip the shape fetching and fail later with the error that no shapes have been fetched.	2017-06-30 13:48:18 +02:00
Yannick Welsch	1fee1045b9	Remove dead code and stale Javadoc	2017-06-30 12:25:56 +02:00
Jason Tedor	d219a85b33	Use LRU set to reduce repeat deprecation messages This commit adds an LRU set to used to determine if a keyed deprecation message should be written to the deprecation logs, or only added to the response headers on the thread context. Relates #25474	2017-06-29 16:36:43 -04:00
Tim Brooks	cac2eec7d2	Add NioTransport threads to thread name checks (#25477 ) We have various assertions that check we never block on transport threads. This commit adds the thread names for the NioTransport to these assertions. With this change I had to fix two places where we were calling blocking methods from the transport threads.	2017-06-29 15:16:07 -05:00
Christoph Büscher	c32c21e875	Add shortcut for AbstractQueryBuilder.parseInnerQueryBuilder to QueryShardContext	2017-06-29 21:45:02 +02:00
Christoph Büscher	99aa04b79c	Fix Java 9 compilation issue My IDE ate a cast that seems required to make Java 9 happy.	2017-06-29 20:57:22 +02:00
Christoph Büscher	927111c91d	Remove QueryParseContext from parsing QueryBuilders (#25448 ) Currently QueryParseContext is only a thin wrapper around an XContentParser that adds little functionality of its own. I provides helpers for long deprecated field names which can be removed and two helper methods that can be made static and moved to other classes. This is a first step in helping to remove QueryParseContext entirely.	2017-06-29 17:10:20 +02:00
Lee Hinman	22ff76da0c	Promote replica on the highest version node (#25277 ) * Promote replica on the highest version node This changes the replica selection to prefer to return replicas on the highest version when choosing a replacement to promote when the primary shard fails. Consider this situation: - A replica on a 5.6 node - Another replica on a 6.0 node - The primary on a 6.0 node The primary shard is sending sequence numbers to the replica on the 6.0 node and skipping sending them for the 5.6 node. Now assume that the primary shard fails and (prior to this change) the replica on 5.6 node gets promoted to primary, it now has no knowledge of sequence numbers and the replica on the 6.0 node will be expecting sequence numbers but will never receive them. Relates to #10708 * Switch from map of node to version to retrieving the version from the node * Remove uneeded null check * You can pretend you're a functional language Java, but you're not fooling me. * Randomize node versions * Add test with random cluster state with multiple versions that fails shards * Re-add comment and remove extra import * Remove unneeded stuff, randomly start replicas a few more times * Move test into FailedNodeRoutingTests * Make assertions actually test replica version promotion * Rewrite test, taking Yannick's feedback into account	2017-06-29 08:56:34 -06:00
Martijn van Groningen	a2b4080fba	use diamond operator	2017-06-29 13:43:39 +02:00
Christoph Büscher	aa2038f9d7	Use DocumentField#toXContent and parsing in SearchHit (#25469 ) As a small follow-up to #25361, we can use DocumentFields toXContent/fromXContent in SearchHit now.	2017-06-29 13:32:13 +02:00
olcbean	3518e313b8	Unify the result interfaces from get and search in Java client (#25361 ) As GetField and SearchHitField have the same members, they have been unified into DocumentField. Closes #16440	2017-06-29 11:35:28 +02:00
Jason Tedor	da59c178e2	Emit settings deprecation logging at most once When a setting is deprecated, if that setting is used repeatedly we currently emit a deprecation warning every time the setting is used. In cases like hitting settings endpoints over and over against a node with a lot of deprecated settings, this can lead to excessive deprecation warnings which can crush a node. This commit ensures that a given setting only sees deprecation logging at most once. Relates #25457	2017-06-28 22:18:46 -04:00
Ali Beyad	b18bfd6062	Output all empty snapshot info fields if in verbose mode (#25455 ) In #24477, a less verbose option was added to retrieve snapshot info via GET /_snapshot/{repo}/{snapshots}. The point of adding this less verbose option was so that if the repository is a cloud based one, and there are many snapshots for which the snapshot info needed to be retrieved, then each snapshot would require reading a separate snapshot metadata file to pull out the necessary information. This can be costly (performance and cost) on cloud based repositories, so a less verbose option was added that only retrieves very basic information about each snapshot that is all available in the index-N blob - requiring only one read! In order to display this less verbose snapshot info appropriately, logic was added to not display those fields which could not be populated. However, this broke integrators (e.g. ECE) that required these fields to be present, even if empty. This commit is to return these fields in the response, even if empty, if the verbose option is set.	2017-06-28 17:37:56 -05:00
Jay Modi	64d11b8831	Fix race condition in RemoteClusterConnection node supplier (#25432 ) This commit fixes a race condition in the node supplier used by the RemoteClusterConnection. The node supplier stores an iterator over a set backed by a ConcurrentHashMap, but the get operation of the supplier uses multiple methods of the iterator and is suceptible to a race between the calls to hasNext() and next(). The test in this commit fails under the old implementation with a NoSuchElementException. This commit adds a wrapper object over a set and a iterator, with all methods being synchronized to avoid races. Modifications to the set result in the iterator being set to null and the next retrieval creates a new iterator.	2017-06-28 15:50:24 -06:00
Jay Modi	b2901f536e	Do not search locally if remote index pattern resolves to no indices (#25436 ) This commit changes how we determine if there were any remote indices that a search should have been executed against. Previously, we used the list of remote shard iterators but if the remote index pattern resolved to no indices there would be no remote shard iterators even though the request specified remote indices. The map of remote cluster names to the original indices is used instead so that we can determine if there were remote indices even when there are no remote shard iterators. Closes #25426	2017-06-28 12:41:37 -06:00
Andreas Gebhardt	a156ccd80e	Expand `/_cat/nodes` to return information about hard drive (#21775 ) Expand `/_cat/nodes` with already present information about available disk space `diskAvail` (alias: `d`, `disk`) by: * `diskTotal` (alias `dt`): total disk space * `diskUsed` (alias `du`): used disk space (`diskTotal - diskAvail`) * `diskUsedPercent` (alias `dup`): used disk space percentage Note: The available disk space is the number of bytes available to the node's Java virtual machine. The size might be smaller than the real one. That means the used disk space (percentage) is larger. Closes #21679	2017-06-28 18:20:20 +02:00
Tim Brooks	5f8be0e090	Introduce NioTransport into framework for testing (#24262 ) This commit introduces a nio based tcp transport into framework for testing. Currently Elasticsearch uses a simple blocking tcp transport for testing purposes (MockTcpTransport). This diverges from production where our current transport (netty) is non-blocking. The point of this commit is to introduce a testing variant that more closely matches the behavior of production instances.	2017-06-28 10:51:20 -05:00
Chris Earle	f2eeceb10d	_nodes/stats should not fail due to concurrent AlreadyClosedException (#25016 ) This catches `AlreadyClosedException` during `stats` calls to avoid failing a `_nodes/stats` request because of the ignorable, concurrent index closure.	2017-06-28 10:08:45 -04:00
Yannick Welsch	5a4a47332c	Use a single method to update shard state This commit refactors index shard to provide a single method for updating the shard state on an incoming cluster state update. Relates #25431	2017-06-28 09:48:47 -04:00
Jason Tedor	ebdae09df3	Do not swallow exception when relocating When relocating a shard before changing the state to relocated, we verify that a relocation is a still taking place. Yet, this can throw an exception if the relocation is in fact no longer valid. Sadly, we were swallowing the exception in this situation. This commit allows such an exception to bubble up after safely releasing resources.	2017-06-28 08:42:13 -04:00
Jason Tedor	be906628d5	Remove implicit 32-bit support We previously tried to maintain (while not formally supporting) 32-bit support, although we never tested this anywhere in CI. Since we do not formally support this, and 32-bit usage is very low, we have elected to no longer maintain 32-bit support. This commit removes any implication of 32-bit support. Relates #25435	2017-06-28 08:24:33 -04:00
Yannick Welsch	5d1e67c882	Disallow multiple concurrent recovery attempts for same target shard (#25428 ) The primary shard uses the GlobalCheckPointTracker to track local checkpoint information of recovering and started replicas in order to calculate the global checkpoint. As the tracker is updated through recoveries as well, it is easier to reason about the tracker if we can ensure that there are no concurrent recovery attempts for the same target shard (which can happen in case of network disconnects).	2017-06-28 10:41:16 +02:00
Yannick Welsch	8ae61c0fc4	Update global checkpoint when increasing primary term on replica (#25422 ) When a replica shard increases its primary term under the mandate of a new primary, it should also update its global checkpoint; this gives us the guarantee that its global checkpoint is at least as high as the new primary and gives a starting point for the primary/replica resync. Relates to #25355, #10708	2017-06-28 10:38:22 +02:00
Daniel Mitterdorfer	dd6751d3e9	Add backwards compatibility indices for 5.4.3	2017-06-28 10:00:01 +02:00
Daniel Mitterdorfer	75ceb7d63b	Add version 5.4.3 after release	2017-06-28 09:59:54 +02:00
Jason Tedor	8afeeed051	Add missing newline at end of SetsTests.java This commit adds a missing newline to the end of SetsTests.java after the closing curly brace.	2017-06-27 17:28:41 -04:00
Jason Tedor	f6a693e1bc	Rename handoff primary context transport handler This commit renames this handler from "hand_off" to "handoff" since "handoff" is an actual word in the English language.	2017-06-27 15:08:58 -04:00
Tal Levy	cbcf6a4f55	correct expected thrown exception in mappingMetaData to ElasticsearchParseException (#25410 )	2017-06-27 08:55:24 -07:00
Jason Tedor	9b3768204b	Add Javadocs and tests for set difference methods This commit adds Javadocs and tests for some set difference utility methods in core.	2017-06-27 11:29:35 -04:00
Christoph Büscher	c55dc23270	Tests: Add parsing test for AggregationsTests (#25396 ) We already have these tests in InternalAggregationTestCase to check random insertions into the response xContent so that we don't fail on future changes in the response format. This change adds the same to AggregationsTests and runs on a whole aggregations tree. Unfortunately we need to exclude many places in the xContent from random insertion, but I added a long comment trying to explaine those.	2017-06-27 17:02:15 +02:00
Daniel Mitterdorfer	0405ef5892	Mute SignificantTermsAggregatorTests#testSignificance() Relates #25429	2017-06-27 15:58:22 +02:00
Daniel Mitterdorfer	54907ba352	Mute FullRollingRestartIT#testFullRollingRestart() Relates #25420	2017-06-27 10:41:48 +02:00
Daniel Mitterdorfer	ef9d099ffd	Mute IndexShardTests#testRelocatedShardCanNotBeRevivedConcurrently	2017-06-27 10:25:40 +02:00
Jason Tedor	f27aba34bf	Mark shutdown non-master nodes test as awaits fix This commit marks a failing test as awaits fix. The test is failing due to a primary shard not knowing its own local checkpoint in the global checkpoint tracker after recovery. If such a shard becomes primary after promotion, and is then subsequently relocated, it can lead to a violation of an assertion that when the primary context is transferred the knowledge of all in-sync local checkpoints is consistent with the global checkpoint on the relocation target. Relates #25415	2017-06-26 22:48:04 -04:00
Jason Tedor	dfd241e0a6	Remove default path settings This commit removes the default path settings for data and logs. With this change, we now ship the packages with these settings set in the elasticsearch.yml configuration file rather than going through the default.path.data and default.path.logs dance that we went through in the past. Relates #25408	2017-06-26 21:43:20 -04:00
Jason Tedor	cca18a2c35	Make plugin loading stricter Today we load plugins reflectively, looking for constructors that conform to specific signatures. This commit tightens the reflective operations here, not allowing plugins to have ambiguous constructors. Relates #25405	2017-06-26 21:42:53 -04:00
Jason Tedor	5a9fc8aa2a	Remove path.conf setting This commit removes path.conf as a valid setting and replaces it with a command-line flag for specifying a non-default path for configuration. Relates #25392	2017-06-26 15:18:29 -04:00
Jason Tedor	e9e7007a51	Remove LongTuple This commit removes an abstraction that was introduced when introducing the primary context. As this abstraction is used in exactly one place, we simply make that abstraction local to its usage so that we do not accumulate yet another general abstraction with exactly one usage. Relates #25402	2017-06-26 14:46:06 -04:00
Jason Tedor	56d3a5e6d8	Fix primary context sealing test This commit updates some assertions in the primary context sealing test after the restriction on updating allocation IDs from master and updating global checkpoint on replica while sealed were removed.	2017-06-26 14:17:33 -04:00
Jason Tedor	c6a03bc549	Introduce primary context (#25122 ) * Introduce primary context The target of a primary relocation is not aware of the state of the replication group. In particular, it is not tracking in-sync and initializing shards and their checkpoints. This means that after the target shard is started, its knowledge of the replication group could differ from that of the relocation source. In particular, this differing view can lead to it computing a global checkpoint that moves backwards after it becomes aware of the state of the entire replication group. This commit addresses this issue by transferring a primary context during relocation handoff. * Fix test * Add assertion messages * Javadocs * Barrier between marking a shard in sync and relocating * Fix misplaced call * Paranoia * Better latch countdown * Catch any exception * Fix comment * Fix wait for cluster state relocation test * Update knowledge via upate local checkpoint API * toString * Visibility * Refactor permit * Push down * Imports * Docs * Fix compilation * Remove assertion * Fix compilation * Remove context wrapper * Move PrimaryContext to new package * Piping for cluster state version This commit adds piping for the cluster state version to the global checkpoint tracker. We do not use it yet. * Remove unused import * Implement versioning in tracker * Fix test * Unneeded public * Imports * Promote on our own * Add tests * Import * Newline * Update comment * Serialization * Assertion message * Update stale comment * Remove newline * Less verbose * Remove redundant assertion * Tracking -> in-sync * Assertions * Just say no Friends do not let friends block the cluster state update thread on network operations. * Extra newline * Add allocation ID to assertion * Rename method * Another rename * Introduce sealing * Sealing tests * One more assertion * Fix imports * Safer sealing * Remove check * Remove another sealed check	2017-06-26 14:09:15 -04:00
Igor Motov	2a4fb950df	Tests: Fix array out of bounds exception in TemplateUpgradeServiceIT	2017-06-26 09:14:05 -04:00
Martijn van Groningen	a34f5fa812	Move more token filters to analysis-common module The following token filters were moved: stemmer, stemmer_override, kstem, dictionary_decompounder, hyphenation_decompounder, reverse, elision and truncate. Relates to #23658	2017-06-26 09:02:16 +02:00
Ryan Ernst	1583f81047	Test: Allow merging mock secure settings (#25387 ) While real secure settings (ie an ES keystore) cannot be merged together, mocked secure settings can and need to be sometimes merged. This commit adds a merge method to allow tests to merge together multiple instances of secure settings.	2017-06-25 10:19:51 -07:00
Simon Willnauer	4e4a104f4a	Remove remaining `index.mapper.single_type` setting usage from tests (#25388 ) This change removes the remaining explicitly specified `index.mapper.single_type` settings from tests in order to allow the removal of the setting. This is the already approved part of #25375 broken out to simplfiy reviews on	2017-06-25 12:25:41 +02:00
Jason Tedor	43c190339a	Remove dead logger prefix code When Log4j 2 was introduced, we removed support for the system property es.logger.prefix. Yet, some code was left behind. This commit removes that dead code. Relates #25377	2017-06-24 08:16:59 -04:00
Igor Motov	79a8336559	Tests: Improve stability and logging of TemplateUpgradeServiceIT tests (#25386 ) Relates to #25382	2017-06-23 17:31:21 -04:00
markharwood	973530f953	Added unit test coverage for SignificantTerms (#24904 ) Added unit test coverage for GlobalOrdinalsSignificantTermsAggregator, GlobalOrdinalsSignificantTermsAggregator.WithHash, SignificantLongTermsAggregator and SignificantStringTermsAggregator. Removed integration test. Relates #22278	2017-06-23 15:34:38 +01:00
Boaz Leskes	9ff1698aa7	testCreateShrinkIndex: removed left over debugging log line that violated linting	2017-06-23 12:14:39 +02:00
Boaz Leskes	0ebc49e8c6	testCreateShrinkIndex should make sure to use the right source stats when testing shrunk target	2017-06-23 11:05:59 +02:00
Tanguy Leroux	6a792d6d82	[Test] Add unit test for XContentParserUtilsTests.parseStoredFieldsValue (#25288 )	2017-06-23 10:54:26 +02:00
Simon Willnauer	4ae426a552	Remove remaining `index.mapping.single_type=false` (#25369 ) This change cleans up remaining tests to not use index.mapping.single_type=false but instead where applicable use a single type or markt the index as created with a pre 6.x version. Yet, there is still on leftover in the client tests that needs special attention. See `org.elasticsearch.client.SearchIT` Relates to #24961	2017-06-23 10:26:06 +02:00
Martijn van Groningen	9c511bc447	test: Replace OldIndexBackwardsCompatibilityIT#testOldClusterStates with a full cluster restart qa test OldIndexBackwardsCompatibilityIT#testOldClusterStates tested whether global and index metadata could be read from data directory, this can also be tested in full cluster qa test that checks cluster state via api. Relates to #24939	2017-06-23 09:54:05 +02:00
Boaz Leskes	d20cd6afcb	ESIndexLevelReplicationTestCase.ReplicationAction#execute should send exceptions to it's listener rather than bubble them up This is how TRA works as well.	2017-06-22 23:37:08 +02:00
Boaz Leskes	fb8c767737	testRecoveryAfterPrimaryPromotion shouldn't flush the replica with extra operations We don't yet have lucene rollbacks, so we can't bake those in	2017-06-22 23:24:43 +02:00
Simon Willnauer	59b625121b	Ensure `InternalEngineTests.testConcurrentWritesAndCommits` doesn't pile up commits (#25367 ) `InternalEngineTests.testConcurrentWritesAndCommits` can be very heavy on disks if threads are slow and the main thread keeps on pulling commit points holding on to many many segments. This commit adds some quadratic backoff to not pile up too many commits and to make sure indexing threads can make progress. This also now doesn't do busy waiting but waits on a latch with a timeout. Closes #25110	2017-06-22 21:50:11 +02:00
Simon Willnauer	a077fa9b07	[TEST] Add debug logging if an unexpected exception is thrown	2017-06-22 21:19:39 +02:00
Igor Motov	e6e5ae6202	TemplateUpgraders should be called during rolling restart (#25263 ) In #24379 we added ability to upgrade templates on full cluster startup. This PR invokes the same update procedure also when a new node first joins the cluster allowing to update templates on a rolling cluster restart as well. Closes #24680	2017-06-22 14:55:28 -04:00
Jason Tedor	8dcb1f5c7c	Initialize max unsafe auto ID timestamp on shrink When shrinking an index we initialize its max unsafe auto ID timestamp to the maximum of the max unsafe auto ID timestamps on the source shards. Relates #25356	2017-06-22 11:14:25 -04:00
Boaz Leskes	d963882053	Enable a long translog retention policy by default (#25294 ) #25147 added the translog deletion policy but didn't enable it by default. This PR enables a default retention of 512MB (same maximum size of the current translog) and an age of 12 hours (i.e., after 12 hours all translog files will be deleted). This increases to chance to have an ops based recovery, even if the primary flushed or the replica was offline for a few hours. In order to see which parts of the translog are committed into lucene the translog stats are extended to include information about uncommitted operations. Views now include all translog ops and guarantee, as before, that those will not go away. Snapshotting a view allows to filter out generations that are not relevant based on a specific sequence number. Relates to #10708	2017-06-22 17:08:14 +02:00
Simon Willnauer	29e80eea40	Remove `index.mapping.single_type=false` from core/tests (#25331 ) This change cleans up core tests to not use `index.mapping.single_type=false` but instead where applicable use a single type or markt the index as created with a pre 6.x version. Relates to #24961	2017-06-22 16:48:16 +02:00
Jason Tedor	97a2c4523d	Get short path name for native controllers Due to limitations with CreateProcessW on Windows (ultimately used by ProcessBuilder) with respect to maximum path lengths, we need to get the short path name for any native controllers before trying to start them in case the absolute path exceeds the maximum path length. This commit uses JNA to invoke the necessary Windows API for this to start the native controller using the short path. To be precise about the limitation here, the MSDN docs for CreateProcessW say for the command line parameter: >The command line to be executed. The maximum length of this string is >32,768 characters, including the Unicode terminating null character. If >lpApplicationName is NULL, the module name portionof lpCommandLine is >limited to MAX_PATH characters. This is exactly how the Windows implementation of Process in the JDK invokes CreateProcessW: with the executable name (lpApplicationName) set to NULL. Relates #25344	2017-06-22 07:59:58 -04:00
Yannick Welsch	e41eae9f05	Live primary-replica resync (no rollback) (#24841 ) Adds a replication task that streams all operations from the primary's global checkpoint to all replicas.	2017-06-22 13:35:34 +02:00
Adrien Grand	44e9c0b947	Upgrade to lucene-7.0.0-snapshot-ad2cb77. (#25349 ) Most notable changes: - better update concurrency: LUCENE-7868 - TopDocs.totalHits is now a long: LUCENE-7872 - QueryBuilder does not remove the boolean query around multi-term synonyms: LUCENE-7878 - removal of Fields: LUCENE-7500 For the `TopDocs.totalHits` change, this PR relies on the fact that the encoding of vInts and vLongs are compatible: you can write and read with any of them as long as the value can be represented by a positive int.	2017-06-22 12:35:33 +02:00
Jason Tedor	cc67d027de	Initialize sequence numbers on a shrunken index Bringing together shards in a shrunken index means that we need to address the start of history for the shrunken index. The problem here is that sequence numbers before the maximum of the maximum sequence numbers on the source shards can collide in the target shards in the shrunken index. To address this, we set the maximum sequence number and the local checkpoint on the target shards to this maximum of the maximum sequence numbers. This enables correct document-level semantics for documents indexed before the shrink, and history on the shrunken index will effectively start from here. Relates #25321	2017-06-21 13:40:45 -04:00
Nik Everett	4bbb7e828b	Port most snapshot/restore static bwc tests to qa:full-cluster-restart (#25296 ) Ports all of RepositoryUpgradabilityIT to qa:full-cluster-restart and ports as much of RestoreBackwardsCompatIT as possible into qa:full-cluster-restart.	2017-06-21 13:26:03 -04:00
Nik Everett	bec1a49a54	Javadoc: ThreadPool doesn't reject while shutdown (#23678 ) It caught me offguard yesterday that our executors won't always reject when the ThreadPool is shutdown.	2017-06-21 12:21:48 -04:00
Tanguy Leroux	49ebd65548	Add backward compatibility indices for 5.4.2	2017-06-21 10:42:26 +02:00
Tanguy Leroux	8274cd67ab	Add version v5.4.2 after release	2017-06-21 10:23:32 +02:00
Alexander Reelsen	68423989da	IndexMetaData: Add internal format index setting (#25292 ) This setting is supposed to ease index upgrades as it allows you to check for a new setting called `index.internal.version` which can be used to check before upgrading indices.	2017-06-21 09:30:46 +02:00
Simon Willnauer	86a544de3b	Ensure we never read from a closed MockSecureSettings object (#25322 ) If secure settings are closed after the node has been constructed no key-store access is permitted. We should also try to be as close as possible to the real behavior if we mock secure settings. This change also adds the same behavior as bootstrap has to InternalTestCluster to ensure we fail if we try to read from secure settings after the node has been constructed.	2017-06-21 08:14:38 +02:00
Simon Willnauer	406a15e7a9	Fix settings serialization to not serialize secure settings or not take the total size into account (#25323 )	2017-06-21 08:13:56 +02:00
Jason Tedor	1f14d042f6	Initialize primary term for shrunk indices Today when an index is shrunk, the primary terms for its shards start from one. Yet, this is a problem as the index will already contain assigned sequence numbers across primary terms. To ensure document-level sequence number semantics, the primary terms of the target shards must start from the maximum of all the shards in the source index. This commit causes this to be the case. Relates #25307	2017-06-20 15:12:39 -04:00
Guillaume Le Floch	93e29d290f	Tests: Refactor NodeTests settings (#25309 ) This pull request aims to use the method baseSettings already present in the class.	2017-06-20 15:17:52 +02:00
Jun Ohtani	62d1969595	Parse synonyms with the same analysis chain (#8049 ) * [Analysis] Parse synonyms with the same analysis chain Synonym Token Filter / Synonym Graph Filter tokenize synonyms with whatever tokenizer and token filters appear before it in the chain. Close #7199	2017-06-20 21:50:33 +09:00
Nik Everett	3261586cac	Tweak reindex cancel logic and add many debug logs (#25256 ) I'm still trying to hunt down rare failures in the cancelation tests for reindex and friends. Here is the latest: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=ubuntu/876/console It doesn't show much, other than that one of the tasks didn't kill itself when asked to cancel. So I'm going a bit crazy with debug logging so that the next time this comes up I can trace exactly what happened. Additionally, this tweaks the logic around how rethrottles were performed around cancel. Previously we set the `requestsPerSecond` to `0` when we cancelled the task. That was the "old way" to set them to inifity which was the intent. This switches that from `0` to `Float.MAX_VALUE` which is the "new way" to set the `requestsPerSecond` to infinity. I don't know that this is much better, but it feels better.	2017-06-19 18:46:42 -04:00
Jay Modi	0d6c47fe14	Keystore CLI should use the AddFileKeyStoreCommand for files (#25298 ) This commit fixes a typo in the KeyStoreCli class. The add-file command was incorrectly set to use the AddStringKeyStoreCommand instead of the AddFileKeyStoreCommand.	2017-06-19 12:43:26 -06:00
Yannick Welsch	1a20760d79	Simplify IndexShard indexing and deletion methods (#25249 ) Indexing or deleting documents through the IndexShard interface is quite complex and error-prone. It requires multiple calls, e.g. first prepareIndexOnPrimary, then do some checks if mapping updates have occurred, then do the actual indexing using index(...) etc. Currently each consumer of the interface (local recovery, peer recovery, replication) has additional custom checks built around it to deal with mapping updates, some of which are even inconsistent. This commit aims at reducing the complexity by exposing a simpler interface on IndexShard. There are no more prepare*** methods and the mapping complexity is also hidden, but still giving callers a possibility to implement custom logic to deal with mapping updates.	2017-06-19 20:11:54 +02:00
David Kyle	d1be2ecfdb	Initialise empty lists in BaseTaskResponse constructor (#25290 ) * Initialise empty lists in BaseTaskResponse constructor * Remove little used default constructor which leaves uninitialised members	2017-06-19 16:37:21 +01:00
Luca Cavanna	d9ec2a23c5	Remove (deprecated) support for '+' in index expressions (#25274 ) Relates to #24515	2017-06-19 15:19:17 +02:00
Tanguy Leroux	e4f4886d40	[Test] Extend parsing checks for DocWriteResponses (#25257 ) This commit changes the parsing logic of DocWriteResponse, ReplicationResponse and GetResult so that it skips any unknown additional fields (for forward compatibility reasons). This affects the IndexResponse, UpdateResponse,DeleteResponse and GetResponse objects.	2017-06-19 13:19:09 +02:00
Martijn van Groningen	bcaa413b0b	test: Port the remaining old indices search tests to full cluster restart qa module Also tweaked the qa module's gradle file to actually run bwc tests against all index compat versions. Relates to #24939	2017-06-19 12:27:24 +02:00
Simon Willnauer	dc02b32650	Simplify connection closing and cleanups in TcpTransport (#25250 ) Today we maintain a map of open connections in order to close them when a low level channel gets closed or handles a failure. We also spawn a thread due to some tricky concurrency issues especially with respect to netty since they listener might be called on a transport / boss thread. Executions on those threads must not be blocking since otherwise we will likely deadlock the event processing which adds to the complexity of the concurrency model in this class. This change associates the connection with the close callback that every channel invokes once it's closed which allows us to remove the connections map. A relaxed non-blocking concurrency model in the connection close listener allows cleaning up connected nodes without blocking on any lock.	2017-06-19 09:19:45 +02:00
Boaz Leskes	7291aba8ae	enable debug logging for testMasterFailoverDuringIndexingWithMappingChanges	2017-06-18 22:40:13 +02:00
Jason Tedor	4c28e781dd	Fix failing delete index test This test is failing because delete /{index} requests no longer support index matching an alias. This commit removes testing such requests again aliases. Closes #25284	2017-06-18 15:32:43 -04:00
Christoph Büscher	3f9f713b44	Add AwaitsFix on IndicesRequestIT due to #25284	2017-06-18 18:56:41 +02:00
Christoph Büscher	e99ced06cc	[Tests] Check that parsing aggregations works in a forward compatible way (#25219 ) This change adds tests for the aggregation parsing that try to simulate that we can parse existing aggregations in a forward compatible way in the future, ignoring potential newly added fields or substructures to the xContent response.	2017-06-17 13:06:31 +02:00
Ali Beyad	0c697348f4	Adds AwaitsFix on snapshot test failing due to #25281	2017-06-16 16:57:01 -04:00
Simon Willnauer	f18b0d293c	Move TransportStats accounting into TcpTransport (#25251 ) Today TcpTransport is the de-facto base-class for transport implementations. The need for all the callbacks we have in TransportServiceAdaptor are not necessary anymore since we can simply have the logic inside the base class itself. This change moves the stats metrics directly into TcpTransport removing the need for low level bytes send / received callbacks.	2017-06-16 22:34:11 +02:00
Nik Everett	ecc87f613f	Move pre-configured "keyword" tokenizer to the analysis-common module (#24863 ) Moves the keyword tokenizer to the analysis-common module. The keyword tokenizer is special because it is used by CustomNormalizerProvider so I pulled it out into its own PR. To get the move to work I've reworked the lookup from static to one using the AnalysisRegistry. This seems safe enough. Part of #23658.	2017-06-16 11:48:15 -04:00
Luca Cavanna	b5cea6980b	Delete index API to work only against concrete indices (#25268 ) With #23997 we have introduced a new internal index option that allows to resolve index expressions only against concrete indices while ignoring aliases. Such index option was applied to IndicesAliasesRequest, so that the index part of alias actions would only be resolved against concrete indices. Same is done in this commit with delete index request. Deleting aliases has always been confusing as some users expect it to only remove the alias from the index (which has its own specific API). Even worse, in case of filtered aliases, deleting an alias may leave users with the expectation that only the documents that match the filter are deleted, which was never the case. To address all this confusion, delete index api works now only against concrete indices. WIldcard expressions will be only resolved against concrete index, as if aliases didn't exist. If one tries to delete against an alias, an IndexNotFoundException will be thrown regardless of whether the alias exists or not, as a concrete index with such a name doesn't exist. Closes #2318	2017-06-16 17:46:01 +02:00
Boaz Leskes	9ddea539f5	Introduce translog size and age based retention policies (#25147 ) This PR extends the TranslogDeletionPolicy to allow keeping the translog files longer than what is needed for recovery from lucene. Specifically, we allow specifying the total size of the files and their maximum age (i.e., keep up to 512MB but no longer than 12 hours). This will allow making ops based recoveries more common. Note that the default size and age still set to 0, maintaining current behavior. This is needed as the other components in the system are not yet ready for a longer translog retention. I will adapt those in follow up PRs. Relates to #10708	2017-06-16 09:09:51 +02:00
Ali Beyad	350125ed2a	Improves snapshot logging and snapshoth deletion error handling (#25264 ) This commit does two things: 1. Adds logging at the DEBUG level for when the index-N blob is updated. 2. When attempting to delete a snapshot, if the snapshot was not found in the repository data, an exception is now thrown instead of silently ignoring the lack of presence of the snapshot in the repository data.	2017-06-15 19:43:19 -04:00
Christoph Büscher	d3442f7d0c	Add unit test for PathHierarchyTokenizerFactory (#24984 )	2017-06-15 19:18:33 +02:00
Guillaume Le Floch	a9014dfcc5	Deprecate tribe service This commit deprecates the tribe service so that deprecation log messages are delivered if a tribe node is configured. Relates #24598	2017-06-15 12:41:05 -04:00
Martijn van Groningen	428e70758a	Moved more token filters to analysis-common module. The following token filters were moved: `edge_ngram`, `ngram`, `uppercase`, `lowercase`, `length`, `flatten_graph` and `unique`. Relates to #23658	2017-06-15 18:28:31 +02:00
Jim Ferenczi	2a78b0a19f	[Test] Make sure that SearchAfterSortedDocQueryTests uses a single threaded searcher	2017-06-15 18:13:38 +02:00
markharwood	7a3155368c	Test fix - removed superfluous assertion (#25247 ) Closes #25245	2017-06-15 16:29:25 +01:00
Martijn van Groningen	fe02829aac	test: Ported more OldIndexBackwardsCompatibilityIT tests to full cluster restart qa tests. (#25173 ) Relates to #24939	2017-06-15 14:48:06 +02:00
Adrien Grand	1b90c46a53	Allow reader wrappers to have different live docs but the same cache key. Relates to #19856	2017-06-15 13:51:46 +02:00
Boaz Leskes	648b4717a4	move assertBusy to use CheckException (#25246 ) We use assertBusy in many places where the underlying code throw exceptions. Currently we need to wrap those exceptions in a RuntimeException which is ugly.	2017-06-15 13:24:07 +02:00
Tanguy Leroux	27f1206999	Use SPI in High Level Rest Client to load XContent parsers (#25098 ) This commit adds a NamedXContentProvider interface that can be implemented by plugins or modules using Java's SPI feature in order to provide additional NamedXContent parsers to external applications like the Java High Level Rest Client.	2017-06-15 12:50:02 +02:00
Adrien Grand	5a6fa62844	Speed up PK lookups at index time. (#19856 ) At index time Elasticsearch needs to look up the version associated with the `_id` of the document that is being indexed, which is often the bottleneck for indexing. While reviewing the output of the `jfr` telemetry from a Rally benchmark, I saw that significant time was spent in `ConcurrentHashMap#get` and `ThreadLocal#get`. The reason is that we cache lookup objects per thread and segment, and for every indexed document, we first need to look up the cache associated with this segment (`ConcurrentHashMap#get`) and then get a state that is local to the current thread (`ThreadLocal#get`). So if you are indexing N documents per second and have S segments, both these methods will be called N*S times per second. This commit changes version lookup to use a cache per index reader rather than per segment. While this makes cache entries live for less long, we now only need to do one call to `ConcurrentHashMap#get` and `ThreadLocal#get` per indexed document.	2017-06-15 10:17:42 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Ryan Ernst	caf7792db1	Scripting: Rename SearchScript.needsScores to needs_score (#25235 ) This commit renames the needsScores method so as to make it automatically generatable, based on the name of the `_score` variable which is available in search scripts. It also adds documentation to ScriptContext to explain the naming and signature of such methods.	2017-06-14 22:01:19 -07:00
Jim Ferenczi	68deda6d03	FastVectorHighlighter should not cache the field query globally (#25197 ) This commit removes the global caching of the field query and replaces it with a caching per field. Each field can use a different `highlight_query` and the rewriting of some queries (prefix, automaton, ...) depends on the targeted field so the query used for highlighting must be unique per field. There might be a small performance penalty when highlighting multiple fields since the query needs to be rewritten once per highlighted field with this change. Fixes #25171	2017-06-15 00:33:01 +02:00
Lee Hinman	4a30e23365	Remove QUERY_AND_FETCH BWC for pre-5.3.0 nodes (#25223 ) * Remove QUERY_AND_FETCH BWC for pre-5.3.0 nodes This was a BWC layer where we expicitly set the `search_type` to "query_and_fetch" when a single node is queried on pre-5.3 nodes. Since 6.0 no longer needs to be compatible with 5.3 nodes, this can be removed. * Fix indentation * Remove unused QUERY_FETCH_ACTION_NAME constant	2017-06-14 15:42:29 -06:00
Zachary Tong	52719b2118	Add more missing AggregationBuilder getters (#25198 ) * Add more missing AggregationBuilder getters - getMetadata for all aggs - various getters on TermsAggBuilder (without "get" prefix to maintain convention) - Also makes InternalSum's ctor public, to follow suit of other metrics (min/max/avg/etc)	2017-06-14 14:31:01 -04:00
Lee Hinman	aa3134c093	Refactor TransportShardBulkAction.executeUpdateRequest and add tests This splits `executeUpdateRequest` into separate parts and adds some unit tests for the behavior in it. The actual behavior has not been changed.	2017-06-14 09:27:58 -06:00
Adrien Grand	cadd31b3a8	Make sure range queries are correctly profiled. (#25108 ) We introduced a new API for ranges in order to be able to decide whether points or doc values would be more appropriate to execute a query, but since `ProfileWeight` does not implement this API, the optimization is disabled when profiling is enabled.	2017-06-14 16:31:16 +02:00
Martijn van Groningen	e333955557	Remove PrefixAnalyzer, because it is no longer used.	2017-06-14 08:59:10 +02:00
Ryan Ernst	9ec1fc7b02	Internal: Remove Strings.cleanPath (#25209 ) This commit removes the cleanPath method, in favor of using java's Path.normalize().	2017-06-13 21:09:45 -07:00
Simon Willnauer	bc7ec68e76	Add Cross Cluster Search support for scroll searches (#25094 ) To complete the cross cluster search capabilities for all search types and function this change adds cross cluster search support for scroll searches.	2017-06-13 17:22:49 +02:00
Sergey Galkin	1c95cbc4e8	Rollover max docs should only count primaries (#24977 ) max_doc condition for index rollover should use document count only from primary shards Fixes #24217	2017-06-13 14:30:46 +02:00
Simon Willnauer	01d7c217f6	Add remote cluster infrastructure to fetch discovery nodes. (#25123 ) In order to add scroll support for cross cluster search we need to resolve the nodes encoded in the scroll ID to send requests to the corresponding nodes. This change adds the low level connection infrastructure that also ensures that connections are re-established if the cluster is disconnected due to a network failure or restarts. Relates to #25094	2017-06-13 14:23:56 +02:00
Simon Willnauer	186c16ea41	Ensure pending transport handlers are invoked for all channel failures (#25150 ) Today if a channel gets closed due to a disconnect we notify the response handler that the connection is closed and the node is disconnected. Unfortunately this is not a complete solution since it only works for published connections. Connections that are unpublished ie. for discovery can indefinitely hang since we never invoke their handers when we get a failure while a user is waiting for the response. This change adds connection tracking to TcpTransport that ensures we are notifying the corresponding connection if there is a failure on a channel.	2017-06-13 09:37:05 +02:00
Lee Hinman	ee1113c902	Tweak AggregatorBase.addRequestCircuitBreakerBytes This modifies a method Mark added to the AggregatorBase that allows aggregations to add additional memory tracking for datastructures used during execution. If an aggregation would like to reclaim circuit breaker reserved bytes by adding a negative number, `addWithoutBreaking` should be used instead of `addEstimateBytesAndMaybeBreak`. Resolves #24511	2017-06-12 12:55:50 -06:00
Jason Tedor	bb66f3b76b	Explicitly reject duplicate data paths Duplicate data paths already fail to work because we would attempt to take out a node lock on the directory a second time which will fail after the first lock attempt succeeds. However, how this failure manifests is not apparent at all and is quite difficult to debug. Instead, we should explicitly reject duplicate data paths to make the failure cause more obvious. Relates #25178	2017-06-12 12:55:19 -04:00
Jason Tedor	982900eabf	Do not swallow node lock failed exception When attempting to obtain the node lock, if an exception is thrown it is not logged. This makes debugging difficult. This commit causes such an exception to be logged. Relates #25176	2017-06-12 11:42:45 -04:00
markharwood	518cda6637	Aggregations bug: Significant_text fails on arrays of text. (#25030 ) * Aggregations bug: Significant_text fails on arrays of text. The set of previously-seen tokens in a doc was allocated per-JSON-field string value rather than once per JSON document meaning the number of docs containing a term could be over-counted leading to exceptions from the checks in significance heuristics. Added unit test for this scenario Closes #25029	2017-06-12 14:02:54 +01:00
Jim Ferenczi	7ab3d5d04a	Speed up sorted scroll when the index sort matches the search sort (#25138 ) Sorted scroll search can use early termination when the index sort matches the scroll search sort. The optimization can be done after the first query (which still needs to collect all documents) by applying a query that only matches documents that are greater than the last doc retrieved in the previous request. Since the index is sorted, retrieving the list of documents that are greater than the last doc only requires a binary search on each segment. This change introduces this new query called `SortedSearchAfterDocQuery` and apply it when possible. Scrolls with this optimization will search all documents on the first request and then will early terminate each segment after $size doc for any subsequent requests. Relates #6720	2017-06-12 09:33:30 +02:00
Boaz Leskes	f34136eda4	TranslogTests.testWithRandomException ignored a possible simulated OOM when trimming files	2017-06-12 08:32:55 +02:00
Boaz Leskes	cfb5f6a5a6	Adapt TranslogTests.testWithRandomException to checkpoint syncing on trim #25005 changed the translog dynamic to fsync the checkpoint before trimming a file. This changed the dynamics of potential failure modes which requires a change to testWithRandomException - it's now possible that we had an exception but the translog was trimmed. Closes #25133	2017-06-11 23:17:10 +02:00
Jason Tedor	dcf57f296e	Fix get mappings HEAD requests Get mappings HEAD requests incorrectly return a content-length header of 0. This commit addresses this by removing the special handling for get mappings HEAD requests, and just relying on the general mechanism that exists for handling HEAD requests in the REST layer. Relates #23192	2017-06-11 14:58:56 -04:00
Boaz Leskes	9b8754e4c2	TranslogTests#commit didn't allow for a concurrent closing of a view The view closing will trim unneeded files but there is a small window where they may still be around.	2017-06-11 19:09:01 +02:00
Jason Tedor	7182577904	Fix handling of exceptions thrown on HEAD requests Today when an exception is thrown handling a HEAD request, the body is swallowed before the channel has a chance to see it. Yet, the channel is where we compute the content length that would be returned as a header in the response. This is a violation of the HTTP specification. This commit addresses the issue. To address this issue, we remove the special handling in bytes rest response for HEAD requests when an exception is thrown. Instead, we let the upstream channel handle the special case, as we already do today for the non-exceptional case. Relates #25172	2017-06-10 23:44:18 -04:00
Jason Tedor	5108fa7529	Remove unneeded weak reference from prefix logger We have a custom logger implementation known as a prefix logger that is used to write every message by the logger with a given prefix. This is useful for node-level, index-level, and shard-level messages where we want to log the node name, index name, and shard ID, respectively, if possible. The mechanism that we employ is that of a marker. Log4j has a built-in facility for managing these markers, but its effectively a memory leak because these markers are held in a map and can never be released. This is problematic for us since indices and shards do not necessarily have infinite life spans and so on a node where there are many indices being creted and destroyed, this infinite lifespan can be a problem indeed. To solve this, we use our own cache of markers. This is necessary to prevent too many instances of the marker for the same prefix from being created (just think of all the shard-level components that exist in the system), and to workaround the effective leak in Log4j. These markers are stored as weak references in a weak hash map. It is these weak references that are unneeded. When a key is removed from a weak hash map, the corresponding entry is placed on a reference queue that is eventually cleared. This commit simplifies prefix logger by removing this unnecessary weak reference wrapper. Relates #22460	2017-06-10 13:20:45 -04:00
Chris Earle	af7b479e12	"shard started" should show index and shard ID (#25157 ) When the cluster state is updated with Shard Started entries, it simply adds "shard-started" as the source of the change. This adds the index name and shard ID so that we can see who/what is spamming the changes when the index creation step has already left the cluster state.	2017-06-09 14:52:42 -04:00
Boaz Leskes	b8fef3309c	await fix testWithRandomException	2017-06-09 20:31:39 +02:00
Jason Tedor	8a45c3105f	Change BWC versions on create index response This commit changes the BWC versions on the create index response now that the index name in the response is supported since 5.6.0. Relates #25139	2017-06-09 13:52:08 -04:00
Sergey Novikov	7c8657df0e	Return the index name on a create index response This commit modifies the create index response so that it includes the index name. Relates #25139	2017-06-09 13:47:47 -04:00
Koen De Groote	64888f6f01	Correctly format arrays in output There are a few places where arrays are output in messages yet the output would merely use the default toString implementation rather than actually putting the content of the array in the message. This commit fixes the issue. Relates #24340	2017-06-09 11:45:07 -04:00
Christoph Büscher	823cbb437b	[Test] Extending parsing checks for SearchResponse (#25148 ) This change extends the tests and parsing of SearchResponse to make sure we can skip additional fields the parser doesn't know for forward compatibility reasons.	2017-06-09 17:33:44 +02:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Martijn van Groningen	c7ae27d57f	nested: In case of a single type the _id field should be added to the nested document instead of _uid field. When `index.mapping.single_type` is `true` the `_uid` field is not used and instead `_id` field is used. Prior to this change nested documents would in this case still use the `_uid` field to mark to what root document they belong to. In case of deleting documents this could lead to only the root Lucene document to be deleted and not the nested Lucene documents. This broke the docid block ordering the block join relies on in order to work correctly and thus causing the `nested` query, `nested` aggregation, nested sorting and nested inner hits to either fail or yield incorrect results. This bug only manifests in 6.0.0-ALPHA2 release and snaphots (5.5.0-SNAPSHOT, 5.6.0-SNAPSHOT, 6.0.0-SNAPSHOT).	2017-06-09 14:57:11 +02:00
Adrien Grand	87d19b21c7	`type` and `id` are lost upon serialization of `Translog.Delete`. (#24586 ) This was introduced in #24460: the constructor of `Translog.Delete` that takes a `StreamInput` does not set the type and id. To make it a bit more robust, I made fields final so that forgetting to set them would make the compiler complain.	2017-06-09 14:56:23 +02:00
Sergey Galkin	dc5aa993e0	Fix NPE in token_count datatype with null value (#25046 ) Fixes an issue with the handling of null values for the token_count data type. Closes #24928	2017-06-09 14:13:05 +02:00
Jim Ferenczi	8250aa4267	Remove the postings highlighter and make unified the default highlighter choice (#25028 ) This change removes the `postings` highlighter. This highlighter has been removed from Lucene master (7.x) because it behaves exactly like the `unified` highlighter when index_options is set to `offsets`: https://issues.apache.org/jira/browse/LUCENE-7815 It also makes the `unified` highlighter the default choice for highlighting a field (if `type` is not provided). The strategy used internally by this highlighter remain the same as before, it checks `term_vectors` first, then `postings` and ultimately it re-analyzes the text. Ultimately it rewrites the docs so that the options that the `unified` highlighter cannot handle are clearly marked as such. There are few features that the `unified` highlighter is not able to handle which is why the other highlighters (`plain` and `fvh`) are still available. I'll open separate issues for these features and we'll deprecate the `fvh` and `plain` highlighters when full support for these features have been added to the `unified`.	2017-06-09 14:09:57 +02:00
Christoph Büscher	eca4f24b16	[Test] Adding test for parsing SearchShardFailure leniently (#25144 ) This change extends the tests and parsing of SearchShardFailure to make sure we can skip fields the parser doesn't know for forward compatibility reasons.	2017-06-09 12:46:09 +02:00
Christoph Büscher	79057b1c61	[Test] Extending checks for Suggestion parsing (#25132 ) When parsing responses we should be ignoring any new unknown fields or inner objects in most cases to be forward compatible with changes in core on the client side. This change adds test for this for Suggestions and its various subclasses to check if we are able to ignore new fields and objects in the xContent.	2017-06-09 10:11:08 +02:00
Tal Levy	340909582f	remove Ingest's Internal Template Service (#25085 ) Ingest was using it's own wrapper around TemplateScripts and the ScriptService. This commit removes that abstraction	2017-06-08 15:24:03 -07:00
Lee Hinman	119f8ed9f0	Correctly enable _all for older 5.x indices When we disabled `_all` by default for indices created in 6.0, we missed adding a layer that would handle the situation where `_all` was not enabled in 5.x and then the cluster was updated to 6.0, this means that when the cluster was updated the `_all` field would be disabled for 5.x indices and field values would not be added to the `_all` field. This adds a compatibility layer for 5.x indices where we treat the default enabled value for the `_all` field to be `true` if unset on 5.x indices. Resolves #25068	2017-06-08 14:37:44 -06:00
Jason Tedor	1708f1773b	Mark Log4j API dependency as non-optional The Log4j dependency is separated into two artifacts, the API and the core implementation. This is to enable replacing Log4j on the backend through the SLF4J bridge with another logging implementation. For this reason, the dependencies are marked as optional. This causes confusion amongst users as to use the bridge, the API should be non-optional since it is needed for the bridge to function correctly. While they could pull it into their application directly, it would be clearer if we simply marked this depdendency as non-optional. Note that this does not mean that users have to use Log4j for logging in their application, so we are not marking core as required, it only clarifies what they need to be able to plug in a different logging implementation. Relates #25136	2017-06-08 16:09:34 -04:00
Lee Hinman	050b7cd0f9	Include empty mappings in GET /{index}/_mappings requests (#25118 ) Previously this would output: ``` GET /test-1/_mappings { } ``` And after this change: ``` GET /test-1/_mappings { "test-1": { "mappings": {} } } ``` To bring parity back to the REST output after #24723. Relates to #25090	2017-06-08 10:57:04 -06:00
Lee Hinman	5b2ab96364	Return index name and empty map for /{index}/_alias with no aliases Previously in #24723 we changed the `_alias` API to not go through the `RestGetIndicesAction` endpoint, instead creating a `RestGetAliasesAction` that did the same thing. This changes the formatting so that it matches the old formatting of the endpoint, before: ``` GET /test-1/_alias { } ``` And after this change: ``` GET /test-1/_alias { "test-1": { "aliases": {} } } ``` This is related to #25090	2017-06-08 10:03:03 -06:00
Eli Skeggs	ee0e921643	Fix typo in GeoUtils#isValidLongitude (#25121 ) GeoUtils#isValidLongitude is inconsistent with GeoUtils#isValidLatitude. Neither technically need the isInfinite() check because they then compare against min and max values.	2017-06-08 17:23:22 +02:00
Christoph Büscher	a0afa917ac	[Tests] Check QueryProfileShardResult parser robustness for new fields (#25130 ) When parsing resonses we should be ignoring any new unknown fields or inner objects in most cases to be forward compatible with changes in core on the client side. This change adds test for this for QueryProfileShardResult and nested substructures and changes the parsing code where necessary to be able to ignore new fields and objects in the xContent.	2017-06-08 16:40:00 +02:00
Nik Everett	4a8c09c5f1	Make randomVersionBetween work with unreleased versions (#25042 ) Test: randomVersionBetween works with unreleased Modifies randomVersionBetween so that it works with unreleased versions. This should make switching a version from unreleased to released much simpler.	2017-06-08 10:19:06 -04:00
Yannick Welsch	cd57395c98	Use correct primary term for replicating NOOPs (#25128 ) NOOPs should be, same as for indexing operations, written on the replica using the original operation term instead of the current term of the replica.	2017-06-08 14:20:26 +02:00
Martijn van Groningen	326fa33d4e	fielddata: Binary script doc values should make a deep copy of the BytesRef before populating it in the values array. Added common base class for ScriptDocValues.Strings and ScriptDocValues.BytesRefs now that these classes are very similar. Also cleaned up the BinaryDVFieldDataTests: * Use junit assertions instead of hamcrest * Use BytesRef directly instead of byte[] Closes #24785	2017-06-08 13:20:35 +02:00
Jim Ferenczi	eeac4b9721	Fix Fast Vector Highlighter NPE on match phrase prefix (#25116 ) The FVH fails with an NPE when a match phrase prefix is rewritten in an empty phrase query. This change makes sure that the multi match query rewrites to a MatchNoDocsQuery (instead of an empty phrase query) when there is a single term and that term does not expand to any term in the index. Fixes #25088	2017-06-08 12:27:11 +02:00
Jim Ferenczi	36a5cf8f35	Automatically early terminate search query based on index sorting (#24864 ) This commit refactors the query phase in order to be able to automatically detect queries that can be early terminated. If the index sort matches the query sort, the top docs collection is early terminated on each segment and the computing of the total number of hits that match the query is delegated to a simple TotalHitCountCollector. This change also adds a new parameter to the search request called `track_total_hits`. It indicates if the total number of hits that match the query should be tracked. If false, queries sorted by the index sort will not try to compute this information and and will limit the collection to the first N documents per segment. Aggregations are not impacted and will continue to see every document even when the index sort matches the query sort and `track_total_hits` is false. Relates #6720	2017-06-08 12:10:46 +02:00
Jim Ferenczi	21a57c1494	Always use DisjunctionMaxQuery to build cross fields disjunction (#25115 ) This commit modifies query_string, simple_query_string and multi_match queries to always use a DisjunctionMaxQuery when a disjunction over multiple fields is built. The tiebreaker is set to 1 in order to behave like the boolean query in terms of scoring. The removal of the coord factor in Lucene 7 made this change mandatory to correctly handle minimum_should_match. Closes #23966	2017-06-08 11:18:17 +02:00
Simon Willnauer	d6d416cacc	Break out clear scroll logic from TransportClearScrollAction (#25125 ) This change extracts the main logic from `TransportClearScrollAction` into a new class `ClearScrollController` and adds a corresponding unit test. Relates to #25094	2017-06-08 11:13:08 +02:00
Simon Willnauer	bdc3a16fa4	Fix naminig in GroupedActionListener GroupedActionListener still had some members named from it's specialization before it was factored out in a general purpose class.	2017-06-08 10:21:15 +02:00
Adrien Grand	a8ea2f0df4	Leverage scorerSupplier when applicable. (#25109 ) The `scorerSupplier` API allows to give a hint to queries in order to let them know that they will be consumed in a random-access fashion. We should use this for aggregations, function_score and matched queries.	2017-06-08 10:19:38 +02:00
Boaz Leskes	087f182481	Translog file recovery should not rely on lucene commits (#25005 ) When we open a translog, we rely on the `translog.ckp` file to tell us what the maximum generation file should be and on the information stored in the last lucene commit to know the first file we need to recover. This requires coordination and is currently subject to a race condition: if a node dies after a lucene commit is made but before we remove the translog generations that were unneeded by it, the next time we open the translog we will ignore those files and never delete them (I have added tests for this). This PR changes the approach to have the translog store both of those numbers in the `translog.ckp`. This means it's more self contained and easier to control. This change also decouples the translog recovery logic from the specific commit we're opening. This prepares the ground to fully utilize the deletion policy introduced in #24950 and store more translog data that's needed for Lucene, keep multiple lucene commits around and be free to recover from any of them.	2017-06-08 09:21:28 +02:00
Simon Willnauer	ce24331d1f	Add helper methods to TransportActionProxy to identify proxy actions and requests (#25124 ) Downstream users of out network intercept infrastructure need this information which is hidden due to member and class visibility.	2017-06-08 09:07:22 +02:00
Jack Conradson	d187fa78fd	Generate Painless Factory for Creating Script Instances (#25120 )	2017-06-07 16:06:11 -07:00
Christoph Büscher	9e741cd13d	Tests: Add ability to generate random new fields for xContent parsing test (#23437 ) For the response parsing we want to be lenient when it comes to parsing new xContent fields. In order to ensure this in our testing, this change adds a utility method to XContentTestUtils that takes xContent bytes representation as input and recursively a random field on each object level. Sometimes we also want to exclude a whole subtree from this treatment (e.g. skipping "_source"), other times an element (e.g. "fields", "highlight" in SearchHit) can have arbitraryly named objects. Those cases can be specified as exceptions.	2017-06-07 21:01:20 +02:00
Jim Ferenczi	68f1d4df5a	bump the Lucene version for Version 5.5 and 5.6 after the upgrade to Lucene 6.6.0	2017-06-07 19:32:13 +02:00
Ryan Ernst	2057bbc6c5	Scripting: Remove unnecessary intermediate script compilation methods on QueryShardContext (#25093 ) This commit removes wrapper methods on QueryShardContext used to compile scripts. Instead, the script service is made accessible in the context, and calls to compile can be made directly. This will ease transition to each of those location becoming their own context, since they would no longer be able to expect the same script class type.	2017-06-07 08:24:18 -07:00
Yannick Welsch	26ec89173b	Remove TranslogRecoveryPerformer (#24858 ) Splits TranslogRecoveryPerformer into three parts: - the translog operation to engine operation converter - the operation perfomer (that indexes the operation into the engine) - the translog statistics (for which there is already RecoveryState.Translog) This makes it possible for peer recovery to use the same IndexShard interface as bulk shard requests (i.e. Engine operations instead of Translog operations). It also pushes the "fail on bad mapping" logic outside of IndexShard. Future pull requests could unify the BulkShard and peer recovery path even more.	2017-06-07 17:11:27 +02:00
Jim Ferenczi	c8bf7ecaed	Higlighters: Fix MultiPhrasePrefixQuery rewriting (#25103 ) The unified highlighter rewrites MultiPhrasePrefixQuery to SpanNearQuer even when there is a single term in the phrase. Though SpanNearQuery throws an exception when the number of clauses is less than 2. This change returns a simple PrefixQuery when there is a single term and builds the SpanNearQuery otherwise. Relates #25088	2017-06-07 16:14:28 +02:00
Tim Brooks	233c63fc63	Add version 5.6 to versions (#25084 ) * Add version 5.6 to versions * Fix test * Remove 5.4.2 constant	2017-06-07 09:59:27 -04:00
Boaz Leskes	8e15186293	Update `IndexShard#refreshMetric` via a `ReferenceManager.RefreshListener` (#25083 ) The PR takes a different approach to solve #24806 than currently implemented via #25052. The `refreshMetric` that IndexShard maintains is updated using the refresh listeners infrastructure in lucene. This means that we truly count all refreshes that lucene makes and not have to worry about each individual caller (like `IndexShard@refresh` and `Engine#get()`)	2017-06-07 10:54:10 +02:00
Martijn van Groningen	db8aa8e94e	Changed inner_hits to work with the new join field type and at the same time maintaining support for the `_parent` meta field type/ Relates to #20257	2017-06-07 10:52:49 +02:00
Yu	14913fdc37	keep _parent field while updating child type mapping (#24407 ) parent/child: Allow updating mapping without specifying `_parent` field on each update. Prior to this change when a mapping has a `_parent` field then any update (also updates that didn't modify the `_parent` field) to the mapping involved specifying the `_parent` field again. With this change specifying the `_parent` field on each mapping update is no longer required. Closes #23381	2017-06-07 10:51:21 +02:00
Jason Tedor	2f5f27fafa	Remove unnecessary callback interface We have a callback interface that is not needed because it is effectively the same as java.util.function.Consumer. This commit removes it. Relates #25089	2017-06-06 20:50:03 -04:00
Tim Brooks	feca0a9f33	Bumping version to v6.0.0-alpha3 (#25077 )	2017-06-06 15:47:23 -05:00
Jason Tedor	1a681a928d	Modify cluster state callback in recovery land We use a callback in recovery land during primary relocation to ensure the relocation target is on at least the same version as the relocation source. This callback is typed as a Callback<Long> which is an unnecessary custom type (we can use Consumer<T> or the appropriate primitive callbacks). Here, we can use LongConsumer. Relates #25081	2017-06-06 16:29:10 -04:00
Jason Tedor	e03c4938c5	GET aliases should 404 if aliases are missing Previously the HEAD and GET aliases endpoints were misaigned in behavior. The HEAD verb would 404 if any aliases are missing while the GET verb would not if any aliases existed. When HEAD was aligned with GET, this broke the previous usage of HEAD to serve as an existence check for aliases. It is the behavior of GET that is problematic here though, if any alias is missing the request should 404. This commit addresses this by modifying the behavior of GET to behave in this way. This fixes the behavior for HEAD to also 404 when aliases are missing. Relates #25043	2017-06-06 14:37:29 -04:00
Jim Ferenczi	7e60cf3e54	Move parent_id query to the parent-join module (#25072 ) This change moves the parent_id query to the parent-join module and handles the case when only the parent-join field can be declared on an index (index with single type on). If single type is off it uses the legacy parent join field mapper and switch to the new one otherwise (default in 6). Relates #20257	2017-06-06 19:35:14 +02:00
Ryan Ernst	7ec39acd4b	Settings: Fix setting groups to include secure settings (#25076 ) This commit fixes the group methdos of Settings to properly include grouped secure settings. Previously the secure settings were included but without the group prefix being removed. closes #25069	2017-06-06 10:13:10 -07:00
Yu	40a13345d7	Add refresh stats tracking for realtime get (#25052 ) Passes a `LongConsumer` into the `Engine` during GETs which the engine calls if it refreshed to perform the get. Closes #24806	2017-06-06 12:39:02 -04:00
olcbean	0d5f3958e7	Expand index expressions against indices only when managing aliases (#23997 ) The index parameter in the update-aliases, put-alias, and delete-alias APIs no longer accepts alias names. Instead, it accepts only index names (or wildcards which will expand to matching indices). Closes #23960	2017-06-06 11:01:38 +02:00
Ryan Ernst	ac82824d80	Settings: Fix secure settings by prefix (#25064 ) This commit fixes a bug in retrieving a sub Settings object for a given prefix with secure settings. Before this commit the returned Settings would be filtered by the prefix, but the found setting names would not have the prefix removed.	2017-06-06 00:11:33 -07:00
Lee Hinman	b6a2b8d682	Track EWMA[1] of task execution time in search threadpool executor This is the first step towards adaptive replica selection (#24915). This PR tracks the execution time, also known as the "service time" of a task in the threadpool. The `QueueResizingEsThreadPoolExecutor` then stores a moving average of these task times which can be retrieved from the executor. Currently there is no functionality using the EWMA yet (other than tests), this is only a bite-sized building block so that it's easier to review. [1]: EWMA = Exponentially Weighted Moving Average	2017-06-05 10:09:41 -06:00
Ali Beyad	f2a23e3459	Removes an invalid assert in resizing big arrays which does not always hold (resizing can result in a smaller size than the current size, while the assert attempted to verify the new size is always greater than the current).	2017-06-05 11:49:06 -04:00
Alex Benusovich	5463294ec4	Fixed NPEs caused by requests without content. (#23497 ) REST handlers that require a body will throw an an ElasticsearchParseException "request body required". REST handlers that require a body OR source param will throw an ElasticsearchParseException "request body or source param required". Replaced asserts in BulkRequest parsing code with a more descriptive IllegalArgumentException if the line contains an empty object. Updated bulk REST test to verify an empty action line is rejected properly. Updated BulkRequestTests with randomized testing for an empty action line. Used try-with-resouces for XContentParser in AbstractBulkByQueryRestHandler.	2017-06-05 09:08:14 -06:00
Nik Everett	73307a2144	Plugins can register pre-configured char filters (#25000 ) Fixes the plumbing so plugins can register char filters and moves the `html_strip` char filter into analysis-common. Relates to #23658	2017-06-05 09:25:15 -04:00
Ryan Ernst	e22a68295c	Tests: Make secure settings available from settings builder for tests (#25037 ) This commit exposes the secure settings in Settings.Builder, so that the current secure settings can be retrieved and added to when creating settings for tests. This is necessary since secure settings can only be added once to a builder, so chains of methods using settings builders must reuse the already set mock secure settings.	2017-06-03 16:55:34 -07:00
Sergey Novikov	57b4002357	Include duplicate jar when jarhell check fails When the jarhell check fails due to a duplicate jar on the classpath, the exception message includes the full classpath but not the duplicated jar. For a long classpath, this can make it difficult to find the jar that is duplicated. This commit changes the exception message to include the duplicated jar. Relates #24953	2017-06-02 18:22:01 -04:00
Lee Hinman	a32d1b91fa	Remove comma-separated feature parsing for GetIndicesAction This removes the parsing of things like `GET /idx/_aliases,_mappings`, instead, a user must choose between retriving all index metadata with `GET /idx`, or only a specific form such as `GET /idx/_settings`. Relates to (and is a prerequisite of) #24437	2017-06-02 14:43:38 -06:00
Ryan Ernst	0d8216d5af	Scripting: Convert CompiledTemplate to a ScriptContext (#25032 ) This commit creates TemplateScript and associated classes so that templates no longer need a special ScriptService.compileTemplate method. The execute() method is equivalent to the old run() method. relates #20426	2017-06-02 13:41:26 -07:00
Ali Beyad	e024c67561	Checks the circuit breaker before allocating bytes for a new big array (#25010 ) Previously, when allocating bytes for a BigArray, the array was created (or attempted to be created) and only then would the array be checked for the amount of RAM used to see if the circuit breaker should trip. This is problematic because for very large arrays, if creating or resizing the array, it is possible to attempt to create/resize and get an OOM error before the circuit breaker trips, because the allocation happens before checking with the circuit breaker. This commit ensures that the circuit breaker is checked before all big array allocations (note, this does not effect the array allocations that are less than 16kb which use the [Type]ArrayWrapper classes found in BigArrays.java). If such an allocation or resizing would cause the circuit breaker to trip, then the breaker trips before attempting to allocate and potentially running into an OOM error from the JVM. Closes #24790	2017-06-02 15:16:22 -04:00

... 2 3 4 5 6 ...

8634 Commits