OpenSearch

Commit Graph

Author	SHA1	Message	Date
hanbj	3ab27d16ad	Fix thread context handling of headers overriding (#26068 ) Previously collisions in headers between old and new contexts could be silently ignored, allowing the original context's headers to "win". This commit fixes the headers to require they are disjoint.	2017-10-09 14:41:09 -07:00
Boaz Leskes	84742690cd	SearchWhileCreatingIndexIT: remove usage of _only_nodes the only nodes preference was used as a replacement of `_primary` which was removed. Sadly, it's not the same as we also check that it makes sense - i.e., that the given node has a shard copy. Since the test uses indices with >1 shards, the primaries may be spread to multiple nodes. Using one (like it currently does) will fail for some primaries. Using all will probably end up hitting all nodes. This commit removed the `_only_nodes` usage in favor a simple search Relates to #26791	2017-10-09 19:37:19 +02:00
Martijn van Groningen	96823b0480	update Lucene version for 6.0-RC2 version	2017-10-09 15:27:06 +02:00
kel	1d4f70210f	Calculate and cache result when advanceExact is called (#26920 ) Cache final result instead of result of advanceExact. Fix SortedNumericDoubleValues does not test MEDIAN mode Replace deprecated random string generation method	2017-10-09 14:02:38 +02:00
Simon Willnauer	cdd7c1e6c2	Return List instead of an array from settings (#26903 ) Today we return a `String[]` that requires copying values for every access. Yet, we already store the setting as a list so we can also directly return the unmodifiable list directly. This makes list / array access in settings a much cheaper operation especially if lists are large.	2017-10-09 09:52:08 +02:00
Nhat	bf4c3642b2	remove _primary and _replica shard preferences (#26791 ) The shard preference _primary, _replica and its variants were useful for the asynchronous replication. However, with the current impl, they are no longer useful and should be removed. Closes #26335	2017-10-08 11:03:06 -04:00
kel	100e3c9a8a	Remove UnsortedNumericDoubleValues (#26817 ) Closes #24086	2017-10-06 16:31:50 +02:00
Thomas Kappler	16431a6601	Fix IndexOutOfBoundsException in histograms for NaN doubles (#26787 ) (#26856 )	2017-10-06 16:27:01 +02:00
Jim Ferenczi	e8f72353d8	Fix search_after with geo distance sorting (#26891 ) Support for search_after and geo distance sorting is broken when the optimized LatLonDocValuesField.distanceSort is used. This commit fixes the parsing of the search_after value for this case.	2017-10-06 11:34:33 +02:00
Jason Tedor	470e5e7cfc	Add additional low-level logging handler () * Add additional low-level logging handler We have the trace handler which is useful for recording sent messages but there are times where it would be useful to have more low-level logging about the events occurring on a channel. This commit adds a logging handler that can be enabled by setting a certain log level (org.elasticsearch.transport.netty4.ESLoggingHandler) to trace that provides trace logging on low-level channel events and includes some information about the request/response read/write events on the channel as well. * Remove imports * License header * Remove redundant * Add test * More assertions	2017-10-05 12:10:58 -04:00
Simon Willnauer	8583727590	[TEST] add test to ensure legacy list syntax in yml works fine	2017-10-05 14:41:51 +02:00
Simon Willnauer	41925e1171	Bump BWC version for settings serialization to 6.1.0	2017-10-05 14:07:05 +02:00
Md. Abdulla-Al-Sun	a40c474e10	Added Bengali Analyzer to Elasticsearch with respect to the lucene update(PR#238)	2017-10-05 13:25:05 +02:00
kel	a978ddf37b	Fix toString() in SnapshotStatus (#26852 ) Closes #26851	2017-10-05 12:57:46 +02:00
Jim Ferenczi	24359c1a75	#26870 change bwc version for fuzzy_transpositions to 6.1 after backport	2017-10-05 11:28:59 +02:00
Martijn van Groningen	b863eaff4d	update lucene version after upgrading Lucene deps in 6.x branch too	2017-10-05 09:49:30 +02:00
Simon Willnauer	00dfdf50cf	Represent lists as actual lists inside Settings (#26878 ) Today we represent each value of a list setting with it's own dedicated key that ends with the index of the value in the list. Aside of the obvious weirdness this has several issues especially if lists are massive since it causes massive runtime penalties when validating settings. Like a list of 100k words will literally cause a create index call to timeout and in-turn massive slowdown on all subsequent validations runs. With this change we use a simple string list to represent the list. This change also forbids to add a settings that ends with a .0 which was internally used to detect a list setting. Once this has been rolled out for an entire major version all the internal .0 handling can be removed since all settings will be converted. Relates to #26723	2017-10-05 09:27:08 +02:00
Martijn van Groningen	dca787ed8a	upgrade to Lucene 7.1.0 snapshot version	2017-10-05 09:06:56 +02:00
Alexander Kazakov	9c95e91471	Expose `fuzzy_transpositions` parameter in fuzzy queries (#26870 ) Add fuzzy_transpositions parameter to multi_match and query_string queries. Add fuzzy_transpositions, fuzzy_prefix_length and fuzzy_max_expansions parameters to simple_query_string query.	2017-10-05 09:01:09 +02:00
Jason Tedor	c5a0e77fc6	Remove unused local transport constant Local transport was removed previously but a stale constant was left behind. This commit removes this unused constant.	2017-10-04 22:26:29 -04:00
Luca Cavanna	9b9cb81c41	Fix serialization errors when cross cluster search goes to a single shard (#26881 ) The single shard optimization that we have in our search api changes the type of response returned by the query transport action name based on the shard search request. if the request goes to one shard, we will do query and fetch at the same time, hence the response will be different. The proxying layer used in cross cluster search was not aware of this distinction, which causes serialization issues every time a cross cluster search request goes to a single shard and goes through a gateway node which has to forward the shard request to a data node. The coordinating node would then expect a QueryFetchSearchResult while the gateway would return a QuerySearchResult. Closes #26833	2017-10-04 22:39:14 +02:00
Tim Brooks	ca35fcabd0	Do not set SO_LINGER to 0 when not shutting down (#26871 ) This is a follow up to #26764. That commit set SO_LINGER to 0 in order to fix a scenario where we were running out of resources during CI. We are primarily interested in setting this to 0 when stopping the tranport. Allowing TIMED_WAIT is standard for other failure scenarios during normal operation. Unfortunately this commit set SO_LINGER to 0 every time we close NodeChannels. NodeChannels can be closed in case of an exception or other failures (such as parsing a response). We want to only disable linger when actually shutting down.	2017-10-04 10:27:26 -06:00
Simon Willnauer	d1533e2397	Remove Settings#getAsMap() (#26845 ) Since `#getAsMap` exposes internal representation we are trying to remove it step by step. This commit is cleaning up some xcontent writing as well as usage in tests	2017-10-04 01:21:38 -06:00
David Roberts	2bdeee8840	Fix test bug from #26166	2017-10-03 15:55:37 +01:00
David Roberts	ea7be2d527	Adjust transport compatibility logic following backport of #26166	2017-10-03 14:16:31 +01:00
David Roberts	a292740b9e	Add cgroup memory usage/limit to OS stats on Linux (#26166 ) This change adds cgroup memory usage/limit to the OS stats section of the node stats on Linux. This information is useful because in Docker containers the standard node stats report the host memory limit, not taking account of extra restrictions that may have been applied to the container. The original idea was to store these values as Long, truncating any values outside the range of long. However, this meant that in the relatively common case of no limit being applied, users would not see the same value in the OS stats as they see by querying Linux directly. So instead the values are stored as String. This change places a burden on consumers of the strings to convert the strings to numbers and decide what to do about extremely large values, but there will be very few consumers and they would need to have a policy for dealing with "no limit" in any case.	2017-10-03 12:08:36 +01:00
Simon Willnauer	7b8d036ab5	Replace group map settings with affix setting (#26819 ) We use group settings historically instead of using a prefix setting which is more restrictive and type safe. The majority of the usecases needs to access a key, value map based on the _leave node_ of the setting ie. the setting `index.tag.*` might be used to tag an index with `index.tag.test=42` and `index.tag.staging=12` which then would be turned into a `{"test": 42, "staging": 12}` map. The group settings would always use `Settings#getAsMap` which is loosing type information and uses internal representation of the settings. Using prefix settings allows now to access such a method type-safe and natively.	2017-09-30 14:27:21 +02:00
David Turner	8fe9a20982	Forbid negative values for index.unassigned.node_left.delayed_timeout (#26828 ) Change delayed_timeout to be a positiveTimeSetting, and add note that this is a breaking change	2017-09-29 14:44:43 +01:00
David Turner	1715fd7036	Nitpicking typos in comments (#26831 )	2017-09-29 11:37:05 +01:00
Boaz Leskes	eabd0d687e	MetaData Builder doesn't properly prevent an alias with the same name as an index (#26804 ) Elasticsearch doesn't allow having an index alias named with the same name as an existing index. We currently have logic that tries to prevents that in the `MetaData.Builder#build()` method. Sadly that logic is flawed. Depending on iteration order, we may allow the above to happen (if we encounter the alias before the index). This commit fixes the above and improves the error message while at it. Note that we have a lot of protections in place before we end up relying on the metadata builder (validating this when we process APIs). I takes quite an abuse of the cluster to get that far.	2017-09-29 11:06:58 +02:00
Tal Levy	ce8533e251	Add version 6.0.0-rc2	2017-09-28 11:07:35 -07:00
Jim Ferenczi	aade2f6d63	Change ParentFieldSubFetchPhase to create doc values iterator once per segment (#26815 )	2017-09-28 16:24:05 +02:00
Jim Ferenczi	0acf74420c	Change VersionFetchSubPhase to create doc values iterator once per segment (#26809 )	2017-09-28 13:57:10 +02:00
Jim Ferenczi	f9f8856d89	Change ScriptFieldsFetchSubPhase to create search scripts once per segment (#26808 ) Closes #26775	2017-09-28 12:23:53 +02:00
James Baiera	81d181394f	Adding unreleased 5.6.3 version number to Version.java (#26794 )	2017-09-26 17:49:22 -04:00
Armin Braun	af06231d4c	#26701 Close TcpTransport on RST in some Spots to Prevent Leaking TIME_WAIT Sockets (#26764 ) #26701 Added option to RST instead of FIN to TcpTransport#closeChannels	2017-09-26 19:58:11 +00:00
olcbean	6952f7b560	Validate top-level keys for create index request (#23755 ) (#23869 ) This commit ensures create index requests do not ignore unknown keys passed to the request. closes #23755	2017-09-26 09:49:20 -07:00
Alexander Kazakov	52887b1437	Throw exception if setting during dynamic settings update isn't recognized (#26569 ) Closes #25607	2017-09-26 13:07:01 +02:00
Simon Willnauer	a506ba8602	Remove `Settings,put(Map<String,String>)` (#26785 ) `Map<String,String>` is basically erasing the type while other methods on the `Settings.Builder` are type safe and have corresponding `get` methods.	2017-09-26 12:15:20 +02:00
Jim Ferenczi	74473c1c3d	Early termination with index sorting should not set terminated_early in the response (#26597 ) Early termination with index sorting always return the best top N in the response but set the flag `terminated_early` in the response. This can be confusing because we use the same flag for `terminate_after` which on the contrary returns partial results. This change removes the flag when results are not partial (early termination due to index sorting) and keeps it only when `terminate_after` is used. Closes #26408	2017-09-26 11:37:11 +02:00
Christoph Büscher	6189c54c84	Reject the `index_options` parameter for numeric fields (#26668 ) Numeric fields no longer support the index_options parameter. This changes the parameter to be rejected in numeric field types after it was deprecated in 6.0. Closes #21475	2017-09-25 23:43:14 +02:00
Jason Tedor	530d80fdc1	Fix global checkpoint sync log message The log message here is incorrect, a failure here is occuring on the post-operation global checkpoint sync, not the background sync. This commit fixes the log message.	2017-09-25 16:38:58 -04:00
Nik Everett	eb754a71be	Fix update_by_query's default size parameter (#26784 ) We were accidentally defaulting it to the scroll size. Untwists some of the tricks that we play with parsing so that the size is no longer scrambled. Closes #26761	2017-09-25 16:25:27 -04:00
Jason Tedor	57aee93693	Add shard ID to failed global checkpoint messages When a global checkpoint sync fails we should log the shard ID for the shard the sync failed for. This commit causes this to be the case.	2017-09-25 14:32:55 -04:00
Boaz Leskes	ec0c621072	`IndexShard.routingEntry` should only be updated once all internal state is ready (#26776 ) The routing entry is used by external components to check whether the shard is ready to perform as primary. Most notably, the peer recovery source handler delays recoveries until the shard routing entry says the shard is ready. When a shard is promoted to primary, we currently update the shard's routing entry before we finish all the work relating to the promotion. This can cause recoveries to fail later on because the `GlobalCheckpointTracker` isn't set (yet) to primary mode. This commit fixes this issue by updating the routing entry last.	2017-09-25 19:57:16 +02:00
Adrien Grand	579cca9adb	Allow copying from a field to another field that belongs to the same nested object. (#26774 ) The previous test was too strict and enforced that the target object was a parent. It has been relaxed so that fields that belong to the same nested object can copy to each other. The commit also improves error handling in case of multi-fields. The current validation works but may throw confusing error messages since it assumes that only object fields may introduce dots in fields names while multi fields may too. Closes #26763	2017-09-25 18:33:26 +02:00
Christoph Büscher	3827918417	Add configurable `maxTokenLength` parameter to whitespace tokenizer (#26749 ) Other tokenizers like the standard tokenizer allow overriding the default maximum token length of 255 using the `"max_token_length` parameter. This change enables using this parameter also with the whitespace tokenizer. The range that is currently allowed is from 0 to StandardTokenizer.MAX_TOKEN_LENGTH_LIMIT, which is 1024 * 1024 = 1048576 characters. Closes #26643	2017-09-25 17:21:19 +02:00
Adrien Grand	324ee8cc96	Use the 6.6.1 Lucene version constant. (#26768 ) It is only possible since we moved to Lucene 7.0.0 GA. Previous snapshots did not know about it.	2017-09-25 17:06:16 +02:00
kel	8f5f63452a	Fix typo in date format (#26503 )	2017-09-25 17:01:39 +02:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	d87ab67e44	Fix global checkpoint sync test This commit fixes issues with the global checkpoint sync test. The test was off in initializing the maximum sequence number on the primary shard, and off setting the local knowledge on the primary of the global checkpoint on the replica.	2017-09-22 17:12:32 -04:00
Jason Tedor	2e63a13c0a	Upgrade to Log4j 2.9.1 This commit upgrades the Log4j dependency, picking up a fix for an issue with handling stack traces on JDK 9. Relates #26750	2017-09-22 11:57:06 -04:00
Yannick Welsch	df5c450e89	Add v6.1 BWC layer for adding wait_for_active_shards to index open command This commit disables BWC tests while adding a v6.1 BWC layer for the PR #26682	2017-09-22 16:30:07 +02:00
Martijn van Groningen	a056c5d469	aggs: Changed how top_hits initialises leaf collectors Both TopDocsCollector and LeafCollector were being kept around at the aggregator level. In case the nested aggregator would do a post collection then this could cause pushing down docids to top hits child aggregators that already moved the next LeafCollector (causing assertions to trip and incorrect results). By keeping track of the LeafCollector in a seperate map at the leaf bucket level this problem can simply not happen any more as the place holding LeafCollector is no longer shared. Also LeafCollector instances for TopDocsCollectors are no longer pre-created as the beginning a new segment is evaluated. There is no guarantee that TopHitsAggregator encounters a document for a particular bucket and there has to be logic to create LeafCollector instances which have not been seen before. Closes #26738	2017-09-22 15:59:43 +02:00
Alexander Kazakov	ff737a880c	Add wait_for_active_shards parameter to index open command (#26682 ) Adds the wait_for_active_shards parameter to the index open command. Similar to the index creation command, the index open command will now, by default, wait until the primaries have been allocated. Closes #20937	2017-09-22 11:15:03 +02:00
Martijn van Groningen	109c6c2717	aggs: Do not delegate a null scorer to LeafBucketCollectors Closes #26611	2017-09-22 09:20:57 +02:00
Yannick Welsch	76e1b7437c	[TEST] Remove assertSeqNos from testAckedIndexing	2017-09-22 08:31:36 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
Jason Tedor	f35d1de502	Introduce global checkpoint background sync It is the exciting return of the global checkpoint background sync. Long, long ago, in snapshot version far, far away we had and only had a global checkpoint background sync. This sync would fire periodically and send the global checkpoint from the primary shard to the replicas so that they could update their local knowledge of the global checkpoint. Later in time, as we sped ahead towards finalizing the initial version of sequence IDs, we realized that we need the global checkpoint updates to be inline. This means that on a replication operation, the primary shard would piggy back the global checkpoint with the replication operation to the replicas. The replicas would update their local knowledge of the global checkpoint and reply with their local checkpoint. However, this could allow the global checkpoint on the primary to advance again and the replicas would fall behind in their local knowledge of the global checkpoint. If another replication operation never fired, then the replicas would be permanently behind. To account for this, we added one more sync that would fire when the primary shard fell idle. However, this has problems: - the shard idle timer defaults to five minutes, a long time to wait for the replicas to learn of the new global checkpoint - if a replica missed the sync, there was no follow-up sync to catch them up - there is an inherent race condition where the primary shard could fall idle mid-operation (after having sent the replication request to the replicas); in this case, there would never be a background sync after the operation completes - tying the global checkpoint sync to the idle timer was never natural To fix this, we add two additional changes for the global checkpoint to be synced to the replicas. The first is that we add a post-operation sync that only fires if there are no operations in flight and there is a lagging replica. This gives us a chance to sync the global checkpoint to the replicas immediately after an operation so that they are always kept up to date. The second is that we add back a global checkpoint background sync that fires on a timer. This timer fires every thirty seconds, and is not configurable (for simplicity). This background sync is smarter than what we had previously in the sense that it only sends a sync if the global checkpoint on at least one replica is lagging that of the primary. When the timer fires, we can compare the global checkpoint on the primary to its knowledge of the global checkpoint on the replicas and only send a sync if there is a shard behind. Relates #26591	2017-09-21 15:34:13 -04:00
Martijn van Groningen	fda8f8b827	muted test	2017-09-21 17:24:18 +02:00
Jay Modi	c47f24d406	BulkProcessor flush runnable preserves the thread context from creation time (#26718 ) When using a bulk processor, the thread context was not preserved for the flush runnable which is executed in another thread in the thread pool. This change wraps the flush runnable in a context preserving runnable so that the headers and transients from the creation time of the bulk processor are available during the execution of the flush. Closes #26596	2017-09-20 10:19:42 -06:00
Simon Willnauer	b9c0d4447c	Catch exceptions and inform handler in RemoteClusterConnection#collectNodes (#26725 ) This adds a missing catch block to invoke the action listener instead of bubbeling up the exception. Closes #26700	2017-09-20 17:53:12 +02:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Christoph Büscher	3d67915ed5	#26720 : Set the correct bwc version after backport to 6.0	2017-09-20 16:14:11 +02:00
Christoph Büscher	22e200e79a	Remove deprecated type and slop field in MatchQueryBuilder (#26720 ) The `type` field has been deprecated in 5.0 and can be removed. It has been replaced by using the MatchPhraseQueryBuilder or the MatchPhrasePrefixQueryBuilder. The `slop` field has also been deprecated and can be removed, the phrase and phrase prefix query builders still provide this parameter.	2017-09-20 14:24:30 +02:00
Yannick Welsch	5f407062ad	Refactoring of Gateway*** classes (#26706 ) - Removes mutual dependency between GatewayMetaState and TransportNodesListGatewayMetaState - Deguices MetaDataIndexUpgradeService - Deguices GatewayMetaState - Makes Gateway the master-level component that is only responsible for coordinating the state recovery	2017-09-20 12:51:58 +02:00
Yannick Welsch	ff1e26276d	Deguice ActionFilter (#26691 ) Allows to instantiate TransportAction instances without Guice.	2017-09-20 10:30:21 +02:00
Martijn van Groningen	61849a1150	aggs: Allow aggregation sorting via nested aggregation. The nested aggregator now buffers all bucket ords per parent document and emits all bucket ords for a parent document's nested document once. This way the nested documents document DocIdSetIterator gets used once per bucket instead of wrapping the nested aggregator inside a multi bucket aggregator, which was the current solution upto now. This allows sorting by buckets under a nested bucket. Closes #16838	2017-09-20 07:44:53 +02:00
Jason Tedor	581a873124	Remove assertion from checkpoint tracker invariants This assertion is wrong because the global checkpoint on a promoted primary can be lagging the replicas until it catches up after through resyncs, ongoing indexing operations and removing the old primary from the in-sync set.	2017-09-19 17:52:41 -04:00
Igor Motov	5090260119	Upgrade API: fix excessive logging and unnecessary template updates (#26698 ) TemplateUpgradeService might get stuck in repeatedly upgrading templates after upgrade to 5.6.0. This is caused by shuffling mappings definition in the template during template serialization. This commit makes the template serialization consistent. Closes #26673	2017-09-19 16:32:17 -04:00
Boaz Leskes	04385a9ce9	Restoring from snapshot should force generation of a new history uuid (#26694 ) Restoring a shard from snapshot throws the primary back in time violating assumptions and bringing the validity of global checkpoints in question. To avoid problems, we should make sure that a shard that was restored will never be the source of an ops based recovery to a shard that existed before the restore. To this end we have introduced the notion of `histroy_uuid` in #26577 and required that both source and target will have the same history to allow ops based recoveries. This PR make sure that a shard gets a new uuid after restore. As suggested by @ywelsch , I derived the creation of a `history_uuid` from the `RecoverySource` of the shard. Store recovery will only generate a uuid if it doesn't already exist (we can make this stricter when we don't need to deal with 5.x indices). Peer recovery follows the same logic (note that this is different than the approach in #26557, I went this way as it means that shards always have a history uuid after being recovered on a 6.x node and will also mean that a rolling restart is enough for old indices to step over to the new seq no model). Local shards and snapshot force the generation of a new translog uuid. Relates #10708 Closes #26544	2017-09-19 15:58:36 +02:00
Martijn van Groningen	332b4d12fa	test: Use a single primary shard so that the exception can caught in the same way	2017-09-19 15:14:24 +02:00
Jason Tedor	256721018b	Move pre-6.0 node checkpoint to SequenceNumbers This commit moves the pre-6.0 node checkpoint constant from SequenceNumbersService to SequenceNumbers so it can chill with the other sequence number-related constants. Relates #26690	2017-09-19 06:27:56 -04:00
Armin Braun	2db3bccd37	Invalid JSON request body caused endless loop (#26680 ) Request bodys that only consists of a String value can lead to endless loops in the parser of several rest requests like e.g. `_count`. Up to 5.2 this seems to have been caught in the logic guessing the content type of the request, but since then it causes the node to block. This change introduces checks for receiving a valid xContent object before starting the parsing in RestActions#parseTopLevelQueryBuilder(). Closes #26083	2017-09-19 12:02:05 +02:00
Martijn van Groningen	6c46a67dd6	added comment	2017-09-19 11:02:15 +02:00
Martijn van Groningen	a3a6ce6220	fix line length violation	2017-09-19 11:02:15 +02:00
Martijn van Groningen	f782f618cc	Moved the check to fetch phase. This basically means that we throw a better error message instead of an AOBE and not adding more restrictions.	2017-09-19 11:02:15 +02:00
Martijn van Groningen	d05aee7eda	inner hits: Do not allow inner hits that use _source and have a non nested object field as parent Closes #25315	2017-09-19 11:02:15 +02:00
Tal Levy	cc726cb3b6	convert more admin requests to writeable (#26566 )	2017-09-18 13:19:34 -07:00
Nik Everett	98f8bde389	Handle release of 5.6.1 * Add a version constant for 5.6.2 so that the 5.6.1 constant represents the 5.6.1 release and the 5.6.2 constant represents the unreleased 5.6 branch.	2017-09-18 15:41:09 -04:00
Simon Willnauer	9f97f9072a	Allow `InputStreamStreamInput` array size validation where applicable (#26692 ) Today we can't validate the array length in `InputStreamStreamInput` since we can't rely on `InputStream.available` yet in some situations we know the size of the stream and can apply additional validation.	2017-09-18 17:52:36 +02:00
Jason Tedor	23093adcb9	Update global checkpoint with permit after recovery After recovery completes from a primary, we now update the local knowledge on the primary of the global checkpoint on the recovery target. However if this occurs concurrently with a relocation, an assertion could trip that we are no longer in primary mode. As this local knowledge should only be tracked when we are in primary mode, updating this local knowledge should be done under a permit. This commit causes that to be the case. Relates #26666	2017-09-18 07:48:08 -04:00
Jason Tedor	6f25163aef	Filter pre-6.0 nodes for checkpoint invariants When checking that the global checkpoint on the primary is consistent with the local checkpoints of the in-sync shards, we have to filter pre-6.0 nodes from the check or the invariant will trivially trip. This commit filters these nodes out when checking this invariant. Relates #26666	2017-09-18 06:51:22 -04:00
Jason Tedor	c238b79cf4	Add global checkpoint tracking on the primary This commit adds local tracking of the global checkpoints on all shard copies when a global checkpoint tracker is operating in primary mode. With this, we relay the global checkpoint on a shard copy back to the primary shard during replication operations. This serves as another step towards adding a background sync of the global checkpoint to the shard copies. Relates #26666	2017-09-18 06:04:44 -04:00
Michael Basnight	296c239611	Add check for invalid index in WildcardExpressionResolver (#26409 ) This commit adds validation to the resolving of indexes in the wildcard expression resolver. It no longer throws a 404 Not Found when resolving invalid indices. It throws a 400 instead, as it is an invalid index. This was the behavior of 5.x.	2017-09-15 17:00:41 -05:00
kel	0f2a11695e	Filter unsupported relation for range query builder (#26620 )	2017-09-15 14:01:35 +02:00
Boaz Leskes	ffc9999567	fix StartRecoveryRequestTests.testSerialization	2017-09-14 23:20:55 +03:00
Boaz Leskes	1ca0b5e9e4	Introduce a History UUID as a requirement for ops based recovery (#26577 ) The new ops based recovery, introduce as part of #10708, is based on the assumption that all operations below the global checkpoint known to the replica do not need to be synced with the primary. This is based on the guarantee that all ops below it are available on primary and they are equal. Under normal operations this guarantee holds. Sadly, it can be violated when a primary is restored from an old snapshot. At the point the restore primary can miss operations below the replica's global checkpoint, or even worse may have total different operations at the same spot. This PR introduces the notion of a history uuid to be able to capture the difference with the restored primary (in a follow up PR). The History UUID is generated by a primary when it is first created and is synced to the replicas which are recovered via a file based recovery. The PR adds a requirement to ops based recovery to make sure that the history uuid of the source and the target are equal. Under normal operations, all shard copies will stay with that history uuid for the rest of the index lifetime and thus this is a noop. However, it gives us a place to guarantee we fall back to file base syncing in special events like a restore from snapshot (to be done as a follow up) and when someone calls the truncate translog command which can go wrong when combined with primary recovery (this is done in this PR). We considered in the past to use the translog uuid for this function (i.e., sync it across copies) and thus avoid adding an extra identifier. This idea was rejected as it removes the ability to verify that a specific translog really belongs to a specific lucene index. We also feel that having a history uuid will serve us well in the future.	2017-09-14 21:25:02 +03:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Jason Tedor	ca6bce75da	Refactor bootstrap check results and error messages This commit refactors the bootstrap checks into a single result object that encapsulates whether or not the check passed, and a failure message if the check failed. This simpifies the checks, and enables the messages to more easily be based on the state used to discern whether or not the check passed. Relates #26637	2017-09-13 21:30:27 -04:00
Simon Willnauer	b4de2a6f28	Add BootstrapContext to expose settings and recovered state to bootstrap checks (#26628 ) This exposes the node settings and the persistent part of the cluster state to the bootstrap checks to allow plugins to enforce certain preconditions based on the recovered state.	2017-09-13 22:14:17 +02:00
Christoph Büscher	2eaf7534f3	[Tests] Removing skipping tests in search rest tests After backporting the script_field soft limit to the 6.x branches, this test can now also run in a mixed cluster. Relates to #26598 enter the commit message for your changes. Lines starting	2017-09-13 18:21:15 +02:00
Jason Tedor	7be5ee5f28	Initialize checkpoint tracker with allocation ID This commit pushes the allocation ID down through to the global checkpoint tracker at construction rather than when activated as a primary. Relates #26630	2017-09-13 12:15:15 -04:00
Adrien Grand	93da7720ff	Move non-core mappers to a module. (#26549 ) Today we have all non-plugin mappers in core. I'd like to start moving those that neither map to json datatypes nor are very frequently used like `date` or `ip` to a module. This commit creates a new module called `mappers-extra` and moves the `scaled_float` and `token_count` mappers to it. I'd like to eventually move `range` fields there but it's more complicated due to their intimate relationship with range queries. Relates #10368	2017-09-13 17:58:53 +02:00
Christoph Büscher	027c555c9b	Add soft limit on allowed number of script fields in request (#26598 ) Requesting to many script_fields in a search request can be costly because of script execution. This change introduces a soft limit on the number of script fields that are allowed per request. The setting can be changed per index using the index.max_script_fields setting. Relates to #26390	2017-09-13 17:22:16 +02:00
Adrien Grand	64770b3fbd	Remove MapperService#dynamic. (#26603 ) We ignore it as of 6.0 and forbid it as of 7.0.	2017-09-13 17:00:52 +02:00
Adrien Grand	454cfc2cea	More efficient encoding of range fields. (#26470 ) This PR removes the vInt that precedes every value in order to know how long they are. Instead the query takes an enum that tells how to compute the length of values: for fixed-length data (ip addresses, double, float) the length is a constant while longs and integers use a variable-length representation that allows the length to be computed from the encoded values. Also the encoding of ints/longs was made a bit more efficient in order not to waste 3 bits in the header. As a consequence, values between -8 and 7 can now be encoded on 1 byte and values between -2048 and 2047 can now be encoded on 2 bytes or less. Closes #26443	2017-09-13 15:26:33 +02:00
Ivan Brusic	9e05b3260b	Add boolean similarity to built in similarity types (#26613 )	2017-09-13 13:58:30 +02:00
Jason Tedor	b3e7e85cf1	Let search phases override max concurrent requests If the query coordinating node is also a data node that holds all the shards for a search request, we can end up recursing through the can match phase (because we send a local request and on response in the listener move to the next shard and do this again, without ever having returned from previous shards). This recursion can lead to stack overflow for even a reasonable number of indices (daily indices over a sixty days with five shards per day is enough to trigger the stack overflow). Moreover, all this execution would be happening on a network thread (the thread that initially received the query). With this commit, we allow search phases to override max concurrent requests. This allows the can match phase to avoid recursing through the shards towards a stack overflow. Relates #26484	2017-09-13 06:16:27 -04:00
Christoph Büscher	e00db235bc	Add a soft limit for the number of requested doc-value fields (#26574 ) Requesting to many docvalue_fields in a search request can potentially be costly because it might incur a per-field per-document seek. This change introduces a soft limit on the number of fields that can be retrieved. The setting can be changed per index using the `index.max_docvalue_fields_search` setting. Relates to #26390	2017-09-13 11:57:06 +02:00

1 2 3 4 5 ...

8968 Commits