OpenSearch

Commit Graph

Author	SHA1	Message	Date
kel	8f5f63452a	Fix typo in date format (#26503 )	2017-09-25 17:01:39 +02:00
Simon Willnauer	aab4655e63	Unify Settings xcontent reading and writing (#26739 ) This change adds a fromXContent method to Settings that allows to read the xcontent that is produced by toXContent. It also replaces the entire settings loader infrastructure and removes the structured map representation. Future PRs will also tackle the `getAsMap` that exposes the internal represenation of settings for better encapsulation.	2017-09-25 13:23:01 +02:00
Jason Tedor	d87ab67e44	Fix global checkpoint sync test This commit fixes issues with the global checkpoint sync test. The test was off in initializing the maximum sequence number on the primary shard, and off setting the local knowledge on the primary of the global checkpoint on the replica.	2017-09-22 17:12:32 -04:00
Jason Tedor	2e63a13c0a	Upgrade to Log4j 2.9.1 This commit upgrades the Log4j dependency, picking up a fix for an issue with handling stack traces on JDK 9. Relates #26750	2017-09-22 11:57:06 -04:00
Yannick Welsch	df5c450e89	Add v6.1 BWC layer for adding wait_for_active_shards to index open command This commit disables BWC tests while adding a v6.1 BWC layer for the PR #26682	2017-09-22 16:30:07 +02:00
Martijn van Groningen	a056c5d469	aggs: Changed how top_hits initialises leaf collectors Both TopDocsCollector and LeafCollector were being kept around at the aggregator level. In case the nested aggregator would do a post collection then this could cause pushing down docids to top hits child aggregators that already moved the next LeafCollector (causing assertions to trip and incorrect results). By keeping track of the LeafCollector in a seperate map at the leaf bucket level this problem can simply not happen any more as the place holding LeafCollector is no longer shared. Also LeafCollector instances for TopDocsCollectors are no longer pre-created as the beginning a new segment is evaluated. There is no guarantee that TopHitsAggregator encounters a document for a particular bucket and there has to be logic to create LeafCollector instances which have not been seen before. Closes #26738	2017-09-22 15:59:43 +02:00
Alexander Kazakov	ff737a880c	Add wait_for_active_shards parameter to index open command (#26682 ) Adds the wait_for_active_shards parameter to the index open command. Similar to the index creation command, the index open command will now, by default, wait until the primaries have been allocated. Closes #20937	2017-09-22 11:15:03 +02:00
Martijn van Groningen	109c6c2717	aggs: Do not delegate a null scorer to LeafBucketCollectors Closes #26611	2017-09-22 09:20:57 +02:00
Yannick Welsch	76e1b7437c	[TEST] Remove assertSeqNos from testAckedIndexing	2017-09-22 08:31:36 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
Jason Tedor	f35d1de502	Introduce global checkpoint background sync It is the exciting return of the global checkpoint background sync. Long, long ago, in snapshot version far, far away we had and only had a global checkpoint background sync. This sync would fire periodically and send the global checkpoint from the primary shard to the replicas so that they could update their local knowledge of the global checkpoint. Later in time, as we sped ahead towards finalizing the initial version of sequence IDs, we realized that we need the global checkpoint updates to be inline. This means that on a replication operation, the primary shard would piggy back the global checkpoint with the replication operation to the replicas. The replicas would update their local knowledge of the global checkpoint and reply with their local checkpoint. However, this could allow the global checkpoint on the primary to advance again and the replicas would fall behind in their local knowledge of the global checkpoint. If another replication operation never fired, then the replicas would be permanently behind. To account for this, we added one more sync that would fire when the primary shard fell idle. However, this has problems: - the shard idle timer defaults to five minutes, a long time to wait for the replicas to learn of the new global checkpoint - if a replica missed the sync, there was no follow-up sync to catch them up - there is an inherent race condition where the primary shard could fall idle mid-operation (after having sent the replication request to the replicas); in this case, there would never be a background sync after the operation completes - tying the global checkpoint sync to the idle timer was never natural To fix this, we add two additional changes for the global checkpoint to be synced to the replicas. The first is that we add a post-operation sync that only fires if there are no operations in flight and there is a lagging replica. This gives us a chance to sync the global checkpoint to the replicas immediately after an operation so that they are always kept up to date. The second is that we add back a global checkpoint background sync that fires on a timer. This timer fires every thirty seconds, and is not configurable (for simplicity). This background sync is smarter than what we had previously in the sense that it only sends a sync if the global checkpoint on at least one replica is lagging that of the primary. When the timer fires, we can compare the global checkpoint on the primary to its knowledge of the global checkpoint on the replicas and only send a sync if there is a shard behind. Relates #26591	2017-09-21 15:34:13 -04:00
Martijn van Groningen	fda8f8b827	muted test	2017-09-21 17:24:18 +02:00
Jay Modi	c47f24d406	BulkProcessor flush runnable preserves the thread context from creation time (#26718 ) When using a bulk processor, the thread context was not preserved for the flush runnable which is executed in another thread in the thread pool. This change wraps the flush runnable in a context preserving runnable so that the headers and transients from the creation time of the bulk processor are available during the execution of the flush. Closes #26596	2017-09-20 10:19:42 -06:00
Simon Willnauer	b9c0d4447c	Catch exceptions and inform handler in RemoteClusterConnection#collectNodes (#26725 ) This adds a missing catch block to invoke the action listener instead of bubbeling up the exception. Closes #26700	2017-09-20 17:53:12 +02:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Christoph Büscher	3d67915ed5	#26720 : Set the correct bwc version after backport to 6.0	2017-09-20 16:14:11 +02:00
Christoph Büscher	22e200e79a	Remove deprecated type and slop field in MatchQueryBuilder (#26720 ) The `type` field has been deprecated in 5.0 and can be removed. It has been replaced by using the MatchPhraseQueryBuilder or the MatchPhrasePrefixQueryBuilder. The `slop` field has also been deprecated and can be removed, the phrase and phrase prefix query builders still provide this parameter.	2017-09-20 14:24:30 +02:00
Yannick Welsch	5f407062ad	Refactoring of Gateway*** classes (#26706 ) - Removes mutual dependency between GatewayMetaState and TransportNodesListGatewayMetaState - Deguices MetaDataIndexUpgradeService - Deguices GatewayMetaState - Makes Gateway the master-level component that is only responsible for coordinating the state recovery	2017-09-20 12:51:58 +02:00
Yannick Welsch	ff1e26276d	Deguice ActionFilter (#26691 ) Allows to instantiate TransportAction instances without Guice.	2017-09-20 10:30:21 +02:00
Martijn van Groningen	61849a1150	aggs: Allow aggregation sorting via nested aggregation. The nested aggregator now buffers all bucket ords per parent document and emits all bucket ords for a parent document's nested document once. This way the nested documents document DocIdSetIterator gets used once per bucket instead of wrapping the nested aggregator inside a multi bucket aggregator, which was the current solution upto now. This allows sorting by buckets under a nested bucket. Closes #16838	2017-09-20 07:44:53 +02:00
Jason Tedor	581a873124	Remove assertion from checkpoint tracker invariants This assertion is wrong because the global checkpoint on a promoted primary can be lagging the replicas until it catches up after through resyncs, ongoing indexing operations and removing the old primary from the in-sync set.	2017-09-19 17:52:41 -04:00
Igor Motov	5090260119	Upgrade API: fix excessive logging and unnecessary template updates (#26698 ) TemplateUpgradeService might get stuck in repeatedly upgrading templates after upgrade to 5.6.0. This is caused by shuffling mappings definition in the template during template serialization. This commit makes the template serialization consistent. Closes #26673	2017-09-19 16:32:17 -04:00
Boaz Leskes	04385a9ce9	Restoring from snapshot should force generation of a new history uuid (#26694 ) Restoring a shard from snapshot throws the primary back in time violating assumptions and bringing the validity of global checkpoints in question. To avoid problems, we should make sure that a shard that was restored will never be the source of an ops based recovery to a shard that existed before the restore. To this end we have introduced the notion of `histroy_uuid` in #26577 and required that both source and target will have the same history to allow ops based recoveries. This PR make sure that a shard gets a new uuid after restore. As suggested by @ywelsch , I derived the creation of a `history_uuid` from the `RecoverySource` of the shard. Store recovery will only generate a uuid if it doesn't already exist (we can make this stricter when we don't need to deal with 5.x indices). Peer recovery follows the same logic (note that this is different than the approach in #26557, I went this way as it means that shards always have a history uuid after being recovered on a 6.x node and will also mean that a rolling restart is enough for old indices to step over to the new seq no model). Local shards and snapshot force the generation of a new translog uuid. Relates #10708 Closes #26544	2017-09-19 15:58:36 +02:00
Martijn van Groningen	332b4d12fa	test: Use a single primary shard so that the exception can caught in the same way	2017-09-19 15:14:24 +02:00
Jason Tedor	256721018b	Move pre-6.0 node checkpoint to SequenceNumbers This commit moves the pre-6.0 node checkpoint constant from SequenceNumbersService to SequenceNumbers so it can chill with the other sequence number-related constants. Relates #26690	2017-09-19 06:27:56 -04:00
Armin Braun	2db3bccd37	Invalid JSON request body caused endless loop (#26680 ) Request bodys that only consists of a String value can lead to endless loops in the parser of several rest requests like e.g. `_count`. Up to 5.2 this seems to have been caught in the logic guessing the content type of the request, but since then it causes the node to block. This change introduces checks for receiving a valid xContent object before starting the parsing in RestActions#parseTopLevelQueryBuilder(). Closes #26083	2017-09-19 12:02:05 +02:00
Martijn van Groningen	6c46a67dd6	added comment	2017-09-19 11:02:15 +02:00
Martijn van Groningen	a3a6ce6220	fix line length violation	2017-09-19 11:02:15 +02:00
Martijn van Groningen	f782f618cc	Moved the check to fetch phase. This basically means that we throw a better error message instead of an AOBE and not adding more restrictions.	2017-09-19 11:02:15 +02:00
Martijn van Groningen	d05aee7eda	inner hits: Do not allow inner hits that use _source and have a non nested object field as parent Closes #25315	2017-09-19 11:02:15 +02:00
Tal Levy	cc726cb3b6	convert more admin requests to writeable (#26566 )	2017-09-18 13:19:34 -07:00
Nik Everett	98f8bde389	Handle release of 5.6.1 * Add a version constant for 5.6.2 so that the 5.6.1 constant represents the 5.6.1 release and the 5.6.2 constant represents the unreleased 5.6 branch.	2017-09-18 15:41:09 -04:00
Simon Willnauer	9f97f9072a	Allow `InputStreamStreamInput` array size validation where applicable (#26692 ) Today we can't validate the array length in `InputStreamStreamInput` since we can't rely on `InputStream.available` yet in some situations we know the size of the stream and can apply additional validation.	2017-09-18 17:52:36 +02:00
Jason Tedor	23093adcb9	Update global checkpoint with permit after recovery After recovery completes from a primary, we now update the local knowledge on the primary of the global checkpoint on the recovery target. However if this occurs concurrently with a relocation, an assertion could trip that we are no longer in primary mode. As this local knowledge should only be tracked when we are in primary mode, updating this local knowledge should be done under a permit. This commit causes that to be the case. Relates #26666	2017-09-18 07:48:08 -04:00
Jason Tedor	6f25163aef	Filter pre-6.0 nodes for checkpoint invariants When checking that the global checkpoint on the primary is consistent with the local checkpoints of the in-sync shards, we have to filter pre-6.0 nodes from the check or the invariant will trivially trip. This commit filters these nodes out when checking this invariant. Relates #26666	2017-09-18 06:51:22 -04:00
Jason Tedor	c238b79cf4	Add global checkpoint tracking on the primary This commit adds local tracking of the global checkpoints on all shard copies when a global checkpoint tracker is operating in primary mode. With this, we relay the global checkpoint on a shard copy back to the primary shard during replication operations. This serves as another step towards adding a background sync of the global checkpoint to the shard copies. Relates #26666	2017-09-18 06:04:44 -04:00
Michael Basnight	296c239611	Add check for invalid index in WildcardExpressionResolver (#26409 ) This commit adds validation to the resolving of indexes in the wildcard expression resolver. It no longer throws a 404 Not Found when resolving invalid indices. It throws a 400 instead, as it is an invalid index. This was the behavior of 5.x.	2017-09-15 17:00:41 -05:00
kel	0f2a11695e	Filter unsupported relation for range query builder (#26620 )	2017-09-15 14:01:35 +02:00
Boaz Leskes	ffc9999567	fix StartRecoveryRequestTests.testSerialization	2017-09-14 23:20:55 +03:00
Boaz Leskes	1ca0b5e9e4	Introduce a History UUID as a requirement for ops based recovery (#26577 ) The new ops based recovery, introduce as part of #10708, is based on the assumption that all operations below the global checkpoint known to the replica do not need to be synced with the primary. This is based on the guarantee that all ops below it are available on primary and they are equal. Under normal operations this guarantee holds. Sadly, it can be violated when a primary is restored from an old snapshot. At the point the restore primary can miss operations below the replica's global checkpoint, or even worse may have total different operations at the same spot. This PR introduces the notion of a history uuid to be able to capture the difference with the restored primary (in a follow up PR). The History UUID is generated by a primary when it is first created and is synced to the replicas which are recovered via a file based recovery. The PR adds a requirement to ops based recovery to make sure that the history uuid of the source and the target are equal. Under normal operations, all shard copies will stay with that history uuid for the rest of the index lifetime and thus this is a noop. However, it gives us a place to guarantee we fall back to file base syncing in special events like a restore from snapshot (to be done as a follow up) and when someone calls the truncate translog command which can go wrong when combined with primary recovery (this is done in this PR). We considered in the past to use the translog uuid for this function (i.e., sync it across copies) and thus avoid adding an extra identifier. This idea was rejected as it removes the ability to verify that a specific translog really belongs to a specific lucene index. We also feel that having a history uuid will serve us well in the future.	2017-09-14 21:25:02 +03:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Jason Tedor	ca6bce75da	Refactor bootstrap check results and error messages This commit refactors the bootstrap checks into a single result object that encapsulates whether or not the check passed, and a failure message if the check failed. This simpifies the checks, and enables the messages to more easily be based on the state used to discern whether or not the check passed. Relates #26637	2017-09-13 21:30:27 -04:00
Simon Willnauer	b4de2a6f28	Add BootstrapContext to expose settings and recovered state to bootstrap checks (#26628 ) This exposes the node settings and the persistent part of the cluster state to the bootstrap checks to allow plugins to enforce certain preconditions based on the recovered state.	2017-09-13 22:14:17 +02:00
Christoph Büscher	2eaf7534f3	[Tests] Removing skipping tests in search rest tests After backporting the script_field soft limit to the 6.x branches, this test can now also run in a mixed cluster. Relates to #26598 enter the commit message for your changes. Lines starting	2017-09-13 18:21:15 +02:00
Jason Tedor	7be5ee5f28	Initialize checkpoint tracker with allocation ID This commit pushes the allocation ID down through to the global checkpoint tracker at construction rather than when activated as a primary. Relates #26630	2017-09-13 12:15:15 -04:00
Adrien Grand	93da7720ff	Move non-core mappers to a module. (#26549 ) Today we have all non-plugin mappers in core. I'd like to start moving those that neither map to json datatypes nor are very frequently used like `date` or `ip` to a module. This commit creates a new module called `mappers-extra` and moves the `scaled_float` and `token_count` mappers to it. I'd like to eventually move `range` fields there but it's more complicated due to their intimate relationship with range queries. Relates #10368	2017-09-13 17:58:53 +02:00
Christoph Büscher	027c555c9b	Add soft limit on allowed number of script fields in request (#26598 ) Requesting to many script_fields in a search request can be costly because of script execution. This change introduces a soft limit on the number of script fields that are allowed per request. The setting can be changed per index using the index.max_script_fields setting. Relates to #26390	2017-09-13 17:22:16 +02:00
Adrien Grand	64770b3fbd	Remove MapperService#dynamic. (#26603 ) We ignore it as of 6.0 and forbid it as of 7.0.	2017-09-13 17:00:52 +02:00
Adrien Grand	454cfc2cea	More efficient encoding of range fields. (#26470 ) This PR removes the vInt that precedes every value in order to know how long they are. Instead the query takes an enum that tells how to compute the length of values: for fixed-length data (ip addresses, double, float) the length is a constant while longs and integers use a variable-length representation that allows the length to be computed from the encoded values. Also the encoding of ints/longs was made a bit more efficient in order not to waste 3 bits in the header. As a consequence, values between -8 and 7 can now be encoded on 1 byte and values between -2048 and 2047 can now be encoded on 2 bytes or less. Closes #26443	2017-09-13 15:26:33 +02:00
Ivan Brusic	9e05b3260b	Add boolean similarity to built in similarity types (#26613 )	2017-09-13 13:58:30 +02:00
Jason Tedor	b3e7e85cf1	Let search phases override max concurrent requests If the query coordinating node is also a data node that holds all the shards for a search request, we can end up recursing through the can match phase (because we send a local request and on response in the listener move to the next shard and do this again, without ever having returned from previous shards). This recursion can lead to stack overflow for even a reasonable number of indices (daily indices over a sixty days with five shards per day is enough to trigger the stack overflow). Moreover, all this execution would be happening on a network thread (the thread that initially received the query). With this commit, we allow search phases to override max concurrent requests. This allows the can match phase to avoid recursing through the shards towards a stack overflow. Relates #26484	2017-09-13 06:16:27 -04:00
Christoph Büscher	e00db235bc	Add a soft limit for the number of requested doc-value fields (#26574 ) Requesting to many docvalue_fields in a search request can potentially be costly because it might incur a per-field per-document seek. This change introduces a soft limit on the number of fields that can be retrieved. The setting can be changed per index using the `index.max_docvalue_fields_search` setting. Relates to #26390	2017-09-13 11:57:06 +02:00
Adrien Grand	04b24c7780	Fix Lucene version of 5.6.1.	2017-09-12 17:54:50 +02:00
Michael Basnight	0e57a416f1	Handle the 5.6.0 release	2017-09-12 09:48:09 -05:00
Simon Willnauer	42f3129d7b	Allow plugins to validate cluster-state on join (#26595 ) Today we don't have a pluggable way to validate if the cluster state is compatible with the node that joins. We already apply some checks for index compatibility that prevents nodes to join a cluster with indices it doesn't support but for plugins this isn't possible. This change adds a cluster state validator that allows plugins to prevent a join if the cluster-state is incompatible.	2017-09-12 15:32:33 +02:00
Yu	3d4e28aee1	Remove index mapper dynamic settings (#25734 ) Remove "index.mapper.dynamic" setting for 6.0 (and after) indices, but still keep working for 5.x (and before) indices. Remove two index dynamic disable test cases as the disability of index.mapper.dynamic is already removed for current version. Add a new test class for version test.	2017-09-12 14:29:10 +02:00
Ryan Ernst	5c35bff1c3	Test: Remove leftover static bwc test case (#26584 ) This test case was leftover from the static bwc tests. There was still one use for checking we do not load old indices, but this PR moves the legacy code needed for that directly into the test. I also opened a follow up issue to completely remove the unsupported test: #26583.	2017-09-11 15:38:30 -07:00
Jason Tedor	b2e4bfa0a7	Snapshot fallback should consider build.snapshot When determining if a build is a snapshot build, we look for a field in the JAR manifest. However, when running tests, we are not running with a compiled core Elasticsearch JAR, we are running with the compiled core classes on the classpath. We have a fallback for this, we always assume such a situation is a snapshot build. However, when running builds with -Dbuild.snapshot=false, this is not the case. As such, we need to fallback to the value of build.snapshot. However, there are cases where we are not running with a compiled core Elasticsearch JAR (e.g., when the transport client is embedded in a web container) so we should only do this fallback if we are in tests. To verify we are in tests, we check if randomized runner is on the classpath. Relates #26554	2017-09-11 07:42:11 -04:00
Jim Ferenczi	c62b0192d0	#26496 : Set the correct bwc version after backport to 6.x	2017-09-11 13:09:44 +02:00
Adrien Grand	1adee8b5a8	Fix the MapperFieldType.rangeQuery API. (#26552 ) RangeQueryBuilder needs to perform too many `instanceof` checks in order to check for `date` or `range` fields in order to know what it should do with the shape relation, time zone and date format. This commit adds those 3 parameters to the `rangeQuery` factory method so that those instanceof checks are not necessary anymore.	2017-09-11 11:02:05 +02:00
Adrien Grand	2bc3eeccde	Deduplicate `_field_names`. (#26550 ) This is a minor optimization that should save some utf8 conversions and indexing.	2017-09-11 10:57:08 +02:00
Md.Abdulla-Al-Sun	d00d18a36d	[Docs] Fix typo in javadocs (#26556 )	2017-09-09 22:25:31 +02:00
Lee Hinman	2702918780	Limit the number of expanded fields it query_string and simple_query_string (#26541 ) * Limit the number of expanded fields it query_string and simple_query_string This limits the number of automatically expanded fields for the "all fields" mode (`"default_field": ""`) for the `query_string` and `simple_query_string` queries to 1024 fields. Resolves #25105 Add blurb about limit to the docs	2017-09-08 13:37:55 -06:00
Lee Hinman	dd90cf1bbb	Throw a better error message for empty field names (#26543 ) * Throw a better error message for empty field names When a document is parsed with a `""` for a field name, we currently throw a confusing error about `.` being present in the field. This changes the error message to be clearer about what's causing the problem. Resolves #23348 * Fix exception message in test	2017-09-08 13:30:17 -06:00
Lee Hinman	4e43aac0f8	Expand "NO" decision message in NodeVersionAllocationDecider (#26542 ) This explains the `NO` Decision a little more. Resolves #10403	2017-09-08 09:18:34 -06:00
Antonio Matarrese	155db7326a	_reroute's retry_failed flag should reset failure counter (#25888 ) To protect against poisonous situations, ES will only try to allocate a shard 5 times (by default). After 5 consecutive failures, ES will stop assigning the shard and wait for an operator to fix the problem. Once the problem is fixed, the operator is expected to call `_reroute` with a `retry_failed` flag to force retrying of those shards. Currently that retry flag is only used for a single allocation run. However, if not all shards can be allocated at once (due to throttling) the operator has to keep on calling the API until all shards are assigned which is cumbersome. This PR changes the behavior of the flag to reset the failed allocations counter and this allowing shards to be assigned again.	2017-09-08 12:18:52 +02:00
Jim Ferenczi	3435c9f4e2	#26496 : Fix sporadic failure of ContextCompletionSuggestSearchIT#testGeoBoosting This test should not rely on strict ordering for same score suggestions. The Lucene completion suggester uses the doc id in case of a tie and documents are indexed randomly.	2017-09-08 11:30:40 +02:00
Jason Tedor	e3b0cc9867	Remove norelease regarding destroying history This commit removes a norelease from the codebase now that there is a CI job that fails on the norelease pattern being present. Instead, a new issue has been opened to track this one. Relates #26544	2017-09-07 21:57:08 -04:00
Jim Ferenczi	e684c5e0a5	#26496 : handle `shard_size` correctly in the completion suggester and tests. The completion suggester has a `shard_size` option that sets the size of the suggestions to retrieve per shard but it is ignored by the builder. This commit restores the handling of this option and fixes a test that can randomly fail without it.	2017-09-07 18:22:28 +02:00
Lee Hinman	cff904bf97	Enable adaptive replica selection by default (#26522 ) Relates to #24915	2017-09-07 09:25:05 -06:00
Jim Ferenczi	d68d8c9cef	Expose duplicate removal in the completion suggester (#26496 ) This change exposes the duplicate removal option added in Lucene for the completion suggester with a new option called `skip_duplicates` (defaults to false). This commit also adapts the custom suggest collector to handle deduplication when multiple contexts match the input. Closes #23364	2017-09-07 17:11:01 +02:00
Jim Ferenczi	abe83c4fac	Fail query when a sort is provided in conjunction with rescorers (#26510 ) This change fixes a regression introduced in 6 that removes the skipping of the rescore phase when a sort other than _score is used. We now fail the request when a sort is provided in conjunction with rescore instead of just skipping the rescore phase This commit also adds an assert that checks if the topdocs are sorted by _score after the rescoring. This is the responsibility of the rescorer to make sure that topdocs are sorted after rescore so we just check that this condition is met in the rescore phase.	2017-09-07 14:17:37 +02:00
Christoph Büscher	ba02485541	Make sure SortBuilders rewrite inner nested sorts (#26532 ) The three SortBuilders that can have inner NestedSortBuilders currently don't rewrite any of the filters contained in them. This change adds a rewrite method to NestedSortBuilder and changes rewriting in FieldSortBuilder, ScriptSortBuilder and GeoDistanceSortBuilder to make sure inner nested sorts get rewritten if they need to.	2017-09-07 14:04:50 +02:00
Christoph Büscher	47ffa17efb	Extend testing of build method in ScriptSortBuilder (#26520 ) Improve testing around the ScriptSortBuilder#build method, adding checks for correct transfers of the sort mode and nested sorts. Also changing the behaviour around the nested_path, nested_filter vs. nested parameter in a similar way as in #26490 and deprecating the setters/getters for the old syntax. Closes #17286	2017-09-07 10:37:50 +02:00
Ryan Ernst	c9964d17bf	Internal: Add versionless alias for rest client codebase in policy files (#26521 ) Security manager policy files contains grants for specific codebases, where a codebase is a jar file. We use a system property containing the name of the jar file to resolve the jar file location when parsing the policy file. However, this means the version of the jars must be modified when versions of dependencies change. This is particularly messy for elasticsearch, where we now have a dependency on the rest client, and need to support both a snapshot version for testing and non snapshot for release. This commit adds an alias for the elasticsearch rest client without a version to be used in policy files. That allows the policy files to not care whether the rest client is a snapshot or release.	2017-09-06 18:57:10 -07:00
Lee Hinman	fe02350e73	With too many incoming tasks, reset measurements to 1ns instead of 0ns Resoves #26332 where too many tasks occurred while adjustment was happening, the measurements were reset to 0, and then an assert failed due to tasks executing in 0 nanoseconds	2017-09-06 15:34:51 -06:00
Jason Tedor	9c795bd838	Fix cache compute if absent for expired entries When a cache entry expires, it remains in the cache (both the segment that it belongs to, and the LRU list) until an eviction occurs. The problem here is that the compute if absent implementation relies on there not being an association to a key that we are trying to put because it internally uses put if absent on the underlying segment. If we try to put an association for a key that has expired but not been evicted, then compute if absent will return as if there is nothing in the cache for the given key, yet no call to compute if absent will succeed in putting a new association for the key. To remedy this, we modify the internal get method for the cache to let the caller take action if the entry they are retrieving is expired. This allows the compute if absent method to take the action of evicting the entry from the cache, thus allowing the put if absent method used by compute if absent to succeed for one of the callers trying to compute if absent a new association. Relates #26516	2017-09-06 13:44:20 -04:00
Jim Ferenczi	0c799eedc5	Add upper limit for scroll expiry (#26448 ) This change adds a dynamic cluster setting named `search.max_keep_alive`. It is used as an upper limit for scroll expiry time in scroll queries and defaults to 1 hour. This change also ensures that the existing setting `search.default_keep_alive` is always smaller than `search.max_keep_alive`. Relates #11511 * check style * add skip for bwc * iter * Add a maxium throttle wait time of 1h for reindex * review * remove empty line	2017-09-06 10:06:48 +02:00
Christoph Büscher	1b49bf3079	Remove deprecated parameters from `ids_query` (#26508 ) The `_type` and `types` version of the current `type` parameter have been deprecated since 5.0. We can remove support for them in 7.0 and also in 6.x and 6.0.	2017-09-05 18:12:31 +02:00
Tim Brooks	c1a20f7e48	Merge tsa with ts (#26369 ) We currently have a weird relationship between Transport, TransportService, and TransportServiceAdaptor. At some point I think that we would like to collapse these all into one concept as we only support TCP transports. This commit moves in that direction by eliminating the adaptor and just passing the transport service to the transport.	2017-09-05 09:15:56 -06:00
Christoph Büscher	760bd6c568	Extend testing of build method in GeoDistanceSortBuilder (#26498 ) Improve testing around the GeoDistanceSortBuilder#build method, adding checks for correct transfers of the sort order, mode, nested sorts and points validation and coercion. Also changing the behaviour around the nested_path, nested_filter vs. nested parameter in a similar way as in #26490 and deprecating the setters/getters for the old syntax. Relates to #17286	2017-09-05 14:38:10 +02:00
Martijn van Groningen	78e9c96d7f	Added a limit to from + size in top_hits and inner hits. Relates to #11511	2017-09-05 08:44:45 +02:00
Christoph Büscher	8f0369296f	Prohibit using `nested_filter`, `nested_path` and new `nested` Option at the same time in FieldSortBuilder (#26490 ) Currently we allow both "old" and "new" way of setting nested sorts on the FieldSortBuilder at the same time. This should throw an error, instead the user should choose one of the two possible options. Also adding testing for the now deprecated nestedPath/nestedFilter parameters, inlcuding checks that they emmit warnings on parsing and that the new NestetedSortBuilder overwrites the deprecated parameters when building the SortField. Relates to #17286	2017-09-04 17:19:52 +02:00
Boaz Leskes	2fd4af82e4	Move `UNASSIGNED_SEQ_NO` and `NO_OPS_PERFORMED` to SequenceNumbers (#26494 ) Where they better belong.	2017-09-04 16:31:00 +02:00
Alexander Reelsen	3706a16baf	Docs: Update broken link to flake ids in uuid generators	2017-09-04 10:48:50 +02:00
Christoph Büscher	f8fc0f3ebe	[Tests] Check that quoteAnalyzer overrides analyzer in `query_string` query (#26473 ) Adding a check to QueryStringQueryBuilderTests that checks the override behaviour of `quote_analyzer`, also adding documentation explaining the use of this parameter in `query_string` query. Closes #25417	2017-09-02 11:53:02 +02:00
Jason Tedor	1757bd8d92	Prettify primary response in assertion message We are getting the default Object#toString implementation here, we need more than this. This commit instead formats the primary response to JSON so we can see into its soul.	2017-09-01 19:25:06 -04:00
Tal Levy	9735e7d706	migrate some MasterNodeRequest subclasses to Writeable Readers (#26463 ) migrate some MasterNodeRequest subclasses to Writeable Readers	2017-09-01 15:27:45 -07:00
Boaz Leskes	2d0997be16	Add version 6.0.0-rc1	2017-09-01 17:48:24 -04:00
Christoph Büscher	c2853c8281	Remove old norelease comment, the test is okay as it is	2017-09-01 18:25:27 +02:00
Christoph Büscher	2d342c0830	[Tests] Add unit tests for NestedSortBuilder (#26458 ) The new NestedSortBuilder currently is only tested via its use in the other SortBuilder implementations it can be used in. This adds its own simple unit test class that at first checks our usual fromXContent parsing, serialization and hashCode/equals checks. It also adds tests for cases where NestedSortBuilder is nested in itself and reuses the code for creating randomized instances in the other SortBuilder tests. In addition to the tests, this changes the `path` parameter in NestedSortBuilder to be mandatory and removes the `read` method since it is not really needed.	2017-09-01 10:53:51 +02:00
Alexander Reelsen	80d0a32f8e	ScriptService: Replace max compilation per minute setting with max compilation rate (#26399 ) The current script service has a script compilation limit for a one minute window. This is set to a small default value of 15. Instead of increasing that default value, this commit introduces a new setting that allows to configure a rate per time unit, so that the script service can deal with bursts better. The new setting is named `script.max_compilations_rate`, requires a nonnegative number and a positive time value. The default is `75/5m`, which is equivalent to the existing 15 per minute.	2017-09-01 10:15:27 +02:00
Jason Tedor	111defdfe1	Allow double aborts on bulk item requests In some cases a request can already be aborted and retried. This means the condition that aborting a request should only happen when an item has not been processed yet is too strict. This commit allows for a double abort. If we attempt to abort an operation that was previously processed but not aborted, we treat that as a hard failure. Relates #26434	2017-08-31 14:37:02 -04:00
Christoph Büscher	294d167973	Revert accidental deletion of cast needed for Java 9	2017-08-31 16:13:12 +02:00
Jason Tedor	697bc266ce	Upgrade to Log4j 2.9.0 This commit upgrades the Log4j dependency from version 2.8.2 to version 2.9.0. Relates #26450	2017-08-31 09:54:35 -04:00
Tim Vernum	eb87df9ff9	Allow abort of bulk items before processing (#26434 ) Adds support for bulk items to be aborted before they are processed by the TransportShardBulkAction. This can be used by an ActionFilter to reject a subset of the items in a bulk action without rejecting the whole action (or all the items for a shard).	2017-08-31 21:23:14 +10:00
Christoph Büscher	adad605081	[Tests] Improve testing of FieldSortBuilder (#26437 ) Currently we don't have much unit testing about the SortField that is created then calling the SortBuilders `build` method. Most of this is covered by integration tests somewhere but it would be good to have some basic checks in FieldSortBuilderTest as well. This adds testing for the sort order, mode, missing values and checks that `nested` gets set in the XFieldComparatorSource when `nestedPath` and `nestedFilter` are set on the builder. Relates to #17286	2017-08-31 12:15:09 +02:00
Adrien Grand	78681bc9e5	Upgrade to lucene-7.0.0-snapshot-d94a5f0. (#26441 )	2017-08-31 09:06:40 +02:00
Lee Hinman	c3da66d021	Implement adaptive replica selection (#26128 ) * Implement adaptive replica selection This implements the selection algorithm described in the C3 paper for determining which copy of the data a query should be routed to. By using the service time EWMA, response time EWMA, and queue size EWMA we calculate the score of a node by piggybacking these metrics with each search request. Since Elasticsearch lacks the "broadcast to every copy" behavior that Cassandra has (as mentioned in the C3 paper) to update metrics after a node has been highly weighted, this implementation adjusts a node's response stats using the average of the its own and the "best" node's metrics. This is so that a long GC or other activity that may cause a node's rank to increase dramatically does not permanently keep a node from having requests routed to it, instead it will eventually lower its score back to the realm where it is a potential candidate for new queries. This feature is off by default and can be turned on with the dynamic setting `cluster.routing.use_adaptive_replica_selection`. Relates to #24915, however instead of `b=3` I used `b=4` (after benchmarking) * Randomly use adaptive replica selection for internal test cluster * Use an action name prefix for retrieving pending requests * Add unit test for replica selection * don't use adaptive replica selection in SearchPreferenceIT * Track client connections in a SearchTransportService instead of TransportService * Bind `entry` pieces in local variables * Add javadoc link to C3 paper and javadocs for stat adjustments * Bind entry's key and value to local variables * Remove unneeded actionNamePrefix parameter * Use conns.longValue() instead of cached Long * Add comments about removing entries from the map * Pull out bindings for `entry` in IndexShardRoutingTable * Use .compareTo instead of manually comparing * add assert for connections not being null and gte to 1 * Copy map for pending search connections instead of "live" map * Increase the number of pending search requests used for calculating rank when chosen When a node gets chosen, this increases the number of search counts for the winning node so that it will not be as likely to be chosen again for non-concurrent search requests. * Remove unused HashMap import * Rename rank -> rankShardsAndUpdateStats * Rename rankedActiveInitializingShardsIt -> activeInitializingShardsRankedIt * Instead of precalculating winning node, use "winning" shard from ranked list * Sort null ranked nodes before nodes that have a rank	2017-08-30 20:55:11 -06:00
Tal Levy	ed151d829d	Migrate Search requests to use Writeable reading strategies (#26428 ) Migrates many SearchRequest objects to use Writeable conventions and rejects usage of `readFrom` in these new classes.	2017-08-30 11:00:33 -07:00
Martijn van Groningen	ea3fa768f9	Changed version from 7.0.0-alpha1 to 6.1.0 in the nested sorting serialization check.	2017-08-30 19:56:10 +02:00
Matt Weber	140395c83f	Multi-level Nested Sort with Filters (#26395 ) Multi-level Nested Sort with Filters Allow multiple levels of nested sorting where each level can have it's own filter. Backward compatible with previous single-level nested sort.	2017-08-30 18:52:56 +02:00
Martijn van Groningen	c821dce3fe	Revert "Multi-level Nested Sort with Filters" This reverts commit `6377afa6c3`.	2017-08-30 14:53:25 +02:00
Martijn van Groningen	410c6c281a	Revert "Temporarily set bwc version for new nested sorting to 7.0.0-alpha1 until the change has been backported to 6.x branch." This reverts commit `472a5dd56b`.	2017-08-30 14:53:10 +02:00
Martijn van Groningen	472a5dd56b	Temporarily set bwc version for new nested sorting to 7.0.0-alpha1 until the change has been backported to 6.x branch.	2017-08-30 14:30:20 +02:00
Martijn van Groningen	6377afa6c3	Multi-level Nested Sort with Filters Allow multple levels of nested sorting where each level can have it's own filter. Backward compatible with previous single-level nested sort.	2017-08-30 14:30:20 +02:00
Colin Goodheart-Smithe	ce1d85d7d0	Moves deferring code into its own subclass (#26421 ) * Moves deferring code into its own subclass This change moves the code that deals with deferring collection to a subclass of BucketAggregator called DeferringBucketAggregator. This means that the code in AggregatorBase is simplified and also means that the code for deferring colleciton is in one place and easier to maintain. * Makes SIngleBucketAggregator an interface This is so aggregators that extend BucketsAggregator directly and those that extend DeferringBucketAggregator can be a single bucket aggregator * review comments * More review comments	2017-08-30 11:15:40 +01:00
Adrien Grand	34a6c7af26	Consolidate locale parsing. (#26400 ) Mappings and ingest have different locale parsing code.	2017-08-30 10:58:33 +02:00
Sergey Galkin	c075323522	Refactor create index service to be unit testable This commit refactors MetaDataCreateIndexService so that it is unit testable. Relates #25961	2017-08-29 16:55:44 -04:00
Jason Tedor	7a035f5f84	setgid on /etc/elasticearch on package install When creating the keystore explicitly (from executing elasticsearch-keystore create) or implicitly (for plugins that require the keystore to be created on install) on an Elasticsearch package installation, we are running as the root user. This leaves /etc/elasticsearch/elasticsearch.keystore having the wrong ownership (root:root) so that the elasticsearch user can not read the keystore on startup. This commit adds setgid to /etc/elasticsearch on package installation so that when executing this directory (as we would when creating the keystore), we will end up with the correct ownership (root:elasticsearch). Additionally, we set the permissions on the keystore to be 660 so that the elasticsearch user via its group can read this file on startup. Relates #26412	2017-08-28 20:47:42 -04:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Stuart Neivandt	f842ff1ae1	Simple verification of the format of the language tag used in DateProcessor. (#25513 ) Closes #26186	2017-08-28 10:59:00 +02:00
Adrien Grand	d692ccf261	Reject IPv6-mapped IPv4 addresses when using the CIDR notation. (#26254 ) It introduces ambiguity as to whether the prefix length should be interpreted as a v4 prefix length or a v6 prefix length. See https://issues.apache.org/jira/browse/LUCENE-7920. Closes #26078	2017-08-28 10:04:05 +02:00
Adrien Grand	262ea9534f	Make locale parsing less lenient. (#26361 ) The `locale` field of `date` fields accepts almost any string and unknown locales are simply ignored, which is trappy. We should fail on unknown languages or countries. This commit also makes `-` an accepted separator in addition to `_` since `-` is the recommended separator (https://tools.ietf.org/html/rfc5646#section-2.1). `_` is probably still worth supporting since it is the separator used by `Locale#toString()`.	2017-08-28 09:59:25 +02:00
Adrien Grand	36e22bc30f	Remove 5.x backcompat from synonym filters.	2017-08-28 09:56:01 +02:00
Adrien Grand	eb782492be	Remove support for lenient booleans. Closes #22298	2017-08-28 09:56:01 +02:00
Alexander Reelsen	bdf2c3c691	Script Stats: Add compilation limit counter to stats (#26387 ) In order to know, when the script compilation limit has kicked in, this commit adds a counter in the script stats to expose that information. So far the only way to find out about this was to check the logs or check out responses of individual requests.	2017-08-28 09:51:49 +02:00
Adrien Grand	6eac3ee8ba	Avoid hardcoded error message that depends on the current version in tests. (#26391 ) It makes it painful to bump the current version.	2017-08-28 09:11:31 +02:00
Michael Basnight	cfd14cd2b8	Revert shading for the low level rest client (#26367 ) At current, we do not feel there is enough of a reason to shade the low level rest client. It caused problems with commons logging and IDE's during the brief time it was used. We did not know exactly how many users will need this, and decided that leaving shading out until we gather more information is best. Users can still shade the jar themselves. For information and feeback, see issue #26366. Closes #26328 This reverts commit `3a20922046`. This reverts commit `2c271f0f22`. This reverts commit `9d10dbea39`. This reverts commit `e816ef89a2`.	2017-08-25 14:13:12 -05:00
Ryan Ernst	3655f3f2a3	Test: Remove irrelevant access after close test for stream (#26392 ) This commit removes the streams test for access after closing the bytes stream. Output streams being closed mean they can no longer be written to, but other methods to retrieve side state of the stream can still make sense, such as bytes() in this case. relates #12620	2017-08-25 11:30:37 -07:00
Nik Everett	b3edd11aa0	Allow plugins to plug rescore implementations (#26368 ) This allows plugins to plug rescore implementations into Elasticsearch. While this is a fairly expert thing to do I've done my best to point folks to the QueryRescorer as one that at least documents the tradeoffs that it makes. I've attempted to limit the API surface area by removing `SearchContext` from the exposed interface, instead exposing just the IndexSearcher and `QueryShardContext`. I also tried to make some of the class names more consistent and do some general cleanup while I was there. I entertained the notion of moving the `QueryRescorer` to module. After all, it'd be a wonderful test to prove that you can plug rescore implementation into Elasticsearch if the only built in rescore implementation is in the module. But I decided against it because the new module would require a client jar and it'd require moving some more things around. I think if we really want to do it, we should do it as a followup. I did, on the other hand, create an "example" rescore plugin which should both be a nice example for anyone wanting to plug in their own rescore implementation and servers as a good integration test to make sure that you can indeed plug one in. Closes #26208	2017-08-25 13:46:57 -04:00
Jim Ferenczi	74cd32942a	Handle leniency for phrase query on a field indexed without positions (#26388 ) This change rewrite phrase query built on a field indexed without positions to match_no_docs query when the `lenient` option is set to true. This change affects all full text queries.	2017-08-25 16:41:01 +02:00
Yannick Welsch	0390c76f0a	Remove reinitShadowPrimary (#26349 ) With shadow replicas gone, there is no need to have this method anymore.	2017-08-25 10:37:51 +09:30
Tim Brooks	0551d2ff68	Move generic http settings out of netty module (#26310 ) There is a group of five settings relating to raw tcp configurations (no_delay, buffer sizes, etc) that we have for the http transport. These currently live in the netty module. As they are unrelated to netty specifically, this commit moves these settings to the `HttpTransportSettings` class in core.	2017-08-24 19:27:56 -05:00
Ryan Ernst	5202e7e93b	Settings: Move keystore creation to plugin installation (#26329 ) This commit removes the keystore creation on elasticsearch startup, and instead adds a plugin property which indicates the plugin needs the keystore to exist. It does still make sure the keystore.seed exists on ES startup, but through an "upgrade" method that loading the keystore in Bootstrap calls. closes #26309	2017-08-24 12:12:47 -07:00
Jay Modi	7fb716daab	Resync replication action should be internal (#26345 ) This commit renames the TransportResyncReplicationAction name to be an internal action as this is not an action that should be invoked by a user, but is instead internal to the operation of the system.	2017-08-24 11:04:30 -06:00
Colin Goodheart-Smithe	c8ca015c0b	Check bucket metric ages point to a multi bucket agg (#26215 ) * Check bucket metric ages point to a multi bucket agg This adds a validation step to the BucketMetricsPipelineAggregationBuilder which ensure that the first aggregation in the `buckets_path` is a multi-bucket aggregation. It does this using a new `MultiBucketAggregationBuilder` marker interface. The change also moves the validate of pipeline aggregations to the `AggregatorFactories.build()` method so the validate can inspect sibling `AggregatorBuilder` objects rather than `AggregatorFactory` objects. Further it removes the validate from `AggregatorFactory` since this was never implemented and since aggregators only depend on their own internal state and not on other aggregators they should be validated ideally at setter time but in rare case where this is not possible the validation should be done in the `AggregationBuilder.build()` step. Closes #25775 Move validate stage to happen during AggregatorFactories.Builder.build Also removes validate method from normal aggs since it was never used. * review comment fix	2017-08-24 12:05:03 +01:00
Jim Ferenczi	c1ba860b71	#26320 : Reset default setting after test	2017-08-23 16:05:52 +02:00
Jim Ferenczi	de1e4e0c15	Accept an array of field names and boosts in the index.query.default_field setting (#26320 ) * Accept an array of field names and boosts in the index.query.default_field setting This commit allows to define an array of field names and boosts for the index setting `index.query.default_field`. The format is equivalent to the `fields` options of the full text search queries (e.g. field_name^boost). This commit also makes this setting dynamically updatable. Fixes #25946	2017-08-23 15:39:54 +02:00
Colin Goodheart-Smithe	c3cc8262a7	Migrates more ToXContentClasses (#26321 ) * More XContent migrations * Removes ToXContentToBytes * Adds toString to classes that used to extend ToXContentToBytes * use XContentHelper * more review comments * prettify tostring output	2017-08-23 08:17:32 +01:00
Jim Ferenczi	8b8c06398e	remove Lucene class copies that are not needed anymore (#26325 )	2017-08-23 09:02:00 +02:00
Yannick Welsch	4b813adf52	[TEST] Account for relocating primary in SearchWhileCreatingIndexIT The test verifies that search on the primary works by executing a search with preference _primary. If the primary is relocating, however, it does not take the primary relocation target into account. The test only makes sense, however, if balancing is not happening yet, i.e., the cluster is not green.	2017-08-23 14:23:14 +09:30
Yannick Welsch	73dff6d21f	Add workaround for Javadoc generation issues on JDK 9 b181 The javadoc tool on JDK 9 has issues with the combination of anonymous classes and varargs parameters. This commit simply refactors a few anonymous classes to private inner classes.	2017-08-23 10:15:01 +09:30
Tal Levy	6ab4b6b0ac	revamp TransportRequest handlers to support Writeable (#26315 ) This PR begins the long journey to deprecating Streamable. The idea here is to add additional method signatures that support Writeable.Reader, so that the work to migrate objects TransportMessage to implement Writeable and not Streamable. One example conversion is done in this PR: SimulatePipelineRequest.	2017-08-22 15:47:05 -07:00
Jim Ferenczi	4756c9a884	Fix nested query highlighting (#26305 ) This commit extracts the inner query in the ESToParentBlockJoinQuery for highlighting. This query has been added in 5.4 and breaks plain highlighting on nested queries. Highlighters that use postings or term vectors are not affected because they can't highlight nested documents correctly. Fixes #26230	2017-08-22 11:36:45 +02:00
Yannick Welsch	3d8feff66e	Use Java 9 FilePermission model (#26302 ) This commit makes the security code aware of the Java 9 FilePermission changes (see #21534) and allows us to remove the `jdk.io.permissionsUseCanonicalPath` system property.	2017-08-22 11:22:00 +09:30
Andy Bristol	bdefcbdcd6	reroute API: log messages from commands (#25955 ) Gives allocation commands from the cluster reroute API the ability to provide messages to be logged once the cluster state change has been committed. The purpose of this change is to create a record in the logs when allocation commands which could potentially be destructive are applied. The allocate_empty_primary and allocate_stale_primary commands are the only ones that currently provide log messages. Closes #22821	2017-08-21 17:09:40 -07:00
Jim Ferenczi	a48616272f	#26173 : Removed global_ordinals_hash and global_ordinals_low_cardinality exeuction hint deprecated in 6.1	2017-08-21 20:44:34 +02:00
Jim Ferenczi	977dcfe789	Deprecate global_ordinals_hash and global_ordinals_low_cardinality (#26173 ) * Deprecate global_ordinals_hash and global_ordinals_low_cardinality This change deprecates the `global_ordinals_hash` and `global_ordinals_low_cardinality` and makes the `global_ordinals` execution hint choose internally if global ords should be remapped or use the segment ord directly. These hints are too sensitive and expert to be exposed and we should be able to take the right decision internally based on the agg tree.	2017-08-21 19:12:27 +02:00
Christoph Büscher	5dae277bb2	Support distance units in GeoHashGrid aggregation precision (#26291 ) Currently the `precision` parameter must be a precision level in the range of [1,12]. In #5042 it was suggested also supporting distance units like "1km" to automatically approcimate the needed precision level. This change adds this support to the Rest API by making use of GeoUtils#geoHashLevelsForPrecision. Plain integer values without a unit are still treated as precision levels like before. Distance values that are too small to be represented by a precision level of 12 (values approx. less than 0.056m) are rejected. Closes #5042	2017-08-21 17:29:28 +02:00
Christoph Büscher	4ff12c9a0b	Throw exception in scroll requests using `from` (#26235 ) The `from` search parameter cannot really be used in scrolled searches. This commit adds a check for this case to the SearchRequest#validate() method so we can reported it as an error rather than silently ignoring it. Closes #9373	2017-08-21 15:12:34 +02:00
Boaz Leskes	181e881a0f	enable testIssue8226 The linked issue has been long closed	2017-08-21 14:33:04 +02:00
Jim Ferenczi	8fd71a5d6d	#26145 Fix test expectation with MatchNoDocsQuery	2017-08-21 14:17:43 +02:00
Jim Ferenczi	4bce727165	Refactor simple_query_string to handle text part like multi_match and query_string (#26145 ) This change is a continuation of #25726 that aligns field expansions for the simple_query_string with the query_string and multi_match query. The main changes are: * For exact field name, the new behavior is to rewrite to a matchnodocs query when the field name is not found in the mapping. * For partial field names (with * suffix), the expansion is done only on keyword, text, date, ip and number field types. Other field types are simply ignored. * For all fields (), the expansion is done on accepted field types only (see above) and metadata fields are also filtered. The use_all_fields option is deprecated in this change and can be replaced by setting `` in the fields parameter. This commit also changes how text fields are analyzed. Previously the default search analyzer (or the provided analyzer) was used to analyze every text part , ignoring the analyzer set on the field in the mapping. With this change, the field analyzer is used instead unless an analyzer has been forced in the parameter of the query. Finally now that all full text queries can handle the special "" expansion (`all_fields` mode), the `index.query.default_field` is now set to `` for indices created in 6.	2017-08-21 13:12:27 +02:00
Sergey Galkin	9a3216dfee	Stricter validation for min/max values for whole numbers (#26137 )	2017-08-21 12:16:45 +02:00
Antonio Matarrese	93cc2d0372	Configurable distance limit with the AUTO fuzziness. (#25731 ) Make the distance thresholds configurable with the AUTO fuzziness.	2017-08-21 11:00:20 +02:00
Ryan Ernst	96b0d3e0cc	Script: Convert script query to a dedicated script context (#26003 ) This commit converts script query to use a new FilterScript context. The new context returns a boolean, so the error that would have previously happened at runtime if a non boolean was returned would now happen at script compilation. Also, the leniency of supporting returning a number and 0 mapping to false, non-zero to true is gone, but it was never documented. With the new context compilation will now also fail if special variables are used at compilation time, instead of runtime, eg ctx.	2017-08-18 15:18:35 -07:00
Tim Brooks	5d7a78fcdb	Use PlainListenableActionFuture for CloseFuture (#26242 ) Right now we use a custom future for the CloseFuture associated with a channel. This is because we need special unwrapping logic to ensure that exceptions from a future failure are a certain type (opposed to an UncategorizedException). However, the current version is limiting because we can only attach one listener. This commit changes the CloseFuture to extend the PlainListenableActionFuture. This change allows us to attach multiple listeners.	2017-08-18 13:38:38 -05:00
Andy Bristol	6eef6c4f7a	[TEST] wait until reindex tasks ready for rethrottle (#26250 ) When slices is set as auto, there's an additional network call needed for the reindex tasks to know how to rethrottle. Sometimes the rethrottle action happens before the reindex task is fully initialized, so in the test we wait for the task to be ready. This commit also adds some safeguards to ensure that cancel and rethrottle operations are handled correctly Closes #26192	2017-08-18 11:01:27 -07:00
Jason Tedor	8a7d48538e	Add friendlier message on bad keystore permissions If we do not have permissions to write the keystore, an unclear access denied exception is thrown. This commit catches this exception so that we can decorate it with a friendlier error message. Relates #26284	2017-08-18 10:39:38 -04:00
Nik Everett	542fe864f8	Handle the 5.5.2 release That looks to be as simple as adding the 5.5.3 version constant.	2017-08-17 20:08:44 -04:00
Lee Hinman	f18ec511ca	Disallow : in cluster and index/alias names (#26247 ) We use `:` for cross-cluster search (eg `cluster:index`), therefore, we should not allow the ambiguity when allowing cluster or index names. Relates to #23892	2017-08-17 14:57:26 -06:00
Simon Willnauer	e3cc24685d	Persist created keystore on startup unless keystore is present (#26253 ) We already added the functionality to create a new keystore on startup in #26126 but apparently missed to persist the keystore. This change adds peristence and adds a test for the boostrap loading.	2017-08-17 15:32:23 +02:00
Adrien Grand	15b7aeeb0f	Remove back compat layer with 2.x indices. (#26245 ) As of 6.0 we do not need to support 2.x indices.	2017-08-17 10:16:24 +02:00
Adrien Grand	22292e8d96	Add segment attributes to the `_segments` API. (#26157 ) This contains information about whether high compression was enabled for instance. Closes #26130	2017-08-16 19:01:29 +02:00
Colin Goodheart-Smithe	a975f4e5d6	Moves more classes over to ToXContentObject/Fragment (#26234 ) * Moves more classes over to ToXContentObject/Fragment * review comments	2017-08-16 15:40:40 +01:00
Simon Willnauer	54bf7d78e8	Prevent cluster internal `ClusterState.Custom` impls to leak to a client (#26232 ) Today a `ClusterState.Custom` can be fetched by a transport client and leaks to the user even if the classes are private etc since the serialized bytes can be reconstructed. This change adds an option to customs to mark them as private such that our clusterstate action will never leak it.	2017-08-16 12:54:17 +02:00
Yannick Welsch	ca6eaf9831	[TEST] Reenable RareClusterStateIt#testDeleteCreateInOneBulk The AwaitsFix issue has been closed as the deleting an index and recreating with same name will give the shard a fresh folder to be written to (based on the index uuid).	2017-08-16 15:41:11 +08:00
Yannick Welsch	01f6851691	Serialize and expose timeout of acknowledged requests in REST layer (#26189 ) Due to the weird way of structuring the serialization code in AcknowledgedRequest, many request types forgot to properly serialize the request timeout, for example "index deletion", "index rollover", "index shrink", "putting pipeline", and other requests. This means that if those requests were not directly sent to the master node, the acknowledgement timeout information would be lost (and the default used instead). Some requests also don't properly expose the timeout mechanism in the REST layer, such as put / delete stored script. This commit fixes all that.	2017-08-16 07:43:05 +08:00
desmorto	292dd8f992	(refactor) some opportunities to use diamond operator (#25585 ) * (refactor) some opportunities to use diamond operator * Update ExceptionRetryIT.java update typo	2017-08-15 16:36:42 -06:00
Ryan Ernst	b2d6ff9116	Settings: Add keystore.seed auto generated secure setting (#26149 ) This commit adds a keystore.seed setting that is automatically generated when the ES keystore is created. This setting may be used by plugins as a secure, random value. This commit also auto creates the keystore upon startup to ensure the new setting is always available.	2017-08-15 14:04:03 -07:00
Jason Tedor	1ff8334d26	Fix document field equals and hash code test For the document field equals and hash code tests, we try to mutate the document field to intentionally produce a document field not equal to our provided one. We do this by randomly choosing a document field that has either - a randomly chosen field name and the same field value as the provided document field - a randomly chosen field value and the same field value as the provided document field If we are unlucky, it can be that the document field chosen by this method can be equal to the provided document field. In this case, our test will fail because the mutation really should be not equal. In this case, we should simply try the other mutation. Note that random document field produced by the second method can be equal to the provided document because it has the same field name and we can get unlucky with our randomly chosen field values. It is not the case that the random document field produced by the first method can be equal to the provided document field; this is because the current implementation guarantees that the field name length will be different guaranteeing that we have a different field name. Nevertheless, we fix the issue here by checking that our random choice gives us a non-equal document field, and assert that if we got unlucky the other one will work for us.	2017-08-15 14:11:13 -04:00
Jason Tedor	d1780a8052	Use holder pattern for lazy deprecation loggers In a few places we need to lazy initialize static deprecation loggers. This is needed to avoid touching logging before logging is configured, but deprecation loggers that are used in foundational classes like settings and parsers would be initialized before logging is configured. Previously we used a lazy set once pattern which is fine, but there's a simpler approach: the holder pattern. Relates #26218	2017-08-15 13:46:19 -04:00
Ryan Ernst	7ed501b230	Settings: Add keystore creation to add commands (#26126 ) This commits changes the keystore cli add commands to prompt for creating the keystore if it does not exist. This will make it easier on users starting out, not having to run a separate command for creation.	2017-08-15 10:15:55 -07:00
Zachary Tong	d26becc040	Fix NPE when `values` is omitted on percentile_ranks agg (#26046 ) An array of values is required because there is no default (or reasonable way to set a default). But validation for values only happens if it is actually set. If the values param is omitted entirely than the agg builder will NPE.	2017-08-15 13:09:15 -04:00
Simon Willnauer	a9169e536b	Several internal improvements to internal test cluster infra (#26214 ) This chance adds several random test infrastructure improvements that caused issues in on-going developments but are generally useful. For instance is it impossible to restart a node with a secure setting source since we close it after the node is started. This change makes it cloneable such that we can reuse it for a restart.	2017-08-15 17:42:15 +02:00
Jason Tedor	1331741d7c	Fix typo in comment in o/e/b/Elasticsearch This commit fixes a typo (missing word) in org/elasticsearch/bootstrap/Elasticsearch.java.	2017-08-15 09:43:35 -04:00
Christoph Büscher	34610b841d	Reject multiple methods in `percentiles` aggregation (#26163 ) Currently the `percentiles` aggregation allows specifying both possible methods in the query DSL, but only the later one is used. This changes it to rejecting such requests with an error. Setting the method multiple times via the java API still works (and the last one wins). Closes #26095	2017-08-15 14:11:57 +02:00
Colin Goodheart-Smithe	f6d14717ed	Makes hashCode and equals in InternalAggregations abstract (#26216 ) This simply removes the default identity hashcode and equals methods in InternalAggregation which where only temporarily put there while we implmeneted the methods in the subclasses.	2017-08-15 11:14:57 +01:00
Yannick Welsch	0127528d97	Register setting cluster.indices.tombstones.size (#26193 ) The node setting `cluster.indices.tombstones.size` was not registered with the settings infrastructure, making it impossible for it to be set by a user. Closes #26191	2017-08-15 09:21:38 +08:00
Yannick Welsch	fe0c68ec8f	Allow wildcards for shard IP filtering (#26187 ) Fixes the broken usage of wildcards for IP-based allocation filtering (introduced by PR #22591), which is documented at https://www.elastic.co/guide/en/elasticsearch/reference/current/shard-allocation-filtering.html Closes #26184	2017-08-15 09:16:53 +08:00
Jason Tedor	447d92e482	Allow not configure logging without config For CLI tools, we configure logging without reading the log4j2.properties file. This because any log statements in a CLI tool should dump to the console while reading from the log4j2.properties file would cause them to dump whereever the log configuration there indicates (e.g., possibly a remote machine). To do this, we added some code to the base implementation of all CLI tools to configure logging without a config file. This code is also executed when Elasticsearch starts up. In the past this was fine yet we previously added detection to Elasticsearch to find cases where we use logging before it is configured. Because of configuring logging without a config, this means we only catch uses of logging before the logging without config is performed. To correct this, we enable a CLI tool to skip enabling logging without a config and then in the Elasticsearch CLI we indeed utilize this to skip configuring logging without a config. Relates #26209	2017-08-14 19:39:14 -04:00
Jason Tedor	685e35e0ae	Fix DiskThresholdMonitor flood warning The flood warning checks the wrong threshold, namely the high watermark. This would impact any node for which the disk usage is above the high watermark and below the flood stage watermark. This commit fixes this so that it compares to the flood threshold. Relates #26204	2017-08-15 00:22:27 +09:00
Jim Ferenczi	d896e62703	Rewrite range queries with open bounds to exists query (#26160 ) * Rewrite range queries with open bounds to exists query This change rewrites range query with open bounds to an exists query that should be faster to execute. Fixes #22640	2017-08-14 09:50:36 +02:00
Christoph Büscher	6e085c75af	Fix eclipse compilation problem (#26170 )	2017-08-13 19:19:12 +02:00
Albert Zaharovits	3e3132fe3f	Epoch millis and second formats parse float implicitly (Closes #14641 ) (#26119 ) `epoch_millis` and `epoch_second` date formats truncate float values, as numbers or as strings. The `coerce` parameter is not defined for `date` field type and this is not changing. See PR #26119 Closes #14641	2017-08-13 08:35:45 +03:00
Martijn van Groningen	1146a35870	Move more token filters to analysis-common module The following token filters were moved: arabic_stem, brazilian_stem, czech_stem, dutch_stem, french_stem, german_stem and russian_stem. Relates to #23658	2017-08-11 17:39:24 +02:00
Andy Bristol	7e3cd6a019	reindex: automatically choose the number of slices (#26030 ) In reindex APIs, when using the `slices` parameter to choose the number of slices, adds the option to specify `slices` as "auto" which will choose a reasonable number of slices. It uses the number of shards in the source index, up to a ceiling. If there is more than one source index, it uses the smallest number of shards among them. This gives users an easy way to use slicing in these APIs without having to make decisions about how to configure it, as it provides a good-enough configuration for them out of the box. This may become the default behavior for these APIs in the future.	2017-08-11 08:25:25 -07:00
Adrien Grand	73e936a065	Fix serialization of the `_all` field. (#26143 ) By default we only serialize analyzers if the index analyzer is not the `default` analyzer or if the `search_analyzer` is different from the index `analyzer`. This raises issues with the `_all` field when the `index.analysis.analyzer.default_search` is set, since it automatically makes the `search_analyzer` different from the index `analyzer`. Then there are exceptions since we expect the `_all` configuration to be empty on 6.0 indices. Closes #26136	2017-08-11 17:11:18 +02:00
Adrien Grand	1011791f4f	Remove SimpleQueryStringIT#testPhraseQueryOnFieldWithNoPositions. This test does not make sense now that `_all` is gone.	2017-08-11 11:31:09 +02:00
Adrien Grand	93cfbe29e0	Tests: reenable ShardReduceIT#testIpRange.	2017-08-11 11:04:40 +02:00
Simon Willnauer	6f82b0c6e2	Allow `ClusterState.Custom` to be created on initial cluster states (#26144 ) Today we have a `null` invariant on all `ClusterState.Custom`. This makes several code paths complicated and requires complex state handling in some cases. This change allows to register a custom supplier that is used to initialize the initial clusterstate with these transient customs.	2017-08-11 09:51:49 +02:00
Martijn van Groningen	076167fbe5	inner hits: Unfiltered nested source should keep its full path like filtered nested source. Closes #23090	2017-08-10 15:58:29 +02:00
Adrien Grand	0bf8a354a0	Use `global_ordinals_hash` execution mode when sorting by sub aggregations. (#26014 ) This is a safer default since sorting by sub aggregations prevents these aggregations from being deferred. `global_ordinals_hash` will at least make sure that we do not use memory for buckets that are not collected. Closes #24359	2017-08-10 12:28:19 +02:00
Martijn van Groningen	0e5460324c	Removed static indices and repos and the scripts that create them. Two tests were still using the static indices: * IndexFolderUpgraderTests#testUpgradeRealIndex() * InternalEngineTests#testUpgradeOldIndex() I removed these tests too, because these tests functionally overlap with the full-cluster-restart qa tests. Relates to #24939	2017-08-10 09:52:29 +02:00
Colin Goodheart-Smithe	20b7258d41	[TEST] fixes mutate methods in aggs tests Closes #26121	2017-08-10 07:05:46 +01:00
Christoph Büscher	566992d2a1	Tests: Fix failure in InternalGeoBoundsTests (#26112 ) This occasionally fails now because if `top` is `-Infinity` (which we sometimes test for in randomization), the value might not get changed for the equals/hashCode tests. Closes #26107	2017-08-09 23:01:36 +02:00
Colin Goodheart-Smithe	dfbaf90951	Adds ToXContentFragment (#25771 ) * Adds ToXContentFragment This interface is meant for objects that implement `ToXContent` but are not complete objects. It is basically the opposite of `ToXContentObject`. It means that it will be easier to track the migration of classes over to the fragment/not fragment ToXContent model as it will be clear which classes are not migrated. When no classes directly implement `ToXContent` we can make `ToXContent` package private to be sure that all new classes must implement `ToXContentObject` or `ToXContentFragment`. * review comments * more review comments * javadocs * iter * Adds tests * iter * adds toString test for aggs * improves tests following review comments * iter * iter	2017-08-09 15:53:30 +01:00
Sergey Galkin	d8ff6e9831	Reject out of range numbers for float, double and half_float (#25826 ) * validate half float values * test upper bound for numeric mapper * test for upper bound for float, double and half_float * more tests on NaN and Infinity for NumberFieldMapper * fix checkstyle errors * minor renaming * comments for disabled test * tests for byte/short/integer/long removed and will be added in separate PR * remove unused import * Fix scaledfloat out of range validation message * 1) delayed autoboxing in numbertype.parse(...) 2) no redudant checks in half_float validation 3) tests with negative values for half_float/float/double	2017-08-09 12:44:57 +01:00
Jim Ferenczi	15598f2174	#26097 : Adapt version check for the new query option: auto_generate_synonyms_phrase_query	2017-08-09 13:19:08 +02:00
Albert Zaharovits	b22147854b	Workaround Eclipse Oxygen type inference error (#26001 )	2017-08-09 13:36:23 +03:00
Jim Ferenczi	a7e1610134	Add support for auto_generate_synonyms_phrase_query in match_query, multi_match_query, query_string and simple_query_string (#26097 ) * Add support for auto_generate_synonyms_phrase_query in match_query, multi_match_query, query_string and simple_query_string This change adds a new parameter called auto_generate_synonyms_phrase_query (defaults to true). This option can be used in conjunction with synonym_graph token filter to generate phrase queries when multi terms synonyms are encountered. For example, a synonym like "ny, new york" would produce the following boolean query when "ny city" is parsed: ((ny OR "new york") AND city) Note how the multi terms synonym "new york" produces a phrase query.	2017-08-09 12:15:09 +02:00
Zachary Tong	59c670cbfa	Add version 6.0.0-beta2 after release	2017-08-08 14:13:47 -04:00
Adrien Grand	f0c1e30544	Upgrade to lucene-7.0.0-snapshot-a128fcb. (#26090 )	2017-08-08 13:03:19 +02:00
Colin Goodheart-Smithe	18e0fb5b3f	[TEST] Adds mutate method to more tests (#26094 ) * Adds mutate method to more tests Relates to #25929 * fixes tests	2017-08-08 11:31:45 +01:00
olcbean	5c4c1c5e15	Verify that _bulk and _msearch requests are terminated by a newline (#25740 )	2017-08-08 10:45:44 +02:00
Simon Willnauer	82fa531ab4	Remove `_index` fielddata hack if cluster alias is present (#26082 ) We introduced a hack in #25885 to respect the cluster alias if available on the `_index` field. This is important if aggregations or other field data related operations are executed. Yet, we added a small hack that duplicated an implementation detail from the `_index` field data builder to make this work. This change adds a necessary but simple API change that allows us to remove the hack and only have a single implementation.	2017-08-08 09:24:24 +02:00
Adrien Grand	f0cba4fce5	Add a scripted similarity. (#25831 ) The goal of this similarity is to help users who would like to keep the functionality of the `tf-idf` similarity that we want to remove, or to allow for specific usec-cases (disabling idf, disabling tf, disabling length norm, etc.) to not have to build a custom plugin and familiarize with the low-level Lucene API.	2017-08-08 08:55:12 +02:00
Christoph Büscher	0ad4c0529b	Tests: Fix edge case in InternalBucketMetricValueTests Same problem as in #26084.	2017-08-07 18:37:51 +02:00
Christoph Büscher	729e09ed6e	Tests: Fix edge case in InternalSimpleValueTests (#26084 ) When value is NaN, the mutate function might return a new instance that is equal to the original one. * Add same fix for InternalDerivativeTests	2017-08-07 18:30:18 +02:00

... 2 3 4 5 6 ...

9020 Commits