OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	504462f617	Reverting change to Script in core, can go in separately	2017-11-21 14:09:26 +01:00
Christoph Büscher	bea8451b2f	Merge branch 'master' into feature/rank-eval	2017-09-15 11:44:51 +02:00
Boaz Leskes	ffc9999567	fix StartRecoveryRequestTests.testSerialization	2017-09-14 23:20:55 +03:00
Boaz Leskes	1ca0b5e9e4	Introduce a History UUID as a requirement for ops based recovery (#26577 ) The new ops based recovery, introduce as part of #10708, is based on the assumption that all operations below the global checkpoint known to the replica do not need to be synced with the primary. This is based on the guarantee that all ops below it are available on primary and they are equal. Under normal operations this guarantee holds. Sadly, it can be violated when a primary is restored from an old snapshot. At the point the restore primary can miss operations below the replica's global checkpoint, or even worse may have total different operations at the same spot. This PR introduces the notion of a history uuid to be able to capture the difference with the restored primary (in a follow up PR). The History UUID is generated by a primary when it is first created and is synced to the replicas which are recovered via a file based recovery. The PR adds a requirement to ops based recovery to make sure that the history uuid of the source and the target are equal. Under normal operations, all shard copies will stay with that history uuid for the rest of the index lifetime and thus this is a noop. However, it gives us a place to guarantee we fall back to file base syncing in special events like a restore from snapshot (to be done as a follow up) and when someone calls the truncate translog command which can go wrong when combined with primary recovery (this is done in this PR). We considered in the past to use the translog uuid for this function (i.e., sync it across copies) and thus avoid adding an extra identifier. This idea was rejected as it removes the ability to verify that a specific translog really belongs to a specific lucene index. We also feel that having a history uuid will serve us well in the future.	2017-09-14 21:25:02 +03:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Jason Tedor	ca6bce75da	Refactor bootstrap check results and error messages This commit refactors the bootstrap checks into a single result object that encapsulates whether or not the check passed, and a failure message if the check failed. This simpifies the checks, and enables the messages to more easily be based on the state used to discern whether or not the check passed. Relates #26637	2017-09-13 21:30:27 -04:00
Simon Willnauer	b4de2a6f28	Add BootstrapContext to expose settings and recovered state to bootstrap checks (#26628 ) This exposes the node settings and the persistent part of the cluster state to the bootstrap checks to allow plugins to enforce certain preconditions based on the recovered state.	2017-09-13 22:14:17 +02:00
Christoph Büscher	2eaf7534f3	[Tests] Removing skipping tests in search rest tests After backporting the script_field soft limit to the 6.x branches, this test can now also run in a mixed cluster. Relates to #26598 enter the commit message for your changes. Lines starting	2017-09-13 18:21:15 +02:00
Jason Tedor	7be5ee5f28	Initialize checkpoint tracker with allocation ID This commit pushes the allocation ID down through to the global checkpoint tracker at construction rather than when activated as a primary. Relates #26630	2017-09-13 12:15:15 -04:00
Adrien Grand	93da7720ff	Move non-core mappers to a module. (#26549 ) Today we have all non-plugin mappers in core. I'd like to start moving those that neither map to json datatypes nor are very frequently used like `date` or `ip` to a module. This commit creates a new module called `mappers-extra` and moves the `scaled_float` and `token_count` mappers to it. I'd like to eventually move `range` fields there but it's more complicated due to their intimate relationship with range queries. Relates #10368	2017-09-13 17:58:53 +02:00
Christoph Büscher	027c555c9b	Add soft limit on allowed number of script fields in request (#26598 ) Requesting to many script_fields in a search request can be costly because of script execution. This change introduces a soft limit on the number of script fields that are allowed per request. The setting can be changed per index using the index.max_script_fields setting. Relates to #26390	2017-09-13 17:22:16 +02:00
Adrien Grand	64770b3fbd	Remove MapperService#dynamic. (#26603 ) We ignore it as of 6.0 and forbid it as of 7.0.	2017-09-13 17:00:52 +02:00
Adrien Grand	454cfc2cea	More efficient encoding of range fields. (#26470 ) This PR removes the vInt that precedes every value in order to know how long they are. Instead the query takes an enum that tells how to compute the length of values: for fixed-length data (ip addresses, double, float) the length is a constant while longs and integers use a variable-length representation that allows the length to be computed from the encoded values. Also the encoding of ints/longs was made a bit more efficient in order not to waste 3 bits in the header. As a consequence, values between -8 and 7 can now be encoded on 1 byte and values between -2048 and 2047 can now be encoded on 2 bytes or less. Closes #26443	2017-09-13 15:26:33 +02:00
Ivan Brusic	9e05b3260b	Add boolean similarity to built in similarity types (#26613 )	2017-09-13 13:58:30 +02:00
Jason Tedor	b3e7e85cf1	Let search phases override max concurrent requests If the query coordinating node is also a data node that holds all the shards for a search request, we can end up recursing through the can match phase (because we send a local request and on response in the listener move to the next shard and do this again, without ever having returned from previous shards). This recursion can lead to stack overflow for even a reasonable number of indices (daily indices over a sixty days with five shards per day is enough to trigger the stack overflow). Moreover, all this execution would be happening on a network thread (the thread that initially received the query). With this commit, we allow search phases to override max concurrent requests. This allows the can match phase to avoid recursing through the shards towards a stack overflow. Relates #26484	2017-09-13 06:16:27 -04:00
Christoph Büscher	e00db235bc	Add a soft limit for the number of requested doc-value fields (#26574 ) Requesting to many docvalue_fields in a search request can potentially be costly because it might incur a per-field per-document seek. This change introduces a soft limit on the number of fields that can be retrieved. The setting can be changed per index using the `index.max_docvalue_fields_search` setting. Relates to #26390	2017-09-13 11:57:06 +02:00
Adrien Grand	04b24c7780	Fix Lucene version of 5.6.1.	2017-09-12 17:54:50 +02:00
Michael Basnight	0e57a416f1	Handle the 5.6.0 release	2017-09-12 09:48:09 -05:00
Simon Willnauer	42f3129d7b	Allow plugins to validate cluster-state on join (#26595 ) Today we don't have a pluggable way to validate if the cluster state is compatible with the node that joins. We already apply some checks for index compatibility that prevents nodes to join a cluster with indices it doesn't support but for plugins this isn't possible. This change adds a cluster state validator that allows plugins to prevent a join if the cluster-state is incompatible.	2017-09-12 15:32:33 +02:00
Yu	3d4e28aee1	Remove index mapper dynamic settings (#25734 ) Remove "index.mapper.dynamic" setting for 6.0 (and after) indices, but still keep working for 5.x (and before) indices. Remove two index dynamic disable test cases as the disability of index.mapper.dynamic is already removed for current version. Add a new test class for version test.	2017-09-12 14:29:10 +02:00
Ryan Ernst	5c35bff1c3	Test: Remove leftover static bwc test case (#26584 ) This test case was leftover from the static bwc tests. There was still one use for checking we do not load old indices, but this PR moves the legacy code needed for that directly into the test. I also opened a follow up issue to completely remove the unsupported test: #26583.	2017-09-11 15:38:30 -07:00
Jason Tedor	b2e4bfa0a7	Snapshot fallback should consider build.snapshot When determining if a build is a snapshot build, we look for a field in the JAR manifest. However, when running tests, we are not running with a compiled core Elasticsearch JAR, we are running with the compiled core classes on the classpath. We have a fallback for this, we always assume such a situation is a snapshot build. However, when running builds with -Dbuild.snapshot=false, this is not the case. As such, we need to fallback to the value of build.snapshot. However, there are cases where we are not running with a compiled core Elasticsearch JAR (e.g., when the transport client is embedded in a web container) so we should only do this fallback if we are in tests. To verify we are in tests, we check if randomized runner is on the classpath. Relates #26554	2017-09-11 07:42:11 -04:00
Jim Ferenczi	c62b0192d0	#26496 : Set the correct bwc version after backport to 6.x	2017-09-11 13:09:44 +02:00
Adrien Grand	1adee8b5a8	Fix the MapperFieldType.rangeQuery API. (#26552 ) RangeQueryBuilder needs to perform too many `instanceof` checks in order to check for `date` or `range` fields in order to know what it should do with the shape relation, time zone and date format. This commit adds those 3 parameters to the `rangeQuery` factory method so that those instanceof checks are not necessary anymore.	2017-09-11 11:02:05 +02:00
Adrien Grand	2bc3eeccde	Deduplicate `_field_names`. (#26550 ) This is a minor optimization that should save some utf8 conversions and indexing.	2017-09-11 10:57:08 +02:00
Md.Abdulla-Al-Sun	d00d18a36d	[Docs] Fix typo in javadocs (#26556 )	2017-09-09 22:25:31 +02:00
Lee Hinman	2702918780	Limit the number of expanded fields it query_string and simple_query_string (#26541 ) * Limit the number of expanded fields it query_string and simple_query_string This limits the number of automatically expanded fields for the "all fields" mode (`"default_field": ""`) for the `query_string` and `simple_query_string` queries to 1024 fields. Resolves #25105 Add blurb about limit to the docs	2017-09-08 13:37:55 -06:00
Lee Hinman	dd90cf1bbb	Throw a better error message for empty field names (#26543 ) * Throw a better error message for empty field names When a document is parsed with a `""` for a field name, we currently throw a confusing error about `.` being present in the field. This changes the error message to be clearer about what's causing the problem. Resolves #23348 * Fix exception message in test	2017-09-08 13:30:17 -06:00
Lee Hinman	4e43aac0f8	Expand "NO" decision message in NodeVersionAllocationDecider (#26542 ) This explains the `NO` Decision a little more. Resolves #10403	2017-09-08 09:18:34 -06:00
Antonio Matarrese	155db7326a	_reroute's retry_failed flag should reset failure counter (#25888 ) To protect against poisonous situations, ES will only try to allocate a shard 5 times (by default). After 5 consecutive failures, ES will stop assigning the shard and wait for an operator to fix the problem. Once the problem is fixed, the operator is expected to call `_reroute` with a `retry_failed` flag to force retrying of those shards. Currently that retry flag is only used for a single allocation run. However, if not all shards can be allocated at once (due to throttling) the operator has to keep on calling the API until all shards are assigned which is cumbersome. This PR changes the behavior of the flag to reset the failed allocations counter and this allowing shards to be assigned again.	2017-09-08 12:18:52 +02:00
Jim Ferenczi	3435c9f4e2	#26496 : Fix sporadic failure of ContextCompletionSuggestSearchIT#testGeoBoosting This test should not rely on strict ordering for same score suggestions. The Lucene completion suggester uses the doc id in case of a tie and documents are indexed randomly.	2017-09-08 11:30:40 +02:00
Jason Tedor	e3b0cc9867	Remove norelease regarding destroying history This commit removes a norelease from the codebase now that there is a CI job that fails on the norelease pattern being present. Instead, a new issue has been opened to track this one. Relates #26544	2017-09-07 21:57:08 -04:00
Jim Ferenczi	e684c5e0a5	#26496 : handle `shard_size` correctly in the completion suggester and tests. The completion suggester has a `shard_size` option that sets the size of the suggestions to retrieve per shard but it is ignored by the builder. This commit restores the handling of this option and fixes a test that can randomly fail without it.	2017-09-07 18:22:28 +02:00
Lee Hinman	cff904bf97	Enable adaptive replica selection by default (#26522 ) Relates to #24915	2017-09-07 09:25:05 -06:00
Jim Ferenczi	d68d8c9cef	Expose duplicate removal in the completion suggester (#26496 ) This change exposes the duplicate removal option added in Lucene for the completion suggester with a new option called `skip_duplicates` (defaults to false). This commit also adapts the custom suggest collector to handle deduplication when multiple contexts match the input. Closes #23364	2017-09-07 17:11:01 +02:00
Jim Ferenczi	abe83c4fac	Fail query when a sort is provided in conjunction with rescorers (#26510 ) This change fixes a regression introduced in 6 that removes the skipping of the rescore phase when a sort other than _score is used. We now fail the request when a sort is provided in conjunction with rescore instead of just skipping the rescore phase This commit also adds an assert that checks if the topdocs are sorted by _score after the rescoring. This is the responsibility of the rescorer to make sure that topdocs are sorted after rescore so we just check that this condition is met in the rescore phase.	2017-09-07 14:17:37 +02:00
Christoph Büscher	ba02485541	Make sure SortBuilders rewrite inner nested sorts (#26532 ) The three SortBuilders that can have inner NestedSortBuilders currently don't rewrite any of the filters contained in them. This change adds a rewrite method to NestedSortBuilder and changes rewriting in FieldSortBuilder, ScriptSortBuilder and GeoDistanceSortBuilder to make sure inner nested sorts get rewritten if they need to.	2017-09-07 14:04:50 +02:00
Christoph Büscher	47ffa17efb	Extend testing of build method in ScriptSortBuilder (#26520 ) Improve testing around the ScriptSortBuilder#build method, adding checks for correct transfers of the sort mode and nested sorts. Also changing the behaviour around the nested_path, nested_filter vs. nested parameter in a similar way as in #26490 and deprecating the setters/getters for the old syntax. Closes #17286	2017-09-07 10:37:50 +02:00
Ryan Ernst	c9964d17bf	Internal: Add versionless alias for rest client codebase in policy files (#26521 ) Security manager policy files contains grants for specific codebases, where a codebase is a jar file. We use a system property containing the name of the jar file to resolve the jar file location when parsing the policy file. However, this means the version of the jars must be modified when versions of dependencies change. This is particularly messy for elasticsearch, where we now have a dependency on the rest client, and need to support both a snapshot version for testing and non snapshot for release. This commit adds an alias for the elasticsearch rest client without a version to be used in policy files. That allows the policy files to not care whether the rest client is a snapshot or release.	2017-09-06 18:57:10 -07:00
Lee Hinman	fe02350e73	With too many incoming tasks, reset measurements to 1ns instead of 0ns Resoves #26332 where too many tasks occurred while adjustment was happening, the measurements were reset to 0, and then an assert failed due to tasks executing in 0 nanoseconds	2017-09-06 15:34:51 -06:00
Jason Tedor	9c795bd838	Fix cache compute if absent for expired entries When a cache entry expires, it remains in the cache (both the segment that it belongs to, and the LRU list) until an eviction occurs. The problem here is that the compute if absent implementation relies on there not being an association to a key that we are trying to put because it internally uses put if absent on the underlying segment. If we try to put an association for a key that has expired but not been evicted, then compute if absent will return as if there is nothing in the cache for the given key, yet no call to compute if absent will succeed in putting a new association for the key. To remedy this, we modify the internal get method for the cache to let the caller take action if the entry they are retrieving is expired. This allows the compute if absent method to take the action of evicting the entry from the cache, thus allowing the put if absent method used by compute if absent to succeed for one of the callers trying to compute if absent a new association. Relates #26516	2017-09-06 13:44:20 -04:00
Jim Ferenczi	0c799eedc5	Add upper limit for scroll expiry (#26448 ) This change adds a dynamic cluster setting named `search.max_keep_alive`. It is used as an upper limit for scroll expiry time in scroll queries and defaults to 1 hour. This change also ensures that the existing setting `search.default_keep_alive` is always smaller than `search.max_keep_alive`. Relates #11511 * check style * add skip for bwc * iter * Add a maxium throttle wait time of 1h for reindex * review * remove empty line	2017-09-06 10:06:48 +02:00
Christoph Büscher	1b49bf3079	Remove deprecated parameters from `ids_query` (#26508 ) The `_type` and `types` version of the current `type` parameter have been deprecated since 5.0. We can remove support for them in 7.0 and also in 6.x and 6.0.	2017-09-05 18:12:31 +02:00
Tim Brooks	c1a20f7e48	Merge tsa with ts (#26369 ) We currently have a weird relationship between Transport, TransportService, and TransportServiceAdaptor. At some point I think that we would like to collapse these all into one concept as we only support TCP transports. This commit moves in that direction by eliminating the adaptor and just passing the transport service to the transport.	2017-09-05 09:15:56 -06:00
Christoph Büscher	760bd6c568	Extend testing of build method in GeoDistanceSortBuilder (#26498 ) Improve testing around the GeoDistanceSortBuilder#build method, adding checks for correct transfers of the sort order, mode, nested sorts and points validation and coercion. Also changing the behaviour around the nested_path, nested_filter vs. nested parameter in a similar way as in #26490 and deprecating the setters/getters for the old syntax. Relates to #17286	2017-09-05 14:38:10 +02:00
Martijn van Groningen	78e9c96d7f	Added a limit to from + size in top_hits and inner hits. Relates to #11511	2017-09-05 08:44:45 +02:00
Christoph Büscher	8f0369296f	Prohibit using `nested_filter`, `nested_path` and new `nested` Option at the same time in FieldSortBuilder (#26490 ) Currently we allow both "old" and "new" way of setting nested sorts on the FieldSortBuilder at the same time. This should throw an error, instead the user should choose one of the two possible options. Also adding testing for the now deprecated nestedPath/nestedFilter parameters, inlcuding checks that they emmit warnings on parsing and that the new NestetedSortBuilder overwrites the deprecated parameters when building the SortField. Relates to #17286	2017-09-04 17:19:52 +02:00
Boaz Leskes	2fd4af82e4	Move `UNASSIGNED_SEQ_NO` and `NO_OPS_PERFORMED` to SequenceNumbers (#26494 ) Where they better belong.	2017-09-04 16:31:00 +02:00
Alexander Reelsen	3706a16baf	Docs: Update broken link to flake ids in uuid generators	2017-09-04 10:48:50 +02:00
Christoph Büscher	f8fc0f3ebe	[Tests] Check that quoteAnalyzer overrides analyzer in `query_string` query (#26473 ) Adding a check to QueryStringQueryBuilderTests that checks the override behaviour of `quote_analyzer`, also adding documentation explaining the use of this parameter in `query_string` query. Closes #25417	2017-09-02 11:53:02 +02:00

1 2 3 4 5 ...

8913 Commits