OpenSearch

Commit Graph

Author	SHA1	Message	Date
Lee Hinman	c3da66d021	Implement adaptive replica selection (#26128 ) * Implement adaptive replica selection This implements the selection algorithm described in the C3 paper for determining which copy of the data a query should be routed to. By using the service time EWMA, response time EWMA, and queue size EWMA we calculate the score of a node by piggybacking these metrics with each search request. Since Elasticsearch lacks the "broadcast to every copy" behavior that Cassandra has (as mentioned in the C3 paper) to update metrics after a node has been highly weighted, this implementation adjusts a node's response stats using the average of the its own and the "best" node's metrics. This is so that a long GC or other activity that may cause a node's rank to increase dramatically does not permanently keep a node from having requests routed to it, instead it will eventually lower its score back to the realm where it is a potential candidate for new queries. This feature is off by default and can be turned on with the dynamic setting `cluster.routing.use_adaptive_replica_selection`. Relates to #24915, however instead of `b=3` I used `b=4` (after benchmarking) * Randomly use adaptive replica selection for internal test cluster * Use an action name prefix for retrieving pending requests * Add unit test for replica selection * don't use adaptive replica selection in SearchPreferenceIT * Track client connections in a SearchTransportService instead of TransportService * Bind `entry` pieces in local variables * Add javadoc link to C3 paper and javadocs for stat adjustments * Bind entry's key and value to local variables * Remove unneeded actionNamePrefix parameter * Use conns.longValue() instead of cached Long * Add comments about removing entries from the map * Pull out bindings for `entry` in IndexShardRoutingTable * Use .compareTo instead of manually comparing * add assert for connections not being null and gte to 1 * Copy map for pending search connections instead of "live" map * Increase the number of pending search requests used for calculating rank when chosen When a node gets chosen, this increases the number of search counts for the winning node so that it will not be as likely to be chosen again for non-concurrent search requests. * Remove unused HashMap import * Rename rank -> rankShardsAndUpdateStats * Rename rankedActiveInitializingShardsIt -> activeInitializingShardsRankedIt * Instead of precalculating winning node, use "winning" shard from ranked list * Sort null ranked nodes before nodes that have a rank	2017-08-30 20:55:11 -06:00
Tal Levy	ed151d829d	Migrate Search requests to use Writeable reading strategies (#26428 ) Migrates many SearchRequest objects to use Writeable conventions and rejects usage of `readFrom` in these new classes.	2017-08-30 11:00:33 -07:00
Martijn van Groningen	ea3fa768f9	Changed version from 7.0.0-alpha1 to 6.1.0 in the nested sorting serialization check.	2017-08-30 19:56:10 +02:00
Matt Weber	140395c83f	Multi-level Nested Sort with Filters (#26395 ) Multi-level Nested Sort with Filters Allow multiple levels of nested sorting where each level can have it's own filter. Backward compatible with previous single-level nested sort.	2017-08-30 18:52:56 +02:00
Martijn van Groningen	c821dce3fe	Revert "Multi-level Nested Sort with Filters" This reverts commit `6377afa6c3`.	2017-08-30 14:53:25 +02:00
Martijn van Groningen	410c6c281a	Revert "Temporarily set bwc version for new nested sorting to 7.0.0-alpha1 until the change has been backported to 6.x branch." This reverts commit `472a5dd56b`.	2017-08-30 14:53:10 +02:00
Martijn van Groningen	472a5dd56b	Temporarily set bwc version for new nested sorting to 7.0.0-alpha1 until the change has been backported to 6.x branch.	2017-08-30 14:30:20 +02:00
Martijn van Groningen	6377afa6c3	Multi-level Nested Sort with Filters Allow multple levels of nested sorting where each level can have it's own filter. Backward compatible with previous single-level nested sort.	2017-08-30 14:30:20 +02:00
Colin Goodheart-Smithe	ce1d85d7d0	Moves deferring code into its own subclass (#26421 ) * Moves deferring code into its own subclass This change moves the code that deals with deferring collection to a subclass of BucketAggregator called DeferringBucketAggregator. This means that the code in AggregatorBase is simplified and also means that the code for deferring colleciton is in one place and easier to maintain. * Makes SIngleBucketAggregator an interface This is so aggregators that extend BucketsAggregator directly and those that extend DeferringBucketAggregator can be a single bucket aggregator * review comments * More review comments	2017-08-30 11:15:40 +01:00
Adrien Grand	34a6c7af26	Consolidate locale parsing. (#26400 ) Mappings and ingest have different locale parsing code.	2017-08-30 10:58:33 +02:00
Sergey Galkin	c075323522	Refactor create index service to be unit testable This commit refactors MetaDataCreateIndexService so that it is unit testable. Relates #25961	2017-08-29 16:55:44 -04:00
Jason Tedor	7a035f5f84	setgid on /etc/elasticearch on package install When creating the keystore explicitly (from executing elasticsearch-keystore create) or implicitly (for plugins that require the keystore to be created on install) on an Elasticsearch package installation, we are running as the root user. This leaves /etc/elasticsearch/elasticsearch.keystore having the wrong ownership (root:root) so that the elasticsearch user can not read the keystore on startup. This commit adds setgid to /etc/elasticsearch on package installation so that when executing this directory (as we would when creating the keystore), we will end up with the correct ownership (root:elasticsearch). Additionally, we set the permissions on the keystore to be 660 so that the elasticsearch user via its group can read this file on startup. Relates #26412	2017-08-28 20:47:42 -04:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
Stuart Neivandt	f842ff1ae1	Simple verification of the format of the language tag used in DateProcessor. (#25513 ) Closes #26186	2017-08-28 10:59:00 +02:00
Adrien Grand	d692ccf261	Reject IPv6-mapped IPv4 addresses when using the CIDR notation. (#26254 ) It introduces ambiguity as to whether the prefix length should be interpreted as a v4 prefix length or a v6 prefix length. See https://issues.apache.org/jira/browse/LUCENE-7920. Closes #26078	2017-08-28 10:04:05 +02:00
Adrien Grand	262ea9534f	Make locale parsing less lenient. (#26361 ) The `locale` field of `date` fields accepts almost any string and unknown locales are simply ignored, which is trappy. We should fail on unknown languages or countries. This commit also makes `-` an accepted separator in addition to `_` since `-` is the recommended separator (https://tools.ietf.org/html/rfc5646#section-2.1). `_` is probably still worth supporting since it is the separator used by `Locale#toString()`.	2017-08-28 09:59:25 +02:00
Adrien Grand	36e22bc30f	Remove 5.x backcompat from synonym filters.	2017-08-28 09:56:01 +02:00
Adrien Grand	eb782492be	Remove support for lenient booleans. Closes #22298	2017-08-28 09:56:01 +02:00
Alexander Reelsen	bdf2c3c691	Script Stats: Add compilation limit counter to stats (#26387 ) In order to know, when the script compilation limit has kicked in, this commit adds a counter in the script stats to expose that information. So far the only way to find out about this was to check the logs or check out responses of individual requests.	2017-08-28 09:51:49 +02:00
Adrien Grand	6eac3ee8ba	Avoid hardcoded error message that depends on the current version in tests. (#26391 ) It makes it painful to bump the current version.	2017-08-28 09:11:31 +02:00
Michael Basnight	cfd14cd2b8	Revert shading for the low level rest client (#26367 ) At current, we do not feel there is enough of a reason to shade the low level rest client. It caused problems with commons logging and IDE's during the brief time it was used. We did not know exactly how many users will need this, and decided that leaving shading out until we gather more information is best. Users can still shade the jar themselves. For information and feeback, see issue #26366. Closes #26328 This reverts commit `3a20922046`. This reverts commit `2c271f0f22`. This reverts commit `9d10dbea39`. This reverts commit `e816ef89a2`.	2017-08-25 14:13:12 -05:00
Ryan Ernst	3655f3f2a3	Test: Remove irrelevant access after close test for stream (#26392 ) This commit removes the streams test for access after closing the bytes stream. Output streams being closed mean they can no longer be written to, but other methods to retrieve side state of the stream can still make sense, such as bytes() in this case. relates #12620	2017-08-25 11:30:37 -07:00
Nik Everett	b3edd11aa0	Allow plugins to plug rescore implementations (#26368 ) This allows plugins to plug rescore implementations into Elasticsearch. While this is a fairly expert thing to do I've done my best to point folks to the QueryRescorer as one that at least documents the tradeoffs that it makes. I've attempted to limit the API surface area by removing `SearchContext` from the exposed interface, instead exposing just the IndexSearcher and `QueryShardContext`. I also tried to make some of the class names more consistent and do some general cleanup while I was there. I entertained the notion of moving the `QueryRescorer` to module. After all, it'd be a wonderful test to prove that you can plug rescore implementation into Elasticsearch if the only built in rescore implementation is in the module. But I decided against it because the new module would require a client jar and it'd require moving some more things around. I think if we really want to do it, we should do it as a followup. I did, on the other hand, create an "example" rescore plugin which should both be a nice example for anyone wanting to plug in their own rescore implementation and servers as a good integration test to make sure that you can indeed plug one in. Closes #26208	2017-08-25 13:46:57 -04:00
Jim Ferenczi	74cd32942a	Handle leniency for phrase query on a field indexed without positions (#26388 ) This change rewrite phrase query built on a field indexed without positions to match_no_docs query when the `lenient` option is set to true. This change affects all full text queries.	2017-08-25 16:41:01 +02:00
Yannick Welsch	0390c76f0a	Remove reinitShadowPrimary (#26349 ) With shadow replicas gone, there is no need to have this method anymore.	2017-08-25 10:37:51 +09:30
Tim Brooks	0551d2ff68	Move generic http settings out of netty module (#26310 ) There is a group of five settings relating to raw tcp configurations (no_delay, buffer sizes, etc) that we have for the http transport. These currently live in the netty module. As they are unrelated to netty specifically, this commit moves these settings to the `HttpTransportSettings` class in core.	2017-08-24 19:27:56 -05:00
Ryan Ernst	5202e7e93b	Settings: Move keystore creation to plugin installation (#26329 ) This commit removes the keystore creation on elasticsearch startup, and instead adds a plugin property which indicates the plugin needs the keystore to exist. It does still make sure the keystore.seed exists on ES startup, but through an "upgrade" method that loading the keystore in Bootstrap calls. closes #26309	2017-08-24 12:12:47 -07:00
Jay Modi	7fb716daab	Resync replication action should be internal (#26345 ) This commit renames the TransportResyncReplicationAction name to be an internal action as this is not an action that should be invoked by a user, but is instead internal to the operation of the system.	2017-08-24 11:04:30 -06:00
Colin Goodheart-Smithe	c8ca015c0b	Check bucket metric ages point to a multi bucket agg (#26215 ) * Check bucket metric ages point to a multi bucket agg This adds a validation step to the BucketMetricsPipelineAggregationBuilder which ensure that the first aggregation in the `buckets_path` is a multi-bucket aggregation. It does this using a new `MultiBucketAggregationBuilder` marker interface. The change also moves the validate of pipeline aggregations to the `AggregatorFactories.build()` method so the validate can inspect sibling `AggregatorBuilder` objects rather than `AggregatorFactory` objects. Further it removes the validate from `AggregatorFactory` since this was never implemented and since aggregators only depend on their own internal state and not on other aggregators they should be validated ideally at setter time but in rare case where this is not possible the validation should be done in the `AggregationBuilder.build()` step. Closes #25775 Move validate stage to happen during AggregatorFactories.Builder.build Also removes validate method from normal aggs since it was never used. * review comment fix	2017-08-24 12:05:03 +01:00
Jim Ferenczi	c1ba860b71	#26320 : Reset default setting after test	2017-08-23 16:05:52 +02:00
Jim Ferenczi	de1e4e0c15	Accept an array of field names and boosts in the index.query.default_field setting (#26320 ) * Accept an array of field names and boosts in the index.query.default_field setting This commit allows to define an array of field names and boosts for the index setting `index.query.default_field`. The format is equivalent to the `fields` options of the full text search queries (e.g. field_name^boost). This commit also makes this setting dynamically updatable. Fixes #25946	2017-08-23 15:39:54 +02:00
Colin Goodheart-Smithe	c3cc8262a7	Migrates more ToXContentClasses (#26321 ) * More XContent migrations * Removes ToXContentToBytes * Adds toString to classes that used to extend ToXContentToBytes * use XContentHelper * more review comments * prettify tostring output	2017-08-23 08:17:32 +01:00
Jim Ferenczi	8b8c06398e	remove Lucene class copies that are not needed anymore (#26325 )	2017-08-23 09:02:00 +02:00
Yannick Welsch	4b813adf52	[TEST] Account for relocating primary in SearchWhileCreatingIndexIT The test verifies that search on the primary works by executing a search with preference _primary. If the primary is relocating, however, it does not take the primary relocation target into account. The test only makes sense, however, if balancing is not happening yet, i.e., the cluster is not green.	2017-08-23 14:23:14 +09:30
Yannick Welsch	73dff6d21f	Add workaround for Javadoc generation issues on JDK 9 b181 The javadoc tool on JDK 9 has issues with the combination of anonymous classes and varargs parameters. This commit simply refactors a few anonymous classes to private inner classes.	2017-08-23 10:15:01 +09:30
Tal Levy	6ab4b6b0ac	revamp TransportRequest handlers to support Writeable (#26315 ) This PR begins the long journey to deprecating Streamable. The idea here is to add additional method signatures that support Writeable.Reader, so that the work to migrate objects TransportMessage to implement Writeable and not Streamable. One example conversion is done in this PR: SimulatePipelineRequest.	2017-08-22 15:47:05 -07:00
Jim Ferenczi	4756c9a884	Fix nested query highlighting (#26305 ) This commit extracts the inner query in the ESToParentBlockJoinQuery for highlighting. This query has been added in 5.4 and breaks plain highlighting on nested queries. Highlighters that use postings or term vectors are not affected because they can't highlight nested documents correctly. Fixes #26230	2017-08-22 11:36:45 +02:00
Yannick Welsch	3d8feff66e	Use Java 9 FilePermission model (#26302 ) This commit makes the security code aware of the Java 9 FilePermission changes (see #21534) and allows us to remove the `jdk.io.permissionsUseCanonicalPath` system property.	2017-08-22 11:22:00 +09:30
Andy Bristol	bdefcbdcd6	reroute API: log messages from commands (#25955 ) Gives allocation commands from the cluster reroute API the ability to provide messages to be logged once the cluster state change has been committed. The purpose of this change is to create a record in the logs when allocation commands which could potentially be destructive are applied. The allocate_empty_primary and allocate_stale_primary commands are the only ones that currently provide log messages. Closes #22821	2017-08-21 17:09:40 -07:00
Jim Ferenczi	a48616272f	#26173 : Removed global_ordinals_hash and global_ordinals_low_cardinality exeuction hint deprecated in 6.1	2017-08-21 20:44:34 +02:00
Jim Ferenczi	977dcfe789	Deprecate global_ordinals_hash and global_ordinals_low_cardinality (#26173 ) * Deprecate global_ordinals_hash and global_ordinals_low_cardinality This change deprecates the `global_ordinals_hash` and `global_ordinals_low_cardinality` and makes the `global_ordinals` execution hint choose internally if global ords should be remapped or use the segment ord directly. These hints are too sensitive and expert to be exposed and we should be able to take the right decision internally based on the agg tree.	2017-08-21 19:12:27 +02:00
Christoph Büscher	5dae277bb2	Support distance units in GeoHashGrid aggregation precision (#26291 ) Currently the `precision` parameter must be a precision level in the range of [1,12]. In #5042 it was suggested also supporting distance units like "1km" to automatically approcimate the needed precision level. This change adds this support to the Rest API by making use of GeoUtils#geoHashLevelsForPrecision. Plain integer values without a unit are still treated as precision levels like before. Distance values that are too small to be represented by a precision level of 12 (values approx. less than 0.056m) are rejected. Closes #5042	2017-08-21 17:29:28 +02:00
Christoph Büscher	4ff12c9a0b	Throw exception in scroll requests using `from` (#26235 ) The `from` search parameter cannot really be used in scrolled searches. This commit adds a check for this case to the SearchRequest#validate() method so we can reported it as an error rather than silently ignoring it. Closes #9373	2017-08-21 15:12:34 +02:00
Boaz Leskes	181e881a0f	enable testIssue8226 The linked issue has been long closed	2017-08-21 14:33:04 +02:00
Jim Ferenczi	8fd71a5d6d	#26145 Fix test expectation with MatchNoDocsQuery	2017-08-21 14:17:43 +02:00
Jim Ferenczi	4bce727165	Refactor simple_query_string to handle text part like multi_match and query_string (#26145 ) This change is a continuation of #25726 that aligns field expansions for the simple_query_string with the query_string and multi_match query. The main changes are: * For exact field name, the new behavior is to rewrite to a matchnodocs query when the field name is not found in the mapping. * For partial field names (with * suffix), the expansion is done only on keyword, text, date, ip and number field types. Other field types are simply ignored. * For all fields (), the expansion is done on accepted field types only (see above) and metadata fields are also filtered. The use_all_fields option is deprecated in this change and can be replaced by setting `` in the fields parameter. This commit also changes how text fields are analyzed. Previously the default search analyzer (or the provided analyzer) was used to analyze every text part , ignoring the analyzer set on the field in the mapping. With this change, the field analyzer is used instead unless an analyzer has been forced in the parameter of the query. Finally now that all full text queries can handle the special "" expansion (`all_fields` mode), the `index.query.default_field` is now set to `` for indices created in 6.	2017-08-21 13:12:27 +02:00
Sergey Galkin	9a3216dfee	Stricter validation for min/max values for whole numbers (#26137 )	2017-08-21 12:16:45 +02:00
Antonio Matarrese	93cc2d0372	Configurable distance limit with the AUTO fuzziness. (#25731 ) Make the distance thresholds configurable with the AUTO fuzziness.	2017-08-21 11:00:20 +02:00
Ryan Ernst	96b0d3e0cc	Script: Convert script query to a dedicated script context (#26003 ) This commit converts script query to use a new FilterScript context. The new context returns a boolean, so the error that would have previously happened at runtime if a non boolean was returned would now happen at script compilation. Also, the leniency of supporting returning a number and 0 mapping to false, non-zero to true is gone, but it was never documented. With the new context compilation will now also fail if special variables are used at compilation time, instead of runtime, eg ctx.	2017-08-18 15:18:35 -07:00
Tim Brooks	5d7a78fcdb	Use PlainListenableActionFuture for CloseFuture (#26242 ) Right now we use a custom future for the CloseFuture associated with a channel. This is because we need special unwrapping logic to ensure that exceptions from a future failure are a certain type (opposed to an UncategorizedException). However, the current version is limiting because we can only attach one listener. This commit changes the CloseFuture to extend the PlainListenableActionFuture. This change allows us to attach multiple listeners.	2017-08-18 13:38:38 -05:00

1 2 3 4 5 ...

8772 Commits