OpenSearch

Commit Graph

Author	SHA1	Message	Date
shaie	8fd3637891	Return correct term statistics when a field is not found in a shard (#21922 ) If you ask for the term vectors of an artificial document with term_statistics=true, but a shard does not have any terms of the doc's field(s), it returns the doc's term vectors values as the shard-level term statistics. This commit fixes that to return 0 for `ttf` and also field-level aggregated statistics. Closes #21906	2016-12-02 08:14:45 +01:00
Simon Willnauer	adf9bd90a4	Remove legacy BWC test infrastructure and tests (#21915 ) We don't use the test infra nor do we run the tests. They might all be entirely out of date. We also have a different BWC test infra in-place. This change removes all of the legacy infra.	2016-12-02 08:06:20 +01:00
makeyang	3f1d7be07a	Refactor shard limit allocation decider This commit simplifies the shard limit allocation decider, removing some duplicated code into a common method. Relates #21845	2016-12-01 21:27:02 -05:00
Ryan Ernst	a6ad89bee0	Mappings: Fix get mapping when no indexes exist to not fail in response generation (#21924 ) When there are no indexes, get mapping has a series of special cases. Two of those expect the response object already started, and the other two respond with an exception. Those two cases (types passed in but no indexes and vice versa) would fail in their error response generation because it did not expect an object to already be started in the json generator. This change moves the object start to where it is needed for the empty responses. closes #21916	2016-12-01 16:57:12 -08:00
Simon Willnauer	6522538033	Add validation for supported index version on node join, restore, upgrade & open index (#21830 ) Today we can easily join a cluster that holds an index we don't support since we currently allow rolling upgrades from 5.x to 6.x. Along the same lines we don't check if we can support an index based on the nodes in the cluster when we open, restore or metadata-upgrade and index. This commit adds additional safety that fails cluster state validation, open, restore and /or upgrade if there is an open index with an incompatible index version created in the cluster. Realtes to #21670	2016-12-01 15:40:35 +01:00
Simon Willnauer	155de53fe3	Add a connect timeout to the ConnectionProfile to allow per node connect timeouts (#21847 ) Timeouts are global today across all connections this commit allows to specify a connection timeout per node such that depending on the context connections can be established with different timeouts. Relates to #19719	2016-12-01 15:39:49 +01:00
Boaz Leskes	92fa9149f3	rename more before() methods that now conflict with ESTestCase	2016-12-01 13:40:27 +01:00
Simon Willnauer	dd5256c324	Reduce number of connections per node depending on the nodes role (#21849 ) We currently treat every node equally when we establish connections to a node. Yet, if we are not master eligible or can't hold any data there is no point in creating a dedicated connection for sending the cluster state or running remote recoveries respectively. The usage of STATE and RECOVERY connections on non-master and/or non-data nodes will result in an IllegalStateException.	2016-12-01 08:00:48 +01:00
Jim Ferenczi	fc9b63877e	Handle specialized term queries in MappedFieldType.extractTerm(TermQuery) (#21889 ) For some fields we have a specialized implementation of a TermQuery that is specific for the field. When these kind of fields are used in a wildcard query or a span term query it fails with an exception because they don't recognize the specialized form. The impacted fields are [_all] and [_type] and the impacted queries are [span_term] and [wilcard]. This change handles these forms and correctly extracts the term inside them for further use. Fixes #21882	2016-11-30 23:11:38 +01:00
Jason Tedor	92f05e796e	Remove traces during connect with handshake This commit removes two trace logging statements during connection with handshake as they are just clutter.	2016-11-30 15:29:33 -05:00
Jason Tedor	761325bf94	Throw exception on ping from another cluster When we receive a ping from another cluster, we should throw an exception so as to not leak the channel.	2016-11-30 15:28:56 -05:00
Jason Tedor	c90ba67abb	Do not reply to pings from another cluster Today when sending responses to discovery pings, we unconditionally reply. Instead, this commit modifies the response handler to not reply when the cluster names do not match. This addresses a race condition identified after reducing the timeout in UnicastZenPingTests#testSimplePings. In particular, we send pings in the following way: - if not connected to the node, connect to the node and after successful handshake, send a ping - if connected to the node, send a ping When the ping timeout is set low, a subsequent batch of pings can race against a connect/disconnect cycle from a prior batch of pings. In particular, consider the following scenario: - node A from cluster X - node B from cluster Y - pings are initiated from node A with node B in the hosts list - node A will try to connect and handshake with B - the connection will succeed, and the handshake will eventually fail due to mismatched cluster names - on a short timeout, a second batch of pings will fire, and on this batch node A will see that it is still connected to node B; thus, it will immediately fire a ping to node B and node B will dutifully respond Relates #21894	2016-11-30 15:09:42 -05:00
Luca Cavanna	103984a4a1	Remove indices query (#21837 ) The indices query is deprecated since 5.0.0 (#17710). It can now be removed in master (future 6.0 version).	2016-11-30 19:37:01 +01:00
Adrien Grand	117944093e	Remove testing of 2.x indices in DecayFunctionScoreIT. Such old indices will not be supported in 6.0.	2016-11-30 17:16:13 +01:00
Jason Tedor	6c45695d52	Add version 5.1.1 This commit removes the version constant for 5.1.0 (due to an inadvertent release) and adds the version constant for 5.1.1. Relates #21890	2016-11-30 11:14:17 -05:00
Adrien Grand	f5ac27a20d	Fix TermsQueryBuilderTests expectations.	2016-11-30 17:07:53 +01:00
Adrien Grand	c5b9c98b99	Remove the `default` store type. (#21616 ) It used to be a hybrid store between `niofs` and `mmapfs`, which we removed when we switched to `fs` by default (which is `mmapfs` on 64-bits systems).	2016-11-30 15:33:26 +01:00
Adrien Grand	90ab477f19	The `terms` query should always map to a Lucene `TermsQuery`. (#21786 ) Currently, the `terms` query is just syctactic sugar for a `bool` query when used in a query context. This change proposes to always generate the same query in query and filter contexts, which is less confusing.	2016-11-30 15:29:09 +01:00
Luca Cavanna	5b8bdba12e	Remove subrequests method from CompositeIndicesRequest (#21873 )	2016-11-30 15:03:58 +01:00
Matt Weber	1e722c060b	Remove forked XRollingBuffer and XQueryBuilder. (#21866 ) Remove the forked versions now that we are on lucene-6.4.0-snapshot.	2016-11-30 13:45:54 +01:00
Adrien Grand	a3ef674992	Reduce memory pressure when sending large terms queries. (#21776 ) When users send large `terms` query to Elasticsearch, every value is stored in an object. This change does not reduce the amount of created objects, but makes sure these objects die young by optimizing the list storage in case all values are either non-null instances of Long objects or BytesRef objects, which seems to help the JVM significantly.	2016-11-30 13:35:56 +01:00
Adrien Grand	6231009a8f	Remove 2.x backward compatibility of mappings. (#21670 ) For the record, I also had to remove the geo-hash cell and geo-distance range queries to make the code compile. These queries already throw an exception in all cases with 5.x indices, so that does not hurt any more. I also had to rename all 2.x bwc indices from `index-${version}` to `unsupported-${version}` to make `OldIndexBackwardCompatibilityIT` happy.	2016-11-30 13:34:46 +01:00
Jason Tedor	072007c759	Speed up UnicastZenPingTests These tests using ping timeouts on the order of seconds, but this is unnecessary since all the sockets are within the same JVM it really should not take that long. Relates #21874	2016-11-29 23:27:25 -05:00
Jason Tedor	b6ba4ae34b	Add version 5.0.3 This commit adds version 5.0.3 and the BWC indices for version 5.0.2. Relates #21867	2016-11-29 18:34:55 -05:00
Jay Modi	404b42ee95	DiscoveryNode and TransportAddress should preserve host information In some cases, such as the creation of DiscoveryNode instances for unicast ping requests, the host information was not being populated properly and instead the address string was being used. Additionally, when serializing a DiscoveryNode and in turn a transport address, the host was not being set on the InetAddress when deserializing the object, so even if the address was created from a hostname, the address in the deserialized instance had no knowledge of the hostname that was originally used.	2016-11-29 16:18:08 -05:00
Luca Cavanna	6eaff9432d	SearchTemplateRequest to implement CompositeIndicesRequest (#21865 ) SearchTemplateRequest to implement CompositeIndicesRequest Given that SearchTemplateRequest effectively delegates to search when a search is being executed, it should implement the CompositeIndicesRequest interface. The subrequests method should return a single search request. When a search is not going to be executed, because we are in simulate mode, there are no inner requests, and there are no corresponding indices to that request either. Closes #21747	2016-11-29 20:52:43 +01:00
Boaz Leskes	be4074e13d	improve debug logging when node waits for initial cluster state And enabled debug logging in InternalTestClusterTests so we can see it.	2016-11-29 20:38:19 +01:00
Luca Cavanna	f253621feb	Remove deprecated query names: in, geo_bbox, mlt, fuzzy_match and match_fuzzy (#21852 ) These query names were all deprecated in 5.0.0: - in is removed in favour of terms - geo_bbox is removed in favour of geo_bounding_box - mlt is removed in favour of more_like_this - fuzzy_match and match_fuzzy are removed in favour of match	2016-11-29 19:07:01 +01:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Nicholas Knize	af1ab68b64	Add RangeFieldMapper for numeric and date range types Lucene 6.2 added index and query support for numeric ranges. This commit adds a new RangeFieldMapper for indexing numeric (int, long, float, double) and date ranges and creating appropriate range and term queries. The design is similar to NumericFieldMapper in that it uses a RangeType enumerator for implementing the logic specific to each type. The following range types are supported by this field mapper: int_range, float_range, long_range, double_range, date_range. Lucene does not provide a DocValue field specific to RangeField types so the RangeFieldMapper implements a CustomRangeDocValuesField for handling doc value support. When executing a Range query over a Range field, the RangeQueryBuilder has been enhanced to accept a new relation parameter for defining the type of query as one of: WITHIN, CONTAINS, INTERSECTS. This provides support for finding all ranges that are related to a specific range in a desired way. As with other spatial queries, DISJOINT can be achieved as a MUST_NOT of an INTERSECTS query.	2016-11-29 10:10:14 -06:00
Simon Willnauer	f5ff69fabe	Remove connectToNodeLight and replace it with a connection profile (#21799 ) The Transport#connectToNodeLight concepts is confusing and not very flexible. neither really testable on a unittest level. This commit cleans up the code used to connect to nodes and simplifies transport implementations to share more code. This also allows to connect to nodes with custom profiles if needed, for instance future improvements can be added to connect to/from nodes that are non-data nodes without dedicated bulks and recovery connections.	2016-11-29 09:35:07 +01:00
Ali Beyad	a884573898	[TEST] fixes FilterAllocationDecider test for decision explanation when the initial recovery is LOCAL_SHARDS	2016-11-28 20:37:19 -05:00
Ali Beyad	07bd0a30f0	Improves allocation decider decision explanation messages (#21771 ) This commit improves the decision explanation messages, particularly for NO decisions, in the various AllocationDecider implementations by including the setting(s) in the explanation message that led to the decision. This commit also returns a THROTTLE decision instead of a NO decision when the concurrent rebalances limit has been reached in ConcurrentRebalanceAllocationDecider, because it more accurately reflects a temporary throttling that will turn into a YES decision once the number of concurrent rebalances lessens, as opposed to a more permanent NO decision (e.g. due to filtering).	2016-11-28 20:23:16 -05:00
Matt Weber	04e07bcdb6	Synonym Graph Support (LUCENE-6664) (#21517 ) Integrate the patch from LUCENE-6664 into elasticsearch and add support for handling a graph token stream in match/multi-match queries. This fixes longstanding bugs with multi-token synonyms returning incorrect results with proximity queries.	2016-11-28 09:25:49 -08:00
Jim Ferenczi	8affb7c845	Fix FiltersFunctionScoreQuery highlighting (#21827 ) This is a cleanup of the fix pushed in https://github.com/elastic/elasticsearch/pull/20400. FiltersFunctionScoreQuery sub query should be extracted in CustomQueryScorer.extract (and not in CustomQueryScorer.extractUnknownQuery). This does not fix any bug in this branch (it's just a cleanup) but the intent is first to clean up and then to backport in 2.x where there is a real bug. The bug is in 2.x only because the backport of https://github.com/elastic/elasticsearch/pull/20400 in 2.x mistakenly renamed the FiltersFunctionScoreQuery to FunctionScoreQuery. This leads to incorrect highlighting on FiltersFunctionScoreQuery in 2.x.	2016-11-28 17:56:24 +01:00
Nik Everett	145d0813b5	Log ScriptException's xcontent if file script compilation fails (#21767 ) When a file script fails to compile, rather than logging the exception that caused the failure this logs the xcontent of that exception. This is both shorter and has the script stack which is useful for figuring out why the compilation failed. Still logs the entire stacktrace at debug level just in case you need it. Relates to #21733	2016-11-28 11:36:06 -05:00
Ali Beyad	db7362da67	Fixes shard level snapshot metadata loading when index-N file is missing (#21813 ) In making changes for the 5.0 version of snapshots, a bug was introduced where if an index-N file could not be found for an individual shard, the backup was to iterate over all snap-.dat files in the shard folder to know which snapshots contain that shard's data, but in 5.0, reading the snap-.dat files as backup was incorrectly passing in the blob name for the snap-.dat file, thereby failing to load all index files for a given snapshot when the index-N file is missing. This condition should be rare as there is no reason an index-N file should be absent (unless it was deleted or there was corruption reading the file), but nevertheless, this situation can be encountered and this commit fixes the bug by reading the correct snap-.dat blob name in the shard data folder.	2016-11-28 10:46:33 -05:00
Simon Willnauer	b7292a6005	Remove TcpTransport#addressSupported since TransportAddress is now final TransportAddress used to be customizable per transport but this has been removed a while ago. Therefore we can remove all usage of this method as well. Relates to #20695	2016-11-28 16:06:59 +01:00
Jim Ferenczi	69f35aa07f	Fix cross_fields type on multi_match query with synonyms (#21638 ) * Fix cross_fields type on multi_match query with synonyms This change fixes the cross_fields type of the multi_match query when synonyms are involved. Since 2.x the Lucene query parser creates SynonymQuery for words that appear at the same position. For simple term query the CrossFieldsQueryBuilder expands the term to all requested fields and creates a BlendedTermQuery. This change adds the same mechanism for SynonymQuery which otherwise are not expanded to all requested fields. As a side note I wonder if we should not replace the BlendedTermQuery with the SynonymQuery. They have the same purpose and behave similarly. Fixes #21633 * Fallback to SynonymQuery for blended terms on a single field	2016-11-28 14:14:01 +01:00
Yannick Welsch	7e198f0e41	Detect nodes being blocked by GC-disrupted node (#21797 ) The disruption type LongGCDisruption simulates GCs on a node by suspending all the threads of that node. If the suspended threads are in a code section with shared JVM locks, however, it can prevent the other nodes from doing their thing. The class LongGCDisruption has a list of class names for which we know that this can occur. Whenever a test using the GC disruption type fails in mysterious ways, it becomes a long guessing game to find the offending class. This commit adds code to LongGCDisruption to automatically detect these situations, fail the test early and report the offending class and all relevant context.	2016-11-28 11:24:25 +01:00
Adrien Grand	243a788289	Fail to index fields with dots in field names when one of the intermediate objects is nested. (#21787 ) Closes #21726	2016-11-28 09:57:32 +01:00
Clinton Gormley	c1fa80d40f	Log failure to connect to node at info instead of debug (#21809 ) Closes #6468	2016-11-26 13:18:26 +01:00
Ali Beyad	efba64d60a	Removing unused AllocationExplanation class (#21805 ) This commit removes the unused AllocationExplanation class. The RoutingAllocation class only created an empty instance of it and never used it anywhere else. The allocation explanations will be encompassed in the various decision classes exposed via the cluster allocation explain API. Therefore, there is no reason to keep the AllocationExplanation class.	2016-11-25 12:18:23 -05:00
Luca Cavanna	720b165350	Search shards to print out aliases array together with alias filter (#21784 ) With #21738 we added an indices section to the search shards api, that will return the concrete indices hit by the request, and eventually the corresponding alias filter. The java API returns the AliasFilter object, which holds the filter itself and an array of aliases that pointed to the index in the original request. The REST layer doesn't print out the aliases array though. This commit adds the aliases array as well and tests for this.	2016-11-25 10:58:06 +01:00
Simon Willnauer	9809760eb0	Fix settings diff generation for affix, list and group settings (#21788 ) Group, List and Affix settings generate a bogus diff that turns the actual diff into a string containing a json structure for instance: ``` "action" : { "search" : { "remote" : { "" : "{\"my_remote_cluster\":\"[::1]:60378\"}" } } } ``` which make reading the setting impossible. This happens for instance if a group or affix setting is rendered via `_cluster/settings?include_defaults=true` This change fixes the issue as well as several minor issues with affix settings that where not accepted as valid setting today.	2016-11-24 21:53:04 +01:00
Simon Willnauer	72ef6fa0d7	Handle spaces in `action.auto_create_index` gracefully (#21790 ) Today if a comma-separated list is passed to action.auto_create_index leading and trailing whitespaces are not trimmed but since the values are index expressions whitespaces should be removed for convenience. Closes #21449	2016-11-24 21:43:58 +01:00
markharwood	aa60e5cc07	Aggregations - support for partitioning set of terms used in aggregations so that multiple requests can be done without trying to compute everything in one request. Closes #21487	2016-11-24 15:10:46 +00:00
Luca Cavanna	ac2aa56350	Cluster search shards improvements: expose ShardId, adjust visibility of some members (#21752 ) * ClusterSearchShardsGroup to return ShardId rather than the int shard id This allows more info to be retrieved, like the index uuid which is exposed through the ShardId object but was not available before * Make ClusterSearchShardsResponse empty constructor public This allows to receive such responses when sending ClusterSearchShardsRequests directly through TransportService (not using ClusterSearchShardsAction via Client), otherwise an empty response cannot be created unless the class that does it is in org.elasticsearch.action, admin.cluster.shards package * adjust visibility of ClusterSearchShards members	2016-11-24 09:46:57 +01:00
Luca Cavanna	d8c934a7fa	Use index uuid as key in the alias filter map rather than the index name (#21749 ) The index uuid is unique across multiple clusters, while the index name is not. Using the index uuid to look up filters in the alias filters map is better and will be needed for multi cluster search.	2016-11-24 09:43:42 +01:00
Luca Cavanna	6a16a60c7e	Remove unused assignedReplicasIncludingRelocating from ShardsIterator interface (#21687 )	2016-11-23 22:25:51 +01:00

1 2 3 4 5 ...

6940 Commits