OpenSearch

Commit Graph

Author	SHA1	Message	Date
javanna	4c726311e2	[TEST] introduce test plugin to inject random search ext elements in search request tests A few of our unit tests generate a random search request body nd run tests against it. The source can optionally contain ext elements under the ext sections, which can be parsed by plugins. With this commit we introduce a plugin so that the tests don't use the one from FetchSubPhasePluginIT anymore. They rather generate multiple search ext elements. The plugin can parse and deal with all those. This extends the test coverage as we may have multiple elements with random names. Took the chance to introduce a common test base class for search requests, called AbstractSearchTestCase, given that the setup phase is the same for all three tests around search source. Then we can have the setup isolated to the base class and the subclasses relying on it. Closes #17685	2016-09-16 21:03:27 +02:00
Luca Cavanna	629e2b2aff	Throw error if query element doesn't end with END_OBJECT (#20528 ) * Throw error if query element doesn't end with END_OBJECT Followup to #20515 where we added validation that after we parse a query within a query element, we should not get a field name. Truth is that the only token allowed at that point is END_OBJECT, as our DSL allows only one single query within the query object: ``` { "query" : { "term" : { "field" : "value" } } } ``` We can then check that after parsing of the query we have an end_object that closes the query itself (which we already do). Following that we can check that the query object is immediately closed, as there are no other tokens that can be present in that position. Relates to #20515	2016-09-16 21:02:12 +02:00
Nik Everett	697adfb3c4	Fix up tasks integ test I'd made some mistakes that hadn't caused the test to fail but did slow it down and partially invalidate some of the assertions. This fixes those mistakes.	2016-09-16 12:18:41 -04:00
Jim Ferenczi	d0f4bc16ca	Fix FieldStats deserialization of `ip` field (#20522 ) * Fix FieldStats deserialization of `ip` field Add missing readBytes in `ip` field deserialization Add (de)serialization tests for all types This change also removes the ability to set FieldStats.minValue or FieldStats.maxValue to null. This is not required anymore since the stats are built on fields with values only. Fixes #20516	2016-09-16 16:15:46 +02:00
Nicholas Knize	01a6b7c408	[TEST] Refactor Geo test names to follow naming and style convention This commit closes a stale issue where GeoJson parsing tests used a combination of underscore and camel case. closes #8998	2016-09-16 09:13:38 -05:00
Simon Willnauer	3aabda6752	Ensure elasticsearch doesn't start with unuspported indices (#20514 ) If an index was created with pre 2.0 we should not treat it as supported even if all segments have been upgraded to a supported lucene version. Closes #20512	2016-09-16 16:11:53 +02:00
Simon Willnauer	f5daa165f1	Remove ability to plug-in TransportService (#20505 ) TransportService is such a central part of the core server, replacing it's implementation is risky and can cause serious issues. This change removes the ability to plug in TransportService but allows registering a TransportInterceptor that enables plugins to intercept requests on both the sender and the receiver ends. This is a commonly used and overwritten functionality but encapsulates the custom code in a contained manner.	2016-09-16 09:47:53 +02:00
Boaz Leskes	577dcb3237	Add current cluster state version to zen pings and use them in master election (#20384 ) During a networking partition, cluster states updates (like mapping changes or shard assignments) are committed if a majority of the masters node received the update correctly. This means that the current master has access to enough nodes in the cluster to continue to operate correctly. When the network partition heals, the isolated nodes catch up with the current state and get the changes they couldn't receive before. However, if a second partition happens while the cluster is still recovering from the previous one and the old master is put in the minority side, it may be that a new master is elected which did not yet catch up. If that happens, cluster state updates can be lost. This commit fixed 95% of this rare problem by adding the current cluster state version to `PingResponse` and use them when deciding which master to join (and thus casting the node's vote). Note: this doesn't fully mitigate the problem as a cluster state update which is issued concurrently with a network partition can be lost if the partition prevents the commit message (part of the two phased commit of cluster state updates) from reaching any single node in the majority side and the partition does allow for the master to acknowledge the change. We are working on a more comprehensive fix but that requires considerate work and is targeted at 6.0.	2016-09-15 23:39:11 +02:00
Simon Willnauer	1e2ef192c9	[TEST] Reduce the number of docs per indexRandom in FieldSortIT#testIssue6614	2016-09-15 22:56:26 +02:00
Areek Zillur	8c12b7c3b6	Merge pull request #20515 from areek/fix/query_validation Fix silently accepting malformed queries	2016-09-15 16:53:17 -04:00
Areek Zillur	bdad62a2f8	Fix silently accepting malformed queries Currently, we silently accept malformed query where more than one key is defined at the top-level for query object. If all the keys have a valid query body, only the last query is executed, besides throwing off parsing for additional suggest, aggregation or highlighting defined in the search request. This commit throws a parsing exception when we encounter a query with multiple keys. closes #20500	2016-09-15 16:32:28 -04:00
Boaz Leskes	2233d48235	add a reduced TCP_CONNECT_TIMEOUT setting to DiscoveryWithServiceDisruptionsIT The default of 30s causes some tests to timeout when running ensureGreen and similar. This is because network delays simulation blocks connect until either the connect timeout expires or the disruption configured time stops. We do not immediately connect when the disruption is stopped.	2016-09-15 21:50:24 +02:00
Areek Zillur	16ed2fb423	[TEST] ensure context filtering with valid utf-8 characters work with completion suggester	2016-09-15 14:38:52 -04:00
Luca Cavanna	37489c3274	Add clusterUUID to RestMainAction output (#20503 ) Add clusterUUID to RestMainAction output GET / now returns the clusterUUID as well as part of its output for monitoring purposes	2016-09-15 16:25:17 +02:00
gfyoung	b03c807368	Rename service.bat to elasticsearch-service.bat (#20496 ) Closes gh-17528.	2016-09-15 15:53:44 +02:00
Jason Tedor	7132fcd7ac	Give useful error message if log config is missing Today when starting Elasticsearch without a Log4j 2 configuration file, we end up throwing an array index out of bounds exception. This is because we are passing no configuration files to Log4j. Instead, we should throw a useful error message to the user. This commit modifies the Log4j configuration setup to throw a user exception if no Log4j configuration files are present in the config directory. Relates #20493	2016-09-15 07:44:05 -04:00
Boaz Leskes	a5f03b4bc5	testCanNotPublishWithoutMinMastNodes could time out if disruption hit just before cluster was fully formed The test uses a NetworkDelay that drops requests and slows down connecting. Next to that it disable node fault detection to make sure nodes are not removed before we check our publishing. Sadly that can lead to huge slow downs if the disruption hits while a node is still pinging (and tries to connect, which is slowed down). Instead we can start the disruption on the cluster state thread, making sure the result of fault detection won't be processed before we publish	2016-09-15 08:50:58 +02:00
Lee Hinman	d9f83a6d74	Change 'norelease' in MappedFieldType to 'TODO' This should not block any sort of release, it is only to be more protective towards fields not having any settings changed.	2016-09-14 14:37:02 -06:00
Lee Hinman	1e39a12fb0	Merge remote-tracking branch 'dakrone/remove-idx-shard-count-norelease'	2016-09-14 14:10:49 -06:00
Lee Hinman	e462edc6d6	Validate index settings differently when registering index template This was actually a byproduct of trying to remove a //norelease for index shard setting validation in MetaDataIndexService. This //norelease is now removed. Previously this check was only used by the template service, so we validated twice, once in the Settings infrastructure and once when actually creating the index. We now instead use the Settings infrastructure to validate the settings for shard count.	2016-09-14 13:25:34 -06:00
Simon Willnauer	17ddee7011	Remove TransportService#registerRequestHandler leniency (#20469 ) `TransportService#registerRequestHandler` allowed to register handlers more than once and issues an annoying warn log message when this happens. This change simple throws an exception to prevent regsitering the same handler more than once. This commit also removes the ability to remove request handlers. Relates to #20468	2016-09-14 20:32:29 +02:00
javanna	e7e27d9ffa	[TEST] remove unused currentTypes variable from SearchSourceBuilderTests	2016-09-14 19:33:23 +02:00
Simon Willnauer	bb6e7eeb7a	[TEST] Don't use transport client if we are blocking internal actions we might run into disconnects	2016-09-14 17:50:14 +02:00
Simon Willnauer	b35f7446ce	Remove unused imports	2016-09-14 17:44:45 +02:00
Simon Willnauer	d402ca0dd7	Remove poor-mans compression in InternalSearchHit and friends (#20472 ) We still use some crazy poor mans compression in InternalSearchHit that uses a thread local and an unordered map as a lookup table if requested. Stuff like this should be handled by compression on the transport layer rather than in-line in the serialization code. This code is complex enough.	2016-09-14 15:25:25 +02:00
Simon Willnauer	c1e84618a6	Only try to read new segments info if we really flushed the index (#20474 ) There is no reason to read the current segments info unless we flushed / committed the lucene index.	2016-09-14 15:23:17 +02:00
Boaz Leskes	74fc074e5e	fix styling	2016-09-14 10:52:10 +02:00
Simon Willnauer	a1cd6be777	Don't register SearchTransportService handlers more than once (#20468 ) This utility class is used in 3 places while we only need to register the handlers once per node. Otherwise we will see nasty `WARN` logs like: `registered two transport handlers for action indices:data/read/search[phase/fetch/id/scroll]...` This change will only register handlers inside the main TransportSearchAction.	2016-09-14 10:34:40 +02:00
Simon Willnauer	89640965d2	Unguice SearchModule (#20456 ) After this change SearchModule doesn't subclass AbstractModule anymore and all wiring happens in `Node.java`. As a side-effect several tests don't need a guice injector anymore.	2016-09-14 10:07:53 +02:00
Jason Tedor	7560101ec7	Complete Elasticsearch logger names This commit modifies the logger names within Elasticsearch to be the fully-qualified class name as opposed removing the org.elasticsearch prefix and dropping the class name. This change separates the root logger from the Elasticsearch loggers (they were equated from the removal of the org.elasticsearch prefix) and enables log levels to be set at the class level (instead of the package level). Relates #20457	2016-09-13 22:46:54 -04:00
Jason Tedor	0eff7daf5b	Fix logging hierarchy configs Today when setting the logging level via the command-line or an API call, the expectation is that the logging level should trickle down the hiearchy to descendant loggers. However, this is not necessarily the case. For example, if loggers x and x.y are already configured then setting the logging level on x will not descend to x.y. This is because the logging config for x.y has already been forked from the logging config for x. Therefore, we must explicitly descend the hierarchy when setting the logging level and that is what this commit does. Relates #20463	2016-09-13 22:46:14 -04:00
Ali Beyad	4431720c3d	File-based discovery plugin (#20394 ) This commit introduces a new plugin for file-based unicast hosts discovery. This allows specifying the unicast hosts participating in discovery through a `unicast_hosts.txt` file located in the `config/discovery-file` directory. The plugin will use the hosts specified in this file as the set of hosts to ping during discovery. The format of the `unicast_hosts.txt` file is to have one host/port entry per line. The hosts file is read and parsed every time discovery makes ping requests, thus a new version of the file that is published to the config directory will automatically be picked up. Closes #20323	2016-09-13 20:52:39 -04:00
Nicholas Knize	87b06c75b0	[TEST] Fix geo_point backcompat tests This commit fixes the following geo_point bwc tests: * GeoDistanceIT to test deprecated GeoDistanceRangeQuery on legacy indexes only. * ExternalFieldMapperTests to correctly handle LatLonPoint type * GeoPointFieldMapperTests to correctly test stored geo_point fields	2016-09-13 16:27:55 -05:00
Nicholas Knize	821004d5cd	[TEST] Refactor LegacyGeohashMappingGeoPointTests to 2.x indices only These tests should only exist to ensure backcompat with 2.x indices.	2016-09-13 15:27:39 -05:00
Jim Ferenczi	1764ec56b3	Fixed naming inconsistency for fields/stored_fields in the APIs (#20166 ) This change replaces the fields parameter with stored_fields when it makes sense. This is dictated by the renaming we made in #18943 for the search API. The following list of endpoint has been changed to use `stored_fields` instead of `fields`: * get * mget * explain The documentation and the rest API spec has been updated to cope with the changes for the following APIs: * delete_by_query * get * mget * explain The `fields` parameter has been deprecated for the following APIs (it is replaced by _source filtering): * update: the fields are extracted from the _source directly. * bulk: the fields parameter is used but fields are extracted from the source directly so it is allowed to have non-stored fields. Some APIs still have the `fields` parameter for various reasons: * cat.fielddata: the fields paramaters relates to the fielddata fields that should be printed. * indices.clear_cache: used to indicate which fielddata fields should be cleared. * indices.get_field_mapping: used to filter fields in the mapping. * indices.stats: get stats on fields (stored or not stored). * termvectors: fields are retrieved from the stored fields if possible and extracted from the _source otherwise. * mtermvectors: * nodes.stats: the fields parameter is used to concatenate completion_fields and fielddata_fields so it's not related to stored_fields at all. Fixes #20155	2016-09-13 20:54:41 +02:00
Jason Tedor	fbe27664a6	Fix prefix logging Today we add a prefix when logging within Elasticsearch. This prefix contains the node name, and index and shard-level components if appropriate. Due to some implementation details with Log4j 2 , this does not work for integration tests; instead what we see is the node name for the last node to startup. The implementation detail here is that Log4j 2 there is only one logger for a name, message factory pair, and the key derived from the message factory is the class name of the message factory. So, when the last node starts up and starts setting prefixes on its message factories, it will impact the loggers for the other nodes. Additionally, the prefixes are lost when logging an exception. This is due to another implementation detail in Log4j 2. Namely, since we log exceptions using a parameterized message, Log4j 2 decides that that means that we do not want to use the message factory that we have provided (the prefix message factory) and so logs the exception without the prefix. This commit fixes both of these issues. Relates #20429	2016-09-13 14:46:34 -04:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Nicholas Knize	ef926894f4	Cut over geo_point field and queries to new LatLonPoint type This commit cuts over geo_point fields to use Lucene's new point-based LatLonPoint type for indexes created in 5.0. Indexes created prior to 5.0 continue to use their respective encoding type. Below is a description of the changes made to support the new encoding type: * New indexes use a new LatLonPointFieldMapper which provides a parse method for the new type * The new LatLonPoint parse method removes support for lat_lon and geohash parameters * Backcompat testing for deprecated lat_lon and geohash parameters is added to all unit and integration tests * LatLonPointFieldMapper provides DocValues support (enabled by default) which uses Lucene's new LatLonDocValuesField type * New LatLonPoint field data classes are added for aggregation support (wraps LatLonPoint's Numeric Doc Values) * MultiFields use the geohash as the string value instead of the lat,lon string making it easier to perform geo string queries on the geohash instead of a lat,lon comma delimited string. Removed Features: * With the removal of geohash indexing, GeoHashCellQuery support is removed for all new indexes (still supported on existing indexes) * LatLonPoint does not support a Distance Range query because it is super inefficient. Instead, the geo_distance_range query should be accomplished using either the geo_distance aggregation, sorting by descending distance on a geo_distance query, or a boolean must not of the excluded distance (which is what the distance_range query did anyway). TODO: * fix/finish yaml changes for plugin and rest integration tests * update documentation	2016-09-13 12:17:36 -05:00
javanna	e0074ee9d4	[TEST] fix MultiMatchQueryIT random docs generation so that they don't interfere in score tests When generating random bogus documents, it could happen that they contain both the terms "the" and "ultimate", which would match the query "the ultimate" better than all the other non bogus documents, which would cause testCrossFieldMode to fail. "the" is a term that's relatively likely to be randomly generated given its length; we can simply increase the minimum length of randomly generated terms to 5, so that there are no collisions, as "the" cannot be generated anymore (nor can "ultimate" as the lenght doesn't go up to 8). Also made some assertions more accurate to check how many hits match a query rather than checking only that the first or second hits are there. Closes #18873	2016-09-13 18:25:53 +02:00
Nik Everett	afbd7cbeb8	Rework the basic IT for GETing running tasks This integ test relied on the false assumption that `MockTaskManagerListener#onTaskUnregistered` was called before the task was unregistered. It is in fact called after the task is unregistered. This mistake led the test to rarely miss the task it was looking for and fail. Found by https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=ubuntu/4/consoleText	2016-09-13 11:59:50 -04:00
Simon Willnauer	c84bc25500	Cleanup version constant for unsupported version in QuerySearchResult	2016-09-13 17:21:04 +02:00
Nik Everett	7888dbfb31	Add second test case for two fields in range query In this test one field is a number and the other is a date. Closes #20447	2016-09-13 09:26:29 -04:00
Britta Weber	444c4f1af8	remove workaround for highlighter bug with geo queries (#20418 ) This has been fixed in Lucene https://issues.apache.org/jira/browse/LUCENE-7293 This commit also adds the tests from #20412	2016-09-13 14:59:56 +02:00
Tanguy Leroux	6090c51fc5	Add quiet option to disable console logging (#20422 ) This commit adds a -q/--quiet option to Elasticsearch so that it does not log anything in the console and closes stdout & stderr streams. This is useful for SystemD to avoid duplicate logs in both journalctl and /var/log/elasticsearch/elasticsearch.log while still allows the JVM to print error messages in stdout/stderr if needed. closes #17220	2016-09-13 14:08:24 +02:00
Jason Tedor	c7bfbe3e69	Add health status parameter to cat indices API This commit adds a health status parameter to the cat indices API for filtering on indices that match the specified status (green\|yellow\|red). Relates #20393	2016-09-13 07:57:18 -04:00
Michael Nitschinger	9ee6624fd1	Network: Allow to listen on virtual interfaces. Previously when trying to listen on virtual interfaces during bootstrap the application would stop working - the interface couldn't be found by the NetworkUtils class. The NetworkUtils utilize the underlying JDK NetworkInterface class which, when asked to lookup by name only takes physical interfaces into account, failing at virtual (or subinterfaces) ones (returning null). Note that when interating over all interfaces, both physical and virtual ones are taken into account. This changeset asks for all known interfaces, iterates over them and matches on the given name as part of the loop, allowing it to catch both physical and virtual interfaces. As a result, elasticsearch can now also serve on virtual interfaces. A test case has been added which makes sure that all iterable interfaces can be found by their respective name. Note that this PR is a second iteration over the previously merged but later reverted #19537 because it causes tests to fail when interfaces are down. The test has been modified to take this into account now. Closes #17473 Closes #19568 Relates #19537	2016-09-13 13:40:09 +02:00
javanna	7894eba2b3	[TEST] add test for match query parsing error when providing an array of terms Match query throws parsing errors when an array of terms is provided, we should test that to make sure this behaviour doesn't change. Relates to #15741	2016-09-13 12:46:35 +02:00
Boaz Leskes	10dcfa3304	Fix concurrency issues between cancelling a relocation and marking shard as relocated (#20443 ) Once a primary is marked as relocated, we can not safely move it back to started (as we have no way of waiting on inflight operations that are performed on the target primary). If the master cancels the relocation in that state, we fail the primary. Sadly, there is a racing condition between the `updateRoutingEntry` method (which is called when the relocation is cancelled by the master) and the `relocated` method. That racing condition can leave the shard as marked "relocated" but have the routing entry not reflect the target relocation. This in turns causes NPEs in TransportReplicationAction: ``` java.util.Objects requireNonNull Objects.java 203 org.elasticsearch.action.support.replication.TransportReplicationAction$ConcreteShardRequest <init> TransportReplicationAction.java 982 ``` Sadly, once we end up in this state, we will never recover. This commit fixes that race condition by making sure `updateRoutingEntry` acquires the mutex when checking for the relocated status. While at it, I also tightened up the code and added lots of assertions/hard checks.	2016-09-13 12:44:40 +02:00
makeyang	1ae8d6123f	Add node name to decider trace logging (#20437 ) Adds the entire DiscoveryNode object to the trace log in AllocationDeciders. The allocation decider logging at TRACE level can sometimes be helpful to determine why a shard is not getting allocated on specific nodes. Currently, we only log the node id for these messages. It will be helpful to also include the node name (esp. when dealing with a lot of nodes in the cluster).	2016-09-13 11:17:39 +02:00
Lee Hinman	3439796df3	Merge branch 'pr/18683'	2016-09-12 16:24:09 -06:00

1 2 3 4 5 ...

6332 Commits