OpenSearch

Commit Graph

Author	SHA1	Message	Date
Simon Willnauer	bb6e7eeb7a	[TEST] Don't use transport client if we are blocking internal actions we might run into disconnects	2016-09-14 17:50:14 +02:00
Simon Willnauer	b35f7446ce	Remove unused imports	2016-09-14 17:44:45 +02:00
Simon Willnauer	d402ca0dd7	Remove poor-mans compression in InternalSearchHit and friends (#20472 ) We still use some crazy poor mans compression in InternalSearchHit that uses a thread local and an unordered map as a lookup table if requested. Stuff like this should be handled by compression on the transport layer rather than in-line in the serialization code. This code is complex enough.	2016-09-14 15:25:25 +02:00
Simon Willnauer	c1e84618a6	Only try to read new segments info if we really flushed the index (#20474 ) There is no reason to read the current segments info unless we flushed / committed the lucene index.	2016-09-14 15:23:17 +02:00
Boaz Leskes	74fc074e5e	fix styling	2016-09-14 10:52:10 +02:00
Simon Willnauer	a1cd6be777	Don't register SearchTransportService handlers more than once (#20468 ) This utility class is used in 3 places while we only need to register the handlers once per node. Otherwise we will see nasty `WARN` logs like: `registered two transport handlers for action indices:data/read/search[phase/fetch/id/scroll]...` This change will only register handlers inside the main TransportSearchAction.	2016-09-14 10:34:40 +02:00
Simon Willnauer	89640965d2	Unguice SearchModule (#20456 ) After this change SearchModule doesn't subclass AbstractModule anymore and all wiring happens in `Node.java`. As a side-effect several tests don't need a guice injector anymore.	2016-09-14 10:07:53 +02:00
Jason Tedor	7560101ec7	Complete Elasticsearch logger names This commit modifies the logger names within Elasticsearch to be the fully-qualified class name as opposed removing the org.elasticsearch prefix and dropping the class name. This change separates the root logger from the Elasticsearch loggers (they were equated from the removal of the org.elasticsearch prefix) and enables log levels to be set at the class level (instead of the package level). Relates #20457	2016-09-13 22:46:54 -04:00
Jason Tedor	0eff7daf5b	Fix logging hierarchy configs Today when setting the logging level via the command-line or an API call, the expectation is that the logging level should trickle down the hiearchy to descendant loggers. However, this is not necessarily the case. For example, if loggers x and x.y are already configured then setting the logging level on x will not descend to x.y. This is because the logging config for x.y has already been forked from the logging config for x. Therefore, we must explicitly descend the hierarchy when setting the logging level and that is what this commit does. Relates #20463	2016-09-13 22:46:14 -04:00
Ali Beyad	4431720c3d	File-based discovery plugin (#20394 ) This commit introduces a new plugin for file-based unicast hosts discovery. This allows specifying the unicast hosts participating in discovery through a `unicast_hosts.txt` file located in the `config/discovery-file` directory. The plugin will use the hosts specified in this file as the set of hosts to ping during discovery. The format of the `unicast_hosts.txt` file is to have one host/port entry per line. The hosts file is read and parsed every time discovery makes ping requests, thus a new version of the file that is published to the config directory will automatically be picked up. Closes #20323	2016-09-13 20:52:39 -04:00
Nicholas Knize	87b06c75b0	[TEST] Fix geo_point backcompat tests This commit fixes the following geo_point bwc tests: * GeoDistanceIT to test deprecated GeoDistanceRangeQuery on legacy indexes only. * ExternalFieldMapperTests to correctly handle LatLonPoint type * GeoPointFieldMapperTests to correctly test stored geo_point fields	2016-09-13 16:27:55 -05:00
Nicholas Knize	821004d5cd	[TEST] Refactor LegacyGeohashMappingGeoPointTests to 2.x indices only These tests should only exist to ensure backcompat with 2.x indices.	2016-09-13 15:27:39 -05:00
Jim Ferenczi	1764ec56b3	Fixed naming inconsistency for fields/stored_fields in the APIs (#20166 ) This change replaces the fields parameter with stored_fields when it makes sense. This is dictated by the renaming we made in #18943 for the search API. The following list of endpoint has been changed to use `stored_fields` instead of `fields`: * get * mget * explain The documentation and the rest API spec has been updated to cope with the changes for the following APIs: * delete_by_query * get * mget * explain The `fields` parameter has been deprecated for the following APIs (it is replaced by _source filtering): * update: the fields are extracted from the _source directly. * bulk: the fields parameter is used but fields are extracted from the source directly so it is allowed to have non-stored fields. Some APIs still have the `fields` parameter for various reasons: * cat.fielddata: the fields paramaters relates to the fielddata fields that should be printed. * indices.clear_cache: used to indicate which fielddata fields should be cleared. * indices.get_field_mapping: used to filter fields in the mapping. * indices.stats: get stats on fields (stored or not stored). * termvectors: fields are retrieved from the stored fields if possible and extracted from the _source otherwise. * mtermvectors: * nodes.stats: the fields parameter is used to concatenate completion_fields and fielddata_fields so it's not related to stored_fields at all. Fixes #20155	2016-09-13 20:54:41 +02:00
Jason Tedor	fbe27664a6	Fix prefix logging Today we add a prefix when logging within Elasticsearch. This prefix contains the node name, and index and shard-level components if appropriate. Due to some implementation details with Log4j 2 , this does not work for integration tests; instead what we see is the node name for the last node to startup. The implementation detail here is that Log4j 2 there is only one logger for a name, message factory pair, and the key derived from the message factory is the class name of the message factory. So, when the last node starts up and starts setting prefixes on its message factories, it will impact the loggers for the other nodes. Additionally, the prefixes are lost when logging an exception. This is due to another implementation detail in Log4j 2. Namely, since we log exceptions using a parameterized message, Log4j 2 decides that that means that we do not want to use the message factory that we have provided (the prefix message factory) and so logs the exception without the prefix. This commit fixes both of these issues. Relates #20429	2016-09-13 14:46:34 -04:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Nicholas Knize	ef926894f4	Cut over geo_point field and queries to new LatLonPoint type This commit cuts over geo_point fields to use Lucene's new point-based LatLonPoint type for indexes created in 5.0. Indexes created prior to 5.0 continue to use their respective encoding type. Below is a description of the changes made to support the new encoding type: * New indexes use a new LatLonPointFieldMapper which provides a parse method for the new type * The new LatLonPoint parse method removes support for lat_lon and geohash parameters * Backcompat testing for deprecated lat_lon and geohash parameters is added to all unit and integration tests * LatLonPointFieldMapper provides DocValues support (enabled by default) which uses Lucene's new LatLonDocValuesField type * New LatLonPoint field data classes are added for aggregation support (wraps LatLonPoint's Numeric Doc Values) * MultiFields use the geohash as the string value instead of the lat,lon string making it easier to perform geo string queries on the geohash instead of a lat,lon comma delimited string. Removed Features: * With the removal of geohash indexing, GeoHashCellQuery support is removed for all new indexes (still supported on existing indexes) * LatLonPoint does not support a Distance Range query because it is super inefficient. Instead, the geo_distance_range query should be accomplished using either the geo_distance aggregation, sorting by descending distance on a geo_distance query, or a boolean must not of the excluded distance (which is what the distance_range query did anyway). TODO: * fix/finish yaml changes for plugin and rest integration tests * update documentation	2016-09-13 12:17:36 -05:00
javanna	e0074ee9d4	[TEST] fix MultiMatchQueryIT random docs generation so that they don't interfere in score tests When generating random bogus documents, it could happen that they contain both the terms "the" and "ultimate", which would match the query "the ultimate" better than all the other non bogus documents, which would cause testCrossFieldMode to fail. "the" is a term that's relatively likely to be randomly generated given its length; we can simply increase the minimum length of randomly generated terms to 5, so that there are no collisions, as "the" cannot be generated anymore (nor can "ultimate" as the lenght doesn't go up to 8). Also made some assertions more accurate to check how many hits match a query rather than checking only that the first or second hits are there. Closes #18873	2016-09-13 18:25:53 +02:00
Nik Everett	afbd7cbeb8	Rework the basic IT for GETing running tasks This integ test relied on the false assumption that `MockTaskManagerListener#onTaskUnregistered` was called before the task was unregistered. It is in fact called after the task is unregistered. This mistake led the test to rarely miss the task it was looking for and fail. Found by https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+5.x+multijob-unix-compatibility/os=ubuntu/4/consoleText	2016-09-13 11:59:50 -04:00
Simon Willnauer	c84bc25500	Cleanup version constant for unsupported version in QuerySearchResult	2016-09-13 17:21:04 +02:00
Nik Everett	7888dbfb31	Add second test case for two fields in range query In this test one field is a number and the other is a date. Closes #20447	2016-09-13 09:26:29 -04:00
Britta Weber	444c4f1af8	remove workaround for highlighter bug with geo queries (#20418 ) This has been fixed in Lucene https://issues.apache.org/jira/browse/LUCENE-7293 This commit also adds the tests from #20412	2016-09-13 14:59:56 +02:00
Tanguy Leroux	6090c51fc5	Add quiet option to disable console logging (#20422 ) This commit adds a -q/--quiet option to Elasticsearch so that it does not log anything in the console and closes stdout & stderr streams. This is useful for SystemD to avoid duplicate logs in both journalctl and /var/log/elasticsearch/elasticsearch.log while still allows the JVM to print error messages in stdout/stderr if needed. closes #17220	2016-09-13 14:08:24 +02:00
Jason Tedor	c7bfbe3e69	Add health status parameter to cat indices API This commit adds a health status parameter to the cat indices API for filtering on indices that match the specified status (green\|yellow\|red). Relates #20393	2016-09-13 07:57:18 -04:00
Michael Nitschinger	9ee6624fd1	Network: Allow to listen on virtual interfaces. Previously when trying to listen on virtual interfaces during bootstrap the application would stop working - the interface couldn't be found by the NetworkUtils class. The NetworkUtils utilize the underlying JDK NetworkInterface class which, when asked to lookup by name only takes physical interfaces into account, failing at virtual (or subinterfaces) ones (returning null). Note that when interating over all interfaces, both physical and virtual ones are taken into account. This changeset asks for all known interfaces, iterates over them and matches on the given name as part of the loop, allowing it to catch both physical and virtual interfaces. As a result, elasticsearch can now also serve on virtual interfaces. A test case has been added which makes sure that all iterable interfaces can be found by their respective name. Note that this PR is a second iteration over the previously merged but later reverted #19537 because it causes tests to fail when interfaces are down. The test has been modified to take this into account now. Closes #17473 Closes #19568 Relates #19537	2016-09-13 13:40:09 +02:00
javanna	7894eba2b3	[TEST] add test for match query parsing error when providing an array of terms Match query throws parsing errors when an array of terms is provided, we should test that to make sure this behaviour doesn't change. Relates to #15741	2016-09-13 12:46:35 +02:00
Boaz Leskes	10dcfa3304	Fix concurrency issues between cancelling a relocation and marking shard as relocated (#20443 ) Once a primary is marked as relocated, we can not safely move it back to started (as we have no way of waiting on inflight operations that are performed on the target primary). If the master cancels the relocation in that state, we fail the primary. Sadly, there is a racing condition between the `updateRoutingEntry` method (which is called when the relocation is cancelled by the master) and the `relocated` method. That racing condition can leave the shard as marked "relocated" but have the routing entry not reflect the target relocation. This in turns causes NPEs in TransportReplicationAction: ``` java.util.Objects requireNonNull Objects.java 203 org.elasticsearch.action.support.replication.TransportReplicationAction$ConcreteShardRequest <init> TransportReplicationAction.java 982 ``` Sadly, once we end up in this state, we will never recover. This commit fixes that race condition by making sure `updateRoutingEntry` acquires the mutex when checking for the relocated status. While at it, I also tightened up the code and added lots of assertions/hard checks.	2016-09-13 12:44:40 +02:00
makeyang	1ae8d6123f	Add node name to decider trace logging (#20437 ) Adds the entire DiscoveryNode object to the trace log in AllocationDeciders. The allocation decider logging at TRACE level can sometimes be helpful to determine why a shard is not getting allocated on specific nodes. Currently, we only log the node id for these messages. It will be helpful to also include the node name (esp. when dealing with a lot of nodes in the cluster).	2016-09-13 11:17:39 +02:00
Lee Hinman	3439796df3	Merge branch 'pr/18683'	2016-09-12 16:24:09 -06:00
Lee Hinman	44278db1bc	Merge pull request #20433 from dakrone/remove-cluster-name-folder-fallback No longer allow cluster name in data path	2016-09-12 17:01:49 -05:00
Lee Hinman	94625d74e4	No longer allow cluster name in data path In 5.x we allowed this with a deprecation warning. This removes the code added for that deprecation, requiring the cluster name to not be in the data path. Resolves #20391	2016-09-12 15:47:01 -06:00
Simon Willnauer	686994ae2d	Deguice SearchService and friends (#20423 ) This change removes the guice dependency handling for SearchService and several related classes like SearchTransportController and SearchPhaseController. The latter two now have package private constructors and dependencies like FetchPhase are now created by calling their constructors explicitly. This also cleans up several users of the DefaultSearchContext and centralized it's creation inside SearchService.	2016-09-12 22:42:55 +02:00
Boaz Leskes	7f92971f26	remove assumeX methods from IndexShardTests The cause early termination of tests, which means we don't clean up and close shards, but also don't cause a failure. This in turns makes TestRuleTemporaryFilesCleanup fail on windows (because it does try to clean up, but the files are referenced). Getting stuff like: ``` > C:\jenkins\workspace\es_core_master_windows-2012-r2\core\build\testrun\test\J3\temp\org.elasticsearch.index.shard.IndexShardTests_68B5E1103D78A58B-001\tempDir-006\indices\_na_\0\translog\translog-1.tlog: java.nio.file.AccessDeniedException: C:\jenkins\workspace\es_core_master_windows-2012-r2\core\build\testrun\test\J3\temp\org.elasticsearch.index.shard.IndexShardTests_68B5E1103D78A58B-001\tempDir-006\indices\_na_\0\translog\translog-1.tlog ```	2016-09-12 22:29:42 +02:00
Ali Beyad	b1e87aa13c	Split allocator decision making from decision application (#20347 ) Splits the PrimaryShardAllocator and ReplicaShardAllocator's decision making for a shard from the implementation of that decision on the routing table. This is a step toward making it easier to use the same logic for the cluster allocation explain APIs.	2016-09-12 16:21:39 -04:00
Luca Cavanna	119d198cc5	Merge pull request #20424 from javanna/enhancement/error_fetch_source_disabled Throw error when trying to fetch fields from source and source is disabled	2016-09-12 18:36:43 +02:00
Boaz Leskes	b08352047d	Introduce IndexShardTestCase (#20411 ) Introduce a base class for unit tests that are based on real `IndexShard`s. The base class takes care of all the little details needed to create and recover shards. This commit also moves `IndexShardTests` and `ESIndexLevelReplicationTestCase` to use the new base class. All tests in `IndexShardTests` that required a full node environment were moved to a new `IndexShardIT` suite.	2016-09-12 18:20:25 +02:00
Ali Beyad	f39f9b9760	Update discovery nodes after cluster state is published (#20409 ) Before, when there was a new cluster state to publish, zen discovery would first update the set of nodes to ping based on the new cluster state, then publish the new cluster state. This is problematic because if the cluster state failed to publish, then the set of nodes to ping should not have been updated. This commit fixes the issue by updating the set of nodes to ping for fault detection only after the new cluster state has been published.	2016-09-12 12:07:51 -04:00
javanna	2a1ed80262	With #20093 we fixed a NPE thrown when using _source include/exclude and source is disabled in the mappings. Fixing meant ignoring the _source parameter in the request as no fields can be extracted from it. We should rather throw a clear exception to clearly point out that we cannot extract fields from _source. Note that this happens only when explicitly trying to extract fields from source. When source is disabled and no _source parameter is specified, no errors will be thrown and no source will be returned. Closes #20408 Relates to #20093	2016-09-12 17:36:48 +02:00
Jim Ferenczi	82fd95fd24	Merge pull request #20400 from jimferenczi/function_score_highlight Fix highlighting of MultiTermQuery within a FunctionScoreQuery	2016-09-12 15:56:06 +02:00
Luca Cavanna	b1a2768d7d	Merge pull request #20188 from qwerty4030/fix/3839_multi_index_add_remove Fix IndexNotFoundException in multi index search request.	2016-09-12 14:42:56 +02:00
Jun Ohtani	770abd7af8	Merge pull request #20396 from johtani/fix/fail_loading_non_prebuilt_tokenfilter_in_analyze_api Can load non-PreBuiltTokenFilter in Analyze API	2016-09-10 09:35:23 +09:00
Luca Cavanna	4b00cc37a1	Merge pull request #20382 from javanna/enhancement/cleanup_parse_elements Cleanup sub fetch phase extension point	2016-09-09 22:47:15 +02:00
javanna	9a84cb99f4	remove writeBoolean from searchExtBuilders serialization in SearchSourceBuilder The list is not optional anymore, default is empty list	2016-09-09 21:24:18 +02:00
Tal Levy	dda32545bb	add ignore_missing option to relevant processors (#20194 )	2016-09-09 12:20:18 -07:00
javanna	17d48c1ff6	adjust SearchExtBuilder javadocs	2016-09-09 21:17:16 +02:00
javanna	90ab460fcc	move parsing of search ext sections to the coordinating node	2016-09-09 19:10:42 +02:00
Nicholas Knize	297fc8373d	[TEST] Fix offsets in BaseXContentTestCase.testBinaryValueWithOffsetLength The max value for randomIntBetween is inclusive, so we should use byte array length minus one to avoid an AIOB exception.	2016-09-09 11:47:37 -05:00
javanna	65c7f61ad9	decouple registration of SearchExtParsers from sub fetch phases Search section supports an ext section that is used to provide additional config needed from plugins. It is now tied to sub fetch phases because it is the only section that may need additional config, but there is no reason for the two to be tightly coupled. It is now possible to register a searchExtParser independently from a sub fetch phase. All a search ext parser does is parsing some ext section of a search request, whose parsed resulting object is stored in the search context for later retrieval.	2016-09-09 18:05:49 +02:00
javanna	455a2143f1	move SearchExtParser back to o.e.search package The parser is now needed only for sub fetch phases, but doesn't have to be strictly connected to them, it could be used for something else as well potentially	2016-09-09 18:05:49 +02:00
javanna	12eaeb3945	FetchSubPhase to support a single parser that extends SearchExtParser	2016-09-09 18:05:49 +02:00
javanna	f9530dfe8f	remove FetchSubPhaseContext in favour of generic fetch sub phase builder of type object The context was an object where the parsed info are stored. That is more of what we call the builder since after the search refactoring. No need for generics in FetchSubPhaseParser then. Also the previous setHitsExecutionNeeded wasn't useful, it can be removed as well, given that once there is a parsed ext section, it will become a builder that can be retrieved by the sub fetch phase. The sub fetch phase is responsible for doing nothing in case the builder is not set, meaning that the fetch sub phase is plugged in but the request didn't have the corresponding section.	2016-09-09 18:05:49 +02:00

1 2 3 4 5 ...

6310 Commits