OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	8b43e21521	Fix multi fields empty query (#33017 ) This change fixes empty query removal when all fields remove the search term in `simple_query_string`, `multi_match` and `query_string`. Closes #33009	2018-08-21 22:12:53 +02:00
Igor Motov	3973bb4028	Fix north pole overflow error in GeoHashUtils.bbox() (#32891 ) Fixes an overflow error in GeoHashUtils.bbox() calculation of a bounding box for geohashes with maximum precision located next to the north pole.	2018-08-21 14:59:37 -04:00
Jason Tedor	bdfcc326d7	Enable avoiding mmap bootstrap check (#32421 ) The maximum map count boostrap check can be a hindrance to users that do not own the underlying platform on which they are executing Elasticsearch. This is because addressing it requires tuning the kernel and a platform provider might now allow this, especially on shared infrastructure. However, this bootstrap check is not needed if mmapfs is not in use. Today we do not have a way for the user to communicate that they are not going to use mmapfs. This commit therefore adds a setting that enables the user to disallow mmapfs. When mmapfs is disallowed, the maximum map count bootstrap check is not enforced. Additionally, we fallback to a different default index store and prevent the explicit use of mmapfs for an index.	2018-08-21 11:02:25 -04:00
Simon Willnauer	92076497e5	Use a dedicated ConnectionManger for RemoteClusterConnection (#32988 ) This change introduces a dedicated ConnectionManager for every RemoteClusterConnection such that there is not state shared with the TransportService internal ConnectionManager. All connections to a remote cluster are isolated from the TransportService but still uses the TransportService and it's internal properties like the Transport, tracing and internal listener actions on disconnects etc. This allows a remote cluster connection to have a different lifecycle than a local cluster connection, also local discovery code doesn't get notified if there is a disconnect on from a remote cluster and each connection can use it's own dedicated connection profile which allows to have a reduced set of connections per cluster without conflicting with the local cluster. Closes #31835	2018-08-21 12:43:25 +02:00
Armin Braun	200078734c	INGEST: Simplify IngestService (#33008 ) * INGEST: Simplify IngestService * Follow up to #32617 * Flatten redundant inner classes of `IngestService`	2018-08-21 10:13:32 +02:00
Armin Braun	8fc213f237	INGEST: Move all Pipeline State into IngestService (#32617 ) * INGEST: Move all Pipeline State into IngestService * Moves all pipeline state into the ingest service * Retains the existing pipeline store and pipeline execution service as inner classes to make the review easier, they should be flattened out in the next step * All tests for these classes were copied (and adapted) to the ingest service tests * This is a refactoring step to enable a clean implementation of a pipeline processor (See #32473)	2018-08-21 05:05:32 +02:00
Jason Tedor	ad0a965db9	Protect scheduler engine against throwing listeners (#32998 ) There are two problems with the scheduler engine today. Both relate to listeners that throw. The first problem is that any triggered listener that throws a plain old exception will cause no additional listeners to be triggered for the event, and will also cause the scheduler to never be invoked again. This leads to lost events and is bad. The second problem is that any triggered listener that throws an error of the fatal kind will not lead to that error because caught by the uncaught exception handler. This is because the triggered listener is executed as a future task under a scheduled thread pool executor. A throwable there goes caught by the JDK framework and set as the outcome on the future task. Since we never inspect these tasks for their outcomes, nor is there a good place to do this, we have to handle these errors ourselves. To do this, we catch them and dispatch them to the uncaught exception handler via a forked thread. This is similar to our handling in Netty.	2018-08-20 22:07:16 -04:00
Nhat Nguyen	40f1bb5e5e	Trim translog when safe commit advanced (#32967 ) Since #28140 when the global checkpoint is advanced, we try to move the safe commit forward, and clean up old index commits if possible. However, we forget to trim unreferenced translog. This change makes sure that we prune both old translog and index commits when the safe commit advanced. Relates #28140 Closes #32089	2018-08-20 15:13:19 -04:00
Nik Everett	462e91d362	Logging: Use settings when building daemon threads (#32751 ) Subclasses of `EsIntegTestCase` run multiple Elasticsearch nodes in the same JVM and when we log we look at the name of the thread to figure out the node name. This makes sure that all calls to `daemonThreadFactory` include the node name. Closes #32574 I'd like to follow this up with more drastic changes that make it impossible to do this incorrectly but that change is much larger than this and I'd like to get these log lines fixed up sooner rather than later.	2018-08-20 13:53:15 -04:00
Andrey Ershov	0749b18181	All Translog inner closes should happen after tragedy exception is set (#32674 ) All Translog inner closes should happen after tragedy exception is set (#32674) We faced with the nasty race condition. See #32526 InternalEngine.failOnTragic method has thrown AssertionError. If you carefully look at if branches in this method, you will spot that its only possible, if either Lucene IndexWriterhas closed from inside or Translog, has closed from inside, but tragedy exception is not set. For now, let us concentrate on the Translog class. We found out that there are two methods in Translog - namely rollGeneration and trimOperations that are closing Translog in case of Exception without tragedy exception being set. This commit fixes these 2 methods. To fix it, we pull tragedyException from TranslogWriter up-to Translog class, because in these 2 methods IndexWriter could be innocent, but still Translog needs to be closed. Also, tragedyException is wrapped with TragicExceptionHolder to reuse CAS/addSuppresed functionality in Translog and TranslogWriter. Also to protect us in the future and make sure close method is never called from inside Translog special assertion examining stack trace is added. Since we're still targeting Java 8 for runtime - no StackWalker API is used in the implementation. In the stack-trace checking method, we're considering inner caller not only Translog methods but Translog child classes methods as well. It does mean that Translog is meant for extending it, but it's needed to be able to test this method. Closes #32526	2018-08-20 19:22:10 +02:00
Tim Brooks	faa42de66d	Pass DiscoveryNode to initiateChannel (#32958 ) This is related to #32517. This commit passes the DiscoveryNode to the initiateChannel method for different Transport implementation. This will allow additional attributes (besides just the socket address) to be used when opening channels.	2018-08-20 08:54:55 -06:00
Jonathan Little	676091aafb	Protect ScriptedMetricIT test cases against failures on 0-doc shards (#32959 ) (#32968 ) Randomized test conditions that cause some shards to have no docs on them failed due to test asserts that relied on a lazy initialization side effect from the map script. After this fix: - Test cases with the relevant init script are protected - Test cases with the relevant combine or reduce scripts were already protected, because the combine and reduce scripts safely handle this case.	2018-08-20 08:55:43 +01:00
Alpar Torok	4b34b3f4aa	Set forbidden APIs target compatibility to compiler java version (#32935 ) Set forbidden apis target compatibility to compiler version Fix outstanding deprecation	2018-08-20 09:27:02 +03:00
Tim Brooks	de92d2ef1f	Move connection listener to ConnectionManager (#32956 ) This is a followup to #31886. After that commit the TransportConnectionListener had to be propogated to both the Transport and the ConnectionManager. This commit moves that listener to completely live in the ConnectionManager. The request and response related methods are moved to a TransportMessageListener. That listener continues to live in the Transport class.	2018-08-18 10:09:24 -06:00
Armin Braun	f82bb64feb	NETWORKING: Make RemoteClusterConn. Lazy Resolve DNS (#32764 ) * Lazy resolve DNS (i.e. `String` to `DiscoveryNode`) to not run into indefinitely caching lookup issues (provided the JVM dns cache is configured correctly as explained in https://www.elastic.co/guide/en/elasticsearch/reference/6.3/networkaddress-cache-ttl.html) * Changed `InetAddress` type to `String` for that higher up the stack * Passed down `Supplier<DiscoveryNode>` instead of outright `DiscoveryNode` from `RemoteClusterAware#buildRemoteClustersSeeds` on to lazy resolve DNS when the `DiscoveryNode` is actually used (could've also passed down the value of `clusterName = REMOTE_CLUSTERS_SEEDS.getNamespace(concreteSetting)` together with the `List<String>` of hosts, but this route seemed to introduce less duplication and resulted in a significantly smaller changeset). * Closes #28858	2018-08-18 08:46:44 +02:00
Nhat Nguyen	86ffce4bbc	TEST: Mute testRetentionPolicyChangeDuringRecovery Tracked at #32089	2018-08-17 14:12:45 -04:00
Igor Motov	da6b61e8ef	Make Geo Context Mapping Parsing More Strict (#32821 ) Currently, if geo context is represented by something other than geo_point or an object with lat and lon fields, the parsing of it as a geo context can result in ignoring the context altogether, returning confusing errors such as number_format_exception or trying to parse the number specifying as long-encoded hash code. It would also fail if the geo_point was stored. This commit makes the mapping parsing more strict and will fail during mapping update or index creation if the geo context doesn't point to a geo_point field. Supersedes #32412 Closes #32202	2018-08-17 08:13:16 -07:00
Jonathan Little	a08127c072	Scripted metric aggregations: add deprecation warning and system property to control legacy params (#31597 ) * Scripted metric aggregations: add deprecation warning and system property to control legacy params Scripted metric aggregation params._agg/_aggs are replaced by state/states context variables. By default the old params are still present, and a deprecation warning is emitted when Scripted Metric Aggregations are used. A new system property can be used to disable the legacy params. This functionality will be removed in a future revision. * Fix minor style issue and docs test failure * Disable deprecated params._agg/_aggs in tests and revise tests to use state/states instead * Add integration test covering deprecated scripted metrics aggs params._agg/_aggs access * Disable deprecated params._agg/_aggs in docs integration tests and revise stored scripts to use state/states instead * Revert unnecessary migrations doc change A relevant note should be added in the changes destined for 7.0; this PR is going to be backported to 6.x. * Replace deprecated _agg param bwc integration test with a couple of unit tests * Fix compatibility test after merge * Rename backwards compatibility system property per code review feedback * Tweak deprecation warning text per review feedback	2018-08-17 13:11:18 +01:00
Alexander Reelsen	0d92f377fd	Tests: Fix timezone conversion in DateTimeUnitTests This fix prevernts trying to parse unknown timezone ids by converting the joda time zone via java.util.TimeZone to a java time based ZoneId. Closes #32927	2018-08-17 14:09:01 +02:00
Paul Sanwald	ca54aacbb5	Fix InternalAutoDateHistogram reproducible failure (#32723 ) Update test logic to correctly bucket intervals.	2018-08-17 07:03:25 -04:00
Andrey Ershov	2fa028cfa1	Remove assertion in testDocStats on deletedDocs counter (#32914 ) testDocStats test is flaky and sometimes it's failing on jenkins and failure is not reproducible locally. The reason for this failure is in timing. If the number of deleted documents is greater than 33% of inserted documents, Lucene will schedule segments to merge if TieredMergePolicy is used (it's not the case for LogMergePolicy, but ES is only using TieredMergePolicy). If this merge is performed before stats are retrieved - we will get 0 for "deleted" counter. So basically this counter could be either 0 or numOfDeletedDocs at this point, but this is the too loose assertion and we decided to remove it at all. Closes #32766	2018-08-17 12:36:45 +02:00
JB Nizet	dd5a5aab88	Fix allowed value for HighlighterBuilder encoder in javadocs (#32780 ) Relates to #32745	2018-08-17 10:59:26 +02:00
Julie Tibshirani	cbf160a4e6	For filters aggs, make sure that rewrites preserve other_bucket. (#32921 )	2018-08-16 17:36:58 -07:00
Jim Ferenczi	3dd1677cdc	[Test] Fix DuelScrollIT#testDuelIndexOrderQueryThenFetch This commit disables the automatic `refresh_interval` in order to ensure that index readers cannot differ between the normal and scroll search. This issue is related to the 7.5 Lucene upgrade which contains a change that makes single segment merge more likely to occur (max deletes percentage). Closes #32682	2018-08-16 15:33:17 +02:00
Jason Tedor	f8c7414ee8	Remove passphrase support from reload settings API (#32889 ) We do not support passphrases on the secure settings storage (the keystore). Yet, we added support for this in the API layer. This commit removes this support so that we are not limited in our future options, or have to make a breaking change.	2018-08-16 07:24:05 -04:00
Adrien Grand	e35be01901	AwaitFix AckIT. Relates #32767	2018-08-16 12:31:58 +02:00
Colin Goodheart-Smithe	d80457ee2a	Mutes test in DuelScrollIT Due to https://github.com/elastic/elasticsearch/issues/32682	2018-08-16 11:08:00 +01:00
Jay Modi	1a45b27d8b	Move CharArrays to core lib (#32851 ) This change cleans up some methods in the CharArrays class from x-pack, which includes the unification of char[] to utf8 and utf8 to char[] conversions that intentionally do not use strings. There was previously an implementation in x-pack and in the reloading of secure settings. The method from the reloading of secure settings was adopted as it handled more scenarios related to the backing byte and char buffers that were used to perform the conversions. The cleaned up class is moved into libs/core to allow it to be used by requests that will be migrated to the high level rest client. Relates #32332	2018-08-15 15:26:00 -06:00
Jason Tedor	364ccc36d6	Fix global checkpoint listeners test This commit fixes a global checkpoint listeners test wherein we were expecting an executor to have been used even if there were no listeners. This is silliness, so this commit adjusts the assertion to verify that the executor never fires if there are no listeners, and fires exactly once if there is one or more listeners.	2018-08-15 15:53:15 -04:00
Armin Braun	986c55b830	INGEST: Add Configuration Except. Data to Metdata (#32322 ) * closes #27728	2018-08-15 19:02:19 +02:00
Jason Tedor	068d03f56b	Introduce global checkpoint listeners (#32696 ) This commit introduces the ability for global checkpoint listeners to be registered at the shard level. These listeners are notified when the global checkpoint is updated, and also when the shard closes. To encapsulate these listeners, we introduce a shard-level component that handles synchronization of notification and modifications to the collection of listeners.	2018-08-15 12:04:24 -04:00
Tim Brooks	2464b68613	Move connection profile into connection manager (#32858 ) This is related to #31835. It moves the default connection profile into the ConnectionManager class. The will allow us to have different connection managers with different profiles.	2018-08-15 09:08:33 -06:00
Lee Hinman	48281ac5bc	Use generic AcknowledgedResponse instead of extended classes (#32859 ) This removes custom Response classes that extend `AcknowledgedResponse` and do nothing, these classes are not needed and we can directly use the non-abstract super-class instead. While this appears to be a large PR, no code has actually changed, only class names have been changed and entire classes removed.	2018-08-15 08:06:14 -06:00
Andy Bristol	a1cff86012	[test] mute IndexShardTests.testDocStats For #32766	2018-08-14 18:21:59 -07:00
Armin Braun	27e64e7251	MINOR: Remove `IndexTemplateFilter` (#32841 ) * This isn't used anywhere anymore ever since `00c123b59f8ba11eb260e6b70acf7be80bccc949` and `dc166c5dc6bcf4abb7f25c6f4143f07d8176333d`	2018-08-14 16:01:33 +02:00
Alexander Reelsen	87481a0e34	Core: Add java time version of rounding classes (#32641 ) This commit adds a java time version of the existing rounding classes, which features the same test suite and a small test class to check if serialization works as expected.	2018-08-14 13:52:55 +02:00
markharwood	e5ab09f708	Aggregations/HL Rest client fix: missing scores (#32774 ) Significance score doubles were being parsed as long. Existing tests did not catch this because SignificantLongTermsTests and SignificantStringTermsTests did not set the score. Fixed these and also added integration test. Thanks for the report/fix, Blakko Closes #32770	2018-08-14 11:14:47 +01:00
Armin Braun	124c1f1358	INGEST: Create Index Before Pipeline Execute (#32786 ) * INGEST: Create Index Before Pipeline Execute * Ensures that indices are created before the default pipeline setting is read to correcly handle the case of an index template containing a default pipeline (without the fix the first document does not get the pipeline applied as explained in #32758) * closes #32758	2018-08-14 11:27:08 +02:00
Yannick Welsch	a8bfa466b2	Fix NOOP bulk updates (#32819 ) #31821 introduced an unreleased bug where NOOP updates were incorrectly mutating the bulk shard request, inserting null item to be replicated, which would result in NullPointerExceptions when serializing the request to be shipped to the replicas. Closes #32808	2018-08-14 08:20:35 +02:00
Tim Brooks	10fddb62ee	Remove client connections from TcpTransport (#31886 ) This is related to #31835. This commit adds a connection manager that manages client connections to other nodes. This means that the TcpTransport no longer maintains a map of nodes that it is connected to.	2018-08-13 16:44:09 -06:00
Nhat Nguyen	8a003e1281	Increase logging testRetentionPolicyChangeDuringRecovery Relates #32089	2018-08-13 16:29:34 -04:00
Armin Braun	d412230cda	SCRIPTING: Support BucketAggScript return null (#32811 ) * As explained in #32790, `BucketAggregationScript` must support `null` as a return value * Closes #32790	2018-08-13 20:08:26 +02:00
Nhat Nguyen	cb2273b02a	Mute IndicesRequestIT#testBulk Tracked at #32808	2018-08-13 10:10:33 -04:00
Ryan Ernst	cb1d467124	Cat apis: Fix index creation time to use strict date format (#32510 ) With the move to java time, the default formatter used by toString on ZonedDateTime uses optional components for least significant portions of the date. This commit changes the cat indices api to use a strict date time format, which will always output milliseconds, even if they are zero. closes #32466	2018-08-10 13:15:00 -07:00
Christoph Büscher	22f7b03430	Fix test reproducability in AbstractBuilderTestCase setup (#32403 ) Currently AbstractBuilderTestCase generates certain random values in its `beforeTest()` method annotated with @Before only the first time that a test method in the suite is run while initializing the serviceHolder that we use for the rest of the test. This changes the values of subsequent random values and has the effect that when running single methods from a test suite with "-Dtests.method=*", the random values it sees are different from when the same test method is run as part of the whole test suite. This makes it hard to use the reproduction lines logged on failure. This change runs the inialization of the serviceHolder and the randomization connected to it using the test runners master seed, so reproduction by running just one method is possible again. Closes #32400	2018-08-10 15:13:44 +02:00
Alexander Reelsen	f236bb3ff6	Tests: Muted ScriptDocValuesDatesTests.testJodaTimeBwc Relates #32779	2018-08-10 14:38:23 +02:00
Boaz Leskes	f58ed21720	Refactor TransportShardBulkAction to better support retries (#31821 ) Processing bulk request goes item by item. Sometimes during processing, we need to stop execution and wait for a new mapping update to be processed by the node. This is currently achieved by throwing a `RetryOnPrimaryException`, which is caught higher up. When the exception is caught, we wait for the next cluster state to arrive and process the request again. Sadly this is a problem because all operations that were already done until the mapping change was required are applied again and get new sequence numbers. This in turn means that the previously issued sequence numbers are never replicated to the replicas. That causes the local checkpoint of those shards to be stuck and with it all the seq# based infrastructure. This commit refactors how we deal with retries with the goal of removing `RetryOnPrimaryException` and `RetryOnReplicaException` (not done yet). It achieves so by introducing a class `BulkPrimaryExecutionContext` that is used the capture the execution state and allows continuing from where the execution stopped. The class also formalizes the steps each item has to go through: 1) A translation phase for updates 2) Execution phase (always index/delete) 3) Waiting for a mapping update to come in, if needed 4) Requires a retry (for updates and cases where the mapping are still not available after the put mapping call returns) 5) A finalization phase which allows updates to the index/delete result to an update result.	2018-08-10 10:15:01 +02:00
Alexander Reelsen	798fb546cb	Core: Create java time based DateMathParser (#32131 ) This adds a java time based date math parser class in order, which will replace the joda date based one in the future. For now the class also returns the date in milliseconds since the epoch.	2018-08-10 09:38:18 +02:00
lipsill	be54ba39c4	Add expected mapping type to `MapperException` (#31564 ) Currently if a document cannot be indexed because it violates the defined mapping for the index, a MapperException is thrown. In some cases it is useful to expose the expected field type in the exception itself, so that the user can react based on the error message. This change adds the expected data type to the MapperException. Closes #31502	2018-08-09 23:10:51 +02:00
Nik Everett	294ab7ee96	Core: Remove some logging constructors (#32513 ) Remove a few of the logger constructors that aren't widely used or aren't used at all and deprecate a few more logger constructors in favor of log4j2's `LogManager`.	2018-08-09 16:11:48 -04:00
Nicholas Knize	e162127ff3	Upgrade to Lucene-7.5.0-snapshot-13b9e28f9d The main feature is the inclusion of bkd backed geo_shape with INTERSECT, DISJOINT, WITHIN bounding box and polygon query support.	2018-08-09 11:15:02 -05:00
Armin Braun	79375d35bb	Scripting: Replace Update Context (#32096 ) * SCRIPTING: Move Update Scripts to their own context * Added system property for backwards compatibility of change to `ctx.params`	2018-08-09 14:32:36 +02:00
Alexander Reelsen	823d40e19b	Core: Fix Java Time DateFormatter printers (#32592 ) A bug in the test suite prevented to properly check that all date formatters printed the date the same way like joda time does. This fixes the test and thus also a fair share of formats, that now use the strict parser for printing.	2018-08-09 10:01:40 +02:00
Lee Hinman	7af28c48c3	Switch WritePipelineResponse to AcknowledgedResponse (#32722 ) We previously discussed moving the classes extending `AcknowledgedResponse` to simply use `AcknowledgedResponse`, making the class non-abstract. This moves the first class to do this, removing `WritePipelineResponse` in the process. If we like the way this looks, I will switch the remaining classes over to using `AcknowledgedResponse`.	2018-08-08 16:21:58 -06:00
Suresh N S	7fdf898518	Whitelisting / from Circuit Breaker Exception (#32325 ) (#32666 ) When Circuit Breaker has tripped, certain diagnostic requests like "_cluster/health" succeed where as request to / fails with 503 Service Unavailable. This behavior is observed because of this commit `f32b700` where certain API paths are whitelisted from Circuit Breaking exception, but / is not whitelisted. Added / to circuit breaker whitelist so that it can be used for diagnostic purposes	2018-08-08 08:24:53 -06:00
Colin Goodheart-Smithe	781e6ad551	Fixes suggestion generics (#32706 ) * Fixes suggestion generics This solves a compile problem in Eclipse where Eclipse could not resolve the generics for the options field in `PhraseSuggestion.Entry`. But I think this is also a good change in general because `PhraseSuggestion.Entry` is now declaring the specific `Option` implementation it requires rather than `Suggest.Entry.Option` which is more general and could lead to weird bugs. `CompletionSuggestion.Entry` and `TermSuggestion.Entry` already declare the more specific class they use so I think this was an oversight in `PhaseSuggestion.Entry` * iter	2018-08-08 12:46:38 +01:00
Luca Cavanna	3e437438d5	Prevent cause from being null in ShardOperationFailedException (#32640 ) `ShardOperationFailedException` and corresponding implementors seem to suggest that the cause may be null, case that is also handled in a few places. Yet, it does not seem to be possible in practice for the cause to be null, hence we can clean that up and enforce the cause to be a non null value. This is best done by making `ShardOperationFailedException` an abstract class rather than an interface, which holds the basic member instance that all the subclasses have in common and can also enforce that cause, status and reason are non null.	2018-08-08 09:59:22 +02:00
Luca Cavanna	5c2ef5e869	Preserve index_uuid when creating QueryShardException (#32677 ) As part of #32608 we made sure that the fully qualified index name is taken from the query shard context whenever creating a new `QueryShardException`. That change introduced a regression as instead of setting the entire `Index` object to the exception, which holds index name and index uuid, we ended up setting only the index name (including cluster alias). With this commit we make sure that the index uuid does not get lost and we try to lower the chances that a similar bug makes it in another time. That's done by making `QueryShardContext` return the fully qualified `Index` (which also holds the uuid) rather than only the fully qualified index name.	2018-08-08 09:57:11 +02:00
Julie Tibshirani	d7183f8f3d	Make sure that field collapsing supports field aliases. (#32648 )	2018-08-07 16:20:09 -07:00
Andy Bristol	8bfb0f3f8d	serialize suggestion responses as named writeables (#30284 ) Suggestion responses were previously serialized as streamables which made writing suggesters in plugins with custom suggestion response types impossible. This commit makes them serialized as named writeables and provides a facility for registering a reader for suggestion responses when registering a suggester. This also makes Suggestion responses abstract, requiring a suggester implementation to provide its own types. Suggesters which do not need anything additional to what is defined in Suggest.Suggestion should provide a minimal subclass. The existing plugin suggester integration tests are removed and replaced with an equivalent implementation as an example plugin.	2018-08-07 13:31:00 -07:00
Jason Tedor	dcc816427e	Expose whether or not the global checkpoint updated (#32659 ) It will be useful for future efforts to know if the global checkpoint was updated. To this end, we need to expose whether or not the global checkpoint was updated when the state of the replication tracker updates. For this, we add to the tracker a callback that is invoked whenever the global checkpoint is updated. For primaries this will be invoked when the computed global checkpoint is updated based on state changes to the tracker. For replicas this will be invoked when the local knowledge of the global checkpoint is advanced from the primary.	2018-08-07 15:10:09 -04:00
Tim Brooks	3d5e9114e3	Reduce connections used by MockNioTransport (#32620 ) The MockNioTransport (similar to the MockTcpTransport) is used for integ tests. The MockTcpTransport has always only opened a single for all of its work. The MockNioTransport has awlays opened the default number of connections (13). This means that every test where two transports connect requires 26 connections. This is more than is necessary. This commit modifies the MockNioTransport to only require 3 connections.	2018-08-07 12:52:28 -06:00
Yannick Welsch	45066b5e89	Verify primary mode usage with assertions (#32667 ) Primary terms were introduced as part of the sequence-number effort (#10708) and added in ES 5.0. Subsequent work introduced the replication tracker which lets the primary own its replication group (#25692) to coordinate recovery and replication. The replication tracker explicitly exposes whether it is operating in primary mode or replica mode, independent of the ShardRouting object that's associated with a shard. During a primary relocation, for example, the primary mode is transferred between the primary relocation source and the primary relocation target. After transferring this so-called primary context, the old primary becomes a replication target and the new primary the replication source, reflected in the replication tracker on both nodes. With the most recent PR in this area (#32442), we finally have a clean transition between a shard that's operating as a primary and issuing sequence numbers and a shard that's serving as a replication target. The transition from one state to the other is enforced through the operation-permit system, where we block permit acquisition during such changes and perform the transition under this operation block, ensuring that there are no operations in progress while the transition is being performed. This finally allows us to turn the best-effort checks that were put in place to prevent shards from being used in the wrong way (i.e. primary as replica, or replica as primary) into hard assertions, making it easier to catch any bugs in this area.	2018-08-07 15:02:37 +02:00
Paul Sanwald	3ce984d746	mute test while I work on #32215	2018-08-07 08:56:00 -04:00
Andrey Ershov	6449d9bc14	Include translog path in error message when translog is corrupted (#32251 ) Currently, when TranslogCorruptedException is thrown most of the times it does not contain information about the translog location on the file system. There is the translog recovery tool that accepts the translog path as an argument and users are constantly puzzled where to get the path. This pull request adds "source" information to every TranslogCorruptedException thrown. The source could be local file, remote translog source (used for recovery), assertion (translog entry is constructed to perform some assertion) or translog constructed inside the test. Closes #24929	2018-08-07 13:03:43 +02:00
Parth Verma	6fe6247dc8	Ignore script fields when size is 0 (#31917 ) This change adds a check so that when parsing the search source, script fields are ignored when the requested search result size is 0. This helps with e.g. clients like Kibana that sends a list of script fields that they may need for convenience, but they don't require any hits. Before this change, user sometimes ran into confusing behaviour, e.g. the script compilation limit to breaking although no hits were requested. Closes #31824	2018-08-07 10:56:44 +02:00
Armin Braun	f57cb10d2c	Tests: Fix Typo Causing Flaky Settings Test (#32665 ) * We were comparing the wrong timeout value in the `randomValueOtherThan` call here, leading to no mutation happening for a certain seed * closes #32639	2018-08-07 10:30:45 +02:00
Jason Tedor	3fb0923182	Fix content type detection with leading whitespace (#32632 ) Today content type detection on an input stream works by peeking up to twenty bytes into the stream. If the stream is headed by more whitespace than twenty bytes, we might fail to detect the content type. We should be ignoring this whitespace before attempting to detect the content type. This commit does that by ignoring all leading whitespace in an input stream before attempting to guess the content type.	2018-08-06 18:07:46 -04:00
Yannick Welsch	014b2772db	[TEST] Fix testReplicaTermIncrementWithConcurrentPrimaryPromotion The assertion in the test was not broad enough. If the timing is very unlucky, the shard is already promoted to primary before the indexOnReplica even gets to execute. Closes #32645	2018-08-06 18:38:01 +02:00
Armin Braun	0a67cb4133	LOGGING: Upgrade to Log4J 2.11.1 (#32616 ) * LOGGING: Upgrade to Log4J 2.11.1 * Upgrade to `2.11.1` to fix memory leaks in slow logger when logging large requests * This was caused by a bug in Log4J https://issues.apache.org/jira/browse/LOG4J2-2269 and is fixed in `2.11.1` via https://git-wip-us.apache.org/repos/asf?p=logging-log4j2.git;h=9496c0c * Fixes #32537 * Fixes #27300	2018-08-06 14:56:21 +02:00
Luca Cavanna	826399f9fc	Cross-cluster search: preserve cluster alias in shard failures (#32608 ) When some remote clusters return shard failures as part of a cross-cluster search request, the cluster alias currently gets lost. As a result, if the shard failures are all caused by the same error, and against indices belonging to different clusters, but with the same index name, only one failure gets returned as part of the search response, meaning that failures are grouped by index name, ignoring the cluster alias. With this commit we make sure that `ShardSearchFailure` returns the cluster alias as part of the index name. Also, we set the fully qualfied index name when creating a `QueryShardException`. That way shard failures are grouped by cluster:index. Such fixes should cover at least most of the cases where either 1) the shard target is set but we don't have the index in the cause (we were previously reading it only from the cause that did not have the cluster alias) 2) the shard target is missing but if the cause is a `QueryShardException` the cluster alias does not get lost. We also prevent NPE in case the failure cause is not set and test such scenario.	2018-08-06 11:48:50 +02:00
Yannick Welsch	3cf08326ab	Handle AlreadyClosedException when bumping primary term If the shard is already closed while bumping the primary term, this can result in an AlreadyClosedException to be thrown. As we use asyncBlockOperations, the exception will be thrown on a thread from the generic thread pool and end up in the uncaught exception handler, failing our tests. Relates to #32442	2018-08-06 08:34:38 +02:00
Armin Braun	6fa7016bbf	SCRIPTING: Move Aggregation Scripts to their own context (#32068 ) * SCRIPTING: Move Aggregation Scripts to their own context	2018-08-04 10:37:07 +02:00
Lee Hinman	1e4751ec47	[TEST] Enhance failure message when bulk updates have failures	2018-08-03 15:27:10 -06:00
Alexander Reelsen	018e77cac6	Core: Move helper date formatters over to java time (#32504 ) Some classes use internal date formatters, which now can be moved over to java time using the DateFormatters class. The same applies for a few test cases.	2018-08-03 13:21:14 +02:00
Colin Goodheart-Smithe	d05f39de8b	[TEST} unmutes SearchAsyncActionTests and adds debugging info This unmutes the testFanOutAndCollect()` method and add a check to make sure we aren't accidentally running something twice causing a search phase to still be running after we have counted down the latch Relates to #29242	2018-08-03 11:52:46 +01:00
Yannick Welsch	0d60e8a029	Fix race between replica reset and primary promotion (#32442 ) We've recently seen a number of test failures that tripped an assertion in IndexShard (see issues linked below), leading to the discovery of a race between resetting a replica when it learns about a higher term and when the same replica is promoted to primary. This commit fixes the race by distinguishing between a cluster state primary term (called pendingPrimaryTerm) and a shard-level operation term. The former is set during the cluster state update or when a replica learns about a new primary. The latter is only incremented under the operation block, which can happen in a delayed fashion. It also solves the issue where a replica that's still adjusting to the new term receives a cluster state update that promotes it to primary, which can happen in the situation of multiple nodes being shut down in short succession. In that case, the cluster state update thread would call `asyncBlockOperations` in `updateShardState`, which in turn would throw an exception as blocking permits is not allowed while an ongoing block is in place, subsequently failing the shard. This commit therefore extends the IndexShardOperationPermits to allow it to queue multiple blocks (which will all take precedence over operations acquiring permits). Finally, it also moves the primary activation of the replication tracker under the operation block, so that the actual transition to primary only happens under the operation block. Relates to #32431, #32304 and #32118	2018-08-03 09:33:08 +02:00
Shaunak Kashyap	0a83968650	Add cluster UUID to Cluster Stats API response (#32206 ) * Make cluster stats response contain cluster UUID * Updating constructor usage in Monitoring tests * Adding cluster_uuid field to Cluster Stats API reference doc * Adding rest api spec test for expecting cluster_uuid in cluster stats response * Adding missing newline * Indenting do section properly * Missed a spot! * Fixing the test cluster ID	2018-08-02 17:14:19 -07:00
Zachary Tong	080b9f58ea	[TEST] Test for shard failures, add debug to testProfileMatchesRegular Unmuting the test and adding some more debug output. Was not able to reproduce the prior failure, but it seems possible that the failure (mismatched counts) could be caused by partial search results during the test. The assertions check for shard failures first, because if one of the two searches is partial the rest of the test will fail. Next, instead of just checking respective hit counts, we emit the difference in hits to help identify what went wrong. Closes #32492	2018-08-02 17:18:29 -04:00
Nhat Nguyen	2c35db8043	TEST: Avoid merges in testSeqNoAndCheckpoints Since LUCENE-8263, testSeqNoAndCheckpoints might trigger merges because of the updates and deletes in the test. Our merge scheduler will trigger a flush if there is no pending merge. Those extra flushes will change the last committed segmentInfos in the engine and fail the test. This commit uses LogMergePolicy for the engine in the test to avoid merges. Closes #32430	2018-08-02 13:46:23 -04:00
Armin Braun	be31cc642b	INGEST: Enable default pipelines (#32286 ) * INGEST: Enable default pipelines * Add `default_pipeline` index setting * `_none` is interpreted as no pipeline * closes #21101	2018-08-02 17:11:12 +02:00
Yannick Welsch	db6e8c736d	Remove cluster state initial customs (#32501 ) This infrastructure was introduced in #26144 and made obsolete in #30743	2018-08-02 15:49:59 +02:00
Julie Tibshirani	5efc2ec9f7	Clarify the error message when a pipeline agg is used in the 'order' parameter. (#32522 )	2018-08-01 12:02:07 -07:00
Ryan Ernst	478f6d6cf1	Scripting: Conditionally use java time api in scripting (#31441 ) This commit adds a boolean system property, `es.scripting.use_java_time`, which controls the concrete return type used by doc values within scripts. The return type of accessing doc values for a date field is changed to Object, essentially duck typing the type to allow co-existence during the transition from joda time to java time.	2018-08-01 08:58:49 -07:00
Nik Everett	e7ead17893	Core: Minor size reduction for AbstractComponent (#32509 ) This removes a constructor from `AbstractComponent` and `AbstractLifecycleComponent` that we weren't using and it switches the logger creation away from one of the `Settings` flavored methods which are no longer needed.	2018-08-01 09:17:48 -04:00
Nhat Nguyen	67d53e5093	Mute testFilterCacheStats Tracked at #32506	2018-07-31 12:45:30 -04:00
Nik Everett	22459576d7	Logging: Make node name consistent in logger (#31588 ) First, some background: we have 15 different methods to get a logger in Elasticsearch but they can be broken down into three broad categories based on what information is provided when building the logger. Just a class like: ``` private static final Logger logger = ESLoggerFactory.getLogger(ActionModule.class); ``` or: ``` protected final Logger logger = Loggers.getLogger(getClass()); ``` The class and settings: ``` this.logger = Loggers.getLogger(getClass(), settings); ``` Or more information like: ``` Loggers.getLogger("index.store.deletes", settings, shardId) ``` The goal of the "class and settings" variant is to attach the node name to the logger. Because we don't always have the settings available, we often use the "just a class" variant and get loggers without node names attached. There isn't any real consistency here. Some loggers get the node name because it is convenient and some do not. This change makes the node name available to all loggers all the time. Almost. There are some caveats are testing that I'll get to. But in production code the node name is node available to all loggers. This means we can stop using the "class and settings" variants to fetch loggers which was the real goal here, but a pleasant side effect is that the ndoe name is now consitent on every log line and optional by editing the logging pattern. This is all powered by setting the node name statically on a logging formatter very early in initialization. Now to tests: tests can't set the node name statically because subclasses of `ESIntegTestCase` run many nodes in the same jvm, even in the same class loader. Also, lots of tests don't run with a real node so they don't have a node name at all. To support multiple nodes in the same JVM tests suss out the node name from the thread name which works surprisingly well and easy to test in a nice way. For those threads that are not part of an `ESIntegTestCase` node we stick whatever useful information we can get form the thread name in the place of the node name. This allows us to keep the logger format consistent.	2018-07-31 10:54:24 -04:00
Sohaib Iftikhar	4fa92cbf49	Changed ReindexRequest to use Writeable.Reader (#32401 ) -- This is a pre-stage for adding the reindex API to the REST high-level-client -- Follows the pattern set in #26315	2018-07-31 10:11:17 -04:00
Paul Sanwald	6f93911955	Fix AutoIntervalDateHistogram.testReduce random failures (#32301 ) 1. Refactor the test to use the same roundings as the implementation. 2. Refactor the test verification logic to use `innerIntervals` when rounding.	2018-07-31 08:52:16 -04:00
Daniel Mitterdorfer	9703d06321	Mute QueryProfilerIT#testProfileMatchesRegular() Relates #32492	2018-07-31 13:29:21 +02:00
Luca Cavanna	a3b272966d	High-level client: fix clusterAlias parsing in SearchHit (#32465 ) When using cross-cluster search through the high-level REST client, the cluster alias from each search hit was not parsed correctly. It would be part of the index field initially, but overridden just a few lines later once setting the shard target (in case we have enough info to build it from the response). In any case, getClusterAlias returns `null` which is a bug. With this change we rather parse back clusterAliases from the index name, set its corresponding field and properly handle the two possible cases depending on whether we can or cannot build the shard target object.	2018-07-31 09:41:51 +02:00
David Turner	8b57e2e5ba	Fix calculation of orientation of polygons (#27967 ) The method for working out whether a polygon is clockwise or anticlockwise is mostly correct but doesn't work in some rare cases such as the included test case. This commit fixes that.	2018-07-31 08:25:21 +01:00
Tal Levy	1e0fcebfe1	update rollover to leverage write-alias semantics (#32216 ) Rollover should not swap aliases when `is_write_index` is set to `true`. Instead, both the new and old indices should have the rollover alias, with the newly created index as the new write index Updates Rollover to leverage the ability to preserve aliases and swap which is the write index. Historically, Rollover would swap which index had the designated alias for writing documents against. This required users to keep a separate read-alias that enabled reading against both rolled over and newly created indices, whiles the write-alias was being re-assigned at every rollover. With the ability for aliases to designate a write index, Rollover can be a bit more flexible with its use of aliases. Updates include: - Rollover validates that the target alias has a write index (the index that is being rolled over). This means that the restriction that aliases only point to one index is no longer necessary. - Rollover explicitly (and atomically) swaps which index is the write-index by explicitly assigning the existing index to have `is_write_index: false` and have the newly created index have its rollover alias as `is_write_index: true`. This is only done when `is_write_index: true` on the write index. Default behavior of removing the alias from the rolled over index stays when `is_write_index` is not explicitly set Relevant things that are staying the same: - Rollover is rejected if there exist any templates that match the newly-created index and configure the rollover-alias - I think this existed to prevent the situation where an alias pointed to two indices for a short while. Although this can technically be relaxed, the specific cases that are safe are really particular and difficult to reason, so leaving the broad restriction sounds good	2018-07-30 14:32:55 -07:00
Armin Braun	cf7489899a	INGEST: Clean up Java8 Stream Usage (#32059 ) * GrokProcessor: Rationalize the loop over the map to save allocations and indirection * IngestDocument: Rationalize way we append to `List`	2018-07-30 21:25:30 +02:00
Ioannis Kakavas	c2e3bebab9	Ensure KeyStoreWrapper decryption exceptions are handled (#32464 ) * Ensure decryption related exceptions are handled This commit ensures that all possible Exceptions in KeyStoreWrapper#decrypt() are handled. More specifically, in the case that a wrong password is used for secure settings, calling readX on the DataInputStream that wraps the CipherInputStream can throw an IOException. It also adds a test for loading a KeyStoreWrapper with a wrong password. Resolves #32411	2018-07-30 22:15:59 +03:00
Boaz Leskes	0cae19c8d7	IndicesClusterStateService should replace an init. replica with an init. primary with the same aId (#32374 ) In rare cases it is possible that a nodes gets an instruction to replace a replica shard that's in `POST_RECOVERY` with a new initializing primary with the same allocation id. This can happen by batching cluster states that include the starting of the replica, with closing of the indices, opening it up again and allocating the primary shard to the node in question. The node should then clean it's initializing replica and replace it with a new initializing primary. I'm not sure whether the test I added really adds enough value as existing tests found this. The main reason I added is to allow for simpler reproduction and to double check I fixed it. I'm open to discuss if we should keep. Closes #32308	2018-07-30 16:24:41 +03:00
Luca Cavanna	9a4d0069f6	REST high-level client: parse back _ignored meta field (#32362 ) `GetResult` and `SearchHit` have been adjusted to parse back the `_ignored` meta field whenever it gets printed out. Expanded the existing tests to make sure this is covered. Fixed also a small problem around highlighted fields in `SearchHitTests`.	2018-07-30 13:43:40 +02:00
Nhat Nguyen	5b1ad8099b	TEST: testDocStats should always use forceMerge (#32450 ) Due to the recent change in LUCENE-8263, we need to adjust the deletion ration to between 10% to 33% to preserve the current behavior of the test. However, we may need another refinement if soft-deletes is enabled as the actual deletes are different because of delete tombstones. This commit prefers to always execute forceMerge instead of adjusting the deletion ratio so that this test can focus on testing docStats. Closes #32449	2018-07-28 07:41:30 -04:00
Nhat Nguyen	6e98615cc1	TEST: Avoid deletion in FlushIT Due to the recent change in LUCENE-8263, a merge can be triggered if the deletion ration is higher than 33%. An in-progress merge can prevent a synced-flush from issuing. This commit avoids deletes by using different docIds. Closes #32436	2018-07-27 23:14:24 -04:00
Nhat Nguyen	139631c77d	AwaitsFix IndexShardTests#testDocStats Relates #32449	2018-07-27 20:48:23 -04:00

1 2 3 4 5 ...

1077 Commits