OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jonathan Little	9d92a87ae6	Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979 )	2018-08-28 09:27:43 +01:00
Alpar Torok	2cc611604f	Run Third party audit with forbidden APIs CLI (part3/3) (#33052 ) The new implementation is functional equivalent with the old, ant based one. It parses task standard error to get the missing classes and violations in the same way. I considered re-using ForbiddenApisCliTask but Gradle makes it hard to build inheritance with tasks that have task actions , since the order of the task actions can't be controlled. This inheritance isn't dully desired either as the third party audit task is much more opinionated and we don't want to expose some of the configuration. We could probably extract a common base class without any task actions, but probably more trouble than it's worth. Closes #31715	2018-08-28 10:03:30 +03:00
Nhat Nguyen	014b3236dc	Ensure to generate identical NoOp for the same failure (#33141 ) We generate slightly different NoOps in InternalEngine and TransportShardBulkAction for the same failure. 1. InternalEngine uses Exception#getFailure to generate a message without the class name: newOp [NoOp{seqNo=1, primaryTerm=1, reason='Contexts are mandatory in context enabled completion field [suggest_context]'}]. 2. TransportShardBulkAction uses Exception#toString to generate a message with the class name: NoOp{seqNo=1, primaryTerm=1, reason='java.lang.IllegalArgumentException: Contexts are mandatory in context enabled completion field [suggest_context]'}. If a write operation fails while a replica is recovering, that replica will possibly receive two different NoOps: one from recovery and one from replication. These two different NoOps will trip TranslogWriter#assertNoSeqNumberConflict assertion. This commit ensures that we generate the same Noop for the same failure. Closes #32986	2018-08-27 15:59:42 -04:00
Luca Cavanna	ed0571e16c	ShardSearchFailure#readFrom to set index and shardId (#33161 ) As part of recent changes made to `ShardOperationFailedException` we introduced `index` and `shardId` members to the base class, but the subclasses are entirely responsible for the serialization of such fields. In the case of `ShardSearchFailure`, we have an additional `SearchShardTarget` instance member which also holds the index and the shardId, hence they get serialized as part of `SearchShardTarget` itself. When de-serializing a `ShardSearchFailure` though, we need to remember to also set the parent class `index` and `shardId` fields otherwise they get lost Relates to #32640	2018-08-27 20:31:27 +02:00
Jason Tedor	318df2a107	Adjust BWC version on mapping version The introduction of mapping version on index metadata has been backported to 6.x. This commit adjusts the BWC version around mapping version to account for this backport.	2018-08-27 13:17:15 -04:00
Jason Tedor	2aef7e0900	Introduce mapping version to index metadata (#33147 ) This commit introduces mapping version to index metadata. This value is monotonically increasing and is updated on mapping updates. This will be useful in cross-cluster replication so that we can request mapping updates from the leader only when there is a mapping update as opposed to the strategy we employ today which is to request a mapping update any time there is an index metadata update. As index metadata updates can occur for many reasons other than mapping updates, this leads to some unnecessary requests and work in cross-cluster replication.	2018-08-27 12:21:11 -04:00
Mikita Karaliou	f1f6d4ed33	Support only string `format` in date, root object & date range (#28117 ) Limit date `format` attribute to String values only. Closes #23650	2018-08-27 12:24:51 +02:00
Daniel Mitterdorfer	06c0055c0f	Have circuit breaker succeed on unknown mem usage With this commit we implement a workaround for https://bugs.openjdk.java.net/browse/JDK-8207200 which is a race condition in the JVM that results in `IllegalArgumentException` to be thrown in rare cases when we determine memory usage via `MemoryMXBean`. As we do not want to fail requests in those cases we always return zero memory usage. Relates #31767 Relates #33125	2018-08-27 07:09:27 +02:00
Jason Tedor	143cd9bbaa	Do not lose default mapper on metadata updates (#33153 ) When applying index metadata updates we run through the mappings updating them if needed. Today if there is not an update to the default mapper, we can lose the default mapping. This means that, for example, if we apply a settings update to an index we will lose the default mapper. This happens because we were not guarding updating the default mapping with a check that the default mapping was updated in the metadata update. When there is no update in the metadata update, we need to continue to preserve the previous default mapping. This commit achieves this by moving the updating of the default mapping under the same guard that we use for updating the default mapping source. We add a test that fails before putting the update under a guard and now passes after moving the update under the guard.	2018-08-26 15:57:52 -04:00
Jason Tedor	f8b07a0d84	Fix a mappings update test (#33146 ) This commit fixes a mappings update test. The test is broken in the sense that it passes, but for the wrong reason. The test here is testing that if we make a mapping update but do not commit that mapping update then the mapper service still maintains the previous document mapper. This was not the case long, long ago when a mapping update would update the in-memory state before the cluster state update was committed. This test was passing, but it was passing because the mapping update was never even updated. It was never even updated because it was encountering a null pointer exception. Of course the in-memory state is not going to be updated in that case, we are simply going to end up with a failed cluster state update. Fixing that leads to another issue which is that the mapping source does not even parse so again we would, of course, end up with the in-memory state not being modified. We fix these issues, assert that the result cluster state task completed successfully, and finally that the in-memory state was not updated since we never committed the resulting cluster state.	2018-08-26 09:36:17 -04:00
Simon Willnauer	3376922e8b	Add proxy support to RemoteClusterConnection (#33062 ) This adds support for connecting to a remote cluster through a tcp proxy. A remote cluster can configured with an additional `search.remote.$clustername.proxy` setting. This proxy will be used to connect to remote nodes for every node connection established. We still try to sniff the remote clsuter and connect to nodes directly through the proxy which has to support some kind of routing to these nodes. Yet, this routing mechanism requires the handshake request to include some kind of information where to route to which is not yet implemented. The effort to use the hostname and an optional node attribute for routing is tracked in #32517 Closes #31840	2018-08-25 20:41:32 +02:00
Nhat Nguyen	9dad82ece8	TEST: Skip assertSeqNos for closed shards (#33130 ) If a shard was closed, we return null for SeqNoStats. Therefore the assertion assertSeqNos will hit NPE when it verifies a closed shard. This commit skips closed shards in assertSeqNos and enables this assertion in AbstractDisruptionTestCase.	2018-08-24 21:02:13 -04:00
Nik Everett	a023e64801	Checkstyle! Catching your unused imports since 2001.	2018-08-24 14:13:13 -04:00
Jim Ferenczi	70030c18f1	[Test] Fix sporadic failure in MembershipActionTests Rewrite test that require Version.V_5 constants.	2018-08-24 18:40:04 +02:00
Mayya Sharipova	6f1ee76443	Revert "Do NOT allow termvectors on nested fields (#32728 )" This reverts commit `fdff8f3db0`.	2018-08-24 10:12:16 -04:00
Jim Ferenczi	f4e9729d64	Remove unsupported Version.V_5_* (#32937 ) This change removes the es 5x version constants and their usages.	2018-08-24 09:51:21 +02:00
Michael Basnight	8f16696fe1	Add versions 5.6.12 and 6.4.1	2018-08-23 15:49:14 -05:00
Mayya Sharipova	fdff8f3db0	Do NOT allow termvectors on nested fields (#32728 ) Requesting _termvectors on a nested field or any sub-fields of a nested field returns empty results. Closes #21625	2018-08-23 16:46:47 -04:00
Simon Willnauer	f3cfd4504f	Use `addIfAbsent` instead of checking if an element is contained Relates to #32988	2018-08-23 13:43:23 +02:00
Ignacio Vera	d7219c05a2	Search: Support of wildcard on docvalue_fields (#32980 ) * Search: Support of wildcard on docvalue_fields For consistency with stored_fields, docvalue_fields should support the use of wildcards. Documentation of doc values fields is updated accordingly. See also: #26390 Closes #26299	2018-08-23 10:04:00 +02:00
Jim Ferenczi	ffe895e16e	Change query field expansion (#33020 ) This commit changes the query field expansion for query parsers to not rely on an hardcoded list of field types. Instead we rely on the type of exception that is thrown by MappedFieldType#termQuery to include/exclude an expanded field. Supersedes #31655 Closes #31798	2018-08-23 09:52:48 +02:00
Armin Braun	46247ff1f9	INGEST: Cleanup Redundant Put Method (#33034 )	2018-08-23 07:43:36 +02:00
Luca Cavanna	393eec1482	Set maxScore for empty TopDocs to Nan rather than 0 (#32938 ) We used to set `maxScore` to `0` within `TopDocs` in situations where there is really no score as the size was set to `0` and scores were not even tracked. In such scenarios, `Float.Nan` is more appropriate, which gets converted to `max_score: null` on the REST layer. That's also more consistent with lucene which set `maxScore` to `Float.Nan` when merging empty `TopDocs` (see `TopDocs#merge`).	2018-08-22 17:23:54 +02:00
Jason Tedor	67bfb765ee	Refactor Netty4Utils#maybeDie (#33021 ) In our Netty layer we have had to take extra precautions against Netty catching throwables which prevents them from reaching the uncaught exception handler. This code has taken on additional uses in NIO layer and now in the scheduler engine because there are other components in stack traces that could catch throwables and suppress them from reaching the uncaught exception handler. This commit is a simple cleanup of the iterative evolution of this code to refactor all uses into a single method in ExceptionsHelper.	2018-08-22 10:18:07 -04:00
Simon Willnauer	ead198bf2e	Add settings updater for 2 affix settings (#33050 ) Today we can only have non-affix settings updated and consumed _together_. Yet, there are use-cases where two affix settings depend on each other which makes using the hard without consuming updates together. Unfortunately, there is not straight forward way to have N settings updated together in a type-safe way having 2 still serves a large portion of use-cases.	2018-08-22 14:13:27 +02:00
Nhat Nguyen	262d3c0783	Allow engine to recover from translog upto a seqno (#33032 ) This change allows an engine to recover from its local translog up to the given seqno. The extended API can be used in these use cases: When a replica starts following a new primary, it resets its index to the safe commit, then replays its local translog up to the current global checkpoint (see #32867). When a replica starts a peer-recovery, it can initialize the start_sequence_number to the persisted global checkpoint instead of the local checkpoint of the safe commit. A replica will then replay its local translog up to that global checkpoint before accepting remote translog from the primary. This change will increase the chance of operation-based recovery. I will make this in a follow-up. Relates #32867	2018-08-22 07:57:44 -04:00
Simon Willnauer	ffb1a5d5b7	Expose `max_concurrent_shard_requests` in `_msearch` (#33016 ) Today `_msearch` doesn't allow modifying the `max_concurrent_shard_requests` per sub search request. This change adds support for setting this parameter on all sub-search requests in an `_msearch`. Relates to #31877	2018-08-22 08:45:08 +02:00
Julie Tibshirani	67b5a83a9a	Ensure that _exists queries on keyword fields use norms when they're available. (#33006 )	2018-08-21 16:33:42 -07:00
Jim Ferenczi	767c69593c	Fix quoted _exists_ query (#33019 ) This change in the `query_string` query fixes the detection of the special `_exists_` field when it is used with a quoted term. Closes #28922	2018-08-21 22:15:09 +02:00
Jim Ferenczi	8b43e21521	Fix multi fields empty query (#33017 ) This change fixes empty query removal when all fields remove the search term in `simple_query_string`, `multi_match` and `query_string`. Closes #33009	2018-08-21 22:12:53 +02:00
Igor Motov	3973bb4028	Fix north pole overflow error in GeoHashUtils.bbox() (#32891 ) Fixes an overflow error in GeoHashUtils.bbox() calculation of a bounding box for geohashes with maximum precision located next to the north pole.	2018-08-21 14:59:37 -04:00
Jason Tedor	bdfcc326d7	Enable avoiding mmap bootstrap check (#32421 ) The maximum map count boostrap check can be a hindrance to users that do not own the underlying platform on which they are executing Elasticsearch. This is because addressing it requires tuning the kernel and a platform provider might now allow this, especially on shared infrastructure. However, this bootstrap check is not needed if mmapfs is not in use. Today we do not have a way for the user to communicate that they are not going to use mmapfs. This commit therefore adds a setting that enables the user to disallow mmapfs. When mmapfs is disallowed, the maximum map count bootstrap check is not enforced. Additionally, we fallback to a different default index store and prevent the explicit use of mmapfs for an index.	2018-08-21 11:02:25 -04:00
Simon Willnauer	92076497e5	Use a dedicated ConnectionManger for RemoteClusterConnection (#32988 ) This change introduces a dedicated ConnectionManager for every RemoteClusterConnection such that there is not state shared with the TransportService internal ConnectionManager. All connections to a remote cluster are isolated from the TransportService but still uses the TransportService and it's internal properties like the Transport, tracing and internal listener actions on disconnects etc. This allows a remote cluster connection to have a different lifecycle than a local cluster connection, also local discovery code doesn't get notified if there is a disconnect on from a remote cluster and each connection can use it's own dedicated connection profile which allows to have a reduced set of connections per cluster without conflicting with the local cluster. Closes #31835	2018-08-21 12:43:25 +02:00
Armin Braun	200078734c	INGEST: Simplify IngestService (#33008 ) * INGEST: Simplify IngestService * Follow up to #32617 * Flatten redundant inner classes of `IngestService`	2018-08-21 10:13:32 +02:00
Armin Braun	8fc213f237	INGEST: Move all Pipeline State into IngestService (#32617 ) * INGEST: Move all Pipeline State into IngestService * Moves all pipeline state into the ingest service * Retains the existing pipeline store and pipeline execution service as inner classes to make the review easier, they should be flattened out in the next step * All tests for these classes were copied (and adapted) to the ingest service tests * This is a refactoring step to enable a clean implementation of a pipeline processor (See #32473)	2018-08-21 05:05:32 +02:00
Jason Tedor	ad0a965db9	Protect scheduler engine against throwing listeners (#32998 ) There are two problems with the scheduler engine today. Both relate to listeners that throw. The first problem is that any triggered listener that throws a plain old exception will cause no additional listeners to be triggered for the event, and will also cause the scheduler to never be invoked again. This leads to lost events and is bad. The second problem is that any triggered listener that throws an error of the fatal kind will not lead to that error because caught by the uncaught exception handler. This is because the triggered listener is executed as a future task under a scheduled thread pool executor. A throwable there goes caught by the JDK framework and set as the outcome on the future task. Since we never inspect these tasks for their outcomes, nor is there a good place to do this, we have to handle these errors ourselves. To do this, we catch them and dispatch them to the uncaught exception handler via a forked thread. This is similar to our handling in Netty.	2018-08-20 22:07:16 -04:00
Nhat Nguyen	40f1bb5e5e	Trim translog when safe commit advanced (#32967 ) Since #28140 when the global checkpoint is advanced, we try to move the safe commit forward, and clean up old index commits if possible. However, we forget to trim unreferenced translog. This change makes sure that we prune both old translog and index commits when the safe commit advanced. Relates #28140 Closes #32089	2018-08-20 15:13:19 -04:00
Nik Everett	462e91d362	Logging: Use settings when building daemon threads (#32751 ) Subclasses of `EsIntegTestCase` run multiple Elasticsearch nodes in the same JVM and when we log we look at the name of the thread to figure out the node name. This makes sure that all calls to `daemonThreadFactory` include the node name. Closes #32574 I'd like to follow this up with more drastic changes that make it impossible to do this incorrectly but that change is much larger than this and I'd like to get these log lines fixed up sooner rather than later.	2018-08-20 13:53:15 -04:00
Andrey Ershov	0749b18181	All Translog inner closes should happen after tragedy exception is set (#32674 ) All Translog inner closes should happen after tragedy exception is set (#32674) We faced with the nasty race condition. See #32526 InternalEngine.failOnTragic method has thrown AssertionError. If you carefully look at if branches in this method, you will spot that its only possible, if either Lucene IndexWriterhas closed from inside or Translog, has closed from inside, but tragedy exception is not set. For now, let us concentrate on the Translog class. We found out that there are two methods in Translog - namely rollGeneration and trimOperations that are closing Translog in case of Exception without tragedy exception being set. This commit fixes these 2 methods. To fix it, we pull tragedyException from TranslogWriter up-to Translog class, because in these 2 methods IndexWriter could be innocent, but still Translog needs to be closed. Also, tragedyException is wrapped with TragicExceptionHolder to reuse CAS/addSuppresed functionality in Translog and TranslogWriter. Also to protect us in the future and make sure close method is never called from inside Translog special assertion examining stack trace is added. Since we're still targeting Java 8 for runtime - no StackWalker API is used in the implementation. In the stack-trace checking method, we're considering inner caller not only Translog methods but Translog child classes methods as well. It does mean that Translog is meant for extending it, but it's needed to be able to test this method. Closes #32526	2018-08-20 19:22:10 +02:00
Tim Brooks	faa42de66d	Pass DiscoveryNode to initiateChannel (#32958 ) This is related to #32517. This commit passes the DiscoveryNode to the initiateChannel method for different Transport implementation. This will allow additional attributes (besides just the socket address) to be used when opening channels.	2018-08-20 08:54:55 -06:00
Jonathan Little	676091aafb	Protect ScriptedMetricIT test cases against failures on 0-doc shards (#32959 ) (#32968 ) Randomized test conditions that cause some shards to have no docs on them failed due to test asserts that relied on a lazy initialization side effect from the map script. After this fix: - Test cases with the relevant init script are protected - Test cases with the relevant combine or reduce scripts were already protected, because the combine and reduce scripts safely handle this case.	2018-08-20 08:55:43 +01:00
Alpar Torok	4b34b3f4aa	Set forbidden APIs target compatibility to compiler java version (#32935 ) Set forbidden apis target compatibility to compiler version Fix outstanding deprecation	2018-08-20 09:27:02 +03:00
Tim Brooks	de92d2ef1f	Move connection listener to ConnectionManager (#32956 ) This is a followup to #31886. After that commit the TransportConnectionListener had to be propogated to both the Transport and the ConnectionManager. This commit moves that listener to completely live in the ConnectionManager. The request and response related methods are moved to a TransportMessageListener. That listener continues to live in the Transport class.	2018-08-18 10:09:24 -06:00
Armin Braun	f82bb64feb	NETWORKING: Make RemoteClusterConn. Lazy Resolve DNS (#32764 ) * Lazy resolve DNS (i.e. `String` to `DiscoveryNode`) to not run into indefinitely caching lookup issues (provided the JVM dns cache is configured correctly as explained in https://www.elastic.co/guide/en/elasticsearch/reference/6.3/networkaddress-cache-ttl.html) * Changed `InetAddress` type to `String` for that higher up the stack * Passed down `Supplier<DiscoveryNode>` instead of outright `DiscoveryNode` from `RemoteClusterAware#buildRemoteClustersSeeds` on to lazy resolve DNS when the `DiscoveryNode` is actually used (could've also passed down the value of `clusterName = REMOTE_CLUSTERS_SEEDS.getNamespace(concreteSetting)` together with the `List<String>` of hosts, but this route seemed to introduce less duplication and resulted in a significantly smaller changeset). * Closes #28858	2018-08-18 08:46:44 +02:00
Nhat Nguyen	86ffce4bbc	TEST: Mute testRetentionPolicyChangeDuringRecovery Tracked at #32089	2018-08-17 14:12:45 -04:00
Igor Motov	da6b61e8ef	Make Geo Context Mapping Parsing More Strict (#32821 ) Currently, if geo context is represented by something other than geo_point or an object with lat and lon fields, the parsing of it as a geo context can result in ignoring the context altogether, returning confusing errors such as number_format_exception or trying to parse the number specifying as long-encoded hash code. It would also fail if the geo_point was stored. This commit makes the mapping parsing more strict and will fail during mapping update or index creation if the geo context doesn't point to a geo_point field. Supersedes #32412 Closes #32202	2018-08-17 08:13:16 -07:00
Jonathan Little	a08127c072	Scripted metric aggregations: add deprecation warning and system property to control legacy params (#31597 ) * Scripted metric aggregations: add deprecation warning and system property to control legacy params Scripted metric aggregation params._agg/_aggs are replaced by state/states context variables. By default the old params are still present, and a deprecation warning is emitted when Scripted Metric Aggregations are used. A new system property can be used to disable the legacy params. This functionality will be removed in a future revision. * Fix minor style issue and docs test failure * Disable deprecated params._agg/_aggs in tests and revise tests to use state/states instead * Add integration test covering deprecated scripted metrics aggs params._agg/_aggs access * Disable deprecated params._agg/_aggs in docs integration tests and revise stored scripts to use state/states instead * Revert unnecessary migrations doc change A relevant note should be added in the changes destined for 7.0; this PR is going to be backported to 6.x. * Replace deprecated _agg param bwc integration test with a couple of unit tests * Fix compatibility test after merge * Rename backwards compatibility system property per code review feedback * Tweak deprecation warning text per review feedback	2018-08-17 13:11:18 +01:00
Alexander Reelsen	0d92f377fd	Tests: Fix timezone conversion in DateTimeUnitTests This fix prevernts trying to parse unknown timezone ids by converting the joda time zone via java.util.TimeZone to a java time based ZoneId. Closes #32927	2018-08-17 14:09:01 +02:00
Paul Sanwald	ca54aacbb5	Fix InternalAutoDateHistogram reproducible failure (#32723 ) Update test logic to correctly bucket intervals.	2018-08-17 07:03:25 -04:00
Andrey Ershov	2fa028cfa1	Remove assertion in testDocStats on deletedDocs counter (#32914 ) testDocStats test is flaky and sometimes it's failing on jenkins and failure is not reproducible locally. The reason for this failure is in timing. If the number of deleted documents is greater than 33% of inserted documents, Lucene will schedule segments to merge if TieredMergePolicy is used (it's not the case for LogMergePolicy, but ES is only using TieredMergePolicy). If this merge is performed before stats are retrieved - we will get 0 for "deleted" counter. So basically this counter could be either 0 or numOfDeletedDocs at this point, but this is the too loose assertion and we decided to remove it at all. Closes #32766	2018-08-17 12:36:45 +02:00

1 2 3 4 5 ...

1056 Commits