OpenSearch

Commit Graph

Author	SHA1	Message	Date
Paul Sanwald	3ce984d746	mute test while I work on #32215	2018-08-07 08:56:00 -04:00
Andrey Ershov	6449d9bc14	Include translog path in error message when translog is corrupted (#32251 ) Currently, when TranslogCorruptedException is thrown most of the times it does not contain information about the translog location on the file system. There is the translog recovery tool that accepts the translog path as an argument and users are constantly puzzled where to get the path. This pull request adds "source" information to every TranslogCorruptedException thrown. The source could be local file, remote translog source (used for recovery), assertion (translog entry is constructed to perform some assertion) or translog constructed inside the test. Closes #24929	2018-08-07 13:03:43 +02:00
Parth Verma	6fe6247dc8	Ignore script fields when size is 0 (#31917 ) This change adds a check so that when parsing the search source, script fields are ignored when the requested search result size is 0. This helps with e.g. clients like Kibana that sends a list of script fields that they may need for convenience, but they don't require any hits. Before this change, user sometimes ran into confusing behaviour, e.g. the script compilation limit to breaking although no hits were requested. Closes #31824	2018-08-07 10:56:44 +02:00
Armin Braun	f57cb10d2c	Tests: Fix Typo Causing Flaky Settings Test (#32665 ) * We were comparing the wrong timeout value in the `randomValueOtherThan` call here, leading to no mutation happening for a certain seed * closes #32639	2018-08-07 10:30:45 +02:00
Jason Tedor	3fb0923182	Fix content type detection with leading whitespace (#32632 ) Today content type detection on an input stream works by peeking up to twenty bytes into the stream. If the stream is headed by more whitespace than twenty bytes, we might fail to detect the content type. We should be ignoring this whitespace before attempting to detect the content type. This commit does that by ignoring all leading whitespace in an input stream before attempting to guess the content type.	2018-08-06 18:07:46 -04:00
Yannick Welsch	014b2772db	[TEST] Fix testReplicaTermIncrementWithConcurrentPrimaryPromotion The assertion in the test was not broad enough. If the timing is very unlucky, the shard is already promoted to primary before the indexOnReplica even gets to execute. Closes #32645	2018-08-06 18:38:01 +02:00
Armin Braun	0a67cb4133	LOGGING: Upgrade to Log4J 2.11.1 (#32616 ) * LOGGING: Upgrade to Log4J 2.11.1 * Upgrade to `2.11.1` to fix memory leaks in slow logger when logging large requests * This was caused by a bug in Log4J https://issues.apache.org/jira/browse/LOG4J2-2269 and is fixed in `2.11.1` via https://git-wip-us.apache.org/repos/asf?p=logging-log4j2.git;h=9496c0c * Fixes #32537 * Fixes #27300	2018-08-06 14:56:21 +02:00
Luca Cavanna	826399f9fc	Cross-cluster search: preserve cluster alias in shard failures (#32608 ) When some remote clusters return shard failures as part of a cross-cluster search request, the cluster alias currently gets lost. As a result, if the shard failures are all caused by the same error, and against indices belonging to different clusters, but with the same index name, only one failure gets returned as part of the search response, meaning that failures are grouped by index name, ignoring the cluster alias. With this commit we make sure that `ShardSearchFailure` returns the cluster alias as part of the index name. Also, we set the fully qualfied index name when creating a `QueryShardException`. That way shard failures are grouped by cluster:index. Such fixes should cover at least most of the cases where either 1) the shard target is set but we don't have the index in the cause (we were previously reading it only from the cause that did not have the cluster alias) 2) the shard target is missing but if the cause is a `QueryShardException` the cluster alias does not get lost. We also prevent NPE in case the failure cause is not set and test such scenario.	2018-08-06 11:48:50 +02:00
Yannick Welsch	3cf08326ab	Handle AlreadyClosedException when bumping primary term If the shard is already closed while bumping the primary term, this can result in an AlreadyClosedException to be thrown. As we use asyncBlockOperations, the exception will be thrown on a thread from the generic thread pool and end up in the uncaught exception handler, failing our tests. Relates to #32442	2018-08-06 08:34:38 +02:00
Armin Braun	6fa7016bbf	SCRIPTING: Move Aggregation Scripts to their own context (#32068 ) * SCRIPTING: Move Aggregation Scripts to their own context	2018-08-04 10:37:07 +02:00
Lee Hinman	1e4751ec47	[TEST] Enhance failure message when bulk updates have failures	2018-08-03 15:27:10 -06:00
Alexander Reelsen	018e77cac6	Core: Move helper date formatters over to java time (#32504 ) Some classes use internal date formatters, which now can be moved over to java time using the DateFormatters class. The same applies for a few test cases.	2018-08-03 13:21:14 +02:00
Colin Goodheart-Smithe	d05f39de8b	[TEST} unmutes SearchAsyncActionTests and adds debugging info This unmutes the testFanOutAndCollect()` method and add a check to make sure we aren't accidentally running something twice causing a search phase to still be running after we have counted down the latch Relates to #29242	2018-08-03 11:52:46 +01:00
Yannick Welsch	0d60e8a029	Fix race between replica reset and primary promotion (#32442 ) We've recently seen a number of test failures that tripped an assertion in IndexShard (see issues linked below), leading to the discovery of a race between resetting a replica when it learns about a higher term and when the same replica is promoted to primary. This commit fixes the race by distinguishing between a cluster state primary term (called pendingPrimaryTerm) and a shard-level operation term. The former is set during the cluster state update or when a replica learns about a new primary. The latter is only incremented under the operation block, which can happen in a delayed fashion. It also solves the issue where a replica that's still adjusting to the new term receives a cluster state update that promotes it to primary, which can happen in the situation of multiple nodes being shut down in short succession. In that case, the cluster state update thread would call `asyncBlockOperations` in `updateShardState`, which in turn would throw an exception as blocking permits is not allowed while an ongoing block is in place, subsequently failing the shard. This commit therefore extends the IndexShardOperationPermits to allow it to queue multiple blocks (which will all take precedence over operations acquiring permits). Finally, it also moves the primary activation of the replication tracker under the operation block, so that the actual transition to primary only happens under the operation block. Relates to #32431, #32304 and #32118	2018-08-03 09:33:08 +02:00
Shaunak Kashyap	0a83968650	Add cluster UUID to Cluster Stats API response (#32206 ) * Make cluster stats response contain cluster UUID * Updating constructor usage in Monitoring tests * Adding cluster_uuid field to Cluster Stats API reference doc * Adding rest api spec test for expecting cluster_uuid in cluster stats response * Adding missing newline * Indenting do section properly * Missed a spot! * Fixing the test cluster ID	2018-08-02 17:14:19 -07:00
Zachary Tong	080b9f58ea	[TEST] Test for shard failures, add debug to testProfileMatchesRegular Unmuting the test and adding some more debug output. Was not able to reproduce the prior failure, but it seems possible that the failure (mismatched counts) could be caused by partial search results during the test. The assertions check for shard failures first, because if one of the two searches is partial the rest of the test will fail. Next, instead of just checking respective hit counts, we emit the difference in hits to help identify what went wrong. Closes #32492	2018-08-02 17:18:29 -04:00
Nhat Nguyen	2c35db8043	TEST: Avoid merges in testSeqNoAndCheckpoints Since LUCENE-8263, testSeqNoAndCheckpoints might trigger merges because of the updates and deletes in the test. Our merge scheduler will trigger a flush if there is no pending merge. Those extra flushes will change the last committed segmentInfos in the engine and fail the test. This commit uses LogMergePolicy for the engine in the test to avoid merges. Closes #32430	2018-08-02 13:46:23 -04:00
Armin Braun	be31cc642b	INGEST: Enable default pipelines (#32286 ) * INGEST: Enable default pipelines * Add `default_pipeline` index setting * `_none` is interpreted as no pipeline * closes #21101	2018-08-02 17:11:12 +02:00
Yannick Welsch	db6e8c736d	Remove cluster state initial customs (#32501 ) This infrastructure was introduced in #26144 and made obsolete in #30743	2018-08-02 15:49:59 +02:00
Julie Tibshirani	5efc2ec9f7	Clarify the error message when a pipeline agg is used in the 'order' parameter. (#32522 )	2018-08-01 12:02:07 -07:00
Ryan Ernst	478f6d6cf1	Scripting: Conditionally use java time api in scripting (#31441 ) This commit adds a boolean system property, `es.scripting.use_java_time`, which controls the concrete return type used by doc values within scripts. The return type of accessing doc values for a date field is changed to Object, essentially duck typing the type to allow co-existence during the transition from joda time to java time.	2018-08-01 08:58:49 -07:00
Nik Everett	e7ead17893	Core: Minor size reduction for AbstractComponent (#32509 ) This removes a constructor from `AbstractComponent` and `AbstractLifecycleComponent` that we weren't using and it switches the logger creation away from one of the `Settings` flavored methods which are no longer needed.	2018-08-01 09:17:48 -04:00
Nhat Nguyen	67d53e5093	Mute testFilterCacheStats Tracked at #32506	2018-07-31 12:45:30 -04:00
Nik Everett	22459576d7	Logging: Make node name consistent in logger (#31588 ) First, some background: we have 15 different methods to get a logger in Elasticsearch but they can be broken down into three broad categories based on what information is provided when building the logger. Just a class like: ``` private static final Logger logger = ESLoggerFactory.getLogger(ActionModule.class); ``` or: ``` protected final Logger logger = Loggers.getLogger(getClass()); ``` The class and settings: ``` this.logger = Loggers.getLogger(getClass(), settings); ``` Or more information like: ``` Loggers.getLogger("index.store.deletes", settings, shardId) ``` The goal of the "class and settings" variant is to attach the node name to the logger. Because we don't always have the settings available, we often use the "just a class" variant and get loggers without node names attached. There isn't any real consistency here. Some loggers get the node name because it is convenient and some do not. This change makes the node name available to all loggers all the time. Almost. There are some caveats are testing that I'll get to. But in production code the node name is node available to all loggers. This means we can stop using the "class and settings" variants to fetch loggers which was the real goal here, but a pleasant side effect is that the ndoe name is now consitent on every log line and optional by editing the logging pattern. This is all powered by setting the node name statically on a logging formatter very early in initialization. Now to tests: tests can't set the node name statically because subclasses of `ESIntegTestCase` run many nodes in the same jvm, even in the same class loader. Also, lots of tests don't run with a real node so they don't have a node name at all. To support multiple nodes in the same JVM tests suss out the node name from the thread name which works surprisingly well and easy to test in a nice way. For those threads that are not part of an `ESIntegTestCase` node we stick whatever useful information we can get form the thread name in the place of the node name. This allows us to keep the logger format consistent.	2018-07-31 10:54:24 -04:00
Sohaib Iftikhar	4fa92cbf49	Changed ReindexRequest to use Writeable.Reader (#32401 ) -- This is a pre-stage for adding the reindex API to the REST high-level-client -- Follows the pattern set in #26315	2018-07-31 10:11:17 -04:00
Paul Sanwald	6f93911955	Fix AutoIntervalDateHistogram.testReduce random failures (#32301 ) 1. Refactor the test to use the same roundings as the implementation. 2. Refactor the test verification logic to use `innerIntervals` when rounding.	2018-07-31 08:52:16 -04:00
Daniel Mitterdorfer	9703d06321	Mute QueryProfilerIT#testProfileMatchesRegular() Relates #32492	2018-07-31 13:29:21 +02:00
Luca Cavanna	a3b272966d	High-level client: fix clusterAlias parsing in SearchHit (#32465 ) When using cross-cluster search through the high-level REST client, the cluster alias from each search hit was not parsed correctly. It would be part of the index field initially, but overridden just a few lines later once setting the shard target (in case we have enough info to build it from the response). In any case, getClusterAlias returns `null` which is a bug. With this change we rather parse back clusterAliases from the index name, set its corresponding field and properly handle the two possible cases depending on whether we can or cannot build the shard target object.	2018-07-31 09:41:51 +02:00
David Turner	8b57e2e5ba	Fix calculation of orientation of polygons (#27967 ) The method for working out whether a polygon is clockwise or anticlockwise is mostly correct but doesn't work in some rare cases such as the included test case. This commit fixes that.	2018-07-31 08:25:21 +01:00
Tal Levy	1e0fcebfe1	update rollover to leverage write-alias semantics (#32216 ) Rollover should not swap aliases when `is_write_index` is set to `true`. Instead, both the new and old indices should have the rollover alias, with the newly created index as the new write index Updates Rollover to leverage the ability to preserve aliases and swap which is the write index. Historically, Rollover would swap which index had the designated alias for writing documents against. This required users to keep a separate read-alias that enabled reading against both rolled over and newly created indices, whiles the write-alias was being re-assigned at every rollover. With the ability for aliases to designate a write index, Rollover can be a bit more flexible with its use of aliases. Updates include: - Rollover validates that the target alias has a write index (the index that is being rolled over). This means that the restriction that aliases only point to one index is no longer necessary. - Rollover explicitly (and atomically) swaps which index is the write-index by explicitly assigning the existing index to have `is_write_index: false` and have the newly created index have its rollover alias as `is_write_index: true`. This is only done when `is_write_index: true` on the write index. Default behavior of removing the alias from the rolled over index stays when `is_write_index` is not explicitly set Relevant things that are staying the same: - Rollover is rejected if there exist any templates that match the newly-created index and configure the rollover-alias - I think this existed to prevent the situation where an alias pointed to two indices for a short while. Although this can technically be relaxed, the specific cases that are safe are really particular and difficult to reason, so leaving the broad restriction sounds good	2018-07-30 14:32:55 -07:00
Armin Braun	cf7489899a	INGEST: Clean up Java8 Stream Usage (#32059 ) * GrokProcessor: Rationalize the loop over the map to save allocations and indirection * IngestDocument: Rationalize way we append to `List`	2018-07-30 21:25:30 +02:00
Ioannis Kakavas	c2e3bebab9	Ensure KeyStoreWrapper decryption exceptions are handled (#32464 ) * Ensure decryption related exceptions are handled This commit ensures that all possible Exceptions in KeyStoreWrapper#decrypt() are handled. More specifically, in the case that a wrong password is used for secure settings, calling readX on the DataInputStream that wraps the CipherInputStream can throw an IOException. It also adds a test for loading a KeyStoreWrapper with a wrong password. Resolves #32411	2018-07-30 22:15:59 +03:00
Boaz Leskes	0cae19c8d7	IndicesClusterStateService should replace an init. replica with an init. primary with the same aId (#32374 ) In rare cases it is possible that a nodes gets an instruction to replace a replica shard that's in `POST_RECOVERY` with a new initializing primary with the same allocation id. This can happen by batching cluster states that include the starting of the replica, with closing of the indices, opening it up again and allocating the primary shard to the node in question. The node should then clean it's initializing replica and replace it with a new initializing primary. I'm not sure whether the test I added really adds enough value as existing tests found this. The main reason I added is to allow for simpler reproduction and to double check I fixed it. I'm open to discuss if we should keep. Closes #32308	2018-07-30 16:24:41 +03:00
Luca Cavanna	9a4d0069f6	REST high-level client: parse back _ignored meta field (#32362 ) `GetResult` and `SearchHit` have been adjusted to parse back the `_ignored` meta field whenever it gets printed out. Expanded the existing tests to make sure this is covered. Fixed also a small problem around highlighted fields in `SearchHitTests`.	2018-07-30 13:43:40 +02:00
Nhat Nguyen	5b1ad8099b	TEST: testDocStats should always use forceMerge (#32450 ) Due to the recent change in LUCENE-8263, we need to adjust the deletion ration to between 10% to 33% to preserve the current behavior of the test. However, we may need another refinement if soft-deletes is enabled as the actual deletes are different because of delete tombstones. This commit prefers to always execute forceMerge instead of adjusting the deletion ratio so that this test can focus on testing docStats. Closes #32449	2018-07-28 07:41:30 -04:00
Nhat Nguyen	6e98615cc1	TEST: Avoid deletion in FlushIT Due to the recent change in LUCENE-8263, a merge can be triggered if the deletion ration is higher than 33%. An in-progress merge can prevent a synced-flush from issuing. This commit avoids deletes by using different docIds. Closes #32436	2018-07-27 23:14:24 -04:00
Nhat Nguyen	139631c77d	AwaitsFix IndexShardTests#testDocStats Relates #32449	2018-07-27 20:48:23 -04:00
javanna	dcb5d24639	[TEST] Mute failing FlushIT test See #32436	2018-07-27 17:10:29 +02:00
javanna	7aa5365497	[TEST] Mute failing InternalEngineTests#testSeqNoAndCheckpoints	2018-07-27 14:41:32 +02:00
Jim Ferenczi	5decb23687	bump lucene version after backport	2018-07-27 10:50:22 +02:00
Jim Ferenczi	53ff06e621	Upgrade to Lucene-7.5.0-snapshot-608f0277b0 (#32390 ) The main highlight is the removal of the reclaim_deletes_weight in the TieredMergePolicy. The es setting index.merge.policy.reclaim_deletes_weight is deprecated in this commit and the value is ignored. The new merge policy setting setDeletesPctAllowed should be added in a follow up.	2018-07-27 08:28:51 +02:00
Jim Ferenczi	860f92fcdd	Copy missing segment attributes in getSegmentInfo (#32396 ) The index sort and the attributes map of a segment are not copied on committed segments that are not loaded by the internal or external searcher.	2018-07-26 20:29:27 +02:00
Jim Ferenczi	8e5f281b27	AbstractQueryTestCase should run without type less often (#28936 ) This commit changes the randomization to always create an index with a type. It also adds a way to create a query shard context that maps to an index with no type registered in order to explicitely test cases where there is no type.	2018-07-26 20:29:05 +02:00
Armin Braun	57876bfeb9	INGEST: Fix Deprecation Warning in Script Proc. (#32407 ) * Using short script form normalized to a map that used 'inline' instead of 'source' so a short form processor definition like: ``` { "script": "ctx.foo= 'bar'" } ``` would always warn about the following deprecation: ``` #! Deprecation: Deprecated field [inline] used, expected [source] ```	2018-07-26 19:55:28 +02:00
Nhat Nguyen	0ed3458534	TEST: Avoid triggering merges in FlushIT In testSyncedFlushSkipOutOfSyncReplicas, we reindex the extra documents to all shards including the out-of-sync replica. However, reindexing to that replica can trigger merges (due to the new deletes) which cause the synced-flush failed. This test starts failing after we aggressively trigger merges segments with a large number of deletes in LUCENE-8263.	2018-07-26 12:38:36 -04:00
Christoph Büscher	35ae87125d	Remove some dead code (#31993 ) Removing some dead code or supressing warnings where apropriate. Most of the time the variable tested for null is dereferenced earlier or never used before.	2018-07-26 17:12:51 +02:00
Jason Tedor	eb675a1c4d	Introduce index store plugins (#32375 ) Today we allow plugins to add index store implementations yet we are not doing this in our new way of managing plugins as pull versus push. That is, today we still allow plugins to push index store providers via an on index module call where they can turn around and add an index store. Aside from being inconsistent with how we manage plugins today where we would look to pull such implementations from plugins at node creation time, it also means that we do not know at a top-level (for example, in the indices service) which index stores are available. This commit addresses this by adding a dedicated plugin type for index store plugins, removing the index module hook for adding index stores, and by aggregating these into the top-level of the indices service.	2018-07-26 08:05:49 -04:00
Adrien Grand	f7ba14d5f4	Make sure _forcemerge respects `max_num_segments`. (#32291 ) An upcoming [Lucene change](https://issues.apache.org/jira/browse/LUCENE-7976) will make TieredMergePolicy respect the maximum merged segment size all the time, meaning it will possibly not respect the `max_num_segments` parameter anymore if the shard is larger than the maximum segment size. This change makes sure that `max_num_segments` is respected for now in order to give us time to think about how to integrate this change, and also to delay it until 7.0 as this might be a big-enough change for us to wait for a new major version.	2018-07-26 09:02:58 +02:00
Dimitris Athanasiou	9efd1407d5	Add 6.5.0 version to master This commit adds the 6.5.0 version constant to the master branch.	2018-07-25 15:57:48 +01:00
Julie Tibshirani	54ba3ea286	Undo a debugging change that snuck in during the field aliases merge.	2018-07-24 13:19:37 -07:00

1 2 3 4 5 ...

964 Commits