OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nhat Nguyen	2a8381d3fa	Avoid sending duplicate remote failed shard requests (#31313 ) Today if a write replication request fails, we will send a shard-failed message to the master node to fail that replica. However, if there are many ongoing write replication requests and the master node is busy, we might overwhelm the cluster and the master node with many shard-failed requests. This commit tries to minimize the shard-failed requests in the above scenario by caching the ongoing shard-failed requests. This issue was discussed at https://discuss.elastic.co/t/half-dead-node-lead-to-cluster-hang/113658/25.	2018-06-18 15:05:34 -04:00
Igor Motov	d9a6d69a0d	Fix defaults in GeoShapeFieldMapper output (#31302 ) GeoShapeFieldMapper should show actual defaults instead of placeholder values when the mapping is requested with include_defaults=true. Closes #23206	2018-06-18 13:50:52 -04:00
Ryan Ernst	340313b048	RestAPI: Reject forcemerge requests with a body (#30792 ) This commit adds validation to forcemerge rest requests which contain a body. All parameters to force merge must be part of http params. closes #29584	2018-06-18 19:03:46 +02:00
Yannick Welsch	02a4ef38a7	Use system context for cluster state update tasks (#31241 ) This commit makes it so that cluster state update tasks always run under the system context, only restoring the original context when the listener that was provided with the task is called. A notable exception is the clusterStatePublished(...) callback which will still run under system context, because it's defined on the executor-level, and not the task level, and only called once for the combined batch of tasks and can therefore not be uniquely identified with a task / thread context. Relates #30603	2018-06-18 16:46:04 +02:00
Zachary Tong	1502812c1a	Percentile/Ranks should return null instead of NaN when empty (#30460 ) The other metric aggregations (min/max/etc) return `null` as their XContent value and string when nothing was computed (due to empty/missing fields). Percentiles and Percentile Ranks, however, return `NaN `which is inconsistent and confusing for the user. This fixes the inconsistency by making the aggs return `null`. This applies to both the numeric value and the "as string" value. Note: like the metric aggs, this does not change the value if fetched directly from the percentiles object, which will return as `NaN`/`"NaN"`. This only changes the XContent output. While this is a bugfix, it still breaks bwc in a minor way as the response changes from prior version. Closes #29066	2018-06-18 10:01:28 -04:00
Sohaib Iftikhar	c4f8df3ad6	REST high-level client: add validate query API (#31077 ) Adds the validate query API to the high level rest client.	2018-06-18 09:59:29 -04:00
Martijn van Groningen	47095357bc	Move language analyzers from server to analysis-common module. (#31300 ) The following analyzers were moved from server module to analysis-common module: `greek`, `hindi`, `hungarian`, `indonesian`, `irish`, `italian`, `latvian`, `lithuanian`, `norwegian`, `persian`, `portuguese`, `romanian`, `russian`, `sorani`, `spanish`, `swedish`, `turkish` and `thai`. Relates to #23658	2018-06-18 11:24:43 +02:00
Albert Zaharovits	3378240b29	Reload secure settings for plugins (#31383 ) Adds the ability to reread and decrypt the local node keystore. Commonly, the contents of the keystore, backing the `SecureSettings`, are not retrievable except during node initialization. This changes that by adding a new API which broadcasts a password to every node. The password is used to decrypt the local keystore and use it to populate a `Settings` object that is passes to all the plugins implementing the `ReloadablePlugin` interface. The plugin is then responsible to do whatever "reload" means in his case. When the `reload`handler returns, the keystore is closed and its contents are no longer retrievable. Password is never stored persistently on any node. Plugins that have been moded in this commit are: `repository-azure`, `repository-s3`, `repository-gcs` and `discovery-ec2`.	2018-06-18 09:42:11 +03:00
Julie Tibshirani	16fa6b270f	Remove some cases in FieldTypeLookupTests that are no longer relevant. (#31381 )	2018-06-17 21:42:42 -07:00
Simon Willnauer	3d5f113ada	Ensure we don't use a remote profile if cluster name matches (#31331 ) If we are running into a race condition between a node being configured to be a remote node for cross cluster search etc. and that node joining the cluster we might connect to that node with a remote profile. If that node now joins the cluster it connected to it as a CCS remote node we use the wrong profile and can't use bulk connections etc. anymore. This change uses the remote profile only if we connect to a node that has a different cluster name than the local cluster. This is not a perfect fix for this situation but is the safe option while potentially only loose a small optimization of using less connections per node which is small anyways since we only connect to a small set of nodes. Closes #29321	2018-06-17 13:32:53 +02:00
Albert Zaharovits	5b94afd309	[TEST] Double write alias fault (#30942 )	2018-06-17 12:17:28 +03:00
Vladimir Dolzhenko	babb16d90c	Support for remote path in reindex api - post backport fix Closes #22913	2018-06-15 22:24:47 +02:00
Vladimir Dolzhenko	dbc9d60260	Support for remote path in reindex api (#31290 ) Support for remote path in reindex api Closes #22913	2018-06-15 22:14:28 +02:00
Tim Brooks	a705e1a9e3	Add byte array pooling to nio http transport (#31349 ) This is related to #28898. This PR implements pooling of bytes arrays when reading from the wire in the http server transport. In order to do this, we must integrate with netty reference counting. That manner in which this PR implements this is making Pages in InboundChannelBuffer reference counted. When we accessing the underlying page to pass to netty, we retain the page. When netty releases its bytebuf, it releases the underlying pages we have passed to it.	2018-06-15 14:01:03 -06:00
Tal Levy	3b70e943eb	add is-write-index flag to aliases (#30942 ) This commit adds the is-write-index flag for aliases. It allows requests to set the flag, and responses to display the flag. It does not validate and/or affect any indexing/getting/updating behavior of Elasticsearch -- this will be done in a follow-up PR.	2018-06-15 08:45:29 -07:00
Tal Levy	eda4964f64	Add rollover-creation-date setting to rolled over index (#31144 ) This commit introduces a new property to IndexMetaData called RolloverInfo. This object contains a map containing the aliases that were used to rollover the related index, which conditions were met, and at what time the rollover took place. much like the `index.creation_date`, it captures the approximate time that the index was rolled over to a new one.	2018-06-15 08:44:29 -07:00
Christoph Büscher	fec7860edc	[Tests] Fix edge case in ScriptedMetricAggregatorTests (#31357 ) An expected exception is only thrown when there are documents in the index created in the test setup. Fixed the test by making sure there is at least one. Closes #31307	2018-06-15 17:12:42 +02:00
Nhat Nguyen	8453ca638d	Upgrade to Lucene-7.4.0-snapshot-518d303506 (#31360 )	2018-06-15 10:58:21 -04:00
Tanguy Leroux	992c7889ee	Uncouple persistent task state and status (#31031 ) This pull request removes the relationship between the state of persistent task (as stored in the cluster state) and the status of the task (as reported by the Task APIs and used in various places) that have been confusing for some time (#29608). In order to do that, a new PersistentTaskState interface is added. This interface represents the persisted state of a persistent task. The methods used to update the state of persistent tasks are renamed: updatePersistentStatus() becomes updatePersistentTaskState() and now takes a PersistentTaskState as a parameter. The Task.Status type as been changed to PersistentTaskState in all places were it make sense (in persistent task customs in cluster state and all other methods that deal with the state of an allocated persistent task).	2018-06-15 09:26:47 +02:00
Tim Brooks	fcf1e41e42	Extract common http logic to server (#31311 ) This is related to #28898. With the addition of the http nio transport, we now have two different modules that provide http transports. Currently most of the http logic lives at the module level. However, some of this logic can live in server. In particular, some of the setting of headers, cors, and pipelining. This commit begins this moving in that direction by introducing lower level abstraction (HttpChannel, HttpRequest, and HttpResonse) that is implemented by the modules. The higher level rest request and rest channel work can live entirely in server.	2018-06-14 15:10:02 -06:00
Yannick Welsch	8f886cd4be	Treat ack timeout more like a publish timeout (#31303 ) This commit changes the ack timeout mechanism so that its behavior is closer to the publish timeout, i.e., it only comes into play after committing a cluster state. This ensures for example that an index creation request with a low (ack) timeout value does not return before the cluster state that contains information about the newly created index is even committed.	2018-06-14 18:32:35 +02:00
Simon Willnauer	375d09c588	[TEST] Fix RemoteClusterClientTests#testEnsureWeReconnect Closes #29547	2018-06-14 16:21:35 +02:00
David Turner	4877cec3e8	More detailed tracing when writing metadata (#31319 ) Packaging tests are occasionally failing (#30295) because of very slow index template creation. It looks like the slow part is updating the on-disk cluster state, and this change will help to confirm this.	2018-06-14 13:41:25 +01:00
Luca Cavanna	ce245a7320	Remove RestGetAllAliasesAction (#31308 ) We currently have a specific REST action to retrieve all aliaes, which uses internally the get index API. This doesn't seem to be required anymore though as the existing RestGetAliaesAction could as well take the requests with no indices and aliases specified. This commit removes the RestGetAllAliasesAction in favour of using RestGetAliasesAction also for requests that don't specify indices nor aliases. Similar to #31129.	2018-06-14 11:21:16 +02:00
Tanguy Leroux	4d7447cb5e	Reenable Checkstyle's unused import rule (#31270 )	2018-06-14 09:52:46 +02:00
Tanguy Leroux	2d4c9ce08c	Remove remaining unused imports before merging #31270	2018-06-14 09:52:03 +02:00
Adrien Grand	af58dc56fe	Add 5.6.11 version constant.	2018-06-13 22:29:45 +02:00
Luca Cavanna	664903a70a	CCS: don't proxy requests for already connected node (#31273 ) Cross-cluster search selects a subset of nodes for each remote cluster and sends requests only to them, which will act as a proxy and properly redirect such requests to the target nodes that hold the relevant data. What happens today is that every time we send a request to a remote cluster, it will be sent to the next node in the proxy list (in round-robin fashion), regardless of whether the target node is already amongst the ones that we are connected to. In case for instance we need to send a shard search request to a data node that's also one of the selected proxy nodes, we may end up sending the request to it through one of the other proxy nodes. This commit optimizes this case to make sure that whenever we are already connected to a remote node, we will send a direct request rather than using the next proxy node. There is a side-effect to this, which is that round-robin will be a bit unbalanced as the data nodes that are also selected as proxies will receive more requests.	2018-06-13 20:37:12 +02:00
Igor Motov	018d3fc81f	Mute ScriptedMetricAggregatorTests testSelfReferencingAggStateAfterMap Tracked by #31307	2018-06-13 14:18:10 -04:00
Dimitris Athanasiou	73742a4be9	Add unreleased version 6.3.1	2018-06-13 17:59:43 +01:00
Jason Tedor	7199d5f0e6	Add notion of internal index settings (#31286 ) We have some use cases for an index setting to only be manageable by dedicated APIs rather than be updateable via the update settings API. This commit adds the notion of an internal index setting. Such settings can be set on create index requests, they can not be changed via the update settings API, yet they can be changed by action on behalf of or triggered by the user via dedicated APIs.	2018-06-13 10:16:46 -04:00
Luca Cavanna	24163d10b7	REST hl client: cluster health to default to cluster level (#31268 ) With #29331 we added support for the cluster health API to the high-level REST client. The transport client does not support the level parameter, and it always returns all the info needed for shards level rendering. We have maintained that behaviour when adding support for cluster health to the high-level REST client, to ease migration, but the correct thing to do is to default the high-level REST client to `cluster` level, which is the same default as when going through the Elasticsearch REST layer.	2018-06-13 15:06:13 +02:00
Boaz Leskes	8c9360b5a1	Log warnings when cluster state publication failed to some nodes (#31233 ) If the publishing of a cluster state to a node fails, we currently only log it as debug information and only on the master. This makes it hard to see the cause of (test) failures when logging is set to default levels. This PR adds a warn level log on the node receiving the cluster state when it fails to deserialise the cluster state and a warn level log on the master with a list of nodes for which publication failed.	2018-06-13 13:22:34 +02:00
David Turner	489db54e57	Ignore numeric shard count if waiting for ALL (#31265 ) Today, if GET /_cluster/health?wait_for_active_shards=all does not immediately succeed then it throws an exception due to an erroneous and unnecessary call to ActiveShardCount#enoughShardsActive(). This commit fixes this logic. Fixes #31151	2018-06-13 11:25:26 +01:00
Ryan Ernst	a65b18f19d	Core: Remove plain execute method on TransportAction (#30998 ) TransportAction has many variants of execute. One of those variants executes by returning a future, which is then often blocked on by calling get(). This commit removes this variant of execute, instead using a helper method for tests that want to block, or having tests pass in a PlainActionFuture directly as a listener. Co-authored-by: Simon Willnauer <simonw@apache.org>	2018-06-13 09:58:13 +02:00
Tanguy Leroux	1f6e874002	Update checkstyle to 8.10.1 (#31269 )	2018-06-13 09:22:17 +02:00
Martijn van Groningen	16d593b22f	Set analyzer version in PreBuiltAnalyzerProviderFactory (#31202 ) instead of lamda that creates the analyzer	2018-06-13 07:25:19 +02:00
Jason Tedor	a36543531b	Fix race in clear scroll (#31259 ) Here is the problem: if two threads are racing and one hits a failure freeing a context and the other succeeded, we can expose the value of the has failure marker to the succeeding thread before the failing thread has had a chance to set the failure marker. This is a problem if the failing thread counted down the expected number of operations, then be put to sleep by a gentle lullaby from the OS, and then the other thread could count down to zero. Since the failing thread did not get to set the failure marker, the succeeding thread would respond that the clear scroll succeeded and that makes that thread a liar. This commit addresses by first setting the failure marker before we potentially expose its value to another thread.	2018-06-12 10:17:41 -04:00
Van0SS	d5e8a5cd69	REST high-level client: add Cluster Health API (#29331 ) Relates to #27205	2018-06-12 13:34:06 +02:00
olcbean	7d7ead95b2	Add Get Aliases API to the high-level REST client (#28799 ) Given the weirdness of the response returned by the get alias API, we went for a client specific response, which allows us to hold the error message, exception and status returned as part of the response together with aliases. See #30536 . Relates to #27205	2018-06-12 10:26:17 +02:00
Martijn van Groningen	6030d4be1e	[INGEST] Interrupt the current thread if evaluation grok expressions take too long (#31024 ) This adds a thread interrupter that allows us to encapsulate calls to org.joni.Matcher#search() This method can hang forever if the regex expression is too complex. The thread interrupter in the background checks every 3 seconds whether there are threads execution the org.joni.Matcher#search() method for longer than 5 seconds and if so interrupts these threads. Joni has checks that that for every 30k iterations it checks if the current thread is interrupted and if so returns org.joni.Matcher#INTERRUPTED Closes #28731	2018-06-12 07:49:03 +02:00
Jason Tedor	1dbe554e5e	Suppress extras FS on caching directory tests This filesystem needs to be suppressed during these tests because it adds random files to the directory upon directory creation. That means that the size of these directories is off from what we expect them to be. Rather than loosening the assertion which could hide bugs on real directories, this commit suppresses this file system in this test suite.	2018-06-11 22:19:19 -04:00
Nhat Nguyen	dda56fc0fc	Move ESIndexLevelReplicationTestCase to test framework (#31243 ) Other components might benefit from the testing infra provided by ESIndexLevelReplicationTestCase. This commit moves it to the test framework.	2018-06-11 12:47:38 -04:00
Lee Hinman	c064b507df	Encapsulate Translog in Engine (#31220 ) This removes the abstract `getTranslog` method in `Engine`, instead leaving it to the abstract implementations of the other methods that use the translog. This allows future Engines not to have a Translog, as instead they must implement the methods that use the translog pieces to return necessary values.	2018-06-11 09:44:50 -06:00
Nhat Nguyen	99e04582de	HLRest: Add get index templates API (#31161 ) Relates #27205	2018-06-11 11:06:28 -04:00
Tanguy Leroux	bf58660482	Remove all unused imports and fix CRLF (#31207 ) The X-Pack opening and the recent other refactorings left a lot of unused imports in the codebase. This commit removes them all.	2018-06-11 15:12:12 +02:00
Yannick Welsch	f9e8afd357	[TEST] Fix testRecoveryAfterPrimaryPromotion This test was failing from time to time due to a ConcurrentModificationException, which was triggered due to the primary-replica resync running concurrently with shards being removed. Closes #30767	2018-06-11 11:09:45 +02:00
Yannick Welsch	4e9b554948	Don't swallow exceptions on replication (#31179 ) Swallowing these exceptions is dangerous as they can result in replicas going out-of-sync with the primary. Follow-up to #28571	2018-06-11 09:09:23 +02:00
Simon Willnauer	f825a530b8	Limit the number of concurrent requests per node (#31206 ) With `max_concurrent_shard_requests` we used to throttle / limit the number of concurrent shard requests a high level search request can execute per node. This had several problems since it limited the number on a global level based on the number of nodes. This change now throttles the number of concurrent requests per node while still allowing concurrency across multiple nodes. Closes #31192	2018-06-11 08:49:18 +02:00
rationull	85c26d682a	Call ensureNoSelfReferences() on _agg state variable after scripted metric agg script executions (#31044 ) Previously this was called for the combine script only. This change checks for self references for init, map, and reduce scripts as well, and adds unit test coverage for the init, map, and combine cases.	2018-06-11 08:39:05 +02:00

1 2 3 4 5 ...

785 Commits