OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-09 22:45:04 +00:00

Author	SHA1	Message	Date
Ali Beyad	cd52065871	[TEST] testAckedIndexing waits for all nodes to stabilize testAckedIndexing now waits for all nodes to stabilize in the cluster state through an assertBusy before final validation that all documents are found in tehir respective shards in the cluster. Before, what could happen is that the ensureGreen check passes but only after that is a ping failure from the network disruption processed by the master, thereby rendering the cluster RED again. This assertBusy waits up to 30 seconds for all nodes to have stabilized and all get document actions to succeed.	2017-01-18 13:51:25 -05:00
Michael McCandless	1d1bdd476c	Finish exposing FlattenGraphTokenFilter (#22667 )	2017-01-18 11:05:34 -05:00
Nik Everett	e71b26f480	Improve unit test coverage of aggs (#22668 ) Add tests for `GlobalAggregator`, `MaxAggregator`, and `InternalMax`. Relates to #22278	2017-01-18 10:33:45 -05:00
Simon Willnauer	24e2847af2	Streamline foreign stored context restore and allow to perserve response headers (#22677 ) Today we do not preserve response headers if they are present on a transport protocol response. While preserving these headers is not always desired, in the most cases we should pass on these headers to have consistent results for depreciation headers etc. yet, this hasn't been much of a problem since most of the deprecations are detected early ie. on the coordinating node such that this bug wasn't uncovered until #22647 This commit allow to optionally preserve headers when a context is restored and also streamlines the context restore since it leaked frequently into the callers thread context when the callers context wasn't restored again.	2017-01-18 16:17:54 +01:00
Ali Beyad	8a0a1140a9	[TEST] add logging to MockRepository to help debug index-N blob reading	2017-01-18 08:53:29 -05:00
Boaz Leskes	1227044ddd	Add a deprecation notice to shadow replicas (#22647 ) Relates to #22024 On top of documentation, the PR adds deprecation loggers and deals with the resulting warning headers. The yaml test is set exclude versions up to 6.0. This is need to make sure bwc tests pass until this is backported to 5.2.0 . Once that's done, I will change the yaml test version limits	2017-01-18 12:28:09 +01:00
Ke Li	797d105177	Remove unnecessary class cast	2017-01-18 11:09:09 +01:00
Simon Willnauer	19f9cb307a	Merge branch 'master' into feature/multi_cluster_search	2017-01-18 09:24:35 +01:00
Scott Somerville	372812da98	Allow an index to be partitioned with custom routing (#22274 ) This change makes it possible for custom routing values to go to a subset of shards rather than just a single shard. This enables the ability to utilize the spatial locality that custom routing can provide while mitigating the likelihood of ending up with an imbalanced cluster or suffering from a hot shard. This is ideal for large multi-tenant indices with custom routing that suffer from one or both of the following: - The big tenants cannot fit into a single shard or there is so many of them that they will likely end up on the same shard - Tenants often have a surge in write traffic and a single shard cannot process it fast enough Beyond that, this should also be useful for use cases where most queries are done under the context of a specific field (e.g. a category) since it gives a hint at how the data can be stored to minimize the number of shards to check per query. While a similar solution can be achieved with multiple concrete indices or aliases per value today, those approaches breakdown for high cardinality fields. A partitioned index enforces that mappings have routing required, that the partition size does not change when shrinking an index (the partitions will shrink proportionally), and rejects mappings that have parent/child relationships. Closes #21585	2017-01-18 08:51:23 +01:00
Igor Motov	500548fcda	Remove taskManager.registerChildTask Instead of forcing each task to register all nodes where its children are running, this commit runs cancellation on all nodes. The task cancellation operation doesn't run too frequently, so this optimization doesn't seem to be worth additional complexity of the interface.	2017-01-17 18:07:31 -05:00
Ali Beyad	ce811feba7	[TEST] testAckedIndexing waits for the cluster state to have propogated to all nodes in the cluster before checking the existance of documents on each node	2017-01-17 15:36:31 -05:00
Nik Everett	1169cd936e	Fix compilation in eclipse Eclipse needs a bit of extra special help with type parameters in `TransportReplicationActionTests` now.	2017-01-17 14:53:54 -05:00
Ali Beyad	554a5e3039	[TEST] add retries to MockRepository getRepositoryData to try to diagnose a NotXContentException being thrown	2017-01-17 12:17:29 -05:00
Simon Willnauer	69f1ffb1f8	fix exception message	2017-01-17 17:29:43 +01:00
Simon Willnauer	292e3a60d1	apply review comments	2017-01-17 17:20:52 +01:00
Ali Beyad	e2977889b8	Allow comma delimited array settings to have a space after each entry (#22591 ) Previously, certain settings that could take multiple comma delimited values would pick up incorrect values for all entries but the first if each comma separated value was followed by a whitespace character. For example, the multi-value "A,B,C" would be correctly parsed as ["A", "B", "C"] but the multi-value "A, B, C" would be incorrectly parsed as ["A", " B", " C"]. This commit allows a comma separated list to have whitespace characters after each entry. The specific settings that were affected by this are: cluster.routing.allocation.awareness.attributes index.routing.allocation.require.* index.routing.allocation.include.* index.routing.allocation.exclude.* cluster.routing.allocation.require.* cluster.routing.allocation.include.* cluster.routing.allocation.exclude.* http.cors.allow-methods http.cors.allow-headers For the allocation filtering related settings, this commit also provides validation of each specified entry if the filtering is done by _ip, _host_ip, or _publish_ip, to ensure that each entry is a valid IP address. Closes #22297	2017-01-17 08:51:04 -06:00
Tanguy Leroux	f5542ed47f	Simplify ElasticsearchException rendering as a XContent (#22611 ) This commit tries to simplify the way ElasticsearchException are rendered to xcontent. It adds some documentation and renames and merges some methods. Current behavior is preserved, the goal is to be more readable and centralize everything in the ElasticsearchException class.	2017-01-17 15:44:49 +01:00
Simon Willnauer	197cd7d7a9	Add test for the grouping error message if indices and cluster can't be disambiguated	2017-01-17 14:13:09 +01:00
Simon Willnauer	88f6ae55f5	Improve remote / local indices filtering by not modifying external state	2017-01-17 14:05:36 +01:00
Simon Willnauer	709cb9a39e	Merge branch 'master' into feature/multi_cluster_search	2017-01-17 12:34:36 +01:00
Simon Willnauer	1c5cc58373	apply review comments	2017-01-17 11:46:55 +01:00
Tim Brooks	16a76d9bc0	Remove blocking TCP clients and servers (#22639 ) This commit removes the option to use the blocking variants of the TCP transport server, TCP transport client, or http server.	2017-01-16 18:38:51 -06:00
Michael McCandless	ebd38e2a6a	Expose FlattenGraphTokenFilter (#22643 ) FlattenGraphTokenFilter is necessary for using graph-based token streams (e.g. the new SynonymGraphFilter) during indexing.	2017-01-16 16:53:32 -05:00
Boaz Leskes	d80e3eea6c	Replace EngineClosedException with AlreadyClosedExcpetion (#22631 ) `EngineClosedException` is a ES level exception that is used to indicate that the engine is closed when operation starts. It doesn't really add much value and we can use `AlreadyClosedException` from Lucene (which may already bubble if things go wrong during operations). Having two exception can just add confusion and lead to bugs, like wrong handling of `EngineClosedException` when dealing with document level failures. The latter was exposed by `IndexWithShadowReplicasIT`. This PR also removes the AwaitFix from the `IndexWithShadowReplicasIT` tests (which was what cause this to be discovered). While debugging the source of the issue I found some mismatches in document uid management in the tests. The term that was passed to the engine didn't correspond to the uid in the parsed doc - those are fixed as well.	2017-01-16 21:14:41 +01:00
Simon Willnauer	f30b1f82ee	Remove HttpServer and HttpServerAdapter in favor of a simple dispatch method (#22636 ) Today we have quite some abstractions that are essentially providing a simple dispatch method to the plugins defining a `HttpServerTransport`. This commit removes `HttpServer` and `HttpServerAdaptor` and introduces a simple `Dispatcher` functional interface that delegate to `RestController` by default. Relates to #18482	2017-01-16 21:06:08 +01:00
Boaz Leskes	f88ab76067	Revert "Add a deprecation notice to shadow replicas (#22025 )" This reverts commit 0da190234c87838df5d37f2375e901351e05e03d.	2017-01-16 16:15:41 +01:00
Boaz Leskes	b887681550	Revert "Don'y use `INDEX_SHARED_FS_ALLOW_RECOVERY_ON_ANY_NODE_SETTING` directly as it triggers (many) deprecation logging" This reverts commit e976aa09bbd98b631cb12b86e9dc787d45fdb969.	2017-01-16 16:15:32 +01:00
Boaz Leskes	e976aa09bb	Don'y use `INDEX_SHARED_FS_ALLOW_RECOVERY_ON_ANY_NODE_SETTING` directly as it triggers (many) deprecation logging #22025 deprecated this setting (pending it's removal) but it's frequent usage will spam the deprecation logs and also fails test. As temporary work around we should not use the setting object directly.	2017-01-16 16:11:59 +01:00
Boaz Leskes	0da190234c	Add a deprecation notice to shadow replicas (#22025 ) Also adds deprecation logging. See #22024	2017-01-16 15:40:05 +01:00
Christoph Büscher	59a48ffc41	ProfileResult and CollectorResult should print machine readable timing information (#22561 ) Currently both ProfileResult and CollectorResult print the time field in a human readable string format (e.g. "time": "55.20315000ms"). When trying to parse this back to a long value, for example to use in the planned high level java rest client, we can lose precision because of conversion and rounding issues. This change adds a new additional field (`time_in_nanos`) to the profile response to be able to get the original time value in nanoseconds back. The old `time` field is only printed when the `?`human=true` flag in the url is set. This follow the behaviour for all other stats-related apis. Also the format of the `time` field is slightly changed. Instead of always formatting the output as a 10-digit ms value, by using the `XContentBuilder#timeValueField()` method we now print the largest time unit present is used (e.g. "s", "ms", "micros").	2017-01-16 14:27:55 +01:00
Jason Tedor	e6dc74f2bf	Add replica ops with version conflict to translog An operation that completed successfully on a primary can result in a version conflict on a replica due to the asynchronous nature of operations. When a replica operation results in a version conflict, the operation is not added to the translog. This leads to gaps in the translog which is problematic as it can lead to situations where a replica shard can never advance its local checkpoint. As such operations are just normal course of business for a replica shard, these operations should be treated as if they completed successfully. This commit adds these operations to the translog. Relates #22626	2017-01-16 08:08:52 -05:00
javanna	8e3f1dd689	Replace custom Functional interface in ElasticsearchException with CheckedFunction	2017-01-16 13:57:58 +01:00
javanna	9a910d3c9d	Make RestChannelConsumer extend CheckedConsumer<RestChannel, Exception>	2017-01-16 13:57:58 +01:00
javanna	ab144c418e	replace ShardSearchRequest.FilterParser functional interface with CheckedFunction	2017-01-16 13:57:58 +01:00
javanna	bc22afcb2f	[TEST] replace SizeFunction with Function<Integer, Integer>	2017-01-16 13:57:58 +01:00
javanna	884302dcaa	Expose CheckedFunction	2017-01-16 13:57:58 +01:00
Jason Tedor	fc3280b3cf	Expose logs base path For certain situations, end-users need the base path for Elasticsearch logs. Exposing this as a property is better than hard-coding the path into the logging configuration file as otherwise the logging configuration file could easily diverge from the Elasticsearch configuration file. Additionally, Elasticsearch will only have permissions to write to the log directory configured in the Elasticsearch configuration file. This commit adds a property that exposes this base path. One use-case for this is configuring a rollover strategy to retain logs for a certain period of time. As such, we add an example of this to the documentation. Additionally, we expose the property es.logs.cluster_name as this is used as the name of the log files in the default configuration. Finally, we expose es.logs.node_name in cases where node.name is explicitly set in case users want to include the node name as part of the name of the log files. Relates #22625	2017-01-16 07:39:37 -05:00
Jason Tedor	9ae5410ea6	Do not configure a logger named level When logger.level is set, we end up configuring a logger named "level" because we look for all settings of the form "logger\..+" as configuring a logger. Yet, logger.level is special and is meant to only configure the default logging level. This commit causes is to avoid not configuring a logger named level. Relates #22624	2017-01-16 07:30:21 -05:00
Simon Willnauer	895124e67e	Merge branch 'master' into feature/multi_cluster_search	2017-01-16 13:20:45 +01:00
Alexander Reelsen	f6ee6e420b	Indexing: Add shard id to indexing operation listener (#22606 ) The IndexingOperationListener interface did not provide any information about the shard id when a document was indexed. This commit adds the shard id as the first parameter to all methods in the IndexingOperationListener.	2017-01-16 09:08:16 +01:00
Jason Tedor	526cf6182d	Cleanup handling of cgroup stats This commit is a simple cleanup of the code related to cgroup stats: - reduce visibility of a method - remove an unneeded logger guard - cleanup the formatting of comments	2017-01-15 12:18:16 -05:00
Simon Willnauer	5f0344a918	Pass ThreadContext to transport interceptors to allow header modification (#22618 ) TransportInterceptors are commonly used to enrich requests with headers etc. which requires access the the thread context. This is not always easily possible since threadpools are hard to access for instance if the interceptor is used on a transport client. This commit passes on the thread context to all the interceptors for further consumption. Closes #22585	2017-01-15 13:35:39 +01:00
Simon Willnauer	3f784a4424	Merge branch 'master' into feature/multi_cluster_search	2017-01-15 10:28:34 +01:00
Jason Tedor	bed719de0a	Log deleting indices at info level Deleting indices is an important event in a cluster and as such should be logged at the info level. This commit changes the logging level on index deletion to the info level. Relates #22627	2017-01-14 23:13:40 -05:00
Simon Willnauer	fde11649fb	harden tests	2017-01-13 23:59:59 +01:00
Jason Tedor	d67514606e	Fix out-of-date Javadocs on Security.java We have made the security manager non-optional, but the Javadocs for Security.java imply that it still is. This commit fixes this issue. Relates #16176	2017-01-13 17:20:45 -05:00
Simon Willnauer	63e4552c0d	Merge branch 'master' into feature/multi_cluster_search	2017-01-13 23:07:20 +01:00
Ali Beyad	0c7fc229b8	[TEST] No longer randomly block on the index-N files in the MockRepository, because the getRepositoryData() call depends on it, which is used in non-synchronized actions such as getting snapshot status.	2017-01-13 15:57:37 -05:00
Lee Hinman	cd236c4de4	Merge remote-tracking branch 'zareek/enhancement/use_shard_bulk_for_single_ops'	2017-01-13 10:09:18 -07:00
Simon Willnauer	4c1ee018f6	Remove setLocalNode from ClusterService and TransportService (#22608 ) ClusterService and TransportService expect the local discovery node to be set before they are started but this requires manual interaction and is error prone since to work absolutely correct they should share the same instance (same ephemeral ID). TransportService also has 2 modes of operation, mainly realted to transport client vs. internal to a node. This change removes the mode where we don't maintain a local node and uses a dummy local node in the transport client since we don't bind to any port in such a case. Local discovery node instances are now managed by the node itself and only suppliers and factories that allow creation only once are passed to TransportService and ClusterService.	2017-01-13 16:12:27 +01:00

1 2 3 4 5 ...

7474 Commits