OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-02-10 23:15:04 +00:00

Author	SHA1	Message	Date
Boaz Leskes	37c8c0fa03	Improve logging for batched cluster state updates (#19255 ) We've been slowly improving batch support in `ClusterService` so service won't need to implement this tricky logic themselves. These good changes are blessed but our logging infra didn't catch up and we now log things like: ``` [2016-07-04 21:51:22,318][DEBUG][cluster.service ] [node_sm0] processing [put-mapping [type1],put-mapping [type1]]: ``` Depending on the `source` string this can get quite ugly (mostly in the ZenDiscovery area). This PR adds some infra to improve logging, keeping the non-batched task the same. As result the above line looks like: ``` [2016-07-04 21:44:45,047][DEBUG][cluster.service ] [node_s0] processing [put-mapping[type0, type0, type0]]: execute ``` ZenDiscovery waiting on join moved from: ``` [2016-07-04 17:09:45,111][DEBUG][cluster.service ] [node_t0] processing [elected_as_master, [1] nodes joined),elected_as_master, [1] nodes joined)]: execute ``` To ``` [2016-07-04 22:03:30,142][DEBUG][cluster.service ] [node_t3] processing [elected_as_master ([3] nodes joined)[{node_t2}{R3hu3uoSQee0B6bkuw8pjw}{p9n28HDJQdiDMdh3tjxA5g}{127.0.0.1}{127.0.0.1:30107}, {node_t1}{ynYQfk7uR8qR5wKIysFlQg}{wa_OKuJHSl-Oyl9Gis-GXg}{127.0.0.1}{127.0.0.1:30106}, {node_t0}{pweq-2T4TlKPrEVAVW6bJw}{NPBSLXSTTguT1So0JsZY8g}{127.0.0.1}{127.0.0.1:30105}]]: execute ``` As a bonus, I removed all `zen-disco` prefixes to sources from that area.	2016-07-04 22:54:43 +02:00
Boaz Leskes	6861d3571e	Persistent Node Ids (#19140 ) Node IDs are currently randomly generated during node startup. That means they change every time the node is restarted. While this doesn't matter for ES proper, it makes it hard for external services to track nodes. Another, more minor, side effect is that indexing the output of, say, the node stats API results in creating new fields due to node ID being used as keys. The first approach I considered was to use the node's published address as the base for the id. We already [treat nodes with the same address as the same](https://github.com/elastic/elasticsearch/blob/master/core/src/main/java/org/elasticsearch/discovery/zen/NodeJoinController.java#L387) so this is a simple change (see [here](https://github.com/elastic/elasticsearch/compare/master...bleskes:node_persistent_id_based_on_address)). While this is simple and it works for probably most cases, it is not perfect. For example, if after a node restart, the node is not able to bind to the same port (because it's not yet freed by the OS), it will cause the node to still change identity. Also in environments where the host IP can change due to a host restart, identity will not be the same. Due to those limitation, I opted to go with a different approach where the node id will be persisted in the node's data folder. This has the upside of connecting the id to the nodes data. It also means that the host can be adapted in any way (replace network cards, attach storage to a new VM). I It does however also have downsides - we now run the risk of two nodes having the same id, if someone copies clones a data folder from one node to another. To mitigate this I changed the semantics of the protection against multiple nodes with the same address to be stricter - it will now reject the incoming join if a node exists with the same id but a different address. Note that if the existing node doesn't respond to pings (i.e., it's not alive) it will be removed and the new node will be accepted when it tries another join. Last, and most importantly, this change requires that all nodes persist data to disk. This is a change from current behavior where only data & master nodes store local files. This is the main reason for marking this PR as breaking. Other less important notes: - DummyTransportAddress is removed as we need a unique network address per node. Use `LocalTransportAddress.buildUnique()` instead. - I renamed `node.add_lid_to_custom_path` to `node.add_lock_id_to_custom_path` to avoid confusion with the node ID which is now part of the `NodeEnvironment` logic. - I removed the `version` paramater from `MetaDataStateFormat#write` , it wasn't really used and was just in the way :) - TribeNodes are special in the sense that they do start multiple sub-nodes (previously known as client nodes). Those sub-nodes do not store local files but derive their ID from the parent node id, so they are generated consistently.	2016-07-04 21:09:25 +02:00
Tanguy Leroux	ed444cd276	Fix CompletionTokenStream modifier redundancy	2016-07-04 16:29:26 +02:00
Tanguy Leroux	d72134b46e	Fix CompletionSuggestSearchIT and CompletionSuggestSearch2xIT	2016-07-04 15:58:27 +02:00
Tanguy Leroux	0e7faf1005	Enable Checkstyle RedundantModifier	2016-07-04 15:22:12 +02:00
Nik Everett	a728c4bb5c	Migrate global, filter, and filters aggs to NamedWriteable Once all of these are migrated we'll be able to remove aggregation's custom "streams" which function that same as NamedWriteable. It also allows us to make most of the fields on aggregations final which is rather nice. Also starts to migrate MultiBucketAggregation.Bucket to Writeable, allowing the buckets to have immutable parts.	2016-07-04 08:53:02 -04:00
Nik Everett	c02de9227c	Migrate remaining calc aggs to NamedWriteable Once all of these are migrated we'll be able to remove aggregation's custom "streams" which function that same as NamedWriteable. It also allows us to make most of the fields on aggregations final which is rather nice.	2016-07-04 08:46:00 -04:00
Jason Tedor	3343ceeae4	Do not catch throwable Today throughout the codebase, catch throwable is used with reckless abandon. This is dangerous because the throwable could be a fatal virtual machine error resulting from an internal error in the JVM, or an out of memory error or a stack overflow error that leaves the virtual machine in an unstable and unpredictable state. This commit removes catch throwable from the codebase and removes the temptation to use it by modifying listener APIs to receive instances of Exception instead of the top-level Throwable. Relates #19231	2016-07-04 08:41:06 -04:00
Boaz Leskes	86d2e88362	re-introduce: Inline reroute with process of node join/master election (#18938 )	2016-07-04 13:06:56 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Daniel Mitterdorfer	25881c265b	Improve error checking and reporting in CBSIT	2016-07-04 10:24:15 +02:00
Daniel Mitterdorfer	4131476db1	Propagate canTripCircuitBreaker for all broadcasted actions With this commit we also propagate the `canTripCircuitBreaker` setting for the main action in TransportBroadcastByNodeAction. Previously, we set it only on the additional action added by this handler.	2016-07-04 10:24:15 +02:00
Ryan Ernst	5cf7583bde	Internal: Remove unused methods and classes This change removes a handful of classes and methods that were simply unused. Some of the classes were intermediate abstract classes that added nothing to the base class they extended.	2016-07-02 11:33:16 -07:00
Yannick Welsch	3e199b1dff	Make testAllOperationsInvoked properly clean up after itself	2016-07-02 11:54:52 +02:00
Yannick Welsch	3221c9d970	Match exception message more exactly in test assertion	2016-07-02 10:07:57 +02:00
Yannick Welsch	b4064ce43f	Make primary relocation handoff non-blocking (#19013 ) Primary relocation and indexing concurrently can currently lead to a deadlock situation as indexing operations are blocked on a (bounded) thread pool during the hand-off phase between old and new primary. This change replaces blocking of indexing operations by putting operations that cannot be executed during relocation hand-off in a queue to be executed once relocation completes. Closes #18553.	2016-07-02 09:35:54 +02:00
Yannick Welsch	50b97ba5f5	Fix test assertion matching exception message Newer versions of the URL class in JDK 9 use a different exception message when throwing a MalformedURLException due to an unknown protocol.	2016-07-02 09:27:07 +02:00
Ryan Ernst	5a66c08ae9	Merge branch 'master' into ingest_plugin_api	2016-07-01 16:27:52 -07:00
Ryan Ernst	c7b9489be8	Merge pull request #19225 from rjernst/we_dont_need_generics Internal: Remove generics from LifecycleComponent	2016-07-01 16:25:34 -07:00
Ryan Ernst	822c995367	Internal: Remove generics from LifecycleComponent The only reason for LifecycleComponent taking a generic type was so that it could return that type on its start and stop methods. However, this chaining has no practical necessity. Instead, start and stop can be void, and a whole bunch of confusing generics disappear.	2016-07-01 16:17:42 -07:00
Ali Beyad	05998224d8	Adding repository index generational files Before, a repository would maintain an index file (named 'index') per repository, that contained the current snapshots in the repository. This file was not atomically written, so repositories had to depend on listing the blobs in the repository to determine what the current snapshots are, and only rely on the index file if the repository does not support the listBlobs operation. This could cause an incorrect view of the current snapshots in the repository if any prior snapshot delete operations failed to delete snapshot metadata files. This commit introduces the atomic writing of the index file, and because atomic writes are not guaranteed if the file already exists, we write to a generational index file (index-N, where N is the current generation). We also maintain an index-latest file that contains the current generation, for those repositories that cannot list blobs. Closes #19002 Relates #18156	2016-07-01 17:52:57 -04:00
Ryan Ernst	40e1fe7d9e	Remove unused ClusterService member of NodeService	2016-07-01 13:17:12 -07:00
Ryan Ernst	f9eed91f33	Fix test oops	2016-07-01 13:00:33 -07:00
Ryan Ernst	2f12d1cb45	Merge branch 'master' into ingest_plugin_api	2016-07-01 12:58:11 -07:00
Ryan Ernst	62397c0c3d	Fix node service to still work when http is disabled	2016-07-01 12:55:41 -07:00
Ryan Ernst	6110dcc710	Fix unit tests for http server that no longer need node service	2016-07-01 12:45:43 -07:00
Ryan Ernst	f6491047a0	Fix line lengths for ingest config utils	2016-07-01 12:36:44 -07:00
Ryan Ernst	e5caadc4f3	Merge branch 'master' into ingest_plugin_api	2016-07-01 12:35:26 -07:00
Ryan Ernst	a200292211	Merge pull request #19218 from rjernst/http_server_without_node_service Internal: Remove cyclic dependency between HttpServer and NodeService	2016-07-01 12:32:52 -07:00
Nik Everett	de71a9abb3	Migrate value_count, percentiles, and percentile_ranks aggregations to NamedWriteable These are the first aggregations with multiple `InternalAggregation`s backing the same `AggregationBuilder`. This required a change in the register method's signature.	2016-07-01 14:48:08 -04:00
Ali Beyad	cb20776439	Includes the index UUID in the _cat/indices API and adds tests for the _cat/indices functionality. Closes #19204 Closes #19132	2016-07-01 14:45:55 -04:00
Ryan Ernst	a206c7a149	Remove unnecessary transport level bwc	2016-07-01 10:16:56 -07:00
Ryan Ernst	dbbfadeefa	Internal: Remove cyclic dependency between HttpServer and NodeService NodeService has an "service attributes" map, which is only set by HttpServer on start/stop. But the only thing it puts in this map is already available as part of the HttpServer info which is added to node info requests. This change removes the attributes map and removes the dependency in HttpServer on NodeService.	2016-07-01 09:50:54 -07:00
Ryan Ernst	76ba10bab6	Remove unnecessary optional injection of ScriptService into NodeService	2016-07-01 09:31:41 -07:00
Ryan Ernst	65c9b0b588	Merge branch 'master' into ingest_plugin_api	2016-07-01 09:26:17 -07:00
Tal Levy	01d7020ee3	Skip the execution of an empty pipeline (#19200 ) main optimization: `sourceToMap` is not called, therefore avoiding creation of Map of Maps Closes #19192.	2016-07-01 09:15:05 -07:00
Tanguy Leroux	5c7ca5cc0c	Fix checkstyle violations	2016-07-01 17:24:43 +02:00
Tanguy Leroux	0a293fad29	Remove some unused code	2016-07-01 17:01:39 +02:00
Tanguy Leroux	8c40b2b54e	Fix order of modifiers	2016-07-01 16:57:14 +02:00
Simon Willnauer	5c8164a561	Clean up BytesReference (#19196 ) BytesReference should be a really simple interface, yet it has a gazillion ways to achieve the same this. Methods like `#hasArray`, `#toBytesArray`, `#copyBytesArray` `#toBytesRef` `#bytes` are all really duplicates. This change simplifies the interface dramatically and makes implementations of it much simpler. All array access has been removed and is streamlined through a single `#toBytesRef` method. Utility methods to materialize a compact byte array has been added too for convenience.	2016-07-01 16:09:31 +02:00
Nik Everett	27e320d5ce	Migrate sum, min, and max aggs to NamedWriteable	2016-07-01 09:23:26 -04:00
Nik Everett	91b66e3cf4	Migration stats and extended stats to NamedWriteable Migrates the `stats` and `extended_stats` aggregations and pipeline aggregations from the special purpose aggregations streams to `NamedWriteable`. These are the first pipeline aggregations so this adds the infrastructure to support both streams and `NamedWriteable`s for pipeline aggregations.	2016-07-01 09:13:15 -04:00
javanna	598c36128e	Revert "Raised IOException on deleteBlob (#18815 )" This reverts commit d24cc65cad2f3152237df8b6c457a2d0a603f13a as it seems to be causing test failures.	2016-07-01 11:00:32 +02:00
gfyoung	d24cc65cad	Raised IOException on deleteBlob (#18815 ) Raise IOException on deleteBlob if the blob doesn't exist This commit raises an IOException on BlobContainer#deleteBlob if the blob does not exist, in conformance with the BlobContainer interface contract. Each implementation of BlobContainer now conforms to this contract (file system, S3, Azure, HDFS). This commit also contains blob container tests for each of the repository implementations. Closes #18530	2016-06-30 23:00:10 -04:00
Ryan Ernst	8275ab497b	Merge pull request #19170 from rjernst/rest_handler_client Changed rest handler interface to take NodeClient	2016-06-30 11:00:09 -07:00
Nik Everett	f5a269b029	Start migration away from aggregation streams We'll migrate to NamedWriteable so we can share code with the rest of the system. So we can work on this in multiple pull requests without breaking Elasticsearch in between the commits this change supports both old style `InternalAggregations.stream` serialization and `NamedWriteable` style serialization. As such it creates about a half dozen `// NORELEASE` comments that will have to be removed once the migration is complete. This also introduces a boolean `transportClient` flag to `SearchModule` which is used to skip inappropriate registrations for for the transport client while still registering the things it needs. In this case that means that the `InternalAggregation` subclasses are registered with the `NamedWriteableRegistry` but the `AggregationBuilder` subclasses are not. Finally, this moves aggregation registration from guice configuration time to `SearchModule` construction time. This will make it simpler to work with in the future as we further clean up Elasticsearch's extension points.	2016-06-30 12:57:34 -04:00
Boaz Leskes	09ca6d6ed2	Add a BridgePartition to be used by testAckedIndexing (#19172 ) We have long worked to capture different partitioning scenarios in our testing infra. This PR adds a new variant, inspired by the Jepsen blogs, which was forgotten far - namely a partition where one node can still see and be seen by all other nodes. It also updates the resiliency page to better reflect all the work that was done in this area.	2016-06-30 17:58:12 +02:00
Ryan Ernst	04a4bcdca0	Add comment explaining bytes reference edge case	2016-06-30 08:47:55 -07:00
Ryan Ernst	e079c83020	Fix test edge case for bytes reference	2016-06-30 08:45:54 -07:00
Ryan Ernst	c762e7aa15	Merge branch 'master' into rest_handler_client	2016-06-30 08:16:25 -07:00

1 2 3 4 5 ...

5649 Commits