OpenSearch

Commit Graph

Author	SHA1	Message	Date
Boaz Leskes	eaa105951f	Simplify GlobalCheckpointService and properly hook it for cluster state updates (#20720 ) During a recent merge from master, we lost the bridge from IndicesClusterStateService to the GlobalCheckpointService of primary shards, notifying them of changes to the current set of active/initializing shards. This commits add the bridge back (with unit tests). It also simplifies the GlobalCheckpoint tracking to use a simpler model (which makes use the fact that the global check point sync is done periodically). The old integration CheckpointIT test is moved to IndexLevelReplicationTests. I also added similar assertions to RelocationsIT, which surfaced a bug in the primary relocation logic and how it plays with global checkpoint updates. The test is currently await-fixed and will be fixed in a follow up issue.	2016-10-17 16:33:03 +02:00
Ali Beyad	7c2e761c87	Sequence numbers commit data in Lucene uses Iterable interface (#20793 ) Sequence number related data (maximum sequence number, local checkpoint, and global checkpoint) gets stored in Lucene on each commit. The logical place to store this data is on each Lucene commit's user commit data structure (see IndexWriter#setCommitData and the new version IndexWriter#setLiveCommitData). However, previously we did not store the maximum sequence number in the commit data because the commit data got copied over before the Lucene IndexWriter flushed the documents to segments in the commit. This means that between the time that the commit data was set on the IndexWriter and the time that the IndexWriter completes the commit, documents with higher sequence numbers could have entered the commit. Hence, we would use FieldStats on the _seq_no field in the documents to get the maximum sequence number value, but this suffers the drawback that if the last sequence number in the commit corresponded to a delete document action, that sequence number would not show up in FieldStats as there would be no corresponding document in Lucene. In Lucene 6.2, the commit data was changed to take an Iterable interface, so that the commit data can be calculated and retrieved after all documents have been flushed, while the commit data itself is being set on the Lucene commit. This commit changes max_seq_no so it is stored in the commit data instead of being calculated from FieldStats, taking advantage of the deferred calculation of the max_seq_no through passing an Iterable that dynamically sets the iterator data. * improvements to iterating over commit data (and better safety guarantees) * Adds sequence number and checkpoint testing for document deletion intertwined with document indexing. * improve test code slightly * Remove caching of max_seq_no in commit data iterator and inline logging * Adds a test for concurrently indexing and committing segments to Lucene, ensuring the sequence number related commit data in each Lucene commit point matches the invariants of localCheckpoint <= highest sequence number in commit <= maxSeqNo * fix comments * addresses code review * adds clarification on checking commit data on recovery from translog * remove unneeded method	2016-10-12 12:38:26 -04:00
Jason Tedor	568033aba3	Fix typos in Javadoc in IndexShard.java This commit fixes two typos on the Javadoc for IndexShard#updateGlobalCheckpointOnPrimary.	2016-10-12 09:38:29 -04:00
Boaz Leskes	27eab74510	merge from master	2016-09-30 17:19:30 +02:00
Boaz Leskes	615928e8cd	ESIndexLevelReplicationTestCase: Make it easier to add new TRA-based actions (#20708 ) Right now our unit tests in that area only simulate indexing single documents. As we go forward it should be easy to add other actions, like delete & bulk indexing. This commit extracts the common parts of the current indexing logic to a based class make it easier to extend.	2016-09-30 15:49:39 +02:00
Tanguy Leroux	bb73472107	Fix Setting.timeValue() methods (#20696 ) The Setting.timeValue() method uses TimeValue.toString() which can produce fractional time values. These fractional time values cannot be parsed again by the settings framework. This commit fix a method that still use the .toString() method and replaces it with .getStringRep(). It also changes a second method so that it's not up to the caller to decide which stringify method to call. closes #20662	2016-09-30 15:30:44 +02:00
Jason Tedor	bfc6156a6d	Fix failling logger level update test This commit fixes a failing cluster settings tests, namely the logger level update test. The test was incorrectly assuming the default log level was info, but it could be non-info, for example, if tests.es.logger.level is set to some non-info level. Closes #20318	2016-09-30 08:36:13 +02:00
Jason Tedor	afcf683228	Remove ignore system bootstrap checks Today we allow system bootstrap checks to be ignored with a setting. Yet, the system bootstrap checks are as vital to the health of a production node as the non-system checks (e.g., the original bootstrap check, the file descriptor check, is critical for reducing the chances of data loss from being too low). This commit removes the ability to ignore system bootstrap checks. Relates #20511	2016-09-30 02:18:54 +02:00
Simon Willnauer	7e3863d2d8	[TEST] Fix EvilSystemPropertyTests to be test order independent	2016-09-29 13:26:14 +02:00
Tal Levy	33b9e2065b	no null values in ingest configuration error messages (#20616 ) The invalid ingest configuration field name used to show itself, even when it was null, in error messages. Sometimes this does not make sense. e.g. ```[null] Only one of [file], [id], or [inline] may be configure``` vs. ```Only one of [file], [id], or [inline] may be configure``` The above deals with three fields, therefore this no one property responsible.	2016-09-29 11:34:52 +02:00
Simon Willnauer	f2e6862803	Add a hard limit for `index.number_of_shard` (#20682 ) this change adds a hard limit to `index.number_of_shard` that prevents indices from being created that have more than 1024 shards. This is still a huge limit and can only be changed via settings a system property.	2016-09-29 11:03:30 +02:00
Nicholas Knize	80c918a13a	[TEST] Fix NumberFieldMapperTests.testNoDocValues to call correct helper method.	2016-09-29 03:04:43 +02:00
Jason Tedor	0808611184	Fix failing tests after merge This commit fixes failing tests in feature/seq_no after merging master in.	2016-09-29 03:04:37 +02:00
Jason Tedor	25fd9e26c4	Merge branch 'master' into feature/seq_no * master: (1199 commits) [DOCS] Remove non-valid link to mapping migration document Revert "Default `include_in_all` for numeric-like types to false" test: add a test with ipv6 address docs: clearify that both ip4 and ip6 addresses are supported Include complex settings in settings requests Add production warning for pre-release builds Clean up confusing error message on unhandled endpoint [TEST] Increase logging level in testDelayShards() change health from string to enum (#20661) Provide error message when plugin id is missing Document that sliced scroll works for reindex Make reindex-from-remote ignore unknown fields Remove NoopGatewayAllocator in favor of a more realistic mock (#20637) Remove Marvel character reference from guide Fix documentation for setting Java I/O temp dir Update client benchmarks to log4j2 Changes the API of GatewayAllocator#applyStartedShards and (#20642) Removes FailedRerouteAllocation and StartedRerouteAllocation IndexRoutingTable.initializeEmpty shouldn't override supplied primary RecoverySource (#20638) Smoke tester: Adjust to latest changes (#20611) ...	2016-09-29 00:22:31 +02:00
Nicholas Knize	b1a508a3c8	[TEST] Fix NumberFieldMapperTests.testNoDocValues to call correct helper method.	2016-09-28 13:09:16 -05:00
Lee Hinman	3f77eacab1	Revert "Default `include_in_all` for numeric-like types to false" This reverts commit `6666892038`.	2016-09-28 07:07:46 -06:00
Jason Tedor	6013234560	Include complex settings in settings requests Today when getting setting via an API like the cluster settings API, complex settings are excluded (e.g., discovery.zen.ping.unicast.hosts). This commit adds these settings to the output of such APIs. Relates #20622	2016-09-27 20:25:03 -04:00
Jason Tedor	3c8ff45917	Add production warning for pre-release builds This commit adds a usage warning when Elasticsearch is started with a pre-release build. Relates #20674	2016-09-27 20:13:12 -04:00
Lee Hinman	e7ebfa0c69	Clean up confusing error message on unhandled endpoint It currently returns something like: ``` "No feature for name [_siohgjoidfhjfihfg]" ``` Which is not the most understandable message, this changes it to be a little more readable. Resolves #10946	2016-09-27 08:42:47 -06:00
Lee Hinman	e6e517a7a4	[TEST] Increase logging level in testDelayShards()	2016-09-27 03:01:38 -06:00
Jason Tedor	a6e33494ab	Provide error message when plugin id is missing Today when executing the install plugin command without a plugin id, we end up throwing an NPE because the plugin id is null yet we just keep going (ultimatley we try to lookup the null plugin id in a set, the direct cause of the NPE). This commit modifies the install command so that a missing plugin id is detected and help is provided to the user. Relates #20660	2016-09-26 08:09:15 -04:00
Nik Everett	370afa371b	Make reindex-from-remote ignore unknown fields reindex-from-remote should ignore unknown fields so it is mostly future compatible. This makes it ignore unknown fields by adding an option to `ObjectParser` and `ConstructingObjectParser` that, if enabled, causes them to ignore unknown fields. Closes #20504	2016-09-26 00:55:46 +02:00
Boaz Leskes	ee76c1a5c9	Remove NoopGatewayAllocator in favor of a more realistic mock (#20637 ) Many of our unit tests instantiate an `AllocationService`, which requires having a `GatewayAllocator`. Today almost all of our test use a class called `NoopGatewayAllocator` which does nothing, effectively leaving all shard assignments to the balanced allocator. This is sad as it means we test a system that behaves differently than our production logic in very basic things. For example, a started primary that is lost will be assigned to a node that didn't use to have it. This PR removes `NoopGatewayAllocator` in favor of a new `TestGatewayAllocator` that inherits the standard `GatewayAllocator` and overrides shard information fetching to return information based on historical assignments the allocator has done. The only exception is `BalanceConfigurationTests` which does test only the balancer and I opted to not have it work around the `GatewayAllocator` being in it's way.	2016-09-25 20:15:30 +02:00
Ali Beyad	ac1b13dde7	Changes the API of GatewayAllocator#applyStartedShards and (#20642 ) Changes the API of GatewayAllocator#applyStartedShards and GatewayAllocator#applyFailedShards to take both a RoutingAllocation and a list of shards to apply. This allows better mock allocators to be created as being done in #20637. Closes #20642	2016-09-23 09:31:46 -04:00
Ali Beyad	029fc909b5	Removes FailedRerouteAllocation and StartedRerouteAllocation Removes the FailedRerouteAllocation class and StartedRerouteAllocation class, as they were just wrappers for RerouteAllocation that stored started and failed shards, but these started and failed shards can be passed in directly to the methods that needed them, removing the need for this wrapper class and extra level of indirection. Closes #20626	2016-09-23 09:02:36 -04:00
Boaz Leskes	65356697ac	IndexRoutingTable.initializeEmpty shouldn't override supplied primary RecoverySource (#20638 ) When initializing a new index routing table, we make a decision where the primary shards should be recovered from. This can be an empty folder for new indices, a set of specific allocation ids for old indices or a snapshot. We currently allow callers of `IndexRoutingTable.initializeEmpty` to supply the source but also set it automatically if null is given. Sadly the current logic is reusing the supplied parameter to store the result of the automatic decision. This is flawed if some of the decision should be different between the different index shard (as the first decision that is maid sticks). This commit fixes this but also simplifies the API to always make an automatic decision. This was discovered while working on #20637 which strengthens the testing infra and caused this to bubble up. I put it as a separate commit to make sure it is not lost as part of a bigger test only PR.	2016-09-23 13:17:54 +02:00
Simon Willnauer	fe1803c957	Remove AnalysisService and reduce it to a simple name to analyzer mapping (#20627 ) Today we hold on to all possible tokenizers, tokenfilters etc. when we create an index service on a node. This was mainly done to allow the `_analyze` API to directly access all these primitive. We fixed this in #19827 and can now get rid of the AnalysisService entirely and replace it with a simple map like class. This ensures we don't create a gazillion long living objects that are entirely useless since they are never used in most of the indices. Also those objects might consume a considerable amount of memory since they might load stopwords or synonyms etc. Closes #19828	2016-09-23 08:53:50 +02:00
Tanguy Leroux	ab2e067ef5	Make ByteSizeUnit implements Writeable (#20557 ) This commit makes ByteSizeUnit implement Writeable.	2016-09-22 14:42:13 +02:00
Jay Modi	0573e03aa1	Pass classpath plugins to tribe nodes When testing tribe nodes in an integration test, we should pass the classpath plugins of the node down to the tribe client nodes. Without this the tribe client nodes could be prevented from communicating with the tribes.	2016-09-22 07:29:07 -04:00
Ali Beyad	86c3bdb8a5	Fixes a test bug for the DeprecationRestHandler where an empty string could be randomly produced as a header to test against, which is rejected by header validation as an invalid header.	2016-09-21 13:07:40 -04:00
Yannick Welsch	d55bd707a3	Update incoming recoveries stats when shadow replica is reinitialized (#20612 ) When an active shadow replica is reinitialized during primary promotion, the recovery stats that are used by the allocation decider settings `cluster.routing.allocation.node_concurrent_recoveries` and `cluster.routing.allocation.node_concurrent_incoming_recoveries` have to be updated.	2016-09-21 18:13:26 +02:00
Adrien Grand	45469a5570	Native scripts should be created once per index, not per segment. (#20609 ) If your native script needs to do some heavy computation on initialization, the fact that we create a new one for every segment rather than for the whole index could have a negative performance impact.	2016-09-21 17:39:43 +02:00
Yannick Welsch	bf5d425ab9	Fix wrong logger usages These misusages were found by the logger usage checker that was re-enabled in the previous commit.	2016-09-21 14:45:28 +02:00
Simon Willnauer	0151974500	`_flush` should block by default (#20597 ) This commit changes the default behavior of `_flush` to block if other flushes are ongoing. This also removes the use of `FlushNotAllowedException` and instead simply return immediately by skipping the flush. Users should be aware if they set this option that the flush might or might not flush everything to disk ie. no transactional behavior of some sort. Closes #20569	2016-09-21 14:20:24 +02:00
Simon Willnauer	6dc03ecb10	Remove unused Translog#read method (#20598 ) Translog#read is a left-over from realtime-get that allows to read from an arbitrary location in the transaction log. This method is unused and can be replaced with snapshots in tests.	2016-09-21 14:19:49 +02:00
Boaz Leskes	b3e5e6a0ba	`index.routing.allocation.initial_recovery` limits replica allocation (#20589 ) `index.routing.allocation.initial_recovery` is used with index shrinking to make sure the new index's primary is assigned to the node that holds a copy of each of the source index shards. Sadly with the introduction of `RecoverySource` a regression was introduced that limits the allocation of replicas of the new index.	2016-09-21 12:40:37 +02:00
Nik Everett	92417c6ae3	Better explain Plugin (#20573 ) Adds some javadoc with more explanation on how to extend Plugin and why we have all these `@Deprecated public final` `onModule` methods. Closes #20564	2016-09-20 11:20:13 -04:00
Jason Tedor	12234c067a	Ensure logging is initialized in CLI tools Today when CLI tools are executed, logging statements can intentionally or unintentionally be executed when logging is not configured. This leads to log messages that the status logger is not configured. This commit reworks logging configuration for CLI tools so that logging is always configured. Relates #20575	2016-09-20 08:28:27 -04:00
Tanguy Leroux	7645abaad9	Remove duplicate methods in ByteSizeValue (#20560 ) This commit removes `ByteSizeValue`'s methods that are duplicated (ex: `mbFrac()` and `getMbFrac()`) in order to only keep the `getN` form. It also renames `mb()` -> `getMb()`, `kb()` -> `getKB()` in order to be more coherent with the `ByteSizeUnit` method names.	2016-09-20 14:07:23 +02:00
Alexander Lin	d31a8e6558	Provides a cat api endpoint for templates. (#20545 ) Adds a cat api endpoint: /_cat/templates and its more specific version, /_cat/templates/{name}. It looks something like: $ curl "localhost:9200/_cat/templates?v" name template order version sushi_california_roll avocado 1 1 pizza_hawaiian pineapples 1 pizza_pepperoni pepperoni 1 The specified version (only allows * globs) looks like: $ curl "localhost:9200/_cat/templates/pizza" name template order version pizza_hawaiian pineapples* 1 pizza_pepperoni pepperoni 1 Partially specified columns: $ curl "localhost:9200/_cat/templates/pizza?v=true&h=name,template" name template pizza_hawaiian pineapples* pizza_pepperoni pepperoni The help text: $ curl "localhost:9200/_cat/templates/pizza*?help" name \| n \| template name template \| t \| template pattern string order \| o \| template application order number version \| v \| version Closes #20467	2016-09-20 10:40:23 +02:00
Tanguy Leroux	38dce6384f	Improve TribeIT tests This commit adds a new test TribeIT#testClusterStateNodes() to verify that the tribe node correctly reflects the nodes of the remote clusters it is connected to. It also changes the existing tests so that they really use two remote clusters now.	2016-09-20 10:31:32 +02:00
Luca Cavanna	b7314c8721	fix IndexResponse#toString to print out shards info (#20562 ) IndexResponse#toString method outputs an error caused by the shards object needing to be wrapped into another object. It is fixed by calling a different variant of Strings.toString(XContent) which accepts a second boolean argument that makes sure that a new object is created before outputting ShardInfo. I didn't change ShardInfo#toString directly as whether it needs a new object or not very much depends on where it is printed out. IndexResponse seemed a specific case as the rest of the info were not json, hence the shards object was the first one, but it is usually not the case.	2016-09-20 08:57:49 +02:00
Ryan Ernst	b77c16086c	Convert old download links to unified release urls (#20574 ) With the unified release process across the elastic stack, download links for all products are changing. This change updates docs referring to the old download and packages urls. Note that this change also updates the plugin installation command as the url for downloads is being changed to be consistent with that for packages (both plural).	2016-09-19 18:10:45 -07:00
Jason Tedor	05b4e0c0e3	Add serial collector bootstrap check The serial collector is not suitable for running with a server application like Elasticsearch and can decimate performance and lead to cluster instability. This commit adds a bootstrap check to prevent usage of the serial collector when Elasticsearch is running in production mode. Relates #20558	2016-09-19 20:25:50 -04:00
Jason Tedor	aa510159c2	Avoid unnecessary creation of prefix loggers Today when acquiring a prefix logger for a logger info stream, we obtain a new prefix logger per invocation. This can lead to contention on the markers lock in the constructor of PrefixLogger. Usually this is not a problem (because the vast majority of callers hold on to the logger they obtain). Unfortunately, under heavy indexing with multiple threads, the contention on the lock can be devastating. This commit modifies LoggerInfoStream to hold on to the loggers it obtains to avoid contending over the lock there. Relates #20571	2016-09-19 20:22:07 -04:00
Ryan Ernst	85b8f29415	Build: Remove old maven deploy support (#20403 ) * Build: Remove old maven deploy support This change removes the old maven deploy that we have in parallel to maven-publish, and makes maven-publish fully work with publishing to maven local. Using `gradle publishToMavenLocal` should be used to publish to .m2. Note that there is an unfortunate hack that means for zip artifacts we must first create/publish a dummy pom file, and then follow that with the real pom file. It would be nice to have the pom file contains packaging=zip, but maven central then requires sources and javadocs. But our zips are really just attached artifacts, so we already set the packaging type to pom for our zip files. This change just works around a limitation of the underlying maven publishing library which silently skips attached artifacts when the packaging type is set to pom. relates #20164 closes #20375 * Remove unnecessary extra spacing	2016-09-19 15:10:41 -07:00
Simon Willnauer	ee8d14798f	Unguice Transport and friends (#20526 ) This change removes all guice interaction from Transport, HttpServerTransport, HttpServer and TransportService. All these classes as well as their subclasses or extended version configured via plugins are now created by using plain old bloody java constructors. YAY!	2016-09-19 22:10:47 +02:00
Simon Willnauer	bd93b3dce6	Call onNewEngine before the engine is actually published	2016-09-19 17:00:17 +02:00
Simon Willnauer	817d55cf93	Take refresh IOExceptions into account when catching ACE in InternalEngine (#20546 ) Since #19975 we are aggressively failing with AssertionError when we catch an ACE inside the InternalEngine. We treat everything that is neither a tragic even on the IndexWriter or the Translog as a bug and throw an AssertionError. Yet, if the engine hits an IOException on refresh of some sort and the IW doesn't realize it since it's not fully under it's control we fail he engine but neither IW nor Translog are marked as failed by tragic event while they are already closed. This change takes the `failedEngine` exception into account and if it's set we know that the engine failed by some other even than a tragic one and can continue. This change also uses the `ReferenceManager#RefreshListener` interface in the engine rather than it's concrete implementation. Relates to #19975	2016-09-19 15:15:51 +02:00
Jason Tedor	7874463db0	Merge pull request #20553 from jasontedor/node-attributes Node attributes and REST API spec README	2016-09-19 08:31:09 -04:00

1 2 3 4 5 ...

6437 Commits