OpenSearch

Commit Graph

Author	SHA1	Message	Date
Daniel Mitterdorfer	016e8760f0	Turn off real-mem breaker in single node tests With this commit we disable the real-memory circuit breaker in tests that inherit from `ESSingleNodeTestCase`. As this breaker is based on real memory usage over which we have no (full) control in tests and their purpose is also not to test the circuit breaker, we use the deterministic circuit breaker implementation that only accounts for explicitly reserved memory. Closes #32047 Relates #32071	2018-07-16 10:40:36 +02:00
Armin Braun	3679d00a74	Replace Ingest ScriptContext with Custom Interface (#32003 ) * Replace Ingest ScriptContext with Custom Interface * Make org.elasticsearch.ingest.common.ScriptProcessorTests#testScripting more precise * Don't mock script factory in ScriptProcessorTests * Adjust mock script plugin in IT for new API	2018-07-13 23:26:10 +02:00
Vladimir Dolzhenko	b1bf643e41	lazy snapshot repository initialization (#31606 ) lazy snapshot repository initialization	2018-07-13 20:05:49 +02:00
Colin Goodheart-Smithe	0edb096eb4	Adds a new auto-interval date histogram (#28993 ) * Adds a new auto-interval date histogram This change adds a new type of histogram aggregation called `auto_date_histogram` where you can specify the target number of buckets you require and it will find an appropriate interval for the returned buckets. The aggregation works by first collecting documents in buckets at second interval, when it has created more than the target number of buckets it merges these buckets into minute interval bucket and continues collecting until it reaches the target number of buckets again. It will keep merging buckets when it exceeds the target until either collection is finished or the highest interval (currently years) is reached. A similar process happens at reduce time. This aggregation intentionally does not support min_doc_count, offest and extended_bounds to keep the already complex logic from becoming more complex. The aggregation accepts sub-aggregations but will always operate in `breadth_first` mode deferring the computation of sub-aggregations until the final buckets from the shard are known. min_doc_count is effectively hard-coded to zero meaning that we will insert empty buckets where necessary. Closes #9572 * Adds documentation * Added sub aggregator test * Fixes failing docs test * Brings branch up to date with master changes * trying to get tests to pass again * Fixes multiBucketConsumer accounting * Collects more buckets than needed on shards This gives us more options at reduce time in terms of how we do the final merge of the buckeets to produce the final result * Revert "Collects more buckets than needed on shards" This reverts commit 993c782d117892af9a3c86a51921cdee630a3ac5. * Adds ability to merge within a rounding * Fixes nonn-timezone doc test failure * Fix time zone tests * iterates on tests * Adds test case and documentation changes Added some notes in the documentation about the intervals that can bbe returned. Also added a test case that utilises the merging of conseecutive buckets * Fixes performance bug The bug meant that getAppropriate rounding look a huge amount of time if the range of the data was large but also sparsely populated. In these situations the rounding would be very low so iterating through the rounding values from the min key to the max keey look a long time (~120 seconds in one test). The solution is to add a rough estimate first which chooses the rounding based just on the long values of the min and max keeys alone but selects the rounding one lower than the one it thinks is appropriate so the accurate method can choose the final rounding taking into account the fact that intervals are not always fixed length. Thee commit also adds more tests * Changes to only do complex reduction on final reduce * merge latest with master * correct tests and add a new test case for 10k buckets * refactor to perform bucket number check in innerBuild * correctly derive bucket setting, update tests to increase bucket threshold * fix checkstyle * address code review comments * add documentation for default buckets * fix typo	2018-07-13 13:08:35 -04:00
Daniel Mitterdorfer	f174f72fee	Circuit-break based on real memory usage With this commit we introduce a new circuit-breaking strategy to the parent circuit breaker. Contrary to the current implementation which only accounts for memory reserved via child circuit breakers, the new strategy measures real heap memory usage at the time of reservation. This allows us to be much more aggressive with the circuit breaker limit so we bump it to 95% by default. The new strategy is turned on by default and can be controlled with the new cluster setting `indices.breaker.total.userealmemory`. Note that we turn it off for all integration tests with an internal test cluster because it leads to spurious test failures which are of no value (we cannot fully control heap memory usage in tests). All REST tests, however, will make use of the real memory circuit breaker. Relates #31767	2018-07-13 10:08:28 +02:00
olcbean	334c255516	XContentTests : Insert random fields at random positions (#30867 ) Currently AbstractXContentTestCase#testFromXContent appends random fields, but in a fixed position. This PR shuffles all fields after the random fields have been appended, hence the random fields are actually added to random positions.	2018-07-12 19:10:51 +02:00
Nik Everett	38e09a1508	Switch test framework to new style requests (#31939 ) In #29623 we added `Request` object flavored requests to the low level REST client and in #30315 we deprecated the old `performRequest`s. This changes all calls in the `test/framework` project to use the new versions.	2018-07-11 10:04:17 -04:00
Armin Braun	b4087d69d2	Fix assertIngestDocument wrongfully passing (#31913 ) * Fix assertIngestDocument wrongfully passing * Previously docA being subset of docB passed because iteration was over docA's keys only * Scalars in nested fields were not compared in all cases * Assertion errors were hard to interpret (message wasn't correct since it only mentioned the class type) * In cases where two paths contained different types a ClassCastException was thrown instead of an AssertionError * Fixes #28492	2018-07-11 10:24:21 +02:00
Alexander Reelsen	1c32497c44	Date: Add DateFormatters class that uses java.time (#31856 ) A newly added class called DateFormatters now contains java.time based builders for dates, which also intends to be fully backwards compatible, when the name based date formatters are picked. Also a new class named CompoundDateTimeFormatter for being able to parse multiple different formats has been added. A duelling test class has been added that ensures the same dates when parsing java or joda time formatted dates for the name based dates. Note, that java.time and joda time are not fully backwards compatible, which also means that old formats will currently not work with this setup.	2018-07-10 09:28:28 +02:00
Yannick Welsch	cce7dc20ad	Smaller aesthetic fixes to InternalTestCluster (#31831 ) Allows cluster to auto-reconfigure faster by starting up nodes in parallel.	2018-07-06 11:42:09 +02:00
Nik Everett	1099060735	Test: Do not remove xpack templates when cleaning (#31642 ) At the end of every `ESRestTestCase` we clean the cluster which includes deleting all of the templates. If xpack is installed it'll automatically recreate a few templates every time they are removed. Which is slow. This change stops the cleanup from removing the xpack templates. It cuts the time to run the docs tests more than in half and it probably saves a bit more time on other tests as well.	2018-07-05 09:43:43 -04:00
Christoph Büscher	bd1c513422	Reduce more raw types warnings (#31780 ) Similar to #31523.	2018-07-05 15:38:06 +02:00
Simon Willnauer	3f2a241b7f	Detach Transport from TransportService (#31727 ) Today TransportService is tightly coupled with Transport since it requires an instance of TransportService in order to receive responses and send requests. This is mainly due to the Request and Response handlers being maintained in TransportService but also because of the lack of a proper callback interface. This change moves request handler registry and response handler registration into Transport and adds all necessary methods to `TransportConnectionListener` in order to remove the `TransportService` dependency from `Transport` Transport now accepts one or more `TransportConnectionListener` instances that are executed sequentially in a blocking fashion.	2018-07-04 11:32:35 +02:00
Yannick Welsch	2bb4f38371	Add writeBlob option to replace existing blob (#31729 ) Adds a new parameter to the BlobContainer#writeBlob methods to specify whether the existing file should be overridden or not. For some metadata files in the repository, we actually want to replace the current file. This is currently implemented through an explicit blob delete and then a fresh write. In case of using a cloud provider (S3, GCS, Azure), this results in 2 API requests instead of just 1. This change will therefore allow us to achieve the same functionality using less API requests.	2018-07-03 09:13:50 +02:00
Christoph Büscher	31aabe4bf9	Clean up double semicolon code typos (#31687 )	2018-07-02 15:14:44 +02:00
Konrad Beiske	2971dd56ca	Enable setting client path prefix to / (#30119 ) Some proxies require all requests to have paths starting with / since there are no relative paths at the HTTP connection level. Elasticsearch assumes paths are absolute. In order to run rest tests against a cluster behind such a proxy, set the system property tests.rest.client_path_prefix to /.	2018-07-01 13:42:03 -04:00
Tanguy Leroux	d8b3f332ef	Remove extra check for object existence in repository-gcs read object (#31661 )	2018-06-29 13:52:31 +02:00
Tanguy Leroux	0ef22db844	[Test] Clean up some repository-s3 tests (#31601 ) This commit removes some tests in the repository-s3 plugin that have not been executed for 2+ years but have been maintained for nothing. Most of the tests in AbstractAwsTestCase were obsolete or superseded by fixture based integration tests.	2018-06-29 13:21:29 +02:00
Ryan Ernst	f924835265	Core: Require all actions have a Task (#31627 ) The TaskManager and TaskAwareRequest could return null when registering a task according to their javadocs, but no implementations ever actually did that. This commit removes that wording from the javadocs and ensures null is no longer allowed.	2018-06-28 08:24:03 -07:00
Alpar Torok	8557bbab28	Upgrade gradle wrapper to 4.8 (#31525 ) * Move to Gradle 4.8 RC1 * Use latest version of plugin The current does not work with Gradle 4.8 RC1 * Switch to Gradle GA * Add and configure build compare plugin * add work-around for https://github.com/gradle/gradle/issues/5692 * work around https://github.com/gradle/gradle/issues/5696 * Make use of Gradle build compare with reference project * Make the manifest more compare friendly * Clear the manifest in compare friendly mode * Remove animalsniffer from buildscript classpath * Fix javadoc errors * Fix doc issues * reference Gradle issues in comments * Conditionally configure build compare * Fix some more doclint issues * fix typo in build script * Add sanity check to make sure the test task was replaced Relates to #31324. It seems like Gradle has an inconsistent behavior and the taks is not always replaced. * Include number of non conforming tasks in the exception. * No longer replace test task, create implicit instead Closes #31324. The issue has full context in comments. With this change the `test` task becomes nothing more than an alias for `utest`. Some of the stand alone tests that had a `test` task now have `integTest`, and a few of them that used to have `integTest` to run multiple tests now only have `check`. This will also help separarate unit/micro tests from integration tests. * Revert "No longer replace test task, create implicit instead" This reverts commit f1ebaf7d93e4a0a19e751109bf620477dc35023c. * Fix replacement of the test task Based on information from gradle/gradle#5730 replace the task taking into account the task providres. Closes #31324. * Only apply build comapare plugin if needed * Make sure test runs before integTest * Fix doclint aftter merge * PR review comments * Switch to Gradle 4.8.1 and remove workaround * PR review comments * Consolidate task ordering	2018-06-28 08:13:21 +03:00
Luca Cavanna	a35b5341c4	[TEST] call yaml client close method from test suite (#31591 ) We added a way to close the yaml test client with #31575. Such close method also needs to be called from the test suite though for the additional clients to be closed.	2018-06-27 08:23:53 +02:00
Luca Cavanna	823a9d34da	[TEST] Close additional clients created while running yaml tests (#31575 ) We recently introduced a mechanism that allows to specify a node selector as part of do sections (see #31471). When a node selector that is not the default one is configured, a new client will be initialized with the same properties as the default one, but with the specified node selector. This commit improves such mechanism but also closing the additional clients being created and adding equals/hashcode impl to the custom node selector as they are cached into a map.	2018-06-26 16:56:35 +02:00
Alpar Torok	08b8d11e30	Add support for switching distribution for all integration tests (#30874 ) * remove left-over comment * make sure of the property for plugins * skip installing modules if these exist in the distribution * Log the distrbution being ran * Don't allow running with integ-tests-zip passed externally * top level x-pack/qa can't run with oss distro * Add support for matching objects in lists Makes it possible to have a key that points to a list and assert that a certain object is present in the list. All keys have to be present and values have to match. The objects in the source list may have additional fields. example: ``` match: { 'nodes.$master.plugins': { name: ingest-attachment } } ``` * Update plugin and module tests to work with other distributions Some of the tests expected that the integration tests will always be ran with the `integ-test-zip` distribution so that there will be no other plugins loaded. With this change, we check for the presence of the plugin without assuming exclusivity. * Allow modules to run on other distros as well To match the behavior of tets.distributions * Add and use a new `contains` assertion Replaces the previus changes that caused `match` to do a partial match. * Implement PR review comments	2018-06-26 06:49:03 -07:00
Nik Everett	232c71b6bf	QA: Create xpack yaml features (#31403 ) This creates a YAML test "features" that indices if the cluster being tested has xpack installed (`xpack`) or if it does not have xpack installed (`no_xpack`). It uses those features to centralize skipping a few tests that fail if xpack is installed. The plan is to use this in a followup to skip docs tests that require xpack when xpack is not installed. We plan to use the declaration of required license level on the docs page to generate the required `skip`. Closes #30933.	2018-06-26 09:26:48 -04:00
Sohaib Iftikhar	ca4c857a90	Improve test times for tests using `RandomObjects::addFields` (#31556 ) Currently RandomObjects::addFields can potentially generate a large number of fields This commit decreases the chances that a new object or array is added as a new branch of an object, which lowers the probability of ending up with very big documents generated. It also reduces the number of documents generated for the SimulatePipelineResponseTests from 10 to 5 to reduce the testing time required for parsing.	2018-06-26 12:39:53 +02:00
Christoph Büscher	86ab3a2d1a	Reduce number of raw types warnings (#31523 ) A first attempt to reduce the number of raw type warnings, most of the time by using the unbounded wildcard.	2018-06-25 15:59:03 +02:00
Jonathan Little	8e4768890a	Migrate scripted metric aggregation scripts to ScriptContext design (#30111 ) * Migrate scripted metric aggregation scripts to ScriptContext design #29328 * Rename new script context container class and add clarifying comments to remaining references to params._agg(s) * Misc cleanup: make mock metric agg script inner classes static * Move _score to an accessor rather than an arg for scripted metric agg scripts This causes the score to be evaluated only when it's used. * Documentation changes for params._agg -> agg * Migration doc addition for scripted metric aggs _agg object change * Rename "agg" Scripted Metric Aggregation script context variable to "state" * Rename a private base class from ...Agg to ...State that I missed in my last commit * Clean up imports after merge	2018-06-25 12:01:33 +01:00
Vladimir Dolzhenko	f04c579203	IndexShard should not return null stats (#31528 ) IndexShard should not return null stats - empty stats or AlreadyCloseException if it's closed is better	2018-06-22 21:08:11 +02:00
Luca Cavanna	16e4e7a7cf	Node selector per client rather than per request (#31471 ) We have made node selectors configurable per request, but all of other language clients don't allow for that. A good reason not to do so, is that having a different node selector per request breaks round-robin. This commit makes NodeSelector configurable only at client initialization. It also improves the docs on this matter, important given that a single node selector can still affect round-robin.	2018-06-22 17:15:29 +02:00
Ryan Ernst	59e7c6411a	Core: Combine messageRecieved methods in TransportRequestHandler (#31519 ) TransportRequestHandler currently contains 2 messageReceived methods, one which takes a Task, and one that does not. The first just delegates to the second. This commit changes all existing implementors of TransportRequestHandler to implement the version which takes Task, thus allowing the class to be a functional interface, and eliminating the need to throw exceptions when a task needs to be ensured.	2018-06-22 07:36:03 -07:00
Yannick Welsch	f22f91c57a	Allow multiple unicast host providers (#31509 ) Introduces support for multiple host providers, which allows the settings based hosts resolver to be treated just as any other UnicastHostsProvider. Also introduces the notion of a HostsResolver so that plugins such as FileBasedDiscovery do not need to create their own thread pool for resolving hosts, making it easier to add new similar kind of plugins.	2018-06-22 15:31:23 +02:00
Yannick Welsch	da69ab28c7	Return transport addresses from UnicastHostsProvider (#31426 ) With #20695 we removed local transport and there is just TransportAddress now. The UnicastHostsProvider currently returns DiscoveryNode instances, where, during pinging, we're actually only making use of the TransportAddress to establish a first connection to the possible new node. To simplify the interface, we can just return a list of transport addresses instead, which means that it's not necessary anymore to create fake node objects in each plugin just to return the address information.	2018-06-21 16:00:26 +02:00
Tim Brooks	86423f9563	Ensure local addresses aren't null (#31440 ) Currently we set local addresses on the creation time of a NioChannel. However, this may return null as the local address may not have been set yet. An example is the local address has not been set on a client channel as the connection process is not yet complete. This PR modifies the getter to set the local field if it is currently null.	2018-06-20 19:50:14 -06:00
Ryan Ernst	00283a61e1	Remove unused generic type for client execute method (#31444 ) This commit removes the request builder generic type for AbstractClient as it was unused.	2018-06-20 16:26:26 -07:00
Tim Brooks	9ab1325953	Introduce http and tcp server channels (#31446 ) Historically in TcpTransport server channels were represented by the same channel interface as socket channels. This was necessary as TcpTransport was parameterized by the channel type. This commit introduces TcpServerChannel and HttpServerChannel classes. Additionally, it adds the implementations for the various transports. This allows server channels to have unique functionality and not implement the methods they do not support (such as send and getRemoteAddress). Additionally, with the introduction of HttpServerChannel this commit extracts some of the storing and closing channel work to the abstract http server transport.	2018-06-20 16:34:56 -06:00
Nhat Nguyen	db1b97fd85	Remove QueryCachingPolicy#ALWAYS_CACHE (#31451 ) The QueryCachingPolicy#ALWAYS_CACHE was deprecated in Lucene-7.4 and will be removed in Lucene-8.0. This change replaces it with QueryCachingPolicy. This also makes INDEX_QUERY_CACHE_EVERYTHING_SETTING visible in testing only.	2018-06-20 10:34:08 -04:00
Tim Brooks	529e704b11	Unify http channels and exception handling (#31379 ) This is a general cleanup of channels and exception handling in http. This commit introduces a CloseableChannel that is a superclass of TcpChannel and HttpChannel. This allows us to unify the closing logic between tcp and http transports. Additionally, the normal http channels are extracted to the abstract server transport. Finally, this commit (mostly) unifies the exception handling between nio and netty4 http server transports.	2018-06-19 11:50:03 -06:00
Ryan Ernst	e67aa96c81	Core: Combine Action and GenericAction (#31405 ) Since #30966, Action no longer has anything but a call to the GenericAction super constructor. This commit renames GenericAction into Action, thus eliminating the Action class. Additionally, this commit removes the Request generic parameter of the class, since it was unused.	2018-06-18 23:53:04 +02:00
Simon Willnauer	3d5f113ada	Ensure we don't use a remote profile if cluster name matches (#31331 ) If we are running into a race condition between a node being configured to be a remote node for cross cluster search etc. and that node joining the cluster we might connect to that node with a remote profile. If that node now joins the cluster it connected to it as a CCS remote node we use the wrong profile and can't use bulk connections etc. anymore. This change uses the remote profile only if we connect to a node that has a different cluster name than the local cluster. This is not a perfect fix for this situation but is the safe option while potentially only loose a small optimization of using less connections per node which is small anyways since we only connect to a small set of nodes. Closes #29321	2018-06-17 13:32:53 +02:00
Tal Levy	3b70e943eb	add is-write-index flag to aliases (#30942 ) This commit adds the is-write-index flag for aliases. It allows requests to set the flag, and responses to display the flag. It does not validate and/or affect any indexing/getting/updating behavior of Elasticsearch -- this will be done in a follow-up PR.	2018-06-15 08:45:29 -07:00
Nhat Nguyen	8453ca638d	Upgrade to Lucene-7.4.0-snapshot-518d303506 (#31360 )	2018-06-15 10:58:21 -04:00
Nik Everett	856936c286	REST Client: NodeSelector for node attributes (#31296 ) Add a `NodeSelector` so that users can filter the nodes that receive requests based on node attributes. I believe we'll need this to backport #30523 and we want it anyway. I also added a bash script to help with rebuilding the sniffer parsing test documents.	2018-06-15 08:04:54 -04:00
Nhat Nguyen	e5b7137508	TEST: getCapturedRequestsAndClear should be atomic (#31312 ) We might lose messages between getCapturedRequestsAndClear calls. This commit makes sure that both getCapturedRequestsAndClear and getCapturedRequestsByTargetNodeAndClear are atomic.	2018-06-14 21:32:07 -04:00
Tim Brooks	fcf1e41e42	Extract common http logic to server (#31311 ) This is related to #28898. With the addition of the http nio transport, we now have two different modules that provide http transports. Currently most of the http logic lives at the module level. However, some of this logic can live in server. In particular, some of the setting of headers, cors, and pipelining. This commit begins this moving in that direction by introducing lower level abstraction (HttpChannel, HttpRequest, and HttpResonse) that is implemented by the modules. The higher level rest request and rest channel work can live entirely in server.	2018-06-14 15:10:02 -06:00
Tanguy Leroux	bbfe1eccc7	[Tests] Mutualize fixtures code in BaseHttpFixture (#31210 ) Many fixtures have similar code for writing the pid & ports files or for handling HTTP requests. This commit adds an AbstractHttpFixture class in the test framework that can be extended for specific testing purposes.	2018-06-14 14:09:56 +02:00
Tanguy Leroux	4d7447cb5e	Reenable Checkstyle's unused import rule (#31270 )	2018-06-14 09:52:46 +02:00
Nik Everett	77bb93557e	Test: Remove broken yml test feature (#31255 ) The `requires_replica` yaml test feature hasn't worked for years. This is what happens if you try to use it: ``` > Throwable #1: java.lang.NullPointerException > at __randomizedtesting.SeedInfo.seed([E6602FB306244B12:6E341069A8D826EA]:0) > at org.elasticsearch.test.rest.yaml.Features.areAllSupported(Features.java:58) > at org.elasticsearch.test.rest.yaml.section.SkipSection.skip(SkipSection.java:144) > at org.elasticsearch.test.rest.yaml.ESClientYamlSuiteTestCase.test(ESClientYamlSuiteTestCase.java:321) ``` None of our tests use it.	2018-06-13 09:33:06 -04:00
Tanguy Leroux	8b4d80ad09	Fix AntFixture waiting condition (#31272 ) The AntFixture waiting condition is evaluated to false but it should be true.	2018-06-13 12:40:22 +02:00
Ryan Ernst	a65b18f19d	Core: Remove plain execute method on TransportAction (#30998 ) TransportAction has many variants of execute. One of those variants executes by returning a future, which is then often blocked on by calling get(). This commit removes this variant of execute, instead using a helper method for tests that want to block, or having tests pass in a PlainActionFuture directly as a listener. Co-authored-by: Simon Willnauer <simonw@apache.org>	2018-06-13 09:58:13 +02:00
Van0SS	d5e8a5cd69	REST high-level client: add Cluster Health API (#29331 ) Relates to #27205	2018-06-12 13:34:06 +02:00
olcbean	7d7ead95b2	Add Get Aliases API to the high-level REST client (#28799 ) Given the weirdness of the response returned by the get alias API, we went for a client specific response, which allows us to hold the error message, exception and status returned as part of the response together with aliases. See #30536 . Relates to #27205	2018-06-12 10:26:17 +02:00
Nik Everett	0d9b78834f	LLClient: Support host selection (#30523 ) Allows users of the Low Level REST client to specify which hosts a request should be run on. They implement the `NodeSelector` interface or reuse a built in selector like `NOT_MASTER_ONLY` to chose which nodes are valid. Using it looks like: ``` Request request = new Request("POST", "/foo/_search"); RequestOptions options = request.getOptions().toBuilder(); options.setNodeSelector(NodeSelector.NOT_MASTER_ONLY); request.setOptions(options); ... ``` This introduces a new `Node` object which contains a `HttpHost` and the metadata about the host. At this point that metadata is just `version` and `roles` but I plan to add node attributes in a followup. The canonical way to get this metadata is to use the `Sniffer` to pull the information from the Elasticsearch cluster. I've marked this as "breaking-java" because it breaks custom implementations of `HostsSniffer` by renaming the interface to `NodesSniffer` and by changing it from returning a `List<HttpHost>` to a `List<Node>`. It shouldn't break anyone else though. Because we expect to find it useful, this also implements `host_selector` support to `do` statements in the yaml tests. Using it looks a little like: ``` --- "example test": - skip: features: host_selector - do: host_selector: version: " - 7.0.0" # same syntax as skip apiname: something: true ``` The `do` section parses the `version` string into a host selector that uses the same version comparison logic as the `skip` section. When the `do` section is executed it passed the off to the `RestClient`, using the `ElasticsearchHostsSniffer` to sniff the required metadata. The idea is to use this in mixed version tests to target a specific version of Elasticsearch so we can be sure about the deprecation logging though we don't currently have any examples that need it. We do, however, have at least one open pull request that requires something like this to properly test it. Closes #21888	2018-06-11 17:07:27 -04:00
Nhat Nguyen	dda56fc0fc	Move ESIndexLevelReplicationTestCase to test framework (#31243 ) Other components might benefit from the testing infra provided by ESIndexLevelReplicationTestCase. This commit moves it to the test framework.	2018-06-11 12:47:38 -04:00
Lee Hinman	c064b507df	Encapsulate Translog in Engine (#31220 ) This removes the abstract `getTranslog` method in `Engine`, instead leaving it to the abstract implementations of the other methods that use the translog. This allows future Engines not to have a Translog, as instead they must implement the methods that use the translog pieces to return necessary values.	2018-06-11 09:44:50 -06:00
Tanguy Leroux	bf58660482	Remove all unused imports and fix CRLF (#31207 ) The X-Pack opening and the recent other refactorings left a lot of unused imports in the codebase. This commit removes them all.	2018-06-11 15:12:12 +02:00
Jason Tedor	65c107b47d	Fix unknown licenses (#31223 ) The goal of this commit is to address unknown licenses when producing the dependencies info report. We have two different checks that we run on licenses. The first check is whether or not we have stashed a copy of the license text for a dependency in the repository. The second is to map every dependency to a license type (e.g., BSD 3-clause). The problem here is that the way we were handling licenses in the second check differs from how we handle licenses in the first check. The first check works by finding a license file with the name of the artifact followed by the text -LICENSE.txt. Yet in some cases we allow mapping an artifact name to another name used to check for the license (e.g., we map lucene-.* to lucene, and opensaml-.* to shibboleth. The second check understood the first way of looking for a license file but not the second way. So in this commit we teach the second check about the mappings from artifact names to license names. We do this by copying the configuration from the dependencyLicenses task to the dependenciesInfo task and then reusing the code from the first check in the second check. There were some other challenges here though. For example, dependenciesInfo was checking too many dependencies. For now, we should only be checking direct dependencies and leaving transitive dependencies from another org.elasticsearch artifact to that artifact (we want to do this differently in a follow-up). We also want to disable dependenciesInfo for projects that we do not publish, users only care about licenses they might be exposed to if they use our assembled products. With all of the changes in this commit we have eliminated all unknown licenses. A follow-up will enforce that when we add a new dependency it does not get mapped to unknown, these will be forbidden in the future. Therefore, with this change and earlier changes are left having no unknown licenses and two custom licenses; custom here means it does not map to an SPDX license type. Those two licenses are xz and ldapsdk. A future change will not allow additional custom licenses unless they are explicitly whitelisted. This ensures that if a new dependency is added it is mapped to an SPDX license or mapped to custom because it does not have an SPDX license.	2018-06-09 07:28:41 -04:00
Lee Hinman	bdb0fb2555	Fully encapsulate LocalCheckpointTracker inside of the engine (#31213 ) * Fully encapsulate LocalCheckpointTracker inside of the engine This makes the Engine interface not expose the `LocalCheckpointTracker`, instead exposing the pieces needed (like retrieving the local checkpoint) as individual methods.	2018-06-08 17:19:41 -06:00
Julie Tibshirani	00b0e10063	Remove DocumentFieldMappers#simpleMatchToFullName. (#31041 ) * Remove DocumentFieldMappers#simpleMatchToFullName, as it is duplicative of MapperService#simpleMatchToIndexNames. * Rename MapperService#simpleMatchToIndexNames -> simpleMatchToFullName for consistency. * Simplify EsIntegTestCase#assertConcreteMappingsOnAll to accept concrete fields instead of wildcard patterns.	2018-06-08 13:53:35 -07:00
Jason Tedor	e481b860a1	Enable engine factory to be pluggable (#31183 ) This commit enables the engine factory to be pluggable based on index settings used when creating the index service for an index.	2018-06-07 17:01:06 -04:00
Tanguy Leroux	b5f05f676c	Remove BlobContainer.move() method (#31100 ) closes #30680	2018-06-07 10:48:31 +02:00
Tim Brooks	67e73b4df4	Combine accepting selector and socket selector (#31115 ) This is related to #27260. This commit combines the AcceptingSelector and SocketSelector classes into a single NioSelector. This change allows the same selector to handle both server and socket channels. This is valuable as we do not necessarily want a dedicated thread running for accepting channels. With this change, this commit removes the configuration for dedicated accepting selectors for the normal transport class. The accepting workload for new node connections is likely low, meaning that there is no need to dedicate a thread to this process.	2018-06-06 11:59:54 -06:00
Yannick Welsch	1dca00deb9	Remove extra checks from HdfsBlobContainer (#31126 ) This commit saves one network roundtrip when reading or deleting files from an HDFS repository.	2018-06-06 16:38:37 +02:00
Tanguy Leroux	9531b7bbcb	Add BlobContainer.writeBlobAtomic() (#30902 ) This commit adds a new writeBlobAtomic() method to the BlobContainer interface that can be implemented by repository implementations which support atomic writes operations. When the BlobContainer implementation does not provide a specific implementation of writeBlobAtomic(), then the writeBlob() method is used. Related to #30680	2018-06-05 13:00:43 +02:00
Christoph Büscher	3f87c79500	Change ObjectParser exception (#31030 ) ObjectParser should throw XContentParseExceptions, not IAE. A dedicated parsing exception can includes the place where the error occurred. Closes #30605	2018-06-04 20:20:37 +02:00
Jason Tedor	be55da18c2	Enable customizing REST tests blacklist (#31074 ) This commit enables adding additional REST tests to the blacklist for builds that already define tests.rest.blacklist.	2018-06-04 13:35:49 -04:00
Boaz Leskes	a7ceefe93f	Make Persistent Tasks implementations version and feature aware (#31045 ) With #31020 we introduced the ability for transport clients to indicate what features they support in order to make sure we don't serialize object to them they don't support. This PR adapts the serialization logic of persistent tasks to be aware of those features and not serialize tasks that aren't supported. Also, a version check is added for the future where we may add new tasks implementations and need to be able to indicate they shouldn't be serialized both to nodes and clients. As the implementation relies on the interface of `PersistentTaskParams`, these are no longer optional. That's acceptable as all current implementation have them and we plan to make `PersistentTaskParams` more central in the future. Relates to #30731	2018-06-03 21:51:08 +02:00
Boaz Leskes	65d3f0efca	Adapt transport tests for the extra byte introduced in #31020 We now serialize a feature array, which takes an extra byte when empty.	2018-06-02 13:04:40 +02:00
Jason Tedor	4522b57e07	Introduce client feature tracking (#31020 ) This commit introduces the ability for a client to communicate to the server features that it can support and for these features to be used in influencing the decisions that the server makes when communicating with the client. To this end we carry the features from the client to the underlying stream as we carry the version of the client today. This enables us to enhance the logic where we make protocol decisions on the basis of the version on the stream to also make protocol decisions on the basis of the features on the stream. With such functionality, the client can communicate to the server if it is a transport client, or if it has, for example, X-Pack installed. This enables us to support rolling upgrades from the OSS distribution to the default distribution without breaking client connectivity as we can now elect to serialize customs in the cluster state depending on whether or not the client reports to us using the feature capabilities that it can under these customs. This means that we would avoid sending a client pieces of the cluster state that it can not understand. However, we want to take care and always send the full cluster state during node-to-node communication as otherwise we would end up with different understanding of what is in the cluster state across nodes depending on which features they reported to have. This is why when deciding whether or not to write out a custom we always send the custom if the client is not a transport client and otherwise do not send the custom if the client is transport client that does not report to have the feature required by the custom. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2018-06-01 11:45:35 -04:00
Jim Ferenczi	0791f93dbd	Add an option to split keyword field on whitespace at query time (#30691 ) This change adds an option named `split_queries_on_whitespace` to the `keyword` field type. When set to true full text queries (`match`, `multi_match`, `query_string`, ...) that target the field will split the input on whitespace to build the query terms. Defaults to `false`. Closes #30393	2018-06-01 09:47:03 +02:00
Nik Everett	b225f5e5c6	HLRest: Allow caller to set per request options (#30490 ) This modifies the high level rest client to allow calling code to customize per request options for the bulk API. You do the actual customization by passing a `RequestOptions` object to the API call which is set on the `Request` that is generated by the high level client. It also makes the `RequestOptions` a thing in the low level rest client. For now that just means you use it to customize the headers and the `httpAsyncResponseConsumerFactory` and we'll add node selectors and per request timeouts in a follow up. I only implemented this on the bulk API because it is the first one in the list alphabetically and I wanted to keep the change small enough to review. I'll convert the remaining APIs in a followup.	2018-05-31 13:59:52 -04:00
Ryan Ernst	46e8d97813	Core: Remove RequestBuilder from Action (#30966 ) This commit removes the RequestBuilder generic type from Action. It was needed to be used by the newRequest method, which in turn was used by client.prepareExecute. Both of these methods are now removed, along with the existing users of prepareExecute constructing the appropriate builder directly.	2018-05-31 16:15:00 +02:00
Martijn van Groningen	544822c78b	Moved keyword tokenizer to analysis-common module (#30642 ) Relates to #23658	2018-05-29 19:22:28 +02:00
Vladimir Dolzhenko	81eb8ba0f0	Include size of snapshot in snapshot metadata (#29602 ) Include size of snapshot in snapshot metadata Adds difference of number of files (and file sizes) between prev and current snapshot. Total number/size reflects total number/size of files in snapshot. Closes #18543	2018-05-25 21:04:50 +02:00
Martijn van Groningen	ae2f021f1c	Move score script context from SearchScript to its own class (#30816 )	2018-05-25 07:17:50 +02:00
Tim Brooks	e8b70273c1	Remove Throwable usage from transport modules (#30845 ) Currently nio and netty modules use the CompletableFuture class for managing listeners. This is unfortunate as that class accepts Throwable. This commit adds a class CompletableContext that wraps the CompletableFuture but does not accept Throwable. This allows the modification of netty and nio logic to no longer handle Throwable.	2018-05-24 17:33:29 -06:00
David Turner	ff0b6c795a	Decouple ClusterStateTaskListener & ClusterApplier (#30809 ) Today, the `ClusterApplier` and `MasterService` both use the `ClusterStateTaskListener` interface to notify their callers when asynchronous activities have completed. However, this is not wholly appropriate: none of the callers into the `ClusterApplier` care about the `ClusterState` arguments that they receive. This change introduces a dedicated ClusterApplyListener interface for callers into the `ClusterApplier`, to distinguish these listeners from the real `ClusterStateTaskListener`s that are waiting for responses from the `MasterService`.	2018-05-24 09:05:09 +01:00
Tim Brooks	d7040ad7b4	Reintroduce mandatory http pipelining support (#30820 ) This commit reintroduces `31251c9` and `63a5799`. These commits introduced a memory leak and were reverted. This commit brings those commits back and fixes the memory leak by removing unnecessary retain method calls.	2018-05-23 14:38:52 -06:00
Colin Goodheart-Smithe	4fd0a3e492	Revert "Make http pipelining support mandatory (#30695 )" (#30813 ) This reverts commit `31251c9` introduced in #30695. We suspect this commit is causing the OOME's reported in #30811 and we will use this PR to test this assertion.	2018-05-23 10:54:46 -06:00
Yannick Welsch	30b004f582	Use original settings on full-cluster restart (#30780 ) When doing a node restart using the test framework, the restarted node does not only use the settings provided to the original node, but also additional settings provided by plugin extensions, which does not correspond to the settings that a node would have on a true restart.	2018-05-23 09:02:01 +02:00
Tim Brooks	63a5799526	Remove http pipelining from integration test case (#30788 ) This is related to #29500. We are removing the ability to disable http pipelining. This PR removes the references to disabling pipelining in the integration test case.	2018-05-22 17:18:05 -06:00
Luca Cavanna	a17d6cab98	Replace Request#setHeaders with addHeader (#30588 ) Adding headers rather than setting them all at once seems more user-friendly and we already do it in a similar way for parameters (see Request#addParameter).	2018-05-22 20:32:30 +02:00
Nhat Nguyen	1918a30237	Upgrade to Lucene-7.4.0-snapshot-cc2ee23050 (#30778 ) The new snapshot includes LUCENE-8324 which fixes missing checkpoint after a fully deletes segment is dropped on flush. This snapshot should resolves failed tests in the CorruptedFileIT suite. Closes #30741 Closes #30577	2018-05-22 13:11:48 -04:00
Tim Brooks	31251c9a6d	Make http pipelining support mandatory (#30695 ) This is related to #29500 and #28898. This commit removes the abilitiy to disable http pipelining. After this commit, any elasticsearch node will support pipelined requests from a client. Additionally, it extracts some of the http pipelining work to the server module. This extracted work is used to implement pipelining for the nio plugin.	2018-05-22 09:29:31 -06:00
Tim Brooks	abf8c56a37	Remove logging from elasticsearch-nio jar (#30761 ) This is related to #27260. The elasticsearch-nio jar is supposed to be a library opposed to a framework. Currently it internally logs certain exceptions. This commit modifies it to not rely on logging. Instead exception handlers are passed by the applications that use the jar.	2018-05-21 20:18:12 -06:00
Nhat Nguyen	67d8fc222d	Upgrade to Lucene-7.4.0-snapshot-59f2b7aec2 (#30726 ) This snapshot resolves issues related to ShrinkIndexIT.	2018-05-18 18:21:39 -04:00
Ryan Ernst	b3f3a4312b	Plugins: Remove meta plugins (#30670 ) Meta plugins existed only for a short time, in order to enable breaking up x-pack into multiple plugins. However, now that x-pack is no longer installed as a plugin, the need for them has disappeared. This commit removes the meta plugins infrastructure.	2018-05-18 10:56:08 -07:00
Adrien Grand	28d4685d72	Mitigate date histogram slowdowns with non-fixed timezones. (#30534 ) Date histograms on non-fixed timezones such as `Europe/Paris` proved much slower than histograms on fixed timezones in #28727. This change mitigates the issue by using a fixed time zone instead when shard data doesn't cross a transition so that all timestamps share the same fixed offset. This should be a common case with daily indices. NOTE: Rewriting the aggregation doesn't work since the timezone is then also used on the coordinating node to create empty buckets, which might be out of the range of data that exists on the shard. NOTE: In order to be able to get a shard context in the tests, I reused code from the base query test case by creating a new parent test case for both queries and aggregations: `AbstractBuilderTestCase`. Mitigates #28727	2018-05-16 17:06:52 +02:00
Zachary Tong	df853c49c0	Add a MovingFunction pipeline aggregation, deprecate MovingAvg agg (#29594 ) This pipeline aggregation gives the user the ability to script functions that "move" across a window of data, instead of single data points. It is the scripted version of MovingAvg pipeline agg. Through custom script contexts, we expose a number of convenience methods: - MovingFunctions.max() - MovingFunctions.min() - MovingFunctions.sum() - MovingFunctions.unweightedAvg() - MovingFunctions.linearWeightedAvg() - MovingFunctions.ewma() - MovingFunctions.holt() - MovingFunctions.holtWinters() - MovingFunctions.stdDev() The user can also define any arbitrary logic via their own scripting, or combine with the above methods.	2018-05-16 10:57:00 -04:00
Van0SS	4478f10a2a	Rest High Level client: Add List Tasks (#29546 ) This change adds a `listTasks` method to the high level java ClusterClient which allows listing running tasks through the task management API. Related to #27205	2018-05-16 13:31:37 +02:00
Nik Everett	9b47e0508b	Fix compilation of test framework tests We accidentally broke the compilation of the test frameworks tests. All better now.	2018-05-15 22:56:41 -04:00
Jason Tedor	25c823da09	Skip shard deprecation messages in REST tests (#30630 ) A 6.x node can send a deprecation message that the default number of shards will change from five to one in 7.0.0. In a mixed cluster, whether or not a create index request sees five or one shard and produces a deprecation message depends on the version of the master node. This means that during BWC tests a test can see this deprecation message depending on the version of the master node. In 6.x when we introduced this deprecation message we assumed that whereever we see this deprecation message is expected. However, in a mixed cluster test we need a similar mechanism but it would only apply if the version of the master node is earlier than 7.0.0. This commit takes advantage of a recent change to expose the version of the master node to do sections of REST tests. With this in hand, we can skip asserting on the deprecation message if the version of the master node is before 7.0.0 and otherwise seeing that deprecation message would be completely unexpected.	2018-05-15 21:07:32 -04:00
Tim Brooks	99b9ab58e2	Add nio http server transport (#29587 ) This commit is related to #28898. It adds an nio driven http server transport. Currently it only supports basic http features. Cors, pipeling, and read timeouts will need to be added in future PRs.	2018-05-15 16:37:14 -06:00
Jason Tedor	abc06d5b79	Expose master version in REST test context (#30623 ) This commit exposes the master version to the REST test context. This will be needed in a follow-up where the master version will be used to determine whether or not a certain warning header is expected.	2018-05-15 17:26:43 -04:00
Nik Everett	869b639d14	QA: System property to override distribution (#30591 ) This configures all `qa` projects to use the distribution contained in the `tests.distribution` system property if it is set. The goal is to create a simple way to run tests against the default distribution which has x-pack basic features enabled while not forcing these tests on all contributors. You run these tests by doing something like: ``` ./gradlew -p qa -Dtests.distribution=zip check ``` or ``` ./gradlew -p qa -Dtests.distribution=zip bwcTest ``` x-pack basic shouldn't get in the way of any of these tests but nothing is ever perfect so this we have to disable a few when running with the zip distribution.	2018-05-15 17:16:16 -04:00
Julie Tibshirani	4f9dd37169	Add support for search templates to the high-level REST client. (#30473 )	2018-05-15 13:07:58 -07:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Martijn van Groningen	7b95470897	Moved tokenizers to analysis common module (#30538 ) The following tokenizers were moved: classic, edge_ngram, letter, lowercase, ngram, path_hierarchy, pattern, thai, uax_url_email and whitespace. Left keyword tokenizer factory in server module, because normalizers directly depend on it.This should be addressed on a follow up change. Relates to #23658	2018-05-14 07:55:01 +02:00
Yannick Welsch	fc870fdb4c	Use simpler write-once semantics for HDFS repository (#30439 ) There's no need for an extra `blobExists()` call when writing a blob to the HDFS service. The writeBlob implementation for the HDFS repository already uses the `CreateFlag.CREATE` option on the file creation, which ensures that the blob that's uploaded does not already exist. This saves one network roundtrip.	2018-05-11 09:50:37 +02:00
Jay Modi	f733de8e67	Security: fix TokenMetaData equals and hashcode (#30347 ) The TokenMetaData equals method compared byte arrays using `.equals` on the arrays themselves, which is the equivalent of an `==` check. This means that a seperate byte[] with the same contents would not be considered equivalent to the existing one, even though it should be. The method has been updated to use `Array#equals` and similarly the hashcode method has been updated to call `Arrays#hashCode` instead of calling hashcode on the array itself.	2018-05-10 13:12:11 -06:00
Jason Tedor	bf2365d13b	Remove BWC repository test (#30500 ) This commit removes a test that we can not restore from 1.x and 2.x repository files. This test is not needed, the version of Elasticsearch that this commit targets can not even read index files from those versions.	2018-05-09 23:24:54 -04:00
Nik Everett	f9dc86836d	Docs: Test examples that recreate lang analyzers (#29535 ) We have a pile of documentation describing how to rebuild the built in language analyzers and, previously, our documentation testing framework made sure that the examples successfully built an analyzer but they didn't assert that the analyzer built by the documentation matches the built in anlayzer. Unsuprisingly, some of the examples aren't quite right. This adds a mechanism that tests that the analyzers built by the docs. The mechanism is fairly simple and brutal but it seems to be working: build a hundred random unicode sequences and send them through the `_analyze` API with the rebuilt analyzer and then again through the built in analyzer. Then make sure both APIs return the same results. Each of these calls to `_anlayze` takes about 20ms on my laptop which seems fine.	2018-05-09 09:23:10 -04:00
Stéphane Campinas	2f8905839f	Correct wording in log message (#30336 )	2018-05-07 12:00:06 +02:00
Tanguy Leroux	1987d6261f	Do not fail snapshot when deleting a missing snapshotted file (#30332 ) When deleting or creating a snapshot for a given shard, elasticsearch usually starts by listing all the existing snapshotted files in the repository. Then it computes a diff and deletes the snapshotted files that are not needed anymore. During this deletion, an exception is thrown if the file to be deleted does not exist anymore. This behavior is challenging with cloud based repository implementations like S3 where a file that has been deleted can still appear in the bucket for few seconds/minutes (because the deletion can take some time to be fully replicated on S3). If the deleted file appears in the listing of files, then the following deletion will fail with a NoSuchFileException and the snapshot will be partially created/deleted. This pull request makes the deletion of these files a bit less strict, ie not failing if the file we want to delete does not exist anymore. It introduces a new BlobContainer.deleteIgnoringIfNotExists() method that can be used at some specific places where not failing when deleting a file is considered harmless. Closes #28322	2018-05-07 09:35:55 +02:00
Jim Ferenczi	dbd857341f	Upgrade to 7.4.0-snapshot-1ed95c097b (#30357 ) Upgrade to lucene-7.4.0-snapshot-1ed95c097b This version contains: * An Analyzer for Korean * An IntervalQuery and IntervalsSource that retrieve minimum intervals of positional queries. * A new API to retrieve matches (offsets and positions) of a query for a single document. * Support for soft deletes in the index writer. * A fixed shingle filter that handles index time synonyms. * Support for emoji sequence in ICUTokenizer (with an upgrade to icu 61.1)	2018-05-04 11:44:22 +02:00
Zachary Tong	3c2d2a7d4a	Fix NPE when CumulativeSum agg encounters null/empty bucket (#29641 ) Fix NPE when CumulativeSum agg encounters null/empty bucket If the cusum agg encounters a null value, it's because the value is missing (like the first value from a derivative agg), the path is not valid, or the bucket in the path was empty. Previously cusum would just explode on the null, but this changes it so we only increment the sum if the value is non-null and finite. This is safe because even if the cusum encounters all null or empty buckets, the cumulative sum is still zero (like how the sum agg returns zero even if all the docs were missing values) I went ahead and tweaked AggregatorTestCase to allow testing pipelines, so that I could delete the IT test and reimplement it as AggTests. Closes #27544	2018-05-02 12:22:55 -07:00
Ryan Ernst	fb0aa562a5	Network: Remove http.enabled setting (#29601 ) This commit removes the http.enabled setting. While all real nodes (started with bin/elasticsearch) will always have an http binding, there are many tests that rely on the quickness of not actually needing to bind to 2 ports. For this case, the MockHttpTransport.TestPlugin provides a dummy http transport implementation which is used by default in ESIntegTestCase. closes #12792	2018-05-02 11:42:05 -07:00
Ryan Ernst	f0e92676b1	Tests: Simplify VersionUtils released version splitting (#30322 ) This commit refactors VersionUtils.resolveReleasedVersions to be simpler, and in the process fixes the behavior to match that of VersionCollection.groovy. closes #30133	2018-05-02 09:29:35 -07:00
Adrien Grand	368ddc408f	Remove MapperService#types(). (#29617 ) This isn't be necessary with a single type per index.	2018-05-02 11:35:12 +02:00
Boaz Leskes	4a537ef03c	Bulk operation fail to replicate operations when a mapping update times out (#30244 ) Starting with the refactoring in https://github.com/elastic/elasticsearch/pull/22778 (released in 5.3) we may fail to properly replicate operation when a mapping update on master fails. If a bulk operations needs a mapping update half way, it will send a request to the master before continuing to index the operations. If that request times out or isn't acked (i.e., even one node in the cluster didn't process it within 30s), we end up throwing the exception and aborting the entire bulk. This is a problem because all operations that were processed so far are not replicated any more to the replicas. Although these operations were never "acked" to the user (we threw an error) it cause the local checkpoint on the replicas to lag (on 6.x) and the primary and replica to diverge. This PR does a couple of things: 1) Most importantly, treat any mapping update failure as a document level failure, meaning only the relevant indexing operation will fail. 2) Removes the mapping update callbacks from `IndexShard.applyIndexOperationOnPrimary` and similar methods for simpler execution. We don't use exceptions any more when a mapping update was successful. I think we need to do more work here (the fact that a single slow node can prevent those mappings updates from being acked and thus fail operations is bad), but I want to keep this as small as I can (it is already too big).	2018-05-01 08:15:02 +02:00
Nik Everett	50945051b6	HTML5ify Javadoc for core and test framework (#30234 ) `javadoc` will switch from detaulting to html4 to html5 in "a future release". We should get ahead of it so we're not surprised. Also, HTML5 is the future! Er, the present. Anyway, this follows up from #30220 to make the Javadoc for two of the four remaining projects HTML5 compatible.	2018-04-30 09:39:50 -04:00
Martijn van Groningen	621a1935b8	test: also assert deprecation warning after clusters have been closed.	2018-04-19 09:20:04 +02:00
Ryan Ernst	1cb3a9d9dc	Test: Guard deprecation check when 0 nodes created The internal test cluster can sometimes have 0 nodes. In this situation, the http.enabled flag will never be read, and thus no deprecation warning will be emitted. This commit guards the deprecation warning check in this case.	2018-04-18 21:18:56 -07:00
Ryan Ernst	98d776edaf	Networking: Deprecate http.enabled setting (#29591 ) This commit deprecates the http.enabled, in preparation for removing the feature in 7.0. relates #12792	2018-04-18 17:36:09 -07:00
Julie Tibshirani	c8209fa7b1	Fix the assertion message for an incorrect current version. (#29572 )	2018-04-17 19:27:02 -07:00
Nhat Nguyen	45c6c20467	Enforce translog access via engine (#29542 ) Today the translog of an engine is exposed and can be accessed directly. While this exposure offers much flexibility, it also causes these troubles: - Inconsistent behavior between translog method and engine method. For example, rolling a translog generation via an engine also trims unreferenced files, but translog's method does not. - An engine does not get notified when critical errors happen in translog as the access is direct. This change isolates translog of an engine and enforces all accesses to translog via the engine.	2018-04-17 08:03:41 -04:00
Jason Tedor	1dd0fd4874	Deprecate the index thread pool (#29540 ) The index thread pool is no longer needed as its primary use-case for single-document indexing requests has been relieved now that single-document indexing requests are converted to bulk indexing requests (with a single document payload).	2018-04-17 06:47:30 -04:00
olcbean	b3e3b80f1b	REST high-level client: add support for Indices Update Settings API [take 2] (#29327 ) Relates to #27205	2018-04-16 21:39:11 +02:00
Simon Willnauer	eab530ce11	Ensure flush happens on shard idle This adds 2 testcases that test if a shard goes idle pending (uncommitted) segments are committed and unreferenced files will be freed. Relates to #29482	2018-04-13 15:06:51 +02:00
Nhat Nguyen	f96e00badf	Add primary term to translog header (#29227 ) This change adds the current primary term to the header of the current translog file. Having a term in a translog header is a prerequisite step that allows us to trim translog operations given the max valid seq# for that term. This commit also updates tests to conform the primary term invariant which guarantees that all translog operations in a translog file have its terms at most the term stored in the translog header.	2018-04-12 13:57:59 -04:00
Lee Hinman	d72d3f996e	Add a helper method to get a random java.util.TimeZone (#29487 ) * Add a helper method to get a random java.util.TimeZone This adds a helper method to ESTestCase that returns a randomized `java.util.TimeZone`. This can be used when transitioning code from Joda to the JDK's time classes.	2018-04-12 11:56:42 -06:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
tomcallahan	2574064e66	Enable rest tests via IDEs (#29439 ) Currently rest-based tests do not work from the IDE, as the security manager is configured to permit certain network operations when using the snapshot jars compiled by gradle. We have an existing workaround that explicitly associates a codebase with the path from which the classes are loaded (in this case, the IDE build directory). This PR adds the rest client to this workaround list.	2018-04-10 09:08:58 -04:00
Lee Hinman	a07ba9e400	Move Streams.copy into elasticsearch-core and make a multi-release jar (#29322 ) * Move Streams.copy into elasticsearch-core and make a multi-release jar This moves the method `Streams.copy(InputStream in, OutputStream out)` into the `elasticsearch-core` project (inside the `o.e.core.internal.io` package). It also makes this class into a multi-release class where the Java 9 equivalent uses `InputStream#transferTo`. This is a followup from https://github.com/elastic/elasticsearch/pull/29300#discussion_r178147495	2018-04-06 11:07:20 -06:00
Lee Hinman	a93c942927	Move ObjectParser into the x-content lib (#29373 ) * Move ObjectParser into the x-content lib This moves `ObjectParser`, `AbstractObjectParser`, and `ConstructingObjectParser` into the libs/x-content dependency. This decoupling allows them to be used for parsing for projects that don't want to depend on the entire Elasticsearch jar. Relates to #28504	2018-04-06 09:41:14 -06:00
Colin Goodheart-Smithe	55c8e80532	Fixes query_string query equals timezone check (#29406 ) * Fixes query_string query equals timezone check This change fixes a bug where two `QueryStringQueryBuilder`s were found to be equal if they had the same timezone set even if the query string in the builders were different Closes #29403 * Adds mutate function to QueryStringQueryBuilderTests * iter	2018-04-06 11:45:34 +01:00
Adrien Grand	569d0c0e89	Improve similarity integration. (#29187 ) This improves the way similarities are plugged in in order to: - reject the classic similarity on 7.x indices and emit a deprecation warning otherwise - reject unkwown parameters on 7.x indices and emit a deprecation warning otherwise Even though this breaks the plugin API, I'd like to backport to 7.x so that users can get deprecation warnings when they are doing something that will become unsupported in the future. Closes #23208 Closes #29035	2018-04-03 16:45:25 +02:00
Adrien Grand	3bdfc8f3fb	Upgrade to lucene-7.3.0-snapshot-98a6b3d. (#29298 ) Most notable changes include: - this release doesn't have the 7.2.1 version constant so I had to create one - spatial4j and jts were upgraded	2018-04-03 09:27:14 +02:00
Lee Hinman	6b2167f462	Begin moving XContent to a separate lib/artifact (#29300 ) * Begin moving XContent to a separate lib/artifact This commit moves a large portion of the XContent code from the `server` project to the `libs/xcontent` project. For the pieces that have been moved, some helpers have been duplicated to allow them to be decoupled from ES helper classes. In addition, `Booleans` and `CheckedFunction` have been moved to the `elasticsearch-core` project. This decoupling is a move so that we can eventually make things like the high-level REST client not rely on the entire ES jar, only the parts it needs. There are some pieces that are still not decoupled, in particular some of the XContent tests still remain in the server project, this is because they test a large portion of the pluggable xcontent pieces through `XContentElasticsearchException`. They may be decoupled in future work. Additionally, there may be more piecese that we want to move to the xcontent lib in the future that are not part of this PR, this is a starting point. Relates to #28504	2018-04-02 15:58:31 -06:00
Mayya Sharipova	e70cd35bda	Revert "REST high-level client: add support for Indices Update Settings API (#28892 )" (#29323 ) This reverts commit `b67b5b1bbd`.	2018-03-30 16:26:46 -07:00
Andy Bristol	b7e6fb9ac5	[test] remove Streamable serde assertions (#29307 ) Removes a set of assertions in the test framework that verified that Streamable objects could be serialized and deserialized across different versions. When this was discussed the consensus was that this approach has not caught many bugs in a long time and that serialization testing of objects was best left to their respective unit and integration tests. This commit also removes a transport interceptor that was used in ESIntegTestCase tests to make these assertions about objects coming in or off the wire.	2018-03-30 14:09:26 -07:00
olcbean	b67b5b1bbd	REST high-level client: add support for Indices Update Settings API (#28892 ) Relates to #27205	2018-03-30 10:53:29 +02:00
Jason Tedor	4ef3de40bc	Fix handling of bad requests (#29249 ) Today we have a few problems with how we handle bad requests: - handling requests with bad encoding - handling requests with invalid value for filter_path/pretty/human - handling requests with a garbage Content-Type header There are two problems: - in every case, we give an empty response to the client - in most cases, we leak the byte buffer backing the request! These problems are caused by a broader problem: poor handling preparing the request for handling, or the channel to write to when the response is ready. This commit addresses these issues by taking a unified approach to all of them that ensures that: - we respond to the client with the exception that blew us up - we do not leak the byte buffer backing the request	2018-03-28 16:25:01 -04:00
Simon Willnauer	13e19e7428	Allow _update and upsert to read from the transaction log (#29264 ) We historically removed reading from the transaction log to get consistent results from _GET calls. There was also the motivation that the read-modify-update principle we apply should not be hidden from the user. We still agree on the fact that we should not hide these aspects but the impact on updates is quite significant especially if the same documents is updated before it's written to disk and made serachable. This change adds back the ability to read from the transaction log but only for update calls. Calls to the _GET API will always do a refresh if necessary to return consistent results ie. if stored fields or DocValues Fields are requested. Closes #26802	2018-03-28 18:03:34 +02:00
Yannick Welsch	cacf759213	Remove RELOCATED index shard state (#29246 ) as this information is already covered by ReplicationTracker.primaryMode.	2018-03-28 12:25:46 +02:00
Nhat Nguyen	87957603c0	Prune only gc deletes below local checkpoint (#28790 ) Once a document is deleted and Lucene is refreshed, we will not be able to look up the `version/seq#` associated with that delete in Lucene. As conflicting operations can still be indexed, we need another mechanism to remember these deletes. Therefore deletes should still be stored in the Version Map, even after Lucene is refreshed. Obviously, we can't remember all deletes forever so a trimming mechanism is needed. Currently, we remember deletes for at least 1 minute (the default GC deletes cycle) and clean them periodically. This is, at the moment, the best we can do on the primary for user facing APIs but this arbitrary time limit is problematic for replicas. Furthermore, we can't rely on the primary and replicas doing the trimming in a synchronized manner, and failing to do so results in the replica and primary making different decisions. The following scenario can cause inconsistency between primary and replica. 1. Primary index doc (index, id=1, v2) 2. Network packet issue causes index operation to back off and wait 3. Primary deletes doc (delete, id=1, v3) 4. Replica processes delete (delete, id=1, v3) 5. 1+ minute passes (GC deletes runs replica) 6. Indexing op is finally sent to the replica which no processes it because it forgot about the delete. We can reply on sequence-numbers to prevent this issue. If we prune only deletes whose seqno at most the local checkpoint, a replica will correctly remember what it needs. The correctness is explained as follows: Suppose o1 and o2 are two operations on the same document with seq#(o1) < seq#(o2), and o2 arrives before o1 on the replica. o2 is processed normally since it arrives first; when o1 arrives it should be discarded: 1. If seq#(o1) <= LCP, then it will be not be added to Lucene, as it was already previously added. 2. If seq#(o1) > LCP, then it depends on the nature of o2: - If o2 is a delete then its seq# is recorded in the VersionMap, since seq#(o2) > seq#(o1) > LCP, so a lookup can find it and determine that o1 is stale. - If o2 is an indexing then its seq# is either in Lucene (if refreshed) or the VersionMap (if not refreshed yet), so a real-time lookup can find it and determine that o1 is stale. In this PR, we prefer to deploy a single trimming strategy, which satisfies both requirements, on primary and replicas because: - It's simpler - no need to distinguish if an engine is running at primary mode or replica mode or being promoted. - If a replica subsequently is promoted, user experience is fully maintained as that replica remembers deletes for the last GC cycle. However, the version map may consume less memory if we deploy two different trimming strategies for primary and replicas.	2018-03-26 13:42:08 -04:00
Boaz Leskes	f5d4550e93	Fold EngineDiskUtils into Store, for better lock semantics (#29156 ) #28245 has introduced the utility class`EngineDiskUtils` with a set of methods to prepare/change translog and lucene commit points. That util class bundled everything that's needed to create and empty shard, bootstrap a shard from a lucene index that was just restored etc. In order to safely do these manipulations, the util methods acquired the IndexWriter's lock. That would sometime fail due to concurrent shard store fetching or other short activities that require the files not to be changed while they read from them. Since there is no way to wait on the index writer lock, the `Store` class has other locks to make sure that once we try to acquire the IW lock, it will succeed. To side step this waiting problem, this PR folds `EngineDiskUtils` into `Store`. Sadly this comes with a price - the store class doesn't and shouldn't know about the translog. As such the logic is slightly less tight and callers have to do the translog manipulations on their own.	2018-03-26 14:08:03 +02:00
Jim Ferenczi	5288235ca3	Optimize the composite aggregation for match_all and range queries (#28745 ) This change refactors the composite aggregation to add an execution mode that visits documents in the order of the values present in the leading source of the composite definition. This mode does not need to visit all documents since it can early terminate the collection when the leading source value is greater than the lowest value in the queue. Instead of collecting the documents in the order of their doc_id, this mode uses the inverted lists (or the bkd tree for numerics) to collect documents in the order of the values present in the leading source. For instance the following aggregation: ``` "composite" : { "sources" : [ { "value1": { "terms" : { "field": "timestamp", "order": "asc" } } } ], "size": 10 } ``` ... can use the field `timestamp` to collect the documents with the 10 lowest values for the field instead of visiting all documents. For composite aggregation with more than one source the execution can early terminate as soon as one of the 10 lowest values produces enough composite buckets. For instance if visiting the first two lowest timestamp created 10 composite buckets we can early terminate the collection since it is guaranteed that the third lowest timestamp cannot create a composite key that compares lower than the one already visited. This mode can execute iff: * The leading source in the composite definition uses an indexed field of type `date` (works also with `date_histogram` source), `integer`, `long` or `keyword`. * The query is a match_all query or a range query over the field that is used as the leading source in the composite definition. * The sort order of the leading source is the natural order (ascending since postings and numerics are sorted in ascending order only). If these conditions are not met this aggregation visits each document like any other agg.	2018-03-26 09:51:37 +02:00
Lee Hinman	b4af451ec5	Remove BytesArray and BytesReference usage from XContentFactory (#29151 ) * Remove BytesArray and BytesReference usage from XContentFactory This removes the usage of `BytesArray` and `BytesReference` from `XContentFactory`. Instead, a regular `byte[]` should be passed. To assist with this a helper has been added to `XContentHelper` that will preserve the offset and length from the underlying BytesReference. This is part of ongoing work to separate the XContent parts from ES so they can be factored into their own jar. Relates to #28504	2018-03-20 11:52:26 -06:00
Nik Everett	a813492fe3	Tests: Make $_path support dots in paths (#28917 ) `$_path` is used by documentation tests to ignore a value from a response, for example: ``` [source,js] ---- { "count": 1, "datafeeds": [ { "datafeed_id": "datafeed-total-requests", "state": "started", "node": { ... "attributes": { "ml.machine_memory": "17179869184", "ml.max_open_jobs": "20", "ml.enabled": "true" } }, "assignment_explanation": "" } ] } ---- // TESTRESPONSE[s/"17179869184"/$body.$_path/] ``` That example shows `17179869184` in the compiled docs but when it runs the tests generated by that doc it ignores `17179869184` and asserts instead that there is a value in that field. This is required because we can't predict things like "how many milliseconds will this take?" and "how much memory will this take?". Before this change it was impossible to use `$_path` when any component of the path contained a `.`. This fixes the `$_path` evaluator to properly escape `.`. Closes #28770	2018-03-19 14:17:09 -04:00
Christoph Büscher	312ccc05d5	[Tests] Fix GetResultTests and DocumentFieldTests failures (#29083 ) Changes made in #28972 seems to have changed some assumptions about how SMILE and CBOR write byte[] values and how this is tested. This changes the generation of the randomized DocumentField values back to BytesArray while expecting the JSON and YAML deserialisation to produce Base64 encoded strings and SMILE and CBOR to parse back BytesArray instances. Closes #29080	2018-03-15 16:42:26 +01:00
Boaz Leskes	bf65cb4914	Untangle Engine Constructor logic (#28245 ) Currently we have a fairly complicated logic in the engine constructor logic to deal with all the various ways we want to mutate the lucene index and translog we're opening. We can: 1) Create an empty index 2) Use the lucene but create a new translog 3) Use both 4) Force a new history uuid in all cases. This leads complicated code flows which makes it harder and harder to make sure we cover all the corner cases. This PR tries to take another approach. Constructing an InternalEngine always opens things as they are and all needed modifications are done by static methods directly on the directory, one at a time.	2018-03-14 20:59:47 +01:00
Lee Hinman	8e8fdc4f0e	Decouple XContentBuilder from BytesReference (#28972 ) * Decouple XContentBuilder from BytesReference This commit removes all mentions of `BytesReference` from `XContentBuilder`. This is needed so that we can completely decouple the XContent code and move it into its own dependency. While this change appears large, it is due to two main changes, moving `.bytes()` and `.string()` out of XContentBuilder itself into static methods `BytesReference.bytes` and `Strings.toString` respectively. The rest of the change is code reacting to these changes (the majority of it in tests). Relates to #28504	2018-03-14 13:47:57 -06:00
Jason Tedor	5904d936fa	Copy Lucene IOUtils (#29012 ) As we have factored Elasticsearch into smaller libraries, we have ended up in a situation that some of the dependencies of Elasticsearch are not available to code that depends on these smaller libraries but not server Elasticsearch. This is a good thing, this was one of the goals of separating Elasticsearch into smaller libraries, to shed some of the dependencies from other components of the system. However, this now means that simple utility methods from Lucene that we rely on are no longer available everywhere. This commit copies IOUtils (with some small formatting changes for our codebase) into the fold so that other components of the system can rely on these methods where they no longer depend on Lucene.	2018-03-13 12:49:33 -04:00
Jason Tedor	8b6fbe2c11	Add test for dying with dignity (#28987 ) I have long wanted an actual test that dying with dignity works. It is tricky because if dying with dignity works, it means the test JVM dies which is usually an abnormal condition. And anyway, how does one force a fatal error to be thrown. I was motivated to investigate this again by the fact that I missed a backport to one branch leading to an issue where Elasticsearch would not successfully die with dignity. And now we have a solution: we install a plugin that throws an out of memory error when it receives a request. We hack the standalone test infrastructure to prevent this from failing the test. To do this, we bypass the security manager and remove the PID file for the node; this tricks the test infrastructure into thinking that it does not need to stop the node. We also bypass seccomp so that we can fork jps to make sure that Elasticsearch really died. And to be extra paranoid, we parse the logs of the dead Elasticsearch process to make sure it died with dignity. Never forget.	2018-03-12 23:20:07 -04:00
Luca Cavanna	184a8718d8	REST high-level client: add flush API (#28852 ) Relates to #27205	2018-03-01 10:56:03 +01:00
Luca Cavanna	cd3d9c9f80	[TEST] share code between streamable/writeable/xcontent base test classes (#28785 ) Today we have two test base classes that have a lot in common when it comes to testing wire and xcontent serialization: `AbstractSerializingTestCase` and `AbstractXContentStreamableTestCase`. There are subtle differences though between the two, in the way they work, what can be overridden and features that they support (e.g. insertion of random fields). This commit introduces a new base class called `AbstractWireTestCase` which holds all of the serialization test code in common between `Streamable` and `Writeable`. It has two minimal subclasses called `AbstractWireSerializingTestCase` and `AbstractStreamableTestCase` which are specialized for `Writeable` and `Streamable`. This commit also introduces a new test class called `AbstractXContentTestCase` for all of the xContent testing, which holds a testFromXContent method for parsing and rendering to xContent. This one can be delegated to from the existing `AbstractStreamableXContentTestCase` and `AbstractSerializingTestCase` so that we avoid code duplicate as much as possible and all these base classes offer the same functionalities in the same way. Having this last base class decoupled from the serialization testing may also help with the REST high-level client testing, as there are some classes where it's hard to implement equals/hashcode and this makes it possible to override `assertEqualInstances` for custom equality comparisons (also this base class doesn't require implementing equals/hashcode as it doesn't test such methods.	2018-02-23 10:48:48 +01:00
Tim Brooks	5a8ec9b762	Selectors operate on channel contexts (#28468 ) This commit is related to #27260. Currently there is a weird relationship between channel contexts and nio channels. The selectors use the context for read and writing. But the selector operates directly on the nio channel for registering, closing, and connecting. This commit works on improving this relationship. The selector operates directly on the context which wraps the low level java.nio.channels. The NioChannel class is simply an API that is used to interact with the channel (sending messages from outside the selector event loop, scheduling a close, adding listeners, etc). The context is only used internally by the channel to implement these apis and by the selector to perform these operations.	2018-02-22 09:44:52 -07:00
Luca Cavanna	8b4a298874	Migrate some *ResponseTests to AbstractStreamableXContentTestCase (#28749 ) This allows us to save a bit of code, but also adds more coverage as it tests serialization which was missing in some of the existing tests. Also it requires implementing equals/hashcode and we get the corresponding tests for them for free from the base test class.	2018-02-21 20:04:12 +01:00
Lee Hinman	d7eae4b90f	Pass InputStream when creating XContent parser (#28754 ) * Pass InputStream when creating XContent parser Rather than passing the raw `BytesReference` in when creating the xcontent parser, this passes the StreamInput (which is an InputStream), this allows us to decouple XContent from BytesReference. This also removes the use of `commons.Booleans` so it doesn't require more external commons classes. Related to #28504 * Undo boolean removal * Enhance deprecation javadoc	2018-02-21 11:03:25 -07:00
Yu	7d8fb69d50	version set in ingest pipeline (#27573 ) Add support version and version_type in ingest pipelines Add support for setting document version and version type in set processor of an ingest pipeline.	2018-02-21 09:34:51 +01:00

1 2 3 4 5 ...

1607 Commits