OpenSearch

Commit Graph

Author	SHA1	Message	Date
Mark Vieira	b14b803676	Mute failing tests on Windows (#50825 ) (#52010 )	2020-02-06 10:47:14 -08:00
Yannick Welsch	39b4710081	Add Docker-based tests for proxy connection mode (#51658 ) Adds more tests for the new "proxy" remote cluster connection mode, using a Docker-based setup, as well as testing SNI-based routing using HAProxy. - Checks that the new proxy mode can work in situations where the publish host of the nodes in the remote cluster are not routable. - Checks that the new proxy mode can work with clusters where nodes are hidden behind HAProxy. - Checks that the new proxy mode can work with clusters where nodes are hidden behind HAProxy, using SNI to identify the nodes/cluster behind HAProxy. Relates #49067	2020-02-03 09:45:38 +01:00
Ryan Ernst	21224caeaf	Remove comparison to true for booleans (#51723 ) While we use `== false` as a more visible form of boolean negation (instead of `!`), the true case is implied and the true value does not need to explicitly checked. This commit converts cases that have slipped into the code checking for `== true`.	2020-01-31 16:35:43 -08:00
William Brafford	1a9e7b2d69	Use an options loop in Elasticsearch startup script (#51547 ) (#51621 ) * Use loop to parse options rather than grep * Add test for --help flag with encrypted keystore	2020-01-31 10:24:04 -05:00
Mark Vieira	8d2370bf00	Always use bundled JDK for external cluster nodes when BWC testing (#51505 ) (#51701 )	2020-01-30 14:35:43 -08:00
Nhat Nguyen	69ef9b05cd	Increase shard inactive time to 1h in upgrade tests (#51651 ) testRecovery relies on the fact that shards are not flushed on inactive. Our CI recently was too slow. It took more than 20 minutes to complete the full cluster restart suite. This slowness caused some shards of testRecovery were flushed on inactive. This commit increases the inactive time to 1h to reduce this noise. Closes #51640	2020-01-30 14:48:56 -05:00
William Brafford	9efa5be60e	Password-protected Keystore Feature Branch PR (#51123 ) (#51510 ) * Reload secure settings with password (#43197) If a password is not set, we assume an empty string to be compatible with previous behavior. Only allow the reload to be broadcast to other nodes if TLS is enabled for the transport layer. * Add passphrase support to elasticsearch-keystore (#38498) This change adds support for keystore passphrases to all subcommands of the elasticsearch-keystore cli tool and adds a subcommand for changing the passphrase of an existing keystore. The work to read the passphrase in Elasticsearch when loading, which will be addressed in a different PR. Subcommands of elasticsearch-keystore can handle (open and create) passphrase protected keystores When reading a keystore, a user is only prompted for a passphrase only if the keystore is passphrase protected. When creating a keystore, a user is allowed (default behavior) to create one with an empty passphrase Passphrase can be set to be empty when changing/setting it for an existing keystore Relates to: #32691 Supersedes: #37472 * Restore behavior for force parameter (#44847) Turns out that the behavior of `-f` for the add and add-file sub commands where it would also forcibly create the keystore if it didn't exist, was by design - although undocumented. This change restores that behavior auto-creating a keystore that is not password protected if the force flag is used. The force OptionSpec is moved to the BaseKeyStoreCommand as we will presumably want to maintain the same behavior in any other command that takes a force option. * Handle pwd protected keystores in all CLI tools (#45289) This change ensures that `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` can handle a password protected elasticsearch.keystore. For setup passwords the user would be prompted to add the elasticsearch keystore password upon running the tool. There is no option to pass the password as a parameter as we assume the user is present in order to enter the desired passwords for the built-in users. For saml-metadata, we prompt for the keystore password at all times even though we'd only need to read something from the keystore when there is a signing or encryption configuration. * Modify docs for setup passwords and saml metadata cli (#45797) Adds a sentence in the documentation of `elasticsearch-setup-passwords` and `elasticsearch-saml-metadata` to describe that users would be prompted for the keystore's password when running these CLI tools, when the keystore is password protected. Co-Authored-By: Lisa Cawley <lcawley@elastic.co> * Elasticsearch keystore passphrase for startup scripts (#44775) This commit allows a user to provide a keystore password on Elasticsearch startup, but only prompts when the keystore exists and is encrypted. The entrypoint in Java code is standard input. When the Bootstrap class is checking for secure keystore settings, it checks whether or not the keystore is encrypted. If so, we read one line from standard input and use this as the password. For simplicity's sake, we allow a maximum passphrase length of 128 characters. (This is an arbitrary limit and could be increased or eliminated. It is also enforced in the keystore tools, so that a user can't create a password that's too long to enter at startup.) In order to provide a password on standard input, we have to account for four different ways of starting Elasticsearch: the bash startup script, the Windows batch startup script, systemd startup, and docker startup. We use wrapper scripts to reduce systemd and docker to the bash case: in both cases, a wrapper script can read a passphrase from the filesystem and pass it to the bash script. In order to simplify testing the need for a passphrase, I have added a has-passwd command to the keystore tool. This command can run silently, and exit with status 0 when the keystore has a password. It exits with status 1 if the keystore doesn't exist or exists and is unencrypted. A good deal of the code-change in this commit has to do with refactoring packaging tests to cleanly use the same tests for both the "archive" and the "package" cases. This required not only moving tests around, but also adding some convenience methods for an abstraction layer over distribution-specific commands. * Adjust docs for password protected keystore (#45054) This commit adds relevant parts in the elasticsearch-keystore sub-commands reference docs and in the reload secure settings API doc. * Fix failing Keystore Passphrase test for feature branch (#50154) One problem with the passphrase-from-file tests, as written, is that they would leave a SystemD environment variable set when they failed, and this setting would cause elasticsearch startup to fail for other tests as well. By using a try-finally, I hope that these tests will fail more gracefully. It appears that our Fedora and Ubuntu environments may be configured to store journald information under /var rather than under /run, so that it will persist between boots. Our destructive tests that read from the journal need to account for this in order to avoid trying to limit the output we check in tests. * Run keystore management tests on docker distros (#50610) * Add Docker handling to PackagingTestCase Keystore tests need to be able to run in the Docker case. We can do this by using a DockerShell instead of a plain Shell when Docker is running. * Improve ES startup check for docker Previously we were checking truncated output for the packaged JDK as an indication that Elasticsearch had started. With new preliminary password checks, we might get a false positive from ES keystore commands, so we have to check specifically that the Elasticsearch class from the Bootstrap package is what's running. * Test password-protected keystore with Docker (#50803) This commit adds two tests for the case where we mount a password-protected keystore into a Docker container and provide a password via a Docker environment variable. We also fix a logging bug where we were logging the identifier for an array of strings rather than the contents of that array. * Add documentation for keystore startup prompting (#50821) When a keystore is password-protected, Elasticsearch will prompt at startup. This commit adds documentation for this prompt for the archive, systemd, and Docker cases. Co-authored-by: Lisa Cawley <lcawley@elastic.co> * Warn when unable to upgrade keystore on debian (#51011) For Red Hat RPM upgrades, we warn if we can't upgrade the keystore. This commit brings the same logic to the code for Debian packages. See the posttrans file for gets executed for RPMs. * Restore handling of string input Adds tests that were mistakenly removed. One of these tests proved we were not handling the the stdin (-x) option correctly when no input was added. This commit restores the original approach of reading stdin one char at a time until there is no more (-1, \r, \n) instead of using readline() that might return null * Apply spotless reformatting * Use '--since' flag to get recent journal messages When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. It seems to me that we might be able to use journald's "--since" flag to retrieve only log messages from the last run, and that this might be less likely to fail due to race conditions in file deletion. Unfortunately, it looks as if the "--since" flag has a granularity of one-second. I've added a two-second sleep to make sure that there's a sufficient gap between the test that will read from journald and the test before it. * Use new journald wrapper pattern * Update version added in secure settings request Co-authored-by: Lisa Cawley <lcawley@elastic.co> Co-authored-by: Ioannis Kakavas <ikakavas@protonmail.com>	2020-01-28 05:32:32 -05:00
Ryan Ernst	6ee1baf2ed	Migrate cron eval bats test to java (#50940 ) (#51007 ) This commit migrates the simple test of the cron eval tool from bats to java packaging tests. relates #46005	2020-01-27 10:49:01 -08:00
William Brafford	2425a1a890	Use '--cursor-after' flag to get recent journal messages (#51366 ) (#51445 ) When we get Elasticsearch logs from journald, we want to fetch only log messages from the last run. There are two reasons for this. First, if there are many logs, we might get a string that's too large for our utility methods. Second, when we're looking for a specific message or error, we almost certainly want to look only at messages from the last execution. Previously, we've been trying to do this by clearing out the physical files under the journald process. But there seems to be some contention over these directories: if journald writes a log file in between when our deletion command deletes the file and when it deletes the log directory, the deletion will fail. Instead, we can use the cursor capablity of journald to retrieve journal entries that occur only after a certain cursor. This avoids any effort to interfere with the underlying file operations of journald.	2020-01-24 16:23:18 -05:00
Rory Hunter	e6f778474e	Follow symlinks in Docker entrypoint (#51101 ) Backport of #50927. Closes #49653. When using _FILE environment variables to supply values to Elasticsearch, following symlinks when checking that file permissions are secure.	2020-01-18 10:39:49 +00:00
Armin Braun	c794803e3d	Fix X-Pack Indices Breaking Repository BwC Tests (#51120 ) (#51125 ) If some internal `.watcher` or so index gets created during these tests then the shard counts on snapshot restores and creates won't match up with expectations. Fixed by only creating the snapshot for the test index Closes #50819	2020-01-16 21:52:57 +01:00
Armin Braun	dd09dc7af6	Fix MultiVersionRepository BwC Tests (#51006 ) * Fix MultiVersionRepository BwC Tests The HLRC doesn't like what its getting back from some older 6.x versions for the restore status so I moved that request to the low level client. Closes #50819	2020-01-15 15:19:08 +01:00
Ryan Ernst	4bdab0e985	Fix windows chown to work with single file (#51004 ) The chown utility for packaging tests works on windows when the given path is a directory, but would fail if the path was a single file. This commit fixes it to handle both cases. relates #50825	2020-01-14 14:40:04 -08:00
Nhat Nguyen	fb32a55dd5	Deprecate synced flush (#50835 ) A normal flush has the same effect as a synced flush on Elasticsearch 7.6 or later. It's deprecated in 7.6 and will be removed in 8.0. Relates #50776	2020-01-13 19:54:38 -05:00
Ryan Ernst	86fb06a108	Migrate certgen packaging test from bats (#50880 ) This commit moves the packaging tests for elasticsearch-certgen to java from bats. Although certgen is deprecated, the tests are moved rather than just deleted, and the tests themselves should be easily adaptable to certutil. One note is that the test is simplified to use a single node, rather than the two node test from bats, which was problematic given how the newer distro tests only operate with a single distribution. relates #46005	2020-01-13 13:56:30 -08:00
Ryan Ernst	2dc23bd968	Add protection in windows for slow file lock releasing (#50884 ) This commit adds retries for windows cleanup after tests, which may fail due to file locks not being immediately released after a windows process exits. closes #50825	2020-01-13 10:39:01 -08:00
Nhat Nguyen	05f97d5e1b	Revert "Deprecate synced flush (#50835 )" This reverts commit `1a32d7142a`.	2020-01-13 11:41:03 -05:00
Nhat Nguyen	1a32d7142a	Deprecate synced flush (#50835 ) A normal flush has the same effect as a synced flush on Elasticsearch 7.6 or later. It's deprecated in 7.6 and will be removed in 8.0. Relates #50776	2020-01-13 10:58:29 -05:00
Armin Braun	609b015e3c	Prevent Old Version Clusters From Corrupting Snapshot Repositories (#50853 ) (#50913 ) Follow up to #50692 that starts writing a `min_version` field to the `RepositoryData` so that pre-7.6 ES versions can not read it (and potentially corrupt it if they attempt to modify the repo contents) after the repository moved to the new metadata format.	2020-01-13 15:02:53 +01:00
Mark Vieira	2ab3e49dec	Skip test suite entirely for non-applicable distribution types (#50824 )	2020-01-09 15:44:08 -08:00
Armin Braun	f70e8f6ab5	Fix Snapshot Repository Corruption in Downgrade Scenarios (#50692 ) (#50797 ) * Fix Snapshot Repository Corruption in Downgrade Scenarios (#50692) This PR introduces test infrastructure for downgrading a cluster while interacting with a given repository. It fixes the fact that repository metadata in the new format could be written while there's still older snapshots in the repository that require the old-format metadata to be restorable.	2020-01-09 21:21:13 +01:00
Nhat Nguyen	b71490b06b	Deprecate indices without soft-deletes (#50502 ) (#50634 ) Soft-deletes will be enabled for all indices in 8.0. Hence, we should deprecate new indices without soft-deletes in 7.x. Backport of #50502	2020-01-06 08:44:30 -05:00
Nhat Nguyen	33204c2055	Use peer recovery retention leases for indices without soft-deletes (#50351 ) Today, the replica allocator uses peer recovery retention leases to select the best-matched copies when allocating replicas of indices with soft-deletes. We can employ this mechanism for indices without soft-deletes because the retaining sequence number of a PRRL is the persisted global checkpoint (plus one) of that copy. If the primary and replica have the same retaining sequence number, then we should be able to perform a noop recovery. The reason is that we must be retaining translog up to the local checkpoint of the safe commit, which is at most the global checkpoint of either copy). The only limitation is that we might not cancel ongoing file-based recoveries with PRRLs for noop recoveries. We can't make the translog retention policy comply with PRRLs. We also have this problem with soft-deletes if a PRRL is about to expire. Relates #45136 Relates #46959	2019-12-23 22:04:07 -05:00
Nhat Nguyen	1dc98ad617	Ensure global checkpoint was advanced and synced We need to make sure that the global checkpoints and peer recovery retention leases were advanced to the max_seq_no and synced; otherwise, we can risk expiring some peer recovery retention leases because of the file-based recovery threshold. Relates #49448	2019-12-23 21:10:30 -05:00
Yannick Welsch	c37c53a7f7	Fix testAutoExpandIndicesDuringRollingUpgrade (#50427 ) Fixes the muted test "testAutoExpandIndicesDuringRollingUpgrade". We can't wait in the test for the index to be green, as we have put a filter exclusion into place that prevents all shards from being allocated after a node rejoins. Instead we check whether the correct auto-expansion has taken place. Closes #50426	2019-12-20 17:15:49 +01:00
Yannick Welsch	1c7bfebe01	Mute testAutoExpandIndicesDuringRollingUpgrade Relates #50426	2019-12-20 12:15:43 +01:00
Yannick Welsch	a3837786ec	Fix testAutoExpandIndicesDuringRollingUpgrade (#50361 ) Follow-up to #50361 that fixes the test that does not work against older ES versions	2019-12-20 11:50:03 +01:00
Yannick Welsch	4f805deb0c	Only auto-expand replicas with allocation filtering when all nodes upgraded (#50361 ) Follow-up to #48974 that ensures that replicas are only auto-expanded according to allocation filtering rules once all nodes are upgraded to a version that supports this. Helps with orchestrating cluster upgrades.	2019-12-20 11:50:00 +01:00
Yannick Welsch	5f37f1f401	Revert "Only auto-expand replicas with allocation filtering when all nodes upgraded (#50361 )" This reverts commit `df4fe73b84`.	2019-12-20 11:07:30 +01:00
Yannick Welsch	df4fe73b84	Only auto-expand replicas with allocation filtering when all nodes upgraded (#50361 ) Follow-up to #48974 that ensures that replicas are only auto-expanded according to allocation filtering rules once all nodes are upgraded to a version that supports this. Helps with orchestrating cluster upgrades.	2019-12-20 10:22:44 +01:00
Tim Brooks	cb73fb0f9b	Backport remote proxy mode stats and naming (#50402 ) * Update remote cluster stats to support simple mode (#49961) Remote cluster stats API currently only returns useful information if the strategy in use is the SNIFF mode. This PR modifies the API to provide relevant information if the user is in the SIMPLE mode. This information is the configured addresses, max socket connections, and open socket connections. * Send hostname in SNI header in simple remote mode (#50247) Currently an intermediate proxy must route conncctions to the appropriate remote cluster when using simple mode. This commit offers a additional mechanism for the proxy to route the connections by including the hostname in the TLS SNI header. * Rename the remote connection mode simple to proxy (#50291) This commit renames the simple connection mode to the proxy connection mode for remote cluster connections. In order to do this, the mode specific settings which we namespaced by their mode (ex: sniff.seed and proxy.addresses) have been reverted. * Modify proxy mode to support a single address (#50391) Currently, the remote proxy connection mode uses a list setting for the proxy address. This commit modifies this so that the setting is proxy_address and only supports a single remote proxy address.	2019-12-19 18:02:48 -07:00
Nik Everett	878852352d	Handle renaming the README (#50404 ) (#50406 ) We renamed README.textile to README.asciidoc but a bunch of tests and the package build itself still pointed at the old name. This switches them the new name.	2019-12-19 18:55:38 -05:00
Rory Hunter	2bd3a05892	Refactor environment variable processing for Docker (#50221 ) Backport of #49612. The current Docker entrypoint script picks up environment variables and translates them into -E command line arguments. However, since any tool executes via `docker exec` doesn't run the entrypoint, it results in a poorer user experience. Therefore, refactor the env var handling so that the -E options are generated in `elasticsearch-env`. These have to be appended to any existing command arguments, since some CLI tools have subcommands and -E arguments must come after the subcommand. Also extract the support for `_FILE` env vars into a separate script, so that it can be called from more than once place (the behaviour is idempotent). Finally, add noop -E handling to CronEvalTool for parity, and support `-E` in MultiCommand before subcommands.	2019-12-16 15:39:28 +00:00
Nhat Nguyen	c732d9923d	Fix doc type in FullClusterRestartIT "_doc" is not accepted in 6.x version.	2019-12-15 21:54:57 -05:00
Nhat Nguyen	4d22e3cd15	Skip bwc versions without retention leases	2019-12-15 13:18:13 -05:00
Nhat Nguyen	df46848fb0	Migrate peer recovery from translog to retention lease (#49448 ) Since 7.4, we switch from translog to Lucene as the source of history for peer recoveries. However, we reduce the likelihood of operation-based recoveries when performing a full cluster restart from pre-7.4 because existing copies do not have PPRL. To remedy this issue, we fallback using translog in peer recoveries if the recovering replica does not have a peer recovery retention lease, and the replication group hasn't fully migrated to PRRL. Relates #45136	2019-12-15 10:24:39 -05:00
Ryan Ernst	54467b5d8b	Simplify running tools in packaging tests (#49665 ) (#50110 ) Running tools requires a shell. This should be the shell setup by the base packaging tests, but currently tests must pass in their own shell. This commit begins to make running tools easier by eliminating the shell argument, instead keeping the shell as part of the Installation (which can eventually be passed through from the test itself on installation). The variable names for each tool are also simplified.	2019-12-11 16:59:21 -08:00
William Brafford	ba9526ec4f	Refactor utility code in qa:os: tests (#49945 ) (#50000 ) This refactor bridges some gaps between a long-running feature branch (#49268) and the master branch. First of all, this PR gives our PackagingTestCase class some methods to start and stop Elasticsearch that will switch on packaging type and delegate to the appropriate utility class for deb/RPM packages, archive installations, and Docker. These methods should be very useful as we continue group tests by function rather than by package or platform type. Second, the password-protected keystore tests have a particular need to read the output of Elasticsearch startup commands. In order to make this easer to do, some commands now return Shell.Result objects so that tests can check over output to the shell. To that end, there's also an assertElasticsearchFailure method that will handle checking for startup failures for the various distribution types. There is an update to the Powershell startup script for archives that asynchronously redirects the output of the Powershell process to files that we can read for errors. Finally, we use the ES_STARTUP_SLEEP_TIME environment variable to make sure that our startup commands wait long enough before exiting for errors to make it to the standard output and error streams.	2019-12-10 15:00:05 -05:00
Ryan Ernst	401c75d8b5	Dump wildfly log on start failure (#49892 ) When testing wildfly with Elasticsearch, we currently dump the wildfly log if the test fails. However, when starting wildfly we may fail to find the port number wildfly started on, and fail with no output. This change dumps the wildflog log when failing to find the http or management ports. relates #49374	2019-12-06 15:55:01 -08:00
Rory Hunter	1bc3e69fa3	Migrate some of the Docker tests from old repository (#49792 ) Backport of #49079. Reimplement a number of the tests from elastic/elasticsearch-docker. There is also one Docker image fix here, which is that two of the provided config files had different file permissions to the rest. I've fixed this with another RUN chmod while building the image, and adjusted the corresponding packaging test.	2019-12-04 08:57:58 +00:00
Christoph Büscher	cdbcab3841	Remove Awaitsfix that seems outdated (#49764 ) This test seems to be fixed on 7.x with https://github.com/elastic/elasticsearch/pull/43871 so the muting annotation can most likely be removed.	2019-12-03 10:24:55 +01:00
Jim Ferenczi	d6445fae4b	Add a cluster setting to disallow loading fielddata on _id field (#49166 ) This change adds a dynamic cluster setting named `indices.id_field_data.enabled`. When set to `false` any attempt to load the fielddata for the `_id` field will fail with an exception. The default value in this change is set to `false` in order to prevent fielddata usage on this field for future versions but it will be set to `true` when backporting to 7x. When the setting is set to true (manually or by default in 7x) the loading will also issue a deprecation warning since we want to disallow fielddata entirely when https://github.com/elastic/elasticsearch/issues/26472 is implemented. Closes #43599	2019-11-28 09:35:28 +01:00
Ryan Ernst	b236076f88	Fix java 8 compile for Files.readString usage	2019-11-27 16:16:56 -08:00
Ryan Ernst	7802b60a5a	Migrate setup passwords packaging test from bats (#49337 ) This commit moves the packaging tests for elasticsearch-setup-passwords to java from bats. The change also enables future tests to enable security in Elasticsearch and automatically have waitForElasticsearch work correctly, at least to the same extent it worked in bats, by waiting on the ES port instead of health check. relates #46005	2019-11-27 16:03:08 -08:00
Dimitrios Liappis	4b6915ea41	Clarify gid used by docker image process and bind-mount method (#49632 ) Fix reference about the uid:gid that Elasticsearch runs as inside the Docker container and add a packaging test to ensure that bind mounting a data dir with a random uid and gid:0 works as expected. Backport of #49529 Closes #47929	2019-11-27 13:42:54 +02:00
Tim Brooks	e965a6f2df	Fix remote settings upgrade test (#49609 ) This commit fixes #49587. Due to a settings change, the broken test was asserting on the incorrect setting. This commit fixes that issue and adds additional assertions to ensure that all settings are working properly.	2019-11-26 16:37:27 -07:00
Hendrik Muhs	41daf284f5	mute FullClusterRestartSettingsUpgradeIT	2019-11-26 13:28:35 +01:00
Tim Brooks	416178c7c8	Enable simple remote connection strategy (#49561 ) This commit back ports three commits related to enabling the simple connection strategy. Allow simple connection strategy to be configured (#49066) Currently the simple connection strategy only exists in the code. It cannot be configured. This commit moves in the direction of allowing it to be configured. It introduces settings for the addresses and socket count. Additionally it introduces new settings for the sniff strategy so that the more generic number of connections and seed node settings can be deprecated. The simple settings are not yet registered as the registration is dependent on follow-up work to validate the settings. Ensure at least 1 seed configured in remote test (#49389) This fixes #49384. Currently when we select a random subset of seed nodes from a list, it is possible for 0 seeds to be selected. This test depends on at least 1 seed being selected. Add the simple strategy to cluster settings (#49414) This is related to #49067. This commit adds the simple connection strategy settings and strategy mode setting to the cluster settings registry. With these changes, the simple connection mode can be used. Additionally, it adds validation to ensure that settings cannot be misconfigured.	2019-11-25 16:53:07 -07:00
Rory Hunter	4fae2bb3b1	Don't close stderr under `--quiet` (#49431 ) Backport of #47208. Closes #46900. When running ES with `--quiet`, if ES then exits abnormally, a user has to go hunting in the logs for the error. Instead, never close System.err, and print more information to it if ES encounters a fatal error e.g. config validation, or some fatal runtime exception. This is useful when running under e.g. systemd, since the error will go into the journal. Note that stderr is still closed in daemon (`-d`) mode.	2019-11-22 14:58:17 +00:00
Jay Modi	eed4cd25eb	ThreadPool and ThreadContext are not closeable (#43249 ) (#49273 ) This commit changes the ThreadContext to just use a regular ThreadLocal over the lucene CloseableThreadLocal. The CloseableThreadLocal solves issues with ThreadLocals that are no longer needed during runtime but in the case of the ThreadContext, we need it for the runtime of the node and it is typically not closed until the node closes, so we miss out on the benefits that this class provides. Additionally by removing the close logic, we simplify code in other places that deal with exceptions and tracking to see if it happens when the node is closing. Closes #42577	2019-11-19 13:15:16 -07:00
Rory Hunter	e84e21174b	Support `_FILE` suffixed env vars in Docker entrypoint (#49182 ) Backport of #47573. Closes #43603. Allow environment variables to be passed to ES in a Docker container via a file, by setting an environment variable with the `_FILE` suffix that points to the file with the intended value of the env var.	2019-11-18 08:22:35 +00:00
markharwood	c3745b03ee	Search optimisation - add canMatch early aborts for queries on "_index" field (#49158 ) Make queries on the “_index” field fast-fail if the target shard is an index that doesn’t match the query expression. Part of the “canMatch” phase optimisations. Closes #48473	2019-11-15 16:50:32 +00:00
Dimitris Athanasiou	a000d868a0	[7.x] Also expect RELOCATING in RecoveryIT.testRecoveryClosedIndex (#49090 ) (#49154 ) In addition to the fix in #48506, it seems we could also get the `RELOCATING` state. This fixes the failure in https://gradle-enterprise.elastic.co/s/svjmmvqk32cii/tests/ua7icdau7nz6y-2cwvhj3qg5qou?openStackTraces=WzBd	2019-11-15 15:56:53 +02:00
Rory Hunter	c46a0e8708	Apply 2-space indent to all gradle scripts (#49071 ) Backport of #48849. Update `.editorconfig` to make the Java settings the default for all files, and then apply a 2-space indent to all `*.gradle` files. Then reformat all the files.	2019-11-14 11:01:23 +00:00
Alpar Torok	2906702901	Always increase timeout for packaging tests (#48945 ) The previous approach did not work because the system property is passed to Gradle but not to the tests JVM. We shouldn't really pass this to the tests as we wouldn't want to have differences. This timeout being different might not be bad, but having a way to differentiate could lead to others and it's best avoided.	2019-11-12 10:08:53 +02:00
Alpar Torok	5c6f8bab8d	Create a thread dump if ES fails to start in packaging tests (#48932 ) Relates to #15385. Hoping to get some insight on CI failures.	2019-11-12 10:08:52 +02:00
Rory Hunter	24f7d4e83b	Add Docker packaging tests on 7.x (#48857 ) Backport of #46599 and #47640. Add packaging tests for Docker. * Introduce packaging tests for Docker (#46599) Closes #37617. Add packaging tests for our Docker images, similar to what we have for RPMs or Debian packages. This works by running a container and probing it e.g. via `docker exec`. Test can also be run in Vagrant, by exporting the Docker images to disk and loading them again in VMs. Docker is installed via `Vagrantfile` in a selection of boxes. * Only define Docker pkg tests if Docker is available (#47640) Closes #47639, and unmutes tests that were muted in b958467. The Docker packaging tests were being defined irrespective of whether Docker was actually available in the current environment. Instead, implement exclude lists so that in environments where Docker is not available, no Docker packaging tests are defined. For CI hosts, the build checks `.ci/dockerOnLinuxExclusions`. The Vagrant VMs can defined the extension property `shouldTestDocker` property to opt-in to packaging tests. As part of this, define a seperate utility class for checking Docker, and call that instead of defining checks in-line in BuildPlugin.groovy	2019-11-05 15:17:59 +00:00
Mark Vieira	6ab4645f4e	[7.x] Introduce type-safe and consistent pattern for handling build globals (#48818 ) This commit introduces a consistent, and type-safe manner for handling global build parameters through out our build logic. Primarily this replaces the existing usages of extra properties with static accessors. It also introduces and explicit API for initialization and mutation of any such parameters, as well as better error handling for uninitialized or eager access of parameter values. Closes #42042	2019-11-01 11:33:11 -07:00
Ioannis Kakavas	99aedc844d	Copy http headers to ThreadContext strictly (#45945 ) (#48675 ) Previous behavior while copying HTTP headers to the ThreadContext, would allow multiple HTTP headers with the same name, handling only the first occurrence and disregarding the rest of the values. This can be confusing when dealing with multiple Headers as it is not obvious which value is read and which ones are silently dropped. According to RFC-7230, a client must not send multiple header fields with the same field name in a HTTP message, unless the entire field value for this header is defined as a comma separated list or this specific header is a well-known exception. This commits changes the behavior in order to be more compliant to the aforementioned RFC by requiring the classes that implement ActionPlugin to declare if a header can be multi-valued or not when registering this header to be copied over to the ThreadContext in ActionPlugin#getRestHeaders. If the header is allowed to be multivalued, then all such headers are read from the HTTP request and their values get concatenated in a comma-separated string. If the header is not allowed to be multivalued, and the HTTP request contains multiple such Headers with different values, the request is rejected with a 400 status.	2019-10-31 23:05:12 +02:00
Yannick Welsch	7892ea8aff	Fix testRecoveryClosedIndex (#48506 ) Fixes test failure: https://gradle-enterprise.elastic.co/s/lplb52nlth4re/tests/ftbzinkix5rme-2cwvhj3qg5qou	2019-10-31 16:24:40 +01:00
Mark Vieira	ef9275567a	Reduce packaging test log output and introduce ready request interval (#48324 )	2019-10-29 14:46:33 -07:00
Alpar Torok	51c1b48b24	Increase timeouts for packaging tests (#48528 ) This is in preparation to move to nested virtualization which is much slower than the bare metal setup we use right now, but parallelizes better resulting in a net win.t	2019-10-28 06:48:11 +02:00
Alpar Torok	d235576452	Add a packagingTask for every os project (#48400 ) We no longer run the sample tests in CI, so it's safe to create a task for every project. This will make it easier to set them up in a matrix like fashion.	2019-10-28 06:48:06 +02:00
Jim Ferenczi	dc5c31d67a	Add a deprecation warning regarding allocation awareness in search request (#48351 ) This is a follow up of https://github.com/elastic/elasticsearch/issues/43453 where we added a system property to disallow allocation awareness in search requests. Since search requests will no longer check the allocation awareness attributes for routing in the next major version, this change adds a deprecation warning on any setup that uses these attributes. Relates #43453	2019-10-24 09:25:50 +02:00
Alpar Torok	13df6beb53	Fix packaging tests on debian 10 (#48138 ) * Fix packaging tests on debian 10 - lintian no longer has `--fail-on-warnings` - Systemd has a build number / version after it's acutal version ``` alpar@alpar-ci-debian-10-1015-1517:~$ systemd --version systemd 241 (241) +PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ +LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD -IDN2 +IDN -PCRE2 default-hierarchy=hybrid ```	2019-10-17 14:29:33 +03:00
Alpar Torok	bc85b22c1f	Complete testclusters backport (#47623 ) * Use versions specific distribution folders so we don't need to clean up (#46539) * Retry deleting distro dir on windows When retarting the cluster we clean up old distribution files that might still be in use by the OS. Windows closes resources of ded processes async, so we do a couple of retries to get arround it. Closes #46014 * Avoid having to delete the distro folder. * Remove the use of ClusterFormationTasks form RestTestTask (#47022) This PR removes a use-case of the ClusterFormationTasks and converts a project that flew under the radar so far. There's probably more clean-up possible here, but for now the goal is to be able to remove that code after `RunTask` is also updated. * Migrate some 7.x only projects	2019-10-07 11:43:57 +03:00
Alpar Torok	2b16d7bcf8	Backport testclusters all (#47565 ) * Bwc testclusters all (#46265) Convert all bwc projects to testclusters * Fix bwc versions config * WIP fix rolling upgrade * Fix bwc tests on old versions * Fix rolling upgrade	2019-10-04 16:12:53 +03:00
Alpar Torok	65c473bd4b	Fix windows packaging tests (#47554 ) On windows, it happens that the process we called terminates but some other process it creates still has the same output strems and thus the files open, so we can't clean it up. This PR makes the cleanup a best effort.	2019-10-04 14:02:57 +03:00
Alpar Torok	67bf3a4f51	Fix default distro bwc tests	2019-10-04 09:44:17 +03:00
Alpar Torok	97a0b7dcbc	Make All OS tests run on GCP instances (#46924 ) This PR makes the necesary adaptations to the tests and adds a power shell script to invoke the OS tests on GCP instances connected as CI workers. Also noticed that logs were not being produced by the tests and that theses were not using log4j so fixed that too. One of the difficulties in working on theses tests was that the tests just stalled with no indication where the problem is. To ease with the debugging, after process explorer suggested that the tests are running some commands, we now have multiple timeouts: one for the tests ( which will generate a thread dump ) and one for individual commands ( that bails with the command being ran and output and error so far ) to make it easier to see what went wrong. The tests were blocking because apparently the pipes to the sub-process were not closing, thus the threads were blocking on them and we were blocking indefinitely on the join. I'm not sure why this doesn't happen in vagrant, but we now properly deal with it.	2019-10-04 08:46:52 +03:00
Nhat Nguyen	44fdf2020a	Always flush in FullClusterRestartIT#testRecovery (#47465 ) The pattern in the latest failure is similar to the source fixed in #46956 but relates to synced-flush. If peer recovery happens after indexing, and indexing flushes some shard at the end, then a synced flush in the test will not roll or commit translog. Closes #46712	2019-10-02 18:03:22 -04:00
Yannick Welsch	99d2fe295d	Use optype CREATE for single auto-id index requests (#47353 ) Changes auto-id index requests to use optype CREATE, making it compliant with our docs. This will also make these auto-id index requests compatible with the new "create-doc" index privilege (which is based on the optype), the default optype is changed to create, just as it is already documented.	2019-10-02 14:16:52 +02:00
Yannick Welsch	7b2613db55	Allow optype CREATE for append-only indexing operations (#47169 ) Bulk requests currently do not allow adding "create" actions with auto-generated IDs. This commit allows using the optype CREATE for append-only indexing operations. This is mainly the user facing aspect of it.	2019-10-02 14:16:52 +02:00
Alpar Torok	a032f9b2d5	Backport testclusters fix bwc (#47363 ) * Add support for bwc for testclusters and convert full cluster restart (#45374) * Testclusters fix bwc (#46740) Additions to make testclsuters work with lather versions of ES * Do common node config on bwc tests Before this PR we always ever ran `ElasticsearchCluster.start` once, and the common node config was never done. This becomes apparent in upgrading from `6.x` to `7.x` as the new config is missing preventing the cluster from starting. * Do common node config on bwc tests Before this PR we always ever ran `ElasticsearchCluster.start` once, and the common node config was never done. This becomes apparent in upgrading from `6.x` to `7.x` as the new config is missing preventing the cluster from starting. * Fix logic to pick up snapshot from 6.x * Make sure ports are cleared * Fix test * Don't clear all the config as we rely on it * Fix removal of keys	2019-10-02 14:37:00 +03:00
Ryan Ernst	bd5f64848e	Clarify missing java error message (#46160 ) Since the bundled jdk was added to Elasticsearch, there are now 2 ways java can be missing. Either JAVA_HOME is set but does not exist, or the bundled jdk does not exist. This commit improves the error messages in those two cases, and also ensures our tests cover both cases.	2019-10-01 22:10:19 -07:00
Jason Tedor	7eed3e5585	Remove use of Setting#getRaw in deprecation tests (#47262 ) This commit removes a use of Setting#getRaw from the deprecation header tests. The use of Setting#getRaw is not needed here, the x-content infrastructure will take care of emitting the appropriate values here, and so the caller does not need to convert these to string representations of the settings values.	2019-09-30 06:35:59 -04:00
Ryan Ernst	b270f70be1	Wait for elasticsearch exit in packaging tests (#47242 ) The archives stopElasticsearch utility method sends SIGTERM to the elasticsearch process, but does not wait for it to exit. That can cause subsequent tests to sometimes file. This commit adds wait logic to both linux and windows for the stopElasticsearch method. closes #44501	2019-09-30 12:09:26 +03:00
Ryan Ernst	33ebf5f5dc	Fix archive tests to not create elasticsearch home (#47239 ) The test for java home with special characters on linux would create a temporary java home under /home/elasticsearch. But our packaging assertions expect that to not exist. Unfortunately this would fail much later when the checks were actually done in bats tests. This commit fixes the linux test to match the behavior of windows, which links the entire java directory, and now does it into a /tmp dir. closes #45903	2019-09-30 12:09:26 +03:00
Rory Hunter	53a4d2176f	Convert most awaitBusy calls to assertBusy (#45794 ) (#47112 ) Backport of #45794 to 7.x. Convert most `awaitBusy` calls to `assertBusy`, and use asserts where possible. Follows on from #28548 by @liketic. There were a small number of places where it didn't make sense to me to call `assertBusy`, so I kept the existing calls but renamed the method to `waitUntil`. This was partly to better reflect its usage, and partly so that anyone trying to add a new call to awaitBusy wouldn't be able to find it. I also didn't change the usage in `TransportStopRollupAction` as the comments state that the local awaitBusy method is a temporary copy-and-paste. Other changes: * Rework `waitForDocs` to scale its timeout. Instead of calling `assertBusy` in a loop, work out a reasonable overall timeout and await just once. * Some tests failed after switching to `assertBusy` and had to be fixed. * Correct the expect templates in AbstractUpgradeTestCase. The ES Security team confirmed that they don't use templates any more, so remove this from the expected templates. Also rewrite how the setup code checks for templates, in order to give more information. * Remove an expected ML template from XPackRestTestConstants The ML team advised that the ML tests shouldn't be waiting for any `.ml-notifications` templates, since such checks should happen in the production code instead. Also rework the template checking code in `XPackRestTestHelper` to give more helpful failure messages. * Fix issue in `DataFrameSurvivesUpgradeIT` when upgrading from < 7.4	2019-09-29 12:21:46 +01:00
David Roberts	e943e27954	Spawn controller processes from a different directory on macOS (#47013 ) This is the Java side of https://github.com/elastic/ml-cpp/pull/593 with a fallback so that ml-cpp bundles with either the new or old directory structure work for the time being. A few days after merging the C++ changes a followup to this change will be made that removes the fallback.	2019-09-27 14:02:40 +01:00
Alpar Torok	813b130e08	Exclude the demo folder form the JDK (#47161 ) The folder contains jars with source code that fail the lintian test on debian (based) distributions.	2019-09-27 10:35:34 +03:00
Yannick Welsch	db63e78b68	Mute DebMetadataTests.test05CheckLintian Relates #46903	2019-09-25 09:29:14 +02:00
Julie Tibshirani	9124c94a6c	Add support for aliases in queries on _index. (#46944 ) Previously, queries on the _index field were not able to specify index aliases. This was a regression in functionality compared to the 'indices' query that was deprecated and removed in 6.0. Now queries on _index can specify an alias, which is resolved to the concrete index names when we check whether an index matches. To match a remote shard target, the pattern needs to be of the form 'cluster:index' to match the fully-qualified index name. Index aliases can be specified in the following query types: term, terms, prefix, and wildcard.	2019-09-23 13:21:37 -07:00
Nhat Nguyen	e8515d1d13	Force flush in FullClusterRestartIT#testRecovery (#46956 ) If peer recovery happens after indexing, and indexing flushes some shard at the end, then the explicit flush in the test will be a noop. Then replicas will have some uncommitted translog , which is transferred in peer recovery, although all of these operations are in the commit already. If that replica becomes primary (after we restarted the cluster), it will have translog to replay and the test will fail. Another issue in this test is that synced_flush is not a replication action, then the global checkpoint on replicas might be not up to date. We need to either wait for the global checkpoint to be synced or call a replication action to sync it. Closes #46712	2019-09-22 19:04:01 -04:00
Luca Cavanna	de47ea2cf4	Add SearchRestCancellationIT This test verifies automatic cancellation of search requests on connection close. It was previously not present in 7.x as the http client was subject do a bug which made testing cancellation of requests impossible. Now that the bug is fixed upstream, we can also backport this test	2019-09-12 09:45:29 +02:00
Jim Ferenczi	f2a6c88f83	Add a system property to ignore awareness attributes (#46375 ) This is a follow up of #19191 for 7.x. This change adds a system property called "es.routing.search_ignore_awareness_attributes" that when set to true will effectively ignore allocation awareness attributes when routing search and get requests. This is now the default in 8.x so this commit adds a way to opt-in to this new behavior in a minor version of 7.x. Relates #45735	2019-09-06 09:29:27 +02:00
Jason Tedor	92866f977a	Clarify error message on keystore write permissions (#46321 ) When the Elasticsearch process does not have write permissions to upgrade the Elasticsearch keystore, we bail with an error message that indicates there is a filesystem permissions problem. This commit clarifies that error message by pointing out the directory where write permissions are required, or that the user can also run the elasticsearch-keystore upgrade command manually before starting the Elasticsearch process. In this case, the upgrade would not be needed at runtime, so the permissions would not be needed then.	2019-09-05 15:11:54 -04:00
Nhat Nguyen	f91dd903cb	Strengthen testUpdate in rolling upgrade We hit a bug where we can't partially update documents created in a mixed cluster between 5.x and 6.x. Although this bug does not affect 7.0 or later, we should have a good test that catches this issue. Relates #46198	2019-09-03 18:40:57 -04:00
William Brafford	ff7fd9b9e2	Pass COMPUTERNAME env var to elasticsearch.bat (#45763 ) * Pass COMPUTERNAME env var to elasticsearch.bat When we run bin/elasticsearch with bash, we get a $HOSTNAME builtin that contains the hostname of the machine the script is running on. When there's no provided nodename, Elasticsearch uses the HOSTNAME to create a nodename. On Windows, Powershell provides a $COMPUTERNAME variable for the same purpose. CMD.EXE provides the same thing, except it's called %COMPUTERNAME%. bin/elasticsearch.bat sets $HOSTNAME to the value of $COMPUTERNAME. However, when testclusters invokes bin/elasticsearch.bat, the COMPUTERNAME variable doesn't get passed in, leaving HOSTNAME null and breaking an integration test on Windows. This commit sets COMPUTERNAME in the environment so that our tests get the value that Elasticsearch would have when bin/elasticsearch.bat is invoked from the shell. * Add null check to protect in non-Windows case What good is it a developer to gain the whole Windows if they forfeit their Unix? The value that fixes things on Windows is null on Linux/Darwin, so let's null-check it. * Override system hostnames for testclusters Rather than relying on variable system behavior, let's just override HOSTNAME and COMPUTERNAME and test for correct values in the integration test that was originally failing. * Rename constants for clarity Since we are setting HOSTNAME and COMPUTERNAME regardless of whether the tests are running on Windows or Linux, we shouldn't imply that constants are only used in one case or the other.	2019-08-26 11:44:31 -04:00
Jason Tedor	599bf2d68b	Deprecate the pidfile setting (#45938 ) This commit deprecates the pidfile setting in favor of node.pidfile.	2019-08-23 21:31:35 -04:00
Ryan Ernst	cff09bea00	Improve distro tests base class (#45761 ) (#45803 ) This commit moves many features of individual distro tests into the base class so that other test cases can utilize them. It also standardizes the pattern for tests adding assumptions for the particular distributions to test.	2019-08-21 14:18:26 -07:00
William Brafford	2b549e7342	CLI tools: write errors to stderr instead of stdout (#45586 ) Most of our CLI tools use the Terminal class, which previously did not provide methods for writing to standard output. When all output goes to standard out, there are two basic problems. First, errors and warnings are "swallowed" in pipelines, making it hard for a user to know when something's gone wrong. Second, errors and warnings are intermingled with legitimate output, making it difficult to pass the results of interactive scripts to other tools. This commit adds a second set of print commands to Terminal for printing to standard error, with errorPrint corresponding to print and errorPrintln corresponding to println. This leaves it to developers to decide which output should go where. It also adjusts existing commands to send errors and warnings to stderr. Usage is printed to standard output when it's correctly requested (e.g., bin/elasticsearch-keystore --help) but goes to standard error when a command is invoked incorrectly (e.g. bin/elasticsearch-keystore list-with-a-typo \| sort).	2019-08-21 14:46:07 -04:00
Ryan Ernst	18fb63209b	Separate distro tests to be per distribution (#45565 ) The java based distribution tests currently have a single Tests class which encapsulates all of the tests for a particular distribution. The test task in gradle then depends on all distributions being built, and each individual tests class looks for the particular distribution it is trying to test. This means that reproducing a single test failure triggers all the distributions to be built, even though only one is needed for the test. This commit reworks the java distribution tests to pass in a particular distribution to be tested, and changes the base test classes to be actual test classes which have assumptions around which distributions they operate on. For example, the archives tests will be skipped when run with an rpm distribution, and vice versa for the package tests. This makes reproduction much more granular. It also also better splitting up tests around a particular use case. For example, all tests for systemd behavior can be in one test class, and run independently of all tests against rpm/deb distributions.	2019-08-20 13:12:15 -07:00
Jason Tedor	ec4182590f	Use bundled JDK in Sys V init (#45593 ) This commit addresses an issue when trying to using Elasticsearch on systems with Sys V init and the bundled JDK was not being used. Instead, we were still inadvertently trying to fallback on the path. This commit removes that fallback as that is against our intentions for 7.x where we only support the bundled JDK or an explicit JDK via JAVA_HOME.	2019-08-15 16:15:17 -04:00
Ryan Ernst	4af6d86c01	Rename vagrant project to os (#45509 ) (#45530 ) The system level tests for our distributions have historically be run in vagrant, and thus the name of the gradle project has been "vagrant". However, as we move to running these tests in other environments (eg GCP) the name vagrant no longer makes sense. This commit renames the project to "os" (short for operating system), since these tests ensure all of our distributions run correctly on our supported operating systems.	2019-08-14 10:30:41 -07:00
Ryan Ernst	80a3aeaef1	Split off upgrade and plugins bats tests (#45360 ) (#45508 ) The bats tests currently require many additional artifacts to be built. In addition to the current distributions, they need all the plugins to be installed, as well as a randomly chosen bwc distribution. This commit splits these two cases into their own bats task, so the dependencies do not slow down other tasks like distroTests which do not need them.	2019-08-13 12:45:29 -07:00
Ryan Ernst	884f26a1dc	Make distro test plugin apply to the top level project (#45406 ) (#45481 ) The distro test plugin was originally designed to be applied within each subproject, per operating system we run in a VM with vagrant. However, for efficiency, and also ease of having a single task to run in CI when launching within individual OS VMs, having the "destructive" tasks in a single place is more convenient. This commit reworks the distro test plugin to be applied to the qa/vagrant project, which now creates only the wrapper tasks in each of the subprojects for each vagrant VM.	2019-08-13 08:19:18 -07:00
Ryan Ernst	97efb6a403	Convert vagrant tests to per platform projects (#45064 ) The vagrant based tests currently reside in a single project, creating dozens of tasks to manage starting and stopping the vagrant VM along with running java and bats tests within each image. This all-in-one pattern makes parallelizing packaging tests difficult. This commit rewrites the vagrant testing infrastructure to be independent of the actual test runners, thus allowing each platform to be handled in a separate subproject. Additionally, the java and bats tests are changed to be run through a "destructive" gradle task, which is run inside the VM. The combination of these will allow parallelization both locally (through running several VMs at once) as well as running the destructive tasks in CI machines dedicated to each platform (thus removing the need for vagrant in CI).	2019-08-12 16:01:53 -07:00
Armin Braun	a9e1402189	Remove Settings from BaseRestRequest Constructor (#45418 ) (#45429 ) * Resolving the todo, cleaning up the unused `settings` parameter * Cleaning up some other minor dead code in affected classes	2019-08-12 05:14:45 +02:00
Alpar Torok	634a070430	Restrict which tasks can use testclusters (#45198 ) * Restrict which tasks can use testclusters This PR fixes a problem between the interaction of test-clusters and build cache. Before this any task could have used a cluster without tracking it as input. With this change a new interface is introduced to track the tasks that can use clusters and we do consider the cluster as input for all of them.	2019-08-09 13:38:01 +03:00
Jason Tedor	5b1b146099	Normalize environment paths (#45179 ) This commit applies a normalization process to environment paths, both in how they are stored internally, also their settings values. This normalization is done via two means: - we make the paths absolute - we remove redundant name elements from the path (what Java calls "normalization") This change ensures that when we compare and refer to these paths within the system, we are using a common ground. For example, prior to the change if the data path was relative, we would not compare it correctly to paths from disk usage. This is because the paths in disk usage were being made absolute.	2019-08-06 06:04:30 -04:00
David Turner	9ff320d967	Use index for peer recovery instead of translog (#45137 ) Today we recover a replica by copying operations from the primary's translog. However we also retain some historical operations in the index itself, as long as soft-deletes are enabled. This commit adjusts peer recovery to use the operations in the index for recovery rather than those in the translog, and ensures that the replication group retains enough history for use in peer recovery by means of retention leases. Reverts #38904 and #42211 Relates #41536 Backport of #45136 to 7.x.	2019-08-02 15:00:43 +01:00
Przemyslaw Gomulka	fe20e217a4	Deprecation messages with the same key but different x-opaque-id are allowed backport(#44587 ) #44682 Deprecation logger was filtering log entries by key, that means that if two log messages with the same key are logged from different users, then the second log messages will be filtered. This change allows to log deprecation message with the same key by different users. relates #41354 backport #44587	2019-07-22 11:38:11 +02:00
Alpar Torok	b34ac66d96	Mute multiple tests on Windows (7.x) (#44676 ) * Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Fix line endings in ESJsonLayoutTests * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute failure unconfigured node name * mute x-pack internal cluster test windows tracking #44610 * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256	2019-07-22 11:32:29 +03:00
Ryan Ernst	226a753e93	Restore setting up temp dir for windows service (#44541 ) (#44661 ) In https://github.com/elastic/elasticsearch/pull/41913 setting up the temp dir for ES was moved from the env script to individual cli scripts. However, moving it to the windows service cli was missed. This commit restores setting up the temp dir for the windows service control script.	2019-07-21 13:54:46 -07:00
Dimitrios Liappis	651b7a001f	Support log file rolling in jvm presence packaging tests (#44548 ) Make JVM presence related packaging tests more resilient, by also checking for entries in rolled over log files. Closes #41436 Backport of #44423	2019-07-18 18:39:55 +03:00
Ryan Ernst	7e06888bae	Convert testclusters to use distro download plugin (#44253 ) (#44362 ) Test clusters currently has its own set of logic for dealing with finding different versions of Elasticsearch, downloading them, and extracting them. This commit converts testclusters to use the DistributionDownloadPlugin.	2019-07-15 17:53:05 -07:00
Przemyslaw Gomulka	e23ecc5838	JSON logging refactoring and X-Opaque-ID support backport(#41354 ) (#44178 ) This is a refactor to current JSON logging to make it more open for extensions and support for custom ES log messages used inDeprecationLogger IndexingSlowLog , SearchSLowLog We want to include x-opaque-id in deprecation logs. The easiest way to have this as an additional JSON field instead of part of the message is to create a custom DeprecatedMessage (extends ESLogMEssage) These messages are regular log4j messages with a text, but also carry a map of fields which can then populate the log pattern. The logic for this lives in ESJsonLayout and ESMessageFieldConverter. Similar approach can be used to refactor IndexingSlowLog and SearchSlowLog JSON logs to contain fields previously only present as escaped JSON string in a message field. closes #41350 backport #41354	2019-07-12 16:53:27 +02:00
Henning Andersen	859709cc94	Closed index noop recovery during upgrade (#44072 ) Test that closed indices do noop recovery during rolling upgrade.	2019-07-09 11:46:42 +02:00
Przemyslaw Gomulka	553f783e73	Fix DieWithDignity test when waiting on jps backport(#43861 ) (#43871 ) the test often hangs on executing jps command we don't need to wait for this command to finish. closes #43413	2019-07-03 20:39:48 +02:00
Alpar Torok	23f739b513	Testclusters: Convert additional projects (#43625 ) * Testclusters: Convert additional projects Found some more that were not using testclusters from elasticsearch-ci/1 * Allow IOException too * Make the client more resilient	2019-06-28 16:44:42 +03:00
Alpar Torok	a38f509284	Testclusters: convert left-overs from checkPart1 (#43370 ) * Testclusters: convert left-overs from checkPart1	2019-06-25 19:14:45 +03:00
Alpar Torok	2beb193311	Disable testing conventions on Windows (#43532 ) Tests are disabled on Windows. Conventions also need to be disabled.	2019-06-25 18:49:08 +03:00
Przemyslaw Gomulka	67a67ac3ce	Fix DieWithDignity logs assertion backport(#43543 ) #43562 This test is likely to kill the server in the middle of writing logs. This means that we can end up with logs with partially written json log lines and standard json parsers would fail on this. This fix is to use regular expressions on json logs.(just like the previous approach on plain text logs) closes #43413	2019-06-25 14:35:24 +02:00
Andrei Stefan	84d3de1fee	Mute DieWithDignityIT test. (#43415 )	2019-06-20 15:10:21 +03:00
Mark Vieira	e44b8b1e2e	[Backport] Remove dependency substitutions 7.x (#42866 ) * Remove unnecessary usage of Gradle dependency substitution rules (#42773) (cherry picked from commit 12d583dbf6f7d44f00aa365e34fc7e937c3c61f7)	2019-06-04 13:50:23 -07:00
Przemyslaw Gomulka	d5061a151a	Remove suppresions for "unchecked" for hamcrest varargs methods Backport(41528) #42749 In hamcrest 2.1 warnings for unchecked varargs were fixed by hamcrest using @SafeVarargs for those matchers where this warning occurred. This PR is aimed to remove these annotations when Matchers.contains ,Matchers.containsInAnyOrder or Matchers.hasItems was used backport #41528	2019-05-31 13:58:49 +02:00
Jason Tedor	371cb9a8ce	Remove Log4j 1.2 API as a dependency (#42702 ) We had this as a dependency for legacy dependencies that still needed the Log4j 1.2 API. This appears to no longer be necessary, so this commit removes this artifact as a dependency. To remove this dependency, we had to fix a few places where we were accidentally relying on Log4j 1.2 instead of Log4j 2 (easy to do, since both APIs were on the compile-time classpath). Finally, we can remove our custom Netty logger factory. This was needed when we were on Log4j 1.2 and handled logging in our own unique way. When we migrated to Log4j 2 we could have dropped this dependency. However, even then Netty would still pick up Log4j 1.2 since it was on the classpath, thus the advantage to removing this as a dependency now.	2019-05-30 16:08:07 -04:00
Nhat Nguyen	ab832c4f17	Use doc instead of _doc in FullClusterRestartIT ES does not accept doc type starting with underscore until 6.2.0. We have to use "doc" instead of "_doc" in FullClusterRestartIT if we are upgrading from a 6.2.0- cluster. Closes #42581	2019-05-27 21:35:56 -04:00
Nhat Nguyen	4123ade2b6	Add test ensure we can execute update requests in mixed cluster Relates #42596	2019-05-27 18:27:50 -04:00
Ignacio Vera	5d3e381648	mute test testClosedIndices (#42582 )	2019-05-27 12:05:17 +02:00
Nhat Nguyen	5d2fcc53e4	Unmute FullClusterRestartIT#testClosedIndices Fixed in #39566 Closes #39576	2019-05-26 11:20:04 -04:00
Nhat Nguyen	84df48ccb3	Recovery with syncId should verify seqno infos (#41265 ) This change verifies and aborts recovery if source and target have the same syncId but different sequenceId. This commit also adds an upgrade test to ensure that we always utilize syncId.	2019-05-21 22:44:17 -04:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Ryan Ernst	69824ed908	Cleanup plugin bin directories (#41907 ) This commit adds deletion of the bin directory to postrm cleanup. While the package's bin files are cleaned up by the package manager, plugins may have created subdirectories under bin. We already cleanup plugins, but not the extra bin dirs their installation created. closes #18109	2019-05-10 11:00:41 -07:00
Ryan Ernst	6343ec3d3e	Update lintian overrides (#41561 ) (#41953 ) The deb package has been updated several times in the past to contain overrides in order to pass lintian inspection. However, there have never been any tests to ensure we do not fallback to failure. This commit updates the overrides file given things that have changed since 2.x like adding ML and bundling the jdk. closes #17185	2019-05-08 12:09:02 -07:00
Jason Tedor	d7fd51a84e	Provide names for all artifact repositories (#41857 ) This commit adds a name for each Maven and Ivy repository used in the build.	2019-05-07 06:35:28 -04:00
Jason Tedor	ff317d0d7f	Set metadata sources for Ivy repositories (#41818 ) We have faked some Ivy repositories on a few artifact locations. Today when Gradle attempts to resolve these artifacts, it follows its default strategy to search for Gradle metadata, then Maven POM files, then Ivy descriptors, and finally will fallback to looking directly for the artifact. This wastes times on remote network calls that will 404 anyway since these metadata resources will not exist for these fake Ivy repositories. This commit overrides the Gradle strategy to look directly for artifacts.	2019-05-05 11:16:52 -04:00
Jason Tedor	241c4ef97a	Use https for artifact locations This commit switches to using https for some artifact locations.	2019-05-03 16:15:48 -04:00
Jason Tedor	03c959f188	Upgrade keystore on package install (#41755 ) When Elasticsearch is run from a package installation, the running process does not have permissions to write to the keystore. This is because of the root:root ownership of /etc/elasticsearch. This is why we create the keystore if it does not exist during package installation. If the keystore needs to be upgraded, that is currently done by the running Elasticsearch process. Yet, as just mentioned, the Elasticsearch process would not have permissions to do that during runtime. Instead, this needs to be done during package upgrade. This commit adds an upgrade command to the keystore CLI for this purpose, and that is invoked during package upgrade if the keystore already exists. This ensures that we are always on the latest keystore format before the Elasticsearch process is invoked, and therefore no upgrade would be needed then. While this bug has always existed, we have not heard of reports of it in practice. Yet, this bug becomes a lot more likely with a recent change to the format of the keystore to remove the distinction between file and string entries.	2019-05-03 10:34:30 -04:00
Jason Tedor	7f3ab4524f	Bump 7.x branch to version 7.2.0 This commit adds the 7.2.0 version constant to the 7.x branch, and bumps BWC logic accordingly.	2019-05-01 13:38:57 -04:00
Jim Ferenczi	a8c3c5cd78	Restore rolling upgrade test for the _all field (#41462 ) This commit adapts the rolling upgrade test introduced in #37808 to ignore runs that upgrade from a version on or after 7.0. Closes #41453	2019-04-27 18:37:15 +02:00
Ryan Ernst	7e3875d781	Upgrade hamcrest to 2.1 (#41464 ) hamcrest has some improvements in newer versions, like FileMatchers that make assertions regarding file exists cleaner. This commit upgrades to the latest version of hamcrest so we can start using new and improved matchers.	2019-04-24 23:40:03 -07:00
Ryan Ernst	cc867e8a5a	Remove /var/run/elasticsearch from packages (#41102 ) The pid dir for both systemd and init.d is already managed by those respective systems (tmpfiles.d and the init script, respectively). Since the /var/run dir is often mounted as tmpfs, it does not make sense to have the elasticsearch pid dir added by the package installation. This commit removes that empty dir from deb and rpm.	2019-04-22 09:57:51 -07:00
Mark Vieira	1287c7d91f	[Backport] Replace usages RandomizedTestingTask with built-in Gradle Test (#40978 ) (#40993 ) * Replace usages RandomizedTestingTask with built-in Gradle Test (#40978) This commit replaces the existing RandomizedTestingTask and supporting code with Gradle's built-in JUnit support via the Test task type. Additionally, the previous workaround to disable all tasks named "test" and create new unit testing tasks named "unitTest" has been removed such that the "test" task now runs unit tests as per the normal Gradle Java plugin conventions. (cherry picked from commit 323f312bbc829a63056a79ebe45adced5099f6e6) * Fix forking JVM runner * Don't bump shadow plugin version	2019-04-09 11:52:50 -07:00
Nhat Nguyen	a9416708d0	Ensure relocation occur in testRelocationWithConcurrentIndexing (#40801 ) If the relocation is throttled, the subsequent search request on the target node (i.e., with preference _only_nodes=target_node) will fail because some shards have not moved to that node yet. With this change, we will wait for the relocation happens by busily checking the routing table of the testing index on the target node. Closes #34950	2019-04-08 22:47:20 -04:00
Mark Vieira	2569fb60de	Avoid sharing source directories as it breaks intellij (#40877 ) * Avoid sharing source directories as it breaks intellij * Subprojects share main project output classes directory * Fix jar hell * Fix sql security with ssl integ tests * Relax dependency ordering rule so we don't explode on cycles	2019-04-08 17:26:46 +03:00
Alpar Torok	21b99a3aeb	Remove unneded cluster config from test (#40856 ) This configuration doesn't influence the logger test. Should be removed to avoid confusion	2019-04-08 17:26:46 +03:00
Ioannis Kakavas	aea4e6596f	Source additional files correctly in elasticsearch-cli (#40890 ) Since we only source additional sources from the same dir as our cli scripts, resolve the path relevant to $ES_HOME	2019-04-05 17:42:34 -04:00
Przemyslaw Gomulka	65d25186d3	Bat scripts to work with JAVA_HOME with parentheses Backports (#40832 ) backports : Bat scripts to work with JAVA_HOME with parentheses (#39712) Link to SYSTEM_JAVA_HOME on windows (#40806)	2019-04-04 18:42:05 +02:00
Przemyslaw Gomulka	a6faf85f26	Migrate systemd packaging tests from bats to java backport(#39954 ) (#40763 ) Migrating systemd bats tests from bats to java dsl. This also covers partially the sysv, but more must be added relates #32143 backport #39954	2019-04-04 08:41:28 +02:00
Yannick Welsch	6ae7d593ea	Avoid background sync on relocated primary (#40800 ) There were some test failures caused by the background retention lease sync running on a relocated primary. This commit fixes the situation that triggered the assertion and reactivates the failing test. Closes #40731	2019-04-03 20:28:48 +02:00
Jason Tedor	df65e46d10	Deprecate versions of Java prior to Java 11 (#40756 ) This commit deprecates versions of Java prior to Java 11. This commit will cause a warning to be printed to standard error when any command line tool is invoked, or when Elasticsearch is started. Additionally, we log a deprecation message when Elasticsearch is started.	2019-04-03 06:39:40 -04:00
Dimitrios Liappis	a3f53cfd05	[TEST] Mute RecoveryWithConcurrentIndexing test (#40735 ) Due to https://github.com/elastic/elasticsearch/issues/40731 Backport of https://github.com/elastic/elasticsearch/pull/40733	2019-04-02 16:23:22 +03:00
Luca Cavanna	f2ca45c210	Add integration tests to verify CCS output (#40038 ) We recently introduced the option to minimize network roundtrips when executing cross-cluster search requests. All the changes made around that are separately unit tested, and there are some yaml tests that exercise the new code-path which involves multiple coordination steps. This commit adds new integration tests that compare the output given by CCS when running the same queries using the two different execution modes available. Relates to #32125	2019-03-29 17:01:14 +01:00
Ryan Ernst	aa24669e95	Resolve JAVA_HOME at windows service install time (#39714 ) On windows, JAVA_HOME is currently resolved when the windows service is started. However, this is contrary to what our documentation states. This commit moves resolution to service install. This has the side effect of making java existence checking optional in elasticsearch-env.bat, since the rest of the service commands do not require java. closes #30720	2019-03-26 19:48:02 -07:00
Nhat Nguyen	606411bce8	Mute testRelocationWithConcurrentIndexing Tracked at #34950	2019-03-20 12:46:36 -04:00
Nhat Nguyen	d58864745c	Dump recovery if fail to get doc count with preference (#40168 ) With this change, we will dump the recovery state if we fail to get doc count for a given index with a preference in rolling upgrade tests. We should have more information to look into why the provided preference is not valid. I also unmuted `testRelocationWithConcurrentIndexing` in this change. Relates #34950	2019-03-19 13:09:04 -04:00
Henning Andersen	dde41cc2dd	Node repurpose tool (#39403 ) When a node is repurposed to master/no-data or no-master/no-data, v7.x will not start (see #37748 and #37347). The `elasticsearch repurpose` tool can fix this by cleaning up the problematic data.	2019-03-19 11:52:02 +01:00
Andrey Ershov	d35d9d1886	cd ES_HOME in elasticsearch-env (#39937 ) This commit adds cd $ES_HOME to elasticsearch-env and removes it from elasticsearch. This way, both elasticsearch and elasticsearch-cli are executed with the working directory set to $ES_HOME. The need for the fix arose from the following bug: 1. Explicitly set path.data to relative to ES_HOME path in elasticsearch.yml. 2. Run elasticsearch from any directory. Elasticsearch is able to correctly start. 3. Stop elasticsearch. 4. Run elasticsearch-node unsafe-bootstrap, not from ES_HOME directory. It will fail with an exception. This commit fixes the issue and adds a new test. This PR fixes the issue and adds a new test. Also tests >=100 are renamed because alphabetic order does not work for them. (cherry picked from commit 2ffc29306ff7366efc598e7b4dd2ce528895cd3a with fixes by #40083 and #40118)	2019-03-18 16:26:27 +01:00
Jason Tedor	86d1d03c37	Remove cluster state size (#40109 ) This commit removes the cluster state size field from the cluster state response, and drops the backwards compatibility layer added in 6.7.0 to continue to support this field. As calculation of this field was expensive and had dubious value, we have elected to remove this field.	2019-03-15 17:16:25 -04:00
Ryan Ernst	8f09c77777	Add no-jdk distributions (#39882 ) This commit adds a variant for every official distribution that omits the bundled jdk. The "no-jdk" naming is conveyed through the package classifier, alongside the platform. Package tests are also added for each new distribution.	2019-03-15 00:55:57 -07:00
Jason Tedor	9181668edf	Stop returning cluster state size by default (#40016 ) Computing the compressed size of the cluster state on every invocation of cluster:monitor/state action is expensive, and the value of this field is dubious anyway. Therefore we want to remove computing this field. As a first step, we stop computing and return this field by default. To avoid breaking users, we will give them a system property to use to tide them over until the next major release when we will actually remove this field. This comes with a deprecation warning too, and the backport to the appropriate minor will also include a note in the migration guide. There will be a follow-up to remove this field in the next major version.	2019-03-14 08:57:55 -04:00
Yannick Welsch	4f941c6963	Do not swallow exceptions in TimedRunnable (#39856 ) Executors of type fixed_auto_queue_size (i.e. search / search_throttled) wrap runnables into TimedRunnable, which is an AbstractRunnable. This is dangerous as it might silently swallow exceptions, and possibly miss calling a response listener. While this has not triggered any failures in the tests I have run so far, it might help uncover future problems. Follow-up to #36137	2019-03-11 19:03:12 +01:00
Ryan Ernst	465343f12a	Bundle java in distributions (#38013 ) * Bundle java in distributions Setting up a jdk is currently a required external step when installing elasticsearch. This is particularly problematic for the rpm/deb packages as installing a jdk in the same package installation command does not guarantee any order, so must be done in separate steps. Additionally, JAVA_HOME must be set and often causes problems in selecting a correct jdk when, for example, the system java is an older unsupported version. This commit bundles platform specific openjdks into each distribution. In addition to eliminating the issues above, it also presents future possible improvements like using jlink to build jdk images only containing modules that elasticsearch uses. closes #31845	2019-03-08 11:04:18 -08:00
Przemyslaw Gomulka	c23ff0803e	Fix JsonLoggerTests.testJsonInStacktraceMessageIsSplitted backport#39830 #39834 a test was splitting a json log message with a wrong line separator. Using System.lineSeparator consistently prevents this problem closes #39829 backport#39830	2019-03-08 16:34:17 +01:00
Alpar Torok	0f89427eb6	Back port build changes from same version bwc tests (#39744 ) * Back port build changes from #39102 This back-ports how versions are determined and bwc test are set up from #39102 without enabling the bwc from current version tests so it's easier/possible to backmerge future buld changes. It's expected that the tets are lacking many of the required fixes in this version to enable them.	2019-03-07 17:25:09 +02:00
Nhat Nguyen	15ab0e23e0	Unmute testHistoryUUIDIsGenerated Fixed in #32270 Relates to #31291	2019-03-04 10:21:23 -05:00
David Turner	f5fb93afdf	Mute FullClusterRestartIT#testClosedIndices (#39600 ) Relates #39576	2019-03-02 12:26:45 +00:00
Tanguy Leroux	e005eeb0b3	Backport support for replicating closed indices to 7.x (#39506 )(#39499 ) Backport support for replicating closed indices (#39499) Before this change, closed indexes were simply not replicated. It was therefore possible to close an index and then decommission a data node without knowing that this data node contained shards of the closed index, potentially leading to data loss. Shards of closed indices were not completely taken into account when balancing the shards within the cluster, or automatically replicated through shard copies, and they were not easily movable from node A to node B using APIs like Cluster Reroute without being fully reopened and closed again. This commit changes the logic executed when closing an index, so that its shards are not just removed and forgotten but are instead reinitialized and reallocated on data nodes using an engine implementation which does not allow searching or indexing, which has a low memory overhead (compared with searchable/indexable opened shards) and which allows shards to be recovered from peer or promoted as primaries when needed. This new closing logic is built on top of the new Close Index API introduced in 6.7.0 (#37359). Some pre-closing sanity checks are executed on the shards before closing them, and closing an index on a 8.0 cluster will reinitialize the index shards and therefore impact the cluster health. Some APIs have been adapted to make them work with closed indices: - Cluster Health API - Cluster Reroute API - Cluster Allocation Explain API - Recovery API - Cat Indices - Cat Shards - Cat Health - Cat Recovery This commit contains all the following changes (most recent first): * c6c42a1 Adapt NoOpEngineTests after #39006 * 3f9993d Wait for shards to be active after closing indices (#38854) * 5e7a428 Adapt the Cluster Health API to closed indices (#39364) * 3e61939 Adapt CloseFollowerIndexIT for replicated closed indices (#38767) * 71f5c34 Recover closed indices after a full cluster restart (#39249) * 4db7fd9 Adapt the Recovery API for closed indices (#38421) * 4fd1bb2 Adapt more tests suites to closed indices (#39186) * 0519016 Add replica to primary promotion test for closed indices (#39110) * b756f6c Test the Cluster Shard Allocation Explain API with closed indices (#38631) * c484c66 Remove index routing table of closed indices in mixed versions clusters (#38955) * 00f1828 Mute CloseFollowerIndexIT.testCloseAndReopenFollowerIndex() * e845b0a Do not schedule Refresh/Translog/GlobalCheckpoint tasks for closed indices (#38329) * cf9a015 Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#38327) * b9becdd Adapt testPendingTasks() for replicated closed indices (#38326) * 02cc730 Allow shards of closed indices to be replicated as regular shards (#38024) * e53a9be Fix compilation error in IndexShardIT after merge with master * cae4155 Relax NoOpEngine constraints (#37413) * 54d110b [RCI] Adapt NoOpEngine to latest FrozenEngine changes * c63fd69 [RCI] Add NoOpEngine for closed indices (#33903) Relates to #33888	2019-03-01 14:48:26 +01:00
Ryan Ernst	6e7b643775	Deprecate fallback to java on PATH (#37990 ) Finding java on the path is sometimes confusing for users and unexpected, as well as leading to a different java being used than a user expects. This commit adds warning messages when starting elasticsearch (or any tools like the plugin cli) and using java found on the PATH instead of via JAVA_HOME.	2019-02-20 17:07:11 -08:00
Christoph Büscher	9f6c77fad4	Fix FullClusterRestartIT#testSnapshotRestore (#38795 ) This test failed on 7.1 when running full cluster restart tests against pre-7.0 clusters (e.g. 6.6 clusters). The fixes the expected type in the templates after the cluster restart.	2019-02-15 20:12:26 +01:00
Julie Tibshirani	e769cb4efd	Perform precise check for types warnings in cluster restart tests. (#37944 ) Instead of using `WarningsHandler.PERMISSIVE`, we only match warnings that are due to types removal. This PR also renames `allowTypeRemovalWarnings` to `allowTypesRemovalWarnings`. Relates to #37920.	2019-02-13 11:28:58 -08:00
Przemyslaw Gomulka	7404882105	Fix line separators in JSON logging tests backport#38771 #38834 The hardcoded '\n' in string will not work in Windows where there is a different line separator. A System.lineSeparator should be used to make it work on all platforms closes #38705 backport #38771	2019-02-13 13:34:33 +01:00
Alpar Torok	bd4ca4c702	Rename integTest to bwcTestSample for bwc test projects (#38433 ) * Rename integTest to bwcTestSample for bwc test projects This change renames the `integTest` task to `bwcTestSample` for projects testing bwc to make it possible to run all the bwc tests that check would run without running on bwc tests. This change makes it possible to add a new PR check on backports to make sure these don't break BWC tests in master. * Rename task as per PR	2019-02-11 15:05:16 +02:00
Alpar Torok	7e10592940	fix verify version constants	2019-02-11 12:23:48 +02:00
Martijn van Groningen	b284fede0b	Make qa/full-cluster-restart tests pass. By fixing a helper method and (#38604 ) muting a test. Relates to #38603	2019-02-08 14:14:23 +01:00
Tim Vernum	1008f1c68e	Only "include_type_name" if running on >= 7 (#38594 ) In cluster restart tests, we need to "include_type_name" if the cluster includes a pre-7 version, but the test is running against a 7+ version	2019-02-08 18:06:15 +11:00
Jason Tedor	fdf6b3f23f	Add 7.1 version constant to 7.x branch (#38513 ) This commit adds the 7.1 version constant to the 7.x branch. Co-authored-by: Andy Bristol <andy.bristol@elastic.co> Co-authored-by: Tim Brooks <tim@uncontended.net> Co-authored-by: Christoph Büscher <cbuescher@posteo.de> Co-authored-by: Luca Cavanna <javanna@users.noreply.github.com> Co-authored-by: markharwood <markharwood@gmail.com> Co-authored-by: Ioannis Kakavas <ioannis@elastic.co> Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co> Co-authored-by: David Roberts <dave.roberts@elastic.co> Co-authored-by: Jason Tedor <jason@tedor.me> Co-authored-by: Alpar Torok <torokalpar@gmail.com> Co-authored-by: David Turner <david.turner@elastic.co> Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Tim Vernum <tim@adjective.org> Co-authored-by: Albert Zaharovits <albert.zaharovits@gmail.com>	2019-02-07 16:32:27 -05:00
Luca Cavanna	a7046e001c	Remove support for maxRetryTimeout from low-level REST client (#38085 ) We have had various reports of problems caused by the maxRetryTimeout setting in the low-level REST client. Such setting was initially added in the attempts to not have requests go through retries if the request already took longer than the provided timeout. The implementation was problematic though as such timeout would also expire in the first request attempt (see #31834), would leave the request executing after expiration causing memory leaks (see #33342), and would not take into account the http client internal queuing (see #25951). Given all these issues, it seems that this custom timeout mechanism gives little benefits while causing a lot of harm. We should rather rely on connect and socket timeout exposed by the underlying http client and accept that a request can overall take longer than the configured timeout, which is the case even with a single retry anyways. This commit removes the `maxRetryTimeout` setting and all of its usages.	2019-02-06 08:43:47 +01:00
Tim Vernum	517aa95984	Fix exit code in certutil packaging test (#38393 ) The exit code is different on windows, and we don't really care about which code it is, we just need to check that it's not 0 (success)	2019-02-06 12:05:44 +11:00
Henning Andersen	20c66c5a05	Bubble-up exceptions from scheduler (#38317 ) Instead of logging warnings we now rethrow exceptions thrown inside scheduled/submitted tasks. This will still log them as warnings in production but has the added benefit that if they are thrown during unit/integration test runs, the test will be flagged as an error. This is a continuation of #38014 Fixed NPE that caused CCR tests (IndexFollowingIT and likely others) to fail. schedule could bubble rejected exception to uncaught exception handler when not using SAME executor if thread pool is terminated. Now ignore rejected exception silently if executor is shutdown.	2019-02-05 21:48:24 +01:00
Mayya Sharipova	e4fa32470b	Types removal fix FullClusterRestartIT warnings (#38445 ) Backport PR #38389 for 6.7 produces warnings for rollover test. This fixes FullClusterRestartIT warning expectations for rollover request Relates to #38389	2019-02-05 14:15:43 -05:00
markharwood	578fd14257	Types removal - fix FullClusterRestartIT warning expectations (#38310 ) Relax test warning message checking to pre-empt PR 38022 landing in 6.7 with new warning messages. The relaxed test now just assumes any warning message starting with “[types removal]” is tolerated rather than the precise phrasing used in the 6.7 branch.	2019-02-04 20:09:07 +00:00
Nhat Nguyen	75abb5b8a6	Adapt LLRest warning exception in FullClusterRestartIT (#38253 ) We now throw a WarningFailureException instead of ResponseException if there's any warning in a response. This change leads to the failures of testSnapshotRestore in the BWC builds for the last two days. Relates #37247	2019-02-02 12:09:14 -05:00
Julie Tibshirani	c2e9d13ebd	Default include_type_name to false in the yml test harness. (#38058 ) This PR removes the temporary change we made to the yml test harness in #37285 to automatically set `include_type_name` to `true` in index creation requests if it's not already specified. This is possible now that the vast majority of index creation requests were updated to be typeless in #37611. A few additional tests also needed updating here. Additionally, this PR updates the test harness to set `include_type_name` to `false` in index creation requests when communicating with 6.x nodes. This mirrors the logic added in #37611 to allow for typeless document write requests in test set-up code. With this update in place, we can remove many references to `include_type_name: false` from the yml tests.	2019-02-01 11:44:13 -08:00
Luca Cavanna	e18cac3659	Add finalReduce flag to SearchRequest (#38104 ) With #37000 we made sure that fnial reduction is automatically disabled whenever a localClusterAlias is provided with a SearchRequest. While working on #37838, we found a scenario where we do need to set a localClusterAlias yet we would like to perform a final reduction in the remote cluster: when searching on a single remote cluster. Relates to #32125 This commit adds support for a separate finalReduce flag to SearchRequest and makes use of it in TransportSearchAction in case we are searching against a single remote cluster. This also makes sure that num_reduce_phases is correct when searching against a single remote cluster: it makes little sense to return `num_reduce_phases` set to `2`, which looks especially weird in case the search was performed against a single remote shard. We should perform one reduction phase only in this case and `num_reduce_phases` should reflect that. * line length	2019-02-01 12:11:42 +01:00
Tanguy Leroux	0e6a7c20a1	Fix FullClusterRestartIT.testHistoryUUIDIsAdded (#38098 ) This test failed once because the index wasn't fully ready (ie, engine opened). This commit changes the test so that it waits for the index to be green before checking the history UUID. Closes #34452	2019-02-01 11:16:54 +01:00
Henning Andersen	68ed72b923	Handle scheduler exceptions (#38014 ) Scheduler.schedule(...) would previously assume that caller handles exception by calling get() on the returned ScheduledFuture. schedule() now returns a ScheduledCancellable that no longer gives access to the exception. Instead, any exception thrown out of a scheduled Runnable is logged as a warning. This is a continuation of #28667, #36137 and also fixes #37708.	2019-01-31 17:51:45 +01:00
Luca Cavanna	622fb7883b	Introduce ability to minimize round-trips in CCS (#37828 ) With #37566 we have introduced the ability to merge multiple search responses into one. That makes it possible to expose a new way of executing cross-cluster search requests, that makes CCS much faster whenever there is network latency between the CCS coordinating node and the remote clusters. The coordinating node can now send a single search request to each remote cluster, which gets reduced by each one of them. from + size results are requested to each cluster, and the reduce phase in each cluster is non final (meaning that buckets are not pruned and pipeline aggs are not executed). The CCS coordinating node performs an additional, final reduction, which produces one search response out of the multiple responses received from the different clusters. This new execution path will be activated by default for any CCS request unless a scroll is provided or inner hits are requested as part of field collapsing. The search API accepts now a new parameter called ccs_minimize_roundtrips that allows to opt-out of the default behaviour. Relates to #32125	2019-01-31 15:12:14 +01:00
David Turner	81c443c9de	Deprecate minimum_master_nodes (#37868 ) Today we pass `discovery.zen.minimum_master_nodes` to nodes started up in tests, but for 7.x nodes this setting is not required as it has no effect. This commit removes this setting so that nodes are started with more realistic configurations, and deprecates it.	2019-01-30 20:09:15 +00:00
David Roberts	2f7776c8b7	Switch default time format for ingest from Joda to Java for v7 (#37934 ) Date formats with and without the "8" prefix are now all treated as Java time formats, so that ingest does the same as mappings in this respect.	2019-01-30 16:26:28 +00:00
Adrien Grand	c8af0f4bfa	Use mappings to format doc-value fields by default. (#30831 ) Doc-value fields now return a value that is based on the mappings rather than the script implementation by default. This deprecates the special `use_field_mapping` docvalue format which was added in #29639 only to ease the transition to 7.x and it is not necessary anymore in 7.0.	2019-01-30 10:31:51 +01:00
Jim Ferenczi	5dcc805dc9	Restore a noop _all metadata field for 6x indices (#37808 ) This commit restores a noop version of the AllFieldMapper that is instanciated only for indices created in 6x. We need this metadata field mapper to be present in this version in order to allow the upgrade of indices that explicitly disable _all (enabled: false). The mapping of these indices contains a reference to the _all field that we cannot remove in 7 so we'll need to keep this metadata mapper in 7x. Since indices created in 6x will not be compatible with 8, we'll remove this noop mapper in the next major version. Closes #37429	2019-01-30 08:45:50 +01:00
Tim Vernum	99129d7786	Fix exit code for Security CLI tools (#37956 ) The certgen, certutil and saml-metadata tools did not correctly return their exit code to the calling shell. These commands now explicitly exit with the code that was returned from the main(args, terminal) method.	2019-01-30 17:51:11 +11:00
markharwood	b889221f75	Types removal - deprecate include_type_name with index templates (#37484 ) Added deprecation warnings for use of include_type_name in put/get index templates. HLRC changes: GetIndexTemplateRequest has a new client-side class which is a copy of server's GetIndexTemplateResponse but modified to be typeless. PutIndexTemplateRequest has a new client-side counterpart which doesn't use types in the mappings Relates to #35190	2019-01-29 20:52:41 +00:00
Ryan Ernst	8e5f9c4b14	Add OS/architecture classifier to distributions (#37881 ) This commit adds classifiers to the distributions indicating the OS (for archives) and platform. The current OSes are for windows, darwin (ie macos) and linux. This change will allow future OS/architecture specific changes to the distributions. Note the docs using distribution links have been updated, but will be reworked in a followup to make OS specific instructions for the archives.	2019-01-29 11:18:30 -08:00
Przemyslaw Gomulka	891320f5ac	Elasticsearch support to JSON logging (#36833 ) In order to support JSON log format, a custom pattern layout was used and its configuration is enclosed in ESJsonLayout. Users are free to use their own patterns, but if smooth Beats integration is needed, they should use ESJsonLayout. EvilLoggerTests are left intact to make sure user's custom log patterns work fine. To populate additional fields node.id and cluster.uuid which are not available at start time, a cluster state update will have to be received and the values passed to log4j pattern converter. A ClusterStateObserver.Listener is used to receive only one ClusteStateUpdate. Once update is received the nodeId and clusterUUid are set in a static field in a NodeAndClusterIdConverter. Following fields are expected in JSON log lines: type, tiemstamp, level, component, cluster.name, node.name, node.id, cluster.uuid, message, stacktrace see ESJsonLayout.java for more details and field descriptions Docker log4j2 configuration is now almost the same as the one use for ES binary. The only difference is that docker is using console appenders, whereas ES is using file appenders. relates: #32850	2019-01-29 07:20:09 +01:00
Martijn van Groningen	e401ab1724	Handle deprecation warnings in a permissive manner. Closes #37920	2019-01-28 15:02:50 +01:00
Luca Cavanna	519423b8f3	Mute failing full-cluster-restart tests Relates to #37920	2019-01-28 14:07:19 +01:00
Martijn van Groningen	445db97867	each full cluster restart round should use its own repository, otherwise snapshots from e.g. 6.5.5 to current and 6.6.0 to current full cluster restart round collides.	2019-01-28 13:56:02 +01:00
Ryan Ernst	acc3cae40c	Remove "reinstall" packaging tests (#37851 ) The packaging tests currently have a test which installs elasticsearch, removes it, modifies ownership of /etc/elasticsearch, and reinstalls. It then checks that the /etc/elasticsearch directory has ownership that the package expects. But the recursive change touches files not owned by the package. In the past this worked because we did a recursive ownership change within the package postinst. However, that was recently removed, and thus this test no longer makes sense.	2019-01-25 08:07:49 -08:00
Alexander Reelsen	9e350d027e	Add BWC compatible processing to ingest date processors (#37407 ) The ingest date processor is currently only able to parse joda formats. However it is not using the existing elasticsearch classes but access joda directly. This means that our existing BWC layer does not notify the user about deprecated formats. This commit switches to use the exising Elasticsearch Joda methods to acquire a date format, that includes the BWC check and the ability to parse java 8 dates. The date parsing in ingest has also another extra feature, that the fallback year, when a date format without a year is used, is the current year, and not 1970 like usual. This is currently not properly supported in the DateFormatter class. As this is the only case for this feature and java time can take care of this using the toZonedDateTime() method, a workaround just for the joda time parser has been created, that can be removed soon again from 7.0.	2019-01-25 13:50:19 +01:00
Andrey Ershov	4974684003	Add tool elasticsearch-node unsafe-bootstrap (#37696 ) elasticsearch-node tool helps to restore cluster if half or more of master eligible nodes are lost. Of course, all bets are off, regarding data consistency. There are two parts of the tool: unsafe-bootstrap to be used when there is still at least one master-eligible node alive and detach-cluster, when there are no master-eligible nodes left. This commit implements the first part. Docs for the tool will be added separately as a part of #37812.	2019-01-24 19:25:55 +01:00
Alpar Torok	37768b7eac	Testing conventions now checks for tests in main (#37321 ) * Testing conventions now checks for tests in main This is the last outstanding feature of the old NamingConventionsTask, so time to remove it. * PR review	2019-01-24 17:30:50 +02:00
Alexander Reelsen	daa2ec8a60	Switch mapping/aggregations over to java time (#36363 ) This commit moves the aggregation and mapping code from joda time to java time. This includes field mappers, root object mappers, aggregations with date histograms, query builders and a lot of changes within tests. The cut-over to java time is a requirement so that we can support nanoseconds properly in a future field mapper. Relates #27330	2019-01-23 10:40:05 +01:00
Boaz Leskes	52ba407931	Expose sequence number and primary terms in search responses (#37639 ) Users may require the sequence number and primary terms to perform optimistic concurrency control operations. Currently, you can get the sequence number via the `docvalues_fields` API but the primary term is not accessible because it is maintained by the `SeqNoFieldMapper` and the infrastructure can't find it. This commit adds a dedicated sub fetch phase to return both numbers that is connected to a new `seq_no_primary_term` parameter.	2019-01-23 09:01:58 +01:00
Alpar Torok	3f2723366e	Mute failing test Tracking #37708	2019-01-22 17:16:40 +02:00
Ryan Ernst	9a34b20233	Simplify integ test distribution types (#37618 ) The integ tests currently use the raw zip project name as the distribution type. This commit simplifies this specification to be "default" or "oss". Whether zip or tar is used should be an internal implementation detail of the integ test setup, which can (in the future) be platform specific.	2019-01-21 12:37:17 -08:00
Yannick Welsch	6d64a2a901	Propagate Errors in executors to uncaught exception handler (#36137 ) This is a continuation of #28667 and has as goal to convert all executors to propagate errors to the uncaught exception handler. Notable missing ones were the direct executor and the scheduler. This commit also makes it the property of the executor, not the runnable, to ensure this property. A big part of this commit also consists of vastly improving the test coverage in this area.	2019-01-17 17:46:35 +01:00

... 2 3 4 5 6 ...

1462 Commits