OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alpar Torok	36d018c909	Convert RunTask to use testclusers, remove ClusterFormationTasks (#47572 ) * Convert RunTask to use testclusers, remove ClusterFormationTasks This PR adds a new RunTask and a way for it to start a testclusters cluster out of band and block on it to replace the old RunTask that used ClusterFormationTasks. With this we can now remove ClusterFormationTasks.	2019-10-08 14:43:29 +03:00
Alpar Torok	bc85b22c1f	Complete testclusters backport (#47623 ) * Use versions specific distribution folders so we don't need to clean up (#46539) * Retry deleting distro dir on windows When retarting the cluster we clean up old distribution files that might still be in use by the OS. Windows closes resources of ded processes async, so we do a couple of retries to get arround it. Closes #46014 * Avoid having to delete the distro folder. * Remove the use of ClusterFormationTasks form RestTestTask (#47022) This PR removes a use-case of the ClusterFormationTasks and converts a project that flew under the radar so far. There's probably more clean-up possible here, but for now the goal is to be able to remove that code after `RunTask` is also updated. * Migrate some 7.x only projects	2019-10-07 11:43:57 +03:00
Jason Tedor	43f588a29e	Fix compilation in JVM options parser code Compilation was accidentally broken here when a backport used code from JDK 9, which is not supported in 7.x. This commit addresses this by using JDK 8 compatiable APIs.	2019-10-04 19:29:20 -04:00
Jason Tedor	8a7e5b0847	Move ES_TMPDIR substitution into jvm options parser (#47189 ) This commit moves the ES_TMPDIR substitution that we do for JVM options into the JVM options parser itself. This solves a problem where the fact that the we do not make the substitution before ergonomics parsing can lead to the JVM that we start for computing the ergonomic values failing to start. Additionally, moving this substitution here enables us to simplify the shell scripts since we do not need to implement this there, and twice for Bash and Windows.	2019-10-04 19:12:28 -04:00
Alpar Torok	97a0b7dcbc	Make All OS tests run on GCP instances (#46924 ) This PR makes the necesary adaptations to the tests and adds a power shell script to invoke the OS tests on GCP instances connected as CI workers. Also noticed that logs were not being produced by the tests and that theses were not using log4j so fixed that too. One of the difficulties in working on theses tests was that the tests just stalled with no indication where the problem is. To ease with the debugging, after process explorer suggested that the tests are running some commands, we now have multiple timeouts: one for the tests ( which will generate a thread dump ) and one for individual commands ( that bails with the command being ran and output and error so far ) to make it easier to see what went wrong. The tests were blocking because apparently the pipes to the sub-process were not closing, thus the threads were blocking on them and we were blocking indefinitely on the join. I'm not sure why this doesn't happen in vagrant, but we now properly deal with it.	2019-10-04 08:46:52 +03:00
Ryan Ernst	bd5f64848e	Clarify missing java error message (#46160 ) Since the bundled jdk was added to Elasticsearch, there are now 2 ways java can be missing. Either JAVA_HOME is set but does not exist, or the bundled jdk does not exist. This commit improves the error messages in those two cases, and also ensures our tests cover both cases.	2019-10-01 22:10:19 -07:00
Alpar Torok	d229e15b8d	Add workaround for building docker on debian 8 (#47106 ) Looks like there's a workaround with aufs used in debian 8. Adding `tsflags=nodocs` works around this issue and results in smaller image files also. Closes #47097 and elastic/infra#14780	2019-09-30 09:35:45 +03:00
David Roberts	e943e27954	Spawn controller processes from a different directory on macOS (#47013 ) This is the Java side of https://github.com/elastic/ml-cpp/pull/593 with a fallback so that ml-cpp bundles with either the new or old directory structure work for the time being. A few days after merging the C++ changes a followup to this change will be made that removes the fallback.	2019-09-27 14:02:40 +01:00
Alpar Torok	813b130e08	Exclude the demo folder form the JDK (#47161 ) The folder contains jars with source code that fail the lintian test on debian (based) distributions.	2019-09-27 10:35:34 +03:00
Alpar Torok	944421627d	Work around error building deb on Windows (#47011 ) Relates to #47007 . the `gradle-ospackage-plugin` plugin doesn't properly support symlink on windows. This PR changes the way we configure tasks to prevent building these packages as part of a windows check.	2019-09-27 09:04:29 +03:00
Henning Andersen	f06aa0c6c0	Fix G1 GC default IHOP (#46169 ) G1 GC were setup to use an `InitiatingHeapOccupancyPercent` of 75. This could leave used memory at a very high level for an extended duration, triggering the real memory circuit breaker even at low activity levels. The value is a threshold for old generation usage relative to total heap size and thus it should leave room for the new generation. Default in G1 is to allow up to 60 percent for new generation and this could mean that the threshold was effectively at 135% heap usage. GC would still kick in of course and eventually enough mixed collections would take place such that adaptive adjustment of IHOP kicks in. The JVM has adaptive setting of the IHOP, but this does not kick in until it has sampled a few collections. A newly started, relatively quiet server with primarily new generation activity could thus experience heap above 95% frequently for a duration. The changes here are two-fold: 1. Use 30% default for IHOP (the JVM default of 45 could still mean 105% heap usage threshold and did not fully ensure not to hit the circuit breaker with low activity) 2. Set G1ReservePercent=25. This is used by the adaptive IHOP mechanism, meaning old/mixed GC should kick in no later than at 75% heap. This ensures IHOP stays compatible with the real memory circuit breaker also after being adjusted by adaptive IHOP.	2019-09-23 13:35:31 +02:00
Alpar Torok	5fd7505efc	Testfixtures allow a single service only (#46780 ) This PR adds some restrictions around testfixtures to make sure the same service ( as defiend in docker-compose.yml ) is not shared between multiple projects. Sharing would break running with --parallel. Projects can still share fixtures as long as each has it;s own service within. This is still useful to share some of the setup and configuration code of the fixture. Project now also have to specify a service name when calling useCluster to refer to a specific service. If this is not the case all services will be claimed and the fixture can't be shared. For this reason fixtures have to explicitly specify if they are using themselves ( fixture and tests in the same project ).	2019-09-23 14:13:49 +03:00
Mark Vieira	507d879b70	Disable bwc distribution caching in master branch (#46686 ) This commit disables caching of BWC snapshot distributions in the "trunk" (aka master) branch. Since the previous major release branches move quickly we rarely get cache hits for these tasks, and the artifacts themselves are very large. This means the overhead here is high and savings basically zero. We conditionally disable task output caching in this scenario in CI to avoid excessive build cache overhead as well as causing too much turn in the cache itself which would lead to lots of cache entry evictions.	2019-09-18 11:52:17 -07:00
Jason Tedor	cd71d4a83b	Use AdoptOpenJDK as the bundled JDK (#46470 ) (#46785 ) This commit teaches the build how to bundle AdoptOpenJDK with our artifacts, and switches to AdoptOpenJDK as the bundled JDK. We keep the functionality to also bundle Oracle OpenJDK distributions.	2019-09-17 13:40:35 -07:00
Mark Vieira	9774377959	Log actual checkout hash instead of generic refspec (#46637 )	2019-09-13 16:07:39 -07:00
Zachary Tong	cf8a4171e1	Rename `data-science` plugin to `analytics` (#46133 ) Rename `data-science` plugin to `analytics`. Also removes enabled flag. Backport of #46092	2019-08-29 12:45:39 -04:00
Tim Brooks	70507e1041	Move netty numDirectArenas to jvm.options (#46104 ) We currently configure io.netty.allocator.numDirectArenas to be 0 in the jvm erconomics class. This is a config that we always want to set, so it makes sense to move it to jvm.options.	2019-08-28 19:30:55 -06:00
Zachary Tong	943a016bb2	Add Cumulative Cardinality agg (and Data Science plugin) (#45990 ) This adds a pipeline aggregation that calculates the cumulative cardinality of a field. It does this by iteratively merging in the HLL sketch from consecutive buckets and emitting the cardinality up to that point. This is useful for things like finding the total "new" users that have visited a website (as opposed to "repeat" visitors). This is a Basic+ aggregation and adds a new Data Science plugin to house it and future advanced analytics/data science aggregations.	2019-08-26 16:19:55 -04:00
Jason Tedor	243f054b0b	Remove redundant Java check from Sys V init (#45793 ) In the Sys V init scripts, we check for Java. This is not needed, since the same check happens in elasticsearch-env when starting up. Having this duplicate check has bitten us in the past, where we made a change to the logic in elasticsearch-env, but missed updating it here. Since there is no need for this duplicate check, we remove it from the Sys V init scripts.	2019-08-22 22:21:53 -04:00
William Brafford	2b549e7342	CLI tools: write errors to stderr instead of stdout (#45586 ) Most of our CLI tools use the Terminal class, which previously did not provide methods for writing to standard output. When all output goes to standard out, there are two basic problems. First, errors and warnings are "swallowed" in pipelines, making it hard for a user to know when something's gone wrong. Second, errors and warnings are intermingled with legitimate output, making it difficult to pass the results of interactive scripts to other tools. This commit adds a second set of print commands to Terminal for printing to standard error, with errorPrint corresponding to print and errorPrintln corresponding to println. This leaves it to developers to decide which output should go where. It also adjusts existing commands to send errors and warnings to stderr. Usage is printed to standard output when it's correctly requested (e.g., bin/elasticsearch-keystore --help) but goes to standard error when a command is invoked incorrectly (e.g. bin/elasticsearch-keystore list-with-a-typo \| sort).	2019-08-21 14:46:07 -04:00
Alpar Torok	c6b30b8883	Add input and outut tracking of built bwc versions (#45694 ) * Add input and outut tracking of built bwc versions This PR adds tracking of the bwc versions git has as input and all the expected files as output. The effect is that `gradlew` is not called at all when the git has doesn't change and the version was allready built. Previusly gradlew would be called for the bwc version and it would have to configure the project and go trough up to date checks to figure out that nothing changed. This helps when working on bwc tests locally needing to run the test multiple times. This should also help in CI not re-build bwc versions across different runs. * Enable caching of bwc builds	2019-08-20 10:05:33 +03:00
Mark Vieira	529946aa15	Rename system property to change bwc checkout behavior (#45574 )	2019-08-16 08:54:04 -07:00
Jason Tedor	ec4182590f	Use bundled JDK in Sys V init (#45593 ) This commit addresses an issue when trying to using Elasticsearch on systems with Sys V init and the bundled JDK was not being used. Instead, we were still inadvertently trying to fallback on the path. This commit removes that fallback as that is against our intentions for 7.x where we only support the bundled JDK or an explicit JDK via JAVA_HOME.	2019-08-15 16:15:17 -04:00
Ryan Ernst	1794718e8e	Make git revision loading lazy (#45358 ) This commit makes the gitRevision property a lazy loaded value by returning an Object implementing toString(). The Dockerfile template is also changed to use groovy templates instead of the mavenfilter hack, so converting to String will not happen until runtime.	2019-08-08 17:08:07 -07:00
Tim Brooks	af908efa41	Disable netty direct buffer pooling by default (#44837 ) Elasticsearch does not grant Netty reflection access to get Unsafe. The only mechanism that currently exists to free direct buffers in a timely manner is to use Unsafe. This leads to the occasional scenario, under heavy network load, that direct byte buffers can slowly build up without being freed. This commit disables Netty direct buffer pooling and moves to a strategy of using a single thread-local direct buffer for interfacing with sockets. This will reduce the memory usage from networking. Elasticsearch currently derives very little value from direct buffer usage (TLS, compression, Lucene, Elasticsearch handling, etc all use heap bytes). So this seems like the correct trade-off until that changes.	2019-08-08 15:10:31 -06:00
Alpar Torok	0ea00e4861	Change how we pick bwc versions to check out (#45189 ) Prior to this PR we always checked out the latest bwc branches and had an external mechanism to store the bwc versions used for every CI run so we could both reproduce those builds and run additional tests using the same combination. This adds complexities in setting up and maintaining CI and makes it difficult to set up multi jobs. This change replaces that mechanism with a time based approach that looks at the commit date of the current revision and picks the newest on the bwc branch that's still older than that. It also makes sure there are no merge commits in this interval. This new behavior will is ment to be enabled in CI only, for everything except PR checks that will still use last available bwc revision.	2019-08-07 16:44:38 +03:00
Jason Tedor	9a142ff25c	Introduce formal node ML role (#45174 ) This commit builds on the ability for plugins to introduce new roles to add a formal node ML role.	2019-08-06 13:00:05 -04:00
Mark Vieira	bb7f46da62	Avoid building docker images when running precommit task (#45211 )	2019-08-06 09:01:06 -07:00
Jason Tedor	5b1b146099	Normalize environment paths (#45179 ) This commit applies a normalization process to environment paths, both in how they are stored internally, also their settings values. This normalization is done via two means: - we make the paths absolute - we remove redundant name elements from the path (what Java calls "normalization") This change ensures that when we compare and refer to these paths within the system, we are using a common ground. For example, prior to the change if the data path was relative, we would not compare it correctly to paths from disk usage. This is because the paths in disk usage were being made absolute.	2019-08-06 06:04:30 -04:00
Jason Tedor	872ae4d6c2	Add OCI annotations and adjust existing annotations (#45167 ) The org.label-schema labels on Docker images have been superseded by pre-defined OCI annotations. However, there is still a lot of tooling in use that relies on the org.label-schema, so we do not want to drop them. This commit adds values for the org.opencontainers.image pre-defined annotation keys. Additionally, we correct an issue with the label used to represent the license, to use the org.label-schema.license label. While this label was never accepted into the org.label-schema specfication (because this specification was superseded, it's not that it was explicitly rejected) there are containers out there using this label. In particular, our base image is and so we need to override otherwise we inherit, and end up mis-reporting the license.	2019-08-04 13:52:13 -04:00
Jason Tedor	659ebf6cfb	Notify systemd when Elasticsearch is ready (#44673 ) Today our systemd service defaults to a service type of simple. This means that systemd assumes Elasticsearch is ready as soon as the ExecStart (bin/elasticsearch) process is forked off. This means that the service appears ready long before it actually is, so before it is ready to receive requests. It also means that services that want to depend on Elasticsearch being ready to start can not as there is not a reliable mechanism to determine this. This commit changes the service type to notify. This requires that Elasticsearch sends a notification message via libsystemd sd_notify method. This commit does that by using JNA to invoke this native method. Additionally, we use this integration to also notify systemd when we are stopping.	2019-07-24 14:04:36 +09:00
Ioannis Kakavas	4dd9238cc0	Mute testPooledMemoryChoiceOnNotSmallHeap	2019-07-23 13:16:22 +03:00
Jason Tedor	f5b2fd2f1a	Fix imports in JvmErgonomicsTests.java	2019-07-22 22:33:41 +09:00
Jason Tedor	5c0ebe7b5f	Reenable JvmErgonomicsTests on Windows	2019-07-22 22:26:41 +09:00
Przemyslaw Gomulka	09e9c4cb59	Fix types field in JSON Search Slow Logs (#44641 ) The field has to be defined in log4j2.properties and should be an escaped JSON for now (it is a broken JSON at the moment). This should later be refactored into a JSON array of strings.	2019-07-22 12:02:20 +02:00
Alpar Torok	b34ac66d96	Mute multiple tests on Windows (7.x) (#44676 ) * Mute failing test tracked in #44552 * mute EvilSecurityTests tracking in #44558 * Fix line endings in ESJsonLayoutTests * Mute failing ForecastIT test on windows Tracking in #44609 * mute BasicRenormalizationIT.testDefaultRenormalization tracked in #44613 * fix mute testDefaultRenormalization * Increase busyWait timeout windows is slow * Mute failure unconfigured node name * mute x-pack internal cluster test windows tracking #44610 * Mute JvmErgonomicsTests on windows Tracking #44669 * mute SharedClusterSnapshotRestoreIT testParallelRestoreOperationsFromSingleSnapshot Tracking #44671 * Mute NodeTests on Windows Tracking #44256	2019-07-22 11:32:29 +03:00
Ryan Ernst	226a753e93	Restore setting up temp dir for windows service (#44541 ) (#44661 ) In https://github.com/elastic/elasticsearch/pull/41913 setting up the temp dir for ES was moved from the env script to individual cli scripts. However, moving it to the windows service cli was missed. This commit restores setting up the temp dir for the windows service control script.	2019-07-21 13:54:46 -07:00
Jason Tedor	cdd06d40d2	Do not checksum all bytes at once in plugin install (#44649 ) Today when checksumming a plugin zip during plugin install, we read all of the bytes of the zip into memory at once. When trying to run the plugin installer on a small heap (say, 64 MiB), this can lead to the plugin installer running out of memory when checksumming large plugins. This commit addresses this by reading the plugin bytes in 8 KiB chunks, thus using a constant amount of memory independent of the size of the plugin.	2019-07-21 07:24:23 +09:00
Jason Tedor	1f7fc1b497	Add default CLI JVM options (#44545 ) This commit adds some default CLI JVM options to control the heap size and the garbage collector used for the CLI tools. We do this because otherwise the JVM will default to large initial and max heap sizes based on the RAM visible to the JVM (which could be all the physical RAM on the machine if not run in a container-aware JVM). This commit therefore sets the initial heap size to 4m, the max heap size to 64m, the garbage collector to the serial collector, and leaves this user-configurable by honoring ES_JAVA_OPTS last.	2019-07-20 09:30:13 +09:00
Przemyslaw Gomulka	e23ecc5838	JSON logging refactoring and X-Opaque-ID support backport(#41354 ) (#44178 ) This is a refactor to current JSON logging to make it more open for extensions and support for custom ES log messages used inDeprecationLogger IndexingSlowLog , SearchSLowLog We want to include x-opaque-id in deprecation logs. The easiest way to have this as an additional JSON field instead of part of the message is to create a custom DeprecatedMessage (extends ESLogMEssage) These messages are regular log4j messages with a text, but also carry a map of fields which can then populate the log pattern. The logic for this lives in ESJsonLayout and ESMessageFieldConverter. Similar approach can be used to refactor IndexingSlowLog and SearchSlowLog JSON logs to contain fields previously only present as escaped JSON string in a message field. closes #41350 backport #41354	2019-07-12 16:53:27 +02:00
Ioannis Kakavas	475752be75	Make plugin verification FIPS 140 compliant (#44266 ) This change makes the process of verifying the signature of official plugins FIPS 140 compliant by defaulting to use the BouncyCastle FIPS provider and adding a dependency to bcpg-fips that implement parts of openPGP in a FIPS compliant manner. In already FIPS 140 enabled environments that use the BouncyCastle FIPS provider, the bcfips dependency is redundant but doesn't cause an issue as it will be added only in the classpath of the cli-tools This is a backport of #44224	2019-07-12 14:34:15 +03:00
Alpar Torok	7ba18732f7	Run some REST tests against a cluster running in docker containers (#39515 ) * Run REST tests against a cluster running on docker Closes #38053	2019-07-11 15:28:33 +03:00
Alpar Torok	d1a4d8866d	Add missing dependencies so we can build in parallel (#43672 )	2019-06-28 16:41:18 +03:00
Chris Koehnke	173338ad37	Fix dockerfile for non-local builds (#43591 ) Use the `source_elasticsearch` variable to conditionally get the command needed for release builds for the [dockerfiles repository][0]. Fixes https://github.com/elastic/elasticsearch/issues/43590 [0]: https://github.com/elastic/dockerfiles	2019-06-25 14:03:48 -04:00
Ryan Ernst	eb01208672	Fix the bundled jdk flag to be passed through windows startup (#43502 ) This commit fixes a typo in elasticsearch.bat that prevented the windows distribution from knowing whether it is using the bundled jdk.	2019-06-23 23:26:13 -07:00
Vincent Boulaye	209a493b27	convert EmptyDirTask.groovy to .java (#34672 )	2019-06-13 12:21:23 +03:00
Jay Modi	f150443d9a	Default distro run creates elastic-admin user (#43004 ) When using gradle run by itself, this uses the default distro with a basic license and enables security. There is a setup command to create a elastic-admin user but only when the license is a trial license. Now that security is available with the basic license, we should always run this command when using the default distribution.	2019-06-10 11:49:52 -06:00
Alpar Torok	9def454ea9	Clean up configuration when docker isn't available (#42745 ) We initially added `requireDocker` for a way for tasks to say that they absolutely must have it, like the build docker image tasks. Projects using the test fixtures plugin are not in this both, as the intent with these is that they will be skipped if docker and docker-compose is not available. Before this change we were lenient, the docker image build would succeed but produce nothing. The implementation was also confusing as it was not immediately obvious this was the case due to all the indirection in the code. The reason we have this leniency is that when we added the docker image build, docker was a fairly new requirement for us, and we didn't have it deployed in CI widely enough nor had CI configured to prefer workers with docker when possible. We are in a much better position now. The other reason was other stack teams running `./gradlew assemble` in their respective CI and the possibility of breaking them if docker is not installed. We have been advocating for building specific distros for some time now and I will also send out an additional notice The PR also removes the use of `requireDocker` from tests that actually use test fixtures and are ok without it, and fixes a bug in test fixtures that would cause incorrect configuration and allow some tasks to run when docker was not available and they shouldn't have. Closes #42680 and #42829 see also #42719	2019-06-10 13:44:15 +03:00
Mark Vieira	84eab4eba1	Omit JDK sources archive from bundled JDK (#42821 ) (cherry picked from commit 71d1454fe5ecc222801731a5f0e0e1053dc8997e)	2019-06-05 10:09:25 -07:00
Przemyslaw Gomulka	cfdb1b771e	Enable console audit logs for docker backport#42671 #42887 Enable audit logs in docker by creating console appenders for audit loggers. also rename field @timestamp to timestamp and add field type with value audit The docker build contains now two log4j configuration for oss or default versions. The build now allows override the default configuration. Also changed the format of a timestamp from ISO8601 to include time zone as per this discussion #36833 (comment) closes #42666 backport#42671	2019-06-05 17:15:37 +02:00

1 2 3 4 5 ...

772 Commits