OpenSearch

Commit Graph

Author	SHA1	Message	Date
Zachary Tong	99e313695f	Reuse CompensatedSum object in agg collect loops (#49548 ) The new CompensatedSum is a nice DRY refactor, but had the unanticipated side effect of creating a lot of object allocation in the aggregation hot collection loop: one object per visited document, per aggregator. In some places it created two per-doc-per-agg (weighted avg, geo centroids, etc) since there were multiple compensations being maintained. This PR moves the object creation out of the hot loop so that it is now created once per segment, and resets the internal state each time through the loop	2019-11-25 16:46:48 -05:00
James Rodewig	2fd58bb845	[DOCS] Add missing "_type" to delimited payload token filter docs	2019-11-25 16:16:05 -05:00
Lisa Cawley	26beb486c7	[DOCS] Fixes security links (#49563 )	2019-11-25 13:02:26 -08:00
James Rodewig	c40449ac22	[DOCS] Reformat delimited payload token filter docs (#49380 ) * Adds a title abbreviation * Relocates the older name deprecation warning * Updates the description and adds a Lucene link * Adds a note to explain payloads and how to store them * Adds analyze and custom analyzer snippets * Adds a 'Return stored payloads' example	2019-11-25 15:40:05 -05:00
James Rodewig	99476db2d0	[DOCS] Remove individual task retrieval from cat/tasks API (#49550 )	2019-11-25 10:32:39 -05:00
Benjamin Trent	688c78c589	[ML] Stop timing stats failure propagation (#49495 ) (#49501 )	2019-11-25 10:09:30 -05:00
Kelly Campbell	df5afa797e	[DOCS] Correct GET path in cat tasks API docs (#49494 ) Previously, the request example included `GET _cat/_tasks`. However, the resource should be `tasks`, not `_tasks`.	2019-11-25 09:37:59 -05:00
David Roberts	62811c2272	[ML] Add default categorization analyzer definition to ML info (#49545 ) The categorization job wizard in the ML UI will use this information when showing the effect of the chosen categorization analyzer on a sample of input.	2019-11-25 13:39:16 +00:00
Dimitris Athanasiou	d21df9eba9	[ML][DOCS] Anomaly detection job retention days settings do not require restart (#49546 )	2019-11-25 14:19:10 +01:00
Dimitris Athanasiou	aca38f6882	[7.x][ML] DFA jobs should accept excluding an unsupported field (#49535 ) (#49544 ) Before this change excluding an unsupported field resulted in an error message that explained the excluded field could not be detected as if it doesn't exist. This error message is confusing. This commit commit changes this so that there is no error in this scenario. When excluding a field that does exist but has been automatically been excluded from the analysis there is no harm (unlike excluding a missing field which could be a typo). Backport of #49535	2019-11-25 15:13:00 +02:00
Daniel Mitterdorfer	8c374014ae	Add note about Gradle wrapper on Windows (#49528 ) With this commit we add a clarifying note in the contribution guidelines that our examples show the usage on Unix and also explain how to invoke the Gradle wrapper script on Windows. Closes #49521	2019-11-25 13:41:26 +01:00
Armin Braun	af0f97d50a	Fix SLMSnapshotBlockingIntegTests.testSnapshotInProgress (#49533 ) (#49542 ) This test must check for state `SUCCESS` as well. `SUCESS` in `SnapshotsInProgress` means "all data nodes finished snapshotting sucessfully but master must still finalize the snapshot in the repo". `SUCESS` does not mean that the snapshot is actually fully finished in this object. You can easily reporduce the scenario in #49303 that has an in-progress snapshot in `SUCCESS` state by waiting 20s before running the busy assert loop on the snapshot status so that all steps but the blocked finalization can finish. Closes #49303	2019-11-25 13:31:45 +01:00
Armin Braun	2502ff39a0	Enhance SnapshotResiliencyTests (#49514 ) (#49541 ) A few enhancements to `SnapshotResiliencyTests`: 1. Test running requests from random nodes in more spots to enhance coverage (this is particularly motivated by #49060 where the additional number of cluster state updates makes it more interesting to fully cover all kinds of network failures) 2. Fix issue with restarting only master node in one test (doing so breaks the test at an incredibly low frequency, that becomes not so low in #49060 with the additional cluster state updates between request and response) 3. Improved cluster formation checks (now properly checks the term as well when forming cluster) + makes sure all nodes are connected to all other nodes (previously the data nodes would at times not be connected to other data nodes, which was shaken out now by adding the `client()` method 4. Make sure the cluster left behind by the test makes sense by running the repo cleanup action on it (this also increases coverage of the repository cleanup action obviously and adds the basis of making it part of more resiliency tests)	2019-11-25 13:31:28 +01:00
Dimitris Athanasiou	c149c64dc4	[7.x][ML] Apply source query on data frame analytics memory estimation (#49517 ) (#49532 ) Closes #49454 Backport of #49517	2019-11-25 12:51:57 +02:00
Armin Braun	a5fa86ed97	Improve Stability of Mock APIs (#49518 ) (#49524 ) This commit ensures that even for requests that are known to be empty body we at least attempt to read one bytes from the request body input stream. This is done to work around the behavior in `sun.net.httpserver.ServerImpl.Dispatcher#handleEvent` that will close a TCP/HTTP connection that does not have the `eof` flag (see `sun.net.httpserver.LeftOverInputStream#isEOF`) set on its input stream. As far as I can tell the only way to set this flag is to do a read when there's no more bytes buffered. This fixes the numerous connection closing issues because the `ServerImpl` stops closing connections that it thinks weren't fully drained. Also, I removed a now redundant drain loop in the Azure handler as well as removed the connection closing in the error handler's drain action (this shouldn't have an effect but makes things more predictable/easier to reason about IMO). I would suggest merging this and closing related issue after verifying that this fixes things on CI. The way to locally reproduce the issues we're seeing in tests is to make the retry timings more aggressive in e.g. the azure tests and move them to single digit values. This makes the retries happen quickly enough that they run into the async connecting closing of allegedly non-eof connections by `ServerImpl` and produces the exact kinds of failures we're seeing currently. Relates #49401, #49429	2019-11-25 10:28:55 +01:00
Hendrik Muhs	5256756879	[Transform] add debug log for configuration index (#49484 ) add debug log for transform creation and disallow partial results for retrieval	2019-11-25 09:49:17 +01:00
Nhat Nguyen	8260cba629	Increase timeout while checking for no snapshotted commit (#49461 ) If some replica is performing a file-based recovery, then the check assertNoSnapshottedIndexCommit would fail. We should increase the timeout for this check so that we can wait until all recoveries done or aborted. Closes #49403	2019-11-24 15:12:34 -05:00
Jared Tan	1d2bfd1af6	Include id to the error msg when it's too long (#49433 )	2019-11-24 13:08:26 -05:00
Mark Vieira	777f6d5da6	Fix extraction of notarized Elasticsearch release distribution (#49511 ) This commit introduces a workaround for an issue related to our recent notarization of distributions starting with the 6.8.5 release. An unintended side effect of notarization was that the file entries of the release tar all have a `./` prefix in the path. This causes a number of issues, not least of which is that our Gradle extract tasks end up copying an empty fileset to the destination directory. The workaround here is imply to remove the leading `./` path segment from each file when performing the extraction. For more details see this issue: https://github.com/elastic/elasticsearch/issues/49417	2019-11-22 17:19:47 -08:00
jesinity	c9eba17517	Fix HLRC parsing of CancelTasks response (#47017 ) Adds support for proper cancel tasks parsing. Closes #45414	2019-11-22 16:56:27 -06:00
debadair	2ec047db04	[DOCS] Rename auditing topic. Closes #49012 (#49013 ) * [DOCS] Rename auditing topic. Closes #49012 * Fixed file name, fixed settings link. * Add link to settings	2019-11-22 14:16:58 -08:00
James Rodewig	d06c71eb82	[DOCS] Fix edge n-gram tokenizer nav Adds a missing float tag to the edge n-gram tokenizer docs. This tag ensures the edge n-gram tokenizer docs display on the same page.	2019-11-22 15:54:07 -05:00
Dimitris Athanasiou	8eaee7cbdc	[7.x][ML] Explain data frame analytics API (#49455 ) (#49504 ) This commit replaces the _estimate_memory_usage API with a new API, the _explain API. The API consolidates information that is useful before creating a data frame analytics job. It includes: - memory estimation - field selection explanation Memory estimation is moved here from what was previously calculated in the _estimate_memory_usage API. Field selection is a new feature that explains to the user whether each available field was selected to be included or not in the analysis. In the case it was not included, it also explains the reason why. Backport of #49455	2019-11-22 22:06:10 +02:00
Jason Tedor	69f570ea5f	Adjust version on final pipeline serialization This commit adjusts the version final pipeline serialization after it was backported to the 7.5 branch.	2019-11-22 14:56:56 -05:00
Jay Modi	4fd5fb5297	Stop NodeTests from timing out in certain cases (#49202 ) (#49503 ) The NodeTests class contains tests that check behavior when shutting down a node. This involves starting a node, performing some operation, stopping the node, and then awaiting the close of the node. Part of closing a node is the termination of the node's ThreadPool. ThreadPool termination semantics can be deceiving. The ThreadPool#terminate method takes a timeout value and the first oddity is that the terminate method can take two times the timeout value before returning. Internally this method acts on the ExecutorService instances that are held by the ThreadPool. First, an orderly shutdown is attempted and pending tasks are allowed to execute while waiting for the timeout value. If any of the ExecutorService instances have not terminated, a call is made to attempt to stop all active tasks (usually using interrupts) and then waits for up to the timeout value a second time for the termination of the ExecutorService instances. This means that if use a large value when waiting for a node to close, we may not attempt to interrupt any threads that are in a blocking call before the test times out. In order to avoid causing these tests to time out, this change reduces the timeout passed to Node#awaitClose to 10 seconds from 1 day. This will allow blocked threads to be interrupted before the test suite fails due to the timeout. Closes #44256 Closes #42350 Closes #44435	2019-11-22 12:41:52 -07:00
Jason Tedor	71bcfbf1e3	Replace required pipeline with final pipeline (#49470 ) This commit enhances the required pipeline functionality by changing it so that default/request pipelines can also be executed, but the required pipeline is always executed last. This gives users the flexibility to execute their own indexing pipelines, but also ensure that any required pipelines are also executed. Since such pipelines are executed last, we change the name of required pipelines to final pipelines.	2019-11-22 14:37:36 -05:00
Jay Modi	1431c2b408	Run build-tools test with Gradle jdk (#49459 ) (#49497 ) The test task is configured to use the runtime java version, but there are issues with the version of groovy used by gradle pre 6.0. In order to workaround this, we use the Gradle JDK to execute the build-tools tests. Closes #49404 Closes #49253	2019-11-22 11:59:46 -07:00
Marios Trivyzas	0c4491964b	SQL: Fix issue with folding of CASE/IIF (#49449 ) Add extra checks to prevent ConstantFolding rule to try to fold the CASE/IIF functions early before the SimplifyCase rule gets applied. Fixes: #49387 (cherry picked from commit f35c9725350e35985d8dd3001870084e1784a5ca)	2019-11-22 18:29:49 +01:00
Henning Andersen	49bb5fb642	Netty4: switch to composite cumulator (#49478 ) The default merge cumulator used in netty transport leads to additional GC pressure and memory copying when a message that exceeds the chunk size is handled. This is especially a problem on G1 GC, since we get many "humongous" allocations and that can in theory cause real memory circuit breaker to break unnecessarily.	2019-11-22 18:14:10 +01:00
Lisa Cawley	ca895d3ad5	[DOCS] Merge rollup config details into API (#49412 )	2019-11-22 08:39:49 -08:00
Armin Braun	97c7ea60b9	Add Missing Nullable Assertions in SnapshotsService (#49465 ) (#49492 ) Just realized we were missing some annotations here which was somewhat confusing since other methods/parameters have the `Nullable` annotation wherever a `null` can be passed.	2019-11-22 17:27:27 +01:00
James Rodewig	562607d3f5	[DOCS] Reformat n-gram token filter docs (#49438 ) Reformats the edge n-gram and n-gram token filter docs. Changes include: * Adds title abbreviations * Updates the descriptions and adds Lucene links * Reformats parameter definitions * Adds analyze and custom analyzer snippets * Adds notes explaining differences between the edge n-gram and n-gram filters Additional changes: * Switches titles to use "n-gram" throughout. * Fixes a typo in the edge n-gram tokenizer docs * Adds an explicit anchor for the `index.max_ngram_diff` setting	2019-11-22 10:38:50 -05:00
Rory Hunter	4fae2bb3b1	Don't close stderr under `--quiet` (#49431 ) Backport of #47208. Closes #46900. When running ES with `--quiet`, if ES then exits abnormally, a user has to go hunting in the logs for the error. Instead, never close System.err, and print more information to it if ES encounters a fatal error e.g. config validation, or some fatal runtime exception. This is useful when running under e.g. systemd, since the error will go into the journal. Note that stderr is still closed in daemon (`-d`) mode.	2019-11-22 14:58:17 +00:00
Benjamin Trent	ed787d06e8	[7.x] [ML][Inference][HLRC] GET trained models (#49464 ) (#49488 ) * [ML][Inference][HLRC] GET trained models (#49464) * fixing for backport	2019-11-22 09:24:06 -05:00
Enrico Zimuel	12c2ca8895	Fix missing slash in specification for indices.put_mapping	2019-11-22 15:06:47 +01:00
István Zoltán Szabó	56d97dcb6c	[DOCS] Replaces deprecated ScriptService.ScriptType.INLINE with supported script in Java update docs. (#49424 )	2019-11-22 14:17:44 +01:00
Benjamin Trent	276b6c67f4	[ML][Inference] Fixing pre-processor value handling and size estimate (#49270 ) (#49489 ) * [ML][Inference] Fixing pre-processor value handling and size estimate * fixing npe	2019-11-22 08:14:33 -05:00
István Zoltán Szabó	35cc0e0948	[DOCS] Removes the default size definition of thread pool types (#49442 ) Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-11-22 11:20:11 +01:00
Florian Kelbert	d444c334d7	Modify example for pinned query (#49481 ) I do not see any reason to advertise phones of specific companies.	2019-11-22 11:03:04 +01:00
Jim Ferenczi	ed4eecc00e	Pre-sort shards based on the max/min value of the primary sort field (#49092 ) This change automatically pre-sort search shards on search requests that use a primary sort based on the value of a field. When possible, the can_match phase will extract the min/max (depending on the provided sort order) values of each shard and use it to pre-sort the shards prior to running the subsequent phases. This feature can be useful to ensure that shards that contain recent data are executed first so that intermediate merge have more chance to contain contiguous data (think of date_histogram for instance) but it could also be used in a follow up to early terminate sorted top-hits queries that don't require the total hit count. The latter could significantly speed up the retrieval of the most/least recent documents from time-based indices. Relates #49091	2019-11-22 11:02:12 +01:00
István Zoltán Szabó	c13fce60a8	[DOCS] Removes data frame leftovers from transforms overview (#49434 )	2019-11-22 10:20:15 +01:00
Przemyslaw Gomulka	d42eac9cf3	[DOC] Modify the update example to change a document (#49228 ) (#49443 ) Example at the moment is not changing the existing document. Update request should at least modify the existing document.	2019-11-22 09:54:34 +01:00
Hendrik Muhs	1fbb248cb7	reenable warning checks in pivot tests (#49436 )	2019-11-22 08:50:10 +01:00
Martijn van Groningen	2243743450	Update geolite2 database in ingest geoip plugin. (#49308 ) Some tests were tweaked to deal with the updated database files.	2019-11-22 08:38:57 +01:00
Mark Vieira	c239a6a493	Fix build failure when attempting to export UBI Docker image (#49472 )	2019-11-21 17:44:21 -08:00
Tim Vernum	2e5f2dd1e1	Deprecate misconfigured SSL server config (#49280 ) This commit adds a deprecation warning when starting a node where either of the server contexts (xpack.security.transport.ssl and xpack.security.http.ssl) meet either of these conditions: 1. The server lacks a certificate/key pair (i.e. neither ssl.keystore.path not ssl.certificate are configured) 2. The server has some ssl configuration, but ssl.enabled is not specified. This new validation does not care whether ssl.enabled is true or false (though other validation might), it simply makes it an error to configure server SSL without being explicit about whether to enable that configuration. Backport of: #45892	2019-11-22 12:14:55 +11:00
Benjamin Trent	a7477ad7c3	[7.x] [ML][Inference] compressing model definition and lazy parsing (#49269 ) (#49446 ) * [ML][Inference] compressing model definition and lazy parsing (#49269) * [ML][Inference] compressing model definition and lazy parsing * addressing PR comments * adding commons io * implementing simplified bounded stream * adjusting for type inclusion	2019-11-21 15:32:32 -05:00
Igor Motov	e8971ff367	Geo: Fix handling of circles in legacy geo_shape queries (#49410 ) Brings back support for circles in legacy geo_shape queries that was accidentally lost during query refactoring. Fixes #49296	2019-11-21 14:03:31 -05:00
Armin Braun	231d079bf8	Fix Azure Mock Issues (#49377 ) (#49381 ) Fixing a few small issues found in this code: 1. We weren't reading the request headers but the response headers when checking for blob existence in the mocked single upload path 2. Error code can never be `null` removed the dead code that resulted 3. In the logging wrapper we weren't checking for `Throwable` so any failing assertions in the http mock would not show up since they run on a thread managed by the mock http server	2019-11-21 19:57:50 +01:00
James Rodewig	0fa3b887b7	[DOCS] Document several missing thread pools (#48543 ) Adds documentation for the following thread pools: - fetch_shard_started - fetch_shard_store - flush - force_merge - management Closes #48524 Co-Authored-By: Jay Modi <jaymode@users.noreply.github.com>	2019-11-21 13:12:56 -05:00

... 3 4 5 6 7 ...

49121 Commits All Branches Search

49121 Commits

All Branches