OpenSearch

Commit Graph

Author	SHA1	Message	Date
Simon Willnauer	b116221540	Ensure shard is refreshed once it's inactive (#27559 ) Once a shard goes inactive we want the shard to be refreshed if the refresh interval is default since we might hold on to unnecessary segments and in the inactive case we stopped indexing and can release old segments. Relates to #27500	2017-11-30 19:04:05 +01:00
Mayya Sharipova	c6b73239ae	Limit the number of tokens produced by _analyze (#27529 ) Add an index level setting `index.analyze.max_token_count` to control the number of generated tokens in the _analyze endpoint. Defaults to 10000. Throw an error if the number of generated tokens exceeds this limit. Closes #27038	2017-11-30 11:54:39 -05:00
David Turner	92a24de509	Add more logging to testCorruptedShards to help investigate sporadic failures	2017-11-30 16:34:23 +00:00
David Turner	1f89e9d94e	Reinstate AwaitsFix This reverts commit `29c5540323`.	2017-11-30 13:01:22 +00:00
olcbean	d25c9671de	Deprecate `jarowinkler` in favor of `jaro_winkler` (#27526 ) Jaro and Winkler are two people, so we should use the same naming convention as for Damerau–Levenshtein.	2017-11-30 12:49:34 +00:00
Tanguy Leroux	41f73e0acf	Fix version for include_global_state in Snapshot Status API It also adds a Rest test. Related #26853	2017-11-30 11:33:01 +01:00
kel	efac982e35	Include include_global_state in Snapshot status API (#26853 ) This commit adds a field include_global_state to snapshot status api response. For legacy snapshot, the field is not present. Closes #22423	2017-11-30 10:38:07 +01:00
Tanguy Leroux	192d1f03f8	Do not swallow exception in ChecksumBlobStoreFormat.writeAtomic() (#27597 ) The ChecksumBlobStoreFormat.writeAtomic() method writes a blob using a temporary name and then moves the blob to its final name. The move operation can fail and in this case the temporary blob is deleted. If this delete operation also fails, then the initial exception is lost. This commit ensures that when something goes wrong during the move operation the initial exception is kept and thrown, and if the delete operation also fails then this additional exception is added as a suppressed exception to the initial one.	2017-11-30 10:09:49 +01:00
Yannick Welsch	2738c783e5	Fix Gradle >=4.2 compatibility (#27591 ) Gradle 4.2 introduced a feature for safer handling of stale output files. Unfortunately, due to the way some of our tasks are written, this broke execution of our REST tests tasks. The reason for this is that the extract task (which extracts the ES distribution) would clean its output directory, and thereby also remove the empty cwd subdirectory which was created by the clean task. The reason why Gradle would remove the directory is that the earlier running clean task would programmatically create the empty cwd directory, but not make Gradle aware of this fact, which would result in Gradle believing that it could just safely clean the directory. This commit explicitly defines a task to create the cwd subdirectory, and marks the directory as output of the task, so that the subsequent extract task won't eagerly clean it. It thereby restores full compatibility of the build with Gradle 4.2 and above. This commit also removes the @Input annotation on the waitCondition closure of AntFixture, which conflicted with the extended input/output validation of Gradle 4.3.	2017-11-30 10:03:16 +01:00
Philipp Krenn	64ca0fe9bb	Update docs regarding SHA-512 checksums This commit updates the docs for the new SHA-512 checksums that are supported for official plugins. Relates #27524	2017-11-29 21:29:06 -05:00
Jason Tedor	6655689b15	Move DNS cache docs to system configuration docs When these docs were moved they should have been moved to the system configuration docs. This commit does that, and also fixes a missing heading that broke the docs build.	2017-11-29 19:57:26 -05:00
Jason Tedor	ff3c19ed13	Move DNS cache settings to important configuration This commit moves the DNS cache settings for the JVM to the important settings section of the docs. Relates #27592	2017-11-29 18:02:26 -05:00
Tim Brooks	b8557651aa	Add exception handling for write listeners (#27590 ) This potential issue was exposed when I saw this PR #27542. Essentially we currently execute the write listeners all over the place without consistently catching and handling exceptions. Some of these exceptions will be logged in different ways (including as low as `debug`). This commit adds a single location where these listeners are executed. If the listener throws an execption, the exception is caught and logged at the `warn` level.	2017-11-29 15:47:12 -07:00
Jason Tedor	55cb8ddd80	Do not set data paths on no local storage required Today when configuring the data paths for the environment, we set data paths to either the specified path.data or default to data relative to the Elasticsearch home. Yet if node.local_storage is false, data paths do not even make sense. In this case, we should reject if path.data is set, and instead of defaulting data paths to data relative to home, we should set this to empty paths. This commit does this. Relates #27587	2017-11-29 17:35:00 -05:00
David Turner	29c5540323	Remove AwaitsFix	2017-11-29 18:12:18 +00:00
Martijn van Groningen	dbf17152d1	docs: use `doc_value_fields` fields as alternative for nested inner hits _source fetching instead of stored fields as doc values are more likely to be enabled by default	2017-11-29 17:31:39 +01:00
Clinton Gormley	65e602c2be	Update index-modules.asciidoc Docs: Clarified `blocks.write` vs `blocks.read_only`	2017-11-29 13:05:12 +01:00
Christoph Büscher	0d11b9fe34	[Docs] Unify spelling of Elasticsearch (#27567 ) Removes occurences of "elasticsearch" or "ElasticSearch" in favour of "Elasticsearch" where appropriate.	2017-11-29 09:44:25 +01:00
Tanguy Leroux	547f006118	Remove XContentType auto detection in BlobStoreRepository (#27480 )	2017-11-29 09:39:49 +01:00
Jason Tedor	d0cd18169e	Remove stale awaits fix on azure master nodes test This awaits fix has been there forever and no one seems to know what to do with this test. I say let CI churn on it because it passed for me three out of three times. If there is something wrong with it, we will know quickly and can then address with the new information that we have.	2017-11-28 22:43:34 -05:00
Kanako Nakai	23f85fe6d4	Fix max number of threads bootstrap docs Previously the bootstrap check for max number of threads was increased from 2048 to 4096 yet the docs were never adjusted for this change. This commit addresses this so the docs are in-line with the limit enforced in the bootstrap check. Relates #27511	2017-11-28 22:19:04 -05:00
Yu	57da9dc57f	Clarify setting IntelliJ preferences This commit clarifies that the preferences menu in IntelliJ can differ by OS (IntelliJ -> Preferences on macOS and Settings on Linux/Windows). Relates #27575	2017-11-28 19:46:24 -05:00
Jack Conradson	2d927fabab	Painless: Fix errors allowing void to be assigned to def. (#27460 )	2017-11-28 13:44:52 -08:00
Jack Conradson	9e42b77f7e	Painless: Fix variable scoping issue in lambdas not including captured variables. (#27571 )	2017-11-28 13:30:13 -08:00
Simon Willnauer	4aa840698f	Ensure threadcontext is preserved when refresh listeners are invoked (#27565 ) today a refresh listener won't preserve the entire context ie. won't carry on response headers etc. from the caller side. This change adds support for stored contexts.	2017-11-28 21:32:16 +01:00
Simon Willnauer	184b7f06ee	Make Segment statistics aware of segments hold by internal readers (#27558 ) Today we only expose the external readers segments. Yet, from a statistics perspective both internal and external segments are relevant. This commit exposes the additional segments of the internal and external reader respectively.	2017-11-28 17:37:03 +01:00
Jason Tedor	cefb46d0fc	Throw UOE from compressible bytes stream reset A compressible bytes output stream is a stream output which supports a reset method. However, compressible bytes output streams are unusual in that the current implementation sometimes supports a reset (if the stream is not compressed) and sometimes does not support a rest (if the stream is compressed). This inconsistent behavior is puzzling and instead we should simply always throw an unsupported operation exception. Relates #27564	2017-11-28 11:29:47 -05:00
Jim Ferenczi	37653c9dca	[TEST] AggregationsIntegrationIT#testScroll can timeout This change sets the scroll timeout for this test to 1m instead of 500ms in order to avoid loosing the scroll on slow machines. Relates #26378	2017-11-28 16:18:54 +01:00
Adrien Grand	d01fcee645	Fix illegal cast of the "low cardinality" optimization of the `terms` aggregation. (#27543 ) The GlobalOrdinalsStringTermsAggregator.LowCardinality aggregator casts global values to `GlobalOrdinalMapping`, even though the implementation of global values is different when a `missing` value is configured. This commit adds a new API that gives access to the ordinal remapping in order to fix this problem.	2017-11-28 14:55:09 +01:00
Adrien Grand	996990ad1f	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27496 ) The main highlight of this new snapshot is that it introduces the opportunity for queries to opt out of caching. In case a query opts out of caching, not only will it never be cached, but also no compound query that wraps it will be cached.	2017-11-28 14:52:42 +01:00
Martijn van Groningen	cb1204774b	Include the _index, _type and _id to nested search hits in the top_hits and inner_hits response. Also include _type and _id for parent/child hits inside inner hits. In the case of top_hits aggregation the nested search hits are directly returned and are not grouped by a root or parent document, so it is important to include the _id and _index attributes in order to know to what documents these nested search hits belong to. Closes #27053	2017-11-28 14:05:29 +01:00
David Turner	a165d1df40	Minor improvements to docs for numeric types (#27553 ) * Caps * Fix awkward wording that took multiple passes to parse * Floating point _number_ * Something more descriptive about the `scaled_float` scaling factor.	2017-11-28 11:36:07 +00:00
David Turner	7ac361d86e	Update @AwaitsFix URL to point at an issue in the current repo	2017-11-28 09:35:46 +00:00
Nhat Nguyen	000f62c1d2	TEST: makes sure to corrupt referenced tlog files (#27546 ) Method TruncateTranslogIT#corruptTranslogFiles corrupts some random existing *.tlog files in a translog directory. However, this may not actually corrupt translog at all if it corrupts only tlog files which are not referenced by the Checkpoint (eg. their translog generations are smaller the Checkpoint). This commit makes sure that we corrupt some tlog files which are referenced by the Checkpoint. Closes #27538	2017-11-27 20:18:58 -05:00
Simon Willnauer	0eb87e5d57	[TEST] Fix broken test that still tried to acquire the shards to set it non-idle	2017-11-27 22:52:34 +01:00
Jason Tedor	d8c28044da	Forbid granting the all permission in production Running with the all permission java.security.AllPermission granted is equivalent to disabling the security manager. This commit adds a bootstrap check that forbids running with this permission granted. Relates #27548	2017-11-27 16:05:27 -05:00
Jason Tedor	379d51fcfa	Bubble exceptions when closing compressible streams Compressible bytes output stream swallows exceptions that occur when closing. This commit changes this behavior so that such exceptions bubble up. Relates #27542	2017-11-27 13:48:04 -05:00
Simon Willnauer	f23ed6188d	Skip shard refreshes if shard is `search idle` (#27500 ) Today we refresh automatically in the background by default very second. This default behavior has a significant impact on indexing performance if the refreshes are not needed. This change introduces a notion of a shard being `search idle` which a shard transitions to after (default) `30s` without any access to an external searcher. Once a shard is search idle all scheduled refreshes will be skipped unless there are any refresh listeners registered. If a search happens on a `serach idle` shard the search request _park_ on a refresh listener and will be executed once the next scheduled refresh occurs. This will also turn the shard into the `non-idle` state immediately. This behavior is only applied if there is no explicit refresh interval set.	2017-11-27 18:16:10 +01:00
Nhat Nguyen	8d6bfe53bb	Remove workaround in translog rest test (#27530 ) Relates #25623 and `a6db0ea908`	2017-11-27 09:41:30 -05:00
Martijn van Groningen	3f98b85489	inner_hits: Return an empty _source for nested inner hit when filtering on a field that doesn't exist. Before this change the search request would fail with an error indicating that it couldn't detect xcontent type based on the string: `null`	2017-11-27 10:51:24 +01:00
Martijn van Groningen	4ab638b71d	percolator: Avoid TooManyClauses exception if number of terms / ranges is exactly equal to 1024 The logic whether to use CoveringQuery was in two places which is why this bug snug in.	2017-11-27 08:55:11 +01:00
Nhat Nguyen	a4b4e14186	Dedup translog operations by reading in reverse (#27268 ) Currently, translog operations are read and processed one by one. This may be a problem as stale operations in translogs may suddenly reappear in recoveries. To make sure that stale operations won't be processed, we read the translog files in a reverse order (eg. from the most recent file to the oldest file) and only process an operation if its sequence number was not seen before. Relates to #10708	2017-11-26 16:44:30 -05:00
Jason Tedor	0519fa223c	Ensure logging is configured for CLI commands Any CLI commands that depend on core Elasticsearch might touch classes (directly or indirectly) that depends on logging. If they do this and logging is not configured, Log4j will dump status error messages to the console. As such, we need to ensure that any such CLI command configures logging (with a trivial configuration that dumps log messages to the console). Previously we did this in the base CLI command but with the refactoring of this class out of core Elasticsearch, we no longer configure logging there (since we did not want this class to depend on settings and logging). However, this meant for some CLI commands (like the plugin CLI) we were no longer configuring logging. This commit adds base classes between the low-level command and multi-command classes that ensure that logging is configured. Any CLI command that depends on core Elasticsearch should use this infrastructure to ensure logging is configured. There is one exception to this: Elasticsearch itself because it takes reponsibility into its own hands for configuring logging from Elasticsearch settings and log4j2.properties. We preserve this special status. Relates #27523	2017-11-25 11:40:08 -05:00
Simon Willnauer	a29dc20c26	Ensure `doc_stats` are changing even if refresh is disabled (#27505 ) Today if refresh is disabled the doc stats are not updated anymore. In a bulk index scenario this might cause confusion since even if we refresh internal readers etc. doc stats are never advancing. This change cuts over to the internal reader that is refreshed outside of the external readers refresh interval but always equally `fresh` or `fresher` which will cause less confusion.	2017-11-25 14:24:16 +01:00
Jason Tedor	0b6448726c	Fix classes that can exit In a previous change, we locked down the classes that can exit by specifying explicit classes rather than packages than can exit. Alas, there was a bug in the sense that the class that we exit from in the case of an uncaught exception is not ElasticsearchUncaughtExceptionHandler but rather an anonymous nested class of ElasticsearchUncaughtExceptionHandler. To address this, we replace this anonymous class with a bonafide nested class ElasticsearchUncaughtExceptionHandler$PrivilegedHaltAction. Note that if we try to get this class name we have a $ in the middle of the string which is a special regular expression character; as such, we have to escape it. Relates #27518	2017-11-24 19:00:18 -05:00
Nhat Nguyen	e0e1a92d36	Revert "Adjust CombinedDeletionPolicy for multiple commits (#27456 )" The commit looks harmless, unfortunately it can break the engine flush scheduler and the translog rolling. Both `uncommittedOperations` and `uncommittedSizeInBytes` are currently calculated based on the minimum required generation for recovery rather than the translog generation of the last index commit. This is not correct if other index commits are reserved for snapshotting even though we are keeping the last index commit only. This reverts commit `e95d18ec23`.	2017-11-24 15:19:50 -05:00
David Turner	00867e618d	Transpose expected and actual, and remove duplicate info from message. (#27515 ) Previously: ``` > Throwable #1: java.lang.AssertionError: Expected all shards successful but got successful [8] total [9] > Expected: <8> > but: was <9> ``` Now: ``` > Throwable #1: java.lang.AssertionError: Expected all shards successful > Expected: <9> > but: was <8> ```	2017-11-24 17:45:34 +00:00
lcawley	af971b3081	[DOCS] Fixed broken link in breaking changes	2017-11-24 09:16:14 -08:00
Nhat Nguyen	06d35f4f01	Backport wait_for_initialiazing_shards to cluster health API Relates #27489	2017-11-24 09:56:16 -05:00
Simon Willnauer	17e9940fc1	Carry over version map size to prevent excessive resizing (#27516 ) Today we create a new concurrent hash map everytime we refresh the internal reader. Under defaults this isn't much of a deal but once the refresh interval is set to `-1` these maps grow quite large and it can have a significant impact on indexing throughput. Under low memory situations this can cause up to 2x slowdown. This change carries over the map size as the initial capacity wich will be auto-adjusted once indexing stops. Closes #20498	2017-11-24 14:57:31 +01:00

1 2 3 4 5 ...

29241 Commits All Branches Search

29241 Commits

All Branches