OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tim Brooks	ad8a571677	Add read timeouts to http module (#27713 ) We currently do not have any server-side read timeouts implemented in elasticsearch. This commit adds a read timeout setting that defaults to 30 seconds. If after 30 seconds a read has not occurred, the channel will be closed. A timeout of value of 0 will disable the timeout.	2017-12-08 09:32:09 -07:00
Jason Tedor	ec5e540174	Fix routing with leading or trailing whitespace The problem here is that splitting was using a method that intentionally trims whitespace (the method is really meant to be used for splitting parameters where whitespace should be trimmed like list settings). However, for routing values whitespace should not be trimmed because we allow routing with leading and trailing spaces. This commit switches the parsing of these routing values to a method that does not trim whitespace. Relates #27712	2017-12-08 11:23:24 -05:00
Simon Willnauer	8f104cc08c	[TEST] Now actually wait for merges Relates to #27651	2017-12-08 12:35:02 +01:00
Simon Willnauer	952c859f52	Test out of order delivery of append only index and retry with an intermediate delete	2017-12-08 12:28:27 +01:00
Christoph Büscher	816878bd4d	[Tests] Add test for GeoShapeFieldType#setStrategyName (#27703 )	2017-12-08 10:11:57 +01:00
Nhat Nguyen	6efee323e0	Remove unused Commit classes (#27714 ) These classes are not used anywhere.	2017-12-07 21:42:11 -05:00
Lee Hinman	cca54b811d	[TEST] Wait for merging to complete before testing breaker It's possible that a merge may be ongoing when we check the breaker and segment stats' memory usage, this causes the test to fail. Instead, we should wait for merging to complete. Resolves #27651	2017-12-07 11:57:22 -07:00
olcbean	bcc33f391f	Add Open Index API to the high level REST client (#27574 ) Add _open to the high level REST client Relates to #27205	2017-12-07 18:16:03 +01:00
Christoph Büscher	b83e14858a	Correcting some minor typos in comments	2017-12-07 16:39:23 +01:00
Yannick Welsch	5a53798f83	Add unreleased v5.6.6 version	2017-12-07 14:59:57 +01:00
Robin Neatherway	057efea893	Correct two equality checks on incomparable types (#27688 )	2017-12-07 14:18:11 +01:00
Yannick Welsch	69dd667f5e	Add unreleased v6.0.2 version	2017-12-07 11:54:22 +01:00
Catalin Ursachi	f823cea79c	Added Create Index support to high-level REST client (#27351 ) Relates to #27205	2017-12-07 11:39:59 +01:00
Yannick Welsch	0b102f6372	[TEST] Fix testOpenWaitingForActiveShardsFailed This test periodically fails if the nodes that apply the cluster state fail to ack the change within 100ms. This commit changes the checks on the test so that it still checks that the open command has taken effect, but that the wait for active shards has actually failed.	2017-12-07 10:23:36 +01:00
Tim Brooks	2aa62daed4	Introduce resizable inbound byte buffer (#27551 ) This is related to #27563. In order to interface with java nio, we must have buffers that are compatible with ByteBuffer. This commit introduces a basic ByteBufferReference to easily allow transferring bytes off the wire to usage in the application. Additionally it introduces an InboundChannelBuffer. This is a buffer that can internally expand as more space is needed. It is designed to be integrated with a page recycler so that it can internally reuse pages. The final piece is moving all of the index work for writing bytes to a channel into the WriteOperation.	2017-12-06 11:02:25 -07:00
Boaz Leskes	e0e698bc26	testCorruptTranslogTruncation: add logging	2017-12-06 14:46:39 +01:00
Jim Ferenczi	caea6b70fa	Add a new cluster setting to limit the total number of buckets returned by a request (#27581 ) This commit adds a new dynamic cluster setting named `search.max_buckets` that can be used to limit the number of buckets created per shard or by the reduce phase. Each multi bucket aggregator can consume buckets during the final build of the aggregation at the shard level or during the reduce phase (final or not) in the coordinating node. When an aggregator consumes a bucket, a global count for the request is incremented and if this number is greater than the limit an exception is thrown (TooManyBuckets exception). This change adds the ability for multi bucket aggregator to "consume" buckets in the global limit, the default is 10,000. It's an opt-in consumer so each multi-bucket aggregator must explicitly call the consumer when a bucket is added in the response. Closes #27452 #26012	2017-12-06 09:15:28 +01:00
Simon Willnauer	70f8ea367b	Allow index settings to be reset by wildcards (#27671 ) Index settings didn't support reset by wildcard which also causes issues like #27537 where archived settings can't be reset. This change adds support for wildcards like `archived.*` to be used to reset setting to their defaults or remove them from an index. Closes #27537	2017-12-06 07:35:37 +01:00
javanna	234e09a105	Fix UpdateMappingIntegrationIT test failures The mappings can be submitted wrapped in a type object or not. They need to be returned in the same way as they were submitted. When applying field filters, we need to make sure that the format is preserved. MappingMetaData#getSourceAsMap removes the root level if it's the type object, which would make us overwrite the original mappings with filtered mappings but without the original root object. Closes #27678	2017-12-06 01:43:17 +01:00
Ryan Ernst	8139e3a1c7	Add validation of keystore setting names (#27626 ) This commit restricts settings added to the keystore to have a lowercase ascii name. The java Keystore javadocs state that case sensitivity of key alias names are implementation dependent. This ensures regardless of case sensitivity in a jvm implementation, the keys will be stored as we expect.	2017-12-05 14:30:36 -08:00
Nhat Nguyen	ed2caf2bad	Prevent constructing index template without patterns (#27662 ) Today, we prevent the system from storing a broken index template in the transport layer, however we don't prevent this in XContent. A broken index template can break the whole cluster state. This commit attempts to prevent the system from constructing an index template without a proper index patterns.	2017-12-05 15:38:33 -05:00
Luca Cavanna	f4fb4d3bf5	Add support for filtering mappings fields (#27603 ) Add support for filtering fields returned as part of mappings in get index, get mappings, get field mappings and field capabilities API. Plugins can plug in their own function, which receives the index as argument, and return a predicate which controls whether each field is included or not in the returned output.	2017-12-05 20:31:29 +01:00
Nicholas Knize	8bcf5393f2	[Geo] Add Well Known Text (WKT) Parsing Support to ShapeBuilders This commit adds WKT support to Geo ShapeBuilders. This supports the following format: POINT (30 10) LINESTRING (30 10, 10 30, 40 40) BBOX (-10, 10, 10, -10) POLYGON ((30 10, 40 40, 20 40, 10 20, 30 10)) POLYGON ((35 10, 45 45, 15 40, 10 20, 35 10), (20 30, 35 35, 30 20, 20 30)) MULTIPOINT ((10 40), (40 30), (20 20), (30 10)) MULTIPOINT (10 40, 40 30, 20 20, 30 10) MULTILINESTRING ((10 10, 20 20, 10 40),(40 40, 30 30, 40 20, 30 10)) MULTIPOLYGON (((30 20, 45 40, 10 40, 30 20)), ((15 5, 40 10, 10 20, 5 10, 15 5))) MULTIPOLYGON (((40 40, 20 45, 45 30, 40 40)), ((20 35, 10 30, 10 10, 30 5, 45 20, 20 35), (30 20, 20 15, 20 25, 30 20))) GEOMETRYCOLLECTION (POINT (30 10), MULTIPOINT ((10 40), (40 30), (20 20), (30 10))) closes #9120	2017-12-05 10:56:41 -06:00
David Turner	5b03e3b53d	Fix up tests now that GeoDistance.*.calculate works (#27541 ) This resolves a longstanding @AwaitsFix	2017-12-05 16:19:33 +00:00
Jason Tedor	42a4ad35da	Add node name to thread pool executor name This commit adds the node name to the names of thread pool executors so that the node name is visible in rejected execution exception messages. Relates #27663	2017-12-05 07:45:40 -05:00
Jason Tedor	144e1698cf	Do not pass executor shutdown to super constructor The main constructor for rejected execution exception its executor shutdown constructor parameter to the super constructor where it would be used as a formatting parameter. This is a mistake so this commit fixes this issue.	2017-12-05 07:11:14 -05:00
Jason Tedor	eb574425b7	Simplify rejected execution exception This exception type has several unnecessary constructor overrides so this commit removes them. Relates #27664	2017-12-05 06:58:34 -05:00
Martijn van Groningen	b447967104	removed redundant import	2017-12-05 11:01:25 +01:00
Martijn van Groningen	4d78e1a9ad	Added msearch api to high level client	2017-12-05 10:17:47 +01:00
Boaz Leskes	c203cff692	fix java9 compilation	2017-12-05 09:28:56 +01:00
Jim Ferenczi	8635f68ece	Fix term vectors generator with keyword and normalizer (bis) Fallback on the index analyzer if the normalizer is null. Closes #27320	2017-12-05 01:33:14 +01:00
Jason Tedor	963ed25cf5	Only fsync global checkpoint if needed In the global checkpoint sync action, we fsync the translog. However, the last synced global checkpoint might already be equal to the current global checkpoint in which case the fsyncing the translog is unnecessary as either the sync needed guard in the translog will skip the translog, or the translog needs an fsync for another reason that will be picked up elsewhere (e.g., at the end of a bulk request). Relates #27652	2017-12-04 18:10:04 -05:00
Jason Tedor	72800bb90b	Fix Lucene version for 6.2.0 constant in master This commit fixes the Lucene version constant in master for the 6.2.0 version. Relates #27658	2017-12-04 16:38:45 -05:00
Nhat Nguyen	4b558636f0	TEST: Rewrite GeoPointParsingTests#testEqualsHashCodeContract (#27634 ) The hashCode contract states that equal objects must have equal hash codes, however the unequal objects are not required to have unequal hashCodes. This commit rewrites GeoPointParsingTests#testEqualsHashCodeContract using#checkEqualsAndHashCode helper. Closes #27633	2017-12-04 16:34:34 -05:00
Jim Ferenczi	e0b1a6544d	Fix term vectors generator with keyword and normalizer (#27608 ) This change applies the normalizer defined on the field when building term vectors dynamically on a keyword field. Fixes #27320	2017-12-04 21:32:16 +01:00
Jim Ferenczi	da50fa4540	Fix highlighting on a keyword field that defines a normalizer (#27604 ) * Fix highlighting on a keyword field that defines a normalizer The `plain` and sometimes the `unified` highlighters need to re-analyze the content to highlight a field This change makes sure that we don't ignore the normalizer defined on the keyword field for this analysis.	2017-12-04 21:04:14 +01:00
Jason Tedor	17a2d574de	Obey translog durability in global checkpoint sync After write operations in some situations we fire a post-operation global checkpoint sync. The global checkpoint sync unconditionally fsyncs the translog and this can then look like an fsync per-request. This violates the translog durability settings on the index if this durability is set to async. This commit changes the global checkpoint sync to observe the translog durability. Relates #27641	2017-12-04 12:14:25 -05:00
Simon Willnauer	84ec472428	Include internal refreshes in refresh stats (#27615 ) Today we exclude internal refreshes in the refresh stats. Yet, it's very much confusing to not take these into account. This change includes internal refreshes into the stats until we have a dedicated stats for this.	2017-12-04 16:33:47 +01:00
Nhat Nguyen	e213fa033d	Tighten the CountedBitSet class This commit addresses the missed comments from https://github.com/elastic/elasticsearch/pull/27547.	2017-12-04 09:51:34 -05:00
Boaz Leskes	2900e3f345	adapt testWaitForPendingSeqNo to stricter operation recovery range Before we use to ship anything in the translog above a certain point. #27580 changed to have a strict upper bound.	2017-12-04 13:18:42 +01:00
Boaz Leskes	1a976ea7a4	Cherry pick tests and seqNo recovery hardning from #27580	2017-12-04 13:15:40 +01:00
Catalin Ursachi	b44ae25c27	Updated "string" datatype in docs & tests to "text" (#27629 )	2017-12-04 11:51:00 +01:00
Adrien Grand	6323bb0d97	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27619 ) This new snapshot mostly brings a change to TopFieldCollector which can now early terminate collection when trackTotalHits is `false`. As a follow-up, we should replace our usage of `EarlyTerminatingSortingCollector` with this new option.	2017-12-04 09:40:08 +01:00
Nhat Nguyen	49df50f662	Simplify MultiSnapshot#SeqNoset (#27547 ) Today, we maintain two sets in a SeqNoSet: ongoing sets and completed sets. We can remove the completed sets and use only the ongoing sets by releasing the internal bitset of a CountedBitSet when all its bits are set. This behaves like two sets but simpler. This commit also makes CountedBitSet as a drop-in replacement for BitSet. Relates #27268	2017-12-03 15:20:57 -05:00
Lee Hinman	623d3700f0	Add accounting circuit breaker and track segment memory usage (#27116 ) * Add accounting circuit breaker and track segment memory usage This commit adds a new circuit breaker "accounting" that is used for tracking the memory usage of non-request-tied memory users. It also adds tracking for the amount of Lucene segment memory used by a shard as a user of the new circuit breaker. The Lucene segment memory is updated when the shard refreshes, and removed when the shard relocates away from a node or is deleted. It should also be noted that all tracking for segment memory uses `addWithoutBreaking` so as not to fail the shard if a limit is reached. The `accounting` breaker has a default limit of 100% and will contribute to the parent breaker limit. Resolves #27044	2017-12-01 07:59:45 -07:00
David Turner	5060007d20	Fix sporadic failures in testCorruptedShards (#27613 ) Add assertBusy() to retry in case the shards are not yet all failed, and remove `@AwaitsFix`. Resolves #12416.	2017-12-01 13:33:23 +00:00
Luca Cavanna	3e8ca38fca	Deprecate the transport client in favour of the high-level REST client (#27085 )	2017-12-01 12:24:16 +01:00
David Turner	1e6bd99248	Reinstate AwaitsFix	2017-11-30 21:09:01 +00:00
Simon Willnauer	67cd1e9c5f	Reset LiveVersionMap on sync commit (#27534 ) Today we carry on the size of the live version map to ensure that we minimze rehashing. Yet, once we are idle or we can issue a sync-commit we can resize it to defaults to free up memory. Relates to #27516	2017-11-30 20:44:05 +01:00
Simon Willnauer	b116221540	Ensure shard is refreshed once it's inactive (#27559 ) Once a shard goes inactive we want the shard to be refreshed if the refresh interval is default since we might hold on to unnecessary segments and in the inactive case we stopped indexing and can release old segments. Relates to #27500	2017-11-30 19:04:05 +01:00
Mayya Sharipova	c6b73239ae	Limit the number of tokens produced by _analyze (#27529 ) Add an index level setting `index.analyze.max_token_count` to control the number of generated tokens in the _analyze endpoint. Defaults to 10000. Throw an error if the number of generated tokens exceeds this limit. Closes #27038	2017-11-30 11:54:39 -05:00
David Turner	92a24de509	Add more logging to testCorruptedShards to help investigate sporadic failures	2017-11-30 16:34:23 +00:00
David Turner	1f89e9d94e	Reinstate AwaitsFix This reverts commit `29c5540323`.	2017-11-30 13:01:22 +00:00
olcbean	d25c9671de	Deprecate `jarowinkler` in favor of `jaro_winkler` (#27526 ) Jaro and Winkler are two people, so we should use the same naming convention as for Damerau–Levenshtein.	2017-11-30 12:49:34 +00:00
Tanguy Leroux	41f73e0acf	Fix version for include_global_state in Snapshot Status API It also adds a Rest test. Related #26853	2017-11-30 11:33:01 +01:00
kel	efac982e35	Include include_global_state in Snapshot status API (#26853 ) This commit adds a field include_global_state to snapshot status api response. For legacy snapshot, the field is not present. Closes #22423	2017-11-30 10:38:07 +01:00
Tanguy Leroux	192d1f03f8	Do not swallow exception in ChecksumBlobStoreFormat.writeAtomic() (#27597 ) The ChecksumBlobStoreFormat.writeAtomic() method writes a blob using a temporary name and then moves the blob to its final name. The move operation can fail and in this case the temporary blob is deleted. If this delete operation also fails, then the initial exception is lost. This commit ensures that when something goes wrong during the move operation the initial exception is kept and thrown, and if the delete operation also fails then this additional exception is added as a suppressed exception to the initial one.	2017-11-30 10:09:49 +01:00
Jason Tedor	55cb8ddd80	Do not set data paths on no local storage required Today when configuring the data paths for the environment, we set data paths to either the specified path.data or default to data relative to the Elasticsearch home. Yet if node.local_storage is false, data paths do not even make sense. In this case, we should reject if path.data is set, and instead of defaulting data paths to data relative to home, we should set this to empty paths. This commit does this. Relates #27587	2017-11-29 17:35:00 -05:00
David Turner	29c5540323	Remove AwaitsFix	2017-11-29 18:12:18 +00:00
Tanguy Leroux	547f006118	Remove XContentType auto detection in BlobStoreRepository (#27480 )	2017-11-29 09:39:49 +01:00
Simon Willnauer	4aa840698f	Ensure threadcontext is preserved when refresh listeners are invoked (#27565 ) today a refresh listener won't preserve the entire context ie. won't carry on response headers etc. from the caller side. This change adds support for stored contexts.	2017-11-28 21:32:16 +01:00
Simon Willnauer	184b7f06ee	Make Segment statistics aware of segments hold by internal readers (#27558 ) Today we only expose the external readers segments. Yet, from a statistics perspective both internal and external segments are relevant. This commit exposes the additional segments of the internal and external reader respectively.	2017-11-28 17:37:03 +01:00
Jason Tedor	cefb46d0fc	Throw UOE from compressible bytes stream reset A compressible bytes output stream is a stream output which supports a reset method. However, compressible bytes output streams are unusual in that the current implementation sometimes supports a reset (if the stream is not compressed) and sometimes does not support a rest (if the stream is compressed). This inconsistent behavior is puzzling and instead we should simply always throw an unsupported operation exception. Relates #27564	2017-11-28 11:29:47 -05:00
Jim Ferenczi	37653c9dca	[TEST] AggregationsIntegrationIT#testScroll can timeout This change sets the scroll timeout for this test to 1m instead of 500ms in order to avoid loosing the scroll on slow machines. Relates #26378	2017-11-28 16:18:54 +01:00
Adrien Grand	d01fcee645	Fix illegal cast of the "low cardinality" optimization of the `terms` aggregation. (#27543 ) The GlobalOrdinalsStringTermsAggregator.LowCardinality aggregator casts global values to `GlobalOrdinalMapping`, even though the implementation of global values is different when a `missing` value is configured. This commit adds a new API that gives access to the ordinal remapping in order to fix this problem.	2017-11-28 14:55:09 +01:00
Adrien Grand	996990ad1f	Upgrade to lucene-7.2.0-snapshot-8c94404. (#27496 ) The main highlight of this new snapshot is that it introduces the opportunity for queries to opt out of caching. In case a query opts out of caching, not only will it never be cached, but also no compound query that wraps it will be cached.	2017-11-28 14:52:42 +01:00
Martijn van Groningen	cb1204774b	Include the _index, _type and _id to nested search hits in the top_hits and inner_hits response. Also include _type and _id for parent/child hits inside inner hits. In the case of top_hits aggregation the nested search hits are directly returned and are not grouped by a root or parent document, so it is important to include the _id and _index attributes in order to know to what documents these nested search hits belong to. Closes #27053	2017-11-28 14:05:29 +01:00
Nhat Nguyen	000f62c1d2	TEST: makes sure to corrupt referenced tlog files (#27546 ) Method TruncateTranslogIT#corruptTranslogFiles corrupts some random existing *.tlog files in a translog directory. However, this may not actually corrupt translog at all if it corrupts only tlog files which are not referenced by the Checkpoint (eg. their translog generations are smaller the Checkpoint). This commit makes sure that we corrupt some tlog files which are referenced by the Checkpoint. Closes #27538	2017-11-27 20:18:58 -05:00
Simon Willnauer	0eb87e5d57	[TEST] Fix broken test that still tried to acquire the shards to set it non-idle	2017-11-27 22:52:34 +01:00
Jason Tedor	d8c28044da	Forbid granting the all permission in production Running with the all permission java.security.AllPermission granted is equivalent to disabling the security manager. This commit adds a bootstrap check that forbids running with this permission granted. Relates #27548	2017-11-27 16:05:27 -05:00
Jason Tedor	379d51fcfa	Bubble exceptions when closing compressible streams Compressible bytes output stream swallows exceptions that occur when closing. This commit changes this behavior so that such exceptions bubble up. Relates #27542	2017-11-27 13:48:04 -05:00
Simon Willnauer	f23ed6188d	Skip shard refreshes if shard is `search idle` (#27500 ) Today we refresh automatically in the background by default very second. This default behavior has a significant impact on indexing performance if the refreshes are not needed. This change introduces a notion of a shard being `search idle` which a shard transitions to after (default) `30s` without any access to an external searcher. Once a shard is search idle all scheduled refreshes will be skipped unless there are any refresh listeners registered. If a search happens on a `serach idle` shard the search request _park_ on a refresh listener and will be executed once the next scheduled refresh occurs. This will also turn the shard into the `non-idle` state immediately. This behavior is only applied if there is no explicit refresh interval set.	2017-11-27 18:16:10 +01:00
Martijn van Groningen	3f98b85489	inner_hits: Return an empty _source for nested inner hit when filtering on a field that doesn't exist. Before this change the search request would fail with an error indicating that it couldn't detect xcontent type based on the string: `null`	2017-11-27 10:51:24 +01:00
Nhat Nguyen	a4b4e14186	Dedup translog operations by reading in reverse (#27268 ) Currently, translog operations are read and processed one by one. This may be a problem as stale operations in translogs may suddenly reappear in recoveries. To make sure that stale operations won't be processed, we read the translog files in a reverse order (eg. from the most recent file to the oldest file) and only process an operation if its sequence number was not seen before. Relates to #10708	2017-11-26 16:44:30 -05:00
Jason Tedor	0519fa223c	Ensure logging is configured for CLI commands Any CLI commands that depend on core Elasticsearch might touch classes (directly or indirectly) that depends on logging. If they do this and logging is not configured, Log4j will dump status error messages to the console. As such, we need to ensure that any such CLI command configures logging (with a trivial configuration that dumps log messages to the console). Previously we did this in the base CLI command but with the refactoring of this class out of core Elasticsearch, we no longer configure logging there (since we did not want this class to depend on settings and logging). However, this meant for some CLI commands (like the plugin CLI) we were no longer configuring logging. This commit adds base classes between the low-level command and multi-command classes that ensure that logging is configured. Any CLI command that depends on core Elasticsearch should use this infrastructure to ensure logging is configured. There is one exception to this: Elasticsearch itself because it takes reponsibility into its own hands for configuring logging from Elasticsearch settings and log4j2.properties. We preserve this special status. Relates #27523	2017-11-25 11:40:08 -05:00
Simon Willnauer	a29dc20c26	Ensure `doc_stats` are changing even if refresh is disabled (#27505 ) Today if refresh is disabled the doc stats are not updated anymore. In a bulk index scenario this might cause confusion since even if we refresh internal readers etc. doc stats are never advancing. This change cuts over to the internal reader that is refreshed outside of the external readers refresh interval but always equally `fresh` or `fresher` which will cause less confusion.	2017-11-25 14:24:16 +01:00
Jason Tedor	0b6448726c	Fix classes that can exit In a previous change, we locked down the classes that can exit by specifying explicit classes rather than packages than can exit. Alas, there was a bug in the sense that the class that we exit from in the case of an uncaught exception is not ElasticsearchUncaughtExceptionHandler but rather an anonymous nested class of ElasticsearchUncaughtExceptionHandler. To address this, we replace this anonymous class with a bonafide nested class ElasticsearchUncaughtExceptionHandler$PrivilegedHaltAction. Note that if we try to get this class name we have a $ in the middle of the string which is a special regular expression character; as such, we have to escape it. Relates #27518	2017-11-24 19:00:18 -05:00
Nhat Nguyen	e0e1a92d36	Revert "Adjust CombinedDeletionPolicy for multiple commits (#27456 )" The commit looks harmless, unfortunately it can break the engine flush scheduler and the translog rolling. Both `uncommittedOperations` and `uncommittedSizeInBytes` are currently calculated based on the minimum required generation for recovery rather than the translog generation of the last index commit. This is not correct if other index commits are reserved for snapshotting even though we are keeping the last index commit only. This reverts commit `e95d18ec23`.	2017-11-24 15:19:50 -05:00
Nhat Nguyen	06d35f4f01	Backport wait_for_initialiazing_shards to cluster health API Relates #27489	2017-11-24 09:56:16 -05:00
Simon Willnauer	17e9940fc1	Carry over version map size to prevent excessive resizing (#27516 ) Today we create a new concurrent hash map everytime we refresh the internal reader. Under defaults this isn't much of a deal but once the refresh interval is set to `-1` these maps grow quite large and it can have a significant impact on indexing throughput. Under low memory situations this can cause up to 2x slowdown. This change carries over the map size as the initial capacity wich will be auto-adjusted once indexing stops. Closes #20498	2017-11-24 14:57:31 +01:00
Jim Ferenczi	c6724abe74	Fix scroll query with a sort that is a prefix of the index sort (#27498 ) During a scroll, if the search sort matches the index sort we use the sort values of the last doc returned by the previous scroll to optimize the main query with a `SearchAfterSortedDocQuery`. This query can "jump" directly to the first document that sorts after the provided sort values. This optim is also applied if the search sort is a prefix of the index sort but this case throws an exception because we use the index sort (instead of the search sort) to validate the sort values of the last document. This change fixes this bug and adds a test for it.	2017-11-24 13:44:47 +01:00
Tanguy Leroux	5dc5580eac	Delete shard store files before restoring a snapshot (#27476 ) Pull request #20220 added a change where the store files that have the same name but are different from the ones in the snapshot are deleted first before the snapshot is restored. This logic was based on the `Store.RecoveryDiff.different` set of files which works by computing a diff between an existing store and a snapshot. This works well when the files on the filesystem form valid shard store, ie there's a `segments` file and store files are not corrupted. Otherwise, the existing store's snapshot metadata cannot be read (using Store#snapshotStoreMetadata()) and an exception is thrown (CorruptIndexException, IndexFormatTooOldException etc) which is later caught as the begining of the restore process (see RestoreContext#restore()) and is translated into an empty store metadata (Store.MetadataSnapshot.EMPTY). This will make the deletion of different files introduced in #20220 useless as the set of files will always be empty even when store files exist on the filesystem. And if some files are present within the store directory, then restoring a snapshot with files with same names will fail with a FileAlreadyExistException. This is part of the #26865 issue. There are various cases were some files could exist in the store directory before a snapshot is restored. One that Igor identified is a restore attempt that failed on a node and only first files were restored, then the shard is allocated again to the same node and the restore starts again (but fails because of existing files). Another one is when some files of a closed index are corrupted / deleted and the index is restored. This commit adds a test that uses the infrastructure provided by IndexShardTestCase in order to test that restoring a shard succeed even when files with same names exist on filesystem. Related to #26865	2017-11-24 13:15:34 +01:00
kel	4885acb048	Replace `delimited_payload_filter` by `delimited_payload` (#26625 ) The `delimited_payload_filter` is renamed to `delimited_payload`, the old name is deprecated and should be replaced by `delimited_payload`. Closes #21978	2017-11-24 13:03:19 +01:00
Alexander Kazakov	43a91f4184	Fix merging of _meta field (#27352 )	2017-11-24 09:44:39 +01:00
Nhat Nguyen	46b508d6c9	Add wait_for_no_initializing_shards to cluster health API (#27489 ) This adds a new option to the cluster health request allowing to wait until there is no initializing shards. Closes #25623	2017-11-23 15:09:58 -05:00
Simon Willnauer	93a988c557	[TEST] use routing partition size based on the max routing shards of the second split	2017-11-23 17:59:07 +01:00
Nhat Nguyen	e95d18ec23	Adjust CombinedDeletionPolicy for multiple commits (#27456 ) Today, we keep only the last index commit and use only it to calculate the minimum required translog generation. This may no longer be correct as we introduced a new deletion policy which keeps multiple index commits. This change adjusts the CombinedDeletionPolicy so that it can work correctly with a new index deletion policy. Relates to #10708, #27367	2017-11-23 11:34:50 -05:00
olcbean	fd564b10db	Deprecate `levenstein` in favor of `levenshtein` (#27409 ) Support both spellings thoughout 6.x, reporting the incorrect one as deprecated.	2017-11-23 12:53:47 +00:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
olcbean	05998f91d0	Validate `op_type` for `_create` (#27483 )	2017-11-23 07:19:47 +01:00
Nicholas Knize	4017049bac	Minor ShapeBuilder cleanup Remove some auto generated comments and make CoordinateNode ctors package private.	2017-11-22 23:08:15 -06:00
Martijn van Groningen	9fbbc46ba4	Added 6.2 version	2017-11-22 17:37:19 +01:00
Mayya Sharipova	57e4d10007	Limit the number of nested documents (#27405 ) Add an index level setting `index.mapping.nested_objects.limit` to control the number of nested json objects that can be in a single document across all fields. Defaults to 10000. Throw an error if the number of created nested documents exceed this limit during the parsing of a document. Closes #26962	2017-11-22 10:16:28 -05:00
Jason Tedor	4cffe8f3bd	Fix whitespace on docs for read/writeArray methods	2017-11-21 16:46:52 -05:00
Jason Tedor	1d5d246e32	Fix whitespace in Security.java	2017-11-21 16:46:10 -05:00
Jason Tedor	f5a79af578	Tighten which classes can exit Today we allow exiting solely by being in certain packages. This commit upgrades the securesm dependency to a new version that supports being explicit about which classes can exit. We utilize that here to only allow exiting from the uncaught exception handler and the base CLI command class. Relates #27482	2017-11-21 16:37:41 -05:00
Jim Ferenczi	3427062c3a	[Test] Fix AggregationsTests#testFromXContentWithRandomFields Exclude "key" field from random modifications in tests, the composite agg uses an array of object for bucket key and values are checked. Relates #26800	2017-11-21 19:28:27 +01:00
javanna	3eeccb7791	Update version check for CCS optional remote clusters also fixed the remote.info yaml test to clean up the registered remote cluster once the test is completed. Relates to #27182	2017-11-21 16:52:45 +01:00
Simon Willnauer	cc78b24867	Bump BWC version to 6.1.0 for #27469	2017-11-21 16:16:31 +01:00
Adrien Grand	6ac799074e	Fix dynamic mapping update generation. (#27467 ) When a field is not mapped, Elasticsearch tries to generate a mapping update from the parsed document. Some documents can introduce corner-cases, for instance in the event of a multi-valued field whose values would be mapped to different field types if they were supplied on their own, see for instance: ``` PUT index/doc/1 { "foo": ["2017-11-10T02:00:01.247Z","bar"] } ``` In that case, dynamic mappings want to map the first value as a `date` field and the second one as a `text` field. This currently throws an exception, which is expected, but the wrong one since it throws a `class_cast_exception` (which triggers a HTTP 5xx code) when it should throw an `illegal_argument_exception` (HTTP 4xx).	2017-11-21 15:31:18 +01:00

1 2 3 4 5 ...

9215 Commits