OpenSearch

Commit Graph

Author	SHA1	Message	Date
Nhat Nguyen	06d35f4f01	Backport wait_for_initialiazing_shards to cluster health API Relates #27489	2017-11-24 09:56:16 -05:00
Simon Willnauer	17e9940fc1	Carry over version map size to prevent excessive resizing (#27516 ) Today we create a new concurrent hash map everytime we refresh the internal reader. Under defaults this isn't much of a deal but once the refresh interval is set to `-1` these maps grow quite large and it can have a significant impact on indexing throughput. Under low memory situations this can cause up to 2x slowdown. This change carries over the map size as the initial capacity wich will be auto-adjusted once indexing stops. Closes #20498	2017-11-24 14:57:31 +01:00
Jim Ferenczi	c6724abe74	Fix scroll query with a sort that is a prefix of the index sort (#27498 ) During a scroll, if the search sort matches the index sort we use the sort values of the last doc returned by the previous scroll to optimize the main query with a `SearchAfterSortedDocQuery`. This query can "jump" directly to the first document that sorts after the provided sort values. This optim is also applied if the search sort is a prefix of the index sort but this case throws an exception because we use the index sort (instead of the search sort) to validate the sort values of the last document. This change fixes this bug and adds a test for it.	2017-11-24 13:44:47 +01:00
Tanguy Leroux	5dc5580eac	Delete shard store files before restoring a snapshot (#27476 ) Pull request #20220 added a change where the store files that have the same name but are different from the ones in the snapshot are deleted first before the snapshot is restored. This logic was based on the `Store.RecoveryDiff.different` set of files which works by computing a diff between an existing store and a snapshot. This works well when the files on the filesystem form valid shard store, ie there's a `segments` file and store files are not corrupted. Otherwise, the existing store's snapshot metadata cannot be read (using Store#snapshotStoreMetadata()) and an exception is thrown (CorruptIndexException, IndexFormatTooOldException etc) which is later caught as the begining of the restore process (see RestoreContext#restore()) and is translated into an empty store metadata (Store.MetadataSnapshot.EMPTY). This will make the deletion of different files introduced in #20220 useless as the set of files will always be empty even when store files exist on the filesystem. And if some files are present within the store directory, then restoring a snapshot with files with same names will fail with a FileAlreadyExistException. This is part of the #26865 issue. There are various cases were some files could exist in the store directory before a snapshot is restored. One that Igor identified is a restore attempt that failed on a node and only first files were restored, then the shard is allocated again to the same node and the restore starts again (but fails because of existing files). Another one is when some files of a closed index are corrupted / deleted and the index is restored. This commit adds a test that uses the infrastructure provided by IndexShardTestCase in order to test that restoring a shard succeed even when files with same names exist on filesystem. Related to #26865	2017-11-24 13:15:34 +01:00
kel	4885acb048	Replace `delimited_payload_filter` by `delimited_payload` (#26625 ) The `delimited_payload_filter` is renamed to `delimited_payload`, the old name is deprecated and should be replaced by `delimited_payload`. Closes #21978	2017-11-24 13:03:19 +01:00
David Turner	e8c2e5d6dd	CURRENT should not be a -SNAPSHOT version if build.snapshot is false (#27512 )	2017-11-24 11:19:41 +00:00
Alexander Kazakov	43a91f4184	Fix merging of _meta field (#27352 )	2017-11-24 09:44:39 +01:00
David Turner	8916935639	Remove unused method (#27508 )	2017-11-24 08:09:31 +00:00
Martijn van Groningen	f1ebf366bf	unmuted test, this has been fixed by #27397 Closes #27497	2017-11-24 08:53:00 +01:00
David Turner	89ba8996c6	Consolidate version numbering semantics (#27397 ) Fixes to the build system, particularly around BWC testing, and to make future version bumps less painful.	2017-11-23 20:21:53 +00:00
Nhat Nguyen	46b508d6c9	Add wait_for_no_initializing_shards to cluster health API (#27489 ) This adds a new option to the cluster health request allowing to wait until there is no initializing shards. Closes #25623	2017-11-23 15:09:58 -05:00
Simon Willnauer	93a988c557	[TEST] use routing partition size based on the max routing shards of the second split	2017-11-23 17:59:07 +01:00
Nhat Nguyen	e95d18ec23	Adjust CombinedDeletionPolicy for multiple commits (#27456 ) Today, we keep only the last index commit and use only it to calculate the minimum required translog generation. This may no longer be correct as we introduced a new deletion policy which keeps multiple index commits. This change adjusts the CombinedDeletionPolicy so that it can work correctly with a new index deletion policy. Relates to #10708, #27367	2017-11-23 11:34:50 -05:00
Clinton Gormley	d1b1d711df	Update composite-aggregation.asciidoc Fixed asciidoc typo	2017-11-23 15:05:14 +01:00
olcbean	fd564b10db	Deprecate `levenstein` in favor of `levenshtein` (#27409 ) Support both spellings thoughout 6.x, reporting the incorrect one as deprecated.	2017-11-23 12:53:47 +00:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
olcbean	05998f91d0	Validate `op_type` for `_create` (#27483 )	2017-11-23 07:19:47 +01:00
Nicholas Knize	4017049bac	Minor ShapeBuilder cleanup Remove some auto generated comments and make CoordinateNode ctors package private.	2017-11-22 23:08:15 -06:00
Martijn van Groningen	ca9c476d88	muted test	2017-11-22 19:18:35 +01:00
Tim Brooks	ef34555b29	Decouple nio constructs from the tcp transport (#27484 ) This is related to #27260. Currently, basic nio constructs (nio channels, the channel factories, selector event handlers, etc) implement logic that is specific to the tcp transport. For example, NioChannel implements the TcpChannel interface. These nio constructs at some point will also need to support other protocols (ex: http). This commit separates the TcpTransport logic from the nio building blocks.	2017-11-22 11:39:31 -06:00
Martijn van Groningen	9fbbc46ba4	Added 6.2 version	2017-11-22 17:37:19 +01:00
Mayya Sharipova	57e4d10007	Limit the number of nested documents (#27405 ) Add an index level setting `index.mapping.nested_objects.limit` to control the number of nested json objects that can be in a single document across all fields. Defaults to 10000. Throw an error if the number of created nested documents exceed this limit during the parsing of a document. Closes #26962	2017-11-22 10:16:28 -05:00
Jason Tedor	4cffe8f3bd	Fix whitespace on docs for read/writeArray methods	2017-11-21 16:46:52 -05:00
Jason Tedor	1d5d246e32	Fix whitespace in Security.java	2017-11-21 16:46:10 -05:00
Jason Tedor	f5a79af578	Tighten which classes can exit Today we allow exiting solely by being in certain packages. This commit upgrades the securesm dependency to a new version that supports being explicit about which classes can exit. We utilize that here to only allow exiting from the uncaught exception handler and the base CLI command class. Relates #27482	2017-11-21 16:37:41 -05:00
Jim Ferenczi	3427062c3a	[Test] Fix AggregationsTests#testFromXContentWithRandomFields Exclude "key" field from random modifications in tests, the composite agg uses an array of object for bucket key and values are checked. Relates #26800	2017-11-21 19:28:27 +01:00
Takumasa Ochi	eed8d1aee5	[DOC] Fix mathematical representation on interval (range) (#27450 )	2017-11-21 17:06:26 +00:00
javanna	3eeccb7791	Update version check for CCS optional remote clusters also fixed the remote.info yaml test to clean up the registered remote cluster once the test is completed. Relates to #27182	2017-11-21 16:52:45 +01:00
Simon Willnauer	cc78b24867	Bump BWC version to 6.1.0 for #27469	2017-11-21 16:16:31 +01:00
Jim Ferenczi	90d2ead14a	Adapt rest test BWC version after backport Relates #26800	2017-11-21 15:45:02 +01:00
Adrien Grand	6ac799074e	Fix dynamic mapping update generation. (#27467 ) When a field is not mapped, Elasticsearch tries to generate a mapping update from the parsed document. Some documents can introduce corner-cases, for instance in the event of a multi-valued field whose values would be mapped to different field types if they were supplied on their own, see for instance: ``` PUT index/doc/1 { "foo": ["2017-11-10T02:00:01.247Z","bar"] } ``` In that case, dynamic mappings want to map the first value as a `date` field and the second one as a `text` field. This currently throws an exception, which is expected, but the wrong one since it throws a `class_cast_exception` (which triggers a HTTP 5xx code) when it should throw an `illegal_argument_exception` (HTTP 4xx).	2017-11-21 15:31:18 +01:00
Simon Willnauer	5a0b6d1977	Use the primary_term field to identify parent documents (#27469 ) This change stops indexing the `_primary_term` field for nested documents to allow fast retrieval of parent documents. Today we create a docvalues field for children to ensure we have a dense datastructure on disk. Yet, since we only use the primary term to tie-break on when we see the same seqID on indexing having a dense datastructure is less important. We can use this now to improve the nested docs performance and it's memory footprint. Relates to #24362	2017-11-21 15:14:03 +01:00
Jim Ferenczi	6319424e4a	Move composite aggregation to core (#27474 ) This change removes the module named aggs-composite and adds the `composite` aggs as a core aggregation. This allows other plugins to use this new aggregation and simplifies the integration in the HL rest client.	2017-11-21 13:31:01 +01:00
Simon Willnauer	8aba7c8bbe	Fix test BWC version after backport Relates to #27468	2017-11-21 12:31:04 +01:00
Simon Willnauer	ea35abca28	Protect shard splitting from illegal target shards (#27468 ) While we have an assertion that checks if the number of routing shards is a multiple of the number of shards we need a real hard exception that checks this way earlier. This change adds a check and test that is executed before we create the index. Relates to #26931	2017-11-21 12:09:45 +01:00
Luca Cavanna	29450de7b5	Cross Cluster Search: make remote clusters optional (#27182 ) Today Cross Cluster Search requires at least one node in each remote cluster to be up once the cross cluster search is run. Otherwise the whole search request fails despite some of the data (either local and/or remote) is available. This happens when performing the _search/shards calls to find out which remote shards the query has to be executed on. This scenario is different from shard failures that may happen later on when the query is actually executed, in case e.g. remote shards are missing, which is not going to fail the whole request but rather yield partial results, and the _shards section in the response will indicate that. This commit introduces a boolean setting per cluster called search.remote.$cluster_alias.skip_if_disconnected, set to false by default, which allows to skip certain clusters if they are down when trying to reach them through a cross cluster search requests. By default all clusters are mandatory. Scroll requests support such setting too when they are first initiated (first search request with scroll parameter), but subsequent scroll rounds (_search/scroll endpoint) will fail if some of the remote clusters went down meanwhile. The search API response contains now a new _clusters section, similar to the _shards section, that gets returned whenever one or more clusters were disconnected and got skipped: "_clusters" : { "total" : 3, "successful" : 2, "skipped" : 1 } Such section won't be part of the response if no clusters have been skipped. The per cluster skip_unavailable setting value has also been added to the output of the remote/info API.	2017-11-21 11:41:47 +01:00
Ulrich Reffle	dd0bb580b0	[Docs] Fix broken bulleted lists (#27470 )	2017-11-21 11:10:35 +01:00
Jason Tedor	190da14bfe	Move resync request serialization assertion This commit moves an assertion that some guard code that will eventually be dead code in the resync replication request read serialization is removed when the master branch is bumped to version 8.0.0.	2017-11-20 20:59:41 -05:00
Jason Tedor	28660be40a	Fix resync request serialization This commit addresses a subtle bug in the serialization routine for resync requests. The problem here is that Translog.Operation#readType is not compatible with the implementations of Translog.Operation#writeTo. Unfortunately, this issue prevents primary-replica from succeeding, issues which we will address in follow-ups. Relates #27418	2017-11-20 20:56:48 -05:00
Tim Brooks	4e04f95ab4	Fix issue where pages aren't released (#27459 ) This is related to #27422. Right now when we send a write to the netty transport, we attach a listener to the future. When you submit a write on the netty event loop and the event loop is shutdown, the onFailure method is called. Unfortunately, netty then tries to notify the listener which cannot be done without dispatching to the event loop. In this case, the dispatch fails and netty logs and error and does not tell us. This commit checks that netty is still not shutdown after sending a message. If netty is shutdown, we complete the listener.	2017-11-20 14:53:08 -07:00
Zachary Tong	196dbf3357	Add YAML REST tests for filters bucket agg (#27128 ) Related to #26220	2017-11-20 16:44:30 -05:00
Tim Brooks	f37eb1b403	Remove tcp profile from low level nio channel (#27441 ) This is related to #27260. Currently every nio channel has a profile field. Profile is a concept that only relates to the tcp transport. Http channels will not have profiles. This commit moves the profile from the nio channel to the read context. The context is the level that protocol specific features and logic should live.	2017-11-20 12:20:42 -07:00
Nicholas Knize	093218e052	[TEST] Fix `GeoShapeQueryTests#testPointsOnly` failure Changes unnecessary geoIntersection query to a matchAll query. closes #27454	2017-11-20 12:11:18 -06:00
Tim Brooks	0a8f48d592	Transition transport apis to use void listeners (#27440 ) Currently we use ActionListener<TcpChannel> for connect, close, and send message listeners in TcpTransport. However, all of the listeners have to capture a reference to a channel in the case of the exception api being called. This commit changes these listeners to be type <Void> as passing the channel to onResponse is not necessary. Additionally, this change makes it easier to integrate with low level transports (which use different implementations of TcpChannel).	2017-11-20 10:47:47 -07:00
Simon Willnauer	d02f45f694	AwaitsFix GeoShapeQueryTests#testPointsOnly #27454	2017-11-20 17:16:36 +01:00
Simon Willnauer	28e5cf933f	Bump test version after backport Relates to #27455	2017-11-20 16:54:59 +01:00
Simon Willnauer	720e96e288	Ensure nested documents have consistent version and seq_ids (#27455 ) Today we index dummy values for seq_ids and version on nested documents. This is on the one hand trappy since users can request these values via inner hits and on the other hand not necessarily good for compression since the dummy value will likely not compress well when seqIDs are lowish. This change ensures that we share the same field values for all documents in a nested block. This won't have any overhead, in-fact it might be more efficient since we even reduce the work needed slightly.	2017-11-20 16:50:08 +01:00
Dimitrios Liappis	d3e3bc8656	Tests: Add Fedora-27 to packaging tests Replace Fedora-25 with Fedora-27 and fix old references in `TESTING.asciidoc`. Relates #27434	2017-11-20 16:59:00 +02:00
Christoph Büscher	682a85b2c1	Delete some seemingly unused exceptions (#27439 )	2017-11-20 09:05:03 +01:00
Jim Ferenczi	d1093bd2fa	#26800 : Fix docs rendering	2017-11-20 08:41:02 +01:00

1 2 3 4 5 ...

29193 Commits All Branches Search

29193 Commits

All Branches