OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-28 02:48:38 +00:00

Author	SHA1	Message	Date
Martijn van Groningen	a056c5d469	aggs: Changed how top_hits initialises leaf collectors Both TopDocsCollector and LeafCollector were being kept around at the aggregator level. In case the nested aggregator would do a post collection then this could cause pushing down docids to top hits child aggregators that already moved the next LeafCollector (causing assertions to trip and incorrect results). By keeping track of the LeafCollector in a seperate map at the leaf bucket level this problem can simply not happen any more as the place holding LeafCollector is no longer shared. Also LeafCollector instances for TopDocsCollectors are no longer pre-created as the beginning a new segment is evaluated. There is no guarantee that TopHitsAggregator encounters a document for a particular bucket and there has to be logic to create LeafCollector instances which have not been seen before. Closes #26738	2017-09-22 15:59:43 +02:00
Ryan Ernst	5b711c283d	Plugins: Add backcompat for sha1 checksums (#26748 ) With 6.0 rc1 we now publish sha512 checksums for official plugins. However, in order to ease the pain for plugin authors, this commit adds backcompat to still allow sha1 checksums. Also added tests for checksums. Closes #26746	2017-09-22 11:26:32 +02:00
Alexander Kazakov	ff737a880c	Add wait_for_active_shards parameter to index open command (#26682 ) Adds the wait_for_active_shards parameter to the index open command. Similar to the index creation command, the index open command will now, by default, wait until the primaries have been allocated. Closes #20937	2017-09-22 11:15:03 +02:00
Martijn van Groningen	109c6c2717	aggs: Do not delegate a null scorer to LeafBucketCollectors Closes #26611	2017-09-22 09:20:57 +02:00
Yannick Welsch	76e1b7437c	[TEST] Remove assertSeqNos from testAckedIndexing	2017-09-22 08:31:36 +02:00
Jason Tedor	e0db89bc35	Upgrade to Lucene 7.0.0 This commit upgrades to the GA release of Luence 7! Relates #26744	2017-09-21 19:19:33 -04:00
Jason Tedor	954fb1c80d	Reenable BWC tests after global checkpoint sync This commit reenables the BWC tests after the introduction of the post-operation and background global checkpoint sync. Relates #26591	2017-09-21 15:44:36 -04:00
Jason Tedor	f35d1de502	Introduce global checkpoint background sync It is the exciting return of the global checkpoint background sync. Long, long ago, in snapshot version far, far away we had and only had a global checkpoint background sync. This sync would fire periodically and send the global checkpoint from the primary shard to the replicas so that they could update their local knowledge of the global checkpoint. Later in time, as we sped ahead towards finalizing the initial version of sequence IDs, we realized that we need the global checkpoint updates to be inline. This means that on a replication operation, the primary shard would piggy back the global checkpoint with the replication operation to the replicas. The replicas would update their local knowledge of the global checkpoint and reply with their local checkpoint. However, this could allow the global checkpoint on the primary to advance again and the replicas would fall behind in their local knowledge of the global checkpoint. If another replication operation never fired, then the replicas would be permanently behind. To account for this, we added one more sync that would fire when the primary shard fell idle. However, this has problems: - the shard idle timer defaults to five minutes, a long time to wait for the replicas to learn of the new global checkpoint - if a replica missed the sync, there was no follow-up sync to catch them up - there is an inherent race condition where the primary shard could fall idle mid-operation (after having sent the replication request to the replicas); in this case, there would never be a background sync after the operation completes - tying the global checkpoint sync to the idle timer was never natural To fix this, we add two additional changes for the global checkpoint to be synced to the replicas. The first is that we add a post-operation sync that only fires if there are no operations in flight and there is a lagging replica. This gives us a chance to sync the global checkpoint to the replicas immediately after an operation so that they are always kept up to date. The second is that we add back a global checkpoint background sync that fires on a timer. This timer fires every thirty seconds, and is not configurable (for simplicity). This background sync is smarter than what we had previously in the sense that it only sends a sync if the global checkpoint on at least one replica is lagging that of the primary. When the timer fires, we can compare the global checkpoint on the primary to its knowledge of the global checkpoint on the replicas and only send a sync if there is a shard behind. Relates #26591	2017-09-21 15:34:13 -04:00
James Baiera	c760eec054	Add permission checks before reading from HDFS stream (#26716 ) Add checks for special permissions before reading hdfs stream data. Also adds test from readonly repository fix. MiniHDFS will now start with an existing repository with a single snapshot contained within. Readonly Repository is created in tests and attempts to list the snapshots within this repo.	2017-09-21 11:55:07 -04:00
Martijn van Groningen	fda8f8b827	muted test	2017-09-21 17:24:18 +02:00
wasserman	67845134de	[Docs] Fixed typo of configuration (#25058 )	2017-09-21 16:49:00 +02:00
kel	601be4f83e	Add azure storage endpoint suffix #26432 (#26568 ) Allow specifying azure storage endpoint suffix for an azure client.	2017-09-20 22:26:19 -07:00
lcawley	06551a8549	[DOCS] Added index-shared4 and index-shared5.asciidoc	2017-09-20 10:54:26 -07:00
Jay Modi	c47f24d406	BulkProcessor flush runnable preserves the thread context from creation time (#26718 ) When using a bulk processor, the thread context was not preserved for the flush runnable which is executed in another thread in the thread pool. This change wraps the flush runnable in a context preserving runnable so that the headers and transients from the creation time of the bulk processor are available during the execution of the flush. Closes #26596	2017-09-20 10:19:42 -06:00
Simon Willnauer	b9c0d4447c	Catch exceptions and inform handler in RemoteClusterConnection#collectNodes (#26725 ) This adds a missing catch block to invoke the action listener instead of bubbeling up the exception. Closes #26700	2017-09-20 17:53:12 +02:00
Tahmim Ahmed Shibli	34662c9e6d	[Docs] Fix name of character filter in example. (#26724 )	2017-09-20 17:08:43 +02:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Christoph Büscher	3d67915ed5	#26720 : Set the correct bwc version after backport to 6.0	2017-09-20 16:14:11 +02:00
Christoph Büscher	22e200e79a	Remove deprecated type and slop field in MatchQueryBuilder (#26720 ) The `type` field has been deprecated in 5.0 and can be removed. It has been replaced by using the MatchPhraseQueryBuilder or the MatchPhrasePrefixQueryBuilder. The `slop` field has also been deprecated and can be removed, the phrase and phrase prefix query builders still provide this parameter.	2017-09-20 14:24:30 +02:00
Yannick Welsch	5f407062ad	Refactoring of Gateway*** classes (#26706 ) - Removes mutual dependency between GatewayMetaState and TransportNodesListGatewayMetaState - Deguices MetaDataIndexUpgradeService - Deguices GatewayMetaState - Makes Gateway the master-level component that is only responsible for coordinating the state recovery	2017-09-20 12:51:58 +02:00
Tanguy Leroux	b3819e7f30	Make RestHighLevelClient's Request class public (#26627 ) Request class is currently package protected, making it difficult for the users to extend the RestHighLevelClient and to use its protected methods to execute requests. This commit makes the Request class public and changes few methods of RestHighLevelClient to be protected.	2017-09-20 11:36:10 +02:00
Yannick Welsch	ff1e26276d	Deguice ActionFilter (#26691 ) Allows to instantiate TransportAction instances without Guice.	2017-09-20 10:30:21 +02:00
Martijn van Groningen	61849a1150	aggs: Allow aggregation sorting via nested aggregation. The nested aggregator now buffers all bucket ords per parent document and emits all bucket ords for a parent document's nested document once. This way the nested documents document DocIdSetIterator gets used once per bucket instead of wrapping the nested aggregator inside a multi bucket aggregator, which was the current solution upto now. This allows sorting by buckets under a nested bucket. Closes #16838	2017-09-20 07:44:53 +02:00
Ryan Ernst	a1c766c75c	Build: Set bwc builds to always set snapshot (#26704 ) This commit enforces bwc builds always generate snapshot versions, even when testing release versions in CI. closes #26702	2017-09-19 17:41:51 -07:00
Ryan Ernst	bebff47b5b	File Discovery: Remove fallback with zen discovery (#26667 ) When adding file based discovery, we added a fallback when the discovery type was set to zen (the default, so everyone got this warning). This commit removes the fallback for 6.0. Setting file discovery should now happen explicitly through the hosts_provider setting. closes #26661	2017-09-19 16:32:34 -07:00
Jason Tedor	581a873124	Remove assertion from checkpoint tracker invariants This assertion is wrong because the global checkpoint on a promoted primary can be lagging the replicas until it catches up after through resyncs, ongoing indexing operations and removing the old primary from the in-sync set.	2017-09-19 17:52:41 -04:00
Igor Motov	5090260119	Upgrade API: fix excessive logging and unnecessary template updates (#26698 ) TemplateUpgradeService might get stuck in repeatedly upgrading templates after upgrade to 5.6.0. This is caused by shuffling mappings definition in the template during template serialization. This commit makes the template serialization consistent. Closes #26673	2017-09-19 16:32:17 -04:00
Boaz Leskes	04385a9ce9	Restoring from snapshot should force generation of a new history uuid (#26694 ) Restoring a shard from snapshot throws the primary back in time violating assumptions and bringing the validity of global checkpoints in question. To avoid problems, we should make sure that a shard that was restored will never be the source of an ops based recovery to a shard that existed before the restore. To this end we have introduced the notion of `histroy_uuid` in #26577 and required that both source and target will have the same history to allow ops based recoveries. This PR make sure that a shard gets a new uuid after restore. As suggested by @ywelsch , I derived the creation of a `history_uuid` from the `RecoverySource` of the shard. Store recovery will only generate a uuid if it doesn't already exist (we can make this stricter when we don't need to deal with 5.x indices). Peer recovery follows the same logic (note that this is different than the approach in #26557, I went this way as it means that shards always have a history uuid after being recovered on a 6.x node and will also mean that a rolling restart is enough for old indices to step over to the new seq no model). Local shards and snapshot force the generation of a new translog uuid. Relates #10708 Closes #26544	2017-09-19 15:58:36 +02:00
Martijn van Groningen	332b4d12fa	test: Use a single primary shard so that the exception can caught in the same way	2017-09-19 15:14:24 +02:00
Jason Tedor	256721018b	Move pre-6.0 node checkpoint to SequenceNumbers This commit moves the pre-6.0 node checkpoint constant from SequenceNumbersService to SequenceNumbers so it can chill with the other sequence number-related constants. Relates #26690	2017-09-19 06:27:56 -04:00
Armin Braun	2db3bccd37	Invalid JSON request body caused endless loop (#26680 ) Request bodys that only consists of a String value can lead to endless loops in the parser of several rest requests like e.g. `_count`. Up to 5.2 this seems to have been caught in the logic guessing the content type of the request, but since then it causes the node to block. This change introduces checks for receiving a valid xContent object before starting the parsing in RestActions#parseTopLevelQueryBuilder(). Closes #26083	2017-09-19 12:02:05 +02:00
Martijn van Groningen	6c46a67dd6	added comment	2017-09-19 11:02:15 +02:00
Martijn van Groningen	a3a6ce6220	fix line length violation	2017-09-19 11:02:15 +02:00
Martijn van Groningen	f782f618cc	Moved the check to fetch phase. This basically means that we throw a better error message instead of an AOBE and not adding more restrictions.	2017-09-19 11:02:15 +02:00
Martijn van Groningen	d05aee7eda	inner hits: Do not allow inner hits that use _source and have a non nested object field as parent Closes #25315	2017-09-19 11:02:15 +02:00
Jack Conradson	c3746b268c	Separate Painless Whitelist Loading from the Painless Definition (#26540 ) Adds several small whitelist data structures and a new Whitelist class to separate the idea of loading a whitelist from the actual Painless Definition class. This is the first step of many in allowing users to define custom whitelists per context. Also supports the idea of loading multiple whitelists from different sources for a single context.	2017-09-18 15:51:07 -07:00
Tal Levy	cc726cb3b6	convert more admin requests to writeable (#26566 )	2017-09-18 13:19:34 -07:00
Nik Everett	98f8bde389	Handle release of 5.6.1 * Add a version constant for 5.6.2 so that the 5.6.1 constant represents the 5.6.1 release and the 5.6.2 constant represents the unreleased 5.6 branch.	2017-09-18 15:41:09 -04:00
Simon Willnauer	9f97f9072a	Allow `InputStreamStreamInput` array size validation where applicable (#26692 ) Today we can't validate the array length in `InputStreamStreamInput` since we can't rely on `InputStream.available` yet in some situations we know the size of the stream and can apply additional validation.	2017-09-18 17:52:36 +02:00
Jason Tedor	23093adcb9	Update global checkpoint with permit after recovery After recovery completes from a primary, we now update the local knowledge on the primary of the global checkpoint on the recovery target. However if this occurs concurrently with a relocation, an assertion could trip that we are no longer in primary mode. As this local knowledge should only be tracked when we are in primary mode, updating this local knowledge should be done under a permit. This commit causes that to be the case. Relates #26666	2017-09-18 07:48:08 -04:00
Jason Tedor	6f25163aef	Filter pre-6.0 nodes for checkpoint invariants When checking that the global checkpoint on the primary is consistent with the local checkpoints of the in-sync shards, we have to filter pre-6.0 nodes from the check or the invariant will trivially trip. This commit filters these nodes out when checking this invariant. Relates #26666	2017-09-18 06:51:22 -04:00
Jason Tedor	5dd476feb5	Skip bad request REST test on pre-6.0 This commit adds a skip for the bad request REST test on pre-6.0 nodes. Previously, a request for /_(.*) where $1 is not an existing endpoint would return a 404. This is because the request would be treated as a get index request for an index named _$1. However, an index can never start with "_" so logic was added to detect this and return a 400 instead as this should be treated as a bad request. During the mixed-cluster BWC tests, a node running pre-6.0 code will still return a 404 though. Therefore, this test needs to skipped in such a mixed-cluster scenario.	2017-09-18 06:46:10 -04:00
Jason Tedor	52e80a9292	Reenable BWC tests after disabling for backport This commit reenables the BWC tests after they were disabled for backporting the change to track global checkpoints of shard copies on the primary. Relates #26666	2017-09-18 06:28:50 -04:00
Jason Tedor	c238b79cf4	Add global checkpoint tracking on the primary This commit adds local tracking of the global checkpoints on all shard copies when a global checkpoint tracker is operating in primary mode. With this, we relay the global checkpoint on a shard copy back to the primary shard during replication operations. This serves as another step towards adding a background sync of the global checkpoint to the shard copies. Relates #26666	2017-09-18 06:04:44 -04:00
Tanguy Leroux	c16c653c3e	[Test] Fix reference/cat/allocation/line_8 test failure In this test, 260b is replaced by the regexp \d+b but the test sometimes produces results like 1.1kb so this commit adapts the regexp to match values with decimals	2017-09-18 10:46:19 +02:00
Peter Dyson	1f9e0fd0dd	[Docs] improved description for fs.total.available_in_bytes (#26657 )	2017-09-18 16:56:19 +10:00
Jason Tedor	bdd9953aa4	Fix discovery-file plugin to use custom config path The discovery-file plugin was not config path aware, so it always picked up the default config path (from Elasticsearch home) rather than a custom config path. This commit fixes the discovery-file plugin to respect a custom config path. Relates #26662	2017-09-16 11:00:33 -04:00
Boaz Leskes	0814ea3200	fix testSniffNodes to use the new error message relates to #26564	2017-09-16 10:43:48 +03:00
Michael Basnight	296c239611	Add check for invalid index in WildcardExpressionResolver (#26409 ) This commit adds validation to the resolving of indexes in the wildcard expression resolver. It no longer throws a 404 Not Found when resolving invalid indices. It throws a 400 instead, as it is an invalid index. This was the behavior of 5.x.	2017-09-15 17:00:41 -05:00
Dimitrios Liappis	b789ce737b	Docs: Use single-node discovery.type for dev example For the single node, dev example, the `discovery.type=single-node`[1],[2] is a perfect fit and makes the example shorter and more self explanatory. Also expose the transport port, to help with dev use-cases using the transport client. [1] https://github.com/elastic/elasticsearch/pull/23595 [2] https://github.com/elastic/elasticsearch/pull/23598 Relates #26289	2017-09-15 16:14:47 +03:00

1 2 3 4 5 ...

28784 Commits