OpenSearch

Commit Graph

Author	SHA1	Message	Date
Luca Cavanna	13f9e922f3	REST client: hosts marked dead for the first time should not be immediately retried (#29230 ) This was the plan from day one but due to a silly bug nodes were immediately retried after they were marked as dead for the first time. From the second time on, the expected backoff was applied.	2018-03-27 16:15:44 +02:00
Nhat Nguyen	d1d3edf156	TEST: Use different translog dir for a new engine In #testPruneOnlyDeletesAtMostLocalCheckpoint, we create a new engine but mistakenly use the same translog directory of the existing engine. This prevents translog files from cleaning up when closing the engines. ERROR 0.12s J2 \| InternalEngineTests.testPruneOnlyDeletesAtMostLocalCheckpoint <<< FAILURES! > Throwable #1: java.io.IOException: could not remove the following files (in the order of attempts): > translog-primary-060/translog-2.tlog: java.io.IOException: access denied: This commit makes sure to use a separate directory for each engine in this tes.	2018-03-27 09:45:51 -04:00
Christoph Büscher	8d6832c5ee	Make SearchStats implement Writeable (#29258 ) Moves another class over from Streamable to Writeable. By this, also some constructors can be removed or made private.	2018-03-27 15:21:11 +02:00
Colin Goodheart-Smithe	9972710e9e	Makes brnach compile Commented out toSteps implem,entations and other bits needed to get the branch to compile	2018-03-27 12:14:38 +01:00
Andrew Banchich	d2baf4b191	[Docs] Spelling and grammar changes to reindex.asciidoc (#29232 )	2018-03-27 12:17:46 +02:00
Tal Levy	f429fc0b3e	begin making sense of types	2018-03-26 18:48:59 -07:00
Tal Levy	083c563cf6	meh	2018-03-26 18:12:38 -07:00
Tal Levy	57821cd55a	moar refactor for steps	2018-03-26 17:54:15 -07:00
Nhat Nguyen	0ac89a32cc	Do not optimize append-only if seen normal op with higher seqno (#28787 ) When processing an append-only operation, primary knows that operations can only conflict with another instance of the same operation. This is true as the id was freshly generated. However this property doesn't hold for replicas. As soon as an auto-generated ID was indexed into the primary, it can be exposed to a search and users can issue a follow up operation on it. In extremely rare cases, the follow up operation can be arrived and processed on a replica before the original append-only request. In this case we can't simply proceed with the append-only request and blindly add it to the index without consulting the version map. The following scenario can cause difference between primary and replica. 1. Primary indexes an auto-gen-id doc. (id=X, v=1, s#=20) 2. A refresh cycle happens on primary 3. The new doc is picked up and modified - say by a delete by query request - Primary gets a delete doc (id=X, v=2, s#=30) 4. Delete doc is processed first on the replica (id=X, v=2, s#=30) 5. Indexing operation arrives on the replica, since it's an auto-gen-id request and the retry marker is lower, we put it into lucene without any check. Replica has a doc the primary doesn't have. To deal with a potential conflict between an append-only operation and a normal operation on replicas, we need to rely on sequence numbers. This commit maintains the max seqno of non-append-only operations on replica then only apply optimization for an append-only operation only if its seq# is higher than the seq# of all non-append-only.	2018-03-26 16:56:12 -04:00
Andy Bristol	f3cd9a69a2	[test] packaging: renamed packaging configuration (elastic/x-pack-elasticsearch#4112 ) For elastic/elasticsearch#26741 Original commit: elastic/x-pack-elasticsearch@401e9bb0e4	2018-03-26 13:43:29 -07:00
Andy Bristol	7bf9091942	[test] packaging: gradle tasks for groovy tests (#29046 ) The vagrant test plugin adds tasks for the groovy packaging tests, which run after the bats packaging test tasks.Rename the 'bats' configuration to 'packaging' and remove the option to inherit archives from this configuration.	2018-03-26 13:43:09 -07:00
Nhat Nguyen	87957603c0	Prune only gc deletes below local checkpoint (#28790 ) Once a document is deleted and Lucene is refreshed, we will not be able to look up the `version/seq#` associated with that delete in Lucene. As conflicting operations can still be indexed, we need another mechanism to remember these deletes. Therefore deletes should still be stored in the Version Map, even after Lucene is refreshed. Obviously, we can't remember all deletes forever so a trimming mechanism is needed. Currently, we remember deletes for at least 1 minute (the default GC deletes cycle) and clean them periodically. This is, at the moment, the best we can do on the primary for user facing APIs but this arbitrary time limit is problematic for replicas. Furthermore, we can't rely on the primary and replicas doing the trimming in a synchronized manner, and failing to do so results in the replica and primary making different decisions. The following scenario can cause inconsistency between primary and replica. 1. Primary index doc (index, id=1, v2) 2. Network packet issue causes index operation to back off and wait 3. Primary deletes doc (delete, id=1, v3) 4. Replica processes delete (delete, id=1, v3) 5. 1+ minute passes (GC deletes runs replica) 6. Indexing op is finally sent to the replica which no processes it because it forgot about the delete. We can reply on sequence-numbers to prevent this issue. If we prune only deletes whose seqno at most the local checkpoint, a replica will correctly remember what it needs. The correctness is explained as follows: Suppose o1 and o2 are two operations on the same document with seq#(o1) < seq#(o2), and o2 arrives before o1 on the replica. o2 is processed normally since it arrives first; when o1 arrives it should be discarded: 1. If seq#(o1) <= LCP, then it will be not be added to Lucene, as it was already previously added. 2. If seq#(o1) > LCP, then it depends on the nature of o2: - If o2 is a delete then its seq# is recorded in the VersionMap, since seq#(o2) > seq#(o1) > LCP, so a lookup can find it and determine that o1 is stale. - If o2 is an indexing then its seq# is either in Lucene (if refreshed) or the VersionMap (if not refreshed yet), so a real-time lookup can find it and determine that o1 is stale. In this PR, we prefer to deploy a single trimming strategy, which satisfies both requirements, on primary and replicas because: - It's simpler - no need to distinguish if an engine is running at primary mode or replica mode or being promoted. - If a replica subsequently is promoted, user experience is fully maintained as that replica remembers deletes for the last GC cycle. However, the version map may consume less memory if we deploy two different trimming strategies for primary and replicas.	2018-03-26 13:42:08 -04:00
Dimitris Athanasiou	afb6a06f61	[ML] Model snapshot min_version is now present since 7.0.0 Original commit: elastic/x-pack-elasticsearch@39d193461d	2018-03-26 17:09:11 +01:00
Chris Earle	a600350d4c	[Monitoring] Remove 202 responses in favor of 200 responses (elastic/x-pack-elasticsearch#4213 ) This changes `_xpack/monitoring/_bulk` to fundamentally behave in the same way as `_bulk` and never return 202 when data is ignored (something `_bulk` cannot do). Instead, anyone interested will have to inspect the returned response for the ignored flag. Original commit: elastic/x-pack-elasticsearch@07254a006d	2018-03-26 11:36:04 -04:00
Boaz Leskes	bca264699a	remove testUnassignedShardAndEmptyNodesInRoutingTable testUnassignedShardAndEmptyNodesInRoutingTable and that test is as old as time and does a very bogus thing. it is an IT test which extracts the GatewayAllocator from the node and tells it to allocated unassigned shards, while giving it a conjured cluster state with no nodes in it (it uses the DiscoveryNodes.EMPTY_NODES. This is never a cluster state we want to reroute on (we always have at least master node in it). I'm going to just delete the test as I don't think it adds much value. Closes #21463	2018-03-26 17:10:57 +02:00
Tal Levy	d63cd8c9c3	step by step	2018-03-26 08:00:03 -07:00
Alexander Reelsen	67badaadb0	Docs: Fix secure settings link Original commit: elastic/x-pack-elasticsearch@f98a8dabc6	2018-03-26 15:32:27 +02:00
Jim Ferenczi	dd77d7fd0a	#28745 : remove extra option in the composite rest tests `allow_partial_search_results` is not needed for these tests.	2018-03-26 14:32:59 +02:00
Alexander Reelsen	c2764cef98	Docs: Fix deprecation notices and typo to build docs Original commit: elastic/x-pack-elasticsearch@6e5504efd9	2018-03-26 14:25:42 +02:00
Boaz Leskes	f5d4550e93	Fold EngineDiskUtils into Store, for better lock semantics (#29156 ) #28245 has introduced the utility class`EngineDiskUtils` with a set of methods to prepare/change translog and lucene commit points. That util class bundled everything that's needed to create and empty shard, bootstrap a shard from a lucene index that was just restored etc. In order to safely do these manipulations, the util methods acquired the IndexWriter's lock. That would sometime fail due to concurrent shard store fetching or other short activities that require the files not to be changed while they read from them. Since there is no way to wait on the index writer lock, the `Store` class has other locks to make sure that once we try to acquire the IW lock, it will succeed. To side step this waiting problem, this PR folds `EngineDiskUtils` into `Store`. Sadly this comes with a price - the store class doesn't and shouldn't know about the translog. As such the logic is slightly less tight and callers have to do the translog manipulations on their own.	2018-03-26 14:08:03 +02:00
Christoph Büscher	a9392f6d42	Add file permissions checks to precommit task This adds a check for source files that have the execute bit set to the precommit task.	2018-03-26 13:37:55 +02:00
Christoph Büscher	318b0af953	Remove execute mode bit from source files Some source files seem to have the execute bit (a+x) set, which doesn't really seem to hurt but is a bit odd. This change removes those, making the permissions similar to other source files in the repository.	2018-03-26 13:37:55 +02:00
Jim Ferenczi	3a75435980	Fix IndexerUtilsTests that relies on indexed fields This test creates doc values fields only but does not set the index options to none. This commit fixes this discrepancy by adding an indexed point field for all doc values field. relates elastic/x-pack-elasticsearch#4223 Original commit: elastic/x-pack-elasticsearch@8adab7c849	2018-03-26 13:37:18 +02:00
David Turner	8c8de0a774	Mute failing IndexerUtilsTests Awaiting a fix of elastic/x-pack-elasticsearch#4223 Original commit: elastic/x-pack-elasticsearch@d385099719	2018-03-26 10:57:34 +01:00
Jim Ferenczi	5288235ca3	Optimize the composite aggregation for match_all and range queries (#28745 ) This change refactors the composite aggregation to add an execution mode that visits documents in the order of the values present in the leading source of the composite definition. This mode does not need to visit all documents since it can early terminate the collection when the leading source value is greater than the lowest value in the queue. Instead of collecting the documents in the order of their doc_id, this mode uses the inverted lists (or the bkd tree for numerics) to collect documents in the order of the values present in the leading source. For instance the following aggregation: ``` "composite" : { "sources" : [ { "value1": { "terms" : { "field": "timestamp", "order": "asc" } } } ], "size": 10 } ``` ... can use the field `timestamp` to collect the documents with the 10 lowest values for the field instead of visiting all documents. For composite aggregation with more than one source the execution can early terminate as soon as one of the 10 lowest values produces enough composite buckets. For instance if visiting the first two lowest timestamp created 10 composite buckets we can early terminate the collection since it is guaranteed that the third lowest timestamp cannot create a composite key that compares lower than the one already visited. This mode can execute iff: * The leading source in the composite definition uses an indexed field of type `date` (works also with `date_histogram` source), `integer`, `long` or `keyword`. * The query is a match_all query or a range query over the field that is used as the leading source in the composite definition. * The sort order of the leading source is the natural order (ascending since postings and numerics are sorted in ascending order only). If these conditions are not met this aggregation visits each document like any other agg.	2018-03-26 09:51:37 +02:00
Alexander Reelsen	6eeacf339c	Build: Use environment variables for credentials (elastic/x-pack-elasticsearch#4058 ) The credentials now get injected via environment variables, so that external services can pull those. As soon as the specified environment variables are set, the tests are run. No need to check for the @Network annotation This also introduces new secret store settings for the secure settings in order to be sure to not leak them in the configuration files, that get dumped. Relates elastic/x-pack-elasticsearch#3800 Original commit: elastic/x-pack-elasticsearch@a2cfb9cb86	2018-03-26 09:10:04 +02:00
Jason Tedor	e66072c09f	Enable security in packaging tests (elastic/x-pack-elasticsearch#4216 ) Now that security is not enabled by default for a trial license, the packaging tests are failing because they expect security to be enabled. This commit adds enabling security in all instances started during the packaging tests. Original commit: elastic/x-pack-elasticsearch@9838393ecb	2018-03-24 15:36:05 -04:00
Tim Sullivan	05a0d6273c	[Monitoring/Beats] Add new CPU fields, remove old CPU fields (elastic/x-pack-elasticsearch#3991 ) * [Monitoring/Beats] Add new CPU fields, remove old CPU fields * use long instead of double for cpu counters * time => time.ms Original commit: elastic/x-pack-elasticsearch@244b08a574	2018-03-23 16:19:40 -07:00
Dimitris Athanasiou	67c64a6dfd	[ML] Return error when process cause has been killed (elastic/x-pack-elasticsearch#4211 ) relates elastic/x-pack-elasticsearch#4210 Original commit: elastic/x-pack-elasticsearch@c5169328ee	2018-03-23 17:30:10 +00:00
Christoph Büscher	afe95a7738	[Docs] Add rank_eval size parameter k (#29218 ) The rank_eval documentation was missing an explanation of the parameter `k` that controls the number of top hits that are used in the ranking evaluation. Closes #29205	2018-03-23 18:04:32 +01:00
Nicholas Knize	d400a08788	[DOCS] Remove ignore_z_value parameter link Removes invalid ignore_z_value parameter link in geo-point.asciidoc.	2018-03-23 11:07:24 -05:00
Jean-Charles Legras	687fe860ac	Docs: Update docs/index_.asciidoc (#29172 ) Use `_doc` in the routing example instead of `tweet` to agree with the text and line up with the other examples.	2018-03-23 11:35:10 -04:00
Petr Novák	16bffc7394	Docs: Link C++ client lib elasticlient (#28949 ) elasticlient is simple library for simplified work with Elasticsearch in C++	2018-03-23 11:30:01 -04:00
Yannick Welsch	3b8a8867c4	[DOCS] Unregister repository instead of deleting it (#29206 ) Relates to #15426	2018-03-23 15:53:36 +01:00
Nik Everett	8c59e43ac7	Docs: HighLevelRestClient#multiSearch (#29144 ) Adds docs for `HighLevelRestClient#multiSearch`. Unlike the `multiGet` docs these are much more sparse because multi-search doesn't support setting many options on the `MultiSearchRequest` and instead just wraps a list of `SearchRequest`s. Closes #28389	2018-03-23 10:11:50 -04:00
Nicholas Knize	fede633563	Add Z value support to geo_shape This enhancement adds Z value support (source only) to geo_shape fields. If vertices are provided with a third dimension, the third dimension is ignored for indexing but returned as part of source. Like beofre, any values greater than the 3rd dimension are ignored. closes #23747	2018-03-23 08:50:55 -05:00
Dimitris Athanasiou	5f219bd70f	[ML][DOCS] Remove empty rules from docs Original commit: elastic/x-pack-elasticsearch@dee88e1161	2018-03-23 12:31:36 +00:00
Nhat Nguyen	794de63232	Remove type casts in logging in server component (#28807 ) This commit removes type-casts in logging in the server component (other components will be done later). This also adds a parameterized message test which would catch breaking-changes related to lambdas in Log4J.	2018-03-23 07:35:50 -04:00
Dimitris Athanasiou	c4ff5ad3ed	[ML] Do not serialize rules when empty (elastic/x-pack-elasticsearch#4203 ) Original commit: elastic/x-pack-elasticsearch@18d731cb35	2018-03-23 11:21:27 +00:00
Alexander Reelsen	f6d318a782	Watcher: Prevent question mark in HttpClient with empty params (elastic/x-pack-elasticsearch#4206 ) The HTTPClient in watcher always appended a question mark at the end of an URL, regardless if parameters were used or not. This commit adds a check to only pass valid parameters to the URI construction. Original commit: elastic/x-pack-elasticsearch@184f8f441c	2018-03-23 12:16:34 +01:00
Costin Leau	264c88f445	SQL: Introduce CSV and TSV tabular output (elastic/x-pack-elasticsearch#4190 ) When running SQL REST queries, a client can ask (through Accept header) for the data to be returned in CSV or TSV format in addition to plain text, json & co. Original commit: elastic/x-pack-elasticsearch@12d87b3033	2018-03-23 12:23:00 +02:00
Yu	4a8099c696	Change BroadcastResponse from ToXContentFragment to ToXContentObject (#28878 ) While working on #27799, we find that it might make sense to change BroadcastResponse from ToXContentFragment to ToXContentObject, seeing that it's rather a complete XContent object and also the other Responses are normally ToXContentObject. By doing this, we can also move the XContent build logic of BroadcastResponse's subclasses, from Rest Layer to the concrete classes themselves. Relates to #3889	2018-03-23 10:53:37 +01:00
javanna	d143d26bbd	Adapt to RecoveryResponse change upstream See https://github.com/elastic/elasticsearch/pull/28878 , RecoveryResponse doesn't accept the detailed boolean flag anymore in its constructor as it was unused. Original commit: elastic/x-pack-elasticsearch@d96df3448e	2018-03-23 10:48:12 +01:00
Milan Chovatiya	8328b9c5cd	REST : Split `RestUpgradeAction` into two actions (#29124 ) Closes #29062	2018-03-23 10:37:31 +01:00
Jason Tedor	111f0788a2	Add error file docs to important settings This commit adds the error file documentation to the important settings docs so that the page is actually visible.	2018-03-22 23:06:53 -04:00
Jason Tedor	a9677023da	Add note to low-level client docs for DNS caching (#29213 ) This commit adds a note to the low-level REST client docs regarding the possibility of being impacted by the JVM DNS cache policy under a default security manager policy.	2018-03-22 21:23:52 -04:00
Zachary Tong	8296dad5ec	[TEST] disable Upgrade YAML tests Tracking issue: elastic/x-pack-elasticsearch#4197 Original commit: elastic/x-pack-elasticsearch@cc2c7ad788	2018-03-22 18:39:27 +00:00
Nhat Nguyen	14157c8705	Harden periodically check to avoid endless flush loop (#29125 ) In #28350, we fixed an endless flushing loop which may happen on replicas by tightening the relation between the flush action and the periodically flush condition. 1. The periodically flush condition is enabled only if it is disabled after a flush. 2. If the periodically flush condition is enabled then a flush will actually happen regardless of Lucene state. (1) and (2) guarantee that a flushing loop will be terminated. Sadly, the condition 1 can be violated in edge cases as we used two different algorithms to evaluate the current and future uncommitted translog size. - We use method `uncommittedSizeInBytes` to calculate current uncommitted size. It is the sum of translogs whose generation at least the minGen (determined by a given seqno). We pick a continuous range of translogs since the minGen to evaluate the current uncommitted size. - We use method `sizeOfGensAboveSeqNoInBytes` to calculate the future uncommitted size. It is the sum of translogs whose maxSeqNo at least the given seqNo. Here we don't pick a range but select translog one by one. Suppose we have 3 translogs `gen1={#1,#2}, gen2={}, gen3={#3} and seqno=#1`, `uncommittedSizeInBytes` is the sum of gen1, gen2, and gen3 while `sizeOfGensAboveSeqNoInBytes` is the sum of gen1 and gen3. Gen2 is excluded because its maxSeqno is still -1. This commit removes both `sizeOfGensAboveSeqNoInBytes` and `uncommittedSizeInBytes` methods, then enforces an engine to use only `sizeInBytesByMinGen` method to evaluate the periodically flush condition. Closes #29097 Relates ##28350	2018-03-22 14:31:15 -04:00
Alexander Reelsen	23b4368fe4	Docs: Fix encrypt watcher sensitive data documentation (elastic/x-pack-elasticsearch#4198 ) The documentation mentions that the xpack.watcher.encrypt_sensitive_data setting needs to be set in the keystore. This is wrong however, it needs to be set in the standard elasticsearch yaml file. relates elastic/x-pack-elasticsearch#4195 Original commit: elastic/x-pack-elasticsearch@613d63da85	2018-03-22 18:57:31 +01:00
David Kyle	179090c840	[ML] Unclutter failed job assignment explanations (elastic/x-pack-elasticsearch#4179 ) Unclutter failed job assignment explanations Original commit: elastic/x-pack-elasticsearch@1c3deebaac	2018-03-22 17:45:57 +00:00

... 9 10 11 12 13 ...

38907 Commits All Branches Search

38907 Commits

All Branches