OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ryan Ernst	b1cef5fdf8	Remove 2.0 prerelease version constants (#22004 ) * Remove 2.0 prerelease version constants This is a start to addressing #21887. This removes: * pre 2.0 snapshot format support * automatic units addition to cluster settings * bwc check for delete by query in pre 2.0 indexes	2016-12-08 21:48:35 -08:00
Areek Zillur	f766dfca82	Merge branch 'master' into enhancement/use_shard_bulk_for_single_ops	2016-12-09 00:44:04 -05:00
Igor Motov	7f79c99e9a	Add descriptions to bulk tasks Related to #21768	2016-12-08 21:59:52 -05:00
Lee Hinman	ef64d230e7	Merge remote-tracking branch 'dakrone/index-seq-id-and-primary-term'	2016-12-08 19:47:21 -07:00
Lee Hinman	ee22a477df	Add internal _primary_term doc values field, fix _seq_no indexing This adds the `_primary_term` field internally to the mappings. This field is populated with the current shard's primary term. It is intended to be used for collision resolution when two document copies have the same sequence id, therefore, doc_values for the field are stored but the filed itself is not indexed. This also fixes the `_seq_no` field so that doc_values are retrievable (they were previously stored but irretrievable) and changes the `stats` implementation to more efficiently use the points API to retrieve the min/max instead of iterating on each doc_value value. Additionally, even though we intend to be able to search on the field, it was previously not searchable. This commit makes it searchable. There is no user-visible `_primary_term` field. Instead, the fields are updated by calling: ```java index.parsedDoc().updateSeqID(seqNum, primaryTerm); ``` This includes example methods in `Versions` and `Engine` for retrieving the sequence id values from the index (see `Engine.getSequenceID`) that are only used in unit tests. These will be extended/replaced by actual implementations once we make use of sequence numbers as a conflict resolution measure. Relates to #10708 Supercedes #21480 P.S. As a side effect of this commit, `SlowCompositeReaderWrapper` cannot be used for documents that contain `_seq_no` because it is a Point value and SCRW cannot wrap documents with points, so the tests have been updated to loop through the `LeafReaderContext`s now instead.	2016-12-08 19:47:03 -07:00
Jason Tedor	c9882dd1a0	Avoid NPE in NodeService#stats if HTTP is disabled This commit adds safety against an NPE if HTTP stats are requested but HTTP is disabled on a node. Relates #22060	2016-12-08 19:59:02 -05:00
Areek Zillur	4231aa4feb	Add bwc for index/delete requests from pre-6.0 nodes	2016-12-08 17:17:41 -05:00
Jason Tedor	4aae017891	Skip IP range query REST test prior to 5.1.2 This commit adds a skip for the IP range query REST test on version prior to 5.1.2 due to a exclusive bug on the top end of the range.	2016-12-08 16:40:39 -05:00
Jason Tedor	f713106827	Bump version to 5.1.2 This commit bumps the version to 5.1.2. Relates #22057	2016-12-08 16:40:39 -05:00
Nik Everett	e9bb8d8b38	Don't allow yaml tests with `warnings` that don't skip `warnings` (#21989 ) If you write a yaml test with a `warnings` section in a `do` block that doesn't also have a corresponding `skip` section for `warnings` then client test runners that don't support `warnings` will fail. This causes the elasticsearch build to fail so we catch these errors earlier. Related to #21811	2016-12-08 13:17:31 -05:00
David Pilato	8b0df47381	readonly on azure repository must be taken into account While I was fixing a documentation issue (#22007), I looked at the code and discovered that we actually never read what the user entered as a `readonly` parameter when he creates an azure repository. So if someone sends: ``` PUT _snapshot/my_backup4 { "type": "azure", "settings": { "account": "my_account2", "location_mode": "primary_only", "readonly": true } } ``` The repository is not actually defined as `readonly`. It's caused by the fact we are always overwriting `readonly`setting based on `location_mode`. If a user sets it to `primary_only`, `readonly` is forced to `false`. If a user sets it to `primary_then_secondary`, `readonly` is forced to `false`. If a user sets it to `secondary_only`, `readonly` is forced to `false`. Note that with this change, a user can force a `secondary_only` repository to `readonly: false` which will lead him to an error later on when we check the repository as per definition in Azure, a secondary repository is not writable. Another option could have been to detect this mismatch and throw an exception in that case. Note sure it is worth writing more code though. Closes #22053.	2016-12-08 18:54:00 +01:00
Ali Beyad	3da04293f3	Cannot force allocate primary to a node where the shard already exists (#22031 ) Before, it was possible that the SameShardAllocationDecider would allow force allocation of an unassigned primary to the same node on which an active replica is assigned. This could only happen with shadow replica indices, because when a shadow replica primary fails, the replica gets promoted to primary but in the INITIALIZED state, not in the STARTED state (because the engine has specific reinitialization that must take place in the case of shadow replicas). Therefore, if the now promoted primary that is initializing fails also, the primary will be in the unassigned state, because replica to primary promotion only happens when the failed shard was in the started state. The now unassigned primary shard will go through the allocation deciders, where the SameShardsAllocationDecider would return a NO decision, but would still permit force allocation on the primary if all deciders returned NO. This commit implements canForceAllocatePrimary on the SameShardAllocationDecider, which ensures that a primary cannot be force allocated to the same node on which an active replica already exists.	2016-12-08 12:21:19 -05:00
Adrien Grand	8fe4bc1b74	Fix REST test for ip range aggregations. Relates to #22018	2016-12-08 18:00:56 +01:00
Nik Everett	0f7c20ae81	Build: NORELEASE is the same as norelease (#22006 ) Changes the build to recognize `NORELEASE` as well as `NOCOMMIT` to mean the same thing as `norelease` and `nocommit` respectively. This is useful because people have been using them that way but haven't realized that only the lowercase versions worked. This also explicitly forbids silly things like `NoReLeAsE` and `noCOMMIT`, failing the build and telling you to spell them properly.	2016-12-08 11:50:03 -05:00
David Pilato	18a3d6b4f3	S3/Azure snapshot repo documentation wrong for "read_only" We used to write that people should use `read_only` although it should be `readonly`. Closes #22007.	2016-12-08 16:57:50 +01:00
Adrien Grand	182e119699	IP range masks exclude the maximum address of the range. (#22018 ) Closes #22005	2016-12-08 15:58:32 +01:00
makeyang	ce0ad4e08e	add test case for parse include_in_all in mulit fields	2016-12-08 19:44:40 +08:00
Ali Beyad	30bcb06606	When shard data is still being fetched from nodes in the cluster, the ReplicaShardAllocator, when in explain mode, would get the node decisions for all nodes in the cluster. The PrimaryShardAllocator neglected to do this and tried to use the shard fetch data in explain mode, which had not yet been fully fetched. This commit fixes this by ensuring the PrimaryShardAllocator gets node decisions in the same way the ReplicaShardAllocator does in explain mode, if shard data is still being fetched.	2016-12-07 22:21:09 -05:00
makeyang	46cdb411b5	modified code according to nik9000's comments.	2016-12-08 11:04:33 +08:00
Jared Carey	317866894e	Fix systemd override example in configuring docs When overriding a systemd configuration via a drop-in file, the [Service] header is required. This commit adds this to an example drop-in override in the configuring docs. Relates #22038	2016-12-07 19:41:59 -05:00
Ali Beyad	e6e7bab58c	Prepares allocator decision objects for use with the allocation explain API (#21691 ) This commit enhances the allocator decision result objects (namely, AllocateUnassignedDecision, MoveDecision, and RebalanceDecision) to enable them to be used directly by the cluster allocation explain API. In particular, this commit does the following: - Adds serialization and toXContent methods to the response objects, which will form the explain API responses. - Moves the calculation of the final explanation to the response object itself, removing it from the responsibility of the allocators. - Adds shard store information to the NodeAllocationResult, so that store information is available for each node, when explaining a shard allocation by the PrimaryShardAllocator or the ReplicaShardAllocator. - Removes RebalanceDecision in favor of using MoveDecision for both moving and rebalancing shards. - Removes NodeRebalanceResult in favor of using NodeAllocationResult. - Changes the notion of weight ranking to be relative to the current node, instead of an absolute weight that doesn't convey any added value to the API user and can be confusing. - Introduces a new enum AllocationDecision to convey the decision type, which enables conveying unassigned, moving, and rebalancing scenarios with more detail as opposed to just Decision.Type and AllocationStatus.	2016-12-07 17:37:51 -05:00
Guilherme	9fbfe540d5	add Lua client (#22028 ) Add entry for elasticsearch-lua (https://github.com/DhavalKapil/elasticsearch-lua)	2016-12-07 11:24:28 -07:00
Ali Beyad	05f64c550a	[TEST] fixes line length issue in BulkRequestModifierTests	2016-12-07 13:11:55 -05:00
Thibault Pierre	e494d6a94e	Fix wrong link (#22019 )	2016-12-07 17:58:46 +01:00
Ryan Ernst	f02a2b6546	Ingest: Moved ingest invocation into index/bulk actions (#22015 ) * Ingest: Moved ingest invocation into index/bulk actions Ingest was originally setup as a plugin, and in order to hook into the index and bulk actions, action filters were used. However, ingest was later moved into core, but the action filters were never removed. This change moves the execution of ingest into the index and bulk actions. * Address PR comments * Remove forwarder direct dependency on ClusterService	2016-12-07 08:43:26 -08:00
Colin Goodheart-Smithe	8006b105f3	Update order examples to use max instead of avg (#22032 ) The use of the avg aggregation for sorting the terms aggregation is not encouraged since it has unbounded error. This changes the examples to use the max aggregation which does not suffer the same issues	2016-12-07 16:00:24 +00:00
Christoph Büscher	7454a9647b	Add fromXContent to HighlightField This adds a fromXContent method and unit test to the HighlightField class so we can parse it as part of a serch response. This is part of the preparation for parsing search responses on the client side.	2016-12-07 16:32:44 +01:00
David Pilato	8923b36780	Merge pull request #21956 from alexshadow007/aws_read_timeout Add setting to set read timeout for EC2 discovery and S3 repository plugins	2016-12-07 16:00:48 +01:00
Yannick Welsch	c87cc15d49	Add toString() for TransportReplicationAction.ConcreteShardRequest	2016-12-07 15:58:22 +01:00
Christoph Büscher	31a1c2e240	Remove redundant source setters from IndexRequestBuilder	2016-12-07 15:20:36 +01:00
Nikhil Patel	b5e3d351d9	Fix typos in threads docs This commit fixes a typo in the threads docs where the past tense form of a verb was used when current tense is needed. Relates #22016	2016-12-07 08:22:49 -05:00
Lucas Bremgartner	0086b99797	[Docs] Correct setting name in snapshot/restore documentation (#22023 ) There is no setting include_cluster_state for snapshot restore. The correct name for this setting is include_global_state.	2016-12-07 14:12:10 +01:00
Yannick Welsch	9630b1a6e7	Promote shadow replica to primary when initializing primary fails (#22021 ) Failing an initializing primary when shadow replicas are enabled for the index can leave the primary unassigned with replicas being active. Instead, a replica should be promoted to primary, which is fixed by this commit.	2016-12-07 13:59:43 +01:00
Yannick Welsch	13e1a6fd40	Trim in-sync allocations set only when it grows (#21976 ) This commit makes two changes to how the in-sync allocations set is updated: - the set is only trimmed when it grows. This prevents trimming too eagerly when the number of replicas was decreased while shards were unassigned. - the allocation id of an active primary that failed is only removed from the in-sync set if another replica gets promoted to primary. This prevents the situation where the only available shard copy in the cluster gets removed the in-sync set. Closes #21719	2016-12-07 10:59:11 +01:00
Adrien Grand	c746854e03	Pre-built analysis factories do not implement MultiTermAware correctly. (#21981 ) We had tests for the regular factories, but not for the pre-built ones, that ship by default without requiring users to define them in the analysis settings.	2016-12-07 10:32:25 +01:00
Adrien Grand	33b8d7a19d	Expose `ip` fields as strings in scripts. (#21997 ) Currently we expose the internal representation that we use for ip addresses, which are the ipv6 bytes. However, this is not really usable, exposes internal implementation details and also does not work fine with other APIs that expect that the values can be `toString`'d. Closes #21977	2016-12-07 10:32:11 +01:00
Areek Zillur	c5b09adf47	Make index and delete operation execute as a single bulk item Performance testing by @danielmitterdorfer revealed single index/delete operations have similar performance (indexing throughput) to equivalent single item bulk request. This PR reduces the code paths to executing single write operations, by reusing the logic in (shard) bulk action for executing single operation as a single-item bulk request.	2016-12-07 00:43:04 -05:00
Nik Everett	ef83dbfbe6	Reindex: Better error message for pipeline in wrong place (#21985 ) `_update_by_query` supports specifying the `pipeline` to process the documents as a url parameter but `_reindex` doesn't. It doesn't because everything about the `_reindex` request that has to do with writing the documents is grouped under the `dest` object in the request body. This changes the response parameter from `request [_reindex] contains unrecognized parameter: [pipeline]` to `_reindex doesn't support [pipeline] as a query parmaeter. Specify it in the [dest] object instead.`	2016-12-06 14:55:46 -05:00
Boaz Leskes	4519bdfeb0	InternalTestCluster shouldn't auto heal an active disruption when a new one is set Instead people should explicitly clear the existing one so it's clear what's going on.	2016-12-06 19:58:11 +01:00
Alexander Kazakov	0a03a62ab6	Using ClientConfiguration.DEFAULT_SOCKET_TIMEOUT as default value for read timeout	2016-12-06 21:13:28 +03:00
shaie	6da44c8164	Fix _termvectors with preference to not hit NPE (#21959 ) When you submit a _termvectors request for an artificial document and specify the 'preference' parameter to send the request to a particular shard, the request sometimes hits NPE. Fix this case by ignoring the auto-generated artificial document ID and pick a shard per the preference parameter, or a random shard. This closes #21928	2016-12-06 17:29:09 +01:00
Jim Ferenczi	b42ca6bcc9	Include unindexed field in FieldStats response (#21821 ) * Include unindexed field in FieldStats response This change adds non-searchable fields to the FieldStats response. These fields do not have min/max informations but they can be aggregatable. Fields that are only stored in _source (store:no, index:no, doc_values:no) will still be missing since they do not have any useful information to show. Indices and clients must be at least on V_5_2_0 to see this change.	2016-12-06 13:32:57 +01:00
Boaz Leskes	a7050b2d56	Remove `InternalTestCluster.startNode(s)Async` (#21846 ) Since the removal of local discovery of #https://github.com/elastic/elasticsearch/pull/20960 we rely on minimum master nodes to be set in our test cluster. The settings is automatically managed by the cluster (by default) but current management doesn't work with concurrent single node async starting. On the other hand, with `MockZenPing` and the `discovery.initial_state_timeout` set to `0s` node starting and joining is very fast making async starting an unneeded complexity. Test that still need async starting could, in theory, still do so themselves via background threads. Note that this change also removes the usage of `INITIAL_STATE_TIMEOUT_SETTINGS` as the starting of nodes is done concurrently (but building them is sequential)	2016-12-06 12:06:15 +01:00
Daniel Mitterdorfer	a02bc8ed1c	Document thread-safety for ingest processors With this commit we document that ingest processors need to be thread-safe. Previously this could be inferred from reading the source code but we got several user questions about this so it is stated explicitly in the Javadocs of Processor now.	2016-12-06 10:07:51 +01:00
Adrien Grand	26cbda41ea	AsciiFoldingFilter's multi-term component should never preserve the original token. (#21982 ) This ports the fix of https://issues.apache.org/jira/browse/LUCENE-7536 to Elasticsearch's ASCIIFoldingTokenFilterFactory.	2016-12-06 10:01:04 +01:00
Ryan Ernst	c8f241f284	Plugins: Remove response action filters (#21950 ) Action filters currently have the ability to filter both the request and response. But the response side was not actually used. This change removes support for filtering responses with action filters.	2016-12-05 16:14:04 -08:00
Alexander Kazakov	1491e2dec9	Remove default value for read_timeout setting Fix tests and docs	2016-12-05 21:29:17 +03:00
Nik Everett	2087234d74	Timeout improvements for rest client and reindex (#21741 ) Changes the default socket and connection timeouts for the rest client from 10 seconds to the more generous 30 seconds. Defaults reindex-from-remote to those timeouts and make the timeouts configurable like so: ``` POST _reindex { "source": { "remote": { "host": "http://otherhost:9200", "socket_timeout": "1m", "connect_timeout": "10s" }, "index": "source", "query": { "match": { "test": "data" } } }, "dest": { "index": "dest" } } ``` Closes #21707	2016-12-05 10:54:51 -05:00
Jim Ferenczi	03a0a0aebb	Undeprecate GetResponse#getFields and GetResponse#getField These functions should not have been deprecated as they can be used to retrieve stored and doc-value field.	2016-12-05 15:31:53 +01:00
makeyang	318ce6ab16	fix bug: https://github.com/elastic/elasticsearch/issues/21710	2016-12-05 19:14:34 +08:00

... 14 15 16 17 18 ...

26306 Commits All Branches Search

26306 Commits

All Branches