OpenSearch

Commit Graph

Author	SHA1	Message	Date
Alan Woodward	852df128a5	Match phrase queries against non-indexed fields should throw an exception (#31060 ) When `lenient=false`, attempts to create match phrase queries with custom analyzers against non-text fields will throw an IllegalArgumentException. Also changes `MatchQueryBuilderTests` so that it avoids this scenario Fixes #31061	2018-06-04 19:12:45 +01:00
Julie Tibshirani	609de08126	In the internal highlighter APIs, use the field type as opposed to the mapper. (#31039 )	2018-06-04 11:12:03 -07:00
Julie Tibshirani	30a8f9d948	Make sure KeywordFieldMapper#clone preserves split_queries_on_whitespace. (#31049 )	2018-06-04 08:42:32 -07:00
Boaz Leskes	167b9b3656	Adapt bwc versions after backporting #31045 to 6.3	2018-06-04 15:13:36 +01:00
Christoph Büscher	11b11f6f4c	Share common readFrom/writeTo code in AcknowledgeResponse (#30983 ) The majority of Responses inheriting from AcknowledgeResponse implement the readFrom and writeTo serialization method in the same way. Moving this as a default into AcknowledgeResponse and letting the few exceptions that need a slightly different implementation handle this themselves saves a lot of duplication.	2018-06-04 15:10:02 +02:00
Boaz Leskes	ccb78c2fdf	Adapt bwc versions after backporting #31045 to 6.x	2018-06-04 13:33:34 +01:00
Daniel Mitterdorfer	146965f3ec	Mute MatchPhrase*QueryBuilderTests Relates #31061	2018-06-04 14:03:01 +02:00
Alan Woodward	0427339ab0	Index phrases (#30450 ) Specifying `index_phrases: true` on a text field mapping will add a subsidiary [field]._index_phrase field, indexing two-term shingles from the parent field. The parent analysis chain is re-used, wrapped with a FixedShingleFilter. At query time, if a phrase match query is executed, the mapping will redirect it to run against the subsidiary field. This should trade faster phrase querying for a larger index and longer indexing times. Relates to #27049	2018-06-04 08:50:35 +01:00
Jason Tedor	dc8a4fb460	Remove leftover debugging from PTCMDT This commit removes some leftover debugging statements.	2018-06-03 21:53:21 -04:00
Jason Tedor	5667b08aaa	Fix PTCMDT#testMinVersionSerialization This commit fixes an issue with PersistentTasksCustomMetaDataTests#testMinVersionSerialization. There were two problems here: - some versions do not have future compatible version (e.g., betas) - the feature logic was incorrect	2018-06-03 21:35:01 -04:00
Boaz Leskes	a7ceefe93f	Make Persistent Tasks implementations version and feature aware (#31045 ) With #31020 we introduced the ability for transport clients to indicate what features they support in order to make sure we don't serialize object to them they don't support. This PR adapts the serialization logic of persistent tasks to be aware of those features and not serialize tasks that aren't supported. Also, a version check is added for the future where we may add new tasks implementations and need to be able to indicate they shouldn't be serialized both to nodes and clients. As the implementation relies on the interface of `PersistentTaskParams`, these are no longer optional. That's acceptable as all current implementation have them and we plan to make `PersistentTaskParams` more central in the future. Relates to #30731	2018-06-03 21:51:08 +02:00
Jason Tedor	5bfe2ba469	Avoid randomization bug in FeatureAwareTests We compute a random version and later try to compute the version prior that random version. If the random version is the earliest version in our list of versions then it, by definition, does not have a previous version. Yet trying to find its previous is someting we do and so the test fails. This commit adds a version check to the randomization so that we do not select the earliest version in our list.	2018-06-01 22:49:51 -04:00
Jason Tedor	3670a2ae05	Adjust BWC version on client features This commit adjusts the BWC version on client features in master to 6.3.0 after the functionality was backported to the 6.3 branch.	2018-06-01 19:15:31 -04:00
Tim Brooks	f8785dda9d	Add TRACE, CONNECT, and PATCH http methods (#31035 ) This is related to #31017. That issue identified that these three http methods were treated like GET requests. This commit adds them to RestRequest. This means that these methods will be handled properly and generate 405s.	2018-06-01 17:07:54 -06:00
Jason Tedor	2401150be7	Adjust BWC version on client features This commit adjusts the BWC version on client features in master to 6.4.0 after the functionality was backported to the 6.x branch.	2018-06-01 16:33:56 -04:00
Jason Tedor	4522b57e07	Introduce client feature tracking (#31020 ) This commit introduces the ability for a client to communicate to the server features that it can support and for these features to be used in influencing the decisions that the server makes when communicating with the client. To this end we carry the features from the client to the underlying stream as we carry the version of the client today. This enables us to enhance the logic where we make protocol decisions on the basis of the version on the stream to also make protocol decisions on the basis of the features on the stream. With such functionality, the client can communicate to the server if it is a transport client, or if it has, for example, X-Pack installed. This enables us to support rolling upgrades from the OSS distribution to the default distribution without breaking client connectivity as we can now elect to serialize customs in the cluster state depending on whether or not the client reports to us using the feature capabilities that it can under these customs. This means that we would avoid sending a client pieces of the cluster state that it can not understand. However, we want to take care and always send the full cluster state during node-to-node communication as otherwise we would end up with different understanding of what is in the cluster state across nodes depending on which features they reported to have. This is why when deciding whether or not to write out a custom we always send the custom if the client is not a transport client and otherwise do not send the custom if the client is transport client that does not report to have the feature required by the custom. Co-authored-by: Yannick Welsch <yannick@welsch.lu>	2018-06-01 11:45:35 -04:00
Alan Woodward	b8fda588f4	Ensure that index_prefixes settings cannot be changed (#30967 )	2018-06-01 15:17:35 +01:00
Sohaib Iftikhar	11887fa54a	REST high-level client: add delete ingest pipeline API (#30865 ) Relates to #27205	2018-06-01 14:13:41 +02:00
Yannick Welsch	fb671adfd6	Fix interoperability with < 6.3 transport clients (#30971 ) With the default distribution changing in 6.3, clusters might now contain custom metadata that a pure OSS transport client cannot deserialize. As this can break transport clients when accessing the cluster state or reroute APIs, we've decided to exclude any custom metadata that the transport client might not be able to deserialize. This will ensure compatibility between a < 6.3 transport client and a 6.3 default distribution cluster. Note that this PR only covers interoperability with older clients, another follow-up PR will cover full interoperability for >= 6.3 transport clients where we will make it possible again to get the custom metadata from the cluster state. Relates to #30731	2018-06-01 10:02:57 +02:00
Jim Ferenczi	0791f93dbd	Add an option to split keyword field on whitespace at query time (#30691 ) This change adds an option named `split_queries_on_whitespace` to the `keyword` field type. When set to true full text queries (`match`, `multi_match`, `query_string`, ...) that target the field will split the input on whitespace to build the query terms. Defaults to `false`. Closes #30393	2018-06-01 09:47:03 +02:00
Christoph Büscher	cea3c28b5b	[Tests] Fix alias names in PutIndexTemplateRequestTests (#30960 ) The randomized alias names could contain unicode controll charactes that don't survive an xContent rendering and parsing roundtrip when using the YAML xContent type. This fix filters the randomized unicode string for control characters to avoid this particular problem. Closes #30911	2018-06-01 09:45:04 +02:00
Sohaib Iftikhar	80d20a9010	REST high-level client: add get ingest pipeline API (#30847 ) Relates to #27205	2018-06-01 08:55:43 +02:00
Luca Cavanna	70749e01c4	Cross Cluster Search: preserve remote status code (#30976 ) In case an error is returned when calling search_shards on a remote cluster, which will lead to throwing an exception in the coordinating node, we should make sure that the status code returned by the coordinating node is the same as the one returned by the remote cluster. Up until now a 500 - Internal Server Error was always returned. This commit changes this behaviour so that for instance if an index is not found, which causes an 404, a 404 is also returned by the coordinating node to the client. Closes #27461	2018-06-01 08:53:53 +02:00
Luca Cavanna	31351ab880	High-level client: list tasks failure to not lose nodeId (#31001 ) This commit reworks testing for `ListTasksResponse` so that random fields insertion can be tested and xcontent equivalence can be checked too. Proper exclusions need to be configured, and failures need to be tested separately. This helped finding a little problem, whenever there is a node failure returned, the nodeId was lost as it was never printed out as part of the exception toXContent.	2018-06-01 08:53:24 +02:00
Julie Tibshirani	cd0a375414	Remove unused query methods from MappedFieldType. (#30987 ) * Remove MappedFieldType#nullValueQuery, as it is now unused. * Remove MappedFieldType#queryStringTermQuery, as it is never overridden.	2018-05-31 12:47:52 -07:00
Tim Brooks	4f66b9a27c	Transport client: Don't validate node in handshake (#30737 ) This is related to #30141. Right now in the transport client we open a temporary node connection and take the node information. This node information is used to open a permanent connection that is used for the client. However, we continue to use the configured transport address. If the configured transport address is a load balancer, you might connect to a different node for the permanent connection. This causes the handshake validation to fail. This commit removes the handshake validation for the transport client when it simple node sample mode.	2018-05-31 13:14:28 -06:00
Michael Basnight	d826cb36c3	Remove version read/write logic in Verify Response (#30879 ) Since master will always communicate with a >=6.4 node, the logic for checking if the node is 6.4 and conditionally reading and writing based on that can be removed from master. This logic will stay in 6.x as it is the bridge to the cleaner response in master. This also unmutes the failing test due to this bwc change. Closes #30807	2018-05-31 12:10:01 -05:00
Ryan Ernst	46e8d97813	Core: Remove RequestBuilder from Action (#30966 ) This commit removes the RequestBuilder generic type from Action. It was needed to be used by the newRequest method, which in turn was used by client.prepareExecute. Both of these methods are now removed, along with the existing users of prepareExecute constructing the appropriate builder directly.	2018-05-31 16:15:00 +02:00
Jim Ferenczi	0f5e570184	Deprecates indexing and querying a context completion field without context (#30712 ) This change deprecates completion queries and documents without context that target a context enabled completion field. Querying without context degrades the search performance considerably (even when the number of indexed contexts is low). This commit targets master but the deprecation will take place in 6.x and the functionality will be removed in 7 in a follow up. Closes #29222	2018-05-31 16:09:48 +02:00
Tanguy Leroux	c41574376f	Make AllocatedPersistentTask.isCompleted() protected (#30949 ) This commit changes the isCompleted() method to be protected so that classes that extends AllocatedPersistentTask can use it. Related to #30858	2018-05-31 09:19:05 +02:00
Nhat Nguyen	b834254862	Mute FlushIT tests We have identified the source causing these tests failed. This commit mutes them again until we have a proper fix. Relates #29392	2018-05-30 14:23:53 -04:00
Michael Basnight	b716b08197	Add Verify Repository High Level REST API (#30934 ) This commit adds Verify Repository, the associated docs and tests for the high level REST API client. A few small changes to the Verify Repository Response went into the commit as well. Relates #27205	2018-05-30 11:10:00 -05:00
Jim Ferenczi	532b91ffa6	Fix composite agg serialization error Fix serialization after backport Relates #29465	2018-05-30 14:22:48 +02:00
Christoph Büscher	1ea9f11b03	Change ScriptException status to 400 (bad request) (#30861 ) Currently failures to compile a script usually lead to a ScriptException, which inherits the 500 INTERNAL_SERVER_ERROR from ElasticsearchException if it does not contain another root cause. Instead, this should be a 400 Bad Request error. This PR changes this more generally for script compilation errors by changing ScriptException to return 400 (bad request) as status code. Closes #12315	2018-05-30 14:00:07 +02:00
Jim Ferenczi	f582418ada	Fix missing option serialization after backport Relates #29465	2018-05-30 12:55:31 +02:00
Luca Cavanna	3c21e46fa3	Cross Cluster Search: do not use dedicated masters as gateways (#30926 ) When we are connecting to a remote cluster we should never select dedicated master nodes as gateway nodes, or we will end up loading them with requests that should rather go to other type of nodes e.g. data nodes or coord_only nodes. This commit adds the selection based on the node role, to the existing selection based on version and potential node attributes. Closes #30687	2018-05-30 12:32:41 +02:00
olcbean	6341d101d2	Fix AliasMetaData parsing (#30866 ) AliasMetaData should be parsed more leniently so that the high-level REST client can support forward compatibility on it. This commit addresses this issue that was found as part of #28799 and adds dedicated XContent tests as well.	2018-05-30 12:31:24 +02:00
Yannick Welsch	ff8ce2c575	Fsync state file before exposing it (#30929 ) With multiple data paths, we write the state files for index metadata to all data paths. We only properly fsync on the first location, though. For other locations, we possibly expose the file before its contents is properly fsynced. This can lead to situations where, after a crash, and where the first data path is not available anymore, ES will see a partially-written state file, preventing the node to start up.	2018-05-30 10:15:12 +02:00
Jim Ferenczi	e33d107f84	Add missing_bucket option in the composite agg (#29465 ) This change adds a new option to the composite aggregation named `missing_bucket`. This option can be set by source and dictates whether documents without a value for the source should be ignored. When set to true, documents without a value for a field emits an explicit `null` value which is then added in the composite bucket. The `missing` option that allows to set an explicit value (instead of `null`) is deprecated in this change and will be removed in a follow up (only in 7.x). This commit also changes how the big arrays are allocated, instead of reserving the provided `size` for all sources they are created with a small intial size and they grow depending on the number of buckets created by the aggregation: Closes #29380	2018-05-30 09:48:40 +02:00
Alan Woodward	67905c85a5	Rename index_prefix to index_prefixes (#30932 ) This commit also adds index_prefixes tests to TextFieldMapperTests to ensure that cloning and wire-serialization work correctly	2018-05-30 08:32:31 +01:00
Tanguy Leroux	a0af0e7f1e	Rename methods in PersistentTasksService (#30837 ) This commit renames methods in the PersistentTasksService, to make obvious that the methods send requests in order to change the state of persistent tasks. Relates to #29608.	2018-05-30 09:20:14 +02:00
Julie Tibshirani	913778b37a	Update the version checks around range bucket keys, now that the change was backported.	2018-05-29 20:51:27 -07:00
Julie Tibshirani	a79c5bd3d6	Minor clean-up in InternalRange. (#30886 ) * Make sure all instance variables are final. * Make generateKey a private static method, instead of protected. * Rename formatter -> format for consistency. * Serialize bucket keys as strings as opposed to optional strings. * Pull the stream serialization logic for buckets into the Bucket class.	2018-05-29 18:16:02 -07:00
Tim Brooks	ad0dc580c5	Fix location of AbstractHttpServerTransport (#30888 ) Currently AbstractHttpServerTransport is in a netty4 module. This is the incorrect location. This commit moves it out of netty4 module. Additionally, it moves unit tests that test AbstractHttpServerTransport logic to server.	2018-05-29 13:14:23 -06:00
Martijn van Groningen	544822c78b	Moved keyword tokenizer to analysis-common module (#30642 ) Relates to #23658	2018-05-29 19:22:28 +02:00
Nhat Nguyen	363f1e84ca	Upgrade to Lucene-7.4-snapshot-1cbadda4d3 (#30928 ) This snapshot includes LUCENE-8328 which is needed to stabilize CCR builds.	2018-05-29 12:29:52 -04:00
Nhat Nguyen	9e9abc31b8	Fix IndexTemplateMetaData parsing from xContent (#30917 ) We failed to register "aliases" and "version" into the list of keywords in the IndexTemplateMetaData; then fail to parse the following index template. ``` { "aliases": {"log": {}}, "index_patterns": ["pattern-1"] } ``` This commit registers that missing keywords.	2018-05-29 11:14:39 -04:00
Sohaib Iftikhar	3c918d799c	Deprecate accepting malformed requests in stored script API (#28939 ) The stored scripts API today accepts malformed requests instead of throwing an exception. This PR deprecates accepting malformed put stored script requests (requests not using the official script format). Relates to #27612	2018-05-29 15:45:53 +02:00
Christoph Büscher	c137ad0c39	Replace several try-finally statements (#30880 ) This change replaces some existing try-finally statements that close resources in their finally block with the slightly shorter and safer try-with-resources pattern.	2018-05-29 10:31:52 +02:00
Tanguy Leroux	6e480663d7	Remove AllocatedPersistentTask.getState() (#30858 ) This commit removes the method AllocatedPersistentTask.getState() that exposes the internal state of an AllocatedPersistentTask and replaces it with a new isCompleted() method. Related to #29608.	2018-05-29 09:26:02 +02:00
Albert Zaharovits	e888467d0a	[TEST] Fix minor random bug from #30794	2018-05-27 20:02:24 +03:00
Vladimir Dolzhenko	b55b079a90	Include size of snapshot in snapshot metadata #18543 , bwc clean up (#30890 )	2018-05-26 21:20:44 +02:00
Vladimir Dolzhenko	81eb8ba0f0	Include size of snapshot in snapshot metadata (#29602 ) Include size of snapshot in snapshot metadata Adds difference of number of files (and file sizes) between prev and current snapshot. Total number/size reflects total number/size of files in snapshot. Closes #18543	2018-05-25 21:04:50 +02:00
Michael Basnight	e08c7c2df4	Change BWC version for VerifyRepositoryResponse (#30796 ) The BWC version was previously at 7.0, because the 6.x backport had not yet landed. Now that it has landed, this commit replaces the BWC compat with the real version, 6.4.0. Relates #30762	2018-05-25 10:09:09 -05:00
Jason Tedor	d31e10a87d	Verify signatures on official plugins (#30800 ) We sign our official plugins yet this is not well-advertised and not at all consumed during plugin installation. For plugins that are installed over the intertubes, verifying that the downloaded artifact is signed by our signing key would establish both integrity and validity of the downloaded artifact. The chain of trust here is simple: our installable artifacts (archive and package distributions) so that if a user trusts our packages via their signatures, and our plugin installer (which would be executing trusted code) verifies the downloaded plugin, then the user can trust the downloaded plugin too. This commit adds verification of official plugins downloaded during installation. We do not add verification for offline plugin installs; a user can download our signatures and verify the artifacts themselves. This commit also needs to solve a few interesting challenges. One of these is that we want the bouncy castle JARs on the classpath only for the plugin installer, but not for the runtime Elasticsearch. Additionally, we want these JARs to not be present for the JAR hell checks. To address this, we shift these JARs into a sub-directory of lib (lib/tools/plugin-cli) that is only loaded for the plugin installer, and in the plugin installer we filter any JARs in this directory from the JAR hell check.	2018-05-25 07:56:35 -04:00
Martijn van Groningen	ae2f021f1c	Move score script context from SearchScript to its own class (#30816 )	2018-05-25 07:17:50 +02:00
Michael Basnight	e1ffbeb824	Fix bad version check writing Repository nodes (#30846 ) The writeTo method of VerifyRepositoryResponse incorrectly used its local version to determine what it was receiving, rather than the sender's version. This fixes a bug that ocassionally happened when a 6.4 master node sent data to a 7.0 client, causing the number of bytes to be improperly read. This also unmutes the test. Closes #30807	2018-05-24 19:21:57 -05:00
Tim Brooks	e8b70273c1	Remove Throwable usage from transport modules (#30845 ) Currently nio and netty modules use the CompletableFuture class for managing listeners. This is unfortunate as that class accepts Throwable. This commit adds a class CompletableContext that wraps the CompletableFuture but does not accept Throwable. This allows the modification of netty and nio logic to no longer handle Throwable.	2018-05-24 17:33:29 -06:00
Sohaib Iftikhar	5a97423b7a	REST high-level client: add put ingest pipeline API (#30793 ) REST high-level client: add put ingest pipeline API Adds the put ingest pipeline API to the high level rest client.	2018-05-24 19:02:26 -04:00
Julie Tibshirani	f55b09bae4	Update the version checks around ip_range bucket keys, now that the change was backported.	2018-05-24 12:04:18 -07:00
Igor Motov	3622486889	Mute IndexMasterFailoverIT.testMasterFailoverDuringIndexingWithMappingChanges Tracked by #30844	2018-05-24 15:00:16 -04:00
Igor Motov	cf0e0606af	Use geohash cell instead of just a corner in geo_bounding_box (#30698 ) Treats geohashes as grid cells instead of just points when the geohashes are used to specify the edges in the geo_bounding_box query. For example, if a geohash is used to specify the top_left corner, the top left corner of the geohash cell will be used as the corner of the bounding box. Closes #25154	2018-05-24 14:46:15 -04:00
Jay Modi	b3a4acdf20	Limit user to single concurrent auth per realm (#30794 ) This commit reworks the way our realms perform caching in order to limit each principal to a single ongoing authentication per realm. In other words, this means that multiple requests made by the same user will not trigger more that one authentication attempt at a time if no entry has been stored in the cache. If an entry is present in our cache, there is no restriction on the number of concurrent authentications performed for this user. This change enables us to limit the load we place on an external system like an LDAP server and also preserve resources such as CPU on expensive operations such as BCrypt authentication. Closes #30355	2018-05-24 10:43:10 -06:00
Julie Tibshirani	638a719370	Ensure that ip_range aggregations always return bucket keys. (#30701 )	2018-05-24 08:55:14 -07:00
Simon Willnauer	8bbfdf1f45	Use remote client in TransportFieldCapsAction (#30838 ) We now have a remote cluster client exposed which can talk to a given remote cluster and manages reconnects etc. This makes code more readable than using the transport layer directly.	2018-05-24 17:02:47 +02:00
David Roberts	aafcd85f50	Move persistent task registrations to core (#30755 ) Persistent tasks was moved from X-Pack to core in #28455. However, registration of the named writables and named X-content was left in X-Pack. This change moves the registration of the named writables and named X-content into core. Additionally, the persistent task actions are no longer registered in the X-Pack client plugin, as they are already registered in ActionModule.	2018-05-24 09:17:17 +01:00
David Turner	ff0b6c795a	Decouple ClusterStateTaskListener & ClusterApplier (#30809 ) Today, the `ClusterApplier` and `MasterService` both use the `ClusterStateTaskListener` interface to notify their callers when asynchronous activities have completed. However, this is not wholly appropriate: none of the callers into the `ClusterApplier` care about the `ClusterState` arguments that they receive. This change introduces a dedicated ClusterApplyListener interface for callers into the `ClusterApplier`, to distinguish these listeners from the real `ClusterStateTaskListener`s that are waiting for responses from the `MasterService`.	2018-05-24 09:05:09 +01:00
Simon Willnauer	0bdfb5c5b5	Send client headers from TransportClient (#30803 ) This change adds a simple header to the transport client that is present on the servers thread context that ensures we can detect if a transport client talks to the server in a specific request. This change also adds a header for xpack to detect if the client has xpack installed.	2018-05-24 09:46:48 +02:00
Igor Motov	699153edc7	Fix GeoShapeQueryBuilder serialization after backport Aligns the routing value serialization version after backport of #30760	2018-05-23 18:45:19 -04:00
Tim Brooks	d7040ad7b4	Reintroduce mandatory http pipelining support (#30820 ) This commit reintroduces `31251c9` and `63a5799`. These commits introduced a memory leak and were reverted. This commit brings those commits back and fixes the memory leak by removing unnecessary retain method calls.	2018-05-23 14:38:52 -06:00
Igor Motov	4b6915976c	Add support for indexed shape routing in geo_shape query (#30760 ) Adds ability to specify the routing value for the indexed shape in the geo_shape query. Closes #7663	2018-05-23 15:15:19 -04:00
Colin Goodheart-Smithe	4fd0a3e492	Revert "Make http pipelining support mandatory (#30695 )" (#30813 ) This reverts commit `31251c9` introduced in #30695. We suspect this commit is causing the OOME's reported in #30811 and we will use this PR to test this assertion.	2018-05-23 10:54:46 -06:00
Yannick Welsch	cfa2b069f3	Use correct cluster state version for node fault detection (#30810 ) Since its introduction in ES 1.4, node fault detection has been using the wrong cluster state version to send as part of the ping request, by using always the constant -1 (ClusterState.UNKNOWN_VERSION). This can, in an unfortunate series of events, lead to a situation where a previous stale master can regain its authority and revert the cluster to an older state. This commit makes NodesFaultDetection use the correct current cluster state for sending ping requests, avoiding the situation where a stale master possibly forces a newer master to step down and rejoin the stale one.	2018-05-23 18:35:25 +02:00
Adrien Grand	405eb7a751	Change serialization version of doc-value fields. Relates #29639	2018-05-23 18:34:05 +02:00
Yannick Welsch	28d690d1cf	[TEST] Don't expect acks when isolating nodes With #30672, acking expects all nodes to successfully apply the cluster state. The testElectMasterWithLatestVersion test was checking for an ack while isolating one node in the test. Relates to #30672	2018-05-23 16:35:22 +02:00
Adrien Grand	a19df4ab3b	Add a `format` option to `docvalue_fields`. (#29639 ) This commit adds the ability to configure how a docvalue field should be formatted, so that it would be possible eg. to return a date field formatted as the number of milliseconds since Epoch. Closes #27740	2018-05-23 14:39:04 +02:00
Colin Goodheart-Smithe	4aa345e6dc	Fixes UpdateSettingsRequestStreamableTests mutate bug The mutate function in UpdateSettingsRequestStreamableTests did not guarantee that the masterNodeTimeout and timeout values are definitely changed and occassionally the randomTimeValue() method would select the sime time value as the original request which caused a failure.	2018-05-23 13:31:43 +01:00
Yannick Welsch	8145a820c2	Only allow x-pack metadata if all nodes are ready (#30743 ) Enables a rolling restart from the OSS distribution to the x-pack based distribution by preventing x-pack code from installing custom metadata into the cluster state until all nodes are capable of deserializing this metadata.	2018-05-23 11:41:23 +02:00
Yannick Welsch	30b004f582	Use original settings on full-cluster restart (#30780 ) When doing a node restart using the test framework, the restarted node does not only use the settings provided to the original node, but also additional settings provided by plugin extensions, which does not correspond to the settings that a node would have on a true restart.	2018-05-23 09:02:01 +02:00
Yannick Welsch	cceaa9a0f1	Only ack cluster state updates successfully applied on all nodes (#30672 ) The cluster state acking mechanism currently incorrectly acks cluster state updates that have not successfully been applied on all nodes. In a situation, for example, where some of the nodes disconnect during publishing, and don't acknowledge receiving the new cluster state, the user-facing action (e.g. create index request) will still consider this as an ack.	2018-05-23 08:56:32 +02:00
Tim Brooks	63a5799526	Remove http pipelining from integration test case (#30788 ) This is related to #29500. We are removing the ability to disable http pipelining. This PR removes the references to disabling pipelining in the integration test case.	2018-05-22 17:18:05 -06:00
Michael Basnight	a8cea90e10	Modify state of VerifyRepositoryResponse for bwc (#30762 ) The VerifyRepositoryResponse class holds a DiscoveryNode[], but the nodes themselves are not serialized to a REST API consumer. Since we do not want to put all of a DiscoveryNode over the wire, be it REST or Transport since its unused, this change introduces a BWC compatible change in ser/deser of the Response. Anything 6.4 and above will read/write a NodeView, and anything prior will read/write a DiscoveryNode. Further changes to 7.0 will be introduced to remove the BWC shim and only read/write NodeView, and hold a List<NodeView> as the VerifyRepositoryResponse internal state.	2018-05-22 14:55:20 -05:00
Jason Tedor	2984734197	Simplify number of shards setting (#30783 ) This is code that was leftover from the move to one shard by default. Here in index metadata we were preserving the default number of shards settings independently of the area of code where we set this value on an index that does not explicitly have an number of shards setting. This took into consideration the es.index.max_number_of_shards system property, and was used in search requests to set the default maximum number of concurrent shard requests. We set the default there based on the default number of shards so that in a one-node case a search request could concurrently hit all shards on an index with the defaults. Now that we default to one shard, we expect fewer shards in clusters and this adjustment of the node count as the max number of concurrent shard requests is no longer needed. This commit then changes the default number of shards settings to be consistent with the value used when an index is created, and removes the now unneeded adjustment in search requests.	2018-05-22 14:33:16 -04:00
Nhat Nguyen	1918a30237	Upgrade to Lucene-7.4.0-snapshot-cc2ee23050 (#30778 ) The new snapshot includes LUCENE-8324 which fixes missing checkpoint after a fully deletes segment is dropped on flush. This snapshot should resolves failed tests in the CorruptedFileIT suite. Closes #30741 Closes #30577	2018-05-22 13:11:48 -04:00
Tim Brooks	31251c9a6d	Make http pipelining support mandatory (#30695 ) This is related to #29500 and #28898. This commit removes the abilitiy to disable http pipelining. After this commit, any elasticsearch node will support pipelined requests from a client. Additionally, it extracts some of the http pipelining work to the server module. This extracted work is used to implement pipelining for the nio plugin.	2018-05-22 09:29:31 -06:00
Adrien Grand	54740cc551	Increase the maximum number of filters that may be in the cache. (#30655 ) We added this limit because we occasionally saw cases where most of the memory usage of the cache was spent on the keys (ie. queries) rather than the values, which caused the cache to vastly underestimate its memory usage. In recent releases, we disabled caching on heavy `terms` queries, which were the main source of the problem, so putting more entries in the cache should be safer.	2018-05-22 14:57:02 +02:00
Yannick Welsch	e6a784c474	[TEST] Wait for CS to be fully applied in testDeleteCreateInOneBulk The test has an issue that exhibits only super rarely. The test sets the publish timeout to 0, then proceeds to block cluster state processing on a data node, then deletes an index and recreates it, and finally removes the cluster state processing block. Finally, it calls ensureGreen, which might now return before the data node has fully applied the cluster state that removed and readded the shard, due to the publish timeout of 0. This commit waits for the cluster state to be fully processed on the data node before doing the search. Closes #30718	2018-05-22 13:31:18 +02:00
Jim Ferenczi	da6a56f3cc	Ignore empty completion input (#30713 ) This change makes sure that an empty completion input does not throw an IAE when indexing. Instead the input is ignored and the completion field is added in the list of ignored fields for the document. Closes #23121	2018-05-22 11:04:16 +02:00
Tim Brooks	abf8c56a37	Remove logging from elasticsearch-nio jar (#30761 ) This is related to #27260. The elasticsearch-nio jar is supposed to be a library opposed to a framework. Currently it internally logs certain exceptions. This commit modifies it to not rely on logging. Instead exception handlers are passed by the applications that use the jar.	2018-05-21 20:18:12 -06:00
Michael Basnight	c6be3b4e5a	Add Delete Repository High Level REST API (#30666 ) This commit adds Delete Repository, the associated docs and tests for the high level REST API client. It also cleans up a seemingly innocuous line in the RestDeleteRepositoryAction and some naming in SnapshotIT. Relates #27205	2018-05-21 19:52:21 -05:00
Lee Hinman	42b0b45659	[TEST] Enable DEBUG logging on testAutoQueueSizingWithMax Enables debug logging on QueueResizingEsThreadPoolExecutorTests#testAutoQueueSizingWithMax Relates to #30740	2018-05-21 10:23:45 -06:00
Jason Tedor	e639036ef1	Add assertion on removing copy_settings (#30748 ) The copy_settings parameter will be removed in Elasticsearch 8.0.0. This commit adds an assertion message that to clean up this code when master is bumped to 8.0.0.	2018-05-21 08:46:37 -04:00
Boaz Leskes	ea09667aa6	bump lucene version for 6_3_0	2018-05-21 11:49:37 +02:00
Martijn van Groningen	314cd6feaf	Add more script contexts (#30721 ) Added dedicated script contexts for: * script function score * script sorting * terms_set query Scripts for these contexts will either have a specific return value or use scoring and therefor in the future will need their own scripting classes. Relates to #30511	2018-05-20 21:31:50 +02:00
Nhat Nguyen	29f647e6f6	Mute testCorruptFileThenSnapshotAndRestore Tracked at #30577	2018-05-19 08:28:15 -04:00
Ryan Ernst	34180f2285	Scripting: Remove getDate methods from ScriptDocValues (#30690 ) The getDate() and getDates() existed prior to 5.x on long fields in scripting. In 5.x, a new Date type for ScriptDocValues was added. The getDate() and getDates() methods were left on long fields and added to date fields to ease the transition. This commit removes those methods for 7.0.	2018-05-18 21:26:26 -07:00
Nhat Nguyen	67d8fc222d	Upgrade to Lucene-7.4.0-snapshot-59f2b7aec2 (#30726 ) This snapshot resolves issues related to ShrinkIndexIT.	2018-05-18 18:21:39 -04:00
Ryan Ernst	b3f3a4312b	Plugins: Remove meta plugins (#30670 ) Meta plugins existed only for a short time, in order to enable breaking up x-pack into multiple plugins. However, now that x-pack is no longer installed as a plugin, the need for them has disappeared. This commit removes the meta plugins infrastructure.	2018-05-18 10:56:08 -07:00
Nhat Nguyen	95f52f07ce	TEST: Add engine log to testCorruptFileThenSnapshotAndRestore Relates #30577	2018-05-18 11:08:25 -04:00
Jason Tedor	d68c44b76c	Default copy settings to true and deprecate on the REST layer (#30598 ) This commit defaults the copy_settings REST parameter to the shrink and split APIs to true, and deprecates the parameter.	2018-05-18 10:12:08 -04:00
Jason Tedor	443c7014ba	Make TransportClusterStateAction abide to our style (#30697 ) I still do not like == false. However, I am so use to reading it that today I read this line of code and could not understand how it could possibly be doing the right thing. It was only when I finally noticed the ! that the code made sense. This commit changes this code to be in our style of == false. I still do not like == false.	2018-05-17 21:17:24 -04:00
tomcallahan	9be3fbd1b2	Change required version for Get Settings transport API changes to 6.4.0 (#30706 ) Get Settings API changes have now been backported to version 6.4, and therefore the latest version must send and expect the extra fields when communicating with 6.4+ code. Relates #29229 #30494	2018-05-17 20:29:08 -04:00
Mayya Sharipova	7c2fc26011	Correct typos Relates to #28725	2018-05-17 10:34:42 -04:00
Mayya Sharipova	3dfa93ef7c	Improve explanation in rescore (#30629 ) Currently in a rescore request if window_size is smaller than the top N documents returned (N=size), explanation of scores could be incorrect for documents that were a part of topN and not part of rescoring. This PR corrects this, but saving in RescoreContext docIDs of documents for which rescoring was applied, and adding rescoring explanation only for these docIDs. Closes #28725	2018-05-17 07:09:18 -04:00
Christoph Büscher	b6340658f4	Deprecate `nGram` and `edgeNGram` names for ngram filters (#30209 ) The camel case name `nGram` should be removed in favour of `ngram` and similar for `edgeNGram` and `edge_ngram`. Before removal, we need to deprecate the camel case names first. This change adds deprecation warnings for indices with versions 6.4.0 and higher and logs deprecation warnings.	2018-05-17 12:52:22 +02:00
Tanguy Leroux	2ac1f9fe89	Fix _cluster/state to always return cluster_uuid (#30656 ) Since #30143, the Cluster State API should always returns the current cluster_uuid in the response body, regardless of the metrics filters. This is not exactly true as it is returned only if metadata metrics and no specific indices are requested. This commit fixes the behavior to always return the cluster_uuid and add new test.	2018-05-17 10:58:25 +02:00
Tanguy Leroux	7915b5f7aa	[Tests] Add debug information to CorruptedFileIT This test failed but the cause is not obvious. This commit adds more debug logging traces so that if it reproduces we could gather more information. Related #30577	2018-05-17 10:57:25 +02:00
Nhat Nguyen	e0ccd4b816	Mute ShrinkIndexIT This is tracked at https://issues.apache.org/jira/browse/LUCENE-8318	2018-05-16 16:44:44 -04:00
Lee Hinman	a89073015d	Adjust serialization version in IndicesOptions The PR for the new format of serialization was backported, so this needs to be adjusted since it can now speak to 6.4 nodes in the new way.	2018-05-16 10:19:05 -06:00
Zachary Tong	94ce645969	[TEST] Fix compilation	2018-05-16 15:45:35 +00:00
Ke Li	d2b9a765cf	Remove version argument in RangeFieldType (#30411 ) The argument `indexVersionCreated` is not needed any more and can be removed.	2018-05-16 17:42:44 +02:00
Adrien Grand	41887e85df	Remove unused DirectoryUtils class. (#30582 )	2018-05-16 17:07:13 +02:00
Adrien Grand	28d4685d72	Mitigate date histogram slowdowns with non-fixed timezones. (#30534 ) Date histograms on non-fixed timezones such as `Europe/Paris` proved much slower than histograms on fixed timezones in #28727. This change mitigates the issue by using a fixed time zone instead when shard data doesn't cross a transition so that all timestamps share the same fixed offset. This should be a common case with daily indices. NOTE: Rewriting the aggregation doesn't work since the timezone is then also used on the coordinating node to create empty buckets, which might be out of the range of data that exists on the shard. NOTE: In order to be able to get a shard context in the tests, I reused code from the base query test case by creating a new parent test case for both queries and aggregations: `AbstractBuilderTestCase`. Mitigates #28727	2018-05-16 17:06:52 +02:00
Zachary Tong	df853c49c0	Add a MovingFunction pipeline aggregation, deprecate MovingAvg agg (#29594 ) This pipeline aggregation gives the user the ability to script functions that "move" across a window of data, instead of single data points. It is the scripted version of MovingAvg pipeline agg. Through custom script contexts, we expose a number of convenience methods: - MovingFunctions.max() - MovingFunctions.min() - MovingFunctions.sum() - MovingFunctions.unweightedAvg() - MovingFunctions.linearWeightedAvg() - MovingFunctions.ewma() - MovingFunctions.holt() - MovingFunctions.holtWinters() - MovingFunctions.stdDev() The user can also define any arbitrary logic via their own scripting, or combine with the above methods.	2018-05-16 10:57:00 -04:00
Colin Goodheart-Smithe	fa43aacd06	Removes AwaitsFix on IndicesOptionsTests	2018-05-16 15:24:58 +01:00
Jay Modi	5baadb1ff8	Template upgrades should happen in a system context (#30621 ) The TemplateUpgradeService is a system service that allows for plugins to register templates that need to be upgraded. These template upgrades should always happen in a system context as they are not a user initiated action. For security integrations, the lack of running this in a system context could lead to unexpected failures. The changes in this commit set an empty system context for the execution of the template upgrades performed by this service. Relates #30603	2018-05-16 08:21:15 -06:00
Zachary Tong	cd1ed9033c	Fix bug in BucketMetrics path traversal (#30632 ) When processing a top-level sibling pipeline, we destructively sublist the path by assigning back onto the same variable. But if aggs are specified such: A. Multi-bucket agg in the first entry of our internal list B. Regular agg as the immediate child of the multi-bucket in A C. Regular agg with the same name as B at the top level, listed as the second entry in our internal list D. Finally, a pipeline agg with the path down to B We'll get class cast exception. The first agg will sublist the path from [A,B] to [B], and then when we loop around to check agg C, the sublisted path [B] matches the name of C and it fails. The fix is simple: we just need to store the sublist in a new object so that the old path remains valid for the rest of the aggs in the loop Closes #30608	2018-05-16 10:10:26 -04:00
Colin Goodheart-Smithe	95ad9abcdb	Fixes IndiceOptionsTests to serialise correctly (#30644 ) * Fixes IndiceOptionsTests to serialise correctly Previous to this change `IndicesOptionsTests.testSerialisation()` would select a complete random version for both the `StreamOutput` and the `StreamInput`. This meant that the output could be selected as 7.0+ while the input was selected as <7.0 causing the stream to be written in the new format and read in teh old format (or vica versa). This change splits the two cases into different test methods ensuring that the Streams are at least on compatibile versions even if they are on different versions. * Use same random version for input and output streams server/src/test/java/org/elasticsearch/action/support/IndicesOptionsTest s.java	2018-05-16 14:17:08 +01:00
Boaz Leskes	b4ae29a192	mute IndicesOptionsTests.testSerialization See https://github.com/elastic/elasticsearch/pull/30644	2018-05-16 15:07:28 +02:00
Van0SS	4478f10a2a	Rest High Level client: Add List Tasks (#29546 ) This change adds a `listTasks` method to the high level java ClusterClient which allows listing running tasks through the task management API. Related to #27205	2018-05-16 13:31:37 +02:00
Yannick Welsch	3161386e2f	Move allocation awareness attributes to list setting (#30626 ) Allows the setting to be specified using proper array syntax, for example: "cluster.routing.allocation.awareness.attributes": [ "foo", "bar", "baz" ] Closes #30617	2018-05-16 09:57:22 +02:00
Vladimir Dolzhenko	fe3e0257ae	Allow date math for naming newly-created snapshots (#7939 ) (#30479 ) Allow date math for naming newly-created snapshots (#7939)	2018-05-16 07:23:25 +02:00
Michael Basnight	b94bc70aee	Add Create Repository High Level REST API (#30501 ) This commit adds Create Repository, the associated docs and tests for the high level REST API client. A few small changes to the PutRepository Request and Response went into the commit as well.	2018-05-15 21:21:11 -05:00
Tim Brooks	99b9ab58e2	Add nio http server transport (#29587 ) This commit is related to #28898. It adds an nio driven http server transport. Currently it only supports basic http features. Cors, pipeling, and read timeouts will need to be added in future PRs.	2018-05-15 16:37:14 -06:00
Lee Hinman	2cb71d0947	Refactor IndicesOptions to not be byte-based (#30586 ) * Refactor IndicesOptions to not be byte-based This refactors IndicesOptions to be enum/enummap based rather than using a byte as a bitmap for each of the options. This is necessary because we'd like to add additional options, but we ran out of bits. Backwards compatibility is kept for earlier versions so the option serialization does not change the options. Relates sort of to #30188	2018-05-15 15:03:08 -06:00
Nhat Nguyen	d1c28c60fc	HLRestClient: Follow-up for put index template api (#30592 ) This commit addresses some comments given after the original PR was in. Follow-up #30400	2018-05-15 08:06:58 -04:00
Simon Willnauer	b50cf3c6b0	Side-step pending deletes check (#30571 ) When we split/shrink an index we open several IndexWriter instances causeing file-deletes to be pending on windows. This subsequently fails when we open an IW to bootstrap the index history due to pending deletes. This change sidesteps the check since we know our history goes forward in terms of files and segments. Closes #30416	2018-05-15 11:51:54 +02:00
Christoph Büscher	bf2fb210cc	[Tests] Relax allowed delta in extended_stats aggregation (#30569 ) The order in which double values are added in java can give different results for the sum, so we need to allow a certain delta in the test assertions. The current value was still a bit too low, resulting in rare test failures. This change increases the allowed margin of error by a factor of ten.	2018-05-15 10:35:16 +02:00
Tim Vernum	6517ac98eb	Fail if reading from closed KeyStoreWrapper (#30394 ) In #28255 the implementation of the elasticsearch.keystore was changed to no longer be built on top of a PKCS#12 keystore. A side effect of that change was that calling getString or getFile on a closed KeyStoreWrapper ceased to throw an exception, and would instead return a value consisting of all 0 bytes. This change restores the previous behaviour as closely as possible. It is possible to retrieve the _keys_ from a closed keystore, but any attempt to get or set the entries will throw an IllegalStateException.	2018-05-15 09:57:34 +10:00
Jason Tedor	4e33443690	Adjust versions for resize copy settings (#30578 ) Now that the change to deprecate copy settings and disallow it being explicitly set to false is backported, this commit adjusts the BWC versions in master.	2018-05-14 16:41:25 -04:00
Jack Conradson	1b0e6ee89f	Deprecate Empty Templates (#30194 ) Deprecate the use of empty templates. Bug fix allows empty templates/scripts to be loaded on start up for upgrades/restarts, but empty templates can no longer be created.	2018-05-14 13:32:09 -07:00
Yannick Welsch	d5f028e085	Auto-expand replicas only after failing nodes (#30553 ) #30423 combined auto-expansion in the same cluster state update where nodes are removed. As the auto-expansion step would run before deassociating the dead nodes from the routing table, the auto-expansion would possibly remove replicas from live nodes instead of dead ones. This commit reverses the order to ensure that when nodes leave the cluster that the auto-expand-replica functionality only triggers after failing the shards on the removed nodes. This ensures that active shards on other live nodes are not failed if the primary resided on a now dead node. Instead, one of the replicas on the live nodes first gets promoted to primary, and the auto- expansion (removing replicas) only triggers in a follow-up step (but still same cluster state update). Relates to #30456 and follow-up of #30423	2018-05-14 20:12:52 +02:00
Luca Cavanna	2f4212b80a	Fold RestGetAllSettingsAction in RestGetSettingsAction (#30561 ) We currently have a separate endpoint for retrieving settings from all indices. We introduced such endpoint when removing comma-separated feature parsing for GetIndicesAction. The RestGetAllSettingsAction duplicates the code to print out the response that we already have in GetSettingsResponse (since it became a ToXContentObject), and uses the get index API internally instead of the get settings API, but the response is the same, hence we can fold get all settings and get settings in a single API, which is what this commit does.	2018-05-14 19:56:50 +02:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
Zachary Tong	1a7110524f	[TEST] Fix typo in MovAvgIT test The second set of assertions was accidentally using the count's moving average for the error delta in the value's moving average assertion. This fixes the typo, and unmutes the test. Closes #29456	2018-05-14 13:38:36 +00:00
Martijn van Groningen	7b95470897	Moved tokenizers to analysis common module (#30538 ) The following tokenizers were moved: classic, edge_ngram, letter, lowercase, ngram, path_hierarchy, pattern, thai, uax_url_email and whitespace. Left keyword tokenizer factory in server module, because normalizers directly depend on it.This should be addressed on a follow up change. Relates to #23658	2018-05-14 07:55:01 +02:00
Nhat Nguyen	73ec90f1b9	Mute ShrinkIndexIT suite Relates #30416	2018-05-13 15:29:31 -04:00
Jason Tedor	593fdd40ed	Deprecate not copy settings and explicitly disallow (#30404 ) We want copying settings to be the default behavior. This commit deprecates not copying settings, and disallows explicitly not copying settings. This gives users a transition path to the future default behavior.	2018-05-13 10:30:05 -04:00
Nhat Nguyen	4c130a1054	Re-enable FlushIT tests These tests failed due to in flight operations on the primary shard. Sadly, we don't have any clue on those ops. This commit unmutes these tests and logs the acquirers when checking for ongoing ops. 1> [2018-05-02T23:10:32,145][INFO ][o.e.i.f.FlushIT ] Third seal: Total shards: [2], failed: [true], reason: [[1] ongoing operations on primary], detail: [] Relates #29392	2018-05-11 22:23:52 -04:00
Yannick Welsch	323bcd84a0	Delete temporary blobs before creating index file (#30528 ) Fixes an (un-released) bug introduced in #30332. Closes #30507	2018-05-11 14:34:11 +02:00
Yannick Welsch	cdcd4a1129	Use simpler write-once semantics for FS repository (#30435 ) The writeBlob method for FsBlobContainer already opens the file with StandardOpenOption.CREATE_NEW, so there's no need for an extra blobExists(blobName) check.	2018-05-11 10:02:07 +02:00
Julie Tibshirani	73b08d937b	Mute two tests in FlushIT with @AwaitsFix. The issue is being tracked in #29392.	2018-05-10 23:16:39 -07:00
Julie Tibshirani	6129d88e07	Mute UnicastZenPingTests#testSimplePings with @AwaitsFix. This failure is being tracked in #28685.	2018-05-10 14:21:18 -07:00
Julie Tibshirani	1112fac206	Mute SharedClusterSnapshotRestoreIT#testSnapshotSucceedsAfterSnapshotFailure with @AwaitsFix. The issue is being tracked in #30507.	2018-05-10 10:20:11 -07:00
Nhat Nguyen	519768b5d3	Upgrade to Lucene-7.4-snapshot-6705632810 (#30519 ) This snapshot is to include LUCENE-8298 which allows DocValues updates to reset a value. This is needed for the Lucene rollback work.	2018-05-10 12:31:45 -04:00
Paul Sanwald	e79894aa52	add version compatibility from 6.4.0 after backport, see #30319 (#30390 )	2018-05-10 12:27:44 -04:00
Igor Motov	2a79d9234b	Add proper longitude validation in geo_polygon_query (#30497 ) Fixes longitude validation in geo_polygon_query builder. The queries with wrong longitude currently fail but only later during polygon with quite complicated error message. Fixes #30488	2018-05-10 11:14:08 -04:00
David Turner	df17f85e14	Remove Discovery.AckListener.onTimeout() (#30514 ) The MasterService takes responsibility for timeouts of the AckListeners that it creates, and the rest of the Discovery subsystem is unaware of these timeouts, so there's no need for this to appear in the Discovery.AckListener interface. Also fix a typo in the name of DelegatingAckListener.	2018-05-10 15:27:38 +01:00
Jason Tedor	bf2365d13b	Remove BWC repository test (#30500 ) This commit removes a test that we can not restore from 1.x and 2.x repository files. This test is not needed, the version of Elasticsearch that this commit targets can not even read index files from those versions.	2018-05-09 23:24:54 -04:00
Julie Tibshirani	9828e11709	Expose CommonStatsFlags directly in IndicesStatsRequest. (#30163 ) This allows us to simplify the logic in a couple places where all flags need to be accessed.	2018-05-09 14:25:28 -07:00
Jason Tedor	4defaa4f2d	Avoid deadlocks in cache (#30461 ) This commit avoids deadlocks in the cache by removing dangerous places where we try to take the LRU lock while completing a future. Instead, we block for the future to complete, and then execute the handling code under the LRU lock (for example, eviction).	2018-05-09 11:52:38 -04:00
Boaz Leskes	54122d8464	mute SplitIndexIT due to https://github.com/elastic/elasticsearch/issues/30416	2018-05-09 15:49:06 +02:00
Yu	2228e6e663	BulkProcessor to retry based on status code (#29329 ) Previously `BulkProcessor` retry logic was based on the exception type of the failed response (`EsRejectedExecutionException`). This commit changes it to be based on the returned status code. This allows us to reproduce the same retry behaviour when the `BulkProcessor` is used from the high-level REST client, which was previously not the case as we cannot rebuild the same exception type when parsing back the response. This change has no effect on the transport client. Closes #28885	2018-05-09 14:27:58 +02:00
Michael Basnight	3b9c8204a6	Add GET Repository High Level REST API (#30362 ) This commit adds the Snapshot Client with a first API call within it, the get repositories call in snapshot/restore module. This also creates a snapshot namespace for the docs, as well as get repositories docs. Relates #27205	2018-05-09 07:25:23 -05:00
Boaz Leskes	ad564240b1	add a comment explaining the need for RetryOnReplicaException on missing mappings	2018-05-09 14:19:50 +02:00
Yu	106bed90c7	Add `coordinating_only` node selector (#30313 ) Today we can execute cluster API actions on only master, data or ingest nodes using the `master:true`, `data:true` and `ingest:true` filters, but it is not so easy to select coordinating-only nodes (i.e. those nodes that are neither master nor data nor ingest nodes). This change fixes this by adding support for a `coordinating_only` filter such that `coordinating_only:true` adds all coordinating-only nodes to the set of selected nodes, and `coordinating_only:false` deletes them. Resolves #28831.	2018-05-09 12:14:07 +01:00
Ke Li	0c6789bc72	Use date format in `date_range` mapping before fallback to default (#29310 ) If the date format is not forced in query, use the format in mapping before fallback to the default format. Closes #29282	2018-05-09 09:41:44 +02:00
aditya-agrawal	27ddb4ffea	Avoid NPE in `more_like_this` when field has zero tokens (#30365 ) Fixes and edge case when using `more_like_this` where TermVectorsWriter could throw an NPE when a field produced zero tokens after analysis. This changes the implementation to use an empty list of tokens in this case. Closes #30148	2018-05-08 15:13:07 +02:00
Jack Conradson	1b22477104	Silence SplitIndexIT.testSplitIndexPrimaryTerm test failure. (#30432 )	2018-05-07 13:35:28 -07:00
Yannick Welsch	82b251adcf	Auto-expand replicas when adding or removing nodes (#30423 ) Auto-expands replicas in the same cluster state update (instead of a follow-up reroute) where nodes are added or removed. Closes #1873, fixing an issue where nodes drop their copy of auto-expanded data when coming up, only to sync it again later.	2018-05-07 22:26:31 +02:00
Jason Tedor	ec939dc012	Fix line length violation in cache tests This commit fixes a line-length violation in the cache tests that was hidden by the IDE folding the generics.	2018-05-07 14:12:38 -04:00
Igor Motov	6fb189ce47	Add stricter geohash parsing (#30376 ) Adds verification that geohashes are not empty and contain only valid characters. It fixes the issue when en empty geohash is treated as [-180, -90] and geohashes with non-geohash character are getting resolved into invalid coordinates. Closes #23579	2018-05-07 13:56:39 -04:00
Jason Tedor	68760ec5da	Add failing test for core cache deadlock The core cache implementation has a deadlock bug. This commit adds a failing test case.	2018-05-07 13:01:37 -04:00
Stéphane Campinas	39623402fc	Pass the task to broadcast actions (#29672 ) Since the task is required as per line 292, give the opportunity to broadcast actions to handle tasks.	2018-05-07 13:47:31 +02:00
Tanguy Leroux	1987d6261f	Do not fail snapshot when deleting a missing snapshotted file (#30332 ) When deleting or creating a snapshot for a given shard, elasticsearch usually starts by listing all the existing snapshotted files in the repository. Then it computes a diff and deletes the snapshotted files that are not needed anymore. During this deletion, an exception is thrown if the file to be deleted does not exist anymore. This behavior is challenging with cloud based repository implementations like S3 where a file that has been deleted can still appear in the bucket for few seconds/minutes (because the deletion can take some time to be fully replicated on S3). If the deleted file appears in the listing of files, then the following deletion will fail with a NoSuchFileException and the snapshot will be partially created/deleted. This pull request makes the deletion of these files a bit less strict, ie not failing if the file we want to delete does not exist anymore. It introduces a new BlobContainer.deleteIgnoringIfNotExists() method that can be used at some specific places where not failing when deleting a file is considered harmless. Closes #28322	2018-05-07 09:35:55 +02:00
Nhat Nguyen	16d6a0bfb3	AwaitsFix testCreateShrinkIndexToN Relates #30416	2018-05-06 22:07:42 -04:00
Nhat Nguyen	eed8a3b585	Add put index template api to high level rest client (#30400 ) Relates #27205	2018-05-06 09:47:36 -04:00
Boaz Leskes	b46d01d409	Relax testAckedIndexing to allow document updating The test indexes new documents and is thus correct in testing that the response result is `CREATED`. Sadly we can't guarantee exactly once delivery just yet. Relates #9967 Closes #21658	2018-05-06 13:06:16 +02:00
Jason Tedor	beee5fe004	Respect accept header on no handler (#30383 ) Today when processing a request for a URL path for which we can not find a handler we send back a plain-text response. Yet, we have the accept header in our hand and can respect the accepted media type of the request. This commit addresses this.	2018-05-04 18:13:50 -04:00
Ioannis Kakavas	21bc87a65b	Use readFully() to read bytes from CipherInputStream (#28515 ) Changes how data is read from CipherInputStream Instead of using `read()` and checking that the bytes read are what we expect, use `readFully()` which will read exactly the number of bytes while keep reading until the end of the stream or throw an `EOFException` if not all bytes can be read. This approach keeps the simplicity of using CipherInputStream while working as expected with both JCE and BCFIPS Security Providers	2018-05-04 20:13:27 +03:00
tomcallahan	0a93956194	Add Get Settings API support to java high-level rest client (#29229 ) This PR adds support for the Get Settings API to the java high-level rest client. Furthermore, logic related to the retrieval of default settings has been moved from the rest layer into the transport layer and now default settings may be retrieved consistency via both the rest API and the transport API.	2018-05-04 11:14:28 -04:00
Jim Ferenczi	719ab30c32	Set the new lucene version for 6.4.0	2018-05-04 12:15:51 +02:00
Jim Ferenczi	dbd857341f	Upgrade to 7.4.0-snapshot-1ed95c097b (#30357 ) Upgrade to lucene-7.4.0-snapshot-1ed95c097b This version contains: * An Analyzer for Korean * An IntervalQuery and IntervalsSource that retrieve minimum intervals of positional queries. * A new API to retrieve matches (offsets and positions) of a query for a single document. * Support for soft deletes in the index writer. * A fixed shingle filter that handles index time synonyms. * Support for emoji sequence in ICUTokenizer (with an upgrade to icu 61.1)	2018-05-04 11:44:22 +02:00
Michael Basnight	5f8101a44c	Make RepositoriesMetaData contents unmodifiable (#30361 ) This commit makes the RepositoriesMetaData backing list no longer modifiable. Ref #30333	2018-05-03 13:14:54 -05:00
Boaz Leskes	ccd791b3b4	InternalEngineTests.testConcurrentOutOfOrderDocsOnReplica should use two documents (#30121 ) We were recently looking at bugs that can only occur if two different documents were indexed concurrently. For example, what happens if the local checkpoint advances above the sequence number of a document that's being indexed. That can only happen if another concurrent operation caused the checkpoint to advance. It has to be another document to allow concurrency as we acquire a per uid lock.While our investigation proved that the suspected bug doesn't exists, we still discovered our unit testing coverage is not good enough to cover this case. This PR extend the test concurrent out of order replica processing to use two documents in its history.	2018-05-03 14:57:48 +02:00
Michael Basnight	bdd43fa69f	Change signature of Get Repositories Response (#30333 ) The Get Repositories response object held a list of RepositoryMetaData entries. This object does not have the from/toXContent methods that are needed to expose this to the high level REST client. The RepositoriesMetaData, however, does, and it also contains a list of RepositoryMetaData objects within it. So rather than duplicate this logic or move it (RepositoriesMetaData is a fragment object used by cluster state), the object holding state in the Response was changed to use the RepositoriesMetaData instead. This also cleans up the read/write methods in the response, as they can now use the same read/write in RepositoriesMetaData, which also were not present in the singular class.	2018-05-03 07:22:59 -05:00
Zachary Tong	3c2d2a7d4a	Fix NPE when CumulativeSum agg encounters null/empty bucket (#29641 ) Fix NPE when CumulativeSum agg encounters null/empty bucket If the cusum agg encounters a null value, it's because the value is missing (like the first value from a derivative agg), the path is not valid, or the bucket in the path was empty. Previously cusum would just explode on the null, but this changes it so we only increment the sum if the value is non-null and finite. This is safe because even if the cusum encounters all null or empty buckets, the cumulative sum is still zero (like how the sum agg returns zero even if all the docs were missing values) I went ahead and tweaked AggregatorTestCase to allow testing pipelines, so that I could delete the IT test and reimplement it as AggTests. Closes #27544	2018-05-02 12:22:55 -07:00
Ryan Ernst	fb0aa562a5	Network: Remove http.enabled setting (#29601 ) This commit removes the http.enabled setting. While all real nodes (started with bin/elasticsearch) will always have an http binding, there are many tests that rely on the quickness of not actually needing to bind to 2 ports. For this case, the MockHttpTransport.TestPlugin provides a dummy http transport implementation which is used by default in ESIntegTestCase. closes #12792	2018-05-02 11:42:05 -07:00
James Baiera	6d6da7c661	Fix merging logic of Suggester Options (#29514 ) Suggester Options have a collate match field that is returned when the prune option is set to true. These values should be merged together in the query reduce phase, otherwise good suggestions that result in rare hits in shards with results that do not arrive first may be incorrectly marked as not matching the collate query.	2018-05-02 14:40:57 -04:00
Boaz Leskes	13917162ad	ReplicationTracker.markAllocationIdAsInSync may hang if allocation is cancelled (#30316 ) At the end of recovery, we mark the recovering shard as "in sync" on the primary. From this point on the primary will treat any replication failure on it as critical and will reach out to the master to fail the shard. To do so, we wait for the local checkpoint of the recovered shard to be above the global checkpoint (in order to maintain global checkpoint invariant). If the master decides to cancel the allocation of the recovering shard while we wait, the method can currently hang and fail to return. It will also ignore the interrupts that are triggered by the cancelled recovery due to the primary closing. Note that this is crucial as this method is called while holding a primary permit. Since the method never comes back, the permit is never released. The unreleased permit will then block any primary relocation and while the primary is trying to relocate all indexing will be blocked for 30m as it waits to acquire the missing permit.	2018-05-02 19:40:29 +02:00
Boaz Leskes	af45b4dee4	Cancelling a peer recovery on the source can leak a primary permit (#30318 ) The code in `SourceRecoveryHandler` runs under a `CancellableThreads` instance in order to allow long running operations to be interrupted when the recovery is cancelled. Sadly if this happens at just the wrong moment while acquiring a permit from the primary, that primary can be leaked and never be freed. Note that this is slightly better than it sounds - we only cancel recoveries on the source side if the primary shard itself is closed. Relates to https://github.com/elastic/elasticsearch/pull/30316	2018-05-02 18:01:29 +02:00
Ryan Ernst	916bf9d26d	Convert server javadoc to html5 (#30279 ) This commit converts the remaining javadocs in :server using html4 to html5. This was mostly converting `tt` to `{@code}`.	2018-05-02 08:08:54 -07:00
Adrien Grand	368ddc408f	Remove MapperService#types(). (#29617 ) This isn't be necessary with a single type per index.	2018-05-02 11:35:12 +02:00
Adrien Grand	7358946bda	Add a new `_ignored` meta field. (#29658 ) This adds a new `_ignored` meta field which indexes and stores fields that have been ignored at index time because of the `ignore_malformed` option. It makes malformed documents easier to identify by using `exists` or `term(s)` queries on the `_ignored` field. Closes #29494	2018-05-02 10:47:02 +02:00
Paul Sanwald	00b21f886a	Fix failure for validate API on a terms query (#29483 ) * WIP commit to try calling rewrite on coordinating node during TransportSearchAction * Use re-written query instead of using the original query * fix incorrect/unused imports and wildcarding * add error handling for cases where an exception is thrown * correct exception handling such that integration tests pass successfully * fix additional case covered by IndicesOptionsIntegrationIT. * add integration test case that verifies queries are now valid * add optional value for index * address review comments: catch superclass of XContentParseException fixes #29483	2018-05-01 13:38:22 -07:00
Michael Basnight	62a9b8909e	Remove RepositoriesMetaData variadic constructor (#29569 ) The variadic constructor was only used in a few places and the RepositoriesMetaData class is backed by a List anyway, so just using a List will make it simpler to instantiate it.	2018-05-01 15:02:06 -05:00
Nhat Nguyen	038fe1151b	TEST: Add debug log to FlushIT We still don't have a strong reason for the failures of testDoNotRenewSyncedFlushWhenAllSealed and testSyncedFlushSkipOutOfSyncReplicas. This commit adds debug logging for these two tests.	2018-05-01 10:15:03 -04:00
Diwas Joshi	dd5fcb211d	index name added to snapshot restore exception (#29604 ) This PR adds index name to snapshot restore exception if index is renamed during restoring. closes [#27601](https://github.com/elastic/elasticsearch/issues/27601)	2018-05-01 15:16:38 +02:00
Jason Tedor	5de6f4ff7b	Adjust copy settings on resize BWC version This commit adjusts the BWC version for copy settings on resize operations after the behavior was backported to 6.x.	2018-05-01 08:49:16 -04:00
Jason Tedor	50535423ff	Allow copying source settings on resize operation (#30255 ) Today when an index is created from shrinking or splitting an existing index, the target index inherits almost none of the source index settings. This is surprising and a hassle for operators managing such indices. Given this is the default behavior, we can not simply change it. Instead, we start by introducing the ability to copy settings. This flag can be set on the REST API or on the transport layer and it has the behavior that it copies all settings from the source except non-copyable settings (a property of a setting introduced in this change). Additionally, settings on the request will always override. This change is the first step in our adventure: - this flag is added here in 7.0.0 and immediately deprecated - this flag will be backported to 6.4.0 and remain deprecated - then, we will remove the ability to set this flag to false in 7.0.0 - finally, in 8.0.0 we will remove this flag and the only behavior will be for settings to be copied	2018-05-01 08:48:19 -04:00
Nik Everett	99b98fab18	Core: Pick inner most parse exception as root cause (#30270 ) Just like `ElasticsearchException`, the inner most `XContentParseException` tends to contain the root cause of the exception and show be show to the user in the `root_cause` field. The effectively undoes most of the changes that #29373 made to the `root_cause` for parsing exceptions. The `type` field still changes from `parse_exception` to `x_content_parse_exception`, but this seems like a fairly safe change. `ElasticsearchWrapperException` looks tempting to implement this but the behavior isn't quite right. `ElasticsearchWrapperExceptions` are entirely unwrapped until the cause no longer `implements ElasticsearchWrapperException` but `XContentParseException` should be unwrapped until its cause is no longer an `XContentParseException` but no further. In other words, `ElasticsearchWrapperException` are unwrapped one step too far. Closes #30261	2018-05-01 07:44:58 -04:00
Luca Cavanna	acdf330a0e	Minor DocWriteResponse changes (#29675 ) Remove double if depending on the Result value. It makes little sense to pass in a boolean flag based on a Result value that we already have, if that internally is represented again as a `Result` value. Also changed the `Result` `lowercase` instance member to be computed based on `name()` instead of `toString()` which is safer and to use `Locale.ROOT` instead of `Locale.ENGLISH`	2018-05-01 09:35:09 +02:00
Boaz Leskes	4a537ef03c	Bulk operation fail to replicate operations when a mapping update times out (#30244 ) Starting with the refactoring in https://github.com/elastic/elasticsearch/pull/22778 (released in 5.3) we may fail to properly replicate operation when a mapping update on master fails. If a bulk operations needs a mapping update half way, it will send a request to the master before continuing to index the operations. If that request times out or isn't acked (i.e., even one node in the cluster didn't process it within 30s), we end up throwing the exception and aborting the entire bulk. This is a problem because all operations that were processed so far are not replicated any more to the replicas. Although these operations were never "acked" to the user (we threw an error) it cause the local checkpoint on the replicas to lag (on 6.x) and the primary and replica to diverge. This PR does a couple of things: 1) Most importantly, treat any mapping update failure as a document level failure, meaning only the relevant indexing operation will fail. 2) Removes the mapping update callbacks from `IndexShard.applyIndexOperationOnPrimary` and similar methods for simpler execution. We don't use exceptions any more when a mapping update was successful. I think we need to do more work here (the fact that a single slow node can prevent those mappings updates from being acked and thus fail operations is bad), but I want to keep this as small as I can (it is already too big).	2018-05-01 08:15:02 +02:00
Chris Earle	725a5af2c6	_cluster/state should always return cluster_uuid (#30143 ) Currently, the only way to get the REST response for the `/_cluster/state` call to return the `cluster_uuid` is to request the `metadata` metrics, which is one of the most expensive response structures. However, external monitoring agents will likely want the `cluster_uuid` to correlate the response with other API responses whether or not they want cluster metadata.	2018-04-30 10:16:11 -04:00
Jason Tedor	811f5b4efc	Do not ignore request analysis/similarity on resize (#30216 ) Today when a resize operation is performed, we copy the analysis, similarity, and sort settings from the source index. It is possible for the resize request to include additional index settings including analysis, similarity, and sort settings. We reject sort settings when validating the request. However, we silently ignore analysis and similarity settings on the request that are already set on the source index. Since it is possible to change the analysis and similarity settings on an existing index, this should be considered a bug and the sort of leniency that we abhor. This commit addresses this bug by allowing the request analysis/similarity settings to override the existing analysis/similarity settings on the target.	2018-04-30 07:31:36 -04:00
Tanguy Leroux	a6624bb742	[Test] Update test in SharedClusterSnapshotRestoreIT (#30200 ) The `testDeleteSnapshotWithMissingIndexAndShardMetadata` test uses an obsolete repository directory structure based on index names instead of UUIDs. Because it swallows exceptions when deleting test files the test never failed when the directory structure changed. This commit fixes the test to use the right directory structure and file names and to not swallow exceptions anymore.	2018-04-30 09:48:03 +02:00
Jason Tedor	0a6312a5e6	Collapse REST resize handlers (#30229 ) The REST resize handlers for shrink/split operations are effectively the same code with a minor difference. This commit collapse these handlers into a single base class.	2018-04-29 08:58:11 -04:00
Jason Tedor	bdde2b9824	Rename request variables in shrink/split handlers (#30207 ) This is a code-tidying PR, a little side adventure while working on another change. Previously only shrink request existed but when the ability to split indices was added, shrink and split were done together under a single request object: the resize request object. However, the code inherited the legacy name in the naming of some variables. This commit cleans this up.	2018-04-28 01:09:44 -04:00
Julie Tibshirani	f5978d6d33	In the field capabilities API, remove support for providing fields in the request body. (#30185 )	2018-04-27 16:14:11 -07:00
Nhat Nguyen	9c586a2f07	Do not log warn shard not-available exception in replication (#30205 ) Since #28049, only fully initialized shards are received write requests. This enhancement allows us to handle all exceptions. In #28571, we started strictly handling shard-not-available exceptions and tried to keep the way we report replication errors to users by only reporting if the error is not shard-not-available exceptions. However, since then we unintentionally always log warn for all exception. This change restores to the previous behavior which logs warn only if an exception is not a shard-not-available exception. Relates #28049 Relates #28571	2018-04-27 16:45:42 -04:00
Nik Everett	f4ed902698	CCS: Drop http address from remote cluster info (#29568 ) They are expensive to fetch and no longer needed by Kibana so they shouldn't be needed by anyone else either. Closes #29207	2018-04-27 14:19:00 -04:00
Julie Tibshirani	d633130e1b	Convert FieldCapabilitiesResponse to a ToXContentObject. (#30182 )	2018-04-27 09:47:11 -07:00
Tanguy Leroux	63148dd9ba	Fail snapshot operations early on repository corruption (#30140 ) A NullPointerException is thrown when trying to create or delete a snapshot in a repository that has been written to by an older Elasticsearch after writing to it with a newer Elasticsearch version. This is because the way snapshots are formatted in the repository snapshots index file changed in #24477. This commit changes the parsing of the repository index file so that it now detects a corrupted index file and fails early the snapshot operation. closes #29052	2018-04-27 16:29:59 +02:00
Jim Ferenczi	c08daf2589	Build global ordinals terms bucket from matching ordinals (#30166 ) The global ordinals terms aggregator has an option to remap global ordinals to dense ordinal that match the request. This mode is automatically picked when the terms aggregator is a child of another bucket aggregator or when it needs to defer buckets to an aggregation that is used in the ordering of the terms. Though when building the final buckets, this aggregator loops over all possible global ordinals rather than using the hash map that was built to remap the ordinals. For fields with high cardinality this is highly inefficient and can lead to slow responses even when the number of terms that match the query is low. This change fixes this performance issue by using the hash table of matching ordinals to perform the pruning of the final buckets for the terms and significant_terms aggregation. I ran a simple benchmark with 1M documents containing 0 to 10 keywords randomly selected among 1M unique terms. This field is used to perform a multi-level terms aggregation using rally to collect the response times. The aggregation below is an example of a two-level terms aggregation that was used to perform the benchmark: ``` "aggregations":{ "1":{ "terms":{ "field":"keyword" }, "aggregations":{ "2":{ "terms":{ "field":"keyword" } } } } } ``` \| Levels of aggregation \| 50th percentile ms (master) \| 50th percentile ms (patch) \| \| --- \| --- \| --- \| \| 2 \| 640.41ms \| 577.499ms \| \| 3 \| 2239.66ms \| 600.154ms \| \| 4 \| 14141.2ms \| 703.512ms \| Closes #30117	2018-04-27 15:26:46 +02:00
Alexander Reelsen	e1a16a6018	REST: Remove GET support for clear cache indices (#29525 ) Clearing the cache indices can be done via GET and POST. As GET should only support read only operations, this removes the support for using GET for clearing the indices caches.	2018-04-27 08:41:36 +02:00
Julie Tibshirani	0d8aed8c2b	Fix a bug in FieldCapabilitiesRequest#equals and hashCode. (#30181 ) Also update its unit test to AbstractStreamableTestCase for better coverage.	2018-04-26 16:09:27 -07:00
Jim Ferenczi	80e0e64bfe	Fix SliceBuilderTests#testRandom failures Add missing shard context creation in a random test.	2018-04-26 22:18:39 +02:00
Julie Tibshirani	d40116d260	Add support for field capabilities to the high-level REST client. (#29664 )	2018-04-26 09:50:37 -07:00
Tanguy Leroux	e864b93abf	Fix TermsSetQueryBuilder.doEquals() method (#29629 ) Closes #29620	2018-04-26 17:47:16 +02:00
Nhat Nguyen	16490d7dfa	TEST: Update settings should go through cluster state (#29682 ) Today we update index settings directly via IndexService instead of the cluster state in IndexServiceTests. However, those changes will be lost if there is a cluster state update. In general, we should update index settings via client and limit the direct usage in only special tests. This commit replaces direct usages by the updateSettings api of client. Closes #24491	2018-04-26 09:28:14 -04:00
Jim Ferenczi	752ba2fb45	Adjust serialization versions after backport Relates #29533	2018-04-26 14:06:56 +02:00
Jim Ferenczi	8b8c0c0b4d	Add additional shards routing info in ShardSearchRequest (#29533 ) This commit propagates the preference and routing of the original SearchRequest in the ShardSearchRequest. This information is then use to fix a bug in sliced scrolls when executed with a preference (or a routing). Instead of computing the slice query from the total number of shards in the index, this commit computes this number from the number of shards per index that participates in the request. Fixes #27550	2018-04-26 09:58:17 +02:00
Julie Tibshirani	32dfb65144	In the field capabilities API, deprecate support for providing fields in the request body. (#30157 ) (cherry picked from commit d8d884b29d4aa7d01070484fee5de8d3db60cb25)	2018-04-25 23:01:53 -07:00
Nhat Nguyen	52c50e353b	Do not add noop from local translog to translog again (#29637 ) Today we always add no-ops to translog regardless of its origin, thus a noop may appear in the translog multiple times. This is not a big deal as noops are small and rare to appear. This commit ensures to add a noop to translog only if its origin is not from local translog. This restriction has been applied for index and delete.	2018-04-25 21:02:12 -04:00
Jason Tedor	2c3e71f116	Remove the suggest metric from stats APIs (#29635 ) This metric previously existed for backwards compatibility reasons although the suggest stats were folded into search stats. This metric was deprecated in 6.3.0 and this commit removes them for 7.0.0.	2018-04-24 19:03:48 -04:00
Jason Tedor	25e45a765c	Fix byte size value equals/hash code test (#29643 ) This commit fixes two issues with the byte size value equals/hash code test. The first problem is due to a test failure when the original instance is zero bytes and we pick the mutation branch where we preserve the size but change the unit. The mutation should result in a different byte size value but changing the unit on zero bytes still leaves us with zero bytes. During the course of fixing this test I discovered another problem. When we need to randomize size, we could randomly select a size that would lead to an overflow of Long.MAX_VALUE. This commit fixes both of these issues.	2018-04-24 19:01:27 -04:00
Jason Tedor	bdf241347d	Add 6.4.0 version to master (#29684 ) This commit adds the 6.4.0 version constant to the master branch.	2018-04-24 18:21:37 -04:00
Jason Tedor	3cadd5c40c	Only enable modules to have native controllers This commit removes the ability for a plugin to have a native controller as leaves it as only modules can have a native controller.	2018-04-20 15:34:02 -07:00
Jason Tedor	d99d0fa669	Add distribution type to startup scripts This commit adds the distribution type to the startup scripts so that we can discern from log output and the main response the type of the distribution (deb/rpm/tar/zip).	2018-04-20 15:34:01 -07:00
Jason Tedor	e64e6d8996	Add distribution flavor to startup scripts This commit adds the distribution flavor (default versus oss) to the build process which is passed through the startup scripts to Elasticsearch. This change will be used to customize the message on attempting to install/remove x-pack based on the distribution flavor.	2018-04-20 15:33:58 -07:00
Ryan Ernst	fab5e21e7d	Build: Split distributions into oss and default This commit makes x-pack a module and adds it to the default distrubtion. It also creates distributions for zip, tar, deb and rpm which contain only oss code.	2018-04-20 15:33:57 -07:00
Yannick Welsch	6a4c5f3e93	Abort early on finding duplicate snapshot name in internal structures (#29634 ) Adds a check in BlobstoreRepository.snapshot(...) that prevents duplicate snapshot names and fails the snapshot before writing out the new index file. This ensures that you cannot end up in this situation where the index file has duplicate names and cannot be read anymore . Relates to #28906	2018-04-20 17:32:34 +02:00
Jason Tedor	0045111ce2	Deprecate the suggest metrics (#29627 ) The suggest stats were folded into the search stats as part of the indices stats API in 5.0.0. However, the suggest metric remained as a synonym for the search metric for BWC reasons. This commit deprecates usage of the suggest metric on the indices stats API. Similarly, due to the changes to fold the suggest stats into the search stats, requesting the suggest index metric on the indices metric on the nodes stats API has produced an empty object as the response since 5.0.0. This commit deprecates this index metric on the indices metric on the nodes stats API.	2018-04-20 09:47:38 -04:00
Jay Modi	dfc7ca7214	Implement Iterator#remove for Cache values iter (#29633 ) This commit implements the ability to remove values from a Cache using the values iterator. This brings the values iterator in line with the keys iterator and adds support for removing items in the cache that are not easily found by the key used for the cache.	2018-04-20 07:21:08 -06:00
Nhat Nguyen	42d81a2945	TEST: Unmute testPrimaryRelocationWhileIndexing Previously we did not put an indexing to a version map if that map does not require safe access but removed the existing delete tombstone only if assertion enabled. In #29585, we removed the side-effect caused by assertion then this test started failing. This failure can be explained as follows: - Step 1: Index a doc then delete that doc - Step 2: The version map can switch to unsafe mode because of concurrent refreshes (implicitly called by flushes) - Step 3: Index a document - the version map won't add this version value and won't prune the tombstone (previously it did) - Step 4: Delete a document - this will return NOT_FOUND instead of DELETED because of the stale delete tombstone This failure is actually fixed by #29619 in which we never leave stale delete tombstones Closes #29626	2018-04-19 21:35:21 -04:00
Ryan Ernst	7975280383	Remove remaining tribe node references (#29574 ) While tribe node was removed in https://github.com/elastic/elasticsearch/pull/28443, there remained a couple lingering references to it in docs and code. This commit removes those remaining references.	2018-04-19 18:02:01 -07:00
Nhat Nguyen	9cf8b01fc4	Never leave stale delete tombstones in version map (#29619 ) Today the VersionMap does not clean up a stale delete tombstone if it does not require safe access. However, in a very rare situation due to concurrent refreshes, the safe-access flag may be flipped over then an engine accidentally consult that stale delete tombstone. This commit ensures to never leave stale delete tombstones in a version map by always pruning delete tombstones when putting a new index entry regardless of the value of the safe-access flag.	2018-04-19 20:49:56 -04:00
Jason Tedor	d1670a18e4	Do not serialize common stats flags using ordinal (#29600 ) This commit remove serializing of common stats flags via its enum ordinal and uses an explicit index defined on the enum. This is to enable us to remove an unused flag (Suggest) without ruining the ordering and thus breaking serialization.	2018-04-19 20:12:24 -04:00
Jason Tedor	a829d920ee	Remove stale comment from JVM stats (#29625 ) We removed catched throwable from the code base and left behind was a comment about catching InternalError in MemoryManagementMXBean. We are not going to catch InternalError here as we expect that to be fatal. This commit removes that stale comment.	2018-04-19 19:56:03 -04:00
Nhat Nguyen	293f85cd52	TEST: Mute testPrimaryRelocationWhileIndexing AwaitsFix #29626	2018-04-19 19:15:30 -04:00
Jason Tedor	5d767e449a	Remove bulk fallback for write thread pool (#29609 ) The name of the bulk thread pool was renamed to "write" with "bulk" as a fallback name. This change was made in 6.x for BWC reasons yet in 7.0.0 we are removing this fallback. This commit removes this fallback for the write thread pool.	2018-04-19 16:59:58 -04:00
Julie Tibshirani	113d1d3eab	Fix an incorrect reference to 'zero_terms_docs' in match_phrase queries.	2018-04-19 13:24:14 -07:00
Julie Tibshirani	48461ac143	Update the version compatibility for zero_terms_query in match_phrase. The change was just backported to 6.x.	2018-04-19 13:20:44 -07:00
Nhat Nguyen	955709b3f3	Account translog location to ram usage in version map This commit accounts a translog location's ram usage in version map.	2018-04-19 16:05:33 -04:00
Julie Tibshirani	b9e1a00213	Add support to match_phrase query for zero_terms_query. (#29598 )	2018-04-19 11:25:27 -07:00
Julie Tibshirani	00d88a5d3e	Fix incorrect references to 'zero_terms_docs' in query parsing error messages. (#29599 )	2018-04-19 11:02:49 -07:00
Nhat Nguyen	1b24d4e68b	Avoid side-effect in VersionMap when assertion enabled (#29585 ) Today when a version map does not require safe access, we will skip that document. However, if the assertion is enabled, we remove the delete tombstone of that document if existed. This side-effect may accidentally hide bugs in which stale delete tombstone can be accessed. This change ensures putAssertionMap not modify the tombstone maps.	2018-04-19 12:38:10 -04:00
Christoph Büscher	24763d881e	Deprecate use of `htmlStrip` as name for HtmlStripCharFilter (#27429 ) The camel case name `htmlStip` should be removed in favour of `html_strip`, but we need to deprecate it first. This change adds deprecation warnings for indices with version starting with 6.3.0 and logs deprecation warnings in this cases.	2018-04-19 16:48:17 +02:00
Jason Tedor	c12c2a6cc9	Rename the bulk thread pool to write thread pool (#29593 ) This commit renames the bulk thread pool to the write thread pool. This is to better reflect the fact that the underlying thread pool is used to execute any document write request (single-document index/delete/update requests, and bulk requests). With this change, we add support for fallback settings thread_pool.bulk.* which will be supported until 7.0.0. We also add a system property so that the display name of the thread pool remains as "bulk" if needed to avoid breaking users.	2018-04-19 08:18:58 -04:00
Tanguy Leroux	e2d770d9b9	Fix missing node id prefix in startup logs (#29534 ) When `node.name` is not set, some log traces at startup time does not show the node id.	2018-04-19 09:40:25 +02:00
Ryan Ernst	98d776edaf	Networking: Deprecate http.enabled setting (#29591 ) This commit deprecates the http.enabled, in preparation for removing the feature in 7.0. relates #12792	2018-04-18 17:36:09 -07:00
Jason Tedor	2b47d67d95	Remove the index thread pool (#29556 ) Now that single-document indexing requests are executed on the bulk thread pool the index thread pool is no longer needed. This commit removes this thread pool from Elasticsearch.	2018-04-18 09:18:08 -04:00
Jim Ferenczi	9d11c7a6c1	Remove extra copy in ScriptDocValues.Strings This commit removes a BytesRef copy introduced in #29567 and not required. Relates #29567	2018-04-18 15:13:24 +02:00
Jim Ferenczi	a7c9857976	Fix binary doc values fetching in _search (#29567 ) Binary doc values are retrieved during the DocValueFetchSubPhase through an instance of ScriptDocValues. Since 6.0 ScriptDocValues instances are not allowed to reuse the object that they return (https://github.com/elastic/elasticsearch/issues/26775) but BinaryScriptDocValues doesn't follow this restriction and reuses instances of BytesRefBuilder among different documents. This results in `field` values assigned to the wrong document in the response. This commit fixes this issue by recreating the BytesRef for each value that needs to be returned. Fixes #29565	2018-04-18 13:01:06 +02:00
Jim Ferenczi	8b34066d8b	Mutes failing MovAvgIT tests Relates #29456	2018-04-18 10:54:45 +02:00
Julie Tibshirani	52858ba760	Fix the version ID for v5.6.10. (#29570 )	2018-04-17 16:04:16 -07:00
Dimitris Athanasiou	7969eb7db7	Add versions 5.6.10 and 6.2.5	2018-04-17 18:47:20 +01:00
Zachary Tong	cfc9d12acc	[TEST] test against scaled value instead of fixed epsilon in MovAvgIT When comparing doubles, fixed epsilons can fail because the absolute difference in values may be quite large, even though the relative difference is tiny (e.g. with two very large numbers). Instead, we can scale epsilon by the absolute value of the expected value. This means we are looking for a diff that is epsilon-percent away from the value, rather than just epsilon. This is basically checking the relative error using junit's assertEqual. Closes #29456, unmutes the test	2018-04-17 17:33:18 +00:00
Luca Cavanna	9c8ebb608f	Remove `flatSettings` support from request classes (#29560 ) As part of adding support for new API to the high-level REST client, we added support for the `flat_settings` parameter to some of our request classes. We added documentation that such flag is only ever read by the high-level REST client, but the truth is that it doesn't do anything given that settings are always parsed back into a `Settings` object, no matter whether they are returned in a flat format or not. It was a mistake to add support for this flag in the context of the high-level REST client, hence this commit removes it.	2018-04-17 18:18:21 +02:00
Adrien Grand	d7be9185c8	MapperService to wrap a single DocumentMapper. (#29511 ) This refactors MapperService so that it wraps a single `DocumentMapper` rather than a `Map<String, DocumentMapper>`. We will need follow-ups since I haven't fixed most APIs that still expose collections of types of mappers, but this is a start...	2018-04-17 17:11:27 +02:00
Igor Motov	983d6c15a2	Add null_value support to geo_point type (#29451 ) Adds support for null_value attribute to the geo_point types. Closes #12998	2018-04-17 10:19:54 -04:00
Nhat Nguyen	45c6c20467	Enforce translog access via engine (#29542 ) Today the translog of an engine is exposed and can be accessed directly. While this exposure offers much flexibility, it also causes these troubles: - Inconsistent behavior between translog method and engine method. For example, rolling a translog generation via an engine also trims unreferenced files, but translog's method does not. - An engine does not get notified when critical errors happen in translog as the access is direct. This change isolates translog of an engine and enforces all accesses to translog via the engine.	2018-04-17 08:03:41 -04:00
Jason Tedor	1dd0fd4874	Deprecate the index thread pool (#29540 ) The index thread pool is no longer needed as its primary use-case for single-document indexing requests has been relieved now that single-document indexing requests are converted to bulk indexing requests (with a single document payload).	2018-04-17 06:47:30 -04:00
Jason Tedor	faa7fe86c5	Introduce analyze thread pool (#29541 ) We want to remove the index thread pool as it is no longer needed since single-document indexing requests are executed as bulk requests now. Analyze requests are also executed on the index thread pool though and they need a thread pool to execute on. The bulk thread does not seem like the right thread pool, let us keep that thread pool conceptually for bulk requests and free for bulk requests. None of the existing thread pools make sense for analyze requests either. The generic thread pool would be a terrible choice since it has an unbounded queue and that is a bad idea for user-facing APIs. This commit introduces a small by default (size=1, queue_size=16) thread pool for analyze requests.	2018-04-17 06:46:15 -04:00
Adrien Grand	d223bcf7ab	Add the `include_type_name` option to the search and document APIs. (#29506 ) This commit add the `include_type_name` option to the `index`, `update`, `delete`, `get`, `bulk` and `search` APIs. When set to `false`, the response will omit the `_type` in the response. This option doesn't work if the endpoint contains a type. For instance, the following call would succeed: ``` GET index/_doc/1?include_type_name=false ``` But the following one would fail: ``` GET index/some_type/1?include_type_name=false ``` Relates #15613	2018-04-17 11:29:08 +02:00
Nhat Nguyen	fd161d2659	TEST: Mute testEnsureWeReconnect Relates #29547	2018-04-16 18:31:34 -04:00
olcbean	b3e3b80f1b	REST high-level client: add support for Indices Update Settings API [take 2] (#29327 ) Relates to #27205	2018-04-16 21:39:11 +02:00
Jason Tedor	a8d4ee1620	Remove PipelineExecutionService#executeIndexRequest (#29537 ) With the move long ago to execute all single-document indexing requests as bulk indexing request, the method PipelineExecutionService#executeIndexRequest is unused and will never be used in production code. This commit removes this method and cuts over all tests to use PipelineExecutionService#executeBulkRequest.	2018-04-16 14:55:26 -04:00
Igor Motov	e334baf6fc	Fix overflow error in parsing of long geohashes (#29418 ) Fixes a possible overflow error that geohashes longer than 12 characters can cause during parsing. Fixes #24616	2018-04-16 12:37:38 -04:00
David Turner	34ec403a2e	Remove unused index.ttl.disable_purge setting (#29527 ) This setting does nothing, and is deprecated in the 6.x series by #29526. This change removes it entirely in 7.0.	2018-04-16 17:10:55 +01:00
Ke Li	0bfb59dcf2	Using ObjectParser in UpdateRequest (#29293 ) CRUD: Parsing changes for UpdateRequest (#29293) Use `ObjectParser` to parse `UpdateRequest` so we reject unknown fields and drop support for the `_fields` parameter because it was deprecated in 5.x.	2018-04-16 08:39:35 -04:00
Christoph Büscher	a004a33803	Prevent accidental changes of default values (#29528 ) The default percentiles values and the default highlighter per- and post-tags are currently publicly accessible and can be altered any time. This change prevents this by restricting field access.	2018-04-16 13:41:42 +02:00
Jason Tedor	00fd73acc4	Avoid self-deadlock in the translog (#29520 ) Today when reading an operation from the current generation fails tragically we attempt to close the translog. However, by invoking close before releasing the read lock we end up in self-deadlock because closing tries to acquire the write lock and the read lock can not be upgraded to a write lock. To avoid this, we move the close invocation outside of the try-with-resources that acquired the read lock. As an extra guard against this, we document the problem and add an assertion that we are not trying to invoke close while holding the read lock.	2018-04-15 16:26:09 -04:00
javanna	485d5d19bc	Mute TranslogTests#testFatalIOExceptionsWhileWritingConcurrently This test has been failing quite a few times with a suite timeout, opened #29509 for it.	2018-04-13 17:03:09 +02:00
Simon Willnauer	694e2a9970	Add remote cluster client (#29495 ) This change adds a client that is connected to a remote cluster. This allows plugins and internal structures to invoke actions on remote clusters just like a if it's a local cluster. The remote cluster must be configured via the cross cluster search infrastructure.	2018-04-13 15:23:44 +02:00
Simon Willnauer	eab530ce11	Ensure flush happens on shard idle This adds 2 testcases that test if a shard goes idle pending (uncommitted) segments are committed and unreferenced files will be freed. Relates to #29482	2018-04-13 15:06:51 +02:00
Chandan83	782517b452	Adds SpanGapQueryBuilder in the query DSL (#28636 ) This change adds the support for a `span_gap` query inside the span query DSL.	2018-04-13 14:51:03 +02:00
Mayya Sharipova	5dcfdb09cb	Control max size and count of warning headers (#28427 ) Control max size and count of warning headers Add a static persistent cluster level setting "http.max_warning_header_count" to control the maximum number of warning headers in client HTTP responses. Defaults to unbounded. Add a static persistent cluster level setting "http.max_warning_header_size" to control the maximum total size of warning headers in client HTTP responses. Defaults to unbounded. With every warning header that exceeds these limits, a message will be logged in the main ES log, and any more warning headers for this response will be ignored.	2018-04-13 05:55:33 -04:00
Adrien Grand	553c718d66	Make index APIs work without types. (#29479 ) Unlike the `indices.create`, `indices.get_mapping` and `indices.put_mapping` APIs, the index APIs do not need the `include_type_name` option, they can work work with and without types withouth knowing whether types are being used. Internally, `_doc` is used as a type if no type is provided, like for the `indices.put_mapping` API.	2018-04-13 09:08:45 +02:00
Adrien Grand	ebd6b5b7ba	Deprecate filtering on `_type`. (#29468 ) As indices are only allowed to have one type now, and types are going away in the future, we should deprecate filtering by `_type`. Relates #15613	2018-04-13 09:07:51 +02:00
Nhat Nguyen	f96e00badf	Add primary term to translog header (#29227 ) This change adds the current primary term to the header of the current translog file. Having a term in a translog header is a prerequisite step that allows us to trim translog operations given the max valid seq# for that term. This commit also updates tests to conform the primary term invariant which guarantees that all translog operations in a translog file have its terms at most the term stored in the translog header.	2018-04-12 13:57:59 -04:00
Lee Hinman	14097359a4	Move TimeValue into elasticsearch-core project (#29486 ) This commit moves the `TimeValue` class into the elasticsearch-core project. This allows us to use this class in many of our other projects without relying on the entire `server` jar. Relates to #28504	2018-04-12 10:24:58 -06:00
Igor Motov	0aa19186ae	Fix NPE in InternalGeoCentroidTests#testReduceRandom (#29481 ) In some rare cases all inputs might have zero count and resulting in zero totalCount, and null in centroid causing NPE. Closes #29480	2018-04-12 10:13:40 -04:00
Martijn van Groningen	fac009630d	test: Index more docs, so that it is less likely the search request does not time out. Closes #29221	2018-04-12 11:41:41 +02:00
Nhat Nguyen	067fbb8ecd	Backport periodic flush count to v6.3.0 Relates #29360	2018-04-11 17:14:28 -04:00
Lee Hinman	263349f628	Decouple TimeValue from Elasticsearch server classes (#29454 ) * Decouple TimeValue from Elasticsearch server classes This commit decouples the `TimeValue` class from the other server classes. This is in preperation to move `TimeValue` into the `elasticsearch-core` jar, allowing us to use it from projects that cannot depend on the elasticsearch-core library. Relates to #28504	2018-04-11 14:58:15 -06:00
Nhat Nguyen	0ae627fc79	ElasticsearchMergePolicy extend from MergePolicyWrapper (#29476 ) The skeleton of ElasticsearchMergePolicy is quite similar to MergePolicyWrapper. This commit therefore makes ElasticsearchMergePolicy inherited from MergePolicyWrapper instead of MergePolicy.	2018-04-11 11:32:19 -04:00
Nhat Nguyen	4e6a8900a3	Add periodic flush count to flush stats (#29360 ) Currently, a flush stats contains only the total flush which is the sum of manual flush (via API) and periodic flush (async triggered when the uncommitted translog size is exceeded the flush threshold). Sometimes, it's useful to know these two numbers independently. This commit tracks and returns a periodic flush count in a flush stats.	2018-04-11 11:15:33 -04:00
Adrien Grand	6a6c0ea5e6	Add an `include_type_name` option. (#29453 ) This adds an `include_type_name` option to the `indices.create`, `indices.get_mapping` and `indices.put_mapping` APIs, which defaults to `true`. When set to `false`, then mappings will be returned directly in the body of the `indices.get_mapping` API, without keying them by the type name, the `indices.create` will expect mappings directly under the `mappings` key, and the `indices.put_mapping` will use `_doc` as a type name and fail if a `type` is provided explicitly. Relates #15613	2018-04-11 15:54:16 +02:00
Simon Willnauer	45e7e24736	Restrict Document list access in ParseContext (#29463 ) Today we expose a mutable list of documents in ParseContext via ParseContext#docs(). This, on the one hand places knowledge how to access nested documnts in multiple places and on the other allows for potential illegal access to nested only docs after the docs are reversed. This change restricts the access and streamlines nested / non-root doc access.	2018-04-11 15:09:44 +02:00
Jim Ferenczi	1b6d5e531b	Fail _search request with trailing tokens (#29428 ) This change validates that the `_search` request does not have trailing tokens after the main object and fails the request with a parsing exception otherwise. Closes #28995	2018-04-11 13:10:22 +02:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Andrew Odendaal	d15cad4afb	Grammar matters.. (#29462 ) Update `all indices on this node will marked read-only` to `all indices on this node will be marked read-only`	2018-04-11 09:30:33 +02:00
Jason Tedor	663a52ad55	Add useful message when no input from terminal (#29369 ) Today when a user runs a CLI tool with standard input closed and no tty attached, the result from reading is null and this usually leads to a null pointer exception when we try to parse this input. This arises for example when the user runs the plugin installer through a Docker container without leaving standard input open and attaching a tty (docker exec <container ID> bin/elasticsearch-plugin install). When we try to read that the user accepts the plugin requiring additional security permissions we will get back null. This commit addresses this for all cases by throwing an illegal state exception. The solution for the user is leave standard input open and attach a tty (or, for some tools, use batch mode).	2018-04-10 21:50:39 -04:00
Zachary Tong	c341b41c54	[TEST] Temporarily silence MovAvgIT tests due to change in double comparisons #29409 removed the nearlyEquals() double comparison snippet, which makes these tests very flaky because they can generate very large or very small doubles which don't work well with absolute error comparison. We need to either refactor these tests to guarantee they stay in a small range (which could be difficult due to holt/holt-winters) or re-implement the more robust double comparison. Tracking issue: #29456	2018-04-10 20:45:33 +00:00
Jason Tedor	bca192a327	Simplify TranslogWriter#closeWithTragicEvent (#29412 ) This commit simplifies the exception handling in TranslogWriter#closeWithTragicEvent. When invoking this method, the inner close method could throw an exception which we always catch and suppress into the exception that led us to tragically close. This commit moves that repeated logic into closeWithTragicException and now callers simply need to catch, invoke closeWithTragicException, and rethrow.	2018-04-10 10:15:54 -04:00
Lee Hinman	0f40199d10	Remove custom PeriodType formatting from TimeValue (#29433 ) In order to decouple TimeValue from Joda, this removes the unused `format` methods. Relates to #28504	2018-04-10 08:02:56 -06:00
Adrien Grand	aeac682869	Make purely negative queries return scores of 0. (#26015 ) It would make them consistent with queries that are only made of filters. Closes #23449	2018-04-10 14:31:06 +02:00
Adrien Grand	a091d950a7	Deprecate slicing on `_uid`. (#29353 ) Deprecate slicing on `_uid`. `_id` should be used instead on 6.x.	2018-04-10 14:28:30 +02:00
Vladimir Dolzhenko	03d1a7e132	Version conflict exception message enhancement (#29432 ) Report doc is not found rather on PUT ?version=X rather current version [-1] is different than the one provided Closes #21278	2018-04-10 13:42:59 +02:00
Christoph Büscher	13da9dd7c0	Remove 5x bwc in LocaleUtils#parse (#29417 ) Remove the special treatment of parsing the locale property for old 5.x indices since in 7.0 we only need to support reading from 6.x indices.	2018-04-10 12:40:36 +02:00
tomcallahan	ec65710926	Remove copy-pasted code (#29409 ) * Remove copy-pasted code We had two instances of copy-pasted code with a bad license from another website. The code was doing something rather simple, and that functionality already exists within junit. This PR simply leverages the junit functionality.	2018-04-09 18:32:32 -04:00
Adrien Grand	dfcce2d872	Speed up some of our slowest unit tests. (#29414 ) `BaseRandomBinaryDocValuesRangeQueryTestCase.testRandomBig` should only run with nightly tests. It doesn't make sense to make it part of every test run. `UUIDTests` had a slow test for compression, which I made a bit faster by decreasing the number of indexed docs.	2018-04-09 16:35:47 +02:00
Jim Ferenczi	d755fcfd4b	Fix date and ip sources in the composite aggregation (#29370 ) This commit fixes the formatting of the values in the composite aggregation response. `date` fields should return timestamp as longs when used in a `terms` source and `ip` fields should always be formatted as strings. This commit also fixes the parsing of the `after` key for these field types. Finally, this commit disables the index optimization for the `ip` field and any source that provides a `missing` value.	2018-04-09 10:49:29 +02:00
Jason Tedor	11a534932d	Simplify Translog#closeOnTragicEvent (#29413 ) This commit simplifies the invocations to Translog#closeOnTragicEvent. This method already catches all possible exceptions and suppresses the non-AlreadyClosedExceptions into the exception that triggered the invocation. Therefore, there is no need for callers to do this same logic (which would never execute).	2018-04-06 17:59:42 -04:00
Lee Hinman	a07ba9e400	Move Streams.copy into elasticsearch-core and make a multi-release jar (#29322 ) * Move Streams.copy into elasticsearch-core and make a multi-release jar This moves the method `Streams.copy(InputStream in, OutputStream out)` into the `elasticsearch-core` project (inside the `o.e.core.internal.io` package). It also makes this class into a multi-release class where the Java 9 equivalent uses `InputStream#transferTo`. This is a followup from https://github.com/elastic/elasticsearch/pull/29300#discussion_r178147495	2018-04-06 11:07:20 -06:00
Lee Hinman	a93c942927	Move ObjectParser into the x-content lib (#29373 ) * Move ObjectParser into the x-content lib This moves `ObjectParser`, `AbstractObjectParser`, and `ConstructingObjectParser` into the libs/x-content dependency. This decoupling allows them to be used for parsing for projects that don't want to depend on the entire Elasticsearch jar. Relates to #28504	2018-04-06 09:41:14 -06:00
Lee Hinman	160d25fcdb	Move Tuple into elasticsearch-core (#29375 ) * Move Tuple into elasticsearch-core This allows us to use Tuple from other projects that don't want to rely on the entire Elasticsearch jar. I have also added very simple tests, since there were none. Relates tangentially to #28504	2018-04-06 08:58:24 -06:00
Jason Tedor	cb3295b212	Close translog writer if exception on write channel (#29401 ) Today we close the translog write tragically if we experience any I/O exception on a write. These tragic closes lead to use closing the translog and failing the engine. Yet, there is one case that is missed which is when we touch the write channel during a read (checking if reading from the writer would put us past what has been flushed). This commit addresses this by closing the writer tragically if we encounter an I/O exception on the write channel while reading. This becomes interesting when we consider that this method is invoked from the engine through the translog as part of getting a document from the translog. This means we have to consider closing the translog here as well which will cascade up into us finally failing the engine. Note that there is no semantic change to, for example, primary/replica resync and recovery. These actions will take a snapshot of the translog which syncs the translog to disk. If an I/O exception occurs during the sync we already close the writer tragically and once we have synced we do not ever read past the position that was synced while taking the snapshot.	2018-04-06 10:33:21 -04:00
Colin Goodheart-Smithe	55c8e80532	Fixes query_string query equals timezone check (#29406 ) * Fixes query_string query equals timezone check This change fixes a bug where two `QueryStringQueryBuilder`s were found to be equal if they had the same timezone set even if the query string in the builders were different Closes #29403 * Adds mutate function to QueryStringQueryBuilderTests * iter	2018-04-06 11:45:34 +01:00
Menno Oudshoorn	28631d7163	Fix some code smells in equals methods (#29348 ) Fixes instances of - Equals methods without type check - Equals methods where the field of `this` was compared to the same field of `this` instead of the `that` object that is compared to	2018-04-06 10:41:25 +01:00
Tanguy Leroux	ae2a9f7108	[Test] Fix SnapshotShardsServiceIT.testRetryPostingSnapshotStatusMessages This test requires a bit more time than 10 seconds for the the snapshot to be completed, it is now 30s. Closes #29270	2018-04-06 10:24:55 +02:00
Jason Tedor	451a328281	Remove double space in BaseTranslogReader (#29400 ) My eyes! The goggles do nothing!	2018-04-05 17:54:59 -04:00
Jason Tedor	e9576806e8	Remove dead write checkpoint method in translog (#29402 ) This commit removes a dead method from TranslogWriter.java.	2018-04-05 17:54:47 -04:00
David Turner	fb1aba9389	Improve NodeVersionAllocationDecider messages (#29356 ) Since #26542 the NodeVersionAllocationDecider tries to explain its NO decisions as follows: ... may not support codecs or postings formats for a newer Lucene version However, this message often appears during a rolling upgrade, and experience has shown that it seems to cause more confusion and worry than it needs to. This change fixes that by removing the explanation again, reducing the message to a statement of fact about the respective nodes' versions. Additionally, the same wording was used for version incompatibilities when allocating a primary (vs its previous location) and a replica (vs its primary). This change separates these two cases so they can have separate, clearer wording. Fixes #29228	2018-04-05 15:13:48 +01:00
Alan Woodward	dccd43af47	Upgrade to lucene 7.3.0 (#29387 )	2018-04-05 10:34:44 +01:00
Igor Motov	2c20f7a164	Allow using distance measure in the geo context precision (#29273 ) Adds support for distance measure, such as "4km", "5m" in the precision field of the geo location context in context suggesters. Fixes #24807	2018-04-04 17:39:30 -04:00
Jim Ferenczi	644e5ea97a	Fixed quote_field_suffix in query_string (#29332 ) This change fixes the handling of the `quote_field_suffix` option on `query_string` query. The expansion was not applied to default fields query. Closes #29324	2018-04-04 17:29:09 +02:00
Luca Cavanna	25d411eb32	Remove undocumented action.master.force_local setting (#29351 ) `action.master.force_local` was only ever used internally and never documented. It was one of those settings that were automatically added to a tribe node, to make sure that cluster state read operations would work locally rather than failing when trying to forward the request to the master (as the tribe node never had a master). Given that we recently removed the tribe node, we can also remove this setting.	2018-04-04 14:50:23 +02:00
Jason Tedor	c95e7539e7	Enhance error for out of bounds byte size settings (#29338 ) Today when you input a byte size setting that is out of bounds for the setting, you get an error message that indicates the maximum value of the setting. The problem is that because we use ByteSize#toString, we end up with a representation of the value that does not really tell you what the bound is. For example, if the bound is 2^31 - 1 bytes, the output would be 1.9gb which does not really tell you want the limit as there are many byte size values that we format to the same 1.9gb with ByteSize#toString. We have a method ByteSize#getStringRep that uses the input units to the value as the output units for the string representation, so we end up with no loss if we use this to report the bound. This commit does this.	2018-04-04 07:22:13 -04:00
Stéphane Campinas	38a651e5f1	[Docs] Correct javadoc of GetIndexRequest (#29364 )	2018-04-04 12:11:29 +02:00
Yannick Welsch	1891d4f83d	Check presence of multi-types before validating new mapping (#29316 ) Before doing any kind of validation on a new mapping, we should first do the multi-type validation in order to provide better error messages. For #29313, this means that the exception message will be Rejecting mapping update to [range_index_new] as the final mapping would have more than 1 type: [_doc, mytype] instead of [expected_attendees] is defined as an object in mapping [mytype] but this name is already used for a field in other types	2018-04-04 10:26:50 +01:00
Jason Tedor	8fdca6a89a	Align cat thread pool info to thread pool config (#29195 ) Today we report thread pool info using a common object. This means that we use a shared set of terminology that is not consistent with the terminology used to the configure thread pools. This holds in particular for the minimum and maximum number of threads in the thread pool where we use the following terminology: thread pool info \| fixed \| scaling min core size max max size A previous change addressed this for the nodes info API. This commit changes the display of thread pool info in the cat thread pool API too to be dependent on the type of the thread pool so that we can align the terminology in the output of thread pool info with the terminology used to configure a thread pool.	2018-04-03 17:27:26 -04:00
Nhat Nguyen	8e2f2be249	Track Lucene operations in engine explicitly (#29357 ) Today we reply on `IndexWriter#hasDeletions` to check if an index contains "update" operations. However, this check considers both deletes and updates. This commit replaces that check by tracking and checking Lucene operations explicitly. This would provide us stronger assertions.	2018-04-03 16:45:53 -04:00
Uwe Schindler	7c6d5cbf1f	Build: Fix Java9 MR build (#29312 ) Correctly setup classpath/dependencies and fix checkstyle task that was partly broken because delayed setup of Java9 sourcesets. This also cleans packaging of META-INF. It also prepares forbiddenapis 2.6 upgrade relates #29292	2018-04-03 10:22:12 -07:00
Adrien Grand	569d0c0e89	Improve similarity integration. (#29187 ) This improves the way similarities are plugged in in order to: - reject the classic similarity on 7.x indices and emit a deprecation warning otherwise - reject unkwown parameters on 7.x indices and emit a deprecation warning otherwise Even though this breaks the plugin API, I'd like to backport to 7.x so that users can get deprecation warnings when they are doing something that will become unsupported in the future. Closes #23208 Closes #29035	2018-04-03 16:45:25 +02:00
Lee Hinman	db8ed36436	Move Nullable into core (#29341 ) This moves the `Nullable` annotation into the elasticsearch-core project, so it may be used without relying entirely on the server jar. This will allow us to decouple more pieces to make them smaller. In addition, there were two different `Nullable` annotations, these have all been moved to the ES version rather than the inject version.	2018-04-03 07:57:21 -06:00
Adrien Grand	befa66ae35	Elasticsearch 6.3.0 is now on Lucene 7.3.	2018-04-03 14:21:16 +02:00
Yannick Welsch	d4538df893	Improve exception handling on TransportMasterNodeAction (#29314 ) We have seen exceptions bubble up to the uncaught exception handler. Checking the blocks can lead for example to IndexNotFoundException when the indices are resolved. In order to make TransportMasterNodeAction more resilient against such expected exceptions, this code change wraps the execution of doStart() into a try catch and informs the listener in case of failures.	2018-04-03 11:57:58 +02:00
Yannick Welsch	2dc546ccec	Don't break allocation if resize source index is missing (#29311 ) DiskThresholdDecider currently assumes that the source index of a resize operation (e.g. shrink) is available, and throws an IndexNotFoundException otherwise, thereby breaking any kind of shard allocation. This can be quite harmful if the source index is deleted during a shrink, or if the source index is unavailable during state recovery. While this behavior has been partly fixed in 6.1 and above (due to #26931), it relies on the order in which AllocationDeciders are executed (i.e. that ResizeAllocationDecider returns NO, ensuring that DiskThresholdDecider does not run, something that for example does not hold for the allocation explain API). This change adds a more complete fix, and also solves the situation for 5.6.	2018-04-03 11:51:06 +02:00
rationull	0028563aac	Pass through script params in scripted metric agg (#29154 ) * Pass script level params into scripted metric aggs (#28819) Now params that are passed at the script level and at the aggregation level are merged and can both be used in the aggregation scripts. If there are any conflicts, aggregation level params will win. This may be followed by another change detecting that case and throwing an exception to disallow such conflicts. * Disallow duplicate parameter names between scripted agg and script (#28819) If a scripted metric aggregation has aggregation params and script params which have the same name, throw an IllegalArgumentException when merging the parameter lists.	2018-04-03 09:57:49 +01:00
Adrien Grand	3bdfc8f3fb	Upgrade to lucene-7.3.0-snapshot-98a6b3d. (#29298 ) Most notable changes include: - this release doesn't have the 7.2.1 version constant so I had to create one - spatial4j and jts were upgraded	2018-04-03 09:27:14 +02:00
Jason Tedor	1df43a09b7	Remove HTTP max content length leniency (#29337 ) I am not sure why we have this leniency for HTTP max content length, it has been there since the beginning (`5ac51ee93f`) with no explanation of its source. That said, our philosophy today is different than the philosophy of the past where Elasticsearch would be quite lenient in its handling of settings and today we aim for predictability for both users and us. This commit removes leniency in the parsing of http.max_content_length.	2018-04-02 20:20:01 -04:00
Lee Hinman	6b2167f462	Begin moving XContent to a separate lib/artifact (#29300 ) * Begin moving XContent to a separate lib/artifact This commit moves a large portion of the XContent code from the `server` project to the `libs/xcontent` project. For the pieces that have been moved, some helpers have been duplicated to allow them to be decoupled from ES helper classes. In addition, `Booleans` and `CheckedFunction` have been moved to the `elasticsearch-core` project. This decoupling is a move so that we can eventually make things like the high-level REST client not rely on the entire ES jar, only the parts it needs. There are some pieces that are still not decoupled, in particular some of the XContent tests still remain in the server project, this is because they test a large portion of the pluggable xcontent pieces through `XContentElasticsearchException`. They may be decoupled in future work. Additionally, there may be more piecese that we want to move to the xcontent lib in the future that are not part of this PR, this is a starting point. Relates to #28504	2018-04-02 15:58:31 -06:00
David Turner	3be960d1c2	Minor cleanup in the InternalEngine (#29241 ) Fix a couple of minor things in the InternalEngine: * Rename loadOrGenerateHistoryUUID to reflect that it always generates a UUID * Move .acquire() call next to the associated try {} block.	2018-04-02 10:07:28 +01:00
Mayya Sharipova	e70cd35bda	Revert "REST high-level client: add support for Indices Update Settings API (#28892 )" (#29323 ) This reverts commit `b67b5b1bbd`.	2018-03-30 16:26:46 -07:00
Andy Bristol	b7e6fb9ac5	[test] remove Streamable serde assertions (#29307 ) Removes a set of assertions in the test framework that verified that Streamable objects could be serialized and deserialized across different versions. When this was discussed the consensus was that this approach has not caught many bugs in a long time and that serialization testing of objects was best left to their respective unit and integration tests. This commit also removes a transport interceptor that was used in ESIntegTestCase tests to make these assertions about objects coming in or off the wire.	2018-03-30 14:09:26 -07:00
javanna	bcc9cbfba7	Resolve unchecked cast warnings introduced with #28892	2018-03-30 10:58:40 +02:00
olcbean	b67b5b1bbd	REST high-level client: add support for Indices Update Settings API (#28892 ) Relates to #27205	2018-03-30 10:53:29 +02:00
Ryan Ernst	54f8f819ef	Search: Validate script query is run with a single script (#29304 ) The parsing code for script query currently silently skips by any tokens it does not know about within its parsing loop. The only token it does not catch is an array, which means pasing multiple scripts in via an array will cause the last script to be parsed and one, silently dropping the others. This commit adds validation that arrays are not seen while parsing.	2018-03-29 22:10:03 -07:00
Nhat Nguyen	04dd738782	TEST: trim unsafe commits before opening engine Since #29260, unsafe commits must be trimmed before opening an engine. This makes the engine constructor follow Lucene standard semantics and use the last commit. However, we haven't fully applied this change in some tests. Relates #29260	2018-03-29 14:25:42 -04:00
Boaz Leskes	eb8b31746a	Move trimming unsafe commits from engine ctor to store (#29260 ) As follow up to #28245 , this PR removes the logic for selecting the right start commit from the Engine constructor in favor of explicitly trimming them in the Store, before the engine is opened. This makes the constructor in engine follow standard Lucene semantics and use the last commit. Relates #28245 Relates #29156	2018-03-29 13:35:57 -04:00
Igor Motov	04d0edc8ee	Fix incorrect geohash for lat 90, lon 180 (#29256 ) Due to special treatment for the 0xFFFFFF... value in GeoHashUtils' encodeLatLon method, the hashcode for lat 90, lon 180 is incorrectly encoded as `"000000000000"` instead of "zzzzzzzzzzzz". This commit removes the special treatment and fixes the issue. Closes #22163	2018-03-29 09:23:43 -04:00
Tanguy Leroux	b6568d0cfd	Do not load global state when deleting a snapshot (#29278 ) When deleting a snapshot, it is not necessary to load and to parse the global metadata of the snapshot to delete. Now indices are stored in the snapshot metadata file, we have all the information to resolve the shards files to delete. This commit removes the readSnapshotMetaData() method that was used to load both global and index metadata files. Test coverage should be enough as SharedClusterSnapshotRestoreIT already contains several deletion tests. Related to #28934	2018-03-29 09:16:53 +02:00
Nhat Nguyen	9bc167466f	TEST: add log testDoNotRenewSyncedFlushWhenAllSealed This test was failed recently. This commit enables debug log and prints out seals. https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=oraclelinux/2234/console https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.x+intake/1437/console	2018-03-28 22:05:00 -04:00
Jason Tedor	4ef3de40bc	Fix handling of bad requests (#29249 ) Today we have a few problems with how we handle bad requests: - handling requests with bad encoding - handling requests with invalid value for filter_path/pretty/human - handling requests with a garbage Content-Type header There are two problems: - in every case, we give an empty response to the client - in most cases, we leak the byte buffer backing the request! These problems are caused by a broader problem: poor handling preparing the request for handling, or the channel to write to when the response is ready. This commit addresses these issues by taking a unified approach to all of them that ensures that: - we respond to the client with the exception that blew us up - we do not leak the byte buffer backing the request	2018-03-28 16:25:01 -04:00
Simon Willnauer	13e19e7428	Allow _update and upsert to read from the transaction log (#29264 ) We historically removed reading from the transaction log to get consistent results from _GET calls. There was also the motivation that the read-modify-update principle we apply should not be hidden from the user. We still agree on the fact that we should not hide these aspects but the impact on updates is quite significant especially if the same documents is updated before it's written to disk and made serachable. This change adds back the ability to read from the transaction log but only for update calls. Calls to the _GET API will always do a refresh if necessary to return consistent results ie. if stored fields or DocValues Fields are requested. Closes #26802	2018-03-28 18:03:34 +02:00
Christoph Büscher	27e45fc552	Remove IndicesOptions bwc serialization layer (#29281 ) On master we don't need to talk to pre-6.0 nodes anymore.	2018-03-28 16:19:45 +02:00
Luca Cavanna	245dd73156	Bulk processor#awaitClose to close scheduler (#29263 ) When the `BulkProcessor` is used with the high-level REST client, a scheduler is internally created that allows to schedule tasks. Such scheduler is not exposed to users and needs to be closed once the `BulkProcessor` is closed. There are two ways to close the `BulkProcessor` though, one is the ordinary `close` method and the other one is `awaitClose`. The former closes the scheduler while the latter doesn't, leaving threads lingering.	2018-03-28 16:09:18 +02:00
Yannick Welsch	cacf759213	Remove RELOCATED index shard state (#29246 ) as this information is already covered by ReplicationTracker.primaryMode.	2018-03-28 12:25:46 +02:00
Robin Neatherway	ea8e3661d0	Fix a type check that is always false (#27726 ) DocumentParser: The checks for Text and Keyword were masked by the earlier check for String, which they are child classes of. As String field types are no longer supported, this check can be removed.	2018-03-28 10:20:20 +02:00
Tanguy Leroux	36f8531bf4	Don't load global state when only restoring indices (#29239 ) Restoring a snapshot, or getting the status of finished snapshots, currently always load the global state metadata file from the repository even if it not required. This slows down the restore process (or listing statuses process) and can also be an issue if the global state cannot be deserialized (because it has unknown customs for example). This commit splits the Repository.getSnapshotMetadata() method into two distincts methods: getGlobalMetadata() and getIndexMetadata() that are now called only when needed.	2018-03-28 09:35:05 +02:00
Lee Hinman	eebda6974d	Decouple NamedXContentRegistry from ElasticsearchException (#29253 ) * Decouple NamedXContentRegistry from ElasticsearchException This commit decouples `NamedXContentRegistry` from using either `ElasticsearchException`, `ParsingException`, or `UnknownNamedObjectException`. This will allow us to move NamedXContentRegistry to its own lib as part of the xcontent extraction work. Relates to #28504	2018-03-27 16:51:31 -06:00
Lee Hinman	7df66abaf5	[TEST] Fix issue with HttpInfo passed invalid parameter HttpInfo is passed the maxContentLength as a parameter, but this value should never be negative. This fixes the test to only pass a positive random value.	2018-03-27 14:20:06 -06:00
Lee Hinman	b4c78019b0	Remove all dependencies from XContentBuilder (#29225 ) * Remove all dependencies from XContentBuilder This commit removes all of the non-JDK dependencies from XContentBuilder, with the exception of `CollectionUtils.ensureNoSelfReferences`. It adds a third extension point around dealing with time-based fields and formatters to work around the Joda dependency. This decoupling allows us to be able to move XContentBuilder to a separate lib so it can be available for things like the high level rest client. Relates to #28504	2018-03-27 12:58:22 -06:00
Jim Ferenczi	3db6f1c9d5	Fix sporadic failure in CompositeValuesCollectorQueueTests This commit fixes a test bug that causes an NPE on empty segments. Closes #29269	2018-03-27 20:11:21 +02:00
Jim Ferenczi	2aaa057387	Propagate ignore_unmapped to inner_hits (#29261 ) In 5.2 `ignore_unmapped` was added to `inner_hits` in order to ignore invalid mapping. This value was automatically set to the value defined in the parent query (`nested`, `has_child`, `has_parent`) but the refactoring of the parent/child in 5.6 removed this behavior unintentionally. This commit restores this behavior but also makes sure that we always automatically enforce this value when the query builder is used directly (previously this was only done by the XContent deserialization). Closes #29071	2018-03-27 18:55:42 +02:00
Nhat Nguyen	dfc9e721d8	TEST: Increase timeout for testPrimaryReplicaResyncFailed The default timeout (eg. 10 seconds) may not be enough for CI to re-allocate shards after the partion is healed. This commit increases the timeout to 30 seconds and enables logging in order to have more detailed information in case this test failed again. Closes #29060	2018-03-27 12:18:09 -04:00
Nhat Nguyen	d1d3edf156	TEST: Use different translog dir for a new engine In #testPruneOnlyDeletesAtMostLocalCheckpoint, we create a new engine but mistakenly use the same translog directory of the existing engine. This prevents translog files from cleaning up when closing the engines. ERROR 0.12s J2 \| InternalEngineTests.testPruneOnlyDeletesAtMostLocalCheckpoint <<< FAILURES! > Throwable #1: java.io.IOException: could not remove the following files (in the order of attempts): > translog-primary-060/translog-2.tlog: java.io.IOException: access denied: This commit makes sure to use a separate directory for each engine in this tes.	2018-03-27 09:45:51 -04:00
Christoph Büscher	8d6832c5ee	Make SearchStats implement Writeable (#29258 ) Moves another class over from Streamable to Writeable. By this, also some constructors can be removed or made private.	2018-03-27 15:21:11 +02:00
Nhat Nguyen	0ac89a32cc	Do not optimize append-only if seen normal op with higher seqno (#28787 ) When processing an append-only operation, primary knows that operations can only conflict with another instance of the same operation. This is true as the id was freshly generated. However this property doesn't hold for replicas. As soon as an auto-generated ID was indexed into the primary, it can be exposed to a search and users can issue a follow up operation on it. In extremely rare cases, the follow up operation can be arrived and processed on a replica before the original append-only request. In this case we can't simply proceed with the append-only request and blindly add it to the index without consulting the version map. The following scenario can cause difference between primary and replica. 1. Primary indexes an auto-gen-id doc. (id=X, v=1, s#=20) 2. A refresh cycle happens on primary 3. The new doc is picked up and modified - say by a delete by query request - Primary gets a delete doc (id=X, v=2, s#=30) 4. Delete doc is processed first on the replica (id=X, v=2, s#=30) 5. Indexing operation arrives on the replica, since it's an auto-gen-id request and the retry marker is lower, we put it into lucene without any check. Replica has a doc the primary doesn't have. To deal with a potential conflict between an append-only operation and a normal operation on replicas, we need to rely on sequence numbers. This commit maintains the max seqno of non-append-only operations on replica then only apply optimization for an append-only operation only if its seq# is higher than the seq# of all non-append-only.	2018-03-26 16:56:12 -04:00
Nhat Nguyen	87957603c0	Prune only gc deletes below local checkpoint (#28790 ) Once a document is deleted and Lucene is refreshed, we will not be able to look up the `version/seq#` associated with that delete in Lucene. As conflicting operations can still be indexed, we need another mechanism to remember these deletes. Therefore deletes should still be stored in the Version Map, even after Lucene is refreshed. Obviously, we can't remember all deletes forever so a trimming mechanism is needed. Currently, we remember deletes for at least 1 minute (the default GC deletes cycle) and clean them periodically. This is, at the moment, the best we can do on the primary for user facing APIs but this arbitrary time limit is problematic for replicas. Furthermore, we can't rely on the primary and replicas doing the trimming in a synchronized manner, and failing to do so results in the replica and primary making different decisions. The following scenario can cause inconsistency between primary and replica. 1. Primary index doc (index, id=1, v2) 2. Network packet issue causes index operation to back off and wait 3. Primary deletes doc (delete, id=1, v3) 4. Replica processes delete (delete, id=1, v3) 5. 1+ minute passes (GC deletes runs replica) 6. Indexing op is finally sent to the replica which no processes it because it forgot about the delete. We can reply on sequence-numbers to prevent this issue. If we prune only deletes whose seqno at most the local checkpoint, a replica will correctly remember what it needs. The correctness is explained as follows: Suppose o1 and o2 are two operations on the same document with seq#(o1) < seq#(o2), and o2 arrives before o1 on the replica. o2 is processed normally since it arrives first; when o1 arrives it should be discarded: 1. If seq#(o1) <= LCP, then it will be not be added to Lucene, as it was already previously added. 2. If seq#(o1) > LCP, then it depends on the nature of o2: - If o2 is a delete then its seq# is recorded in the VersionMap, since seq#(o2) > seq#(o1) > LCP, so a lookup can find it and determine that o1 is stale. - If o2 is an indexing then its seq# is either in Lucene (if refreshed) or the VersionMap (if not refreshed yet), so a real-time lookup can find it and determine that o1 is stale. In this PR, we prefer to deploy a single trimming strategy, which satisfies both requirements, on primary and replicas because: - It's simpler - no need to distinguish if an engine is running at primary mode or replica mode or being promoted. - If a replica subsequently is promoted, user experience is fully maintained as that replica remembers deletes for the last GC cycle. However, the version map may consume less memory if we deploy two different trimming strategies for primary and replicas.	2018-03-26 13:42:08 -04:00
Boaz Leskes	bca264699a	remove testUnassignedShardAndEmptyNodesInRoutingTable testUnassignedShardAndEmptyNodesInRoutingTable and that test is as old as time and does a very bogus thing. it is an IT test which extracts the GatewayAllocator from the node and tells it to allocated unassigned shards, while giving it a conjured cluster state with no nodes in it (it uses the DiscoveryNodes.EMPTY_NODES. This is never a cluster state we want to reroute on (we always have at least master node in it). I'm going to just delete the test as I don't think it adds much value. Closes #21463	2018-03-26 17:10:57 +02:00
Boaz Leskes	f5d4550e93	Fold EngineDiskUtils into Store, for better lock semantics (#29156 ) #28245 has introduced the utility class`EngineDiskUtils` with a set of methods to prepare/change translog and lucene commit points. That util class bundled everything that's needed to create and empty shard, bootstrap a shard from a lucene index that was just restored etc. In order to safely do these manipulations, the util methods acquired the IndexWriter's lock. That would sometime fail due to concurrent shard store fetching or other short activities that require the files not to be changed while they read from them. Since there is no way to wait on the index writer lock, the `Store` class has other locks to make sure that once we try to acquire the IW lock, it will succeed. To side step this waiting problem, this PR folds `EngineDiskUtils` into `Store`. Sadly this comes with a price - the store class doesn't and shouldn't know about the translog. As such the logic is slightly less tight and callers have to do the translog manipulations on their own.	2018-03-26 14:08:03 +02:00
Christoph Büscher	318b0af953	Remove execute mode bit from source files Some source files seem to have the execute bit (a+x) set, which doesn't really seem to hurt but is a bit odd. This change removes those, making the permissions similar to other source files in the repository.	2018-03-26 13:37:55 +02:00
Jim Ferenczi	5288235ca3	Optimize the composite aggregation for match_all and range queries (#28745 ) This change refactors the composite aggregation to add an execution mode that visits documents in the order of the values present in the leading source of the composite definition. This mode does not need to visit all documents since it can early terminate the collection when the leading source value is greater than the lowest value in the queue. Instead of collecting the documents in the order of their doc_id, this mode uses the inverted lists (or the bkd tree for numerics) to collect documents in the order of the values present in the leading source. For instance the following aggregation: ``` "composite" : { "sources" : [ { "value1": { "terms" : { "field": "timestamp", "order": "asc" } } } ], "size": 10 } ``` ... can use the field `timestamp` to collect the documents with the 10 lowest values for the field instead of visiting all documents. For composite aggregation with more than one source the execution can early terminate as soon as one of the 10 lowest values produces enough composite buckets. For instance if visiting the first two lowest timestamp created 10 composite buckets we can early terminate the collection since it is guaranteed that the third lowest timestamp cannot create a composite key that compares lower than the one already visited. This mode can execute iff: * The leading source in the composite definition uses an indexed field of type `date` (works also with `date_histogram` source), `integer`, `long` or `keyword`. * The query is a match_all query or a range query over the field that is used as the leading source in the composite definition. * The sort order of the leading source is the natural order (ascending since postings and numerics are sorted in ascending order only). If these conditions are not met this aggregation visits each document like any other agg.	2018-03-26 09:51:37 +02:00
Nicholas Knize	fede633563	Add Z value support to geo_shape This enhancement adds Z value support (source only) to geo_shape fields. If vertices are provided with a third dimension, the third dimension is ignored for indexing but returned as part of source. Like beofre, any values greater than the 3rd dimension are ignored. closes #23747	2018-03-23 08:50:55 -05:00
Nhat Nguyen	794de63232	Remove type casts in logging in server component (#28807 ) This commit removes type-casts in logging in the server component (other components will be done later). This also adds a parameterized message test which would catch breaking-changes related to lambdas in Log4J.	2018-03-23 07:35:50 -04:00
Yu	4a8099c696	Change BroadcastResponse from ToXContentFragment to ToXContentObject (#28878 ) While working on #27799, we find that it might make sense to change BroadcastResponse from ToXContentFragment to ToXContentObject, seeing that it's rather a complete XContent object and also the other Responses are normally ToXContentObject. By doing this, we can also move the XContent build logic of BroadcastResponse's subclasses, from Rest Layer to the concrete classes themselves. Relates to #3889	2018-03-23 10:53:37 +01:00
Milan Chovatiya	8328b9c5cd	REST : Split `RestUpgradeAction` into two actions (#29124 ) Closes #29062	2018-03-23 10:37:31 +01:00
Nhat Nguyen	14157c8705	Harden periodically check to avoid endless flush loop (#29125 ) In #28350, we fixed an endless flushing loop which may happen on replicas by tightening the relation between the flush action and the periodically flush condition. 1. The periodically flush condition is enabled only if it is disabled after a flush. 2. If the periodically flush condition is enabled then a flush will actually happen regardless of Lucene state. (1) and (2) guarantee that a flushing loop will be terminated. Sadly, the condition 1 can be violated in edge cases as we used two different algorithms to evaluate the current and future uncommitted translog size. - We use method `uncommittedSizeInBytes` to calculate current uncommitted size. It is the sum of translogs whose generation at least the minGen (determined by a given seqno). We pick a continuous range of translogs since the minGen to evaluate the current uncommitted size. - We use method `sizeOfGensAboveSeqNoInBytes` to calculate the future uncommitted size. It is the sum of translogs whose maxSeqNo at least the given seqNo. Here we don't pick a range but select translog one by one. Suppose we have 3 translogs `gen1={#1,#2}, gen2={}, gen3={#3} and seqno=#1`, `uncommittedSizeInBytes` is the sum of gen1, gen2, and gen3 while `sizeOfGensAboveSeqNoInBytes` is the sum of gen1 and gen3. Gen2 is excluded because its maxSeqno is still -1. This commit removes both `sizeOfGensAboveSeqNoInBytes` and `uncommittedSizeInBytes` methods, then enforces an engine to use only `sizeInBytesByMinGen` method to evaluate the periodically flush condition. Closes #29097 Relates ##28350	2018-03-22 14:31:15 -04:00
Jim Ferenczi	c93c7f3121	Remove deprecated options for query_string (#29203 ) This commit removes some parameters deprecated in 6.x (or 5.x): `use_dismax`, `split_on_whitespace`, `all_fields` and `lowercase_expanded_terms`. Closes #25551	2018-03-22 18:37:08 +01:00
Yu	24c8d8f5ef	REST high-level client: add force merge API (#28896 ) Relates to #27205	2018-03-22 17:17:16 +01:00
Lee Hinman	7d1de890b8	Decouple more classes from XContentBuilder and make builder strict (#29197 ) This commit decouples `BytesRef`, `Releaseable`, and `TimeValue` from XContentBuilder, and paves the way for doupling `ByteSizeValue` as well. It moves much of the Lucene and Joda encoding into a new SPI extension that is loaded by XContentBuilder to know how to encode these values. Part of doing this also allows us to make JSON encoding strict, as we no longer allow just any old object to be passed (in the past it was possible to get json that was `"field": "java.lang.Object@d8355a8"` if no one was careful about what was passed in). Relates to #28504	2018-03-22 08:18:55 -06:00
Christoph Büscher	d6d3fb3c73	Use EnumMap in ClusterBlocks (#29112 ) By using EnumMap instead of an ImmutableLevelHolder array we can avoid the using enum ordinals to index into the array.	2018-03-22 11:14:24 +01:00
Tanguy Leroux	edf27a599e	Add new setting to disable persistent tasks allocations (#29137 ) This commit adds a new setting `cluster.persistent_tasks.allocation.enable` that can be used to enable or disable the allocation of persistent tasks. The setting accepts the values `all` (default) or `none`. When set to none, the persistent tasks that are created (or that must be reassigned) won't be assigned to a node but will reside in the cluster state with a no "executor node" and a reason describing why it is not assigned: ``` "assignment" : { "executor_node" : null, "explanation" : "persistent task [foo/bar] cannot be assigned [no persistent task assignments are allowed due to cluster settings]" } ```	2018-03-22 09:18:07 +01:00
Nhat Nguyen	7d44d75774	Adjust PreSyncedFlushResponse bwc versions We discussed and agreed to include the synced-flush change in 6.3.0+ but not in 5.6.9. We will re-evaluate the urgency and importance of the issue then decide which versions that the change should be included.	2018-03-21 16:50:35 -04:00
markharwood	93ff973afc	Tests - fix incorrect test assumption that zero-doc buckets will be returned by the adjacency matrix aggregation. Closes #29159 (#29167 )	2018-03-21 10:42:14 +00:00
Jason Tedor	2f6c77337e	Remove 6.1.5 version constant The assumption here is that we will no longer be making a release from the 6.1 branch. Since we assume that all versions on this branch are actually released, we do not want to leave behind any versions that would require a snapshot build. We do have a test that verifies that all released versions are present here, so if another release is performed from the 6.1 branch, that test will fail and we will know to add the version constant at that time.	2018-03-21 06:28:17 -04:00
Adrien Grand	8f9d2ee4e2	Reject updates to the `_default_` mapping. (#29165 ) This will reject mapping updates to the `_default_` mapping with 7.x indices and still emit a deprecation warning with 6.x indices. Relates #15613 Supersedes #28248	2018-03-21 10:44:11 +01:00
Nhat Nguyen	f938c4267e	Fix BWC issue for PreSyncedFlushResponse I misunderstood how the bwc versions works. If we backport to 5.x, we need to backport to all supported 6.*. This commit corrects the BWC versions for PreSyncedFlushResponse. Relates #29103	2018-03-20 13:56:15 -04:00
Lee Hinman	b4af451ec5	Remove BytesArray and BytesReference usage from XContentFactory (#29151 ) * Remove BytesArray and BytesReference usage from XContentFactory This removes the usage of `BytesArray` and `BytesReference` from `XContentFactory`. Instead, a regular `byte[]` should be passed. To assist with this a helper has been added to `XContentHelper` that will preserve the offset and length from the underlying BytesReference. This is part of ongoing work to separate the XContent parts from ES so they can be factored into their own jar. Relates to #28504	2018-03-20 11:52:26 -06:00
Lee Hinman	4bd217c94f	Add pluggable XContentBuilder writers and human readable writers (#29120 ) * Add pluggable XContentBuilder writers and human readable writers This adds the ability to use SPI to plug in writers for XContentBuilder. By implementing the XContentBuilderProvider class we can allow Elasticsearch to plug in different ways to encode types to JSON. Important caveat for this, we should always try to have the class implement `ToXContentFragment` first, however, in the case of classes from our dependencies (think Joda classes or Lucene classes) we need a way to specify writers for these classes. This also makes the human-readable field writers generic and pluggable, so that we no longer need to tie XContentBuilder to things like `TimeValue` and `ByteSizeValue`. Contained as part of this moves all the TimeValue human readable fields to the new `humanReadableField` method. A future commit will move the `ByteSizeValue` calls over to this method. Relates to #28504	2018-03-20 11:39:24 -06:00
Christoph Büscher	701625b065	Add unreleased version 6.2.4 (#29171 )	2018-03-20 18:38:06 +01:00
Christoph Büscher	5a97fe75da	Add unreleased version 6.1.5 (#29168 )	2018-03-20 18:31:59 +01:00
Luca Cavanna	ff09c82319	REST high-level client: add clear cache API (#28866 ) * REST high-level client: add clear cache API Relates to #27205 Also Closes #26947 (rest-spec were outdated)	2018-03-20 10:39:36 +01:00
Lee Hinman	687577a516	Fix javadoc warning in Strings for missing parameter description Fixes a parameter in `Strings` that had a javadoc annotation but was missing the description, causing warnings in the build.	2018-03-19 12:28:15 -06:00
Lee Hinman	3025295f7e	Decouple Text and Geopoint from XContentBuilder (#29119 ) This removes the `Text` and `Geopoint` special handling from `XContentBuilder`. Instead, these classes now implement `ToXContentFragment` and render themselves accordingly. This allows us to further decouple XContentBuilder from Elasticsearch-specific classes so it can be factored into a standalone lib at a later time. Relates to #28504	2018-03-19 08:54:10 -06:00
Nik Everett	bf05c600c4	REST: Include suppressed exceptions on failures (#29115 ) This modifies xcontent serialization of Exceptions to contain suppressed exceptions. If there are any suppressed exceptions they are included in the exception response by default. The reasoning here is that they are fairly rare but when they exist they almost always add extra useful information. Take, for example, the response when you specify two broken ingest pipelines: ``` { "error" : { "root_cause" : ...snip... "type" : "parse_exception", "reason" : "[field] required property is missing", "header" : { "processor_type" : "set", "property_name" : "field" }, "suppressed" : [ { "type" : "parse_exception", "reason" : "[field] required property is missing", "header" : { "processor_type" : "convert", "property_name" : "field" } } ] }, "status" : 400 } ``` Moreover, when suppressed exceptions come from 500 level errors should give us more useful debugging information. Closes #23392	2018-03-19 10:52:50 -04:00
Tanguy Leroux	0f93b7abdf	Fix compilation errors in ML integration tests After elastic/elasticsearch#29109, the `needsReassignment` method has been moved to the PersistentTasksClusterService. This commit fixes some compilation in tests I introduced.	2018-03-19 09:46:53 +01:00
Tanguy Leroux	b57bd695f2	Small code cleanups and refactorings in persistent tasks (#29109 ) This commit consists of small code cleanups and refactorings in the persistent tasks framework. Most changes are in PersistentTasksClusterService where some methods have been renamed or merged together, documentation has been added, unused code removed in order to improve readability of the code.	2018-03-19 09:26:17 +01:00
Nhat Nguyen	f1029aaad5	getMinGenerationForSeqNo should acquire read lock (#29126 ) The method Translog#getMinGenerationForSeqNo does not modify the current translog but only access, it therefore should acquire the readLock instead of writeLock.	2018-03-17 17:43:20 -04:00
Nhat Nguyen	c9749180a1	Backport - Do not renew sync-id PR to 5.6 and 6.3 Relates ##29103	2018-03-17 11:38:22 -04:00
Jason Tedor	2e93a9158f	Align thread pool info to thread pool configuration (#29123 ) Today we report thread pool info using a common object. This means that we use a shared set of terminology that is not consistent with the terminology used to the configure thread pools. This holds in particular for the minimum and maximum number of threads in the thread pool where we use the following terminology: thread pool info \| fixed \| scaling min core size max max size This commit changes the display of thread pool info to be dependent on the type of the thread pool so that we can align the terminology in the output of thread pool info with the terminology used to configure a thread pool.	2018-03-16 22:47:06 -04:00
Nhat Nguyen	22ad52a288	TEST: Adjust translog size assumption in new engine A new engine now can have more than one empty translog since #28676. This cause #testShouldPeriodicallyFlush failed because in the test we asssume an engine should have one empty translog. This commit takes into account the extra translog size of a new engine.	2018-03-16 21:50:31 -04:00
olcbean	47211c00e9	REST: Clear Indices Cache API simplify param parsing (#29111 ) Simplify the parsing of the params in Clear Indices Cache API, as a follow up to the removing of the deprecated parameter names.	2018-03-16 16:50:34 -04:00
Jason Tedor	4d62640bf1	Fix typo in ExceptionSerializationTests This commit fixes a little typo in ExceptionSerializationTests.java replacing "weas" by "was".	2018-03-16 15:52:39 -04:00
Jason Tedor	1f1a4d17b4	Remove BWC layer for rejected execution exception The serialization changes for rejected execution exceptions has been backported to 6.x with the intention to appear in all versions since 6.3.0. Therefore, this BWC layer is no longer needed in master since master would never speak to a node that does not speak the same serialization.	2018-03-16 14:40:17 -04:00
Jason Tedor	6bf742dd1b	Fix EsAbortPolicy to conform to API (#29075 ) The rejected execution handler API says that rejectedExecution(Runnable, ThreadPoolExecutor) throws a RejectedExecutionException if the task must be rejected due to capacity on the executor. We do throw something that smells like a RejectedExecutionException (it is named EsRejectedExecutionException) yet we violate the API because EsRejectedExecutionException is not a RejectedExecutionException. This has caused problems before where we try to catch RejectedExecution when invoking rejectedExecution but this causes EsRejectedExecutionException to go uncaught. This commit addresses this by modifying EsRejectedExecutionException to extend RejectedExecutionException.	2018-03-16 14:34:36 -04:00
David Turner	158bb23887	Remove usages of obsolete settings (#29087 ) The settings `indices.recovery.concurrent_streams` and `indices.recovery.concurrent_small_file_streams` were removed in `f5e4cd4616`. This commit removes their last traces from the codebase.	2018-03-16 15:35:40 +00:00
Nhat Nguyen	2c1ef3d4c6	Do not renew sync-id if all shards are sealed (#29103 ) Today the synced-flush always issues a new sync-id even though all shards haven't been changed since the last seal. This causes active shards to have different a sync-id from offline shards even though all were sealed and no writes since then. This commit adjusts not to renew sync-id if all active shards are sealed with the same sync-id. Closes #27838	2018-03-16 11:16:30 -04:00
Adrien Grand	0755ff425f	Clarify requirements of strict date formats. (#29090 ) Closes #29014	2018-03-16 14:39:36 +01:00
Alan Woodward	a2d5cf6514	Compilation fix for #29067	2018-03-16 13:33:25 +00:00
Alan Woodward	986e518170	Store offsets in index prefix fields when stored in the parent field (#29067 ) The index prefix field is normally indexed as docs-only, given that it cannot be used in phrases. However, in the case that the parent field has been indexed with offsets, or has term-vector offsets, we should also store this in the index prefix field for highlighting. Note that this commit does not implement highlighting on prefix fields, but rather ensures that future work can implement this without a backwards-break in index data. Closes #28994	2018-03-16 11:39:46 +00:00
Tanguy Leroux	f14146982f	Use removeTask instead of finishTask in PersistentTasksClusterService (#29055 ) The method `PersistentTasksClusterService.finishTask()` has been modified since it was added and does not use any `removeOncompletion` flag anymore. Its behavior is now similar to `removeTask()` and can be replaced by this one. When a non existing task is removed, the cluster state update task will fail and its `source` will still indicate `finish persistent task`/`remove persistent task`.	2018-03-16 10:20:56 +01:00
Yogesh Gaikwad	a685784cea	CLI: Close subcommands in MultiCommand (#28954 ) * CLI Command: MultiCommand must close subcommands to release resources properly - Changes are done to override the close method and call close on subcommands using IOUtils#close - Unit Test Closes #28953	2018-03-16 09:59:23 +11:00
Nhat Nguyen	c75790e7c0	TEST: write ops should execute under shard permit (#28966 ) Currently ESIndexLevelReplicationTestCase executes write operations without acquiring index shard permit. This may prevent the primary term on replica from being updated or cause a race between resync and indexing on primary. This commit ensures that write operations are always executed under shard permit like the production code.	2018-03-15 14:42:15 -04:00
Mayya Sharipova	8cb3d18eac	Revert "Improve error message for installing plugin (#28298 )" This reverts commit `0cc1ffdf20` The reason is that Windows test are failing, because of the incorrect path for the plugin	2018-03-15 10:47:50 -07:00
Adrien Grand	404e776a45	Validate regular expressions in dynamic templates. (#29013 ) Today you would only get these errors at index time. Relates #24749	2018-03-15 16:43:56 +01:00
Christoph Büscher	312ccc05d5	[Tests] Fix GetResultTests and DocumentFieldTests failures (#29083 ) Changes made in #28972 seems to have changed some assumptions about how SMILE and CBOR write byte[] values and how this is tested. This changes the generation of the randomized DocumentField values back to BytesArray while expecting the JSON and YAML deserialisation to produce Base64 encoded strings and SMILE and CBOR to parse back BytesArray instances. Closes #29080	2018-03-15 16:42:26 +01:00

... 6 7 8 9 10 ...

1044 Commits