OpenSearch

Commit Graph

Author	SHA1	Message	Date
Ryan Ernst	72d818c304	Tests: Fix DateFormatter equals tests with locale (#34435 ) This commit removes randomization of locale for DateFormatter equals tests, instead using explicit locales. The test framework already randomizes locales, so the random choice of the second locale can sometimes be equal to the already chosen locale. Randomization also does not provide any extra protection, as the equality of DateFormatter does not implement equality of the locales itself. closes #34337	2018-10-14 23:54:49 +01:00
Jay Modi	6d99d7dafc	ListenableFuture should preserve ThreadContext (#34394 ) ListenableFuture may run a listener on the same thread that called the addListener method or it may execute on another thread after the future has completed. Whenever the ListenableFuture stores the listener for execution later, it should preserve the thread context which is what this change does.	2018-10-11 15:24:38 +01:00
Nhat Nguyen	33791ac27c	CCR: Following primary should process operations once (#34288 ) Today we rewrite the operations from the leader with the term of the following primary because the follower should own its history. The problem is that a newly promoted primary may re-assign its term to operations which were replicated to replicas before by the previous primary. If this happens, some operations with the same seq_no may be assigned different terms. This is not good for the future optimistic locking using a combination of seqno and term. This change ensures that the primary of a follower only processes an operation if that operation was not processed before. The skipped operations are guaranteed to be delivered to replicas via either primary-replica resync or peer-recovery. However, the primary must not acknowledge until the global checkpoint is at least the highest seqno of all skipped ops (i.e., they all have been processed on every replica). Relates #31751 Relates #31113	2018-10-10 15:39:57 -04:00
Simon Willnauer	34b935ae57	Improve `getRestHandlerWrapper` JavaDocs (#34376 ) Questions on how to work with `ActionPlugin#getRestHandlerWrapper()` come up in discuss forums all the time. This change adds an example to the javadoc how this method should/could be used.	2018-10-10 17:28:07 +01:00
Yannick Welsch	49cbcaff4f	Allow excluding folder names when scanning for dangling indices (#34349 ) ES is scanning for dangling indices on every cluster state update. For this, it lists the subfolders of the indices directory to determine which extra index directories exist on the node where there's no corresponding index in the cluster state. These are potential targets for dangling index import. On certain machine types, and with large number of indices, this subfolder listing can be horribly slow. This means that every cluster state update will be slowed down by potentially hundreds of milliseconds. One of the reasons for this poor performance is that Files.isDirectory() is a relatively expensive call on some OS and JDK versions. There is no need though to do all these isDirectory calls for folders which we know we are going to discard anyhow in the next step of the dangling indices logic. This commit allows adding an exclusion predicate to the availableIndexFolders methods which can dramatically speed up this method when scanning for dangling indices.	2018-10-08 15:35:50 +02:00
Nik Everett	06993e0c35	Logging: Make ESLoggerFactory package private (#34199 ) Since all calls to `ESLoggerFactory` outside of the logging package were deprecated, it seemed like it'd simplify things to migrate all of the deprecated calls and declare `ESLoggerFactory` to be package private. This does that.	2018-10-06 09:54:08 -04:00
Daniel Mitterdorfer	7d826916b9	Adjust size of BigArrays in circuit breaker test With this commit we restore the previous behavior in `BigArraysTests#testMaxSizeExceededOnResize` but lower the sizes that are tested to the range between 256 bytes to 16 kB so the test does not produce a whole lot of garbage. The previous attempt to reduce the amount of garbage produced by that test was to properly size the array initially but it failed to account for object alignment which lead to test failures in some cases. While it would be possible to account for object alignment, we would need to open up BigArrays or directly use the underlying Lucene API which would require us to allocate an array upfront only to find its size (incl. object alignment). Instead we have fixed this issue by conservatively sizing the array initially (so the initial allocation will never trip the circuit breaker) and reduce garbage by reducing the circuit breaker's upper bound as described previously. Closes #33750 Relates #34325	2018-10-05 15:39:08 +02:00
Jim Ferenczi	5c7b52e930	Adapt bwc version after backport Relates #33587	2018-10-05 13:07:39 +02:00
eray	daf88335d7	Add max_children limit to nested sort (#33587 ) Add an option to `nested` sort to limit the number of children to visit when picking the sort value of the root document. Closes #33592	2018-10-05 12:02:47 +02:00
Dimitris Athanasiou	4dacfa95d2	[ML] Allow asynchronous job deletion (#34058 ) This changes the delete job API by adding the choice to delete a job asynchronously. The commit adds a `wait_for_completion` parameter to the delete job request. When set to `false`, the action returns immediately and the response contains the task id. This also changes the handling of subsequent delete requests for a job that is already being deleted. It now uses the task framework to check if the job is being deleted instead of the cluster state. This is a beneficial for it is going to also be working once the job configs are moved out of the cluster state and into an index. Also, force delete requests that are waiting for the job to be deleted will not proceed with the deletion if the first task fails. This will prevent overloading the cluster. Instead, the failure is communicated better via notifications so that the user may retry. Finally, this makes the `deleting` property of the job visible (also it was renamed from `deleted`). This allows a client to render a deleting job differently. Closes #32836	2018-10-05 02:41:28 +03:00
Nik Everett	09aaed4fe4	Tasks: Document that status is not semvered (#34270 ) The `status` part of the tasks API reflects the internal status of a running task. In general, we do not make backwards breaking changes to the `status` but because it is internal we reserve the right to do so. I suspect we will very rarely excercise that right but it is important that we have it so we're not boxed into any particular implementation for a request. In some sense this is policy making by documentation change. In another it is clarification of the way we've always thought of this field. I also reflect the documentation change into the Javadoc in a few places. There I acknowledge Kibana's "special relationship" with Elasticsearch. Kibana parses `_reindex`'s `status` field and, because we're friends with those folks, we should talk to them before we make backwards breaking changes to it. We want to be friends with everyone but there is only so much time in the day and we don't want to make backwards breaking fields to `status` at all anyway. So we hope that breaking changes documentation should be enough for other folks. Relates to #34245.	2018-10-04 14:42:37 -04:00
Vladimir Dolzhenko	dcfe64e0e4	[CI] Fix bogus ScheduleWithFixedDelayTests.testRunnableRunsAtMostOnceAfterCancellation Closes #34004	2018-10-04 16:31:56 +02:00
Armin Braun	3ccfc3de58	SCRIPTING: Terms set query expression (#33856 ) * SCRIPTING: Add Expr. Compile for TermSetQuery Ctx. * Follow up to #33602 adding the ability to compile TermsSetQuery scripts with the expressions engine in the same way we support SearchScript in Expressions * Duplicated the code here for now to make the change less complex, the only difference to SearchScript is that `_score` and `_value` are not handled for TermsSetQuery * remove redundant check	2018-10-04 16:03:57 +02:00
Nik Everett	ab8a5563f2	Logging: Drop remaining Settings log ctor (#34149 ) Drops the last logging constructor that takes `Settings` because it is no longer needed. Watcher goes through a lot of effort to pass `Settings` to `Logger` constructors and dropping `Settings` from all of those calls allowed us to remove quite a bit of log-based ceremony from watcher.	2018-10-04 09:18:04 -04:00
Jim Ferenczi	e8b986cc37	Fix sporadic failure in NestedObjectMapperTests Relates #34225	2018-10-04 07:40:46 +02:00
Nhat Nguyen	6dd716b0c4	Replace version with reader cache key in IndicesRequestCache (#34189 ) Today we use the version of a DirectoryReader as a component of the key of IndicesRequestCache. This usage is perfectly fine since the version is advanced every time a new change is made into IndexWriter. In other words, two DirectoryReaders with the same version should have the same content. However, this invariant is only guaranteed in the context of a single IndexWriter because the version is reset to the committed version value when IndexWriter is re-opened. Since #33473, each IndexShard may have more than one IndexWriter, and using the version of a DirectoryReader as a part of the cache key can cause IndicesRequestCache to return stale cached values. For example, in #27650, we rollback the engine (i.e., re-open IndexWriter), index new documents, refresh, then make a count request, but the search layer mistakenly returns the count of the DirectoryReader of the previous IndexWriter because the current DirectoryReader has the same version of the old DirectoryReader even their documents are different. This is possible because these two readers come from different IndexWriters. This commit replaces the the version with the reader cache key of IndexReader as a component of the cache key of IndicesRequestCache. Closes #27650 Relates #33473	2018-10-03 21:03:24 -04:00
Kazuhiro Sera	d45fe43a68	Fix a variety of typos and misspelled words (#32792 )	2018-10-03 18:11:38 +01:00
Jim Ferenczi	ee21067a41	Add early termination support for min/max aggregations (#33375 ) This commit adds the support to early terminate the collection of a leaf in the min/max aggregator. If the query matches all documents the min and max value for a numeric field can be retrieved efficiently in the points reader. This change applies this optimization when possible.	2018-10-03 18:33:39 +02:00
Lee Hinman	90c55f5e36	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-03 09:11:28 -06:00
albendz	f09190c14d	Require combine and reduce scripts in scripted metrics aggregation (#33452 ) * Make text message not required in constructor for slack * Remove unnecessary comments in test file * Throw exception when reduce or combine is not provided; update tests * Update integration tests for scripted metrics to always include reduce and combine * Remove some old changes from previous branches * Rearrange script presence checks to be earlier in build * Change null check order in script builder for aggregated metrics; correct test scripts in IT * Add breaking change details to PR	2018-10-03 15:22:01 +01:00
Jim Ferenczi	41528c0813	Adapt bwc version after backport (bis) Relates #34225	2018-10-03 14:24:01 +02:00
Jim Ferenczi	1aa8e72be7	Adapt bwc version after backport Relates #34225	2018-10-03 12:24:07 +02:00
Jim Ferenczi	5a3e031831	Preserve the order of nested documents in the Lucene index (#34225 ) Today we reverse the initial order of the nested documents when we index them in order to ensure that parents documents appear after their children. This means that a query will always match nested documents in the reverse order of their offsets in the source document. Reversing all documents is not needed so this change ensures that parents documents appear after their children without modifying the initial order in each nested level. This allows to match children in the order of their appearance in the source document which is a requirement to efficiently implement #33587. Old indices created before this change will continue to reverse the order of nested documents to ensure backwark compatibility.	2018-10-03 11:55:30 +02:00
Colin Goodheart-Smithe	2d64e3db9a	Adds trace logging to IndicesRequestCache (#34180 ) * Adds trace logging to IndicesRequestCache This change adds trace level logging to `IndicesrrequestCache` witht eh primary aim of helping to identify the cause of teh failures in https://github.com/elastic/elasticsearch/issues/32827. The cache will log at trace level when a cache hit or miss occurs including the reader version and the cache key. Note that this change adds a `cacheKeyRenderer` whcih supplies a human readable String of the cache key since the actual cache key itself is a `BytesReference` containing the wire protocol serialised form of the request. Logging is also added for the case where a search timeout occurs and fr that reason the cache entry is invalidated. * Adds comment to remaind us to remove cacheKeyRenderer	2018-10-03 08:58:33 +01:00
Gordon Brown	fb907706ec	Merge branch 'master' into index-lifecycle	2018-10-02 13:43:46 -06:00
Dimitrios Liappis	f12e0a8398	Add ES version 6.4.3 (#34239 ) Version bump	2018-10-02 21:15:58 +03:00
David Turner	a7ce4b31ed	Fix logging of cluster state update descriptions (#34182 ) In #28941 we changed the computation of cluster state task descriptions but this introduced a bug in which we only log the empty descriptions (rather than the non-empty ones). This change fixes that.	2018-10-02 19:08:19 +01:00
Christoph Büscher	5183ea3d68	Use OptionalInt instead of Optional<Integer> (#34220 ) Optionals containing boxed primitive types are prohibitively costly because they have two level of boxing. For Optional<Integer> the analogous OptionalInt can be used to avoid the boxing of the contained int value.	2018-10-02 15:58:07 +02:00
Jim Ferenczi	ead6ffce54	Fix cross fields mode of the query_string query (#34216 ) This change fixes a bug in the cross fields mode of the `query_string` query. The multi fields query builder must be reseted before parsing in order to clear the list of expanded fields coming from the previous text block. Closes #34215	2018-10-02 14:53:26 +02:00
Przemyslaw Gomulka	3f8cc89c9f	Completion types with multi-fields support (#34081 ) Mappings with completion type and multi-fields, were not able to index array or object format on completion fields. Only string format was supported. This is fixed by providing multiField parser with externalValueContext with already parsed object closes #15115	2018-10-02 14:32:56 +02:00
Alexander Reelsen	b1b0f3276b	Core: Add methods to get locale/timezone in DateFormatter (#34113 ) This adds some method into the `DateFormatter` interface, namely * `withLocale()` to change the locale of a date formatter * `getLocale()` * `getZone()` * `hashCode()` * `equals()` These methods will be needed for aggregations and mapping changes, where zones and locales can be specified in the mapping or in search/aggs parts of a search request.	2018-10-02 14:13:30 +02:00
Jim Ferenczi	aba4a59d0d	Handle terms query when detecting if a query can match nested docs (#34072 ) When nested objects are present in the mappings, we add a filter in queries to exclude them if there is no evidence that the query cannot match in this space. In 6x we visit the query in order to find a mandatory clause that can match root documents only. If we find one we can omit the nested documents filter. Currently only `term` and `range` queries are checked, this change adds the support for `terms` query to effectively remove the nested filter if a mandatory `terms` clause targets a non-nested field. Closes #34067	2018-10-02 09:30:23 +02:00
David Turner	2aff005a69	Clean up TransportMasterNodeAction (#34076 ) Mainly this fixes a warning by replacing the unchecked `new ActionListener` with the checked `new ActionListener<Response>`, and it also fixes the line length violations in this class.	2018-10-02 03:17:55 +01:00
Lee Hinman	2d9cb21490	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-10-01 14:10:09 -06:00
Christophe Bismuth	2923fb5b31	Disallow "enabled" attribute change for types in mapping update (#33933 ) This commit adds a check for "enabled" attribute change for types when a RestPutMappingAction is received. A MappingException is thrown when such a change is detected. Change are prevented in both ways: "false -> true" and "true -> false". Closes #33566	2018-10-01 20:49:08 +02:00
Vladimir Dolzhenko	2e2ae19b97	drop elasticsearch-translog for 7.0 (#33373 ) #32281 adds elasticsearch-shard to provide bwc version of elasticsearch-translog for 6.x; have to remove elasticsearch-translog for 7.0 Relates to #31389	2018-10-01 16:21:14 +02:00
Christoph Büscher	17e6932bf3	[Tests] Rename DocumentMapperMergeTests (#34121 ) Renaming to simply DocumentMapperTests to indicate this is where other unit tests should go. Also removing outdates Todo in DocumentMapperParserTests.	2018-10-01 10:29:19 +02:00
Jason Tedor	e2bd2028d8	Allow specifying shard changes batch sizes in bytes (#34168 ) This commit changes the shard changes requests from using a raw byte value to being able to be specified using bytes units (e.g., 4mb).	2018-09-30 14:22:22 -04:00
Martijn van Groningen	b1a27b2e6b	[CCR] Add unfollow API (#34132 ) The unfollow API changes a follower index into a regular index, so that it will accept write requests from clients. For the unfollow api to work the index follow needs to be stopped and the index needs to be closed. Closes #33931	2018-09-30 19:19:34 +02:00
Nhat Nguyen	ad61398879	CCR: Optimize indexing ops using seq_no on followers (#34099 ) This change introduces the indexing optimization using sequence numbers in the FollowingEngine. This optimization uses the max_seq_no_updates which is tracked on the primary of the leader and replicated to replicas and followers. Relates #33656	2018-09-28 20:42:26 -04:00
Ryan Ernst	47cbae9b26	Scripting: Remove ExecutableScript (#34154 ) This commit removes the legacy ExecutableScript, which was no longer used except in tests. All uses have previously been converted to script contexts.	2018-09-28 17:13:08 -07:00
Lee Hinman	6ea396a476	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-28 15:40:12 -06:00
Armin Braun	76dd3948f3	TESTS: Relax Assertion About Deleting Shard Dir (#34120 ) * TESTS: Relax Assertion About Deleting Shard Dir * Allow empty state directory to prevent test from failing * Closes #32686	2018-09-28 19:09:49 +02:00
Ryan Ernst	95977f4db9	Scripting: Add watcher script contexts (#34059 ) This commit removes the use of ExecutableScript from watcher in favor of custom script contexts for both watcher condition scripts and transform scripts.	2018-09-28 07:58:17 -07:00
Hendrik Muhs	e2f310b56c	Fix AggregationFactories.Builder equality and hash regarding order (#34005 ) Fixes the equals and hash function to ignore the order of aggregations to ensure equality after serialization and deserialization. This ensures storing configs with aggregation works properly. This also addresses a potential issue in caching when the same query contains aggregations but in different order. 1st it will not hit in the cache, 2nd cache objects which shall be equal might end up twice in the cache.	2018-09-28 13:30:50 +02:00
Armin Braun	c4b831645c	MINOR: Remove some deadcode in NodeEnv and Related (#34133 )	2018-09-28 12:40:20 +02:00
Alexander Reelsen	bc7d69f74a	Core: Don't rely on java time for epoch seconds formatting (#34086 ) In order to be compatible with joda time, this adds an epoch seconds formatter, that is able to parse floating point values. However joda time discards the floating point values, but still parses the data, where as this one is able to parse the whole value including milliseconds.	2018-09-28 10:53:33 +02:00
Alan Woodward	f243d75f59	Remove special-casing of Synonym filters in AnalysisRegistry (#34034 ) The synonym filters no longer need access to the AnalysisRegistry in their constructors, so we can remove the special-case code and move them to the common analysis module. This commit means that synonyms are no longer available for `server` integration tests, so several of these are either rewritten or migrated to the common analysis module as rest-spec-api tests	2018-09-28 09:02:47 +01:00
Julie Tibshirani	9cd4f70a67	Support 'string'-style queries on metadata fields when reasonable. (#34089 ) * Make sure 'ignored' and 'routing' field types inherit from StringFieldType. * Add tests for prefix and regexp queries. * Support prefix and regexp queries on _index fields.	2018-09-27 20:59:03 -07:00
Ryan Ernst	a2c941806b	Tests: Add support for custom contexts to mock scripts (#34100 ) This commit adds the ability to plug in compilation of custom contexts in mock script engine. This is needed for testing plugins which add custom contexts like watcher.	2018-09-27 12:23:59 -07:00
Jake Landis	73ee721b29	ingest: correctly measure chained pipeline stats (#33912 ) Prior to this change when a pipeline processor called another pipeline, only the stats for the first processor were recorded. The stats for the subsequent pipelines were ignored. This change properly accounts for pipelines irregardless if they are the first or subsequently called pipelines. This change moves the state of the stats from the IngestService to the pipeline itself. Cluster updates are safe since the pipelines map is atomically swapped, and if a cluster update happens while iterating over stats (now read directly from the pipeline) a slightly stale view of stats may be shown.	2018-09-27 13:54:26 -05:00
Lee Hinman	a26cc1a242	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-27 11:00:37 -06:00
Jason Tedor	899a7c7d99	Fix remote cluster seeds fallback (#34090 ) Recently we introduced the settings cluster.remote to take the place of search.remote for configuring remote cluster connections. We made this change due to the fact that we have generalized the remote cluster infrastructure to also be used within cross-cluster replication and not only cross-cluster search. For backwards compatibility, when we made this change, we allowed that cluster.remote would fallback to search.remote. Alas, the initial change for this contained a bug for handling the proxy and seeds settings. The bug for the seeds settings arose because we were manually iterating over the concrete settings only for cluster.remote seeds but not for search.remote seeds. This commit addresses this by iterating over both cluster.remote seeds and search.remote seeds. Additionally, when checking for existence of proxy settings, we have to not only check cluster.remote proxy settings, but also fallback to search.remote proxy settings. This commit addresses both issues, and adds tests for these situations.	2018-09-27 09:47:51 -04:00
Jim Ferenczi	269ae0bc15	Handle MatchNoDocsQuery in span query wrappers (#34106 ) * Handle MatchNoDocsQuery in span query wrappers This change adds a new SpanMatchNoDocsQuery query that replaces MatchNoDocsQuery in the span query wrappers. The `wildcard` query now returns MatchNoDocsQuery if the target field is not in the mapping (#34093) so we need the equivalent span query in order to be able to pass it to other span wrappers. Closes #34105	2018-09-27 14:19:08 +02:00
Christoph Büscher	cb4cdf17f0	Update MovAvgIT AwaitsFix bug url	2018-09-27 11:11:21 +02:00
Simon Willnauer	bda7bc145b	Fold EngineSearcher into Engine.Searcher (#34082 ) EngineSearcher can be easily folded into Engine.Searcher which removes a level of inheritance that is necessary for most of it's subclasses. This change folds it into Engine.Searcher and removes the dependency on ReferenceManager.	2018-09-27 09:06:04 +02:00
Armin Braun	acd80a1e07	TESTS: Enable DEBUG Logging in Flaky Test (#34091 ) * This should surface what errors are thrown on CI and in org.elasticsearch.transport.RemoteClusterConnection.ConnectHandler#collectRemoteNodes (the sequence of caught error in the last catch block and moving on to the next seed node seems to be the only path by which the errors logged in #33756 could come about) * Relates #33756	2018-09-27 06:02:24 +02:00
Nhat Nguyen	ea9b33527e	TEST: Add engine is closed as expected failure msg This commit adds "engine is closed" as an expected failure message. This change is due to #33967 in which we might access a closed engine on promotion. Relates #33967	2018-09-26 22:38:55 -04:00
Nhat Nguyen	12d94e44b8	Adjust bwc version for max_seq_no_of_updates Relates #33967 Relates #33842	2018-09-26 22:12:19 -04:00
Simon Willnauer	ae8e54493d	Build DocStats from SegmentInfos in ReadOnlyEngine (#34079 ) This change is related to #33903 that ports the DocStats simplification to the master branch. This change builds the docStats in the ReadOnlyEngine from the last committed segment infos rather than the reader. Co-authored-by: Tanguy Leroux <tlrx.dev@gmail.com>	2018-09-27 00:16:17 +02:00
Julie Tibshirani	1d08f63eff	When creating wildcard queries, use MatchNoDocsQuery when the field type doesn't exist. (#34093 )	2018-09-26 15:08:35 -07:00
Simon Willnauer	2b730d1b9d	Mute MovAvgIT#testHoltWintersNotEnoughData Relates to #34098	2018-09-26 23:50:31 +02:00
Mayya Sharipova	80c5d30f30	XContentBuilder to handle BigInteger and BigDecimal (#32888 ) Although we allow to index BigInteger and BigDecimal into a keyword field, source filtering on these fields would fail as XContentBuilder was not able to deserialize BigInteger and BigDecimal to json. This modifies XContentBuilder to allow to handle BigInteger and BigDecimal. Closes #32395	2018-09-26 14:24:31 -04:00
Julie Tibshirani	de8bfb908f	Delegate wildcard query creation to MappedFieldType. (#34062 ) * Delegate wildcard query creation to MappedFieldType. * Disallow wildcard queries on collation fields. * Disallow wildcard queries on non-string fields.	2018-09-26 09:36:41 -07:00
Nik Everett	ddce9704d4	Logging: Drop two deprecated methods (#34055 ) This drops two deprecated methods from `ESLoggerFactory`, switching all calls to those methods to calls to methods of the same name on `LogManager`.	2018-09-26 11:20:52 -04:00
Ryan Ernst	7800b4fa91	Core: Abstract DateMathParser in an interface (#33905 ) This commits creates a DateMathParser interface, which is already implemented for both joda and java time. While currently the java time DateMathParser is not used, this change will allow a followup which will create a DateMathParser from a DateFormatter, so the caller does not need to know the internals of the DateFormatter they have.	2018-09-26 07:56:25 -07:00
Zachary Tong	25d74bd0cb	Prefer mapped aggs to lead reductions (#33528 ) Previously, unmapped aggs try to delegate reduction to a sibling agg that is mapped. That delegated agg will run the reductions, and also reduce any pipeline aggs. But because delegation comes before running pipelines, the unmapped agg _also_ tries to run pipeline aggs. This causes the pipeline to run twice, and potentially double it's output in buckets which can create invalid JSON (e.g. same key multiple times) and break when converting to maps. This fixes by sorting the list of aggregations ahead of time so that mapped aggs appear first, meaning they preferentially lead the reduction. If all aggs are unmapped, the first unmapped agg simply creates a new unmapped object and returns that for the reduction. This means that unmapped aggs no longer defer and there is no chance for a secondary execution of pipelines (or other side effects caused by deferring execution). Closes #33514	2018-09-26 10:09:31 -04:00
Nik Everett	1871e7f7e9	Search: Simply SingleFieldsVisitor (#34052 ) `SingleFieldsVisitor` is meant to load a single stored field but it manages to be quite complex to reason about because it inherits from our "basic" `FieldsVisitor` which is designed to load many fields. This breaks that inheritance and adds logic to `SingleFieldsVisitor` so it can be properly stand alone. While this amounts to more lines of code they ought to be significantly easier to reason about.	2018-09-26 09:48:15 -04:00
David Roberts	1413ace74f	Mute testSplitFromOneToN and testCreateShrinkIndexToN on Windows Relates #34080	2018-09-26 14:02:14 +01:00
Christoph Büscher	ba3ceeaccf	Clean up "unused variable" warnings (#31876 ) This change cleans up "unused variable" warnings. There are several cases were we most likely want to suppress the warnings (especially in the client documentation test where the snippets contain many unused variables). In a lot of cases the unused variables can just be deleted though.	2018-09-26 14:09:32 +02:00
Jim Ferenczi	a255880497	Add nested and object fields to field capabilities response (#33803 ) This commit adds nested and object fields to the field capabilities response. Closes #33237	2018-09-26 08:59:41 +02:00
Ryan Ernst	be8475955e	Scripting: Use ParameterMap for deprecated ctx var in update scripts (#34065 ) This commit removes the sysprop controlling whether ctx is in params for update scripts and replaces it with use of the new ParameterMap, which outputs a deprecation warning whenever params.ctx is used.	2018-09-25 22:08:02 -07:00
Nhat Nguyen	8a56369f5b	Move max_unsafe_auto_id_timestamp constant to Engine (#34025 ) We should not access InternalEngine in other classes.	2018-09-25 19:20:00 -04:00
Jim Ferenczi	0f878eff19	Add a limit for graph phrase query expansion (#34031 ) Today query parsers throw TooManyClauses exception when a query creates too many clauses. However graph phrase queries do not respect this limit. This change adds a protection against crazy expansions that can happen when building a graph phrase query. This is a temporary copy of the fix available in https://issues.apache.org/jira/browse/LUCENE-8479 but not merged yet. This logic will be removed when we integrate the Lucene patch in a future release.	2018-09-25 21:38:47 +02:00
Igor Motov	1e6780d703	Mute AckClusterUpdateSettingsIT Tracked by #33673	2018-09-25 14:16:47 -04:00
Armin Braun	0ba1855740	INGEST: Tests for Drop Processor (#33430 ) * INGEST: Tests for Drop Processor * UT for behavior of dropped callback and drop processor * Moved drop processor to `server` project to enable this test * Simple IT * Relates #32278	2018-09-25 19:29:22 +02:00
Christoph Büscher	ecc087a5bb	Remove Join utility class (#34037 ) The functionality can be replaces with String.join in new Java versions.	2018-09-25 15:25:54 +02:00
David Turner	7c63f5455b	Use a threadsafe map in SearchAsyncActionTests (#33700 ) Today `SearchAsyncActionTests#testFanOutAndCollect` uses a simple `HashMap` for the `nodeToContextMap` variable, which is then accessed from multiple threads without, apparently, explicit synchronisation. This provides an explanation for the test failure identified in #29242 in which `.toString()` returns `"[]"` just before `.isEmpty` returns `false`, without any concurrent modifications. This change converts `nodeToContextMap` to a `newConcurrentMap()` so that this cannot occur. It also fixes a race condition in the detection of double-calling the subsequent search phase. Closes #29242.	2018-09-25 13:58:05 +01:00
Nhat Nguyen	5166dd0a4c	Replicate max seq_no of updates to replicas (#33967 ) We start tracking max seq_no_of_updates on the primary in #33842. This commit replicates that value from a primary to its replicas in replication requests or the translog phase of peer-recovery. With this change, we guarantee that the value of max seq_no_of_updates on a replica when any index/delete operation is performed at least the max_seq_no_of_updates on the primary when that operation was executed. Relates #33656	2018-09-25 08:07:57 -04:00
Luca Cavanna	970407c663	[DOCS] add comment to clarify cluster name resolution (#34014 ) We currently fallback to local indices whenever a remote cluster is not found, as there may still be indices / aliases with the same name. Such behaviour is lenient but needs to be kept for backwards compatibility. Clarified that in the code so we don't forget. Relates to #26247	2018-09-25 14:03:07 +02:00
Adrien Grand	612201aee0	Fix created version for similarity validation. (#33890 ) It mistakenly uses the Elasticsearch major version instead of the Lucene major version. I noticed it when backporting, it is not noticeable on master because the only two Lucene versions that are supported, 7 and 8, encode norms the same way, unlike Lucene 6.	2018-09-25 13:48:25 +02:00
Hendrik Muhs	bf6cf6b6d9	refactor CompositeValuesSourceParserHelper for reusage by making it public (#33945 ) refactor CompositeValuesSourceParserHelper for reusage by making it public and moving toXContent into it	2018-09-25 09:15:52 +02:00
David Turner	3af8fc74c7	Make TransportService more test-friendly (#33869 ) Today, TransportService uses System.currentTimeMillis() to get the current time to report on things like timeouts, and enqueues lambdas for future execution. However, in tests it is useful to be able to fake out the current time and to see what all these enqueued lambdas are really for. This change alters the situation so that we can obtain the time from the more easily-faked ThreadPool#relativeTimeInMillis(), and implements some friendlier toString() methods on the various Runnables so we can see what they are later.	2018-09-25 07:50:18 +01:00
Lee Hinman	243e863f6e	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-24 10:33:51 -06:00
Armin Braun	25bc8c4b5a	Fix typo `NodeEnvironment#assertPathsDoNotExist` (#33996 ) * We want to check the individual paths here one by one to get a better to interpret assertion message	2018-09-24 17:57:27 +02:00
Julie Tibshirani	8e8bd56cc7	In MatchQuery, remove a check for fragile search analyzers. (#33927 ) As far as I can tell this guard against fragile analyzers is no longer relevant, since we stopped setting special analyzers on numeric fields (3bf6f4). Instead of removing the guard completely, I opted to keep a check for untokenized + unnormalized fields to avoid going through the analysis process unnecessarily. My motivation for simplifying this check is that I'd like to add support for `split_queries_on_whitespace` to the new 'queryable object' fields. As it stands, I would have to add a dedicated instanceof check for the new mapper, which is not optimal.	2018-09-24 08:56:13 -07:00
Tim Brooks	78e483e8d8	Introduce abstract security transport testcase (#33878 ) This commit introduces an AbstractSimpleSecurityTransportTestCase for security transports. This classes provides transport tests that are specific for security transports. Additionally, it fixes the tests referenced in #33285.	2018-09-24 09:44:44 -06:00
Ignacio Vera	df333ca305	TESTS: Make score Float#NaN when there is no max score (#33997 ) * TESTS: Make score Float#NaN when there is no max score Fixes test failure due to maxScore set to Float#MinValue instead on Float#NaN. In addition the initial value for maxScore is set to Float#NEGATIVE_INFINITY so it is an illegal value. Closes #33993	2018-09-24 17:36:48 +02:00
Luca Cavanna	e389d9e296	Clarify RemoteClusterService#groupIndices behaviour (#33899 ) When executing a cross-cluster search, we need to search against all local indices (and no remote indices) in case no indices are specified. Also, if only remote indices are specified, no local indices will be queried. We previously added empty local indices whenever they were not present in the map of the grouped indices, then we would act differently later based on the extracted remote indices. Instead, we now add the empty array for local indices only in case we need to search all local indices; the entry for local indices is not added when local indices should not be searched. This way the grouped indices reflect reality and provide a better indication of what indices will be searched.	2018-09-24 11:45:33 +02:00
Christophe Bismuth	47ed6c79ee	[TEST] Add validate query tests for empty and malformed queries (#33862 ) Relates to #33095	2018-09-24 11:21:47 +02:00
Simon Willnauer	7d703c2f92	Fix AutoQueueAdjustingExecutorBuilder settings validation (#33922 ) Settings validation in AutoQueueAdjustingExecutorBuilder always checked against a default value which means that we never can change a max queue size that is lower than the default. This change adds tests and fixes this validation.	2018-09-24 07:45:50 +02:00
Nhat Nguyen	432e61c971	Adjust bwc for resync request (#33964 ) Relates #33964	2018-09-22 19:29:38 -04:00
Nhat Nguyen	f2f08dd6c5	Adjust bwc for recovery request (#33693 ) Relates #33693	2018-09-22 19:28:20 -04:00
Nhat Nguyen	e7ae2f9d36	Propagate auto_id_timestamp in primary-replica resync (#33964 ) A follow-up of #33693 to propagate max_seen_auto_id_timestamp in a primary-replica resync. Relates #33693	2018-09-22 11:40:10 -04:00
Nhat Nguyen	7944a0cb25	Track max seq_no of updates or deletes on primary (#33842 ) This PR is the first step to use seq_no to optimize indexing operations. The idea is to track the max seq_no of either update or delete ops on a primary, and transfer this information to replicas, and replicas use it to optimize indexing plan for index operations (with assigned seq_no). The max_seq_no_of_updates on primary is initialized once when a primary finishes its local recovery or peer recovery in relocation or being promoted. After that, the max_seq_no_of_updates is only advanced internally inside an engine when processing update or delete operations. Relates #33656	2018-09-22 08:02:57 -04:00
Vladimir Dolzhenko	9c0316869b	Store: keep IndexFormatTooOldException and IndexFormatTooNewException in corruption marker (#33920 ) Closes #33916	2018-09-21 14:00:02 +02:00
Nik Everett	cac93949fe	API: Drop deprecated methods from Retry (#33925 ) We deprecated the `Retry.withBackoff` flavors with `Settings` in 6.5 because they were no longer needed. This drops them form 7.0.	2018-09-21 07:55:50 -04:00
Christoph Büscher	b654d986d7	Add OneStatementPerLineCheck to Checkstyle rules (#33682 ) This change adds the OneStatementPerLineCheck to our checkstyle precommit checks. This rule restricts the number of statements per line to one. The resoning behind this is that it is very difficult to read multiple statements on one line. People seem to mostly use it in short lambdas and switch statements in our code base, but just going through the changes already uncovered some actual problems in randomization in test code, so I think its worth it.	2018-09-21 11:52:31 +02:00
Nhat Nguyen	5f7f793f43	Propagate max_auto_id_timestamp in peer recovery (#33693 ) Today we don't store the auto-generated timestamp of append-only operations in Lucene; and assign -1 to every index operations constructed from LuceneChangesSnapshot. This looks innocent but it generates duplicate documents on a replica if a retry append-only arrives first via peer-recovery; then an original append-only arrives via replication. Since the retry append-only (delivered via recovery) does not have timestamp, the replica will happily optimizes the original request while it should not. This change transmits the max auto-generated timestamp from the primary to replicas before translog phase in peer recovery. This timestamp will prevent replicas from optimizing append-only requests if retry counterparts have been processed. Relates #33656 Relates #33222	2018-09-20 19:53:30 -04:00
Vladimir Dolzhenko	dbe6405354	mute RemoveCorruptedShardDataCommandTests.testCorruptedIndex	2018-09-20 21:30:40 +02:00
Nhat Nguyen	76a1a863e3	TEST: stop assertSeqNos if shards movement (#33875 ) Currently, assertSeqNos assumes that the cluster is stable at the end of the test (i.e., no more shard movement). However, this assumption does not always hold. In these cases, we can stop the assertion instead of failing a test. Closes #33704	2018-09-20 13:44:26 -04:00
Christoph Büscher	28b1d41007	Fix unused import checktyle issue	2018-09-20 19:42:15 +02:00
Nhat Nguyen	002f763c48	Restore local history from translog on promotion (#33616 ) If a shard was serving as a replica when another shard was promoted to primary, then its Lucene index was reset to the global checkpoint. However, if the new primary fails before the primary/replica resync completes and we are now being promoted, we have to restore the reverted operations by replaying the translog to avoid losing acknowledged writes. Relates #33473 Relates #32867	2018-09-20 13:21:11 -04:00
Nhat Nguyen	b13a434f59	Remove wrong assert in LocalCheckpointTrackerTests It's possible for the set "seqNos" to contain only the "unFinishedSeq" in the testConcurrentReplica test. If this is the case, the call `randomValueOtherThan` won't make any progress because the predicate will never be false. This commit removes this expectation because it's incorrect and it's no longer needed as we have a dedicated test to verify the contains method. Relates #33871	2018-09-20 13:12:19 -04:00
Alan Woodward	b33c18d316	Move SoraniNormalizationFilterFactory to the common analysis plugin (#33892 ) Follow up to #25715	2018-09-20 17:31:41 +01:00
Yannick Welsch	db327818dd	[TEST] Enable DEBUG logging on testCreateShrinkIndexToN	2018-09-20 18:16:20 +02:00
Nik Everett	f963c29876	Logging: Drop Settings from some logger lookups (#33859 ) Drops `Settings` from some of the methods to lookup loggers and deprecates another logger lookup that takes `Settings` because `Settings` is no longer required to build a logger.	2018-09-20 10:42:48 -04:00
Jake Landis	e37e5dfc04	ingest: support simulate with verbose for pipeline processor (#33839 ) * ingest: support simulate with verbose for pipeline processor This change better supports the use of simulate?verbose with the pipeline processor. Prior to this change any pipeline processors executed with simulate?verbose would not show all intermediate processors for the inner pipelines. This changes also moves the PipelineProcess and TrackingResultProcessor classes to enable instance checks and to avoid overly public classes. As well this updates the error message for when cycles are detected in pipelines calling other pipelines.	2018-09-20 08:33:07 -05:00
Simon Willnauer	3522b9084b	Introduce a `search_throttled` threadpool (#33732 ) Today all searches happen on the search threadpool which is the correct behavior in almost any case. Yet, there are exceptions where for instance searches searches should be passed through a single-thread thread-pool to reduce impact on a node. This change adds a index-private setting that allows to mark an index as throttled for searches and forks off all non-stats searcher access to this thread-pool for indices that are marked as `index.search.throttled`	2018-09-20 13:43:11 +02:00
David Turner	c041e94349	Test that transient settings beat persistent ones (#33818 ) Transient settings override persistent settings, but in fact all of the tests that run as part of `:server:test` and `:server:integTest` will pass if the precedence is changed to be the other way round. This change adds a test that verifies the precedence is as documented.	2018-09-20 11:17:19 +01:00
Tim Vernum	8d50c10208	Mute ShrinkIndexIT.testCreateShrinkIndexToN on Windows Relates: #33857	2018-09-20 18:21:15 +10:00
Daniel Mitterdorfer	b1cc58e425	Allow to clear the fielddata cache per field With this commit we clear the fielddata cache per field as it is supposed to be. Previously we retrieved the proper field from the cache but then cleared the entire cache anyway. Closes #33798 Relates #33807	2018-09-20 08:59:53 +02:00
Tim Vernum	1f1ebb4656	Add additional null check in _cat/shards The target of the func lambda may be null (e.g. in a mixed cluster where older nodes lack some of the values) Relates: #33858 / 331caba Closes #33877	2018-09-20 06:44:13 +02:00
Nhat Nguyen	05bf9dc2e8	Add contains method to LocalCheckpointTracker (#33871 ) This change adds "contains" method to LocalCheckpointTracker. One of the use cases is to check if a given operation has been processed in an engine or not by looking up its seq_no in LocalCheckpointTracker. Relates #33656	2018-09-19 20:29:36 -04:00
Gordon Brown	90de436e55	Use custom index metadata for ILM state (#33783 ) Using index settings for ILM state is fragile and exposes too much information that doesn't need to be exposed. Using custom index metadata is more resilient and allows more controlled access to internal information. As part of these changes, moves away from using defaults for ILM-related values, in favor of using null values to clearly indicate that the value is not present.	2018-09-19 14:50:48 -06:00
Nik Everett	26c4f1fb6c	Core: Default node.name to the hostname (#33677 ) Changes the default of the `node.name` setting to the hostname of the machine on which Elasticsearch is running. Previously it was the first 8 characters of the node id. This had the advantage of producing a unique name even when the node name isn't configured but the disadvantage of being unrecognizable and not being available until fairly late in the startup process. Of particular interest is that it isn't available until after logging is configured. This forces us to use a volatile read whenever we add the node name to the log. Using the hostname is available immediately on startup and is generally recognizable but has the disadvantage of not being unique when run on machines that don't set their hostname or when multiple elasticsearch processes are run on the same host. I believe that, taken together, it is better to default to the hostname. 1. Running multiple copies of Elasticsearch on the same node is a fairly advanced feature. We do it all the as part of the elasticsearch build for testing but we make sure to set the node name then. 2. That the node.name defaults to some flavor of "localhost" on an unconfigured box feels like it isn't going to come up too much in production. I expect most production deployments to at least set the hostname. As a bonus, production deployments need no longer set the node name in most cases. At least in my experience most folks set it to the hostname anyway.	2018-09-19 15:21:29 -04:00
Simon Willnauer	a92dda2e7e	Move CompletionStats into the Engine (#33847 ) By moving CompletionStats into the engine we can easily cache the stats for read-only engines if necessary. It also moves the responsibiltiy out of IndexShard which has quiet some complexity already. Relates to #33835	2018-09-19 20:35:57 +02:00
Simon Willnauer	0fa5758bc6	Fix potential NPE in `_cat/shards/` with partial CommonStats (#33858 ) Today if we fetch common stats from a shard we might get a partial response if the shard is closed while we fetch the stats. This causes hard to track and reproduce NPEs. This change streamlines null checking to ensure we only render stats we actually received.	2018-09-19 20:34:54 +02:00
Nik Everett	3ede13a454	Test framework fall cleaning (#33423 ) Wraps all lines in our test framework at 140 characters because that is our standard line length and removes all of the checkstyle suppressions for the test framework. Drops most of `ModuleTestCase` because it isn't used and we're moving away from using guice in the way that it wants to test anyway. Also switches a few classes that extend it but don't use it to extend `ESTestCase` instead.	2018-09-19 14:34:02 -04:00
Lee Hinman	81e9150c7a	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-19 09:43:26 -06:00
Simon Willnauer	6ec12bef0d	Add missing IndexShard#readAllowed() This was lost in #33835	2018-09-19 17:07:13 +02:00
Alan Woodward	5107949402	Allow TokenFilterFactories to rewrite themselves against their preceding chain (#33702 ) We currently special-case SynonymFilterFactory and SynonymGraphFilterFactory, which need to know their predecessors in the analysis chain in order to correctly analyze their synonym lists. This special-casing doesn't work with Referring filter factories, such as the Multiplexer or Conditional filters. We also have a number of filters (eg the Multiplexer) that will break synonyms when they appear before them in a chain, because they produce multiple tokens at the same position. This commit adds two methods to the TokenFilterFactory interface. * `getChainAwareTokenFilterFactory()` allows a filter factory to rewrite itself against its preceding filter chain, or to resolve references to other filters. It replaces `ReferringFilterFactory` and `CustomAnalyzerProvider.checkAndApplySynonymFilter`, and by default returns `this`. * `getSynonymFilter()` defines whether or not a filter should be applied when building a synonym list `Analyzer`. By default it returns `true`. Fixes #33609	2018-09-19 15:52:14 +01:00
Christoph Büscher	546e7361ed	[Tests] Nudge wait time in RemoteClusterServiceTests (#33853 ) This test occasionally fails in `testCollectSearchShards` waiting on what seems to be a search request to a remote cluster for one second. Given that the test fails here very rarely I suspect maybe one second is very rarely not enough so we could fix it by increasing the max wait time slightly. Closes #33852	2018-09-19 15:58:35 +02:00
Simon Willnauer	0c77f45dc6	Move DocsStats into Engine (#33835 ) By moving DocStats into the engine we can easily cache the stats for read-only engines if necessary. It also moves the responsibility out of IndexShard which has quiet some complexity already.	2018-09-19 11:03:11 +02:00
Vladimir Dolzhenko	a3e8b831ee	add elasticsearch-shard tool (#32281 ) Relates #31389	2018-09-19 10:28:22 +02:00
Simon Willnauer	251489d59a	Cut over to unwrap segment reader (#33843 ) The fix in #33757 introduces some workaround since FilterCodecReader didn't support unwrapping. This cuts over to a more elegant fix to access the readers segment infos.	2018-09-19 10:18:03 +02:00
Jim Ferenczi	61e1df0274	Use the global doc id to generate a random score (#33599 ) This commit changes the random_score function to use the global docID of the document rather than the segment docID to generate random scores. As a result documents that have the same segment docID within the shard will generate different scores.	2018-09-19 09:28:38 +02:00
Adrien Grand	c4261bab44	Add minimal sanity checks to custom/scripted similarities. (#33564 ) Add minimal sanity checks to custom/scripted similarities. Lucene 8 introduced more constraints on similarities, in particular: - scores must not be negative, - scores must not decrease when term freq increases, - scores must not increase when norm (interpreted as an unsigned long) increases. We can't check every single case, but could at least run some sanity checks. Relates #33309	2018-09-19 09:19:13 +02:00
Ignacio Vera	7f473b683d	Profiler: Don’t profile NEXTDOC for ConstantScoreQuery. (#33196 ) * Profiler: Don’t profile NEXTDOC for ConstantScoreQuery. A ConstantScore query will return the iterator of its inner query. However, when profiling, the constant score query is wrapped separately from its inner query, which distorts the times emitted by the profiler. Return the iterator directly in such a case. Closes #23430	2018-09-18 23:32:16 -07:00
Lee Hinman	c87cff22b4	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-18 13:57:41 -06:00
Zachary Tong	f4cbbcf98b	Add ES version 6.4.2 (#33831 ) Version and properties files	2018-09-18 15:25:20 -04:00
Armin Braun	c6462057a1	MINOR: Remove Some Dead Code in Scripting (#33800 ) * The is default check method is not used in ScriptType * The removed vars on ExpressionSearchScript are unused	2018-09-18 20:43:31 +02:00
Simon Willnauer	9026c3ee92	Ensure realtime `_get` and `_termvectors` don't run on the network thread (#33814 ) The change in #27500 introduces this regression that causes `_get` and `_term_vector` actions to run on the network thread if the realtime flag is set. This fixes the issue by delegating to the super method forking on the corresponding threadpool.	2018-09-18 19:53:42 +02:00
Simon Willnauer	98ccd94962	Factor out a ChannelActionListener (#33819 ) We use similar / same concepts in SerachTransportService and HandledTransportAction but both duplicate the efforts with slightly different implementation details. This streamlines sending responses / exceptions back to a channel in an ActionListener with appropriate logging.	2018-09-18 19:53:26 +02:00
Jim Ferenczi	241c74efb2	upgrade to a new snapshot of Lucene 8 (7d0a7782fa) (#33812 )	2018-09-18 18:16:40 +02:00
David Turner	421f58e172	Remove discovery-file plugin (#33257 ) In #33241 we moved the file-based discovery functionality to core Elasticsearch, but preserved the `discovery-file` plugin, and support for the existing location of the `unicast_hosts.txt` file, for BWC reasons. This commit completes the removal of this plugin.	2018-09-18 12:01:16 +01:00
markharwood	2fa09f062e	New plugin - Annotated_text field type (#30364 ) New plugin for annotated_text field type. Largely a copy of `text` field type but adds ability to include markdown-like syntax in the text. The “AnnotatedText” class parses text+markup and converts into plain text and AnnotationTokens. The annotation token values are injected unchanged alongside the regular text tokens to provide a form of additional indexed overlay useful in positional searches and highlighting. Annotated_text fields do not support fielddata as we want to phase this out. Also includes a new "annotated" highlighter type that retains annotations and merges in search hits as additional annotation markup. Closes #29467	2018-09-18 10:25:27 +01:00
Armin Braun	87cedef3cf	NETWORKING:Def CName in Http Publish Addr to True (#33631 ) * Follow up to #32806 setting the setting to true for 7.x	2018-09-18 10:29:02 +02:00
Armin Braun	615f494c77	MINOR: Drop Redundant Ctx. Check in ScriptService (#33782 ) * MINOR: Drop Redundant Ctx. Check in ScriptService * This check is completely redundant, the expression script engine will throw anyway (and with a similar message) for those contexts that it cannot compile. Moreover, the update context is not the only context that is not suported by the expression engine at this point so handling the update context separately here makes no sense.	2018-09-18 07:25:22 +02:00
Or Bin	a5bad4d92c	Docs: Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' (#33744 ) Fixed a grammatical mistake: 'a HTTP ...' -> 'an HTTP ...' Closes #33728	2018-09-17 15:35:54 -04:00
Lee Hinman	7ff11b4ae1	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-17 10:41:10 -06:00
Vladimir Dolzhenko	4d0bea705c	Do not report negative free bytes for DiskThresholdDecider#canAllocate (#33641 ) Do not report negative free bytes for DiskThresholdDecider#canAllocate (#33641) Closes #33596	2018-09-17 17:56:47 +02:00
Armin Braun	a654f21599	TESTS: Fix Concurent Remote Connection Updates (#33707 ) * Same fix idea as in #10666a4 to prevent background threads trying to reconnect after the tests are done from throwing `ExecutionCancelledException` and breaking the test * Closes #30714	2018-09-17 16:38:44 +02:00
Bukhtawar	14d57c1115	Skip rebalancing when cluster_concurrent_rebalance threshold reached (#33329 ) Allows to skip shard balancing when the cluster_concurrent_rebalance threshold is already reached, which cuts down the time spent in the rebalance method of BalancedShardsAllocator.	2018-09-17 13:13:44 +02:00
Adrien Grand	b06a082725	Improve reproducibility of BigArraysTests. Close #33750	2018-09-17 11:59:15 +02:00
Christoph Büscher	1f2a90cb39	Mute DateTimeUnitTests.testConversion	2018-09-17 11:16:50 +02:00
Martijn van Groningen	34379887b4	Make custom index metadata completely immutable (#33735 ) Currently `IndexMetadata#getCustomData(...)` wraps the custom metadata in an unmodifiable map, but in case there is no entry for the specified key then a NPE is thrown by Collections.unmodifiableMap(...). This is not ideal in case callers like to throw an exception with a specific message. (like in the case for ccr to indicate that the follow index was not created by the create_and_follow api and therefor incompatible as follow index) I think making `DiffableStringMap` itself immutable is better then just wrapping custom metadata with `Collections.unmodifiableMap(...)` in all methods that access it. Also removed the `equals()`, `hashcode()` and to `toString()` methods of `DiffableStringMap`, because `AbstractMap` already implements these methods.	2018-09-17 07:51:34 +02:00
Ryan Ernst	3046656ab1	Scripting: Rework joda time backcompat (#33486 ) This commit switches the joda time backcompat in scripting to use augmentation over ZonedDateTime. The augmentation methods provide compatibility with the missing methods between joda's DateTime and java's ZonedDateTime. Due to getDayOfWeek returning an enum in the java API, ZonedDateTime is wrapped so that the method can return int like the joda time does. The java time api version is renamed to getDayOfWeekEnum, which will be kept through 7.x for compatibility while users switch back to getDayOfWeek once joda compatibility is removed.	2018-09-16 19:18:00 -07:00
Ryan Ernst	e5d82c3dea	Test: Fix dv date bwc tests when no docs have a value (#32798 ) This commit adds a guard around the rare case that no documents in the 10 iterations actually have any values, thus making the warning check incorrect. closes #32779	2018-09-16 11:11:51 -07:00
Lee Hinman	e6cbaa5a78	Merge remote-tracking branch 'origin/master' into index-lifecycle	2018-09-14 16:27:37 -06:00
Jason Tedor	a0f0d7860e	Cleanup assertions in global checkpoint listeners (#33722 ) This commit is a cleanup of the assertions in global checkpoint listeners, simplifying them and adding some messages to them in case the assertions trip.	2018-09-14 14:45:58 -04:00
Christoph Büscher	bcbbbdf660	[Tests] Fix randomization in StringTermsIT (#33678 ) It looks like the COLLECT_SEGMENT_ORDS flag should be randomized.	2018-09-14 15:52:47 +02:00
Jason Tedor	39191331d1	Only notify ready global checkpoint listeners (#33690 ) When we add a global checkpoint listener, it is also carries along with it a value that it thinks is the current global checkpoint. This value can be above the actual global checkpoint on a shard if the listener knows the global checkpoint from another shard copy (e.g., the primary), and the current shard copy is lagging behind. Today we notify the listener whenever the global checkpoint advances, regardless if it goes above the current global checkpoint known to the listener. This commit reworks this implementation. Rather than thinking of the value associated with the listener as the current global checkpoint known to the listener, we think of it as the value that the listener is waiting for the global checkpoint to advance to (inclusive). Now instead of notifying all waiting listeners when the global checkpoint advances, we only notify those that are waiting for a value not larger than the actual global checkpoint that we advanced to.	2018-09-14 09:32:03 -04:00
Adrien Grand	4f68104865	Don't count hits via the collector if the hit count can be computed from index stats. (#33701 ) This is something that we were already doing when sorting by field, which is now also done when sorting by score. As-is this change will speed up top-k `term` queries. This could work for `match_all` queries as well when we implement the `setMinCompetitiveScore` API on their Scorer.	2018-09-14 14:59:16 +02:00
Alexander Reelsen	faa3c16241	Core: Add DateFormatter interface for java time parsing (#33467 ) The existing approach used date formatters when a format based string like `date_time\|\|epoch_millis` was used, instead of the custom code. In order to properly solve this, a new interface called `DateFormatter` has been added, which now can be implemented for custom formatters. Currently there are two implementations, one using java time and one doing the epoch_millis formatter, which simply parses a number and then converts it to a date in UTC timezone. The DateFormatter interface now also has a method to retrieve the name of the formatter pattern, which is needed for mapping changes anyway. The existing `CompoundDateTimeFormatter` class has been removed, the name was not really nice anyway. One more minor change is the fact, that the new java time using FormatDateFormatter does not try to parse the date with its printer implementation first (which might be a strict one and fail), but a printer can now be specified in addition. This saves one potential failure/exception when parsing less strict dates. If only a printer is specified, the printer will also be used as a parser.	2018-09-14 13:55:16 +02:00
Igor Motov	b8fb83d7a4	Mute ClusterDisruptionIT#testSendingShardFailure Tracked by #33704	2018-09-14 14:24:06 +04:00
Armin Braun	0b4960ff6b	SCRIPTING: Move terms_set Context to its Own Class (#33602 ) * SCRIPTING: Move terms_set Context to its Own Class * Extracted TermsSetQueryScript * Kept mechanics close to what they were with SearchScript	2018-09-14 06:21:18 +02:00
Armin Braun	040695b64e	CORE: Disable Setting Type Validation (#33660 ) (#33669 ) * Reverts setting type validation introduced in #33503	2018-09-13 20:45:48 +02:00
Jason Tedor	e4eb631b8e	Revert "Use serializable exception in GCP listeners (#33657 )" This reverts commit `6dfe54c838`.	2018-09-13 13:55:19 -04:00
Nhat Nguyen	b3071133d4	TEST: decrease logging level in the flush test Relates #31629	2018-09-13 11:18:03 -04:00
Jason Tedor	d806a0e59d	Fix race in global checkpoint listeners test This race can occur if the latch from the listener notifies the test thread and the test thread races ahead before the scheduler thread has a chance to emit the log message. This commit fixes this test by not counting down the latch until after the log message we are going to assert on has been emitted.	2018-09-13 07:00:40 -04:00
Jason Tedor	6dfe54c838	Use serializable exception in GCP listeners (#33657 ) We used TimeoutException here but that's not serializable. This commit switches to a serializable exception so that we can test for the exception type on the remote side.	2018-09-13 06:35:36 -04:00
Colin Goodheart-Smithe	8e59de3eb2	Merge branch 'master' into index-lifecycle	2018-09-13 09:46:14 +01:00
Jim Ferenczi	6ca36bba15	Fix field mapping updates with similarity (#33634 ) This change fixes a bug introduced in 6.3 that prevents fields with an explicit similarity to be updated. It also adds a test that checks this case for similarities but also for analyzers since they could suffer from the same problem. Closes #33611	2018-09-13 09:21:27 +02:00
David Turner	5a3fd8e4e7	Use file-based discovery not MockUncasedHostsProvider (#33554 ) Today we use a special unicast hosts provider, the `MockUncasedHostsProvider`, in many integration tests, to deal with the dynamic nature of the allocation of ports to nodes. However #33241 allows us to use file-based discovery to achieve the same goal, so the special test-only `MockUncasedHostsProvider` is no longer required. This change removes `MockUncasedHostProvider` and replaces it with file-based discovery in tests based on `EsIntegTestCase`.	2018-09-13 07:37:15 +02:00
Nhat Nguyen	b097eff342	Resync fails to notify on unavaiable exceptions (#33615 ) We fail to notify the resync listener if the resync replication hits a shard unavailable exception. Moreover, we no longer need to swallow these unavailable exceptions. Relates #28571 Closes #33613	2018-09-12 21:27:59 -04:00
Jason Tedor	9b8fe85edb	Remove volatile from global checkpoint listeners (#33636 ) This field does not need to be volatile because all accesses are done under a lock. This commit removes the unnecessary volatile modifier from this field.	2018-09-12 14:38:24 -04:00
Jason Tedor	c023f67c5d	Add migration note for remote cluster settings (#33632 ) The remote cluster settings search.remote.* have been renamed to cluster.remote.* and are automatically upgraded in the cluster state on gateway recovery, and on put. This commit adds a note to the migration docs for these changes.	2018-09-12 13:37:11 -04:00
Simon Willnauer	c783488e97	Add `_source`-only snapshot repository (#32844 ) This change adds a `_source` only snapshot repository that allows to wrap any existing repository as a _backend_ to snapshot only the `_source` part including live docs markers. Snapshots taken with the `source` repository won't include any indices, doc-values or points. The snapshot will be reduced in size and functionality such that it requires full re-indexing after it's successfully restored. The restore process will copy the `_source` data locally starts a special shard and engine to allow `match_all` scrolls and searches. Any other query, or get call will fail with and unsupported operation exception. The restored index is also marked as read-only. This feature aims mainly for disaster recovery use-cases where snapshot size is a concern or where time to restore is less of an issue. NOTE: The snapshot produced by this repository is still a valid lucene index. This change doesn't allow for any longer retention policies which is out of scope for this change.	2018-09-12 17:47:10 +02:00
Jason Tedor	36ba3cda7e	Enable global checkpoint listeners to timeout (#33620 ) In cross-cluster replication, we will use global checkpoint listeners to long poll for updates to a shard. However, we do not want these polls to wait indefinitely as it could be difficult to discern if the listener is still waiting for updates versus something has gone horribly wrong and cross-cluster replication is stuck. Instead, we want these listeners to timeout after some period (for example, one minute) so that they are notified and we can update status on the following side that cross-cluster replication is still active. After this, we will immediately enter back into a poll mode. To do this, we need the ability to associate a timeout with a global checkpoint listener. This commit adds this capability.	2018-09-12 10:53:22 -04:00
Nhat Nguyen	d9bbb89b26	TEST: Adjust rollback condition when shard is empty If a shard is empty, it won't rollback its engine on promotion. This commit adjusts the expectation in the rollback test. Relates #33473	2018-09-12 08:26:02 -04:00
lipsill	c92ec1c5d7	Forbid negative `weight` in Function Score Query (#33390 ) This change forbids negative `weight` in Function Score query. Negative scores are forbidden in Lucene 8.	2018-09-12 09:16:40 +02:00
Jim Ferenczi	4561c5ee83	Clarify context suggestions filtering and boosting (#33601 ) This change clarifies the documentation of the context completion suggester regarding filtering and boosting with contexts. Unlike the suggester v1, filtering on multiple contexts works as a disjunction, a suggestion matches if it contains at least one of the provided context values and boosting selects the maximum score among the matching contexts. This commit also adapts an old test that was written for the v1 suggester and commented out for version 2 because the behavior changed.	2018-09-12 08:47:32 +02:00
Jason Tedor	c74c46edc3	Upgrade remote cluster settings (#33537 ) This commit adds settings upgraders for the search.remote.* settings that can be in the cluster state to automatically upgrade these settings to cluster.remote.*. Because of the infrastructure that we have here, these settings can be upgraded when recovering the cluster state, but also when a user tries to make a dynamic update for these settings.	2018-09-12 01:14:43 -04:00
Armin Braun	94cdf0ceba	NETWORKING: http.publish_host Should Contain CNAME (#32806 ) * NETWORKING: http.publish_host Should Contain CNAME * Closes #22029	2018-09-12 06:15:36 +02:00
Jason Tedor	9752540866	Add test coverage for global checkpoint listeners This commit adds test coverage for two cases not previously covered by the existing testing. Namely, we add coverage ensuring that the executor is used to notify listeners being added that are immediately notified because the shard is closed or because the global checkpoint is already beyond what the listener knows.	2018-09-11 23:19:27 -04:00
Nhat Nguyen	743327efc2	Reset replica engine to global checkpoint on promotion (#33473 ) When a replica starts following a newly promoted primary, it may have some operations which don't exist on the new primary. Thus we need to throw those operations to align a replica with the new primary. This can be done by first resetting an engine from the safe commit, then replaying the local translog up to the global checkpoint. Relates #32867	2018-09-11 22:09:37 -04:00
Nhat Nguyen	1e577d3ce8	Mute testIndexDeletionWhenNodeRejoins Tracked at #33613	2018-09-11 16:23:12 -04:00
Colin Goodheart-Smithe	624b84f897	Improves doc values format deprecation message (#33576 ) * Improves doc values format deprecation message This changes the deprecation message when doc values fields do not supply a format form logging a deprecation warning for each offending field individually to logging a single message which lists all offending fields Closes #33572 * Updates YAML test with new deprecation message Also adds a test to ensure multiple deprecation warnings are collated into one message * Condenses collection of fields without format check Moves the collection of fields that don't have a format to a separate loop and moves the logging of the deprecation warning to be next to it at the expesnse of looping through the field list twice * fixes typo * Fixes test	2018-09-11 14:32:43 +01:00
Alan Woodward	36bdad4895	Use IndexWriter.getFlushingBytes() rather than tracking it ourselves (#33582 ) Currently we keep track of how many bytes are currently being written to disk in an AtomicLong within InternalEngine, updating it on refresh. The IndexWriter has its own accounting for this, and exposes it via a getFlushingBytes method in the latest lucene 8 snapshot. This commit removes the InternalEngine tracking in favour of just using the IndexWriter method.	2018-09-11 13:38:44 +01:00
Jason Tedor	ad4b5e4270	Fix upgrading of list settings (#33589 ) Upgrading list settings is broken because of the conversion that we do to strings, and then when we try to put back the upgraded value we do not know that it is a representation of a list. This commit addresses this by adding special handling for list settings.	2018-09-11 08:35:42 -04:00
Simon Willnauer	517cfc3cc0	Add read-only Engine (#33563 ) This change adds an engine implementation that opens a reader on an existing index but doesn't permit any refreshes or modifications to the index. Relates to #32867 Relates to #32844	2018-09-11 14:05:14 +02:00
Armin Braun	6075e159e5	Validate list values for settings (#33503 ) When we see a settings value, it could be a list. Yet this should only happen if the underlying setting type is a list setting type. This commit adds validation that when we get a setting value that is a list, that the setting that we are getting is a list setting. And similarly, if we get a value for a list setting, the underlying value should be a list.	2018-09-10 19:24:17 -04:00
Nhat Nguyen	624b6bb487	Copy and validatie soft-deletes setting on resize (#33517 ) This change copies and validates the soft-deletes setting during resize. If the source enables soft-deletes, the target must also enable it. Closes #33321	2018-09-10 17:38:58 -04:00
Colin Goodheart-Smithe	cdc4f57a77	Merge branch 'master' into index-lifecycle	2018-09-10 21:30:44 +01:00
Alan Woodward	39c3234c2f	Upgrade to latest Lucene snapshot (#33505 ) * LeafCollector.setScorer() now takes a Scorable * Scorers may not have null Weights * IndexWriter.getFlushingBytes() reports how much memory is being used by IW threads writing to disk	2018-09-10 20:51:55 +01:00
Armin Braun	9a2c77d1c3	MINOR: Remove Dead Code in SearchScript (#33569 ) * `lookup` is not used anywhere * `getLeafContext` is not used anywhere	2018-09-10 18:56:21 +02:00
Tanguy Leroux	079d130d8c	[Test] Remove duplicate method in TestShardRouting (#32815 )	2018-09-10 18:29:00 +02:00
David Turner	284c45a6ff	Strengthen FilterRoutingTests (#33149 ) Today the FilterRoutingTests take the belt-and-braces approach of excluding some node attribute values and including some others. This means that we don't really test that both inclusion and exclusion work correctly: as long as one of them works as expected then the test will pass. This change improves these tests by only using one approach at once, demonstrating that both do indeed work, and adds tests for various other scenarios too.	2018-09-10 11:23:05 +02:00
Nhat Nguyen	e6ca55bca6	Adjust bwc for stale primary recovery source (#33432 ) Relates #33432	2018-09-09 21:34:32 -04:00
Jason Tedor	6bb817004b	Add infrastructure to upgrade settings (#33536 ) In some cases we want to deprecate a setting, and then automatically upgrade uses of that setting to a replacement setting. This commit adds infrastructure for this so that we can upgrade settings when recovering the cluster state, as well as when such settings are dynamically applied on cluster update settings requests. This commit only focuses on cluster settings, index settings can build on this infrastructure in a follow-up.	2018-09-09 20:49:19 -04:00
Armin Braun	d4b212c4c9	CORE: Make Pattern Exclusion Work with Aliases (#33518 ) * CORE: Make Pattern Exclusion Work with Aliases * Adds the pattern exclusion logic to finding aliases * Closes #33395	2018-09-09 17:31:02 +02:00
S.Y. Wang	9073dbefd6	HLRC: Add put stored script support to high-level rest client (#31323 ) Relates to #27205	2018-09-09 13:47:47 +02:00
Nhat Nguyen	94e4cb64c2	Bootstrap a new history_uuid when force allocating a stale primary (#33432 ) This commit ensures that we bootstrap a new history_uuid when force allocating a stale primary. A stale primary should never be the source of an operation-based recovery to another shard which exists before the forced-allocation. Closes #26712	2018-09-08 19:29:31 -04:00
Armin Braun	f27c3dcf88	INGEST: Remove Outdated TODOs (#33458 ) * CompoundProcessor is in the ingest package now -> resolved * Java generics don't offer type checking so nothing can be done here -> remvoed TODO and test * #16019 was closed and not acted on -> todo can go away	2018-09-08 10:18:45 +02:00
Jason Tedor	9a404f3def	Include fallback settings when checking dependencies (#33522 ) Today when checking settings dependencies, we do not check if fallback settings are present. This means, for example, that if cluster.remote..seeds falls back to search.remote..seeds, and cluster.remote..skip_unavailable and search.remote..skip_unavailable depend on cluster.remote..seeds, and we have set search.remote..seeds and search.remote..skip_unavailable, then validation will fail because it is expected that cluster.ermote..seeds is set here. This commit addresses this by also checking fallback settings when validating dependencies. To do this, we adjust the settings exist method to also check for fallback settings, a case that it was not handling previously.	2018-09-07 20:09:53 -04:00
Nik Everett	190ea9a6de	Logging: Configure the node name when we have it (#32983 ) Change the logging infrastructure to handle when the node name isn't available in `elasticsearch.yml`. In that case the node name is not available until long after logging is configured. The biggest change is that the node name logging no longer fixed at pattern build time. Instead it is read from a `SetOnce` on every print. If it is unset it is printed as `unknown` so we have something that fits in the pattern. On normal startup we don't log anything until the node name is available so we never see the `unknown`s.	2018-09-07 14:31:23 -04:00
Nhat Nguyen	ab7e696108	TEST: Ensure merge triggered in _source retention test (#33487 ) We invoke force merge twice in the test to verify that recovery sources are pruned when the global checkpoint advanced. However, if the global checkpoint equals to the local checkpoint in the first force-merge, the second force-merge will be a noop because all deleted docs are expunged in the first merge already. We need to flush a new segment to make merge happen so we can verify that all recovery sources are pruned.	2018-09-07 12:58:00 -04:00
Simon Willnauer	c12d232215	Pass Directory instead of DirectoryService to Store (#33466 ) Instead of passing DirectoryService which causes yet another dependency on Store we can just pass in a Directory since we will just call `DirectoryService#newDirectory()` on it anyway.	2018-09-07 14:00:24 +02:00
Colin Goodheart-Smithe	017ffe5d12	Merge branch 'master' into index-lifecycle	2018-09-07 10:59:10 +01:00
Jim Ferenczi	79cd6385fe	Collapse package structure for metrics aggs (#33463 ) This change collapses all metrics aggregations classes into a single package `org.elasticsearch.aggregations.metrics`. It also restricts the visibility of some classes (aggregators and factories) that should not be used outside of the package. Relates #22868	2018-09-07 10:58:06 +02:00
Jim Ferenczi	34859414a0	Fix bwc serialization of total hits when track_total_hits is false	2018-09-07 10:30:53 +02:00
Nik Everett	0d45752e50	Fix IndexMetaData loads after rollover (#33394 ) When we rollover and index we write the conditions of the rollover that the old index met into the old index. Loading this index metadata requires a working `NamedXContentRegistry` that has been populated with parsers from the rollover infrastructure. We had a few loads that didn't use a working `NamedXContentRegistry` and so would fail if they ever encountered an index that had been rolled over. Here are the locations of the loads and how I fixed them: * IndexFolderUpgrader - removed entirely. It existed to support opening indices made in Elasticsearch 2.x. Since we only need this change as far back as 6.4.1 which will supports reading from indices created as far back as 5.0.0 we should be good here. * TransportNodesListGatewayStartedShards - wired the `NamedXContentRegistry` into place. * TransportNodesListShardStoreMetaData - wired the `NamedXContentRegistry` into place. * OldIndexUtils - removed entirely. It existed to support the zip based index backwards compatibility tests which we've since replaced with code that actually runs old versions of Elasticsearch. In addition to fixing the actual problem I added full cluster restart integration tests for rollover which would have caught this problem and I added an extra assertion to IndexMetaData's deserialization code which will trip if we try to deserialize and index's metadata without a fully formed `NamedXContentRegistry`. It won't catch if use the wrong `NamedXContentRegistry` but it is better than nothing. Closes #33316	2018-09-06 17:55:24 -04:00
Simon Willnauer	c6c456e8cb	Move up acquireSearcher logic to Engine (#33453 ) By moving the logic to acquire the searcher up to the engine it's simpler to build new engines that are for instance read-only.	2018-09-06 18:48:05 +02:00
Nhat Nguyen	8afe09a749	Pass TranslogRecoveryRunner to engine from outside (#33449 ) This commit allows us to use different TranslogRecoveryRunner when recovering an engine from its local translog. This change is a prerequisite for the commit-based rollback PR. Relates #32867	2018-09-06 11:59:16 -04:00
Jim Ferenczi	7ad71f906a	Upgrade to a Lucene 8 snapshot (#33310 ) The main benefit of the upgrade for users is the search optimization for top scored documents when the total hit count is not needed. However this optimization is not activated in this change, there is another issue opened to discuss how it should be integrated smoothly. Some comments about the change: * Tests that can produce negative scores have been adapted but we need to forbid them completely: #33309 Closes #32899	2018-09-06 14:42:06 +02:00
Alan Woodward	e134f9b5f3	Fix generics in ScriptPlugin#getContexts() (#33426 ) Changes the return value from List<ScriptContext> to List<ScriptContext<?>> to remove raw-types warnings.	2018-09-06 09:04:22 +01:00
Alexander Reelsen	82fab40099	Core: Fix IndicesSegmentResponse.toXcontent() serialization (#33414 ) When index sorting is enabled, toXContent tried to serialize an SortField object, resulting in an exception, when using the _segments endpoint. Relates #29120	2018-09-06 09:56:20 +02:00
Daniel Mitterdorfer	5236f2b1af	Improve reproducability of RestControllerTests With this commit we use the classic parent circuit breaker which does not account for real memory usage. In those tests we want to have reproducible results and hence it makes sense to disable the real memory circuit breaker there.	2018-09-06 09:44:05 +02:00
Colin Goodheart-Smithe	b1257d873b	Merge branch 'master' into index-lifecycle	2018-09-06 08:17:40 +01:00
Martijn van Groningen	a721d09c81	[CCR] Added auto follow patterns feature (#33118 ) Auto Following Patterns is a cross cluster replication feature that keeps track whether in the leader cluster indices are being created with names that match with a specific pattern and if so automatically let the follower cluster follow these newly created indices. This change adds an `AutoFollowCoordinator` component that is only active on the elected master node. Periodically this component checks the the cluster state of remote clusters if there new leader indices that match with configured auto follow patterns that have been defined in `AutoFollowMetadata` custom metadata. This change also adds two new APIs to manage auto follow patterns. A put auto follow pattern api: ``` PUT /_ccr/_autofollow/{{remote_cluster}} { "leader_index_pattern": ["logs-*", ...], "follow_index_pattern": "{{leader_index}}-copy", "max_concurrent_read_batches": 2 ... // other optional parameters } ``` and delete auto follow pattern api: ``` DELETE /_ccr/_autofollow/{{remote_cluster_alias}} ``` The auto follow patterns are directly tied to the remote cluster aliases configured in the follow cluster. Relates to #33007 Co-authored-by: Jason Tedor jason@tedor.me	2018-09-06 08:01:58 +02:00
Jason Tedor	d71ced1b00	Generalize search.remote settings to cluster.remote (#33413 ) With features like CCR building on the CCS infrastructure, the settings prefix search.remote makes less sense as the namespace for these remote cluster settings than does a more general namespace like cluster.remote. This commit replaces these settings with cluster.remote with a fallback to the deprecated settings search.remote.	2018-09-05 20:43:44 -04:00
Nhat Nguyen	39e3bd93c7	TEST: Create following engines in the main thread (#33391 ) There are two races in the testUpdateAndReadChangesConcurrently if the following engines are created in the worker threads. We fixed the translog issue in #33352, but there is still another race with createStore. This commit ensures that we create all engines in the main thread. Relates #33352 Closes #33344	2018-09-05 19:05:41 -04:00
Nhat Nguyen	41839cf9a8	Acquire seacher on closing engine should throw ACE (#33331 ) Closes #33330	2018-09-05 19:03:34 -04:00
Tim Brooks	b697f485bb	Introduce `TransportLogger` for common logging (#32725 ) Historically we have had a ESLoggingHandler in the netty module that logs low-level connection operations. This class just extends the netty logging handler with some (broken) message deserialization. This commit fixes this message serialization and moves the class to server. This new logger logs inbound and outbound messages. Eventually, we should move other event logging to this class (connect, close, flush). That way we will have consistent logging regards of which transport is loaded. Resolves #27306 on master. Older branches will need a different fix.	2018-09-05 16:12:37 -06:00
Tim Brooks	88c178dca6	Add sni name to SSLEngine in netty transport (#33144 ) This commit is related to #32517. It allows an "server_name" attribute on a DiscoveryNode to be propagated to the server using the TLS SNI extentsion. This functionality is only implemented for the netty security transport.	2018-09-05 16:12:10 -06:00
Armin Braun	ef1066d7f8	INGEST: Allow Repeated Invocation of Pipeline (#33419 ) * Allows repeated, non-recursive invocation of the same pipeline	2018-09-05 22:04:53 +02:00
Tal Levy	b5f7fb6882	Merge branch 'master' into index-lifecycle	2018-09-05 12:56:58 -07:00
Jim Ferenczi	50e07dd413	Add an index setting to control TieredMergePolicy#deletesPctAllowed (#32907 ) This change adds an expert index setting called `index.merge.policy.deletes_pct_allowed`. It controls the maximum percentage of deleted documents that is tolerated in the index. Lower values make the index more space efficient at the expense of increased CPU and I/O activity. Values must be between `20` and `50`. Default value is `33`.	2018-09-05 19:57:36 +02:00
Nik Everett	5c624bc55b	Logging: Further clean up logging ctors (#33378 ) Drops and unused logging constructor, simplifies a rarely used one, and removes `Settings` from a third. There is now only a single logging ctor that takes `Settings` and we'll remove that one in a follow up change.	2018-09-05 13:04:26 -04:00
Adrien Grand	46ac8d1a51	Make test less GC-intensive.	2018-09-05 18:59:43 +02:00
Christoph Büscher	eafc2a5470	Don't count metadata fields towards index.mapping.total_fields.limit (#33386 ) The maximum number of fields per index is limited to 1000 by default by the `index.mapping.total_fields.limit` setting to prevent accidental mapping explosions due to too many fields. Currently all metadata fields also count towards this limit, which can lead to some confusion when using lower limits. It is not obvious for users that they cannot actually add as many fields as are specified by the limit in this case. This change takes the number of metadata fields out of the field count that we check against the field limit. It also adds tests that check that we can add fields up to the specified limit, but throw an exception for any additional field added. Closes #24096	2018-09-05 18:27:21 +02:00
Jason Tedor	23934e39d2	Fix deprecated setting specializations (#33412 ) Deprecating a some setting specializations (e.g., list settings) does not cause deprecation warning headers and deprecation log messages to appear. This is due to a missed check for deprecation. This commit fixes this for all setting specializations, and ensures that this can not be missed again.	2018-09-05 11:01:58 -04:00
Adrien Grand	913d5fd820	Disable IndexRecoveryIT.testRerouteRecovery. Relates #32686.	2018-09-05 14:53:22 +02:00
Armin Braun	46774098d9	INGEST: Implement Drop Processor (#32278 ) * INGEST: Implement Drop Processor * Adjust Processor API * Implement Drop Processor * Closes #23726	2018-09-05 14:25:29 +02:00
Paul Sanwald	c303006e6b	Add interval response parameter to AutoDateInterval histogram (#33254 ) Adds the interval used to the aggregation response.	2018-09-05 07:35:59 -04:00
Armin Braun	4156cc3fae	MINOR+CORE: Remove Dead Methods ClusterService (#33346 ) * None of these methods are used anywhere	2018-09-05 12:08:28 +02:00
Colin Goodheart-Smithe	f00a28a909	Merge branch 'master' into index-lifecycle	2018-09-05 09:48:48 +01:00
Gordon Brown	cfd3fa72ed	Add user-defined cluster metadata (#33325 ) Adds a place for users to store cluster-wide data they wish to associate with the cluster via the Cluster Settings API. This is strictly for user-defined data, Elasticsearch makes no other other use of these settings.	2018-09-04 16:14:18 -06:00
Jim Ferenczi	dbc7102c86	Fix inner hits retrieval when stored fields are disabled (_none_) (#33018 ) Now that types are unique per mapping we can retrieve the document mapper without referencing the type. This fixes an NPE when stored fields are disabled. For 6x we'll need a different fix since mappings can still have multiple types. Relates #32941	2018-09-04 16:25:52 +02:00
Sohaib Iftikhar	761e8c461f	HLRC: Add delete by query API (#32782 ) Adds the delete-by-query API to the High Level REST Client.	2018-09-04 08:56:26 -04:00
Colin Goodheart-Smithe	92ab442aee	Merge branch 'master' into index-lifecycle	2018-09-04 10:34:49 +01:00
Julie Tibshirani	78df00ff24	Simplify the return type of FieldMapper#parse. (#32654 )	2018-09-04 01:15:19 +00:00
Jason Tedor	09bf4e5f00	Introduce private settings (#33327 ) This commit introduces the formal notion of a private setting. This enables us to register some settings that we had previously not registered as fully-fledged settings to avoid them being exposed via APIs such as the create index API. For example, we had hacks in the codebase to allow index.version.created to be passed around inside of settings objects, but was not registered as a setting so that if a user tried to use the setting on any API then they would get an exception. This prevented users from setting index.version.created on index creation, or updating it via the index settings API. By introducing private settings, we can continue to reject these attempts, yet now we can represent these settings as actual settings. In this change, we register index.version.created as an actual setting. We do not cutover all settings that we had been treating as private in this pull request, it is already quite large due to moving some tests around to account for the fact that some tests need to be able to set the index.version.created. This can be done in a follow-up change.	2018-09-03 19:17:57 -04:00
Armin Braun	1f046617bf	TESTS: Fix Race Condition in Temp Path Creation (#33352 ) * TESTS: Fix Race Condition in Temp Path Creation * Calling `createTempDir` concurrently here in the `Follower`s causes collisions at times which lead to `createEngine` throwing because of unexpected files in the newly created temp dir * Fixed by creating all temp dirs in the main test thread * closes #33344	2018-09-03 19:55:59 +02:00
Nhat Nguyen	24d60c7f4b	Fix from_range in search_after in changes snapshot (#33335 ) We can have multiple documents in Lucene with the same seq_no for parent-child documents (or without rollback). In this case, the usage "lastSeenSeqNo + 1" is an off-by-one error as it may miss some documents. This error merely affects the `skippedOperations` contract. See: https://github.com/elastic/elasticsearch/pull/33222#discussion_r213842257 Closes #33318	2018-09-03 11:58:49 -04:00
Armin Braun	42424aff21	TESTS+DISTR.: Fix testIndexCheckOnStartup Flake (#33349 ) * Ignore all `RuntimeException` since random file corruption triggers other RTE in addition to the randomly caught one * closes #33345	2018-09-03 17:06:12 +02:00
tony-dillon	a9d2b1dde8	Null completion field should not throw IAE (#33268 ) Ignore null value on the completion field Closes #33200	2018-09-03 16:49:53 +02:00
Colin Goodheart-Smithe	0bf36253a9	Adds code to help with IndicesRequestCacheIT failures (#33313 ) * Adds code to help with IndicesRequestCacheIT failures Relates to #32827 * Adds comment * Fixes test failure	2018-09-03 14:54:17 +01:00
Alexander Reelsen	246a7df8c2	Core: Fix epoch millis java time formatter (#33302 ) The existing implemention could not deal with negative numbers as well as +- 999 milliseconds around the epoch. This commit uses Instant.ofEpochMilli() and parses the input to a number instead of using a date formatter.	2018-09-03 13:13:19 +02:00
Colin Goodheart-Smithe	e2c1beb1be	Merge branch 'master' into index-lifecycle	2018-09-03 10:01:16 +01:00
Jim Ferenczi	9310d2eaf3	[CI] Mute IndexShardTests#testIndexCheckOnStartup fails #33345	2018-09-03 10:27:42 +02:00
Jim Ferenczi	2fa75b4438	[CI] Mute LuceneChangesSnapshotTests#testUpdateAndReadChangesConcurrently	2018-09-03 10:14:00 +02:00
Jim Ferenczi	713c07e14d	Add early termination support to BucketCollector (#33279 ) This commit adds the support to early terminate the collection of a leaf in the aggregation framework. This change introduces a MultiBucketCollector which handles CollectionTerminatedException exactly like the Lucene MultiCollector. Any aggregator can now throw a CollectionTerminatedException without stopping the collection of a sibling aggregator. This is useful for aggregators that can infer their result without visiting all documents (e.g.: a min/max aggregation on a match_all query).	2018-09-03 09:34:35 +02:00
Nik Everett	f8b7a4dbc8	Logging: Drop Settings from some logging ctors (#33332 ) Drops `Settings` from some logging ctors now that they are no longer needed. This should allow us to stop passing `Settings` around to quite as many places.	2018-09-02 16:51:26 -04:00
Jason Tedor	ea4eef8641	Merge branch 'master' into ccr * master: HLREST: add update by query API (#32760)	2018-09-02 16:07:50 -04:00
Sohaib Iftikhar	389bf67275	HLREST: add update by query API (#32760 ) Adds update by query to the high level rest client.	2018-09-02 15:15:00 -04:00
Nhat Nguyen	3197a6bbdd	Merge branch 'master' into ccr * master: HLRC: ML Flush job (#33187) HLRC: Adding ML Job stats (#33183) LLREST: Drop deprecated methods (#33223) Mute testSyncerOnClosingShard [DOCS] Moves machine learning APIs to docs folder (#31118)	2018-09-02 09:30:51 -04:00
Nhat Nguyen	ce635f5f15	Mute testSyncerOnClosingShard Tracked at #33330	2018-09-01 09:53:31 -04:00
Nhat Nguyen	b93507608a	Merge branch 'master' into ccr * master: Mute test watcher usage stats output [Rollup] Fix FullClusterRestart test Adjust soft-deletes version after backport into 6.5 completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194) Fix AwaitsFix issue number Mute SmokeTestWatcherWithSecurityIT testsi drop `index.shard.check_on_startup: fix` (#32279) tracked at [DOCS] Moves ml folder from x-pack/docs to docs (#33248) [DOCS] Move rollup APIs to docs (#31450) [DOCS] Rename X-Pack Commands section (#33005) TEST: Disable soft-deletes in ParentChildTestCase Fixes SecurityIntegTestCase so it always adds at least one alias (#33296) Fix pom for build-tools (#33300) Lazy evaluate java9home (#33301) SQL: test coverage for JdbcResultSet (#32813) Work around to be able to generate eclipse projects (#33295) Highlight that index_phrases only works if no slop is used (#33303) Different handling for security specific errors in the CLI. Fix for https://github.com/elastic/elasticsearch/issues/33230 (#33255) [ML] Refactor delimited file structure detection (#33233) SQL: Support multi-index format as table identifier (#33278) MINOR: Remove Dead Code from PathTrie (#33280) Enable forbiddenapis server java9 (#33245)	2018-08-31 19:03:04 -04:00
Nhat Nguyen	08b9247ce2	Adjust soft-deletes version after backport into 6.5 Relates #33222	2018-08-31 16:50:08 -04:00
Vladimir Dolzhenko	00b272af32	completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194 ) Relates to #32279	2018-08-31 22:08:28 +02:00
Vladimir Dolzhenko	3d82a30fad	drop `index.shard.check_on_startup: fix` (#32279 ) drop `index.shard.check_on_startup: fix` Relates #31389	2018-08-31 21:29:06 +02:00
Colin Goodheart-Smithe	3eef74d5d5	Merge branch 'master' into index-lifecycle	2018-08-31 14:45:22 +01:00
Armin Braun	c6cfa08a61	MINOR: Remove Dead Code from PathTrie (#33280 ) * The array size checks are redundant since the array sizes are checked earlier in those methods too * The removed methods are just not used anywhere	2018-08-31 08:40:27 +02:00
Alpar Torok	44ed5f6306	Enable forbiddenapis server java9 (#33245 )	2018-08-31 09:31:55 +03:00
Nhat Nguyen	ad4dd086d2	Integrates soft-deletes into Elasticsearch (#33222 ) This PR integrates Lucene soft-deletes(LUCENE-8200) into Elasticsearch. Highlight works in this PR include: - Replace hard-deletes by soft-deletes in InternalEngine - Use _recovery_source if _source is disabled or modified (#31106) - Soft-deletes retention policy based on the global checkpoint (#30335) - Read operation history from Lucene instead of translog (#30120) - Use Lucene history in peer-recovery (#30522) Relates #30086 Closes #29530 --- These works have been done by the whole team; however, these individuals (lexical order) have significant contribution in coding and reviewing: Co-authored-by: Adrien Grand <jpountz@gmail.com> Co-authored-by: Boaz Leskes <b.leskes@gmail.com> Co-authored-by: Jason Tedor <jason@tedor.me> Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co> Co-authored-by: Simon Willnauer <simonw@apache.org>	2018-08-30 23:46:07 -04:00
Nhat Nguyen	547de71d59	Revert "Integrates soft-deletes into Elasticsearch (#33222 )" Revert to correct co-author tags. This reverts commit `6dd0aa54f6`.	2018-08-30 23:44:57 -04:00
Nhat Nguyen	d3f32273eb	Merge branch 'master' into ccr	2018-08-30 23:22:58 -04:00
Nhat Nguyen	6dd0aa54f6	Integrates soft-deletes into Elasticsearch (#33222 ) This PR integrates Lucene soft-deletes(LUCENE-8200) into Elasticsearch. Highlight works in this PR include: - Replace hard-deletes by soft-deletes in InternalEngine - Use _recovery_source if _source is disabled or modified (#31106) - Soft-deletes retention policy based on the global checkpoint (#30335) - Read operation history from Lucene instead of translog (#30120) - Use Lucene history in peer-recovery (#30522) Relates #30086 Closes #29530 --- These works have been done by the whole team; however, these individuals (lexical order) have significant contribution in coding and reviewing: Co-authored-by: Adrien Grand jpountz@gmail.com Co-authored-by: Boaz Leskes b.leskes@gmail.com Co-authored-by: Jason Tedor jason@tedor.me Co-authored-by: Martijn van Groningen martijn.v.groningen@gmail.com Co-authored-by: Nhat Nguyen nhat.nguyen@elastic.co Co-authored-by: Simon Willnauer simonw@apache.org	2018-08-30 22:11:23 -04:00
Tal Levy	13a0d822d0	Merge branch 'master' into index-lifecycle	2018-08-30 15:04:17 -07:00
Lee Hinman	8a2d154bad	Update serialization versions for custom IndexMetaData backport	2018-08-30 15:56:53 -06:00
Igor Motov	001b78f704	Replace IndexMetaData.Custom with Map-based custom metadata (#32749 ) This PR removes the deprecated `Custom` class in `IndexMetaData`, in favor of a `Map<String, DiffableStringMap>` that is used to store custom index metadata. As part of this, there is now no way to set this metadata in a template or create index request (since it's only set by plugins, or dedicated REST endpoints). The `Map<String, DiffableStringMap>` is intended to be a namespaced `Map<String, String>` (`DiffableStringMap` implements `Map<String, String>`, so the signature is more like `Map<String, Map<String, String>>`). This is so we can do things like: ``` java Map<String, String> ccrMeta = indexMetaData.getCustom("ccr"); ``` And then have complete control over the metadata. This also means any plugin/feature that uses this has to manage its own BWC, as the map is just serialized as a map. It also means that if metadata is put in the map that isn't used (for instance, if a plugin were removed), it causes no failures the way an unregistered `Setting` would. The reason I use a custom `DiffableStringMap` here rather than a plain `Map<String, String>` is so the map can be diffed with previous cluster state updates for serialization. Supersedes #32683	2018-08-30 13:57:00 -06:00
Simon Willnauer	af2eaf2a6c	Remove usage of `index.shrink.source.` in 7.x (#33271 ) We cut over to `index.resize.source.` but still have these constants being public in `IndexMetaData`. Those Settings and constants are not needed in 7.x while we still need to keep the keys known to private settings since they might be part of the index settings of old indices. We can remove that in 8.0. Yet, we should remove the settings to make sure they are not used again.	2018-08-30 21:08:35 +02:00
Jim Ferenczi	d0630093cd	Fix serialization of empty field capabilities response (#33263 ) Fix serialization of empty field capabilities response When no response are required (no indices match the requested patterns) the empty response throws an NPE in the transport serialization (writeTo).	2018-08-30 18:07:58 +02:00
Jim Ferenczi	1404dd2a42	Fix nested _source retrieval with includes/excludes (#33180 ) If an exclude or an include clause removes an entry to a nested field in the original source at query time, the creation of nested hits fails with an NPE. This change fixes this exception and replaces the nested document source with an empty map. Closes #33163 Closes #33170	2018-08-30 15:15:50 +02:00
Nhat Nguyen	13261996ce	Add NoOps to Lucene for failed delete ops (#33217 ) Today we add a NoOp to Lucene and translog if we fail to process an indexing operation. However, we are only adding NoOps to translog for delete operations. In order to have a complete history in Lucene, we should add NoOps of failed delete operations to both Lucene and translog. Relates #29530	2018-08-30 07:55:13 -04:00
David Turner	47859e56ac	Move file-based discovery to core (#33241 ) Today we support a static list of seed hosts in core Elasticsearch, and allow a dynamic list of seed hosts to be provided via a file using the `discovery-file` plugin. In fact the ability to provide a dynamic list of seed hosts is increasingly useful, so this change moves this functionality to core Elasticsearch to avoid the need for a plugin. Furthermore, in order to start up nodes in integration tests we currently assign a known port to each node before startup, which unfortunately sometimes fails if another process grabs the selected port in the meantime. By moving the `discovery-file` functionality into the core product we can use it to avoid this race. This change also moves the expected path to the file from `$ES_PATH_CONF/discovery-file/unicast_hosts.txt` to `$ES_PATH_CONF/unicast_hosts.txt`. An example of this file is not included in distributions. For BWC purposes the plugin still exists, but does nothing more than create the example file in the old location, and issue a warning when it is used. We also continue to support the old location for the file, but warn about its deprecation. Relates #29244 Closes #33030	2018-08-30 06:43:04 +01:00
Armin Braun	cc4d7059bf	Ingest: Add conditional per processor (#32398 ) * Ingest: Add conditional per processor * closes #21248	2018-08-30 03:46:39 +02:00
Jason Tedor	0f22dbb1cc	Apply settings filter to get cluster settings API (#33247 ) Some settings have filters applied to them and we use this in logs and the get nodes info API. For consistency, we should apply this in the get cluster settings API too.	2018-08-29 15:56:13 -04:00
Nhat Nguyen	5632e31c74	Merge branch 'master' into ccr * master: Painless: Add Bindings (#33042) Update version after client credentials backport Fix forbidden apis on FIPS (#33202) Remote 6.x transport BWC Layer for `_shrink` (#33236) Test fix - Graph HLRC tests needed another field adding to randomisation exception list HLRC: Add ML Get Records API (#33085) [ML] Fix character set finder bug with unencodable charsets (#33234) TESTS: Fix overly long lines (#33240) Test fix - Graph HLRC test was missing field name to be excluded from randomisation logic Remove unsupported group_shard_failures parameter (#33208) Update BucketUtils#suggestShardSideQueueSize signature (#33210) Parse PEM Key files leniantly (#33173) INGEST: Add Pipeline Processor (#32473) Core: Add java time xcontent serializers (#33120) Consider multi release jars when running third party audit (#33206) Update MSI documentation (#31950) HLRC: create base timed request class (#33216) [DOCS] Fixes command page titles HLRC: Move ML protocol classes into client ml package (#33203) Scroll queries asking for rescore are considered invalid (#32918) Painless: Fix Semicolon Regression (#33212) ingest: minor - update test to include dissect (#33211) Switch remaining LLREST usage to new style Requests (#33171) HLREST: add reindex API (#32679)	2018-08-29 12:30:24 -04:00
Simon Willnauer	6a0d4b4a77	Remote 6.x transport BWC Layer for `_shrink` (#33236 ) The shrink action was renamed to `_resize` with the addition or split. This bwc layer is unnecessary on 7.x since 6.latest will always use the resize action.	2018-08-29 16:43:13 +02:00
Gordon Brown	454ce99b01	Merge branch 'master' into index-lifecycle	2018-08-29 08:28:23 -06:00
Luca Cavanna	49109187e2	Remove unsupported group_shard_failures parameter (#33208 ) We have had support for the `group_shard_failures` parameter in our code for a while, since we introduced failures grouping. When we introduced validation of parameters at REST, we seem to have forgotten to expose such parameter. Given that the parameter is effectively not supported for many months now, that no user has complained about that and that grouping is the expected behaviour, this commit removes support for the parameter.	2018-08-29 14:05:41 +02:00
Luca Cavanna	034fdbca28	Update BucketUtils#suggestShardSideQueueSize signature (#33210 ) `BucketUtils#suggestShardSideQueueSize` used to calculate the shard_size based on the number of shards. It returns now a different value only based on whether we are querying a single shard or multiple shards. This commit replaces the numberOfShards argument with a boolean that tells whether we are querying a single shard or not.	2018-08-29 13:51:54 +02:00
Armin Braun	f690b492e7	INGEST: Add Pipeline Processor (#32473 ) * INGEST: Add Pipeline Processor * Adds Processor capable of invoking other pipelines * Closes #31842	2018-08-29 11:03:10 +02:00
Alexander Reelsen	48b388ce82	Core: Add java time xcontent serializers (#33120 ) This ensures that the java time class exposed by painless have proper serialization/string representations. Closes #31853	2018-08-29 10:00:16 +02:00
Alpar Torok	f29f0af7bc	Consider multi release jars when running third party audit (#33206 ) Exclude classes meant for newer versions than what we are auditing against, those classes won't be found. There's no reason to exclude JDK classes from newer versions, with this PR, we will not extract them in the first place.	2018-08-29 09:53:04 +03:00
Mark Tozzi	84b61d0738	Scroll queries asking for rescore are considered invalid (#32918 ) This PR changes our behavior from silently ignoring rescore in a scroll query to instead report to the user that such a query is invalid. Closes #31775	2018-08-28 15:48:23 -04:00
Nhat Nguyen	c42dc77896	Merge branch 'master' into ccr * master: [Rollup] Better error message when trying to set non-rollup index (#32965) HLRC: Use Optional in validation logic (#33104) Remove unused User class from protocol (#33137) ingest: Introduce the dissect processor (#32884) [Docs] Add link to es-kotlin-wrapper-client (#32618) [Docs] Remove repeating words (#33087) Minor spelling and grammar fix (#32931) Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979) Watcher: Simplify finding next date in cron schedule (#33015) Run Third party audit with forbidden APIs CLI (part3/3) (#33052) Fix plugin build test on Windows (#33078) HLRC+MINOR: Remove Unused Private Method (#33165) Remove old unused test script files (#32970) Build analysis-icu client JAR (#33184) Ensure to generate identical NoOp for the same failure (#33141) ShardSearchFailure#readFrom to set index and shardId (#33161)	2018-08-28 13:56:38 -04:00
Sohaib Iftikhar	7f5e29ddb2	HLREST: add reindex API (#32679 ) Adds the reindex API to the high level REST client.	2018-08-28 13:02:23 -04:00
Nhat Nguyen	e39689a198	Send only ops after checkpoint in file-based recovery with soft-deletes (#33190 ) Today a file-based recovery will replay all existing translog operations from the primary on a replica so that that replica can have a full history in translog as the primary. However, with soft-deletes enabled, we should not do it because: 1. All operations before the local checkpoint of the safe commit exist in the commit already. 2. The number of operations before the local checkpoint may be considerable and requires a significant amount of time to replay on a replica. Relates #30522 Relates #29530	2018-08-28 12:32:09 -04:00
Nhat Nguyen	e2b931e80b	Use Lucene history in primary-replica resync (#33178 ) This commit makes primary-replica resyncer use Lucene as the source of history operation instead of translog if soft-deletes is enabled. With this change, we no longer expose translog snapshot directly in IndexShard. Relates #29530	2018-08-28 10:44:15 -04:00
Nhat Nguyen	d8a1b7cb17	Make soft-deletes settings final (#33172 ) For now, we do not support changing the soft-deletes setting even with closed indices. Therefore we should make it a final setting. Relates #29530	2018-08-28 08:48:42 -04:00
Jonathan Little	9d92a87ae6	Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979 )	2018-08-28 09:27:43 +01:00
Alpar Torok	2cc611604f	Run Third party audit with forbidden APIs CLI (part3/3) (#33052 ) The new implementation is functional equivalent with the old, ant based one. It parses task standard error to get the missing classes and violations in the same way. I considered re-using ForbiddenApisCliTask but Gradle makes it hard to build inheritance with tasks that have task actions , since the order of the task actions can't be controlled. This inheritance isn't dully desired either as the third party audit task is much more opinionated and we don't want to expose some of the configuration. We could probably extract a common base class without any task actions, but probably more trouble than it's worth. Closes #31715	2018-08-28 10:03:30 +03:00
Gordon Brown	50368656ee	Merge branch 'master' into index-lifecycle	2018-08-27 15:35:19 -06:00
Nhat Nguyen	014b3236dc	Ensure to generate identical NoOp for the same failure (#33141 ) We generate slightly different NoOps in InternalEngine and TransportShardBulkAction for the same failure. 1. InternalEngine uses Exception#getFailure to generate a message without the class name: newOp [NoOp{seqNo=1, primaryTerm=1, reason='Contexts are mandatory in context enabled completion field [suggest_context]'}]. 2. TransportShardBulkAction uses Exception#toString to generate a message with the class name: NoOp{seqNo=1, primaryTerm=1, reason='java.lang.IllegalArgumentException: Contexts are mandatory in context enabled completion field [suggest_context]'}. If a write operation fails while a replica is recovering, that replica will possibly receive two different NoOps: one from recovery and one from replication. These two different NoOps will trip TranslogWriter#assertNoSeqNumberConflict assertion. This commit ensures that we generate the same Noop for the same failure. Closes #32986	2018-08-27 15:59:42 -04:00
Luca Cavanna	ed0571e16c	ShardSearchFailure#readFrom to set index and shardId (#33161 ) As part of recent changes made to `ShardOperationFailedException` we introduced `index` and `shardId` members to the base class, but the subclasses are entirely responsible for the serialization of such fields. In the case of `ShardSearchFailure`, we have an additional `SearchShardTarget` instance member which also holds the index and the shardId, hence they get serialized as part of `SearchShardTarget` itself. When de-serializing a `ShardSearchFailure` though, we need to remember to also set the parent class `index` and `shardId` fields otherwise they get lost Relates to #32640	2018-08-27 20:31:27 +02:00
Jason Tedor	0e5d42ca38	Merge branch 'master' into ccr * master: Adjust BWC version on mapping version Token API supports the client_credentials grant (#33106) Build: forked compiler max memory matches jvmArgs (#33138) Introduce mapping version to index metadata (#33147) SQL: Enable aggregations to create a separate bucket for missing values (#32832) Fix grammar in contributing docs SECURITY: Fix Compile Error in ReservedRealmTests (#33166) APM server monitoring (#32515) Support only string `format` in date, root object & date range (#28117) [Rollup] Move toBuilders() methods out of rollup config objects (#32585) Fix forbiddenapis on java 11 (#33116) Apply publishing to genreate pom (#33094) Have circuit breaker succeed on unknown mem usage Do not lose default mapper on metadata updates (#33153) Fix a mappings update test (#33146) Reload Secure Settings REST specs & docs (#32990) Refactor CachingUsernamePassword realm (#32646)	2018-08-27 13:49:59 -04:00
Jason Tedor	318df2a107	Adjust BWC version on mapping version The introduction of mapping version on index metadata has been backported to 6.x. This commit adjusts the BWC version around mapping version to account for this backport.	2018-08-27 13:17:15 -04:00
Jason Tedor	2aef7e0900	Introduce mapping version to index metadata (#33147 ) This commit introduces mapping version to index metadata. This value is monotonically increasing and is updated on mapping updates. This will be useful in cross-cluster replication so that we can request mapping updates from the leader only when there is a mapping update as opposed to the strategy we employ today which is to request a mapping update any time there is an index metadata update. As index metadata updates can occur for many reasons other than mapping updates, this leads to some unnecessary requests and work in cross-cluster replication.	2018-08-27 12:21:11 -04:00
Tal Levy	5783545222	Merge branch 'master' into index-lifecycle	2018-08-27 08:19:05 -07:00
Mikita Karaliou	f1f6d4ed33	Support only string `format` in date, root object & date range (#28117 ) Limit date `format` attribute to String values only. Closes #23650	2018-08-27 12:24:51 +02:00
Daniel Mitterdorfer	06c0055c0f	Have circuit breaker succeed on unknown mem usage With this commit we implement a workaround for https://bugs.openjdk.java.net/browse/JDK-8207200 which is a race condition in the JVM that results in `IllegalArgumentException` to be thrown in rare cases when we determine memory usage via `MemoryMXBean`. As we do not want to fail requests in those cases we always return zero memory usage. Relates #31767 Relates #33125	2018-08-27 07:09:27 +02:00
Jason Tedor	143cd9bbaa	Do not lose default mapper on metadata updates (#33153 ) When applying index metadata updates we run through the mappings updating them if needed. Today if there is not an update to the default mapper, we can lose the default mapping. This means that, for example, if we apply a settings update to an index we will lose the default mapper. This happens because we were not guarding updating the default mapping with a check that the default mapping was updated in the metadata update. When there is no update in the metadata update, we need to continue to preserve the previous default mapping. This commit achieves this by moving the updating of the default mapping under the same guard that we use for updating the default mapping source. We add a test that fails before putting the update under a guard and now passes after moving the update under the guard.	2018-08-26 15:57:52 -04:00
Jason Tedor	f8b07a0d84	Fix a mappings update test (#33146 ) This commit fixes a mappings update test. The test is broken in the sense that it passes, but for the wrong reason. The test here is testing that if we make a mapping update but do not commit that mapping update then the mapper service still maintains the previous document mapper. This was not the case long, long ago when a mapping update would update the in-memory state before the cluster state update was committed. This test was passing, but it was passing because the mapping update was never even updated. It was never even updated because it was encountering a null pointer exception. Of course the in-memory state is not going to be updated in that case, we are simply going to end up with a failed cluster state update. Fixing that leads to another issue which is that the mapping source does not even parse so again we would, of course, end up with the in-memory state not being modified. We fix these issues, assert that the result cluster state task completed successfully, and finally that the in-memory state was not updated since we never committed the resulting cluster state.	2018-08-26 09:36:17 -04:00
Nhat Nguyen	75304f405b	Merge branch 'master' into ccr * master: Add proxy support to RemoteClusterConnection (#33062) TEST: Skip assertSeqNos for closed shards (#33130) TEST: resync operation on replica should acquire shard permit (#33103) Switch remaining x-pack tests to new style Requests (#33108) Switch remaining tests to new style Requests (#33109) Switch remaining ml tests to new style Requests (#33107) Build: Line up IDE detection logic Security index expands to a single replica (#33131) HLRC: request/response homogeneity and JavaDoc improvements (#33133) Checkstyle! [Test] Fix sporadic failure in MembershipActionTests Revert "Do NOT allow termvectors on nested fields (#32728)" [Rollup] Move toAggCap() methods out of rollup config objects (#32583) Fix race condition in scheduler engine test	2018-08-25 21:41:53 -04:00
Simon Willnauer	3376922e8b	Add proxy support to RemoteClusterConnection (#33062 ) This adds support for connecting to a remote cluster through a tcp proxy. A remote cluster can configured with an additional `search.remote.$clustername.proxy` setting. This proxy will be used to connect to remote nodes for every node connection established. We still try to sniff the remote clsuter and connect to nodes directly through the proxy which has to support some kind of routing to these nodes. Yet, this routing mechanism requires the handshake request to include some kind of information where to route to which is not yet implemented. The effort to use the hostname and an optional node attribute for routing is tracked in #32517 Closes #31840	2018-08-25 20:41:32 +02:00
Nhat Nguyen	9dad82ece8	TEST: Skip assertSeqNos for closed shards (#33130 ) If a shard was closed, we return null for SeqNoStats. Therefore the assertion assertSeqNos will hit NPE when it verifies a closed shard. This commit skips closed shards in assertSeqNos and enables this assertion in AbstractDisruptionTestCase.	2018-08-24 21:02:13 -04:00

... 4 5 6 7 8 ...

1752 Commits