OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jason Tedor	09bf4e5f00	Introduce private settings (#33327 ) This commit introduces the formal notion of a private setting. This enables us to register some settings that we had previously not registered as fully-fledged settings to avoid them being exposed via APIs such as the create index API. For example, we had hacks in the codebase to allow index.version.created to be passed around inside of settings objects, but was not registered as a setting so that if a user tried to use the setting on any API then they would get an exception. This prevented users from setting index.version.created on index creation, or updating it via the index settings API. By introducing private settings, we can continue to reject these attempts, yet now we can represent these settings as actual settings. In this change, we register index.version.created as an actual setting. We do not cutover all settings that we had been treating as private in this pull request, it is already quite large due to moving some tests around to account for the fact that some tests need to be able to set the index.version.created. This can be done in a follow-up change.	2018-09-03 19:17:57 -04:00
Armin Braun	1f046617bf	TESTS: Fix Race Condition in Temp Path Creation (#33352 ) * TESTS: Fix Race Condition in Temp Path Creation * Calling `createTempDir` concurrently here in the `Follower`s causes collisions at times which lead to `createEngine` throwing because of unexpected files in the newly created temp dir * Fixed by creating all temp dirs in the main test thread * closes #33344	2018-09-03 19:55:59 +02:00
Nhat Nguyen	24d60c7f4b	Fix from_range in search_after in changes snapshot (#33335 ) We can have multiple documents in Lucene with the same seq_no for parent-child documents (or without rollback). In this case, the usage "lastSeenSeqNo + 1" is an off-by-one error as it may miss some documents. This error merely affects the `skippedOperations` contract. See: https://github.com/elastic/elasticsearch/pull/33222#discussion_r213842257 Closes #33318	2018-09-03 11:58:49 -04:00
Armin Braun	42424aff21	TESTS+DISTR.: Fix testIndexCheckOnStartup Flake (#33349 ) * Ignore all `RuntimeException` since random file corruption triggers other RTE in addition to the randomly caught one * closes #33345	2018-09-03 17:06:12 +02:00
tony-dillon	a9d2b1dde8	Null completion field should not throw IAE (#33268 ) Ignore null value on the completion field Closes #33200	2018-09-03 16:49:53 +02:00
Colin Goodheart-Smithe	0bf36253a9	Adds code to help with IndicesRequestCacheIT failures (#33313 ) * Adds code to help with IndicesRequestCacheIT failures Relates to #32827 * Adds comment * Fixes test failure	2018-09-03 14:54:17 +01:00
Alexander Reelsen	246a7df8c2	Core: Fix epoch millis java time formatter (#33302 ) The existing implemention could not deal with negative numbers as well as +- 999 milliseconds around the epoch. This commit uses Instant.ofEpochMilli() and parses the input to a number instead of using a date formatter.	2018-09-03 13:13:19 +02:00
Jim Ferenczi	9310d2eaf3	[CI] Mute IndexShardTests#testIndexCheckOnStartup fails #33345	2018-09-03 10:27:42 +02:00
Jim Ferenczi	2fa75b4438	[CI] Mute LuceneChangesSnapshotTests#testUpdateAndReadChangesConcurrently	2018-09-03 10:14:00 +02:00
Jim Ferenczi	713c07e14d	Add early termination support to BucketCollector (#33279 ) This commit adds the support to early terminate the collection of a leaf in the aggregation framework. This change introduces a MultiBucketCollector which handles CollectionTerminatedException exactly like the Lucene MultiCollector. Any aggregator can now throw a CollectionTerminatedException without stopping the collection of a sibling aggregator. This is useful for aggregators that can infer their result without visiting all documents (e.g.: a min/max aggregation on a match_all query).	2018-09-03 09:34:35 +02:00
Nik Everett	f8b7a4dbc8	Logging: Drop Settings from some logging ctors (#33332 ) Drops `Settings` from some logging ctors now that they are no longer needed. This should allow us to stop passing `Settings` around to quite as many places.	2018-09-02 16:51:26 -04:00
Jason Tedor	ea4eef8641	Merge branch 'master' into ccr * master: HLREST: add update by query API (#32760)	2018-09-02 16:07:50 -04:00
Sohaib Iftikhar	389bf67275	HLREST: add update by query API (#32760 ) Adds update by query to the high level rest client.	2018-09-02 15:15:00 -04:00
Nhat Nguyen	3197a6bbdd	Merge branch 'master' into ccr * master: HLRC: ML Flush job (#33187) HLRC: Adding ML Job stats (#33183) LLREST: Drop deprecated methods (#33223) Mute testSyncerOnClosingShard [DOCS] Moves machine learning APIs to docs folder (#31118)	2018-09-02 09:30:51 -04:00
Nhat Nguyen	ce635f5f15	Mute testSyncerOnClosingShard Tracked at #33330	2018-09-01 09:53:31 -04:00
Nhat Nguyen	b93507608a	Merge branch 'master' into ccr * master: Mute test watcher usage stats output [Rollup] Fix FullClusterRestart test Adjust soft-deletes version after backport into 6.5 completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194) Fix AwaitsFix issue number Mute SmokeTestWatcherWithSecurityIT testsi drop `index.shard.check_on_startup: fix` (#32279) tracked at [DOCS] Moves ml folder from x-pack/docs to docs (#33248) [DOCS] Move rollup APIs to docs (#31450) [DOCS] Rename X-Pack Commands section (#33005) TEST: Disable soft-deletes in ParentChildTestCase Fixes SecurityIntegTestCase so it always adds at least one alias (#33296) Fix pom for build-tools (#33300) Lazy evaluate java9home (#33301) SQL: test coverage for JdbcResultSet (#32813) Work around to be able to generate eclipse projects (#33295) Highlight that index_phrases only works if no slop is used (#33303) Different handling for security specific errors in the CLI. Fix for https://github.com/elastic/elasticsearch/issues/33230 (#33255) [ML] Refactor delimited file structure detection (#33233) SQL: Support multi-index format as table identifier (#33278) MINOR: Remove Dead Code from PathTrie (#33280) Enable forbiddenapis server java9 (#33245)	2018-08-31 19:03:04 -04:00
Nhat Nguyen	08b9247ce2	Adjust soft-deletes version after backport into 6.5 Relates #33222	2018-08-31 16:50:08 -04:00
Vladimir Dolzhenko	00b272af32	completely drop `index.shard.check_on_startup: fix` for 7.0 (#33194 ) Relates to #32279	2018-08-31 22:08:28 +02:00
Vladimir Dolzhenko	3d82a30fad	drop `index.shard.check_on_startup: fix` (#32279 ) drop `index.shard.check_on_startup: fix` Relates #31389	2018-08-31 21:29:06 +02:00
Armin Braun	c6cfa08a61	MINOR: Remove Dead Code from PathTrie (#33280 ) * The array size checks are redundant since the array sizes are checked earlier in those methods too * The removed methods are just not used anywhere	2018-08-31 08:40:27 +02:00
Alpar Torok	44ed5f6306	Enable forbiddenapis server java9 (#33245 )	2018-08-31 09:31:55 +03:00
Nhat Nguyen	ad4dd086d2	Integrates soft-deletes into Elasticsearch (#33222 ) This PR integrates Lucene soft-deletes(LUCENE-8200) into Elasticsearch. Highlight works in this PR include: - Replace hard-deletes by soft-deletes in InternalEngine - Use _recovery_source if _source is disabled or modified (#31106) - Soft-deletes retention policy based on the global checkpoint (#30335) - Read operation history from Lucene instead of translog (#30120) - Use Lucene history in peer-recovery (#30522) Relates #30086 Closes #29530 --- These works have been done by the whole team; however, these individuals (lexical order) have significant contribution in coding and reviewing: Co-authored-by: Adrien Grand <jpountz@gmail.com> Co-authored-by: Boaz Leskes <b.leskes@gmail.com> Co-authored-by: Jason Tedor <jason@tedor.me> Co-authored-by: Martijn van Groningen <martijn.v.groningen@gmail.com> Co-authored-by: Nhat Nguyen <nhat.nguyen@elastic.co> Co-authored-by: Simon Willnauer <simonw@apache.org>	2018-08-30 23:46:07 -04:00
Nhat Nguyen	547de71d59	Revert "Integrates soft-deletes into Elasticsearch (#33222 )" Revert to correct co-author tags. This reverts commit `6dd0aa54f6`.	2018-08-30 23:44:57 -04:00
Nhat Nguyen	d3f32273eb	Merge branch 'master' into ccr	2018-08-30 23:22:58 -04:00
Nhat Nguyen	6dd0aa54f6	Integrates soft-deletes into Elasticsearch (#33222 ) This PR integrates Lucene soft-deletes(LUCENE-8200) into Elasticsearch. Highlight works in this PR include: - Replace hard-deletes by soft-deletes in InternalEngine - Use _recovery_source if _source is disabled or modified (#31106) - Soft-deletes retention policy based on the global checkpoint (#30335) - Read operation history from Lucene instead of translog (#30120) - Use Lucene history in peer-recovery (#30522) Relates #30086 Closes #29530 --- These works have been done by the whole team; however, these individuals (lexical order) have significant contribution in coding and reviewing: Co-authored-by: Adrien Grand jpountz@gmail.com Co-authored-by: Boaz Leskes b.leskes@gmail.com Co-authored-by: Jason Tedor jason@tedor.me Co-authored-by: Martijn van Groningen martijn.v.groningen@gmail.com Co-authored-by: Nhat Nguyen nhat.nguyen@elastic.co Co-authored-by: Simon Willnauer simonw@apache.org	2018-08-30 22:11:23 -04:00
Lee Hinman	8a2d154bad	Update serialization versions for custom IndexMetaData backport	2018-08-30 15:56:53 -06:00
Igor Motov	001b78f704	Replace IndexMetaData.Custom with Map-based custom metadata (#32749 ) This PR removes the deprecated `Custom` class in `IndexMetaData`, in favor of a `Map<String, DiffableStringMap>` that is used to store custom index metadata. As part of this, there is now no way to set this metadata in a template or create index request (since it's only set by plugins, or dedicated REST endpoints). The `Map<String, DiffableStringMap>` is intended to be a namespaced `Map<String, String>` (`DiffableStringMap` implements `Map<String, String>`, so the signature is more like `Map<String, Map<String, String>>`). This is so we can do things like: ``` java Map<String, String> ccrMeta = indexMetaData.getCustom("ccr"); ``` And then have complete control over the metadata. This also means any plugin/feature that uses this has to manage its own BWC, as the map is just serialized as a map. It also means that if metadata is put in the map that isn't used (for instance, if a plugin were removed), it causes no failures the way an unregistered `Setting` would. The reason I use a custom `DiffableStringMap` here rather than a plain `Map<String, String>` is so the map can be diffed with previous cluster state updates for serialization. Supersedes #32683	2018-08-30 13:57:00 -06:00
Simon Willnauer	af2eaf2a6c	Remove usage of `index.shrink.source.` in 7.x (#33271 ) We cut over to `index.resize.source.` but still have these constants being public in `IndexMetaData`. Those Settings and constants are not needed in 7.x while we still need to keep the keys known to private settings since they might be part of the index settings of old indices. We can remove that in 8.0. Yet, we should remove the settings to make sure they are not used again.	2018-08-30 21:08:35 +02:00
Jim Ferenczi	d0630093cd	Fix serialization of empty field capabilities response (#33263 ) Fix serialization of empty field capabilities response When no response are required (no indices match the requested patterns) the empty response throws an NPE in the transport serialization (writeTo).	2018-08-30 18:07:58 +02:00
Jim Ferenczi	1404dd2a42	Fix nested _source retrieval with includes/excludes (#33180 ) If an exclude or an include clause removes an entry to a nested field in the original source at query time, the creation of nested hits fails with an NPE. This change fixes this exception and replaces the nested document source with an empty map. Closes #33163 Closes #33170	2018-08-30 15:15:50 +02:00
Nhat Nguyen	13261996ce	Add NoOps to Lucene for failed delete ops (#33217 ) Today we add a NoOp to Lucene and translog if we fail to process an indexing operation. However, we are only adding NoOps to translog for delete operations. In order to have a complete history in Lucene, we should add NoOps of failed delete operations to both Lucene and translog. Relates #29530	2018-08-30 07:55:13 -04:00
David Turner	47859e56ac	Move file-based discovery to core (#33241 ) Today we support a static list of seed hosts in core Elasticsearch, and allow a dynamic list of seed hosts to be provided via a file using the `discovery-file` plugin. In fact the ability to provide a dynamic list of seed hosts is increasingly useful, so this change moves this functionality to core Elasticsearch to avoid the need for a plugin. Furthermore, in order to start up nodes in integration tests we currently assign a known port to each node before startup, which unfortunately sometimes fails if another process grabs the selected port in the meantime. By moving the `discovery-file` functionality into the core product we can use it to avoid this race. This change also moves the expected path to the file from `$ES_PATH_CONF/discovery-file/unicast_hosts.txt` to `$ES_PATH_CONF/unicast_hosts.txt`. An example of this file is not included in distributions. For BWC purposes the plugin still exists, but does nothing more than create the example file in the old location, and issue a warning when it is used. We also continue to support the old location for the file, but warn about its deprecation. Relates #29244 Closes #33030	2018-08-30 06:43:04 +01:00
Armin Braun	cc4d7059bf	Ingest: Add conditional per processor (#32398 ) * Ingest: Add conditional per processor * closes #21248	2018-08-30 03:46:39 +02:00
Jason Tedor	0f22dbb1cc	Apply settings filter to get cluster settings API (#33247 ) Some settings have filters applied to them and we use this in logs and the get nodes info API. For consistency, we should apply this in the get cluster settings API too.	2018-08-29 15:56:13 -04:00
Nhat Nguyen	5632e31c74	Merge branch 'master' into ccr * master: Painless: Add Bindings (#33042) Update version after client credentials backport Fix forbidden apis on FIPS (#33202) Remote 6.x transport BWC Layer for `_shrink` (#33236) Test fix - Graph HLRC tests needed another field adding to randomisation exception list HLRC: Add ML Get Records API (#33085) [ML] Fix character set finder bug with unencodable charsets (#33234) TESTS: Fix overly long lines (#33240) Test fix - Graph HLRC test was missing field name to be excluded from randomisation logic Remove unsupported group_shard_failures parameter (#33208) Update BucketUtils#suggestShardSideQueueSize signature (#33210) Parse PEM Key files leniantly (#33173) INGEST: Add Pipeline Processor (#32473) Core: Add java time xcontent serializers (#33120) Consider multi release jars when running third party audit (#33206) Update MSI documentation (#31950) HLRC: create base timed request class (#33216) [DOCS] Fixes command page titles HLRC: Move ML protocol classes into client ml package (#33203) Scroll queries asking for rescore are considered invalid (#32918) Painless: Fix Semicolon Regression (#33212) ingest: minor - update test to include dissect (#33211) Switch remaining LLREST usage to new style Requests (#33171) HLREST: add reindex API (#32679)	2018-08-29 12:30:24 -04:00
Simon Willnauer	6a0d4b4a77	Remote 6.x transport BWC Layer for `_shrink` (#33236 ) The shrink action was renamed to `_resize` with the addition or split. This bwc layer is unnecessary on 7.x since 6.latest will always use the resize action.	2018-08-29 16:43:13 +02:00
Luca Cavanna	49109187e2	Remove unsupported group_shard_failures parameter (#33208 ) We have had support for the `group_shard_failures` parameter in our code for a while, since we introduced failures grouping. When we introduced validation of parameters at REST, we seem to have forgotten to expose such parameter. Given that the parameter is effectively not supported for many months now, that no user has complained about that and that grouping is the expected behaviour, this commit removes support for the parameter.	2018-08-29 14:05:41 +02:00
Luca Cavanna	034fdbca28	Update BucketUtils#suggestShardSideQueueSize signature (#33210 ) `BucketUtils#suggestShardSideQueueSize` used to calculate the shard_size based on the number of shards. It returns now a different value only based on whether we are querying a single shard or multiple shards. This commit replaces the numberOfShards argument with a boolean that tells whether we are querying a single shard or not.	2018-08-29 13:51:54 +02:00
Armin Braun	f690b492e7	INGEST: Add Pipeline Processor (#32473 ) * INGEST: Add Pipeline Processor * Adds Processor capable of invoking other pipelines * Closes #31842	2018-08-29 11:03:10 +02:00
Alexander Reelsen	48b388ce82	Core: Add java time xcontent serializers (#33120 ) This ensures that the java time class exposed by painless have proper serialization/string representations. Closes #31853	2018-08-29 10:00:16 +02:00
Alpar Torok	f29f0af7bc	Consider multi release jars when running third party audit (#33206 ) Exclude classes meant for newer versions than what we are auditing against, those classes won't be found. There's no reason to exclude JDK classes from newer versions, with this PR, we will not extract them in the first place.	2018-08-29 09:53:04 +03:00
Mark Tozzi	84b61d0738	Scroll queries asking for rescore are considered invalid (#32918 ) This PR changes our behavior from silently ignoring rescore in a scroll query to instead report to the user that such a query is invalid. Closes #31775	2018-08-28 15:48:23 -04:00
Nhat Nguyen	c42dc77896	Merge branch 'master' into ccr * master: [Rollup] Better error message when trying to set non-rollup index (#32965) HLRC: Use Optional in validation logic (#33104) Remove unused User class from protocol (#33137) ingest: Introduce the dissect processor (#32884) [Docs] Add link to es-kotlin-wrapper-client (#32618) [Docs] Remove repeating words (#33087) Minor spelling and grammar fix (#32931) Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979) Watcher: Simplify finding next date in cron schedule (#33015) Run Third party audit with forbidden APIs CLI (part3/3) (#33052) Fix plugin build test on Windows (#33078) HLRC+MINOR: Remove Unused Private Method (#33165) Remove old unused test script files (#32970) Build analysis-icu client JAR (#33184) Ensure to generate identical NoOp for the same failure (#33141) ShardSearchFailure#readFrom to set index and shardId (#33161)	2018-08-28 13:56:38 -04:00
Sohaib Iftikhar	7f5e29ddb2	HLREST: add reindex API (#32679 ) Adds the reindex API to the high level REST client.	2018-08-28 13:02:23 -04:00
Nhat Nguyen	e39689a198	Send only ops after checkpoint in file-based recovery with soft-deletes (#33190 ) Today a file-based recovery will replay all existing translog operations from the primary on a replica so that that replica can have a full history in translog as the primary. However, with soft-deletes enabled, we should not do it because: 1. All operations before the local checkpoint of the safe commit exist in the commit already. 2. The number of operations before the local checkpoint may be considerable and requires a significant amount of time to replay on a replica. Relates #30522 Relates #29530	2018-08-28 12:32:09 -04:00
Nhat Nguyen	e2b931e80b	Use Lucene history in primary-replica resync (#33178 ) This commit makes primary-replica resyncer use Lucene as the source of history operation instead of translog if soft-deletes is enabled. With this change, we no longer expose translog snapshot directly in IndexShard. Relates #29530	2018-08-28 10:44:15 -04:00
Nhat Nguyen	d8a1b7cb17	Make soft-deletes settings final (#33172 ) For now, we do not support changing the soft-deletes setting even with closed indices. Therefore we should make it a final setting. Relates #29530	2018-08-28 08:48:42 -04:00
Jonathan Little	9d92a87ae6	Remove support for deprecated params._agg/_aggs for scripted metric aggregations (#32979 )	2018-08-28 09:27:43 +01:00
Alpar Torok	2cc611604f	Run Third party audit with forbidden APIs CLI (part3/3) (#33052 ) The new implementation is functional equivalent with the old, ant based one. It parses task standard error to get the missing classes and violations in the same way. I considered re-using ForbiddenApisCliTask but Gradle makes it hard to build inheritance with tasks that have task actions , since the order of the task actions can't be controlled. This inheritance isn't dully desired either as the third party audit task is much more opinionated and we don't want to expose some of the configuration. We could probably extract a common base class without any task actions, but probably more trouble than it's worth. Closes #31715	2018-08-28 10:03:30 +03:00
Nhat Nguyen	014b3236dc	Ensure to generate identical NoOp for the same failure (#33141 ) We generate slightly different NoOps in InternalEngine and TransportShardBulkAction for the same failure. 1. InternalEngine uses Exception#getFailure to generate a message without the class name: newOp [NoOp{seqNo=1, primaryTerm=1, reason='Contexts are mandatory in context enabled completion field [suggest_context]'}]. 2. TransportShardBulkAction uses Exception#toString to generate a message with the class name: NoOp{seqNo=1, primaryTerm=1, reason='java.lang.IllegalArgumentException: Contexts are mandatory in context enabled completion field [suggest_context]'}. If a write operation fails while a replica is recovering, that replica will possibly receive two different NoOps: one from recovery and one from replication. These two different NoOps will trip TranslogWriter#assertNoSeqNumberConflict assertion. This commit ensures that we generate the same Noop for the same failure. Closes #32986	2018-08-27 15:59:42 -04:00

1 2 3 4 5 ...

1243 Commits