OpenSearch

Commit Graph

Author	SHA1	Message	Date
Marios Trivyzas	c7efbc1b83	SQL: Implement DATE_PARSE function for parsing strings into DATE values (#57391 ) (#59699 ) Implement DATE_PARSE(<date_str>, <pattern_str>) function which allows to parse a date string according to the specified pattern into a date object. The patterns allowed are those of java.time.format.DateTimeFormatter. Closes #54962 Co-authored-by: Marios Trivyzas <matriv@users.noreply.github.com> Co-authored-by: Patrick Jiang(白泽) <dreamlike.sky@foxmail.com> (cherry picked from commit 647a413d9b21bd3938f1716bb19f8407e1334125)	2020-07-16 17:24:30 +02:00
Adam Locke	305b46c7cd	[DOCS] Adding get snapshot status API docs (#59355 ) (#59670 ) * Adding get snapshot status API docs. * Adding more fields and a link to the new page. * Adding missing spaces in TESTRESPONSES * Adding more parameters and making some edits. * Marking snapshot as optional * Marking repository as optional * Add data type for stats * Add data type for shard_stats * Incorporating review feedback. * Lots of review feedback incorporated. * Fixing tests to unbreak CI builds. * Changing indices to index.	2020-07-16 11:21:17 -04:00
Benjamin Trent	a28547c4b4	[7.x] [ML] add new `custom` field to trained model processors (#59542 ) (#59700 ) * [ML] add new `custom` field to trained model processors (#59542) This commit adds the new configurable field `custom`. `custom` indicates if the preprocessor was submitted by a user or automatically created by the analytics job. Eventually, this field will be used in calculating feature importance. When `custom` is true, the feature importance for the processed fields is calculated. When `false` the current behavior is the same (we calculate the importance for the originating field/feature). This also adds new required methods to the preprocessor interface. If users are to supply their own preprocessors in the analytics job configuration, we need to know the input and output field names.	2020-07-16 10:57:38 -04:00
Rene Groeschke	3a228906a9	Fix license header checks on buildSrc (#58457 ) (#59705 )	2020-07-16 16:33:30 +02:00
Howard	c0d429863c	remove unused cluster name in environment. (backport of #59605 ) (#59681 ) removes an unused variable	2020-07-16 09:25:55 -04:00
Nik Everett	343053c0a7	Fix compilation in Eclipse (backport #59675 ) Eclipse was confused by #59583. It can't see a the public inner interface within the superclass. This time. Usually that is fine, but the Eclipse gods don't like this particular code, I guess.	2020-07-16 08:25:12 -04:00
István Zoltán Szabó	76fbe0a6d9	[DOCS] Sorts agg and grouping names alphabetically in PUT Transforms API docs. (#59688 )	2020-07-16 12:45:29 +02:00
Alan Woodward	27067de699	Make MappedFieldType#meta final (#59383 ) The MappedFieldType#updateMeta method was used for testing equality checks, but we no longer need these after #59212 , so we can remove this method and make meta final.	2020-07-16 09:45:55 +01:00
David Kyle	c349fdcb89	Mute RegressionIT testWithDataStream (#59687 ) For #59664	2020-07-16 09:45:29 +01:00
Przemysław Witek	df4fea79cb	Add a "verbose" option to the data frame analytics stats endpoint (#59589 ) (#59621 )	2020-07-16 09:51:31 +02:00
Armin Braun	6db481f49e	Fix ConcurrentSnapshotsIT.testEquivalentDeletesAreDeduplicated (#59611 ) (#59653 ) Trying to queue up snapshot deletes by blocking the delete of the latest index-N doesn't work here. The first delete will block on the delete operation but only do so after having already written the updated repository data. Since that repository data will contain no snapshots, the subsequent deletes for `*` will just fall through and complete instead of queue up. => Fixed by simply waiting on all files on master so that we block before updating the repository data and get to test the queueing of equivalent operations closes #59608	2020-07-16 09:28:36 +02:00
Nhat Nguyen	b599f7a9c0	Fix estimate size of translog operations (#59206 ) Make sure that the estimateSize method includes all fields of translog operations.	2020-07-16 00:19:30 -04:00
Yang Wang	067db1fc3b	Fix test of API key creation in a mixed cluster (#59680 ) RoleDescriptors are mandatory prior to v7.3 Relates: #59425	2020-07-16 12:44:17 +10:00
lcawl	4ad8bef33b	[DOCS] Removes docs PR from release notes	2020-07-15 16:07:43 -07:00
Costin Leau	5f2285a8b3	EQL: Fix bug in returning results (#59673 ) Using serialization/deserialization when dealing with non-trivial documents causes the process to get stuck not to mention it is expensive. Use a much more simple approach at the expense of losing information (we're just interested in the source after all). (cherry picked from commit e1659822db7ce1390ba9bbfb21768e24a0907dff)	2020-07-16 01:01:13 +03:00
James Rodewig	43481441e9	[DOCS] EQL: Update EQL search response format (#59554 ) (#59668 )	2020-07-15 17:23:48 -04:00
Julie Tibshirani	2b70758a05	Correct type parametrization in geo mappers. (#59583 ) Previously the concrete type parameters for the MappedFieldType didn't always match those for the FieldMapper. This PR updates the mappers so that the type parameters always match, which makes the design easier to follow.	2020-07-15 14:10:47 -07:00
Martijn van Groningen	f1028fbbcc	Only install stack templates via elected master node (#59624 ) (#59657 ) to avoid many error stacktraces in logs during a rolling upgrade. Stack templates use the composable index template and component APIs,these APIs aren't supported in 7.7 and earlier and in mixed cluster environments this can cause a lot of ActionNotFoundTransportException errors in the logs during rolling upgrades. If these templates are only installed via elected master node then the APIs are always there and the ActionNotFoundTransportException errors are then prevented.	2020-07-15 22:22:01 +02:00
Lee Hinman	74372df824	Mute {p0=mixed_cluster/120_api_key_auth/Test API key authentication will work in a mixed cluster} (#59663 ) Relates to #59425	2020-07-15 14:14:33 -06:00
Boice Huang	ef26c1739b	fix typo in Exception Response in GeoJson (#59270 )	2020-07-15 20:15:18 +01:00
Boice Huang	07a58d915d	Fix typo in AggregationProfiler (#59269 )	2020-07-15 20:14:19 +01:00
James Rodewig	e30af2fc35	[DOCS] Fix syntax and wording in EQL docs (#59623 ) (#59650 )	2020-07-15 14:45:56 -04:00
Nhat Nguyen	93d419b9c8	Mute CcrRollingUpgradeIT Tracked at #59625	2020-07-15 14:43:32 -04:00
Armin Braun	cc7093645c	Cleanup some Serialization Code around Snapshots (#59532 ) (#59606 ) A number of obvious possible simplifications that also improve efficiency in some cases (better empty collection handling and size hint use). Also, added a shortcut for writing and reading immutable open maps that can be used to dry up additional spots.	2020-07-15 20:40:43 +02:00
Adam Locke	776e9507fb	[DOCS] Update similarity.asciidoc (#59400 ) (#59644 ) Community contribution to fix linking issues in the Similarity module docs. Co-authored-by: Xin Yan <SHU_Yanx@hotmail.com>	2020-07-15 14:12:00 -04:00
David Kyle	df7fc8f967	Accounting for model size when models are not cached (#59607 ) When an inference model is loaded it is accounted for in circuit breaker and should not be released until there are no users of the model. Adds a reference count to the model to track usage.	2020-07-15 18:06:15 +01:00
James Rodewig	ef9b14b07e	[DOCS] Add `write_index_only` param to ds mapping tutorials (#59618 ) (#59639 )	2020-07-15 13:02:01 -04:00
David Turner	67e7c3f60e	Fix failing test introduced in #59601	2020-07-15 17:44:27 +01:00
Rory Hunter	b8d73a1e7e	Default gateway.auto_import_dangling_indices to false (#59302 ) Backport of #58898. Part of #48366. Now that there is a dedicated API for dangling indices, the auto-import behaviour can default to off. Also add a note to the breaking changes for 7.9.0.	2020-07-15 17:10:42 +01:00
David Turner	691759fb1f	Validate snapshot UUID during restore (#59601 ) Today when mounting a searchable snapshot we obtain the snapshot/index UUIDs and then assume that these are the UUIDs used during the subsequent restore. If you concurrently delete the snapshot and replace it with one with the same name then this assumption is violated, with chaotic consequences. This commit introduces a check that ensures that the snapshot UUID does not change during the mount process. If the snapshot remains in place then the index UUID necessarily does not change either. Relates #50999	2020-07-15 16:23:20 +01:00
Costin Leau	6b75525efb	EQL: Improve testing spec (#59615 ) Case sensitivity is incorporated as a test dimension - instead of running the same test twice, two different tests are created. Clean-up the test invocation by removing unused parameters. Fix #59294 (cherry picked from commit 72c8a3582d8e8a4a663d82814a17a1a3d2757292)	2020-07-15 18:07:24 +03:00
Igor Motov	b5ab447b3e	EQL: Fix async EQL Rest test (#59556 ) (#59620 ) Unfortunately, we cannot guarantee that the execution will be truly async even with 0ms timeout since we cannot block the execution. So, we need to modify the test to work in both async and non-async mode. Closes #59416	2020-07-15 11:02:33 -04:00
Martijn van Groningen	2a89e13e43	Move data stream transport and rest action to xpack (#59593 ) Backport of #59525 to 7.x branch. * Actions are moved to xpack core. * Transport and rest actions are moved the data-streams module. * Removed data streams methods from Client interface. * Adjusted tests to use client.execute(...) instead of data stream specific methods. * only attempt to delete all data streams if xpack is installed in rest tests * Now that ds apis are in xpack and ESIntegTestCase no longers deletes all ds, do that in the MlNativeIntegTestCase class for ml tests.	2020-07-15 16:50:44 +02:00
James Rodewig	8cac702171	[DOCS] Note that EQL timestamp field can also be date_nanos	2020-07-15 09:55:55 -04:00
James Rodewig	4e58f967de	[DOCS] Update ds overview for optional `@timestamp` mapping (#59558 ) (#59614 )	2020-07-15 09:46:55 -04:00
Martijn van Groningen	53249dcca8	No need to select only < 7.9 nodes in 7.x branch. (#59609 )	2020-07-15 15:23:16 +02:00
Rory Hunter	2e05ce5f88	Bump version to 7.10.0	2020-07-15 11:56:45 +01:00
Ignacio Vera	f8037abf47	upgrade to lucene-8.6.0 release (#59596 ) (#59599 )	2020-07-15 12:40:57 +02:00
Tanguy Leroux	604f22db79	Use a dedicated thread pool for searchable snapshot cache prewarming (#59313 ) (#59590 ) Since #58728 writing operations on searchable snapshot directory cache files are executed in an asynchronous manner using a dedicated thread pool. The thread pool used is searchable_snapshots which has been created to execute prewarming tasks. Reusing the same thread pool wasn't a good idea as it can lead to deadlock situations. One of these situation arose in a test failure where the thread pool was full of prewarming tasks, all waiting for a cache file to be accessible, while the cache file was being evicted by the cache service. But such an eviction can only be processed when all read/write operations on the cache file are completed and in this case the deadlock occurred because the cache file was actively being read by a concurrent search which also won the privilege to write the range of bytes in cache... and this writing operation could never have been completed because of the prewarming tasks making no progress and filling up the thread pool. This commit renames the searchable_snapshots thread pool to searchable_snapshots_cache_fetch_async. Assertions are added to assert that cache writes are executed using this thread pool and to assert that read on cached index inputs are executed using a different thread pool to avoid potential deadlock situations. This commit also adds a searchable_snapshots_cache_prewarming that is used to execute prewarming tasks. It also converts the existing cache prewarming test into a more complte integration test that creates multiple searchable snapshot indices concurrently with randomized thread pool sizes, and verifies that all files have been correctly prewarmed.	2020-07-15 11:45:52 +02:00
Martijn Laarman	a699c89133	[DOCS] Add release notes for 7.8.1 (#59594 ) (cherry picked from commit f43a233948f13e487d4d0f4be668687c404a71f4)	2020-07-15 11:42:03 +02:00
David Turner	0c2510dc68	Don't request cluster metadata in _cat/shards impl (#59548 ) Today `GET _cat/shards` requests the nodes, routing table, and metadata from the cluster state, but it does not use any information from the metadata portion of the response. Metadata includes things like mappings and templates that may be substantial in size. This commit drops the unnecessary metadata portion of this cluster state request.	2020-07-15 10:14:48 +01:00
Francisco Fernández Castaño	66ef1cdad7	Add the possibility to inject a custom RecoveryState factory to IndexStorePlugin implementations (#59124 ) Add a custom factory for recovery state into IndexStorePlugin that allows different implementors to provide its own RecoveryState implementation. Backport of #59038	2020-07-15 11:11:07 +02:00
Yannick Welsch	bc11503dc3	Wait for active license in CcrRestIT (#59543 ) Relates #53966 Closes #59486	2020-07-15 09:38:08 +02:00
Armin Braun	96f52a028f	Fix Snapshot not Starting in Partial Snapshot Corner Case (#59428 ) (#59584 ) We were not handling the case where during a partial snapshot all shards would enter a failed state right off the bat. Closes #59384	2020-07-15 07:59:22 +02:00
Armin Braun	ecf97e9415	Remove Outdated Documentation On Snapshots (#59358 ) (#59585 ) * We now have concurrent repository operations so the one at a time limit does not apply any longer * Initialization was never slow solely due to loading information about all existing snaphots (though this contributed) but also because two cluster state updates and a few writes to the repository had to happen before initialization could return * Repo data necessary for a snapshot create operation is now cached on heap so loading it is effectively instant * Snapshot initialization is just a single CS update now * Initialization does no writes to the repository whatsoever * Fixed missing `repository`	2020-07-15 07:49:18 +02:00
Tal Levy	4bb91b61e8	Adds support for date_nanos in Rollup Metric and DateHistogram Configs (#59349 ) (#59577 ) Closes #44505.	2020-07-14 22:37:48 -07:00
Armin Braun	2dd086445c	Enable Fully Concurrent Snapshot Operations (#56911 ) (#59578 ) Enables fully concurrent snapshot operations: * Snapshot create- and delete operations can be started in any order * Delete operations wait for snapshot finalization to finish, are batched as much as possible to improve efficiency and once enqueued in the cluster state prevent new snapshots from starting on data nodes until executed * We could be even more concurrent here in a follow-up by interleaving deletes and snapshots on a per-shard level. I decided not to do this for now since it seemed not worth the added complexity yet. Due to batching+deduplicating of deletes the pain of having a delete stuck behind a long -running snapshot seemed manageable (dropped client connections + resulting retries don't cause issues due to deduplication of delete jobs, batching of deletes allows enqueuing more and more deletes even if a snapshot blocks for a long time that will all be executed in essentially constant time (due to bulk snapshot deletion, deleting multiple snapshots is mostly about as fast as deleting a single one)) * Snapshot creation is completely concurrent across shards, but per shard snapshots are linearized for each repository as are snapshot finalizations See updated JavaDoc and added test cases for more details and illustration on the functionality. Some notes: The queuing of snapshot finalizations and deletes and the related locking/synchronization is a little awkward in this version but can be much simplified with some refactoring. The problem is that snapshot finalizations resolve their listeners on the `SNAPSHOT` pool while deletes resolve the listener on the master update thread. With some refactoring both of these could be moved to the master update thread, effectively removing the need for any synchronization around the `SnapshotService` state. I didn't do this refactoring here because it's a fairly large change and not necessary for the functionality but plan to do so in a follow-up. This change allows for completely removing any trickery around synchronizing deletes and snapshots from SLM and 100% does away with SLM errors from collisions between deletes and snapshots. Snapshotting a single index in parallel to a long running full backup will execute without having to wait for the long running backup as required by the ILM/SLM use case of moving indices to "snapshot tier". Finalizations are linearized but ordered according to which snapshot saw all of its shards complete first	2020-07-15 03:42:31 +02:00
Armin Braun	06d94cbb2a	Fix TODO about Spurious FAILED Snapshots (#58994 ) (#59576 ) There is no point in writing out snapshots that contain no data that can be restored whatsoever. It may have made sense to do so in the past when there was an `INIT` snapshot step that wrote data to the repository that would've other become unreferenced, but in the current day state machine without the `INIT` step there is no point in doing so.	2020-07-15 00:54:30 +02:00
Armin Braun	e1014038e9	Simplify Repository.finalizeSnapshot Signature (#58834 ) (#59574 ) Many of the parameters we pass into this method were only used to build the `SnapshotInfo` instance to write. This change simplifies the signature. Also, it seems less error prone to build `SnapshotInfo` in `SnapshotsService` isntead of relying on the fact that each repository implementation will build the correct `SnapshotInfo`.	2020-07-15 00:14:28 +02:00
Armin Braun	16a47e0d08	Simplify SnapshotsInProgress Construction (#58893 ) (#59573 ) With parallel snapshots incoming (but also in isolation) it makes sense to clean up `SnapshotsInProgress` construction. We don't need to pre-compute the waiting shards for every entry. We rarely use this information (only on routing changes) and in the one spot we did we now simply spent the extra cycles for looping over all shards instead of just the waiting ones once per routing change tops instead of on every change to `SnapshotsInProgress` (moreover, we would burn the cycles for looping on all nodes even though only the current master cares about the information). In addition to that change I removed some dead code constructors and slighly optimized deserialization.	2020-07-15 00:00:53 +02:00

1 2 3 4 5 ...

52732 Commits All Branches Search

52732 Commits

All Branches