OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-25 01:19:02 +00:00

Author	SHA1	Message	Date
Lee Hinman	4315a55a1c	[7.x] Initial documentation for index templates V2 (#55755 ) (#55898 ) Backports the following commits to 7.x: - Initial documentation for index templates V2 (#55755)	2020-04-28 16:10:50 -06:00
Ryan Ernst	f8db1a56f8	Guard java9+ warn option in test config	2020-04-28 14:32:40 -07:00
Ryan Ernst	3f1a983ecb	Fix spotless...whitespace	2020-04-28 14:10:10 -07:00
Ryan Ernst	07f8c0368e	Split java plugin elements out of BuildPlugin (#55834 ) BuildPlugin is a catch all for any elasticsearch common build infrastructure. Unfortunately that makes reusing parts of it difficult. This commit splits the parts specific to all java based projects out to our own elasticsearch.java plugin.	2020-04-28 13:50:40 -07:00
Nik Everett	a5d0409a8f	Save memory in on aggs in async search (#55683 ) (#55879 ) This replaces a reference to the result of partially reducing aggregations that async search keeps with a reference to the serialized form of the result of the partial reduction which we need to keep anyway.	2020-04-28 16:23:30 -04:00
Ryan Ernst	fed296ebb7	Add method to check if object is generically writeable in stream (#54936 ) (#55561 ) When calling scripts in metric aggregation, the returned metric state is passed along to the coordinating node to do the final reduce. However, it is possible the object could contain nested state which is unknown to StreamOutput/StreamInput. This would then result in the node crashing as exceptions are not expected in the middle of serialization. This commit adds a method to StreamOutput that can determine if an object is writeable by the stream. It uses the same logic writeGenericValue, special casing each of the supported collection types to recursively determine if each contained value is itself writeable. relates #54708	2020-04-28 13:08:41 -07:00
Tim Brooks	9e376589a6	Fully stop RetryableAction when cancelled (#55614 ) Currently cancelling the RetryableAction does not stop one last run from being executed. This commit makes a best effort attempt to cancel a scheduled retry and guards future executions from the action already being completed.	2020-04-28 13:54:00 -06:00
Tim Brooks	cd228095df	Retry failed peer recovery due to transient errors (#55883 ) Currently a failed peer recovery action will fail an recovery. This includes when the recovery fails due to potentially short lived transient issues such as rejected exceptions or circuit breaking errors. This commit adds the concept of a retryable action. A retryable action will be retryed in face of certain errors. The action will be retried after an exponentially increasing backoff period. After defined time, the action will timeout. This commit only implements retries for responses that indicate the target node has NOT executed the action.	2020-04-28 13:52:49 -06:00
Lee Hinman	1c73fcfc86	Mark ITv2 APIs as experimental (#55874 ) This commit marks the V2 index and component template APIs experimental, with intent to mark them as "stable" in 7.9.0. Relates to #53101	2020-04-28 11:27:34 -06:00
Nhat Nguyen	ad6221c0cb	Fix testKeepTranslogAfterGlobalCheckpoint (#55868 ) If we advance the global checkpoint during commit and sync that checkpoint after commit, then the assertions in the test won't hold because the deletion policy did not see the latest global checkpoint but only the value before committing. Closes #55680	2020-04-28 12:50:41 -04:00
Henning Andersen	cab7bcc156	Disk decider respect watermarks for single data node (#55805 ) (#55847 ) The disk decider had special handling for the single data node case, allowing any allocation (skipping watermark checks) for such clusters. This special handling can now be avoided via a setting.	2020-04-28 18:46:22 +02:00
Lee Hinman	777caf0725	[7.x] Add support for V2 index templates to /_cat/templates (#55829 ) (#55866 ) Backports the following commits to 7.x: - Add support for V2 index templates to /_cat/templates (#55829)	2020-04-28 10:14:19 -06:00
Mark Tozzi	bebbc375ae	Wire up IpRangeAggregation to ValuesSourceRegistry (#55831 ) (#55859 )	2020-04-28 12:10:21 -04:00
Armin Braun	f38385ee25	Fix Leaking Listener When Closing NodeClient (#55676 ) (#55864 ) If a node client (or rather its underlying node) is closed then any executions on it will just quietly fail as happens in #55660 via closing the nodes on the test thread and asynchronously using a node client. Closes #55660	2020-04-28 17:27:58 +02:00
Lee Hinman	3b211c1212	Downgrade template update error to a warning for v1 templates (#55611 ) For 7.x, we already implemented the `?prefer_v2_templates` flag and made V2 templates opt-in, so we can relax the error when updating V1 templates to just a warning. This will still be a hard error for 8.0+ Relates to #53101	2020-04-28 09:16:08 -06:00
Armin Braun	51a94102e8	Improve some Byte Array Handling Spots (#55844 ) (#55856 ) Some small memory-saving improvements in `byte[]` handling.	2020-04-28 16:38:48 +02:00
Larry Gregory	47d252424b	Backport: Deprecate the kibana reserved user (#54967 ) (#55822 )	2020-04-28 10:30:25 -04:00
James Rodewig	ddc7305ac9	[DOCS] Correct search API's timeout parm default (#55855 )	2020-04-28 09:44:50 -04:00
James Rodewig	386fb16409	[DOCS] SQL: Update link for supported regex in `RLIKE` docs (#55830 ) The`RLIKE` function docs points users to [Java’s Pattern class doc][0] for regular expression syntax. However, these docs include shorthand character classes, such as `[\d]`, `[\s]`, and `[\w]`. These character classes are not supported in Elasticsearch, which may confuse users. This updates the SQL `RLIKE` docs to refer to the ES [regular expression syntax docs][1], which only documents supported syntax. [0]: https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/util/regex/Pattern.html [1]: https://www.elastic.co/guide/en/elasticsearch/reference/master/regexp-syntax.html Relates to #55231	2020-04-28 09:25:51 -04:00
James Rodewig	452be22a4d	[DOCS] Warn about searching across all fields wt. `query_string` (#55853 ) Warn about potential performance impact when a large number of fields is used with query string query and no default field. Re-adds content from #35570. That content was erroneously removed in #45296. Co-authored-by: Peter Dyson <peter.dyson@geekpete.com>	2020-04-28 09:20:21 -04:00
Christos Soulios	fae9ec13dd	Removed ValuesSourceRegistry.registerAny() (#55846 ) * Backports #55747 to 7.x * All ValuesSourceTypes must be registered explicitly * Removed lambdas in ValuesSourceRegistry	2020-04-28 15:44:42 +03:00
Adrien Grand	58c3bb5ae1	Repurpose `ignore_throttled` to be only about frozen indices. (#55047 ) (#55852 ) This has no practical impact on users since frozen indices are the only throttled indices today. However this has an impact on upcoming features that would use search throttling. Filtering out throttled indices made sense a couple years ago, but as we're now improving support for slow requests with `_async_search` and exploring ways to reduce storage costs, this feature has most likely become a trap, that we'd like to not have with upcoming features that would use search throttling. Relates #54058	2020-04-28 14:31:54 +02:00
David Turner	3f2d10d8fc	Permit searches to be concurrent to prewarming (#55795 ) Today when prewarming a searchable snapshot we use the `SparseFileTracker` to lock each (part of a) snapshotted blob, blocking any other readers from accessing this data until the whole part is available. This commit changes this strategy: instead we optimistically start to download the blob without any locking, and then lock much smaller ranges after each individual `read()` call. This may mean that some bytes are downloaded twice, but reduces the time that other readers may need to wait before the data they need is available. As a best-effort optimisation we try to request the smallest possible single range of missing bytes in the part by first checking how many of the initial and terminal bytes of the part are already present in cache. In particular if the part is already fully cached before prewarming then this check means we skip the part entirely.	2020-04-28 10:44:05 +01:00
Amit Khandelwal	126e4acca8	Expose `preserve_original` in `edge_ngram` token filter (#55766 ) The Lucene `preserve_original` setting is currently not supported in the `edge_ngram` token filter. This change adds it with a default value of `false`. Closes #55767	2020-04-28 10:24:27 +02:00
István Zoltán Szabó	a5cf4712e5	[DOCS] Changes feature importance links to point to the new page (#55531 ) * [DOCS] Changes feature importance links to point to the new page. * [DOCS] Fixes line breaks.	2020-04-28 09:03:43 +02:00
Tim Brooks	80662f31a1	Introduce mechanism to stub request handling (#55832 ) Currently there is a clear mechanism to stub sending a request through the transport. However, this is limited to testing exceptions on the sender side. This commit reworks our transport related testing infrastructure to allow stubbing request handling on the receiving side.	2020-04-27 16:57:15 -06:00
Igor Motov	2ff858b290	Fix error massage for unknown value type (#55821 ) (#55825 ) Fixes confusing error message when unknown value type is specified in a terms aggregation. Adds support for parsing "numeric" and "number" value types. Fixes #55727	2020-04-27 18:34:43 -04:00
weizijun	08d328333a	Append indies to update index setting task name (#55714 ) This change adds index names to the name of the update index setting task so we have more information about the pending tasks.	2020-04-27 17:50:36 -04:00
James Rodewig	c16b1edae0	[DOCS] EQL: Fix whitespace in `stringContains` docs	2020-04-27 15:53:59 -04:00
Julie Tibshirani	4bfd65a375	Remove TODO around aggregating on _index. The _index field can in fact be used in aggregations.	2020-04-27 12:48:20 -07:00
Ryan Ernst	70b499b7aa	Simplify java home verification (#55635 ) * Simplify java home verification At one time, all uses of java home were found through the getJavaHome utility method on BuildPlugin. However, that was changed many refactorings ago, but the complex support for registering a java home version needed that fails at configuration time still exists. The only remaining use of grabbing java home is within bwc tests, and must be at runtime since that is when we have the checkout and know what version is needed. This commit consolidates the java home finding method into a utility unassociated with BuildPlugin. * fix checkstyle * address feedback	2020-04-27 12:43:32 -07:00
Tal Levy	6ba5148ead	Add geo_shape support for the geo_centroid aggregation (#55602 ) (#55819 ) this commit leverages the new geo_shape doc values to register a new geo_centroid aggregator that works on geo_shape field.	2020-04-27 12:16:10 -07:00
James Rodewig	8df5cff9c1	[DOCS] Correct stemmer token filters anchor	2020-04-27 14:57:59 -04:00
James Rodewig	5b8a18c756	[DOCS] Correct stemmer token filter anchor	2020-04-27 14:51:51 -04:00
Ioannis Kakavas	ca5d677130	Mute-55816 (#55818 ) See #55816	2020-04-27 21:26:02 +03:00
Hendrik Muhs	4b93f17b24	[Transform] improve TransformRestTestCase robustness (#55786 ) handles/retries temporary SearchPhaseExecutionErrors fixes #54810	2020-04-27 17:17:53 +02:00
Jake Landis	6f392cf5b9	[7.x] json spec - add description for searchable snapshots (#55746 ) (#55809 )	2020-04-27 10:08:09 -05:00
Mark Tozzi	22a98ec279	Aggregation support for Value Scripts that change types (#54830 ) (#55752 )	2020-04-27 09:57:05 -04:00
Jake Landis	7b4bacebb5	[7.x] fix the schema validation for scripts_painless_context (#55738 ) (#55751 )	2020-04-27 08:39:56 -05:00
Jim Ferenczi	b5916ac455	Ignore closed exception on refresh pending location listener (#55799 ) This newly added listener should catch closed exceptions when accessing the internal engine. Closes #55792	2020-04-27 15:06:35 +02:00
Dimitris Athanasiou	abab4c4d4f	[7.x][ML] Do not fail DFA task when it's stopped whilst reindexing (#55797 ) (#55800 ) Adding to #55659, we missed another way we could set the task to failed due to task cancellation. CI revealed that we might also get a `SearchPhaseExecutionException` whose cause is a `TaskCancelledException`. That exception is not wrapped so unwrapping it will not return the underlying `TaskCancelledException`. Thus to be complete in catching this, we also need to check the error's cause. Closes #55068 Backport of #55797	2020-04-27 16:03:57 +03:00
Dimitris Athanasiou	7f100c1196	[7.x][ML] Allow analytics process define its own progress phases (#55763 ) (#55791 ) This is a continuation from #55580. Now that we're parsing phase progresses from the analytics process we change `ProgressTracker` to allow for custom phases between the `loading_data` and `writing_results` phases. Each `DataFrameAnalysis` may declare its own phases. This commit sets things in place for the analytics process to start reporting different phases per analysis type. However, this is still preserving existing behaviour as all analyses currently declare a single `analyzing` phase. Backport of #55763	2020-04-27 13:30:05 +03:00
Armin Braun	fe9904fbea	More Efficient Blobstore Metdata IO (#55777 ) (#55788 ) No need to copy all these bytes multiple times, especially not when writing a multiple MB global cluster state snapshot through this method.	2020-04-27 11:48:53 +02:00
Ioannis Kakavas	d56f25acb4	Validate hashing algorithm in users tool (#55628 ) (#55734 ) This change adds validation when running the users tool so that if Elasticsearch is expected to run in a JVM that is configured to be in FIPS 140 mode and the password hashing algorithm is not compliant, we would throw an error. Users tool uses the configuration from the node and this validation would also happen upon node startup but users might be added in the file realm before the node is started and we would have the opportunity to notify the user of this misconfiguration. The changes in #55544 make this much less probable to happen in 8 since the default algorithm will be compliant but this change can act as a fallback in anycase and makes for a better user experience.	2020-04-27 12:23:41 +03:00
Ioannis Kakavas	38b55f06ba	Fix concurrent refresh of tokens (#55114 ) (#55733 ) Our handling for concurrent refresh of access tokens suffered from a race condition where: 1. Thread A has just finished with updating the existing token document, but hasn't stored the new tokens in a new document yet 2. Thread B attempts to refresh the same token and since the original token document is marked as refreshed, it decrypts and gets the new access token and refresh token and returns that to the caller of the API. 3. The caller attempts to use the newly refreshed access token immediately and gets an authentication error since thread A still hasn't finished writing the document. This commit changes the behavior so that Thread B, would first try to do a Get request for the token document where it expects that the access token it decrypted is stored(with exponential backoff ) and will not respond until it can verify that it reads it in the tokens index. That ensures that we only ever return tokens in a response if they are already valid and can be used immediately It also adjusts TokenAuthIntegTests to test authenticating with the tokens each thread receives, which would fail without the fix. Resolves: #54289	2020-04-27 12:23:17 +03:00
Adrien Grand	0753d9a35c	Exists queries to MatchNoneQueryBuilder when the field is unmapped (#55785 ) Co-authored-by: Sivagurunathan Velayutham <sivadeva.93@gmail.com> Closes #54062	2020-04-27 11:06:50 +02:00
Armin Braun	4403b69048	Fix NPE in Partial Snapshot Without Global State (#55776 ) (#55783 ) We make sure to filter shard generations for indices that are missing from the metadata when finalizing a partial snapshot (from concurrent index deletion) but we failed to account for the case where we manually build a fake metadata instance for snapshots without the global state. Fixed this by handling missing indices by skipping, same way we do it for filtering the shard generations. Relates #50234	2020-04-27 10:07:09 +02:00
Nhat Nguyen	1a3f9e5a07	Return true for can_match on idle search shards (#55428 ) With this change, we will always return true for can_match requests on idle search shards; otherwise, some shards will never get refreshed if all search requests perform the can_match phase (i.e., total shards > pre_filter_shard_size). Relates #27500 Relates #50043	2020-04-26 22:21:42 -04:00
David Roberts	3ba44a5af8	[ML] Adding failed_category_count to model_size_stats (#55761 ) The failed_category_count statistic records the number of times categorization wanted to create a new category but couldn't because the job had reached its model_memory_limit. Backport of #55716	2020-04-25 10:36:49 +01:00
Aleksandr Maus	ad54cca823	EQL: implement math functions: add, divide, module, multiply, subtract (#55137 ) (#55737 ) * EQL: implement math functions: add, divide, module, multiply, subtract	2020-04-24 15:52:27 -04:00

1 2 3 4 5 ...

51340 Commits